Infrastructure Engineer II

Job Description

About the Attentive Team
Have you ever received text message marketing from your favorite brand with an incredible offer? Did you know that effective text message marketing is by far the highest ROI marketing channel? And did you know that customers are increasingly preferring to interact with brands through text? That is what we do at Attentive. We help the world’s largest brands interact with their clients at the right time and with the right message. Our platform powers more than 400 million messages per day, approaching 100 billion per year.

We’re building big things!  Check out our tech blog here: https://tech.attentive.com/

About the Role
Our Search Platform team is the backbone of Attentive’s data infrastructure, processing, storing, and optimizing data at massive scale and speed. We handle billions of events from over 100 million customers daily, enabling near-real-time data insights and AI-driven capabilities through our Data, Optimization, and ML Platforms. Joining our team offers a high-growth career opportunity to work with some of the world’s most talented engineers in a high-performance and high-impact culture.

We’re looking for a self-motivated Infrastructure Engineer to join our Data Infrastructure team. In this role, you'll take ownership of the foundational components powering our data storage and streaming services, ensuring they are resilient, scalable, manageable, and highly available. You'll work closely with our microservices architecture teams, empowering them to move with confidence in our data’s integrity and availability. If you’re excited by the challenge of working with a diverse set of technologies—from streaming (Kafka, Pulsar) and query engines (Trino, Druid) to persistent data stores (Postgres, MySQL, OpenSearch)—this could be the perfect fit.


What You'll Accomplish
  • Design, deploy, and maintain scalable, secure, and extensible big data systems to power our core products and enable business growth
  • Develop automation strategies that streamline data services and ensure a stable infrastructure for all engineering teams
  • Champion DevOps practices to foster a culture of quality and enhance developer productivity
  • Lead peer design reviews, code reviews, and knowledge-sharing sessions to elevate team collaboration and technical expertise
  • Continuously improve system reliability, availability, and scalability through your technical leadership and expertise

  • Your Expertise
  • 3+ years of experience as a Data Infrastructure Engineer or in similar roles like Platform Engineer, SRE, DevOps, or Backend Engineer with a data storage focus
  • Experience managing and operating data platform technologies like PostgreSQL, MySQL, OpenSearch, Kafka, Druid, Trino, OpenMetadata, and Redis
  • Proficiency in managing and operating cloud infrastructure at scale using Kubernetes, Terraform, and Datadog. Familiarity with cloud computing concepts like access control and instance management
  • Strong problem-solving skills with the ability to analyze trade-offs, collaborate with developer peers, and adopt a solutions-oriented approach
  • Experience with high availability and disaster recovery strategies in a cloud environment, along with monitoring and logging tools.
  • You embrace blameless post-incident analysis and are committed to continuous improvement

  • What We Use
  • We use a wide variety of technology and we’re always looking for the best solutions to solve hard problems
  • Attentive’s backend is Java / Spring Boot microservices, built with Gradle, and hosted via AWS
  • Our infrastructure is Kubernetes-centric, with Argo CD leveraged for deployments, Terraform for configuration management, and Spacelift as configuration control
  • As a Data Team, we support a number of different data storage solutions across several paradigms
  • MySQL and PostgreSQL as Relational Database solutions
  • Redis and Valkey as the Cache offering
  • AWS DynamoDB and PlanetScale are Horizontally Scalable Database solutions
  • Apache Druid and Trino for Real-Time Analytics
  • Apache Pulsar as a Streaming Data platform
  • Apache Airflow and Temporal as Workflow engines
  • AWS S3 as Object storage