Our Platform Infrastructure team is the backbone of everything we do at Attentive, providing a resilient and cost-effective platform that seamlessly handles billions of events from over 100 million customers daily. We own everything from compute, persistence, and networking to observability and deployments. Joining our team offers a high-growth career opportunity to collaborate with some of the world’s most talented engineers in a high-performance, high-impact culture
We’re looking for a collaborative, self-motivated, and highly driven Software Engineer to join our Compute and Network team. Our team provides the compute and networking platform with reusable, opinionated cloud components and tools that follow well-defined patterns and opinionated interfaces. This gives engineering teams across Attentive safe, reliable leverage through strong guardrails and a stable underlying infrastructure. We are responsible for building and operating the core compute and networking infrastructure that underpins Attentive’s microservices architecture
Our platform is built on AWS and cloud-native technologies such as Kubernetes and its ecosystem, Istio, ArgoCD, Terraform-based Infrastructure as Code, and Cloudflare at the edge.
What You’ll Accomplish
Design, develop, and operate software that powers Attentive’s compute and networking platform, enabling automation, reliability, and scale
Analyze, troubleshoot, and resolve complex compute, networking, and platform infrastructure issues
Orchestrate and operate Kubernetes infrastructure across multiple networks and AWS accounts
Own our Infrastructure-as-Code orchestration that enables engineering teams to consume AWS services through automation and self-serve capabilities
Develop automation workflows and tooling that improve team efficiency and reduce operational overhead
Analyze and improve the efficiency, scalability, reliability, and stability of core system resources
Partner with other teams and business stakeholders to deliver business initiatives
Provide technical leadership by initiating impactful discussions, shaping design decisions, influencing best practices, and driving initiatives to completion with strong ownership
Collaborate with partner teams on large-scope engineering projects and facilitate alignment across teams on outcomes, priorities and timelines
Your Expertise
You have been working in the areas of Platform Engineering / Site Reliability Engineering / DevOps / Infrastructure for a while, and have a strong understanding of best practices
You have collaborated effectively across engineering teams, communicate clearly, and drive alignment on technical decisions
You have demonstrated ownership and accountability in designing, building, and operating critical systems
You have experience building infrastructure in a microservices architecture
You have a background in software development and are passionate about bringing that experience to bear on the world of infrastructure
You have strong experience with Infrastructure-as-Code and a preference for building self-serve cloud platforms
You understand the importance of CI/CD in building high-performing teams and have worked with tools like Jenkins, CircleCI, and ArgoCD
You are passionate about observability and worked with tools such as Datadog, Prometheus, Splunk or similar
You are very familiar with containers and container orchestration and have direct experience with vanilla Docker as well as Kubernetes as both a user and as an administrator
You have a strong understanding of networking fundamentals and can troubleshoot or design solutions across OSI layers (L3–L7)
You are committed to mentorship and continuous learning—willing to coach others while receiving feedback openly and using it to grow
What We Use
Our infrastructure runs primarily in Kubernetes hosted in AWS’s EKS
Infrastructure tooling includes Istio, Datadog, Terraform, CloudFlare, and Helm
Our backend is Java / Spring Boot microservices, built with Gradle, coupled with things like DynamoDB, AirFlow, Postgres, and Redis, hosted via AWS
Our frontend is built with React and TypeScript, and uses best practices like GraphQL, Storybook, Radix UI, Vite, esbuild, and Playwright
Our automation is driven by custom and open source machine learning models, lots of data and built with Python, Metaflow, HuggingFace 🤗, PyTorch, TensorFlow, and Pandas