Our Platform Infrastructure team is the backbone of everything we do at Attentive, providing a resilient and cost-effective platform that seamlessly handles billions of events from over 100 million customers daily. We own everything from compute, persistence, and networking to observability and deployments. Joining our team offers a high-growth career opportunity to collaborate with some of the world’s most talented engineers in a high-performance, high-impact culture.
As a Staff Engineer in Production Engineering, you will have the opportunity to work across all of the engineering teams at Attentive, focusing on implementing and driving cost optimizations across our Software Stack. You will collaborate and work with engineers, product managers and our finance team. Your work will drive millions of cost savings, provide enhanced visibility and improve our cost model.
What You'll Accomplish
Spearhead FinOps initiatives to increase cost transparency and optimize resource utilization across teams
Provide guidance and mentorship to engineers on cost-optimization best practices
Collaborate with teams across the organization to implement cost-saving measures
Analyze, troubleshoot, coordinate, and resolve complex infrastructure issues
Proactively lead infrastructure initiatives that have a company-wide impact
Partner with engineering teams to influence cost-effective architectural and design choices
Develop automation workflows to enhance team productivity and efficiency
Foster a culture of continuous improvement by encouraging innovative thinking and challenging established norms
Your Expertise
Bachelor's degree in Computer Science, Engineering, or related field; Master’s degree preferred
2+ years of experience in FinOps or cloud cost management
5+ years of experience in cloud infrastructure management, development or related roles
Proficiency in some of the following technologies: AWS, Datadog, Kubernetes, Vantage, Terraform, Java
Strong analytical and problem-solving skills with a keen attention to detail
Excellent communication and leadership abilities
Ability to work collaboratively in a cross-functional team environment
What We Use
Our infrastructure runs primarily in Kubernetes hosted in AWS’s EKS
Infrastructure tooling includes Istio, Datadog, Terraform, CloudFlare, and Helm
Our backend is Java / Spring Boot microservices, built with Gradle, coupled with things like DynamoDB, AirFlow, Postgres, and Redis, hosted via AWS
Our frontend is built with React and TypeScript, and uses best practices like GraphQL, Storybook, Radix UI, Vite, esbuild, and Playwright
Our automation is driven by custom and open source machine learning models, lots of data and built with Python, Metaflow, HuggingFace 🤗, PyTorch, TensorFlow, and Pandas