Job Description
About the Role
Our Platform Infrastructure team is the backbone of everything we do at Attentive, providing a resilient and cost-effective platform that seamlessly handles billions of events from over 100 million customers daily. We own everything from compute, persistence, and networking to observability and deployments. Joining our team offers a high-growth career opportunity to collaborate with some of the world’s most talented engineers in a high-performance, high-impact culture.
As a Staff Engineer in Production Engineering, you will have the opportunity to work across all of the engineering teams at Attentive, focusing on implementing and driving cost optimizations across our Software Stack. You will collaborate and work with engineers, product managers and our finance team. Your work will drive millions of cost savings, provide enhanced visibility and improve our cost model.
What You'll Accomplish
Spearhead FinOps initiatives to increase cost transparency and optimize resource utilization across teamsProvide guidance and mentorship to engineers on cost-optimization best practicesCollaborate with teams across the organization to implement cost-saving measuresAnalyze, troubleshoot, coordinate, and resolve complex infrastructure issuesProactively lead infrastructure initiatives that have a company-wide impactPartner with engineering teams to influence cost-effective architectural and design choicesDevelop automation workflows to enhance team productivity and efficiencyFoster a culture of continuous improvement by encouraging innovative thinking and challenging established normsYour Expertise
Bachelor's degree in Computer Science, Engineering, or related field; Master’s degree preferred2+ years of experience in FinOps or cloud cost management5+ years of experience in cloud infrastructure management, development or related rolesProficiency in some of the following technologies: AWS, Datadog, Kubernetes, Vantage, Terraform, JavaStrong analytical and problem-solving skills with a keen attention to detailExcellent communication and leadership abilitiesAbility to work collaboratively in a cross-functional team environmentWhat We Use
Our infrastructure runs primarily in Kubernetes hosted in AWS’s EKSInfrastructure tooling includes Istio, Datadog, Terraform, CloudFlare, and HelmOur backend is Java / Spring Boot microservices, built with Gradle, coupled with things like DynamoDB, Kinesis, AirFlow, Postgres, Planetscale, and Redis, hosted via AWSOur frontend is built with React and TypeScript, and uses best practices like GraphQL, Storybook, Radix UI, Vite, esbuild, and PlaywrightOur automation is driven by custom and open source machine learning models, lots of data and built with Python, Metaflow, HuggingFace 🤗, PyTorch, TensorFlow, and Pandas