Job Description

About the Attentive Team

Have you ever received a text message from your favorite brand with an incredible offer? Did you know that text message marketing delivers the highest ROI of any marketing channel? And that more customers than ever prefer to connect with brands via text? That is what we do at Attentive. We empower the world’s leading brands to engage with their customers at the right moment, with the right message. Our platform powers more than 400 million messages every day, approaching 100 billion annually.

We’re building big things! Check out our tech blog here: https://tech.attentive.com/

About the Role

Our Platform Infrastructure team is the backbone of everything we do at Attentive, providing a resilient and cost-effective platform that seamlessly handles billions of events from over 100 million customers daily. We own everything from compute, persistence, and networking to observability and deployments. Joining our team offers a high-growth career opportunity to collaborate with some of the world’s most talented engineers in a high-performance, high-impact culture.

As part of the Infrastructure and Platform organization, the Production Engineering Team is focused on delivering a fast and reliable platform that empowers Attentive engineers to deliver solutions quickly and safely. We build scalable systems that automate routine tasks so we can focus on other impactful efforts. Reliability, scalability, and security are our areas of expertise. We focus on release, observability, and cost optimization. Our mission is to create robust platforms and tools that allow stakeholders to concentrate on delivering exceptional products.

As an Engineering Manager, you will lead a team of engineers, taking a strategic role in designing and implementing solutions that enhance the reliability and scalability of our systems, while mentoring others and influencing technical roadmaps across the organization. You will also remain hands-on, contributing to code and technical design.

What You'll Accomplish

Lead and Manage a Team: Recruit, mentor, and develop a team of production engineers. Conduct performance reviews, provide feedback, and support career growth

Craft the team’s roadmap: align with organizational goals, define vision and work with stakeholders across the organization

Design and Deliver High-Impact Solutions: Design and implement systems that enhance reliability, observability, traceability, and incident management, ensuring the platform scales effectively. Remain hands-on with coding and technical design

Lead Strategic Initiatives: Take ownership of cross-team collaborations and drive impactful projects by providing technical leadership and guidance

Partner Across Teams: Collaborate with engineers from AI/ML, Data, Platform, and Product teams to develop best-in-class services

Establish Standards and Best Practices: Define and enforce production standards, processes, and tools to ensure operational excellence

Champion Reliability Goals: Advocate for and implement SLIs, SLOs, and other reliability-focused metrics across the engineering organization

Mentorship and Knowledge Sharing: Guide and mentor team members, fostering technical growth and helping to develop the next generation of engineering leaders

Innovate and Inspire: Drive continuous improvement by bringing creative ideas and challenging the status quo

Your Expertise

3+ years of experience in Production Engineering, Backend Engineering, SRE, DevOps or similar role

2+ years of experience in a management or team lead role

Proficient Problem-Solver: Strong coding ability in at least one language (e.g., Golang, Python, Java, Typescript) with the capability to solve complex issues through code

Track Record of Success: Demonstrated experience delivering medium to large-scale projects that drive meaningful improvements in platform reliability and scalability

Reliability Expertise: Deep understanding of production reliability concepts, including SLIs, SLOs, and incident management

Strong Communicator: Excellent verbal and written communication skills with the ability to influence and collaborate across technical and non-technical teams

Fast-Paced Experience: Familiarity with working in dynamic, reliability-focused production environments (preferred)

What We Use

Our infrastructure runs primarily in Kubernetes hosted in AWS’s EKS

Infrastructure tooling includes Istio, Datadog, Terraform, CloudFlare, and Helm

Our backend is Java / Spring Boot microservices, built with Gradle, coupled with things like DynamoDB, Kinesis, AirFlow, Postgres, Planetscale, and Redis, hosted via AWS

Our frontend is built with React and TypeScript, and uses best practices like GraphQL, Storybook, Radix UI, Vite, esbuild, and PlaywrightOur automation is driven by custom and open source machine learning models, lots of data and built with Python, Metaflow, HuggingFace 🤗, PyTorch, TensorFlow, and Pandas

Pathward

IT Software Engineer Sr

We are a hybrid, remote-office company dedicated to growing our talent anywhere! We have onsite locations in: Sioux Falls, SD, Scottsdale, AZ, Troy, MI, Franklin, TN, ;

engineer
dev

Cyberark

Staff Production Engineer

Company DescriptionAbout CyberArk: CyberArk (NASDAQ: CYBR), is the global leader in Identity Security. Centered on privileged access management, CyberArk provides the mos;

engineer

Sinch

Senior Frontend Software Engineer

Sinch is a Customer Communication Cloud company, directly powering meaningful conversations at scale across messaging, voice and email to help businesses deliver unified,;

front end
senior
dev
engineer
react

Grafana Labs

Senior Backend Engineer - Loki Query (Remote, USA)

This is a remote opportunity and we would be interested in applicants from USA time zones only at this time. Senior Backend Software Engineer - Databases What is Grafana ;

senior
backend
engineer

Engineering Manager, Production Engineering

Job Description

USA Only

Software development Product Manager

17 hours ago