Job Description
We are looking for a Senior Site Reliability Engineer to join our growing Platform Infrastructure group, Site Reliability Engineering team! Reporting to the Engineering Manager - Infrastructure, you'll apply your technical and domain expertise to solve complex technical and business challenges; respond to and assist with production incidents in collaboration with product teams; participate in design discussions, code reviews, and project-related team meetings; and work with other engineers to develop innovative solutions that meet business needs concerning functionality, performance, observability, scalability, and reliability.
You Will:
Build, deploy, and maintain observability platforms to enable teams to self-serve their metrics gathering and dash-boarding needsLead software and system design initiatives by leveraging cloud-native design patterns and injecting your cloud expertise into the entire development lifecyclePartner with other teams to iterate on and improve BenchSci’s Incident Response processesHelp other teams to respond, mitigate, and remediate production incidentsHelp other teams write effective post-mortems and improve our reliability culture and processesWork with your team, Staff Engineers, and Engineering Managers to help promote SRE best practicesHelp reduce toil and improve developer productivity by automating our team and business processesPartner with engineering and product stakeholders and other cross-functional teams to devise and refine requirementsCommunicate cross-cutting decisions to all potentially impacted teamsYou Have:
5+ years of experience working as a Senior Site Reliability Engineer preferredExpert knowledge of incident response, observability, and reliability tools and techniques in a cloud-native environment (Google Cloud is preferred, but AWS experience is also valuable)Experience with cloud design patterns (Google Cloud is considered an asset) and developing specialized application stacks on cloud services (Python backend, TypeScript frontend)Experience working in Python and JavaScript/TypeScript codebasesEagerness to share your own ideas, and openness to those of others