We are seeking a Site Reliability Engineer (SRE) with deep expertise in monitoring, observability, and reliability engineering to support systems running across on-premises infrastructure and Google Cloud Platform (GCP).
This role is primarily responsible for designing, operating, and improving monitoring, alerting, and observability platforms, with a strong focus on Grafana and Kubernetes environments.
As a secondary responsibility, this role provides backup coverage for the Application Support team during periods of resource constraints or major incidents, offering L2/L3 technical support when required.
Monitoring & Observability (Core Focus)
Site Reliability Engineering
Kubernetes & Platform Reliability
Requirements
Technology Stack:
Nice to have:
Benefits
At Devsu, we believe in creating an environment where you can thrive both personally and professionally. By joining our team, you’ll enjoy:
Join Devsu and discover a workplace that values your growth, supports your well-being, and empowers you to make a global impact.