Are you ready to join a fun, nimble team that thrives on collaboration and innovation?
At Azul, we are dedicated to advancing our technology and infrastructure, and we are looking for passionate individuals to be part of our journey. As a member of our team, you will have the opportunity to work alongside talented Engineers who are committed to building and maintaining a secure and high-performance cloud infrastructure.
What You'll Do (aka the Responsibilities)
Manage connectivity between within and across multiple Cloud providers (AWS, GCP, and Azure)
Support Cloud and on-premise Kubernetes stacks
Design and implement IT infrastructure
You will develop and support CI/CD pipelines
Develop and support observability and alerting infrastructure
Work with a team of Cloud Operations Engineers to help build and maintain the systems and code that allow us to provide an always available, secure, and performant cloud infrastructure
Work with internal Engineering Teams to support the deployment and monitoring of their products
Automate monitoring of cloud infrastructure using Open Telemetry, Prometheus, Grafana and other observability tools
Deploy/provision new cloud infrastructure using automation like terraform, argocd, helm, ansible, boto3 (Python)
Develop automated remediation for system faults to remove points of failure in cloud infrastructure
Evaluate and make recommendations about stacks, tooling, and engineering best-practices
What You'll Bring (aka Education and Experience)
Bachelor's degree in computer science, Engineering, or a related field, or equivalent work experience.
5+ years of experience in a DevOps or Site Reliability Engineering (SRE) role, with a proven track record of managing large-scale infrastructure.
Linux proficiency
Familiarity with OpenStack
A strong understanding of networking. The ability to diagnose and understand network issues. (BGP, IPsec, VXLAN, Geneve, 802.1Q, etc.)
Expertise in AWS, Azure, GCP, and cloud-native technologies.
In-depth knowledge of CI/CD tools (Jenkins, GitLab CI, ArgoCD, etc.) and best practices.
Experience with infrastructure-as-code tools, such as Terraform, CloudFormation, Ansible, etc.
Experience with containerization (Docker) and orchestration tools (Kubernetes, OpenShift).