As a Senior DevOps Engineer, you will help design, build, and maintain a stable and efficient infrastructure to optimize service delivery cross production throughout the development lifecycle. Additionally, this role will monitor, troubleshoot, maintain, and continuously improve building, packaging and deployment processes. The Senior DevOps Engineer will collaborate within the Cloud Ops team as well as with QA and development to troubleshoot performance issues.
Job Responsibilities
Design and implement advanced CI/CD pipelines for complex applications.
Develop and maintain infrastructure automation scripts and templates.
Troubleshoot and resolve complex issues related to systems, DBs and applications.
Implement advanced security measures and compliance standards.
Mentor junior team members.
Manage and optimize cloud resources for cost-efficiency.
Lead or participate in incident response and disaster recovery efforts.
Create run-books for troubleshooting
Participate in on-call rotation
Other duties as assigned
Required Qualifications
Bachelor's degree in Computer Science, Information Technology, or related field (or equivalent experience).
Minimum of 5 years of experience
In-depth knowledge of DevOps principles, practices, and technologies including Amazon Web Services (AWS) and Terraform or other Infrastructure as Code (IaC)
Strong proficiency of Linux operating systems and troubleshooting OS issues
Strong understanding of setting up and managing monitoring tools (such as Prometheus, Grafana, Datadog, Nagios, Open Telemetry, ELK, or similar tool)
Advanced knowledge of scripting and automation utilizing Python
Proficiency with version control systems (e.g., Git).
Strong experience with automation and scripting languages (e.g., Python, Bash, or Ruby).
In-depth familiarity with configure management tools (Ansible or Chef)
Deep understanding of:
Networking concepts and principles
Version Control Systems (such as Git)
CI/CD tools such as Jenkins, Gitlab CI/CD, Github or similar tool
Containerization and orchestration (Docker, Kubernetes).
Proficiency with cloud platforms (AWS, Azure, or Google Cloud).
Knowledge related to security best practices including identity and access management, encryption, and vulnerability assessments
Strong problem-solving, collaboration, teamwork and communication skills.
Eagerness to learn and adapt to new technologies and tools.