Job Description
The Principal AWS Cloud Engineer leads and manages a team of cloud engineers and DevOps, fostering collaboration and innovation to deliver complex cloud solutions. This role involves coordinating with various stakeholders to plan, design, develop, test, and maintain cloud-based applications and infrastructure. The engineer provides technical leadership in AWS architecture, ensures the scalability, security, and cost-efficiency of cloud systems, and maintains high standards of quality.
Key responsibilities include overseeing the implementation of new systems, ensuring timely project delivery, and continually updating technical knowledge. The role also involves maintaining detailed documentation, understanding the release process, and performing system maintenance.
Your responsibilities include:
- Leading and managing a team of AWS cloud engineers, fostering a collaborative and innovative environment to deliver complex cloud solutions.
- Coordinating with scrum masters, project managers, and other engineers to plan, design, develop, test, and maintain cloud-based applications and infrastructure.
- Providing technical leadership in all aspects of AWS cloud architecture and solutions, ensuring best practices and efficient use of AWS services.
- Overseeing the design, development, and implementation of new cloud systems and enhancements to existing systems, ensuring scalability, security, and cost-efficiency.
- Ensuring all cloud engineering projects are delivered on time and meet the agreed-upon quality standards and performance metrics.
- Continually updating technical knowledge and skills, particularly in AWS services and cloud technologies, and sharing them with the team.
- Maintaining high standards of cloud solution quality within the team by establishing and promoting good practices and habits.
- Providing regular performance feedback, coaching, and career development opportunities to team members, fostering their growth and professional development.
- Facilitating root cause analysis of cloud system issues, implementing resolution strategies, and ensuring proactive measures are taken to prevent future occurrences.
- Developing and maintaining detailed and comprehensive documentation and diagrams
- Being knowledgeable in the release process and overall architecture of our systems, ensuring smooth deployment and integration of new features and updates.
- Performing system maintenance procedures even during non-business hours and over the weekends
- Participating in an on-call coverage for support of systems outside of normal business hours
- Ensuring high availability of internal and production systems
- Ensuring that company and business unit standard server technologies (Linux and Windows), proper performance, optimization, and capacity standards are implemented
Requirements
To qualify for the role, you should have experience in most of the following:
- Demonstrates a can-do attitude, flexibility, and a willingness to learn and grow in new areas.
- Problem solver; experiments with new approaches and techniques to improve work processes.
- Team player with strong communication skills
The candidate should also have experience in most of the following:
- AWS Cloud Computing Solutions (Required)
- Terraform and other laC tools
- Experience with Jenkins Pipeline Scripting
- Docker, Apache, TomCat, Wordpress, and Testlink
- Networking, IDS/IPS, and SAN
- Server computing (Linux and Windows administration)
- Virtualization (HyperV, VirtualBox, and VMWare)
- Encryption
- VPN (SonicWall/OpenVPN)
- ITIL Standards
- DB Administration (MySQL)
- Command line scripting: BASH PowerShell, DOS Batch, etc.
- Source code management: GIT and GIT services like Bitbucket
Development skills that employ/use other applicable requirements such as:
- not just being able to navigate the AWS website dashboard but being able to use AWS in code scripts
- not just being able to do docker click-drag & drops but being able to create and modify docker YAML files
- not just being able to tweak IP tables but being able to create scripts to automate or tweak IP tables