Job Description
At Planner 5D we are searching for Senior / Lead DevOps specialist who will join our team and work on improving and maintaining our IT infrastructure.
Our tech stack: Docker Swarm, Kubernetes, Gitlab, nginx, PHP, MySQL, MariaDB, TiDB, Hazelcast, MongoDB, Redis, RabbitMQ, Ansible, Python, Bash
Responsibilities:
Management, purchase, … of infrastructure hardware/services (Hetzner, Amazon, Google Cloud, LeaderGPU)Research and testing of new big solutions (for example TiDB, Kafka, Hadoop, ...)Launch, maintenance and monitoring of: Front, load balancing, mirrors, CDN and DNS (nginx, haproxy, powerdns, ...)Application servers (PHP)Container orchestration (Docker Swarm and Kubernetes)TiDB, MongoDB, MariaDB Galera clusterFile storage clusters (GlusterFS, Ceph, ...)Worker servers (rendering, AI, ...) - using Ansible, Docker, KuberMonitoring and alerting (Grafana, Proeteus, Loki, ...)Cloud services (BigQuery, SES, ...)Caching and queue servers (Redis, Hazelcast, RabbitMQ, ...)Multiple datacenters around the worldOther internal services / servers (VPN, analytics, mailing, ...)Virtual and dedicated serversInfrastructure as code: Ansible, Docker, Docker Swarm, Kubernetes, Gitlab CI/CDProgramming for infrastructure using Python, understanding of code and how servers/services are used in code to architect infrastructure solutions and find issues (in Python mostly, sometimes in other languages like Java, PHP, …)Know and implement best practices in security, performance optimization, and maintainabilityManage and mentor the infrastructure team, improve the working process, and participate in proactive infrastructure management.
Required knowledge of:
Strong programming skills, ideally with past experience as a software engineer (Java, Python, etc.)Deep expertise in Linux administration (Ubuntu, Debian, CentOS), networking and system security knowledgeExpertise in Docker, Kubernetes, and similar including container orchestration and scalingOn-premise infrastructure expertise – understanding server clusters, database management, caching, and networkingExperience with databases, their administration, message brokers, load balancers, fronts, … and other servers commonly used in production stacks (MariaDB, MongoDB, Redis, RabbitMQ, Kafka, nginx, HAProxy, …) Hands-on experience with infrastructure as code tools (GitLab CI/CD, Ansible, Kubernetes)Monitoring, logging, backup and disaster recovery (Grafana, Prometheus, Loki, …)A strong engineering mindset – ability to bridge development and infrastructure needs, sort out and understand new and old technologiesExperience in mentoring, process improvement, and proactive infrastructure management and architecting. This position has the potential to grow into a Team Lead role.