At Red Canary, we believe data is the key to outsmarting cyber threats. We’re building a state-of-the-art security data lake to empower our customers and teams across Red Canary and we are looking for expertise to join and help shape the future of this technology. You’ll be part of a high-impact, lean team dedicated to building governance and automation solutions that keep our data lake optimized, scalable, and secure. As the center of excellence for Red Canary’s production data lake, this team ensures seamless access, drives continuous evolution, and upholds the highest standards of governance and operational efficiency.
As a Staff Data Engineer, you will be a key contributor to our mission to harness data for securing organizations. Your technical expertise will directly influence Red Canary's ability to detect and respond to cybersecurity threats at scale. By building and optimizing robust data pipelines and frameworks, you will ensure that our platform continues to deliver actionable, data-driven insights to our customers.
Your work will focus on solving complex data challenges, streamlining data processing, and delivering high-quality data solutions that power Red Canary's detection engines. You will contribute significantly to the evolution of our data engineering practices and the enhancement of our cybersecurity capabilities.
What You'll Do
Design, develop, and maintain data pipelines and workflows to support scalable and reliable data processing with cybersecurity considerations at the forefront.
Develop and implement comprehensive data strategies that address the needs of our customers, enhancing Red Canary’s ability to detect and respond to cybersecurity threats.
Collaborate closely with other departments to integrate cybersecurity data insights into our overall service offering.
Oversee the enhancement of data quality, reliability, and security, ensuring our data infrastructure is robust, scalable, and aligned with industry best practices.
Research and implement modern tools and technologies to optimize data pipelines and storage solutions.
Troubleshoot and resolve data pipeline and infrastructure challenges to ensure operational continuity.
Actively contribute to team discussions on architecture, technical designs, and future roadmap items.
What You'll Bring
Demonstrated impact at scale in data engineering, with a strong background in cybersecurity or a related field.
Expertise in programming and scripting languages such as Python, SQL, and Scala or PySpark.
Experience building and maintaining data processing systems using cloud platforms, especially AWS (Glue, EMR, Athena, S3, Redshift).
Strong understanding of modern data storage formats like Parquet and Delta Lake, and open table formats (Iceberg, Hudi).
Familiarity with big data frameworks (Apache Spark, Kafka) and orchestration tools like Apache Airflow.
Experience working with ETL/ELT pipelines and transformation tools (e.g., dbt).
A solid understanding of containerization and orchestration technologies, such as Docker and Kubernetes.
Strong problem-solving skills, with the ability to debug and optimize complex data workflows.
Effective communication skills for working collaboratively across technical and non-technical teams.
A strong desire to mentor and develop others, sharing your expertise to elevate the team's capabilities.