Senior Software Engineer - Data Products

Job Description

Reddit is a community of communities. It’s built on shared interests, passion, and trust and is home to the most open and authentic conversations on the internet. Every day, Reddit users submit, vote, and comment on the topics they care most about. With 100,000+ active communities and approximately 97M+ daily active unique visitors, Reddit is one of the internet’s largest sources of information. For more information, visit redditinc.com.

The Data Snapshot team is looking to hire a Senior Software Engineer who is excited to solve large scale batch and streaming data challenges.

Our community of users generates over 100B analytics events per day, each of which is ingested by the Data Infrastructure team into a data warehouse that sees 55,000+ daily queries. We utilize this data to enable both batch and streaming data usage at the company. The team also owns our Streaming Platform that is build using Kafka

As a senior engineer, you will help develop our Snapshot product used to deliver high quality data to partners. You will partner with teams around Reddit to create and execute a strategy to ensure data quality and consistency at scale. 

In your day-to-day, you can expect to:

  • Refine and maintain our data infrastructure technologies to support batch and real-time processing of hundreds of millions of users.
  • Own the tools we use to ingest, store and improve data quality.
  • Design, Build and Deliver end-to-end data solutions to improve the reliability, scalability, latency and efficiency of Reddit’s Data Platform.
  • Implement automation for key elements of the development process, including data quality, managing alerts and handling critical infrastructure operations.
  • Guide and support fellow engineers within the team by serving as a mentor, while actively contributing to the sharing of knowledge through training sessions and comprehensive documentation.

Who you might be:

  • 4+ years of coding experience in a production setting writing clean, maintainable, and well-tested code.
  • Excellent communication skills to collaborate with stakeholders in engineering, data science, machine learning, and product.
  • Experience with programming languages such as Scala, Go, Java, or Python with expertise in SQL languages like BigQuery, SparkSQL or Postgres.
  • Experience working with Terraform, Helm, Kafka, Flink, CDC, Airflow, Prometheus, Docker, Kubernetes, and CI/CD.
  • Degree in Computer Science or equivalent experience.
  • Excellent communication skills tailored for effective collaboration within both a service-oriented team and the broader organizational context

Benefits:

  • Comprehensive Healthcare Benefits
  • 401k Matching
  • Workspace benefits for your home office
  • Personal & Professional development funds
  • Family Planning Support
  • Flexible Vacation (please use them!) & Reddit Global Wellness Days
  • 4+ months paid Parental Leave
  • Paid Volunteer time off

#LI-CK2 #LI-Remote