Senior Software Engineer, Data

Job Description

We are looking for a Senior Software Data Engineer to join our growing Data Team! Reporting to the Engineering Manager, you will evolve our data models in several styles of datastores and operationalize production-grade data pipelines. As part of this role, you'll collaborate with a world-class team, experience growth and mentorship, and apply data engineering solutions to shape the future of scientific discovery.


You Will:
  • Collaborate with Machine Learning, Fullstack engineers, and Science to solve complex document mining challenges, helping us capture and model additional scientific experiments
  • Use your experience to help define and apply best practices for a broad platform of technologies in a cloud-based environment
  • Lead or consult the authoring of engineering design proposals following the unified Platform Stream roadmap at BenchSci
  • Leverage a deep understanding of the business context and the team’s goals to unlock independent technical decisions in the face of open-ended requirements
  • Proactively identify new opportunities (from both internal and external sources) and advocate for and implement improvements to the current state of projects
  • Respond with urgency and drive urgency in own team to operational issues, owning resolution within one's sphere of responsibility
  • Challenge the status quo and propose newer technologies or ways of working 
  • Scale data pipelines to allow our data to go from research to platform quickly and reliably
  • Work with sources that contain both semi-structured and unstructured data

  • You Have:
  • A degree in Computer Science/Engineering  or a related field within science
  • 5+ years experience working as a software developer in the industry
  • Proficient with Python
  • Proficient with SQL
  • Experience with Event-driven architecture with Pub/Sub
  • A track record in building high-quality, maintainable code

  • Nice to have:
  • Worked with GCP
  • ML/Data science exposure
  • Worked with Auth0, Terraform
  • Worked with the llamaIndex framework and have experience with ConversationalAI