Tecton’s Realtime serving team builds high-performance online infrastructure that power AI inference at 100K+ QPS with millisecond latency. Our services have tight SLAs on availability and latency and are often used in mission-critical applications by enterprises.
This position is open to candidates based anywhere in the United States. You can work in one of our hub offices in San Francisco, New York City, Seattle or work fully remotely from outside those areas within the US.
Responsibilities
Develop and communicate a clear 18-month technical vision to align the team and guide our development efforts
Architect and implement solutions to scale our serving platform to handle millions of requests per second with low latency and high availability
Evolve Tecton’s query execution engine to support complex, multi-stage queries with user-defined Directed Acyclic Graphs (DAGs)
Build an integrated observability solution that provides an exceptional operational experience with logs, metrics, and traces
Launch our serving infrastructure across multiple cloud platforms, ensuring compliance with security protocols and data residency requirements
Assess and prioritize tasks, demonstrating a keen awareness of performance-critical areas
Qualifications
7+ years of experience in programming, debugging, and performance tuning distributed and/or highly concurrent software systems
Degree in Computer Science, Software Engineering, or a related field, or equivalent practical experience, with strong proficiency in building high throughput infrastructure
Experience with Database Query Engines
Experience with at least one of AWS, GCP
Experience with low latency online storage like DynamoDB, Redis, and BigTable
Experience with Data warehouses like Snowflake, BigQuery, Object Storage like S3