Reporting to the Engineering Manager, this Senior/Staff Software Engineer - Registry Platform will own the technical architecture and evolution of our Registry product. This role will lead the critical transition from our legacy Model Registry to a scalable, organization-wide Global Registry platform, while also architecting for a future that supports the extensive needs of large foundation model builders and customers focused on ML model fine-tuning, compliance, system-of-record requirements, and comprehensive monitoring capabilities.
This Senior/Staff Software Engineer will partner closely with product managers, ML platform engineers, and the design team to build and implement a robust, performant registry system that meets the growing demands of enterprise ML teams while maintaining a seamless user experience during the transition.
Responsibilities:
Lead the architectural transformation of our Registry platform, designing systems that scale to support millions/billions of artifacts for our customers.
Dive deep into system architecture to uncover optimization opportunities, solve complex bugs, and make difficult technical decisions that balance short-term fixes with long-term platform health.
Identify and refactor areas of technical debt, improving code maintainability and developer productivity.
Mentor a team of 3-4 Software Engineers through complex technical challenges, elevating their capabilities through code reviews and design guidance.
Establish engineering patterns and best practices that balance immediate feature delivery with long-term platform health.
Collaborate with product and design to translate complex user requirements into elegant technical implementations.
Interface directly with customers to understand their ML workflow challenges, gathering feedback and incorporating it into the product development cycle.
Practice agile methodologies with users at the center, maintaining a closed feedback loop from customer need to technical implementation to validation.
Requirements:
7+ years of software engineering experience, with 3+ years leading significant technical initiatives or architectural changes
Strong proficiency in Python
Strong proficiency in at least one compiled language (C++, C, C#, Go), with Go experience being a significant bonus
Expert-level TypeScript knowledge with demonstrated ability to implement deep frontend optimizations and complex state management
Advanced experience with GraphQL APIs, query optimization, and sophisticated data fetching strategies
Demonstrated success improving performance and maintainability of complex web applications
Track record of mentoring engineers and elevating team capabilities through technical leadership
Comfort diving into complex systems to diagnose and solve difficult bugs across the full stack
Exceptional communication skills with ability to explain complex technical concepts to various audiences
Benefits:
🏝️ Flexible time off
🩺 Medical, Dental, and Vision for employees and Family Coverage
🏠 Remote first culture with in-office flexibility in San Francisco
💵 Home office budget with a new high-powered laptop
🥇 Truly competitive salary and equity
🚼 12 weeks of Parental leave (U.S. specific)
📈 401(k) (U.S. specific)
Supplemental benefits may be available depending on your location
Why join us?
Top-tier machine learning teams rely on our tools for their daily work at companies including OpenAI, Toyota Research Institute, Lyft, Samsung, and Pandora.
You'll never stop learning. This role gives you first-hand experience talking with leading researchers in the field, understanding their problems, and directly shaping the product direction.
Our experienced founding team has successfully built and sold ML tools in the past at Figure Eight, and their deep knowledge of our industry, empathy for our users, and skillful management is driving W&B to success.
Customers genuinely benefit from our tool. Here's a quote from Wojciech Zaremba, Cofounder and Robotics Lead, OpenAI: "W&B allows to scale up insights from a single researcher to the entire team, and from a single machine to hundreds of them."