Staff ML Infrastrucrure Engineer
Location
Oakland, CA | United States
Job description
Job Title: Staff Software Engineer - ML Infrastructure
Location: Hybrid out of SF Bay Area
Compensation/Benefits: $200,000 - $250,000 + Generous PTO, 401k w/ match, Equity Plan
Requirements: Minimum 5+ years of experience in software engineering focused on infrastructure design. Strong experience with distributed systems like Apache Spark, Ray etc. and infrastructure design.
Backed by one of the most renowned VC's in the industry, we have developed a site intelligence platform that helps safety and operations leaders see the unseen risks, make strategic decisions, and prevent workplace incidents before they happen. Our client base includes Fortune 500 companies across major industries worldwide.
Due to our growth, we are seeking a highly skilled and experienced Senior ML Infrastructure professional to join our team.
What You'll Do
- Build and maintain cloud infra and distributed systems for MLOps
- Build out internal training framework to make research with large distributed models easy and fun
- Design and develop systems to support Voxel's ML development, with a focus on computer vision applications.
- Provide technical guidance, mentorship, and project management support.
Must-Haves
- Bachelor's degree in Computer Science or a related field.
- Minimum 5+ years of experience in software engineering, with a focus on infrastructure design.
- Have experience designing large, highly available distributed systems with Kubernetes.
- Proven experience in designing complex systems and strong software engineering skills.
- Strong understanding and experience with distributed systems like Apache Spark, Ray etc. and infrastructure design.
- Proficiency in containerization technologies like Docker and orchestration platforms like Kubernetes.
- Demonstrated expertise in DevOps practices, with a focus on ML.
- Have worked on ML tools that researchers love
- Knowledge of advanced ML operations techniques, such as model deployment and monitoring.
- Experience with pipeline automation and data management in ML workflows.
- Previous experience building ML systems and working on ML adjacent teams.
- Build and maintain cloud infra and distributed systems for MLOps
- Build out internal training framework to make research with large distributed models easy and fun
- Design and develop systems to support Voxel's ML development, with a focus on computer vision applications.
- Provide technical guidance, mentorship, and project management support.
What's in it for you
- $200,000 - $250,000 a year + Equity
- Extensive / Generous health, dental, and vision insurance.
- Highly competitive paid parental leave and support system.
- Generous paid time off and / or flexible work arrangements.
- 401K retirement plan, HSA options, pre-tax Commuter Card.
Benefits - Generous paid time off and / or flexible work arrangements.
Applicants must be authorized to work in the U.S.
Preferred Skills
ML
AI
Infrastrucure
Apache
SPARK
Ray
Docker
Kubernetes
ML Infrastructure
Devops
Job tags
Salary
$200k - $250k