Mlops Engineer
Company | Trunk Tools |
---|---|
Location | San Francisco, CA, USA, Austin, TX, USA, Remote in USA, New York, NY, USA |
Salary | $140000 – $200000 |
Type | Full-Time |
Degrees | Bachelor’s, Master’s |
Experience Level | Senior |
Requirements
- BS/MS in Computer Science, Data Science, or related technical discipline
- 5+ years experience in ML Operations, with at least 3 years focused on scalable AI/ML deployments
- Strong proficiency with cloud infrastructure (preferably AWS), container technologies (Docker, Kubernetes), and modern MLOps frameworks
- Extensive experience managing GPU and CPU resources for specialized AI workloads (Computer Vision, NLP, or LLM fine-tuning)
- Practical experience with data quality and performance monitoring in production ML environments
Responsibilities
- Develop and manage infrastructure for distributed model training (e.g., SageMaker, Ray, Kubernetes)
- Deploy ML models using containerization (Docker), orchestration tools (Kubernetes, ECS), and serving frameworks
- Integrate ML workflows seamlessly with CI/CD pipelines for efficient model building, testing, and deployment
- Create and maintain robust data and ML pipelines using Prefect, Airflow, or custom orchestration tools
- Implement comprehensive experiment tracking (MLflow, Weights & Biases) and observability systems (Arize, Evidently)
- Establish effective monitoring, logging, and governance practices for ML systems.
Preferred Qualifications
- Experience designing data architectures optimized for AI/ML (vector, graph databases)
- Familiarity with RAG systems and agentic applications