Senior MLOps Engineer Job at DeepRec.ai, San Jose, CA

bE82MElQZXZuSzFHUEFUZkZZaWk5Zmhzd1E9PQ==
  • DeepRec.ai
  • San Jose, CA

Job Description

Senior MLOps Engineer

We are hiring for an MLOps Engineer for a fast-moving AI startup who are building a worldclass AI-powered video platform.

We are looking for a skilled and hands-on MLOps Engineer to join their growing team. You will play a critical role in deploying, scaling, and maintaining their machine learning infrastructure, supporting a range of tools that enable the controlled generation of high-quality animated videos.

Key Responsibilities

  • Design, deploy, and maintain scalable training and data-processing pipelines on distributed compute clusters (e.g., Slurm, Kubernetes, or cloud-native equivalents).
  • Optimize inference systems for latency and cost in a production setting.
  • Collaborate closely with ML researchers and engineers to productionize deep learning models.
  • Implement robust monitoring, logging, and alerting systems for model performance and infrastructure reliability.
  • Automate model testing, validation, and deployment processes across staging and production environments.
  • Ensure efficient usage of compute resources, including GPU clusters, and help identify bottlenecks or cost-saving opportunities.

Requirements

  • Proven experience in MLOps, ML infrastructure, or related roles.
  • Deep expertise in deploying and maintaining ML training pipelines on distributed systems.
  • Strong knowledge of inference optimization techniques, especially in reducing latency and cost at scale.
  • Proficiency with cloud platforms (AWS, GCP, Azure) and orchestration tools (Kubernetes, Docker).
  • Experience working with GPU scheduling, distributed training (e.g., PyTorch DDP), and model serving frameworks (e.g., Triton, TorchServe).
  • Familiarity with CI/CD for ML workflows.
  • Strong Python skills and experience with ML/DL frameworks like PyTorch or TensorFlow.

Bonus Points

  • Experience working in the creative media or animation industry.
  • Exposure to video processing, generative AI, or large-scale content production systems.
  • Experience collaborating with research teams or integrating research code into production pipelines.

Please apply for more information

Job Tags

Similar Jobs

TrophySmack

Product Photographer & Videographer Job at TrophySmack

 ...ecommerce company that caters to passionate fans through innovative products. Were scaling rapidly, and we need a dynamic Product Photographer + Videographer to drive our next phase of growth in creative asset production. We live and breathe the essence of winning ... 

Canter Power Systems

Marketing Manager Job at Canter Power Systems

 ...is looking for a talented, well-rounded marketing pro that can take charge and lead the Canter...  ...s why we need you! This role is part remote and part in-office with geo preference...  ...we need you to solve: As a Marketing Manager at Canter Power Systems, you will play a... 

Hope and Healing Survivor Resource Center

Education and Prevention Specialist Job at Hope and Healing Survivor Resource Center

 ...Job Description Job Description Position Summary The Education and Prevention Specialist plays a crucial role in our efforts to prevent sexual violence, human trafficking, and the intersections with intimate partner violence, educate the community, and support survivors... 

Oldcastle BuildingEnvelope

Loader/Unloader - Lamination Dept. Job at Oldcastle BuildingEnvelope

 ...enable qualified individuals with disabilities to perform the essential duties of the job** Work Today, Get Paid Tomorrow! Oldcastle BuildingEnvelope has partnered with DailyPay to offer you the ability to access your earnings before your next paycheck based on current... 

Insight Global

Sr. Treasury Analyst Job at Insight Global

 ...continuous improvement mindset. To be successful in this role, the ideal candidate will have well-rounded treasury experience in cash management, investment strategy, Fx optimization, and cash flow forecasting. In This Role, You Will Primary activities for...