Machine Learning - Model Serving Job at Alexander Chapman, Fremont, CA

bnVpekkvV3JuS05CUGd2YkY0dWw4UHRxd1E9PQ==
  • Alexander Chapman
  • Fremont, CA

Job Description

We are working with a company building intuitive, voice-first AI systems that blend natural interaction with powerful model performance. Founded by leaders from Meta, Oculus, and Google, they’re creating a new class of consumer devices powered by speech, vision, and LLMs.

The Role

You’ll help optimize and scale the inference stack, working across model serving, performance tuning, and deployment to support real-time, multimodal AI.

What You’ll Do

  • Improve serving systems for LLMs, speech, and vision models.
  • Optimize throughput, latency, and cost using advanced techniques like batching, caching, and kernel tuning.
  • Extend frameworks like VLLM or SGLang to push the limits of performance.
  • Collaborate with training teams to deploy faster, lighter models.
  • Experiment with compilers and hardware backends to boost efficiency.

What We’re Looking For

  • Strong experience with PyTorch or similar ML frameworks.
  • Deep knowledge of model serving and systems performance.
  • Skilled in low-level debugging, bottleneck analysis, and server optimization.
  • Familiar with VLLM, Ray, or deploying inference workloads at scale.
  • Comfortable owning complex infrastructure projects end to end.
  • Background in computer science or related field from a top-tier university (e.g. Stanford, MIT, Ivy League).
  • Experience at a top tech company (e.g. FAANG) or a successful, high-growth startup.

They’re looking for curious, impact-driven engineers ready to push what’s possible with real-time AI.

Job Tags

Similar Jobs

Sanford Health

Physician - Internal Medicine Job at Sanford Health

 ...state-of-the-art clinic, projected to open in the spring of 2021 in the heart of Grand Forks. This 2-story clinic includes an on stage, off stage design for optimum patient satisfaction and efficiency. Practice Detail: * Clinic hours are Mon-Fri, 8:00 AM-5:00 PM * Hospital... 

Gotham Enterprises Ltd

Licensed Marriage and Family Therapist (LMFT) Job at Gotham Enterprises Ltd

 ...Licensed Marriage and Family Therapist (LMFT) Make a Meaningful Impact in California Looking to apply your skills as an LMFT in a setting where you can make a real difference? We have an opportunity for a licensed professional to support individuals, couples, and... 

Archer Travel

Remote Work-From-Home Travel Agent Job at Archer Travel

 ...About the Role: Are you passionate about travel and love helping others plan their dream...  ...to join our team as Remote Travel Agents! No experience is required - we provide full...  ...you'll have the flexibility to work from home, set your own hours, and earn based on your... 

Disability Solutions

Adjunct Instructor - Law & Paralegal Studies, CECH Criminal Justice Job at Disability Solutions

 ...discovering, creating, and reporting knowledge to a diverse set of students. The Law & Paralegal Studies Program in the School of Criminal Justice at the University of Cincinnati is looking for Adjunct Instructors to teach classes in the Law & Paralegal Studies program.... 

Emcor Uk

BMS Manager Job at Emcor Uk

 ...and Steam systems at the GSK Stevenage site, with potential expansion to other sites. You will have management responsibility for the BMS/Controls Team, collaborating with the BMS Systems Manager and client representatives to ensure all aspects of the BMS and controlled...