Machine Learning Engineer
About Moonvalley
Moonvalley's mission is to solve Visual Intelligence in the age of generative AI. We are building technology that can tell stories, scale creativity, and understand both the physics and semantics of the world. With Marey, our first high-definition foundation model trained exclusively on licensed data, we are powering the next era of cinematic, commercial, and enterprise-grade creation.
Our team is an unprecedented convergence of talent across industries. Our elite AI scientists from Deepmind, Google, Microsoft, Meta & Snap, have decades of collective experience in machine learning and computational creativity. We have also established the first AI-enabled movie studio in Hollywood, filled with accomplished filmmakers and visionary creative talent. We work with the top producers, actors, and filmmakers in Hollywood as well as creative-driven global brands. So far we've raised over $100M+ from world-class investors including General Catalyst, Bessemer, Khosla Ventures & YCombinator – and we're just getting started.
Job Summary
We’re looking for a Machine Learning Engineer to design and scale the infrastructure powering our generative video and visual models, particularly for our latest product verv.fm . This role combines deep ML engineering expertise with hands-on experience in modern generative systems like Stable Diffusion, Comfy, and Flux.
You’ll build data ingestion pipelines, develop visual preprocessing workflows, and fine-tune generative models with production-level reliability. You’ll also help drive prompt-aware systems and intelligent feedback loops that elevate both the quality and control of our media generation stack.
What you’ll do
Design and implement data workflows for ingestion, cleaning, validation, filtering, and quality scoring
Fine-tune and deploy generative models like Stable Diffusion and Flux Loras and build large-scale workflows using tools such as Comfy
Experience training and working with video models like WAN, VACE, etc
Integrate computer vision techniques (segmentation, mask ops, object tracking) into generation pipelines
Build high-throughput pipelines for frame extraction, captioning, and visual data processing at scale
Develop and experiment with context- and prompt-aware model orchestration strategies
Contribute to observability and monitoring across the ML data lifecycle
Collaborate with infra teams to scale across GPU-backed infrastructure and serverless environments (e.g., Fal.ai)
Work across a fast-paced, evolving product environment with creative and technical inputs
(Bonus) Explore self-supervising or agentic workflows for automated pipeline feedback and improvement
What we’re looking for
Strong experience in ML engineering with a focus on computer vision, generative media, or multimodal systems
Hands-on experience fine-tuning or deploying generative models (e.g., SD, Flux, Comfy)
Proficiency in Python and asynchronous API development
Familiarity with image/video-specific challenges: frame alignment, codec handling, perceptual quality scoring
Experience with scalable data systems using tools like Airflow, Spark, or Ray
Solid understanding of GPU infrastructure and model deployment best practices
Knowledge of prompt engineering and context-driven model behavior
Comfortable working in ambiguity and bridging infrastructure and modeling challenges
Bonus: Experience with foundation model training pipelines or agentic systems
Availability for at least 4 hours of overlap with US Eastern Time
What we offer (compensation & benefits)
Competitive salary and equity
Private health coverage
Pension contribution (UK, Canada, US)
Unlimited paid vacation
Fully-distributed, async-first culture
Hardware setup of your choice
Stipends for phone, internet, and meals
In our team, we approach our work with the dedication similar to Olympic athletes. Anticipate occasional late nights and weekends dedicated to our mission. We understand this level of commitment may not suit everyone, and we openly communicate this expectation.
If you're motivated by deeply technical problems, a seemingly never-ending uphill battle and the opportunity to build (and own) a generational technology company, we can give you what you're looking for.
All business roles at Moonvalley are hybrid positions by default, with some fully remote depending on the job scope. We meet a few times every year, usually in London, UK or North America (LA, Toronto) as a company.
If you're excited about the opportunity to work on cutting-edge AI technology and help shape the future of media and entertainment, we encourage you to apply. We look forward to hearing from you!
The statements contained in this job description reflect general details as necessary to describe the principal functions of this job, the level of knowledge and skill typically required and the scope of responsibility. It should not be considered an all-inclusive listing of work requirements. Individuals may perform other duties as assigned, including work in other functional areas to cover absences, to equalize peak work periods, or to otherwise balance organizational work
Moonvalley AI is proud to be an equal opportunity employer. We are committed to providing accommodations. If you require accommodation, we will work with you to meet your needs.
Please be assured we'll treat any information you share with us with the utmost care, only use your information for recruitment purposes and will never sell it to other companies for marketing purposes. Please review our privacy policy and job applicant privacy policy located here for further information.
About the job
Apply for this position
Machine Learning Engineer
About Moonvalley
Moonvalley's mission is to solve Visual Intelligence in the age of generative AI. We are building technology that can tell stories, scale creativity, and understand both the physics and semantics of the world. With Marey, our first high-definition foundation model trained exclusively on licensed data, we are powering the next era of cinematic, commercial, and enterprise-grade creation.
Our team is an unprecedented convergence of talent across industries. Our elite AI scientists from Deepmind, Google, Microsoft, Meta & Snap, have decades of collective experience in machine learning and computational creativity. We have also established the first AI-enabled movie studio in Hollywood, filled with accomplished filmmakers and visionary creative talent. We work with the top producers, actors, and filmmakers in Hollywood as well as creative-driven global brands. So far we've raised over $100M+ from world-class investors including General Catalyst, Bessemer, Khosla Ventures & YCombinator – and we're just getting started.
Job Summary
We’re looking for a Machine Learning Engineer to design and scale the infrastructure powering our generative video and visual models, particularly for our latest product verv.fm . This role combines deep ML engineering expertise with hands-on experience in modern generative systems like Stable Diffusion, Comfy, and Flux.
You’ll build data ingestion pipelines, develop visual preprocessing workflows, and fine-tune generative models with production-level reliability. You’ll also help drive prompt-aware systems and intelligent feedback loops that elevate both the quality and control of our media generation stack.
What you’ll do
Design and implement data workflows for ingestion, cleaning, validation, filtering, and quality scoring
Fine-tune and deploy generative models like Stable Diffusion and Flux Loras and build large-scale workflows using tools such as Comfy
Experience training and working with video models like WAN, VACE, etc
Integrate computer vision techniques (segmentation, mask ops, object tracking) into generation pipelines
Build high-throughput pipelines for frame extraction, captioning, and visual data processing at scale
Develop and experiment with context- and prompt-aware model orchestration strategies
Contribute to observability and monitoring across the ML data lifecycle
Collaborate with infra teams to scale across GPU-backed infrastructure and serverless environments (e.g., Fal.ai)
Work across a fast-paced, evolving product environment with creative and technical inputs
(Bonus) Explore self-supervising or agentic workflows for automated pipeline feedback and improvement
What we’re looking for
Strong experience in ML engineering with a focus on computer vision, generative media, or multimodal systems
Hands-on experience fine-tuning or deploying generative models (e.g., SD, Flux, Comfy)
Proficiency in Python and asynchronous API development
Familiarity with image/video-specific challenges: frame alignment, codec handling, perceptual quality scoring
Experience with scalable data systems using tools like Airflow, Spark, or Ray
Solid understanding of GPU infrastructure and model deployment best practices
Knowledge of prompt engineering and context-driven model behavior
Comfortable working in ambiguity and bridging infrastructure and modeling challenges
Bonus: Experience with foundation model training pipelines or agentic systems
Availability for at least 4 hours of overlap with US Eastern Time
What we offer (compensation & benefits)
Competitive salary and equity
Private health coverage
Pension contribution (UK, Canada, US)
Unlimited paid vacation
Fully-distributed, async-first culture
Hardware setup of your choice
Stipends for phone, internet, and meals
In our team, we approach our work with the dedication similar to Olympic athletes. Anticipate occasional late nights and weekends dedicated to our mission. We understand this level of commitment may not suit everyone, and we openly communicate this expectation.
If you're motivated by deeply technical problems, a seemingly never-ending uphill battle and the opportunity to build (and own) a generational technology company, we can give you what you're looking for.
All business roles at Moonvalley are hybrid positions by default, with some fully remote depending on the job scope. We meet a few times every year, usually in London, UK or North America (LA, Toronto) as a company.
If you're excited about the opportunity to work on cutting-edge AI technology and help shape the future of media and entertainment, we encourage you to apply. We look forward to hearing from you!
The statements contained in this job description reflect general details as necessary to describe the principal functions of this job, the level of knowledge and skill typically required and the scope of responsibility. It should not be considered an all-inclusive listing of work requirements. Individuals may perform other duties as assigned, including work in other functional areas to cover absences, to equalize peak work periods, or to otherwise balance organizational work
Moonvalley AI is proud to be an equal opportunity employer. We are committed to providing accommodations. If you require accommodation, we will work with you to meet your needs.
Please be assured we'll treat any information you share with us with the utmost care, only use your information for recruitment purposes and will never sell it to other companies for marketing purposes. Please review our privacy policy and job applicant privacy policy located here for further information.