MENU
  • Remote Jobs
  • Companies
  • Go Premium
  • Job Alerts
  • Post a Job
  • Log in
  • Sign up
Working Nomads logo Working Nomads
  • Remote Jobs
  • Companies
  • Post Jobs
  • Go Premium
  • Get Free Job Alerts
  • Log in

Senior Machine Learning Engineer - Agents data

Canva

Full-time
Austria
machine learning
engineer
python
aws
artificial intelligence
Apply for this position

Company Description

Join the team redefining how the world experiences design.

Servus, hey, g'day, mabuhay, kia ora, 你好, hallo, vítejte!

Thanks for stopping by. We know job hunting can be a little time consuming and you're probably keen to find out what's on offer, so we'll get straight to the point.

Where and how you can work

Our flagship campus is in Sydney, Australia but Austria is home to part of our European operations. And you have choice in where and how you work, we trust our Canvanauts to choose the balance that empowers them and their team to achieve their goals.

Fun fact, a big part of our Austrian operations is developing the AI product within Canva to help reimagine how artificial intelligence can be used in design. Pretty cool ha!

Job Description

At Canva, our mission is to empower the world to design. We’re building AI that feels magical and lands real impact for millions of people - helping anyone create with confidence. We're looking for a Machine Learning Engineer to own the data foundations that power our multimodal agent research—building the pipelines, datasets, and tooling that turn ambitious research ideas into trainable reality.

About the team

We explore multimodal agentic architectures, build scalable training and evaluation loops, and partner closely with product and platform teams to turn breakthroughs into delightful product features. We are a cutting-edge post-training team, developing new multimodal agentic systems. We work on all topics of multimodal modeling, post-training and design agents, we build scalable training and evaluation loops, and partner closely with product and platform teams to turn breakthroughs into delightful product features.

About the role

You'll be responsible for the data lifecycle that fuels our agent research: from collection and curation through to preprocessing, quality assurance, and delivery into training pipelines. You'll work closely with research scientists to understand what data is needed, then design and build the systems to make it happen—reliably and at scale. You'll have significant autonomy over how data problems get solved, while aligning on what problems matter most with the broader team.

What you’ll be doing in this role

  • Design and build data pipelines for agent training: collection, filtering, deduplication, formatting, and versioning across text, image, and multimodal sources.

  • Develop tooling for dataset construction—including human annotation workflows, synthetic data generation, and preference data collection for RLHF/DPO-style training.

  • Own data quality: build validation frameworks, monitor for drift and contamination, and establish standards that make datasets trustworthy and reproducible.

  • Create evaluation datasets and benchmarks in collaboration with researchers—curating task distributions that surface real failure modes.

  • Build and maintain infrastructure for efficient data loading, storage, and retrieval at scale (S3, distributed systems, streaming pipelines).

  • Collaborate with research scientists to translate research requirements into concrete data specifications, and iterate as experiments reveal new needs.

  • Document datasets thoroughly: provenance, known limitations, intended use cases, and versioning history.

  • Profile and optimize research code for training and inference efficiency, implement comprehensive test coverage for data pipelines and ML workflows, ensuring reliability and catching regressions early.

  • Elevate codebase quality through code reviews, refactoring, and establishing engineering best practices that help research velocity scale sustainably.

  • Contribute to team roadmaps by identifying data bottlenecks and proposing solutions that unblock research velocity.

You're likely a match if you have

  • Strong software engineering skills in Python, with experience building production-grade data pipelines and ML DevOps.

  • Practical experience with prompt engineering—designing, testing, and refining prompts for reliable LLM/VLM outputs.

  • Experience with ML data workflows: large-scale data processing and loading (Ray, or similar), data versioning, and format considerations for training (tokenization, batching, sharding).

  • Hands-on experience working with data pipelines for large scale distributed ML training runs.

  • Familiarity with annotation tooling and human-in-the-loop data collection (Label Studio or internal systems).

  • Understanding of ML training requirements—you know what 'good data' looks like for LLM/VLM fine-tuning and can anticipate downstream issues.

  • Experience loading and writing large datasets to/from cloud infrastructure (AWS) and distributed storage systems.

  • Strong communication skills: you can work with researchers to scope ambiguous problems and translate needs into actionable plans.

  • A collaborative approach, comfortable taking ownership and iterating quickly.

Nice to have

  • Experience with preference data collection for RLHF or reward modeling.

  • Familiarity with multimodal data (image-text pairs, video, design assets).

  • Experience building synthetic data generation pipelines using LLMs.

  • Background in data quality metrics and monitoring systems.

  • Contributions to dataset releases or benchmarks in the ML community.

Additional Information

What's in it for you?

Achieving our crazy big goals motivates us to work hard - and we do - but you'll experience lots of moments of magic, connectivity and fun woven throughout life at Canva, too. We also offer a stack of benefits to set you up for every success in and outside of work.

Here's a taste of what's on offer:

  • Equity packages - we want our success to be yours too

  • Inclusive parental leave policy that supports all parents & carers

  • An annual Vibe & Thrive allowance to support your wellbeing, social connection, home office setup & more

  • Flexible leave options that empower you to be a force for good, take time to recharge and supports you personally

Check out lifeatcanva.com for more info.

Other stuff to know

We make hiring decisions based on your experience, skills and passion, as well as how you can enhance Canva and our culture. When you apply, please tell us the pronouns you use and any reasonable adjustments you may need during the interview process.

Please note that interviews are predominantly conducted virtually. 

Apply for this position
Bookmark Report

About the job

Full-time
Austria
Senior Level
Posted 6 hours ago
machine learning
engineer
python
aws
artificial intelligence

Apply for this position

Bookmark
Report
Enhancv advertisement
+ 1,284 new jobs added today
30,000+
Remote Jobs

Don't miss out — new listings every hour

Join Premium

Senior Machine Learning Engineer - Agents data

Canva

Company Description

Join the team redefining how the world experiences design.

Servus, hey, g'day, mabuhay, kia ora, 你好, hallo, vítejte!

Thanks for stopping by. We know job hunting can be a little time consuming and you're probably keen to find out what's on offer, so we'll get straight to the point.

Where and how you can work

Our flagship campus is in Sydney, Australia but Austria is home to part of our European operations. And you have choice in where and how you work, we trust our Canvanauts to choose the balance that empowers them and their team to achieve their goals.

Fun fact, a big part of our Austrian operations is developing the AI product within Canva to help reimagine how artificial intelligence can be used in design. Pretty cool ha!

Job Description

At Canva, our mission is to empower the world to design. We’re building AI that feels magical and lands real impact for millions of people - helping anyone create with confidence. We're looking for a Machine Learning Engineer to own the data foundations that power our multimodal agent research—building the pipelines, datasets, and tooling that turn ambitious research ideas into trainable reality.

About the team

We explore multimodal agentic architectures, build scalable training and evaluation loops, and partner closely with product and platform teams to turn breakthroughs into delightful product features. We are a cutting-edge post-training team, developing new multimodal agentic systems. We work on all topics of multimodal modeling, post-training and design agents, we build scalable training and evaluation loops, and partner closely with product and platform teams to turn breakthroughs into delightful product features.

About the role

You'll be responsible for the data lifecycle that fuels our agent research: from collection and curation through to preprocessing, quality assurance, and delivery into training pipelines. You'll work closely with research scientists to understand what data is needed, then design and build the systems to make it happen—reliably and at scale. You'll have significant autonomy over how data problems get solved, while aligning on what problems matter most with the broader team.

What you’ll be doing in this role

  • Design and build data pipelines for agent training: collection, filtering, deduplication, formatting, and versioning across text, image, and multimodal sources.

  • Develop tooling for dataset construction—including human annotation workflows, synthetic data generation, and preference data collection for RLHF/DPO-style training.

  • Own data quality: build validation frameworks, monitor for drift and contamination, and establish standards that make datasets trustworthy and reproducible.

  • Create evaluation datasets and benchmarks in collaboration with researchers—curating task distributions that surface real failure modes.

  • Build and maintain infrastructure for efficient data loading, storage, and retrieval at scale (S3, distributed systems, streaming pipelines).

  • Collaborate with research scientists to translate research requirements into concrete data specifications, and iterate as experiments reveal new needs.

  • Document datasets thoroughly: provenance, known limitations, intended use cases, and versioning history.

  • Profile and optimize research code for training and inference efficiency, implement comprehensive test coverage for data pipelines and ML workflows, ensuring reliability and catching regressions early.

  • Elevate codebase quality through code reviews, refactoring, and establishing engineering best practices that help research velocity scale sustainably.

  • Contribute to team roadmaps by identifying data bottlenecks and proposing solutions that unblock research velocity.

You're likely a match if you have

  • Strong software engineering skills in Python, with experience building production-grade data pipelines and ML DevOps.

  • Practical experience with prompt engineering—designing, testing, and refining prompts for reliable LLM/VLM outputs.

  • Experience with ML data workflows: large-scale data processing and loading (Ray, or similar), data versioning, and format considerations for training (tokenization, batching, sharding).

  • Hands-on experience working with data pipelines for large scale distributed ML training runs.

  • Familiarity with annotation tooling and human-in-the-loop data collection (Label Studio or internal systems).

  • Understanding of ML training requirements—you know what 'good data' looks like for LLM/VLM fine-tuning and can anticipate downstream issues.

  • Experience loading and writing large datasets to/from cloud infrastructure (AWS) and distributed storage systems.

  • Strong communication skills: you can work with researchers to scope ambiguous problems and translate needs into actionable plans.

  • A collaborative approach, comfortable taking ownership and iterating quickly.

Nice to have

  • Experience with preference data collection for RLHF or reward modeling.

  • Familiarity with multimodal data (image-text pairs, video, design assets).

  • Experience building synthetic data generation pipelines using LLMs.

  • Background in data quality metrics and monitoring systems.

  • Contributions to dataset releases or benchmarks in the ML community.

Additional Information

What's in it for you?

Achieving our crazy big goals motivates us to work hard - and we do - but you'll experience lots of moments of magic, connectivity and fun woven throughout life at Canva, too. We also offer a stack of benefits to set you up for every success in and outside of work.

Here's a taste of what's on offer:

  • Equity packages - we want our success to be yours too

  • Inclusive parental leave policy that supports all parents & carers

  • An annual Vibe & Thrive allowance to support your wellbeing, social connection, home office setup & more

  • Flexible leave options that empower you to be a force for good, take time to recharge and supports you personally

Check out lifeatcanva.com for more info.

Other stuff to know

We make hiring decisions based on your experience, skills and passion, as well as how you can enhance Canva and our culture. When you apply, please tell us the pronouns you use and any reasonable adjustments you may need during the interview process.

Please note that interviews are predominantly conducted virtually. 

Working Nomads

Post Jobs
Premium Subscription
Sponsorship
Reviews
Job Alerts

Job Skills
Jobs by Location
API
FAQ
Privacy policy
Terms and conditions
Contact us
About us

Jobs by Category

Remote Administration jobs
Remote Consulting jobs
Remote Customer Success jobs
Remote Development jobs
Remote Design jobs
Remote Education jobs
Remote Finance jobs
Remote Legal jobs
Remote Healthcare jobs
Remote Human Resources jobs
Remote Management jobs
Remote Marketing jobs
Remote Sales jobs
Remote System Administration jobs
Remote Writing jobs

Jobs by Position Type

Remote Full-time jobs
Remote Part-time jobs
Remote Contract jobs

Jobs by Region

Remote jobs Anywhere
Remote jobs North America
Remote jobs Latin America
Remote jobs Europe
Remote jobs Middle East
Remote jobs Africa
Remote jobs APAC

Jobs by Skill

Remote Accounting jobs
Remote Assistant jobs
Remote Copywriting jobs
Remote Cyber Security jobs
Remote Data Analyst jobs
Remote Data Entry jobs
Remote English jobs
Remote Spanish jobs
Remote Project Management jobs
Remote QA jobs
Remote SEO jobs

Jobs by Country

Remote jobs Australia
Remote jobs Argentina
Remote jobs Brazil
Remote jobs Canada
Remote jobs Colombia
Remote jobs France
Remote jobs Germany
Remote jobs Ireland
Remote jobs India
Remote jobs Japan
Remote jobs Mexico
Remote jobs Netherlands
Remote jobs New Zealand
Remote jobs Philippines
Remote jobs Poland
Remote jobs Portugal
Remote jobs Singapore
Remote jobs Spain
Remote jobs UK
Remote jobs USA


Working Nomads curates remote digital jobs from around the web.

© 2025 Working Nomads.