MENU
  • Remote Jobs
  • Companies
  • Go Premium
  • Job Alerts
  • Post a Job
  • Log in
  • Sign up
Working Nomads logo Working Nomads
  • Remote Jobs
  • Companies
  • Post Jobs
  • Go Premium
  • Get Free Job Alerts
  • Log in

Site Reliability Engineer

Phaidra

Full-time
USA, Canada
$93k-$178k per year
engineer
devops
javascript
python
docker
The job listing has expired. Unfortunately, the hiring company is no longer accepting new applications.

To see similar active jobs please follow this link: Remote Development jobs

Who You Are

Phaidra is looking for a driven Site Reliability Engineer to be a part of our engineering team. You are bold and creative, and have deep empathy for customers who may not be tech-savvy. You will work in the Infrastructure Engineering team to build and maintain world class infrastructure. You will have the opportunity to make an immediate impact with your work and guide the product and team as we grow.

*We are seeking a team member located within one of the following areas: USA and Canada

  • In the United States, we accept applicants located in the following states: California, Colorado, Connecticut, Georgia, Indiana, Maryland, Minnesota, Missouri, Nebraska, New York, North Carolina, Pennsylvania, South Carolina, Tennessee, Texas, Virginia, Washington. 

  • In Canada, we accept applicants located in the following provinces: Ontario, British Columbia, and Alberta.

Responsibilities

The ideal candidate has expertise building and managing cloud infrastructure on AWS, GCP or Azure and has good knowledge of Kubernetes, CI/CD and Observability. Your responsibilities will include flavors of Infrastructure Engineering, MLOps, SRE and DevOps. As a Site Reliability Engineer, it will be expected of you to be an Individual Contributor (IC) in the Infrastructure Engineering team.

  • You will help build and maintain infrastructure for:

    • Large-scale data ingestion and processing.

    • Distributed model training, evaluation and inference.

    • Automating the end-to-end system for continuous improvement and deployment.

    • Developer environments and build systems.

    • Multi-cloud deployments.

  • You will work with cloud services like AWS, Azure, GCP.

  • You will work with Cloud Native technologies like Kubernetes, Prometheus and gRPC.

  • You will help build CI/CD infrastructure, pipelines and take part in DevOps duties.

  • You will apply SRE principles for observability, SLOs, automation and change management.

  • You will write and maintain tooling and documentation for infrastructure, supported applications and processes.

  • Build and maintain cross-functional relationships with internal teams to drive initiatives.

Key Qualifications

  • 5+ years of work experience.

  • Bachelors or Masters in Computer Science, or equivalent experience.

  • Proven experience automating Cloud and Networking infrastructure on AWS, GCP or Azure.

  • Good understanding of Linux-based Operating Systems, Containerisation and Orchestration technologies like Docker and Kubernetes.

  • Experience with Terraform or other configuration management tools like Jsonnet, Kapitan, Helm or Kustomize.

  • Experience with Monitoring stacks such as Prometheus, Influx, Stackdriver or Zabbix.

  • Programming experience, ideally with Python, Go or Bash scripting.

  • Experience with writing Kubernetes Operators.

  • Good understanding of DevOps, SRE principles and Platform Engineering.

  • Share our company values: curiosity, ownership, transparency & directness, outcome-based performance, and customer empathy.

Preferred Skills & Experience

  • Expertise with multi and hybrid cloud environments.

  • Experience with Software Engineering.

  • Expertise with some parts of our tech stack is a big plus.

  • Experience in automating scalable multi-tenant systems architectures with high availability, fault tolerance, performance tuning, monitoring, and statistics/metrics collection.

Our Stack

  • Languages - (Backend) Python, Go; (Frontend) JavaScript/TypeScript, React; Customer SDK & Clients - C# .NET

  • PyTorch

  • Cypress

  • Docker, Kubernetes, Terraform & Kapitan

  • Custom Kubernetes Operators (with kopf)

  • Gitlab CI, ArgoCD, Atlantis, Vercel

  • GCP - GKE, PubSub, CloudSQL, BigTable, Postgres, etc.

  • Ray.io

  • REST & gRPC micro-services

  • Poetry, Pantsbuild

Onboarding

In your first 30 days...

  • You will be immersed in an onboarding program that introduces you to Phaidra and our product.

  • You will spend time in the Engineering org, learning how the teams operate, interact, and approach problems.

  • You will read various parts of our handbook and familiarize yourself with the documentation culture at Phaidra.

  • You will set up your development environment and start working on an onboarding exercise that will introduce you to various parts of our code and infrastructure base.

  • You will learn about how we use agile and be able to navigate our sprint boards and backlogs.

  • You will learn about various team standards and development & release processes.

  • You will start to learn about our system architecture and infrastructure.

By your first 60 days...

  • You will have a solid understanding of what Phaidra does and how we do it.

  • You will have met with team members across Phaidra and started building relationships that will help you be successful at your job.

  • You will have completed the onboarding exercise and will be on your way to completing your first production task.

By your first 90 days...

  • You will have been fully integrated in the team and with team members across the company.

  • You will get a more in-depth understanding of our system architecture and infrastructure.

  • You will have completed your first on-call experience helping monitor and improve our production environments.

  • You will have become an expert with our tooling.

  • You will have started to contribute to knowledge sharing throughout Phaidra.

General Interview Process

All of our interviews are held via Google Meet, and an active camera connection is required.

  • Initial screening interview with a People Operations team member (30 minutes)

  • Meeting with Director, Infrastructure Engineering (30 minutes)

  • Take Home Exercise

  • Meeting with Site Reliability Engineers (60 minutes)

  • Meeting with VP of Engineering (60 minutes)

  • Culture fit interview with Phaidra’s co-founders (30 minutes)

Base Salary Range

  • US Residents: $92,800-$178,000/year

  • Canada Residents: CA$113,600-CA$180,000/year

This position will also include equity.

These are best faith estimates of the base salary range for this position. It is important to note that the salary bands provided are inclusive of multiple levels and the actual candidate level will be determined during the interview process. In addition to this, other factors such as experience, education, and location will be taken into consideration when deciding final compensation.

About the job

Full-time
USA, Canada
$93k-$178k per year
14 Applicants
Posted 7 months ago
engineer
devops
javascript
python
docker
Enhancv advertisement

30,000+
REMOTE JOBS

Unlock access to our database and
kickstart your remote career
Join Premium

Site Reliability Engineer

Phaidra
The job listing has expired. Unfortunately, the hiring company is no longer accepting new applications.

To see similar active jobs please follow this link: Remote Development jobs

Who You Are

Phaidra is looking for a driven Site Reliability Engineer to be a part of our engineering team. You are bold and creative, and have deep empathy for customers who may not be tech-savvy. You will work in the Infrastructure Engineering team to build and maintain world class infrastructure. You will have the opportunity to make an immediate impact with your work and guide the product and team as we grow.

*We are seeking a team member located within one of the following areas: USA and Canada

  • In the United States, we accept applicants located in the following states: California, Colorado, Connecticut, Georgia, Indiana, Maryland, Minnesota, Missouri, Nebraska, New York, North Carolina, Pennsylvania, South Carolina, Tennessee, Texas, Virginia, Washington. 

  • In Canada, we accept applicants located in the following provinces: Ontario, British Columbia, and Alberta.

Responsibilities

The ideal candidate has expertise building and managing cloud infrastructure on AWS, GCP or Azure and has good knowledge of Kubernetes, CI/CD and Observability. Your responsibilities will include flavors of Infrastructure Engineering, MLOps, SRE and DevOps. As a Site Reliability Engineer, it will be expected of you to be an Individual Contributor (IC) in the Infrastructure Engineering team.

  • You will help build and maintain infrastructure for:

    • Large-scale data ingestion and processing.

    • Distributed model training, evaluation and inference.

    • Automating the end-to-end system for continuous improvement and deployment.

    • Developer environments and build systems.

    • Multi-cloud deployments.

  • You will work with cloud services like AWS, Azure, GCP.

  • You will work with Cloud Native technologies like Kubernetes, Prometheus and gRPC.

  • You will help build CI/CD infrastructure, pipelines and take part in DevOps duties.

  • You will apply SRE principles for observability, SLOs, automation and change management.

  • You will write and maintain tooling and documentation for infrastructure, supported applications and processes.

  • Build and maintain cross-functional relationships with internal teams to drive initiatives.

Key Qualifications

  • 5+ years of work experience.

  • Bachelors or Masters in Computer Science, or equivalent experience.

  • Proven experience automating Cloud and Networking infrastructure on AWS, GCP or Azure.

  • Good understanding of Linux-based Operating Systems, Containerisation and Orchestration technologies like Docker and Kubernetes.

  • Experience with Terraform or other configuration management tools like Jsonnet, Kapitan, Helm or Kustomize.

  • Experience with Monitoring stacks such as Prometheus, Influx, Stackdriver or Zabbix.

  • Programming experience, ideally with Python, Go or Bash scripting.

  • Experience with writing Kubernetes Operators.

  • Good understanding of DevOps, SRE principles and Platform Engineering.

  • Share our company values: curiosity, ownership, transparency & directness, outcome-based performance, and customer empathy.

Preferred Skills & Experience

  • Expertise with multi and hybrid cloud environments.

  • Experience with Software Engineering.

  • Expertise with some parts of our tech stack is a big plus.

  • Experience in automating scalable multi-tenant systems architectures with high availability, fault tolerance, performance tuning, monitoring, and statistics/metrics collection.

Our Stack

  • Languages - (Backend) Python, Go; (Frontend) JavaScript/TypeScript, React; Customer SDK & Clients - C# .NET

  • PyTorch

  • Cypress

  • Docker, Kubernetes, Terraform & Kapitan

  • Custom Kubernetes Operators (with kopf)

  • Gitlab CI, ArgoCD, Atlantis, Vercel

  • GCP - GKE, PubSub, CloudSQL, BigTable, Postgres, etc.

  • Ray.io

  • REST & gRPC micro-services

  • Poetry, Pantsbuild

Onboarding

In your first 30 days...

  • You will be immersed in an onboarding program that introduces you to Phaidra and our product.

  • You will spend time in the Engineering org, learning how the teams operate, interact, and approach problems.

  • You will read various parts of our handbook and familiarize yourself with the documentation culture at Phaidra.

  • You will set up your development environment and start working on an onboarding exercise that will introduce you to various parts of our code and infrastructure base.

  • You will learn about how we use agile and be able to navigate our sprint boards and backlogs.

  • You will learn about various team standards and development & release processes.

  • You will start to learn about our system architecture and infrastructure.

By your first 60 days...

  • You will have a solid understanding of what Phaidra does and how we do it.

  • You will have met with team members across Phaidra and started building relationships that will help you be successful at your job.

  • You will have completed the onboarding exercise and will be on your way to completing your first production task.

By your first 90 days...

  • You will have been fully integrated in the team and with team members across the company.

  • You will get a more in-depth understanding of our system architecture and infrastructure.

  • You will have completed your first on-call experience helping monitor and improve our production environments.

  • You will have become an expert with our tooling.

  • You will have started to contribute to knowledge sharing throughout Phaidra.

General Interview Process

All of our interviews are held via Google Meet, and an active camera connection is required.

  • Initial screening interview with a People Operations team member (30 minutes)

  • Meeting with Director, Infrastructure Engineering (30 minutes)

  • Take Home Exercise

  • Meeting with Site Reliability Engineers (60 minutes)

  • Meeting with VP of Engineering (60 minutes)

  • Culture fit interview with Phaidra’s co-founders (30 minutes)

Base Salary Range

  • US Residents: $92,800-$178,000/year

  • Canada Residents: CA$113,600-CA$180,000/year

This position will also include equity.

These are best faith estimates of the base salary range for this position. It is important to note that the salary bands provided are inclusive of multiple levels and the actual candidate level will be determined during the interview process. In addition to this, other factors such as experience, education, and location will be taken into consideration when deciding final compensation.

Working Nomads

Post Jobs
Premium Subscription
Sponsorship
Free Job Alerts

Job Skills
API
FAQ
Privacy policy
Terms and conditions
Contact us
About us

Jobs by Category

Remote Administration jobs
Remote Consulting jobs
Remote Customer Success jobs
Remote Development jobs
Remote Design jobs
Remote Education jobs
Remote Finance jobs
Remote Legal jobs
Remote Healthcare jobs
Remote Human Resources jobs
Remote Management jobs
Remote Marketing jobs
Remote Sales jobs
Remote System Administration jobs
Remote Writing jobs

Jobs by Position Type

Remote Full-time jobs
Remote Part-time jobs
Remote Contract jobs

Jobs by Region

Remote jobs Anywhere
Remote jobs North America
Remote jobs Latin America
Remote jobs Europe
Remote jobs Middle East
Remote jobs Africa
Remote jobs APAC

Jobs by Skill

Remote Accounting jobs
Remote Assistant jobs
Remote Copywriting jobs
Remote Cyber Security jobs
Remote Data Analyst jobs
Remote Data Entry jobs
Remote English jobs
Remote Spanish jobs
Remote Project Management jobs
Remote QA jobs
Remote SEO jobs

Jobs by Country

Remote jobs Australia
Remote jobs Argentina
Remote jobs Brazil
Remote jobs Canada
Remote jobs Colombia
Remote jobs France
Remote jobs Germany
Remote jobs Ireland
Remote jobs India
Remote jobs Japan
Remote jobs Mexico
Remote jobs Netherlands
Remote jobs New Zealand
Remote jobs Philippines
Remote jobs Poland
Remote jobs Portugal
Remote jobs Singapore
Remote jobs Spain
Remote jobs UK
Remote jobs USA


Working Nomads curates remote digital jobs from around the web.

© 2025 Working Nomads.