MENU
  • Remote Jobs
  • Companies
  • ✦ Go Premium
  • Job Alerts
  • Post a Job
  • Log in
  • Sign up
Working Nomads logo Working Nomads
  • Remote Jobs
  • Companies
  • Post Jobs
  • ✦ Go Premium
  • Get Free Job Alerts
  • Log in

SRE Lead

First Advantage

Full-time
USA
$120k-$150k per year
devops
python
aws
architecture
azure
Apply for this position

At First Advantage (Nasdaq: FA), people are at the heart of everything we do. From our customers and partners to our greatest advantage — our team members. Operating with empathy and compassion, First Advantage fosters a global inclusive workforce devoted to the diverse voices that make up our talent and products. Our team members empower each other to be their authentic selves and treat all with respect, integrity, and fairness. Say hello to a rewarding career, and come join a leading provider of mission-critical background screening solutions to some of the most recognized Fortune 100 and Global 500 brands. First Advantage is a global leader in background screening, identity, and verification solutions. As we continue to scale our digital platforms and modern cloud-native infrastructure, we are seeking a highly skilled and forward-thinking Lead Site Reliability Engineer (SRE) to drive reliability, resilience, and operational excellence across our systems. The Lead SRE will be responsible for guiding reliability strategy, overseeing complex incident response, improving observability, strengthening automation and CI/CD practices, and partnering closely with engineering teams to embed SRE principles throughout the organization. This role requires a deep understanding of modern cloud architecture—including both Azure and AWS—as well as expertise in Linux systems, monitoring technologies, and root‑cause analysis. This is a senior hands-on engineering role, ideal for someone who enjoys solving difficult problems at scale and mentoring others while driving meaningful improvements to uptime, performance, and customer experience. What You'll Do:

  • Site Reliability & Platform Stability - Lead reliability initiatives across multiple high-availability, large-scale SaaS systems, ensuring platform uptime, performance, and resilience. - Build and maintain distributed systems, infrastructure components, and automation tooling to ensure consistent, reliable delivery of production services. - Champion proactive reliability engineering, holistic system monitoring, and continuous operational improvements. - Partner with architecture, engineering, and operations teams to define SLAs, SLOs, and SLIs.

  • Cloud Engineering (Azure & AWS) - Architect, build, and maintain cloud infrastructure using best practices. - Guide cloud migrations, cost optimization, and resilience engineering across multi-cloud environments. - Implement and enforce cloud security, compliance, and governance standards.

 

  • DevOps, CI/CD, and Automation - Create and maintain CI/CD pipelines using GitHub Actions, Azure DevOps, Jenkins, or equivalent. - Automate deployments using IaC tools (Terraform, Bicep, CloudFormation). - Reduce manual operational burden through automation and self-service tooling.

 

  • Monitoring, Observability & Performance - Implement observability stacks covering metrics, logs, traces, and synthetic checks. - Standardize monitoring practices using industry tooling. - Perform performance analysis, load testing, and optimization.

 

  • Incident Response & Management - Serve as Incident Commander for major production incidents. - Define and improve incident management processes. - Ensure clear communication during outages and lead technical bridges. - Deliver high‑quality RCAs with actionable follow‑ups.

 

  • Root‑Cause Analysis (RCA) & Continuous Improvement - Drive deep, data‑driven RCAs and long-term reliability improvements. - Identify and eliminate systemic issues and operational toil.

 

  • Leadership, Collaboration & Mentorship - Provide technical leadership across teams. - Mentor engineers and promote SRE best practices. - Foster strong cross‑functional partnerships.

What You'll Need to be Successful:

  • 7+ years in SRE, DevOps, Platform Engineering, or Cloud Engineering.

  • Strong expertise in Azure and AWS.

  • Proficiency in CI/CD, automation, and release engineering.

  • Deep monitoring, logging, and observability experience.

  • Incident response leadership experience.

  • Proven RCA experience.

  • Strong Linux skills.

  • Scripting skills (Python, Bash, PowerShell, Go).

  • IaC experience.

  • Strong systems and networking fundamentals.

  • Additional Preferred Qualifications - Experience with large-scale distributed systems. - Message queues or event streaming knowledge. - Familiarity with incident management frameworks. - Multi-cloud enterprise experience. - Kubernetes, ECS, AKS, or EKS exposure

Why First Advantage is Your Next Big Career Move   First Advantage is going through a technology transformation! We are looking for experts who are excited to work with advanced technologies and provide best-in-class user experiences, drive the development and deployment of scalable solutions, and smoothly guide our agile teams and clients through meaningful changes as we continue to expand our impact. What Are You Waiting For? Apply Today! You have learned a little about us today – we want to learn about you! If you think this position and our company are a great fit for your areas of interest and expertise, tell us about you by applying now! The salary range for this position is approximately $120,000 - $150,000 base annually. This range reflects our good faith estimate to pay fairly as to what our ideal candidates are likely to expect, and we tailor our offers within the range based on the selected candidate’s experience, industry knowledge, technical and communication skills, and other factors that may prove relevant during the interview process.    

United States Equal Opportunity Employment:

First Advantage is proud to be a global leader in removing barriers and supporting our community members to ensure the changing demographics of the workforce are reflected in our hiring and employment practices. We value all of our candidates, employees, and clients, and place great emphasis on hiring and supporting qualified individuals in each role. We are an equal opportunity employer. We do not discriminate on the basis of race, color, ethnicity, ancestry, religion, sex, national origin, sexual orientation, age, citizenship status, marital status, disability, gender identity, gender expression, veteran status, genetic information, or any other area protected by applicable law.

Apply for this position
Bookmark Report

About the job

Full-time
USA
Senior Level
$120k-$150k per year
Posted 10 hours ago
devops
python
aws
architecture
azure

Apply for this position

Bookmark
Report
Enhancv advertisement
+ 1,284 new jobs added today
30,000+
Remote Jobs

Don't miss out — new listings every hour

Join Premium

SRE Lead

First Advantage

At First Advantage (Nasdaq: FA), people are at the heart of everything we do. From our customers and partners to our greatest advantage — our team members. Operating with empathy and compassion, First Advantage fosters a global inclusive workforce devoted to the diverse voices that make up our talent and products. Our team members empower each other to be their authentic selves and treat all with respect, integrity, and fairness. Say hello to a rewarding career, and come join a leading provider of mission-critical background screening solutions to some of the most recognized Fortune 100 and Global 500 brands. First Advantage is a global leader in background screening, identity, and verification solutions. As we continue to scale our digital platforms and modern cloud-native infrastructure, we are seeking a highly skilled and forward-thinking Lead Site Reliability Engineer (SRE) to drive reliability, resilience, and operational excellence across our systems. The Lead SRE will be responsible for guiding reliability strategy, overseeing complex incident response, improving observability, strengthening automation and CI/CD practices, and partnering closely with engineering teams to embed SRE principles throughout the organization. This role requires a deep understanding of modern cloud architecture—including both Azure and AWS—as well as expertise in Linux systems, monitoring technologies, and root‑cause analysis. This is a senior hands-on engineering role, ideal for someone who enjoys solving difficult problems at scale and mentoring others while driving meaningful improvements to uptime, performance, and customer experience. What You'll Do:

  • Site Reliability & Platform Stability - Lead reliability initiatives across multiple high-availability, large-scale SaaS systems, ensuring platform uptime, performance, and resilience. - Build and maintain distributed systems, infrastructure components, and automation tooling to ensure consistent, reliable delivery of production services. - Champion proactive reliability engineering, holistic system monitoring, and continuous operational improvements. - Partner with architecture, engineering, and operations teams to define SLAs, SLOs, and SLIs.

  • Cloud Engineering (Azure & AWS) - Architect, build, and maintain cloud infrastructure using best practices. - Guide cloud migrations, cost optimization, and resilience engineering across multi-cloud environments. - Implement and enforce cloud security, compliance, and governance standards.

 

  • DevOps, CI/CD, and Automation - Create and maintain CI/CD pipelines using GitHub Actions, Azure DevOps, Jenkins, or equivalent. - Automate deployments using IaC tools (Terraform, Bicep, CloudFormation). - Reduce manual operational burden through automation and self-service tooling.

 

  • Monitoring, Observability & Performance - Implement observability stacks covering metrics, logs, traces, and synthetic checks. - Standardize monitoring practices using industry tooling. - Perform performance analysis, load testing, and optimization.

 

  • Incident Response & Management - Serve as Incident Commander for major production incidents. - Define and improve incident management processes. - Ensure clear communication during outages and lead technical bridges. - Deliver high‑quality RCAs with actionable follow‑ups.

 

  • Root‑Cause Analysis (RCA) & Continuous Improvement - Drive deep, data‑driven RCAs and long-term reliability improvements. - Identify and eliminate systemic issues and operational toil.

 

  • Leadership, Collaboration & Mentorship - Provide technical leadership across teams. - Mentor engineers and promote SRE best practices. - Foster strong cross‑functional partnerships.

What You'll Need to be Successful:

  • 7+ years in SRE, DevOps, Platform Engineering, or Cloud Engineering.

  • Strong expertise in Azure and AWS.

  • Proficiency in CI/CD, automation, and release engineering.

  • Deep monitoring, logging, and observability experience.

  • Incident response leadership experience.

  • Proven RCA experience.

  • Strong Linux skills.

  • Scripting skills (Python, Bash, PowerShell, Go).

  • IaC experience.

  • Strong systems and networking fundamentals.

  • Additional Preferred Qualifications - Experience with large-scale distributed systems. - Message queues or event streaming knowledge. - Familiarity with incident management frameworks. - Multi-cloud enterprise experience. - Kubernetes, ECS, AKS, or EKS exposure

Why First Advantage is Your Next Big Career Move   First Advantage is going through a technology transformation! We are looking for experts who are excited to work with advanced technologies and provide best-in-class user experiences, drive the development and deployment of scalable solutions, and smoothly guide our agile teams and clients through meaningful changes as we continue to expand our impact. What Are You Waiting For? Apply Today! You have learned a little about us today – we want to learn about you! If you think this position and our company are a great fit for your areas of interest and expertise, tell us about you by applying now! The salary range for this position is approximately $120,000 - $150,000 base annually. This range reflects our good faith estimate to pay fairly as to what our ideal candidates are likely to expect, and we tailor our offers within the range based on the selected candidate’s experience, industry knowledge, technical and communication skills, and other factors that may prove relevant during the interview process.    

United States Equal Opportunity Employment:

First Advantage is proud to be a global leader in removing barriers and supporting our community members to ensure the changing demographics of the workforce are reflected in our hiring and employment practices. We value all of our candidates, employees, and clients, and place great emphasis on hiring and supporting qualified individuals in each role. We are an equal opportunity employer. We do not discriminate on the basis of race, color, ethnicity, ancestry, religion, sex, national origin, sexual orientation, age, citizenship status, marital status, disability, gender identity, gender expression, veteran status, genetic information, or any other area protected by applicable law.

Working Nomads

Post Jobs
Premium Subscription
Sponsorship
Reviews
Job Alerts

Job Skills
Jobs by Location
Jobs by Experience Level
Jobs by Position Type
Jobs by Salary
API
Scam Alert
FAQ
Privacy policy
Terms and conditions
Contact us
About us

Jobs by Category

Remote Administration jobs
Remote Consulting jobs
Remote Customer Success jobs
Remote Development jobs
Remote Design jobs
Remote Education jobs
Remote Finance jobs
Remote Legal jobs
Remote Healthcare jobs
Remote Human Resources jobs
Remote Management jobs
Remote Marketing jobs
Remote Sales jobs
Remote System Administration jobs
Remote Writing jobs

Jobs by Position Type

Remote Full-time jobs
Remote Part-time jobs
Remote Contract jobs

Jobs by Region

Remote jobs Anywhere
Remote jobs North America
Remote jobs Latin America
Remote jobs Europe
Remote jobs Middle East
Remote jobs Africa
Remote jobs APAC

Jobs by Skill

Remote Accounting jobs
Remote Assistant jobs
Remote Copywriting jobs
Remote Cyber Security jobs
Remote Data Analyst jobs
Remote Data Entry jobs
Remote English jobs
Remote Entry Level jobs
Remote Spanish jobs
Remote Project Management jobs
Remote QA jobs
Remote SEO jobs

Jobs by Country

Remote jobs Australia
Remote jobs Argentina
Remote jobs Belgium
Remote jobs Brazil
Remote jobs Canada
Remote jobs Colombia
Remote jobs France
Remote jobs Germany
Remote jobs Ireland
Remote jobs India
Remote jobs Japan
Remote jobs Mexico
Remote jobs Netherlands
Remote jobs New Zealand
Remote jobs Philippines
Remote jobs Poland
Remote jobs Portugal
Remote jobs Singapore
Remote jobs Spain
Remote jobs UK
Remote jobs USA


Working Nomads curates remote digital jobs from around the web.

© 2026 Working Nomads.