MENU
  • Remote Jobs
  • Companies
  • Go Premium
  • Job Alerts
  • Post a Job
  • Log in
  • Sign up
Working Nomads logo Working Nomads
  • Remote Jobs
  • Companies
  • Post Jobs
  • Go Premium
  • Get Free Job Alerts
  • Log in

Intermediate Site Reliability Engineer - Durability

GitLab

Full-time
Anywhere
engineer
architecture
front end
html
saas
The job listing has expired. Unfortunately, the hiring company is no longer accepting new applications.

To see similar active jobs please follow this link: Remote Development jobs

Site Reliability Engineer, Durability

An overview of this role

As a Site Reliability Engineer (SRE) at GitLab, you are responsible for keeping all user-facing services and other GitLab production systems running smoothly. SREs are a blend of pragmatic operators and software craftspeople that apply sound engineering principles, operational discipline, and mature automation to our operating environments and the GitLab codebase.

GitLab SREs specialize in systems (operating systems, storage subsystems, networking), while implementing best practices for availability, reliability and scalability, with varied interests in algorithms and distributed systems.

What you’ll do  

  • Design and implement highly scalable infrastructure to support the needs of current and future GitLab.com.

  • Collaborate closely with cross-functional teams and other teams throughout Infrastructure on projects to drive GitLab’s future.

  • Respond to incidents on an on call rotation (our team is distributed globally, so you only are on call during your daytime hours!) and participate in incident review.

  • Act as subject matter experts within the GitLab infrastructure department, specializing in knowledge of our edge services and kubernetes workloads.

  • Automate every operational task.

What you’ll bring 

  • Experience with the Kubernetes ecosystem including Helm.

  • Google Cloud Platform expertise, specifically around networking, GKE configuration, and scaling.

  • Experience with Terraform infrastructure as code.

  • Experience with configuration management tools such as Ansible and Chef.

  • Programming skills in Go or Ruby.

  • Ability to clearly define problems and think beyond initial solutions, looking at how to make things better in the future.

  • A drive for automating everything.

  • Ability to be a manager of one and have a strong bias for action.

  • An independent,  proactive and self-organized mindset.

  • An ability to clearly communicate asynchronously.

  • Excitement to be doing something different every day from project work to production change requests to emergency response.

About the team

Durability is responsible for safeguarding and securing customer data that is stored by the GitLab application and sets guidelines for data access. Running the largest GitLab instance in existence (and in fact, one of the largest single-tenancy open-source SaaS sites on the Internet) means we are constantly faced with unique and rewarding challenges that directly impact our users every day. Our future is all about increasing automation so we can continue to scale even bigger with enterprise level expectations around reliability and availability. Thanks to our Transparency value, you can see how we work on our team page or even see what we’re working on. 

About the job

Full-time
Anywhere
19 Applicants
Posted 5 months ago
engineer
architecture
front end
html
saas
Enhancv advertisement

30,000+
REMOTE JOBS

Unlock access to our database and
kickstart your remote career
Join Premium

Intermediate Site Reliability Engineer - Durability

GitLab
The job listing has expired. Unfortunately, the hiring company is no longer accepting new applications.

To see similar active jobs please follow this link: Remote Development jobs

Site Reliability Engineer, Durability

An overview of this role

As a Site Reliability Engineer (SRE) at GitLab, you are responsible for keeping all user-facing services and other GitLab production systems running smoothly. SREs are a blend of pragmatic operators and software craftspeople that apply sound engineering principles, operational discipline, and mature automation to our operating environments and the GitLab codebase.

GitLab SREs specialize in systems (operating systems, storage subsystems, networking), while implementing best practices for availability, reliability and scalability, with varied interests in algorithms and distributed systems.

What you’ll do  

  • Design and implement highly scalable infrastructure to support the needs of current and future GitLab.com.

  • Collaborate closely with cross-functional teams and other teams throughout Infrastructure on projects to drive GitLab’s future.

  • Respond to incidents on an on call rotation (our team is distributed globally, so you only are on call during your daytime hours!) and participate in incident review.

  • Act as subject matter experts within the GitLab infrastructure department, specializing in knowledge of our edge services and kubernetes workloads.

  • Automate every operational task.

What you’ll bring 

  • Experience with the Kubernetes ecosystem including Helm.

  • Google Cloud Platform expertise, specifically around networking, GKE configuration, and scaling.

  • Experience with Terraform infrastructure as code.

  • Experience with configuration management tools such as Ansible and Chef.

  • Programming skills in Go or Ruby.

  • Ability to clearly define problems and think beyond initial solutions, looking at how to make things better in the future.

  • A drive for automating everything.

  • Ability to be a manager of one and have a strong bias for action.

  • An independent,  proactive and self-organized mindset.

  • An ability to clearly communicate asynchronously.

  • Excitement to be doing something different every day from project work to production change requests to emergency response.

About the team

Durability is responsible for safeguarding and securing customer data that is stored by the GitLab application and sets guidelines for data access. Running the largest GitLab instance in existence (and in fact, one of the largest single-tenancy open-source SaaS sites on the Internet) means we are constantly faced with unique and rewarding challenges that directly impact our users every day. Our future is all about increasing automation so we can continue to scale even bigger with enterprise level expectations around reliability and availability. Thanks to our Transparency value, you can see how we work on our team page or even see what we’re working on. 

Working Nomads

Post Jobs
Premium Subscription
Sponsorship
Free Job Alerts

Job Skills
API
FAQ
Privacy policy
Terms and conditions
Contact us
About us

Jobs by Category

Remote Administration jobs
Remote Consulting jobs
Remote Customer Success jobs
Remote Development jobs
Remote Design jobs
Remote Education jobs
Remote Finance jobs
Remote Legal jobs
Remote Healthcare jobs
Remote Human Resources jobs
Remote Management jobs
Remote Marketing jobs
Remote Sales jobs
Remote System Administration jobs
Remote Writing jobs

Jobs by Position Type

Remote Full-time jobs
Remote Part-time jobs
Remote Contract jobs

Jobs by Region

Remote jobs Anywhere
Remote jobs North America
Remote jobs Latin America
Remote jobs Europe
Remote jobs Middle East
Remote jobs Africa
Remote jobs APAC

Jobs by Skill

Remote Accounting jobs
Remote Assistant jobs
Remote Copywriting jobs
Remote Cyber Security jobs
Remote Data Analyst jobs
Remote Data Entry jobs
Remote English jobs
Remote Spanish jobs
Remote Project Management jobs
Remote QA jobs
Remote SEO jobs

Jobs by Country

Remote jobs Australia
Remote jobs Argentina
Remote jobs Brazil
Remote jobs Canada
Remote jobs Colombia
Remote jobs France
Remote jobs Germany
Remote jobs Ireland
Remote jobs India
Remote jobs Japan
Remote jobs Mexico
Remote jobs Netherlands
Remote jobs New Zealand
Remote jobs Philippines
Remote jobs Poland
Remote jobs Portugal
Remote jobs Singapore
Remote jobs Spain
Remote jobs UK
Remote jobs USA


Working Nomads curates remote digital jobs from around the web.

© 2025 Working Nomads.