Senior Site Reliability Engineer
To see similar active jobs please follow this link: Remote Development jobs
Senior Site Reliability Engineer, Foundations
An overview of this role
As a Senior Site Reliability Engineer (SRE) at GitLab, you are responsible for keeping all user-facing services and other GitLab production systems running smoothly. SREs are a blend of pragmatic operators and software craftspeople that apply sound engineering principles, operational discipline, and mature automation to our operating environments and the GitLab codebase.
GitLab SREs specialize in systems (operating systems, storage subsystems, networking), while implementing best practices for availability, reliability and scalability, with varied interests in algorithms and distributed systems.
What you’ll do
Design and implement highly scalable infrastructure to support the needs of current and future GitLab.com.
Drive complex projects and objectives to improve the resilience and reliability of the GitLab product.
Provide deep domain knowledge, radiating that knowledge through asynchronous communication in all the work that you do.
Bring your influence and leadership to the team to grow Intermediate and Associate SREs.
Collaborate closely with cross-functional teams and other teams throughout Infrastructure on projects to drive GitLab’s future.
Influence the teams roadmap and work with engineering and product counterparts to define the teams future work.
Respond to incidents on an on call rotation (our team is distributed globally, so you only are on call during your daytime hours!) and participate in incident review.
What you’ll bring
Advanced experience with the Kubernetes ecosystem including Helm.
Google Cloud Platform expertise, specifically around networking, GKE configuration, and scaling.
Advanced experience with Terraform infrastructure as code.
Experience with configuration management tools such as Ansible and Chef.
Foundational understanding of programming skills in Go or Ruby.
Ability to clearly define problems and think beyond initial solutions, looking at how to make things better in the future.
A vision and drive for automating everything.
Ownership and accountability in everything that you do.
Ability to be a manager of one and have a strong bias for action.
An ability to clearly communicate asynchronously.
Excitement to be doing something different every day from project work to production change requests to mentoring to emergency response.
About the team
Foundations develops, owns, and runs the lifecycle of core infrastructure for GitLab.com. Running the largest GitLab instance in existence (and in fact, one of the largest single-tenancy open-source SaaS sites on the Internet) means we are constantly faced with unique and rewarding challenges that directly impact our users every day. Our future is all about increasing automation so we can continue to scale even bigger with enterprise level expectations around reliability and availability. Thanks to our Transparency value, you can see how we work on our team page. You can even see what we’re working on or listen in on some of our meetings.
#LI-BC2
Senior Site Reliability Engineer
To see similar active jobs please follow this link: Remote Development jobs
Senior Site Reliability Engineer, Foundations
An overview of this role
As a Senior Site Reliability Engineer (SRE) at GitLab, you are responsible for keeping all user-facing services and other GitLab production systems running smoothly. SREs are a blend of pragmatic operators and software craftspeople that apply sound engineering principles, operational discipline, and mature automation to our operating environments and the GitLab codebase.
GitLab SREs specialize in systems (operating systems, storage subsystems, networking), while implementing best practices for availability, reliability and scalability, with varied interests in algorithms and distributed systems.
What you’ll do
Design and implement highly scalable infrastructure to support the needs of current and future GitLab.com.
Drive complex projects and objectives to improve the resilience and reliability of the GitLab product.
Provide deep domain knowledge, radiating that knowledge through asynchronous communication in all the work that you do.
Bring your influence and leadership to the team to grow Intermediate and Associate SREs.
Collaborate closely with cross-functional teams and other teams throughout Infrastructure on projects to drive GitLab’s future.
Influence the teams roadmap and work with engineering and product counterparts to define the teams future work.
Respond to incidents on an on call rotation (our team is distributed globally, so you only are on call during your daytime hours!) and participate in incident review.
What you’ll bring
Advanced experience with the Kubernetes ecosystem including Helm.
Google Cloud Platform expertise, specifically around networking, GKE configuration, and scaling.
Advanced experience with Terraform infrastructure as code.
Experience with configuration management tools such as Ansible and Chef.
Foundational understanding of programming skills in Go or Ruby.
Ability to clearly define problems and think beyond initial solutions, looking at how to make things better in the future.
A vision and drive for automating everything.
Ownership and accountability in everything that you do.
Ability to be a manager of one and have a strong bias for action.
An ability to clearly communicate asynchronously.
Excitement to be doing something different every day from project work to production change requests to mentoring to emergency response.
About the team
Foundations develops, owns, and runs the lifecycle of core infrastructure for GitLab.com. Running the largest GitLab instance in existence (and in fact, one of the largest single-tenancy open-source SaaS sites on the Internet) means we are constantly faced with unique and rewarding challenges that directly impact our users every day. Our future is all about increasing automation so we can continue to scale even bigger with enterprise level expectations around reliability and availability. Thanks to our Transparency value, you can see how we work on our team page. You can even see what we’re working on or listen in on some of our meetings.
#LI-BC2