Cloud Site Reliability Engineer
As a Cloud Site Reliability Engineer, you’ll be at the forefront of innovation, working on our cloud products platform to ensure stability and optimal performance.
Location & Time Zone: This role is 100% remote as long as you’re in the EET or Central Europe time zone +/- 2 hours.
Interview Process: 4 stages - 45 min HR conversation -> 1h Values/Technical discussion -> Home Task -> 45 min Technical interview
Technologies: Linux, Kubernetes, CI/CD, Prometheus, Helm, Bash
Reporting to: Cloud SRE Team Leader
Your team: You’ll join a team of 7 colleagues, (Cloud SRE Lead, 5 Cloud SRE Engineers, YOU!)
What will make your journey with us amazing?
A supportive manager who cares about your well-being and is invested in your professional growth.
A culture of continuous learning, with clear targets and feedback.
A global company with over 2500 employees located in more than 26 countries around the world, including offices in 3 countries: Ukraine, Portugal, and India.
What will you do:
The Cloud SRE team supports our cloud system, takes care of monitoring platforms, and provides 24x7 'Always On' support through on-call rotations. We automate manual processes, enhance monitoring tools, maintain documentation, and collaborate with other teams to ensure effective service delivery to customers.
What will you bring:
-Kind, empathetic, and collaborative personality, willing to learn and share knowledge openly.
-Proficiency in command-line interfaces, *nix systems (Linux, Ubuntu), and Git.
-Experience working with Kubernetes clusters, both Docker and CRI-O based, and familiarity with Helm charts.
-Deep understanding of monitoring tools such as Prometheus, Grafana, and Alertmanager.
-Demonstrated expertise in scripting (Bash)
-A proactive approach to taking ownership, supporting new ideas, and following through from ideation to post-release support.
-An autonomous and flexible working style, able to contribute independently and collaboratively, with strong research and analytical skills for informed decision-making.
-And as a bonus—we value a good sense of humor!
Will be a plus:
-Knowledge of Rancher/RKE2
-Experience with CI/CD tools like ArgoCD, and FluxCD
-Experience with Ansible and Terraform
-Knowledge of programming languages like Python, Go, and PHP
-Experience with any ChatOps solutions (including AI-powered)
What’s in it for you:
Embrace a 100% remote lifestyle with this opportunity!
Work with flexibility in a supportive environment where you have the autonomy to manage your time, while also staying connected with the team through daily check-ins and shared office hours. We value collaboration and commitment to team goals, balancing independence with structured support to ensure we all succeed together.
-Invest in your growth with dedicated learning resources and support.
-Thrive in a culture rooted in truth, trust, and transparency.
-Unleash your creativity and explore new ideas with 2 dedicated R&D days each month!
-Stay ahead of the curve with weekly team knowledge-sharing sessions.
-Dedicated budget for training, conferences and certifications.
-Escape the meeting marathon with 3 meeting-free days per week.
-Enjoy generous vacation policies to recharge when you need it.
-Be a part of a unique team, not just another 'cloud-shop' - we run our own infrastructure!
#NamecheapCareers
#HackYourCareer
#equalopportunity
About the job
Apply for this position
Cloud Site Reliability Engineer
As a Cloud Site Reliability Engineer, you’ll be at the forefront of innovation, working on our cloud products platform to ensure stability and optimal performance.
Location & Time Zone: This role is 100% remote as long as you’re in the EET or Central Europe time zone +/- 2 hours.
Interview Process: 4 stages - 45 min HR conversation -> 1h Values/Technical discussion -> Home Task -> 45 min Technical interview
Technologies: Linux, Kubernetes, CI/CD, Prometheus, Helm, Bash
Reporting to: Cloud SRE Team Leader
Your team: You’ll join a team of 7 colleagues, (Cloud SRE Lead, 5 Cloud SRE Engineers, YOU!)
What will make your journey with us amazing?
A supportive manager who cares about your well-being and is invested in your professional growth.
A culture of continuous learning, with clear targets and feedback.
A global company with over 2500 employees located in more than 26 countries around the world, including offices in 3 countries: Ukraine, Portugal, and India.
What will you do:
The Cloud SRE team supports our cloud system, takes care of monitoring platforms, and provides 24x7 'Always On' support through on-call rotations. We automate manual processes, enhance monitoring tools, maintain documentation, and collaborate with other teams to ensure effective service delivery to customers.
What will you bring:
-Kind, empathetic, and collaborative personality, willing to learn and share knowledge openly.
-Proficiency in command-line interfaces, *nix systems (Linux, Ubuntu), and Git.
-Experience working with Kubernetes clusters, both Docker and CRI-O based, and familiarity with Helm charts.
-Deep understanding of monitoring tools such as Prometheus, Grafana, and Alertmanager.
-Demonstrated expertise in scripting (Bash)
-A proactive approach to taking ownership, supporting new ideas, and following through from ideation to post-release support.
-An autonomous and flexible working style, able to contribute independently and collaboratively, with strong research and analytical skills for informed decision-making.
-And as a bonus—we value a good sense of humor!
Will be a plus:
-Knowledge of Rancher/RKE2
-Experience with CI/CD tools like ArgoCD, and FluxCD
-Experience with Ansible and Terraform
-Knowledge of programming languages like Python, Go, and PHP
-Experience with any ChatOps solutions (including AI-powered)
What’s in it for you:
Embrace a 100% remote lifestyle with this opportunity!
Work with flexibility in a supportive environment where you have the autonomy to manage your time, while also staying connected with the team through daily check-ins and shared office hours. We value collaboration and commitment to team goals, balancing independence with structured support to ensure we all succeed together.
-Invest in your growth with dedicated learning resources and support.
-Thrive in a culture rooted in truth, trust, and transparency.
-Unleash your creativity and explore new ideas with 2 dedicated R&D days each month!
-Stay ahead of the curve with weekly team knowledge-sharing sessions.
-Dedicated budget for training, conferences and certifications.
-Escape the meeting marathon with 3 meeting-free days per week.
-Enjoy generous vacation policies to recharge when you need it.
-Be a part of a unique team, not just another 'cloud-shop' - we run our own infrastructure!
#NamecheapCareers
#HackYourCareer
#equalopportunity