Site Reliability Engineer (Postgres Database)
To see similar active jobs please follow this link: Remote System Administration jobs
**Candidate Note: English communication skills (verbal/written) required, and this position is 100% remote for candidates based in Brazil**
Budgets= max of 150,000 BRL/Annually
We are looking for an Jr. Site Reliability Engineer/Database Reliability Engineer with some Postgres experience and possesses an understanding of how to leverage SRE/DBRE best practices. Ideal candidates will take pride in improving the daily lives of customers, support engineers, and software engineers.
Your impact will be:
You will collaborate and guide our Engineering teams to ensure our applications and Infrastructure are stable and reliable
You will continuously refine monitoring processes, thresholds, and configuration (SLO/SLI)
Ensure observability and compliance is built in from the start
Create tools our team can use to do their jobs more efficiently
Identify critical metrics to monitor and alert on to surface actual user issues
Apply data modeling and predictive analysis to anticipate issues
Document solutions, SRE architectural patterns, and best practices to ensure that developers have guidance and insight as needed.
Collaborate with product and development teams to define SLOs and corresponding SLIs
Work closely with development and support teams to solve production escalation cases, as well as conducting post escalation reviews to document learnings and take actions to continuously improve existing processes.
What you will bring:
2-3+ years of experience working as a Site Reliability Engineer, DevOps Engineer, or a Software Engineer
Experience in Python, Go, Javascript, Bash or PowerShell, etc.
Experience with DBRE or DBA practices.
Postgres DBA technical knowledge would be ideal.
You feel at home in concepts like Infrastructure as Code and CI/CD using Terraform
Understanding of how to monitor cloud native Kubernetes environments and their workloads
Experience with observability tools, such as Prometheus/Thanos, Azure Log Analytics Workspace, Grafana, AWS Cloudwatch, DataDog, etc.
Experience with automating alerts to enable quick response and real-time collaboration between various teams.
What will give you an edge:
Experience with tools such as GitHub, TeamCity, Azure DevOps, etc.
Experience with chaos engineering
Site Reliability Engineer (Postgres Database)
To see similar active jobs please follow this link: Remote System Administration jobs
**Candidate Note: English communication skills (verbal/written) required, and this position is 100% remote for candidates based in Brazil**
Budgets= max of 150,000 BRL/Annually
We are looking for an Jr. Site Reliability Engineer/Database Reliability Engineer with some Postgres experience and possesses an understanding of how to leverage SRE/DBRE best practices. Ideal candidates will take pride in improving the daily lives of customers, support engineers, and software engineers.
Your impact will be:
You will collaborate and guide our Engineering teams to ensure our applications and Infrastructure are stable and reliable
You will continuously refine monitoring processes, thresholds, and configuration (SLO/SLI)
Ensure observability and compliance is built in from the start
Create tools our team can use to do their jobs more efficiently
Identify critical metrics to monitor and alert on to surface actual user issues
Apply data modeling and predictive analysis to anticipate issues
Document solutions, SRE architectural patterns, and best practices to ensure that developers have guidance and insight as needed.
Collaborate with product and development teams to define SLOs and corresponding SLIs
Work closely with development and support teams to solve production escalation cases, as well as conducting post escalation reviews to document learnings and take actions to continuously improve existing processes.
What you will bring:
2-3+ years of experience working as a Site Reliability Engineer, DevOps Engineer, or a Software Engineer
Experience in Python, Go, Javascript, Bash or PowerShell, etc.
Experience with DBRE or DBA practices.
Postgres DBA technical knowledge would be ideal.
You feel at home in concepts like Infrastructure as Code and CI/CD using Terraform
Understanding of how to monitor cloud native Kubernetes environments and their workloads
Experience with observability tools, such as Prometheus/Thanos, Azure Log Analytics Workspace, Grafana, AWS Cloudwatch, DataDog, etc.
Experience with automating alerts to enable quick response and real-time collaboration between various teams.
What will give you an edge:
Experience with tools such as GitHub, TeamCity, Azure DevOps, etc.
Experience with chaos engineering