Manager - PostgreSQL DBaaS Support & Site Reliability Engineering
Job Description:
We are seeking an experienced Manager of PostgreSQL DBaaS Support and Site Reliability Engineering (SRE) to lead a team responsible for the availability, reliability, and performance of our PostgreSQL Database-as-a-Service platform. This role combines technical leadership, operational excellence, and people management to ensure our DBaaS meets enterprise-grade standards while delivering outstanding customer support.
You will drive the evolution of our Hybrid Manager infrastructure, manage escalations from enterprise customers. The right person will be responsible for ticket metrics, meeting service level agreements and best practices. Experience with SOC compliance is a plus.
What your impact will be:
Leadership & Team Management
Lead, mentor, and grow a distributed team of PostgreSQL DBaaS support engineers and SREs.
Define career paths, set goals, and provide regular feedback to build a high-performance team.
Foster a culture of ownership, accountability, and continuous improvement.
Operational Excellence
Oversee 24x7 support and incident management for the PostgreSQL DBaaS platform.
Drive SRE best practices: SLAs, SLOs, SLIs, budgets, incident retrospectives, and postmortems.
Ensure compliance with SOC 2, HIPAA, GDPR, and other regulatory frameworks.
Collaborate with Product and Engineering teams to influence roadmaps and drive platform improvements.
Technical & Strategic Execution
Own the reliability, scalability, and performance of PostgreSQL clusters in production.
Drive automation of provisioning, monitoring, backup/recovery, patching, and upgrades.
Partner with architecture teams to define best practices for schema design, indexing, performance tuning, and replication strategies.
Guide incident response, root cause analysis, and long-term remediation.
Develop dashboards, runbooks, and playbooks to enhance operational visibility and reduce mean time to recovery (MTTR).
What you will bring:
7+ years of experience in PostgreSQL administration, support, or engineering, with at least 3 years in leadership or management.
Proven track record managing DBaaS platforms or large-scale PostgreSQL deployments.
Deep knowledge of high availability, replication, partitioning, and performance tuning in PostgreSQL.
Strong understanding of SRE principles, including monitoring, alerting, incident response, and service level objectives.
Experience with Kubernetes, container orchestration, and cloud providers (AWS, GCP, Azure).
Familiarity with Terraform, Ansible, or similar automation tools.
Strong communication and stakeholder management skills.
What will give you an edge:
Prior experience managing 24x7 global support teams.
Knowledge of multi-tenant DBaaS architectures.
Experience with security, compliance, and audit frameworks (SOC 2, HIPAA, FedRAMP).
Familiarity with observability stacks (Prometheus, Grafana, ELK, Datadog).
Programming/scripting proficiency in Python, Go, or Bash.
Manager - PostgreSQL DBaaS Support & Site Reliability Engineering
Job Description:
We are seeking an experienced Manager of PostgreSQL DBaaS Support and Site Reliability Engineering (SRE) to lead a team responsible for the availability, reliability, and performance of our PostgreSQL Database-as-a-Service platform. This role combines technical leadership, operational excellence, and people management to ensure our DBaaS meets enterprise-grade standards while delivering outstanding customer support.
You will drive the evolution of our Hybrid Manager infrastructure, manage escalations from enterprise customers. The right person will be responsible for ticket metrics, meeting service level agreements and best practices. Experience with SOC compliance is a plus.
What your impact will be:
Leadership & Team Management
Lead, mentor, and grow a distributed team of PostgreSQL DBaaS support engineers and SREs.
Define career paths, set goals, and provide regular feedback to build a high-performance team.
Foster a culture of ownership, accountability, and continuous improvement.
Operational Excellence
Oversee 24x7 support and incident management for the PostgreSQL DBaaS platform.
Drive SRE best practices: SLAs, SLOs, SLIs, budgets, incident retrospectives, and postmortems.
Ensure compliance with SOC 2, HIPAA, GDPR, and other regulatory frameworks.
Collaborate with Product and Engineering teams to influence roadmaps and drive platform improvements.
Technical & Strategic Execution
Own the reliability, scalability, and performance of PostgreSQL clusters in production.
Drive automation of provisioning, monitoring, backup/recovery, patching, and upgrades.
Partner with architecture teams to define best practices for schema design, indexing, performance tuning, and replication strategies.
Guide incident response, root cause analysis, and long-term remediation.
Develop dashboards, runbooks, and playbooks to enhance operational visibility and reduce mean time to recovery (MTTR).
What you will bring:
7+ years of experience in PostgreSQL administration, support, or engineering, with at least 3 years in leadership or management.
Proven track record managing DBaaS platforms or large-scale PostgreSQL deployments.
Deep knowledge of high availability, replication, partitioning, and performance tuning in PostgreSQL.
Strong understanding of SRE principles, including monitoring, alerting, incident response, and service level objectives.
Experience with Kubernetes, container orchestration, and cloud providers (AWS, GCP, Azure).
Familiarity with Terraform, Ansible, or similar automation tools.
Strong communication and stakeholder management skills.
What will give you an edge:
Prior experience managing 24x7 global support teams.
Knowledge of multi-tenant DBaaS architectures.
Experience with security, compliance, and audit frameworks (SOC 2, HIPAA, FedRAMP).
Familiarity with observability stacks (Prometheus, Grafana, ELK, Datadog).
Programming/scripting proficiency in Python, Go, or Bash.