Senior Site Reliability Engineer - Feegow
Company Description
Welcome to the good side of tech 👋
You might have heard about us but with a different name: Feegow, Doctoralia or Docplanner. The names are different depending on the country we are located in, but we are all part of Docplanner Group.
It all started over 12 years ago when we asked ourselves: is anyone in healthcare thinking about patients? We jumped in and we empowered patients by giving them access to leave and read reviews about their visit. We then provided doctors with the technology to manage bookings easily and save time, so they could devote themselves to what they always wanted: treating patients. And today is the day in which we ask you: wanna join us in the next step of making the healthcare experience more human?
Where does Feegow fit here?
In 2022 Feegow joined the Docplanner Group, a health-tech company. We are dedicated to developing innovative solutions and promoting access to top-tier management for physicians seeking to take their practices to the next level, as well as managers in search of a robust solution that covers end-to-end operations. Our ultimate goal is to create a world where everyone can access high-quality healthcare.
Docplanner Group at scale
We are leaders in 13 countries so far, and more than 80 million patients trust us every month. 130.000 specialists believe in us and our product, and so do leading venture capital funds such as Point Nine Capital, Goldman Sachs Asset Management, and One Peak Partners. And yet, employing over 2.500 people all over the globe, we managed to keep the startup-mindset we started with over 10 years ago.
And why should you join us?
Because it feels good to tell your family and your friends how you made the world a little bit better. You go to bed knowing that what you do matters, and that your talents align with your beliefs.
We want to make the healthcare experience more human, and that starts with you being you. We believe that considering the diversity of human experience makes a better healthcare experience for all. We’re not just different: we embrace diversity. We will encourage you to come to work your whole self, and that includes not coming to the office at all if you prefer not to, as we're 100% remote-friendly (you will work 100% from home, but you will also be welcome in our offices in Curitiba and Rio de Janeiro).
Job Description
We are looking for a Senior Site Reliability Engineer to play a key role in the evolution of our platform. You will be responsible for leading initiatives that improve the scalability, reliability, observability, and security of our systems. This role goes beyond just maintaining infrastructure – we’re looking for someone to raise the engineering bar, proactively identify bottlenecks, and unlock the autonomy of the entire engineering team.
By optimizing our infrastructure and maintaining system reliability, you will ensure that our digital healthcare platform operates smoothly and effectively. This will contribute to the overall user experience, which is vital to our mission of making healthcare accessible and efficient. Your role will involve implementing security policies, ensuring that our users' data is safe and protected, which is crucial to maintain trust in our services.
You’ll act as a technical reference, working closely with the DevOps Manager at Feegow and the global PMS Platform Team at Doctoralia, contributing not only to day-to-day operations but to the strategic vision of our DevOps practices.
In this role, you will:
Lead and guide the evolution of our infrastructure and deployment pipelines.
Support the engineering teams with architectural decisions and production-readiness practices.
Work on complex and high-impact projects, including our journey to a global infrastructure.
Act as a driver of change, fostering a culture of observability, automation, and operational excellence.
Collaboration & Leadership
Act as the go-to person for DevOps topics across engineering squads.
Support and mentor engineers on platform engineering principles, CI/CD, observability, incident response, and production reliability.
Partner with SRE, platform, and product teams to unlock delivery and reliability goals.
Contribute to architectural discussions and ensure systems are designed with scalability, security, and operational readiness in mind.
Actively support and mentor other team members in platform-related topics (CI/CD, automation, dockerization, etc.) to increase team’s autonomy.
Proactively cooperate with SREs during investigations on the efficiency and reliability of production systems.
Infrastructure & Automation
Manage infrastructure using AWS, Kubernetes (EKS), ArgoCD, and Terraform.
Evolve and maintain CI/CD pipelines to support fast and safe deployments.
Automate repetitive tasks and proactively improve system resilience.
Observability & Incident Management
Improve the monitoring and alerting culture within engineering teams using tools like Datadog.
Lead post-mortems and drive follow-ups from incidents, ensuring continuous improvement.
Ensure SLAs, SLOs, and system health indicators are well defined and visible.
Monitoring
Monitor system performance and troubleshoot issues to ensure high availability and reliability.
Ensure there’s necessary alerting around your team’s systems.
Security & Compliance
Champion DevSecOps practices, proactively identifying and mitigating risks.
Support the enforcement of security baselines and compliance across systems.
Expectations
Strong hands-on experience in DevOps, SRE, or Platform Engineering.
Proven ability to lead infrastructure and reliability improvements in complex systems.
Advanced skills in Kubernetes, AWS, Terraform, CI/CD, and observability tooling.
Experience with service ownership, incident management, and root cause analysis.
Ability to influence teams and drive DevOps best practices in a growing organization.
Proactive mindset, sense of urgency, and strong communication skills.
Qualifications
Qualifications
Proficiency with infrastructure as code tools like Terraform.
Experience with containerization and orchestration tools, particularly Docker and Kubernetes.
Understanding of AWS and its services.
Familiarity with CI/CD tools such as ArgoCD or similar.
Hands-on experience with DataDog (how to analyze production-running systems, understanding of metrics and monitoring capabilities).
Excellent problem-solving and troubleshooting skills.
Good communication in English (B2-level) to cooperate with worldwide peers.
Understanding of security best practices and compliance requirements.
Nice to Have
Experience in regulated environments (e.g., healthcare, finance).
Experience mentoring junior engineers and fostering DevOps culture across teams.
Exposure to multi-region, multi-cloud, or hybrid infrastructure scenarios.
Additional Information
Working hours are from Monday to Friday, from 9 am to 6 pm;
We have compensatory time off (Banco de Horas);
Food/Market Voucher;
Medical, Dental, and Group Life Insurance;
Pet Plan;
iFeel app, for emotional comfort;
Gympass for you and up to 3 people!
Creditas: Payroll loan services, eligible after 6 months of employment;
Stock Options - eligible after 6 months of employment (5 years grace period) -
Birthday Day Off;
Daycare Assistance;
Partnership Club, with discounts ranging from teaching institutions, such as colleges and language learning services;
Referral Program offers up to R$600 per person who stays with us for more than 6 months;
Leave of Absence/Time-off: in the event of the passing of loved ones, we offer 10 days off; if your pet passes away, we offer 2 days. Got married? 7 days of rest! Did the baby arrive? We offer 30 days for Dads and 6 months for Moms;
Senior Site Reliability Engineer - Feegow
Company Description
Welcome to the good side of tech 👋
You might have heard about us but with a different name: Feegow, Doctoralia or Docplanner. The names are different depending on the country we are located in, but we are all part of Docplanner Group.
It all started over 12 years ago when we asked ourselves: is anyone in healthcare thinking about patients? We jumped in and we empowered patients by giving them access to leave and read reviews about their visit. We then provided doctors with the technology to manage bookings easily and save time, so they could devote themselves to what they always wanted: treating patients. And today is the day in which we ask you: wanna join us in the next step of making the healthcare experience more human?
Where does Feegow fit here?
In 2022 Feegow joined the Docplanner Group, a health-tech company. We are dedicated to developing innovative solutions and promoting access to top-tier management for physicians seeking to take their practices to the next level, as well as managers in search of a robust solution that covers end-to-end operations. Our ultimate goal is to create a world where everyone can access high-quality healthcare.
Docplanner Group at scale
We are leaders in 13 countries so far, and more than 80 million patients trust us every month. 130.000 specialists believe in us and our product, and so do leading venture capital funds such as Point Nine Capital, Goldman Sachs Asset Management, and One Peak Partners. And yet, employing over 2.500 people all over the globe, we managed to keep the startup-mindset we started with over 10 years ago.
And why should you join us?
Because it feels good to tell your family and your friends how you made the world a little bit better. You go to bed knowing that what you do matters, and that your talents align with your beliefs.
We want to make the healthcare experience more human, and that starts with you being you. We believe that considering the diversity of human experience makes a better healthcare experience for all. We’re not just different: we embrace diversity. We will encourage you to come to work your whole self, and that includes not coming to the office at all if you prefer not to, as we're 100% remote-friendly (you will work 100% from home, but you will also be welcome in our offices in Curitiba and Rio de Janeiro).
Job Description
We are looking for a Senior Site Reliability Engineer to play a key role in the evolution of our platform. You will be responsible for leading initiatives that improve the scalability, reliability, observability, and security of our systems. This role goes beyond just maintaining infrastructure – we’re looking for someone to raise the engineering bar, proactively identify bottlenecks, and unlock the autonomy of the entire engineering team.
By optimizing our infrastructure and maintaining system reliability, you will ensure that our digital healthcare platform operates smoothly and effectively. This will contribute to the overall user experience, which is vital to our mission of making healthcare accessible and efficient. Your role will involve implementing security policies, ensuring that our users' data is safe and protected, which is crucial to maintain trust in our services.
You’ll act as a technical reference, working closely with the DevOps Manager at Feegow and the global PMS Platform Team at Doctoralia, contributing not only to day-to-day operations but to the strategic vision of our DevOps practices.
In this role, you will:
Lead and guide the evolution of our infrastructure and deployment pipelines.
Support the engineering teams with architectural decisions and production-readiness practices.
Work on complex and high-impact projects, including our journey to a global infrastructure.
Act as a driver of change, fostering a culture of observability, automation, and operational excellence.
Collaboration & Leadership
Act as the go-to person for DevOps topics across engineering squads.
Support and mentor engineers on platform engineering principles, CI/CD, observability, incident response, and production reliability.
Partner with SRE, platform, and product teams to unlock delivery and reliability goals.
Contribute to architectural discussions and ensure systems are designed with scalability, security, and operational readiness in mind.
Actively support and mentor other team members in platform-related topics (CI/CD, automation, dockerization, etc.) to increase team’s autonomy.
Proactively cooperate with SREs during investigations on the efficiency and reliability of production systems.
Infrastructure & Automation
Manage infrastructure using AWS, Kubernetes (EKS), ArgoCD, and Terraform.
Evolve and maintain CI/CD pipelines to support fast and safe deployments.
Automate repetitive tasks and proactively improve system resilience.
Observability & Incident Management
Improve the monitoring and alerting culture within engineering teams using tools like Datadog.
Lead post-mortems and drive follow-ups from incidents, ensuring continuous improvement.
Ensure SLAs, SLOs, and system health indicators are well defined and visible.
Monitoring
Monitor system performance and troubleshoot issues to ensure high availability and reliability.
Ensure there’s necessary alerting around your team’s systems.
Security & Compliance
Champion DevSecOps practices, proactively identifying and mitigating risks.
Support the enforcement of security baselines and compliance across systems.
Expectations
Strong hands-on experience in DevOps, SRE, or Platform Engineering.
Proven ability to lead infrastructure and reliability improvements in complex systems.
Advanced skills in Kubernetes, AWS, Terraform, CI/CD, and observability tooling.
Experience with service ownership, incident management, and root cause analysis.
Ability to influence teams and drive DevOps best practices in a growing organization.
Proactive mindset, sense of urgency, and strong communication skills.
Qualifications
Qualifications
Proficiency with infrastructure as code tools like Terraform.
Experience with containerization and orchestration tools, particularly Docker and Kubernetes.
Understanding of AWS and its services.
Familiarity with CI/CD tools such as ArgoCD or similar.
Hands-on experience with DataDog (how to analyze production-running systems, understanding of metrics and monitoring capabilities).
Excellent problem-solving and troubleshooting skills.
Good communication in English (B2-level) to cooperate with worldwide peers.
Understanding of security best practices and compliance requirements.
Nice to Have
Experience in regulated environments (e.g., healthcare, finance).
Experience mentoring junior engineers and fostering DevOps culture across teams.
Exposure to multi-region, multi-cloud, or hybrid infrastructure scenarios.
Additional Information
Working hours are from Monday to Friday, from 9 am to 6 pm;
We have compensatory time off (Banco de Horas);
Food/Market Voucher;
Medical, Dental, and Group Life Insurance;
Pet Plan;
iFeel app, for emotional comfort;
Gympass for you and up to 3 people!
Creditas: Payroll loan services, eligible after 6 months of employment;
Stock Options - eligible after 6 months of employment (5 years grace period) -
Birthday Day Off;
Daycare Assistance;
Partnership Club, with discounts ranging from teaching institutions, such as colleges and language learning services;
Referral Program offers up to R$600 per person who stays with us for more than 6 months;
Leave of Absence/Time-off: in the event of the passing of loved ones, we offer 10 days off; if your pet passes away, we offer 2 days. Got married? 7 days of rest! Did the baby arrive? We offer 30 days for Dads and 6 months for Moms;