Data Reliability Engineer III
To see similar active jobs please follow this link: Remote System Administration jobs
Summary
The DRE Team is responsible for building, maintaining, and evolving cloud-native and containerized infrastructure dedicated to hosting and integrating data products and services.
Play a crucial role in increasing our platform maturity by supporting other squads and participating as a team player in multidisciplinary projects, primarily focusing on availability, security, and scalability.
This senior position requires an intermediate to advanced understanding of systems design, cloud infrastructure, networking, and databases or data-related technologies.
What you'll do
Built and maintained data infrastructure using IaC (Terraform), focusing on best practices to provide reusable modules and blueprints ready to be used by other data professionals.
Create and improve data CI/CD pipeline templates, emphasizing usability and a fast-paced development cycle for other data professionals.
Led infrastructure initiatives and supported the entire development lifecycle of new data products and services.
Support and provide observability and alarms to the entire data stack.
Improve our data stack and help drive new data products and services, always focusing on security, scalability, and resilience.
Participate in production support activities, providing insights and alternatives during troubleshooting, focusing on solving problems, eliminating technical debts, removing toils, and providing important feedback to improve our data infrastructure and services.
Minimum Qualifications
Experience with Cloud (desirable AWS), Kubernetes, CI/CD and Observability (Monitoring, Logging and Tracing);
Basic knowledge of Streaming, Databases, and Containerized Applications;
Desirable experience with BigData, No-SQL Databases, Python or Shell (scripting), Spark and Airflow;
Core Benefits
Remote work
Flexible hours
Gympass
Meal & Food vouchers
Remote work financial support
Life Insurance
Medical and Dental Assistance
Employee child care benefit: daycare
Vidalink partnership
Day off (Birthday)
Support for studying languages
50% off AWS and GCP certifications
Technologies that we apply in our day
AWS (S3, Glue, EMR, Lambda, Kinesis, Firehose, EKS, SNS, SQS and others)
GCP (BigQuery)
Databricks (Spark)
Airflow (Job Scheduler)
Codefresh and ArgoCD (CI/CD)
Grafana Cloud (Logging and Monitoring)
Kubernetes Hosted Apps (Trino, Superset, Openmetadata, Clickhouse, and others coming)
Looker (BI/Analytics)
Languages (Go, Python and Shell Script)
Data Reliability Engineer III
To see similar active jobs please follow this link: Remote System Administration jobs
Summary
The DRE Team is responsible for building, maintaining, and evolving cloud-native and containerized infrastructure dedicated to hosting and integrating data products and services.
Play a crucial role in increasing our platform maturity by supporting other squads and participating as a team player in multidisciplinary projects, primarily focusing on availability, security, and scalability.
This senior position requires an intermediate to advanced understanding of systems design, cloud infrastructure, networking, and databases or data-related technologies.
What you'll do
Built and maintained data infrastructure using IaC (Terraform), focusing on best practices to provide reusable modules and blueprints ready to be used by other data professionals.
Create and improve data CI/CD pipeline templates, emphasizing usability and a fast-paced development cycle for other data professionals.
Led infrastructure initiatives and supported the entire development lifecycle of new data products and services.
Support and provide observability and alarms to the entire data stack.
Improve our data stack and help drive new data products and services, always focusing on security, scalability, and resilience.
Participate in production support activities, providing insights and alternatives during troubleshooting, focusing on solving problems, eliminating technical debts, removing toils, and providing important feedback to improve our data infrastructure and services.
Minimum Qualifications
Experience with Cloud (desirable AWS), Kubernetes, CI/CD and Observability (Monitoring, Logging and Tracing);
Basic knowledge of Streaming, Databases, and Containerized Applications;
Desirable experience with BigData, No-SQL Databases, Python or Shell (scripting), Spark and Airflow;
Core Benefits
Remote work
Flexible hours
Gympass
Meal & Food vouchers
Remote work financial support
Life Insurance
Medical and Dental Assistance
Employee child care benefit: daycare
Vidalink partnership
Day off (Birthday)
Support for studying languages
50% off AWS and GCP certifications
Technologies that we apply in our day
AWS (S3, Glue, EMR, Lambda, Kinesis, Firehose, EKS, SNS, SQS and others)
GCP (BigQuery)
Databricks (Spark)
Airflow (Job Scheduler)
Codefresh and ArgoCD (CI/CD)
Grafana Cloud (Logging and Monitoring)
Kubernetes Hosted Apps (Trino, Superset, Openmetadata, Clickhouse, and others coming)
Looker (BI/Analytics)
Languages (Go, Python and Shell Script)
