MENU
  • Remote Jobs
  • Companies
  • Go Premium
  • Job Alerts
  • Post a Job
  • Log in
  • Sign up
Working Nomads logo Working Nomads
  • Remote Jobs
  • Companies
  • Post Jobs
  • Go Premium
  • Get Free Job Alerts
  • Log in

Staff Software Engineer - Observability

Reddit

Full-time
USA
$207k-$289k per year
software engineering
engineer
saas
kubernetes
communication
The job listing has expired. Unfortunately, the hiring company is no longer accepting new applications.

To see similar active jobs please follow this link: Remote Development jobs

The Observability (OBS) team is looking to hire a Staff Software Engineer that thrives at the intersection of infrastructure and software development. This team own a suite of tools for allowing engineers to understand their creations, based primarily on open-source solutions at scale. We’re active users of and contributors to Prometheus, Thanos, Grafana, Vector and more.

Monitoring

We run a monitoring stack at reddit that processes billions of data-points a minute. Our stack is one of the larger deployments in the world of Prometheus/Thanos/Grafana, and with this come unique challenges of scale for these systems. Fun problems include performance engineering on a distributed query system and product thinking around new features to remove the user pain from this stack.

Logging

We also operate a hybrid system for logging  that involves some open source (Vector) and SaaS for the search backend. The team is working to provide new features, and deliver more reliable, scalable logging in the future.

Distributed Tracing

We’re in the midst of releasing a tracing product for internal use at Reddit, based on OTEL, Clickhouse, and Grafana. There will be ongoing work to scale this platform and add features.

As a member of the Observability team, your work will span these domains, which are rich with challenging infrastructure and software engineering problems. Your work will directly impact hundreds of millions of users around the world. Join us and help build the future of Reddit!

In your day-to-day, you can expect to:

  • Work collaboratively with a team of software engineers to create and maintain the foundational platform for running Reddit’s infrastructure.

  • Deliver software to improve the availability, scalability, latency, and efficiency of observability components.

  • Contribute feedback to the technical and strategic direction of eventing at Reddit.

  • Automate critical aspects of the event driven development process

  • Share on-call responsibilities. 

  • Contribute upstream changes to the open source projects we use

You have:

  • 7+ years of experience developing internet-scale software, preferably in the context of infrastructure.

  • Familiarity with distributed systems development, bonus if familiar with any of the specific tools (Prometheus, Thanos, Grafana, Vector, Clickhouse, Otel, Loki)

  • Experience developing on top of Kubernetes or similar distributed systems.

    • Kubernetes controller or operator development experience is a huge plus.

  • Strong troubleshooting capabilities surrounding both systems and software.

  • Experience engineering large systems, tracking work, and being a self-starter on projects.

  • Excellent communication skills to collaborate with a service-oriented team and company.

Benefits:

  • Comprehensive Healthcare Benefits

  • 401k Matching

  • Workspace benefits for your home office

  • Personal & Professional development funds

  • Family Planning Support

  • Flexible Vacation (please use them!) & Reddit Global Wellness Days

  • 4+ months paid Parental Leave

  • Paid Volunteer time off

#LI-remote, #LI-JS5

About the job

Full-time
USA
$207k-$289k per year
Posted 8 months ago
software engineering
engineer
saas
kubernetes
communication
Enhancv advertisement

30,000+
REMOTE JOBS

Unlock access to our database and
kickstart your remote career
Join Premium

Staff Software Engineer - Observability

Reddit
The job listing has expired. Unfortunately, the hiring company is no longer accepting new applications.

To see similar active jobs please follow this link: Remote Development jobs

The Observability (OBS) team is looking to hire a Staff Software Engineer that thrives at the intersection of infrastructure and software development. This team own a suite of tools for allowing engineers to understand their creations, based primarily on open-source solutions at scale. We’re active users of and contributors to Prometheus, Thanos, Grafana, Vector and more.

Monitoring

We run a monitoring stack at reddit that processes billions of data-points a minute. Our stack is one of the larger deployments in the world of Prometheus/Thanos/Grafana, and with this come unique challenges of scale for these systems. Fun problems include performance engineering on a distributed query system and product thinking around new features to remove the user pain from this stack.

Logging

We also operate a hybrid system for logging  that involves some open source (Vector) and SaaS for the search backend. The team is working to provide new features, and deliver more reliable, scalable logging in the future.

Distributed Tracing

We’re in the midst of releasing a tracing product for internal use at Reddit, based on OTEL, Clickhouse, and Grafana. There will be ongoing work to scale this platform and add features.

As a member of the Observability team, your work will span these domains, which are rich with challenging infrastructure and software engineering problems. Your work will directly impact hundreds of millions of users around the world. Join us and help build the future of Reddit!

In your day-to-day, you can expect to:

  • Work collaboratively with a team of software engineers to create and maintain the foundational platform for running Reddit’s infrastructure.

  • Deliver software to improve the availability, scalability, latency, and efficiency of observability components.

  • Contribute feedback to the technical and strategic direction of eventing at Reddit.

  • Automate critical aspects of the event driven development process

  • Share on-call responsibilities. 

  • Contribute upstream changes to the open source projects we use

You have:

  • 7+ years of experience developing internet-scale software, preferably in the context of infrastructure.

  • Familiarity with distributed systems development, bonus if familiar with any of the specific tools (Prometheus, Thanos, Grafana, Vector, Clickhouse, Otel, Loki)

  • Experience developing on top of Kubernetes or similar distributed systems.

    • Kubernetes controller or operator development experience is a huge plus.

  • Strong troubleshooting capabilities surrounding both systems and software.

  • Experience engineering large systems, tracking work, and being a self-starter on projects.

  • Excellent communication skills to collaborate with a service-oriented team and company.

Benefits:

  • Comprehensive Healthcare Benefits

  • 401k Matching

  • Workspace benefits for your home office

  • Personal & Professional development funds

  • Family Planning Support

  • Flexible Vacation (please use them!) & Reddit Global Wellness Days

  • 4+ months paid Parental Leave

  • Paid Volunteer time off

#LI-remote, #LI-JS5

Working Nomads

Post Jobs
Premium Subscription
Sponsorship
Free Job Alerts

Job Skills
API
FAQ
Privacy policy
Terms and conditions
Contact us
About us

Jobs by Category

Remote Administration jobs
Remote Consulting jobs
Remote Customer Success jobs
Remote Development jobs
Remote Design jobs
Remote Education jobs
Remote Finance jobs
Remote Legal jobs
Remote Healthcare jobs
Remote Human Resources jobs
Remote Management jobs
Remote Marketing jobs
Remote Sales jobs
Remote System Administration jobs
Remote Writing jobs

Jobs by Position Type

Remote Full-time jobs
Remote Part-time jobs
Remote Contract jobs

Jobs by Region

Remote jobs Anywhere
Remote jobs North America
Remote jobs Latin America
Remote jobs Europe
Remote jobs Middle East
Remote jobs Africa
Remote jobs APAC

Jobs by Skill

Remote Accounting jobs
Remote Assistant jobs
Remote Copywriting jobs
Remote Cyber Security jobs
Remote Data Analyst jobs
Remote Data Entry jobs
Remote English jobs
Remote Spanish jobs
Remote Project Management jobs
Remote QA jobs
Remote SEO jobs

Jobs by Country

Remote jobs Australia
Remote jobs Argentina
Remote jobs Brazil
Remote jobs Canada
Remote jobs Colombia
Remote jobs France
Remote jobs Germany
Remote jobs Ireland
Remote jobs India
Remote jobs Japan
Remote jobs Mexico
Remote jobs Netherlands
Remote jobs New Zealand
Remote jobs Philippines
Remote jobs Poland
Remote jobs Portugal
Remote jobs Singapore
Remote jobs Spain
Remote jobs UK
Remote jobs USA


Working Nomads curates remote digital jobs from around the web.

© 2025 Working Nomads.