Staff Software Engineer - Core Services (Data focused)
About The Team:
Scylla, Postgres, ClickHouse, and Redis comprise the core of our data infrastructure. As such, they need to be performant and reliable for all our use cases.
As a Scylla & DB focused Software Engineer, you will primarily take ownership of our Scylla clusters and ensure they are used and managed with best practices at scale. This includes improving uptime, latency, restorability, partitioning, etc. You will also work closely with other engineers in moving and replicating data to optimal data stores.
Your secondary responsibilities will include learning Postgres and ClickHouse - schema and query design, partitioning strategies, etc, along with writing microservices.
Please consider applying even if you don’t meet all of the desired qualifications.
What You'll Do
Optimize a scaling strategy for our high performance Scylla clusters
Design and optimize schemas, queries, indices, caches and more, primarily for Scylla, eventually for Postgres and ClickHouse
Conduct data analysis and performance monitoring to identify areas for optimization and enhancement
Collaborate closely with fellow engineers to architect and implement services required to back OneSignal’s product offerings
Actively participate in peer code reviews and Technical Design Spec reviews, providing valuable technical insights to continuously improve our code base
Work with the team to efficiently resolve production issues and ensure the system scales smoothly to meet the growing demands of our customers.
Stay up-to-date with the latest industry trends and technologies, incorporating new ideas into our engineering processes
Participate in production on-call rotation
Debug production issues by utilizing metrics, logs, and distributed traces
Design synchronous and asynchronous APIs for communicating between services in a large distributed system
What You'll Bring:
Significant experience optimizing Scylla at scale
At least 8 years experience working as a software engineer
Ability to work on a self-motivated team, high performing team
Experience operating reliable production systems at scale
Easily bored running tasks by hand and the ability to automate such tasks
Experience with distributed system event streaming framework such as Apache Kafka
Experience with Rust, Golang or other lower level languages. Those with backgrounds in interpreted languages may have a steeper learning curve with Rust, but if you have worked on core components of very high throughput systems, please apply
Ability to reason about how data flows through distributed systems
Experience with Docker and Kubernetes
We value a variety of experiences, and these are not required. It would be a added bonus if you have experience in any of the following:
Experience with any of Redis, Kafka, Postgres, ClickHouse, gRPC
Experience profiling applications to improve performance in terms of time/CPU/memory
Experience debugging issues in distributed systems
The New York and California base salary for this full time position is between $190,000 to $210,000. Your exact starting salary is determined by a number of factors such as your experience, skills, and qualifications. In addition to base salary, we also offer a competitive equity program and comprehensive and inclusive benefits.
About the job
Apply for this position
Staff Software Engineer - Core Services (Data focused)
About The Team:
Scylla, Postgres, ClickHouse, and Redis comprise the core of our data infrastructure. As such, they need to be performant and reliable for all our use cases.
As a Scylla & DB focused Software Engineer, you will primarily take ownership of our Scylla clusters and ensure they are used and managed with best practices at scale. This includes improving uptime, latency, restorability, partitioning, etc. You will also work closely with other engineers in moving and replicating data to optimal data stores.
Your secondary responsibilities will include learning Postgres and ClickHouse - schema and query design, partitioning strategies, etc, along with writing microservices.
Please consider applying even if you don’t meet all of the desired qualifications.
What You'll Do
Optimize a scaling strategy for our high performance Scylla clusters
Design and optimize schemas, queries, indices, caches and more, primarily for Scylla, eventually for Postgres and ClickHouse
Conduct data analysis and performance monitoring to identify areas for optimization and enhancement
Collaborate closely with fellow engineers to architect and implement services required to back OneSignal’s product offerings
Actively participate in peer code reviews and Technical Design Spec reviews, providing valuable technical insights to continuously improve our code base
Work with the team to efficiently resolve production issues and ensure the system scales smoothly to meet the growing demands of our customers.
Stay up-to-date with the latest industry trends and technologies, incorporating new ideas into our engineering processes
Participate in production on-call rotation
Debug production issues by utilizing metrics, logs, and distributed traces
Design synchronous and asynchronous APIs for communicating between services in a large distributed system
What You'll Bring:
Significant experience optimizing Scylla at scale
At least 8 years experience working as a software engineer
Ability to work on a self-motivated team, high performing team
Experience operating reliable production systems at scale
Easily bored running tasks by hand and the ability to automate such tasks
Experience with distributed system event streaming framework such as Apache Kafka
Experience with Rust, Golang or other lower level languages. Those with backgrounds in interpreted languages may have a steeper learning curve with Rust, but if you have worked on core components of very high throughput systems, please apply
Ability to reason about how data flows through distributed systems
Experience with Docker and Kubernetes
We value a variety of experiences, and these are not required. It would be a added bonus if you have experience in any of the following:
Experience with any of Redis, Kafka, Postgres, ClickHouse, gRPC
Experience profiling applications to improve performance in terms of time/CPU/memory
Experience debugging issues in distributed systems
The New York and California base salary for this full time position is between $190,000 to $210,000. Your exact starting salary is determined by a number of factors such as your experience, skills, and qualifications. In addition to base salary, we also offer a competitive equity program and comprehensive and inclusive benefits.