MENU
  • Remote Jobs
  • Companies
  • Go Premium
  • Job Alerts
  • Post a Job
  • Log in
  • Sign up
Working Nomads logo Working Nomads
  • Remote Jobs
  • Companies
  • Post Jobs
  • Go Premium
  • Get Free Job Alerts
  • Log in

Senior Site Reliability Engineer - Observability

ScienceLogic

Full-time
USA
engineer
devops
python
sql
project management
Apply for this position

Senior Site Reliability Engineer, Observability

Reston, VA or Remote

This position can be remote within the U.S.

Who we are...

ScienceLogic is going through a product transformation and the Site Reliability team is at the forefront of it. We are responsible for the design, deployment, and maintenance of the Cloud Infrastructure used for running the company’s revenue generating go-forward SaaS product line. 

ScienceLogic’s current SaaS product is a single tenancy, highly available and secure platform used by many customers for achieving their AIOps objectives. Cloud Operations leads the SaaS portfolio from the front by onboarding new customers on their own dedicated instance of the product, performing capacity planning, platform maintenance, upgrades, security and triaging incident response for the SaaS platform.

Overall, we’re passionate about automation and solving complex business and technology challenges. Our team combines SRE, DevOps, Software Development and Information Security knowledge to help make Cloud operations agile, elastic inside the security and governance framework boundaries. If you are well versed in cloud technologies, have an automation mindset and are ardent follower of the SRE discipline…then our team will be benefited by your skillset!

What we're looking for...

We’re seeking an experienced Site Reliability Engineer who is passionate about building and owning modern monitoring and observability solutions at scale. You’ll play a key role in designing proactive monitoring strategies, defining SLIs/SLOs, automating detection and remediation, and improving platform reliability across our SaaS environment.

The ideal candidate is a hands-on engineer with strong cloud, automation, and scripting experience, deep familiarity with tools like Prometheus, AWS CloudWatch, and New Relic, and a collaborative mindset. You enjoy solving complex problems, mentoring others, and continuously improving systems before issues impact customers.

What you'll be doing...

  • Be a key contributor on an Agile development team, collaboratively realizing business value through iterative software development lifecycle

  • Build and execute the monitoring strategy for ScienceLogic SaaS infrastructure

  • Define, deploy, and maintain system and service monitors

  • Be the authority for various monitoring technologies like Prometheus, AWS Cloudwatch, Scylla manager, New Relic to provide next generation monitoring solutions for ScienceLogic SaaS

  • Employ advanced monitoring practices and technologies to detect and automatically resolve platform issues before they impact the customer’s experience.

  • Participate in architecture and operations reviews

  • Identify and automate measurement of operations SLAs, SLOs using SLIs

  • Triage incident response, document SOPs, Runbooks and train NOC team members

  • Participate in shared on-call manager rotation for escalations during incidents and outages, occasionally during off hours

  • Provide dash boarding and analytics solutions to internal teams based on requirements

Qualities you possess...

  • 8+ years of software development or site reliability engineering or equivalent experience

  • Skilled at problem solving, algorithms, and data structures

  • Building tools and scripting frameworks from scratch

  • Working with Cloud Automation tools like CloudFormation, Terraform, CDK, aws-cli

  • Scripting languages like Python, Groovy, PowerShell, Bash, Perl etc.

  • Configuration automation using Ansible or equivalent tools

  • Exposure to Windows and Linux administration skills

  • Project management tools like Jira, Trello

  • Prior experience in dealing with Datastore technologies like Postgres, MySQL, SQL, DynamoDB is desirable

  • Familiarity with basic networking, security and cloud engineering concepts

  • Team player who is eager to help others to succeed through mentoring and leading by example

  • Highly collaborative with effective written and verbal communication skills

Benefits & Perks

  • Comprehensive medical, dental and vision plans

  • 401(k) plan with employer match

  • Flexible Paid Time Off (FTO) so that you can take the time that you need to re-energize

  • Volunteer Time Off (VTO) - take two days off per calendar year to volunteer with your preferred charitable organization

  • 5-year Service Milestone Sabbatical

  • Paid parental leave

  • Generous employee referral bonus program

  • Pet insurance

  • HQ Office centrally located in Reston Town Center featuring a well-stocked kitchen with rotating snacks and beverages, and catered lunch on Thursdays

  • Regular virtual company-wide events, including cooking classes, yoga, meditation and more

  • Mentorship and professional development opportunities with experienced product marketing leaders

  • The opportunity to learn and develop from some of the best and brightest minds in the industry!

Don’t meet every single requirement? Studies have shown that women and people of color are less likely to apply to jobs unless they meet every single qualification. At ScienceLogic, we are dedicated to building a diverse, inclusive and authentic workplace, so if you’re excited about this role but your past experience doesn’t align perfectly with every qualification in the job description, we encourage you to apply anyway. You may be just the right candidate for this or other roles.

All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, or any other applicable legally protected characteristics in the location in which you are applying

About ScienceLogic

ScienceLogic is a leader in IT Operations Management, providing modern IT operations with actionable insights to resolve and predict problems faster in a digital, ephemeral world. Its solution sees everything across cloud and distributed architectures, contextualizes data through relationship mapping, and acts on this insight through integration and automation.

www.sciencelogic.com

All ScienceLogic employees have the responsibility to protect information assets, adhere to access controls, report suspicious activity, and comply with security and privacy policies.

#LI-Remote

Apply for this position
Bookmark Report

About the job

Full-time
USA
Senior Level
Posted 6 days ago
engineer
devops
python
sql
project management

Apply for this position

Bookmark
Report
Enhancv advertisement
+ 1,284 new jobs added today
30,000+
Remote Jobs

Don't miss out — new listings every hour

Join Premium

Senior Site Reliability Engineer - Observability

ScienceLogic

Senior Site Reliability Engineer, Observability

Reston, VA or Remote

This position can be remote within the U.S.

Who we are...

ScienceLogic is going through a product transformation and the Site Reliability team is at the forefront of it. We are responsible for the design, deployment, and maintenance of the Cloud Infrastructure used for running the company’s revenue generating go-forward SaaS product line. 

ScienceLogic’s current SaaS product is a single tenancy, highly available and secure platform used by many customers for achieving their AIOps objectives. Cloud Operations leads the SaaS portfolio from the front by onboarding new customers on their own dedicated instance of the product, performing capacity planning, platform maintenance, upgrades, security and triaging incident response for the SaaS platform.

Overall, we’re passionate about automation and solving complex business and technology challenges. Our team combines SRE, DevOps, Software Development and Information Security knowledge to help make Cloud operations agile, elastic inside the security and governance framework boundaries. If you are well versed in cloud technologies, have an automation mindset and are ardent follower of the SRE discipline…then our team will be benefited by your skillset!

What we're looking for...

We’re seeking an experienced Site Reliability Engineer who is passionate about building and owning modern monitoring and observability solutions at scale. You’ll play a key role in designing proactive monitoring strategies, defining SLIs/SLOs, automating detection and remediation, and improving platform reliability across our SaaS environment.

The ideal candidate is a hands-on engineer with strong cloud, automation, and scripting experience, deep familiarity with tools like Prometheus, AWS CloudWatch, and New Relic, and a collaborative mindset. You enjoy solving complex problems, mentoring others, and continuously improving systems before issues impact customers.

What you'll be doing...

  • Be a key contributor on an Agile development team, collaboratively realizing business value through iterative software development lifecycle

  • Build and execute the monitoring strategy for ScienceLogic SaaS infrastructure

  • Define, deploy, and maintain system and service monitors

  • Be the authority for various monitoring technologies like Prometheus, AWS Cloudwatch, Scylla manager, New Relic to provide next generation monitoring solutions for ScienceLogic SaaS

  • Employ advanced monitoring practices and technologies to detect and automatically resolve platform issues before they impact the customer’s experience.

  • Participate in architecture and operations reviews

  • Identify and automate measurement of operations SLAs, SLOs using SLIs

  • Triage incident response, document SOPs, Runbooks and train NOC team members

  • Participate in shared on-call manager rotation for escalations during incidents and outages, occasionally during off hours

  • Provide dash boarding and analytics solutions to internal teams based on requirements

Qualities you possess...

  • 8+ years of software development or site reliability engineering or equivalent experience

  • Skilled at problem solving, algorithms, and data structures

  • Building tools and scripting frameworks from scratch

  • Working with Cloud Automation tools like CloudFormation, Terraform, CDK, aws-cli

  • Scripting languages like Python, Groovy, PowerShell, Bash, Perl etc.

  • Configuration automation using Ansible or equivalent tools

  • Exposure to Windows and Linux administration skills

  • Project management tools like Jira, Trello

  • Prior experience in dealing with Datastore technologies like Postgres, MySQL, SQL, DynamoDB is desirable

  • Familiarity with basic networking, security and cloud engineering concepts

  • Team player who is eager to help others to succeed through mentoring and leading by example

  • Highly collaborative with effective written and verbal communication skills

Benefits & Perks

  • Comprehensive medical, dental and vision plans

  • 401(k) plan with employer match

  • Flexible Paid Time Off (FTO) so that you can take the time that you need to re-energize

  • Volunteer Time Off (VTO) - take two days off per calendar year to volunteer with your preferred charitable organization

  • 5-year Service Milestone Sabbatical

  • Paid parental leave

  • Generous employee referral bonus program

  • Pet insurance

  • HQ Office centrally located in Reston Town Center featuring a well-stocked kitchen with rotating snacks and beverages, and catered lunch on Thursdays

  • Regular virtual company-wide events, including cooking classes, yoga, meditation and more

  • Mentorship and professional development opportunities with experienced product marketing leaders

  • The opportunity to learn and develop from some of the best and brightest minds in the industry!

Don’t meet every single requirement? Studies have shown that women and people of color are less likely to apply to jobs unless they meet every single qualification. At ScienceLogic, we are dedicated to building a diverse, inclusive and authentic workplace, so if you’re excited about this role but your past experience doesn’t align perfectly with every qualification in the job description, we encourage you to apply anyway. You may be just the right candidate for this or other roles.

All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, or any other applicable legally protected characteristics in the location in which you are applying

About ScienceLogic

ScienceLogic is a leader in IT Operations Management, providing modern IT operations with actionable insights to resolve and predict problems faster in a digital, ephemeral world. Its solution sees everything across cloud and distributed architectures, contextualizes data through relationship mapping, and acts on this insight through integration and automation.

www.sciencelogic.com

All ScienceLogic employees have the responsibility to protect information assets, adhere to access controls, report suspicious activity, and comply with security and privacy policies.

#LI-Remote

Working Nomads

Post Jobs
Premium Subscription
Sponsorship
Reviews
Job Alerts

Job Skills
Jobs by Location
Jobs by Experience Level
Jobs by Position Type
Jobs by Salary
API
Scam Alert
FAQ
Privacy policy
Terms and conditions
Contact us
About us

Jobs by Category

Remote Administration jobs
Remote Consulting jobs
Remote Customer Success jobs
Remote Development jobs
Remote Design jobs
Remote Education jobs
Remote Finance jobs
Remote Legal jobs
Remote Healthcare jobs
Remote Human Resources jobs
Remote Management jobs
Remote Marketing jobs
Remote Sales jobs
Remote System Administration jobs
Remote Writing jobs

Jobs by Position Type

Remote Full-time jobs
Remote Part-time jobs
Remote Contract jobs

Jobs by Region

Remote jobs Anywhere
Remote jobs North America
Remote jobs Latin America
Remote jobs Europe
Remote jobs Middle East
Remote jobs Africa
Remote jobs APAC

Jobs by Skill

Remote Accounting jobs
Remote Assistant jobs
Remote Copywriting jobs
Remote Cyber Security jobs
Remote Data Analyst jobs
Remote Data Entry jobs
Remote English jobs
Remote Entry Level jobs
Remote Spanish jobs
Remote Project Management jobs
Remote QA jobs
Remote SEO jobs

Jobs by Country

Remote jobs Australia
Remote jobs Argentina
Remote jobs Belgium
Remote jobs Brazil
Remote jobs Canada
Remote jobs Colombia
Remote jobs France
Remote jobs Germany
Remote jobs Ireland
Remote jobs India
Remote jobs Japan
Remote jobs Mexico
Remote jobs Netherlands
Remote jobs New Zealand
Remote jobs Philippines
Remote jobs Poland
Remote jobs Portugal
Remote jobs Singapore
Remote jobs Spain
Remote jobs UK
Remote jobs USA


Working Nomads curates remote digital jobs from around the web.

© 2026 Working Nomads.