Senior DevOps Engineer
Senior DevOps Engineer
As a Senior DevOps Engineer at SuccessKPI, you will be a key member of the Engineering team, responsible for designing, building, and maintaining the infrastructure that supports our SaaS analytics platform. You will champion automation, reliability, security, and scalability as you optimize cloud-based environments and drive best practices across CI/CD pipelines, monitoring, and infrastructure-as-code. This is a hands-on role that combines deep technical expertise with cross-functional collaboration to ensure seamless deployment and operational excellence in a high-growth, fast-paced environment.
Job Location: Remote Work, USA. Candidates must be in minimal driving distance to McLean, VA
Why work for SuccessKPI:
· Opportunity to work for an organization that prides itself on offering a diverse and dynamic culture where employees are proud to work
· Opportunity to work for a fast-growth global company in the rapidly growing analytics space
· Opportunity for career development and growth opportunities as we grow and scale
· Opportunity to build industry relationships and work alongside seasoned industry experts
· Opportunity to work with our leadership team to strategize, collaborate, and solve customer challenges every day - YOU HAVE A VOICE AT SUCCESSKPI!
What You’ll Do:
Infrastructure Design & Management
Design, implement, and maintain scalable, reliable, and secure infrastructure on cloud platforms (primarily AWS).
Create new infrastructure or environments to meet evolving customer and product demands.
Monitor infrastructure performance and availability, ensuring high uptime and efficiency.
Apply infrastructure-as-code principles using tools such as Terraform, AWS CloudFormation, or the Serverless framework.
CI/CD & Deployment
Build, maintain, and optimize CI/CD pipelines for application deployments using tools like Jenkins, Bitbucket Pipelines, or equivalent.
Automate and standardize release processes to support frequent, reliable, and fast software delivery.
Support production release and bug-fix deployments, including environment configurations.
Automation & Scripting
Develop scripts and tooling (using Python, Bash, Node.js, etc.) to automate infrastructure management, deployments, and operational tasks.
Champion continuous improvement in automation to reduce manual effort and improve reliability.
Monitoring, Logging & Troubleshooting
Implement and manage robust monitoring and logging systems using AWS CloudWatch, Datadog, Dynatrace, or custom solutions.
Proactively identify, troubleshoot, and resolve infrastructure and application issues before they impact end users.
Participate in on-call rotations for critical production systems support.
Security & Compliance
Apply security best practices across all infrastructure layers to ensure secure operations.
Conduct routine security audits and vulnerability assessments to maintain compliance with applicable standards and frameworks.
Collaboration & Cross-Functional Support
Partner closely with development teams to support application architecture, deployments, and infrastructure decisions.
Collaborate with QA, Product, and Customer Support teams to resolve customer-impacting issues and improve system reliability.
What You’ll Bring - Required qualifications to be successful in this role:
Education & Experience
Bachelor's degree in Computer Science, Information Technology, Engineering, or a related field.
8+ years of experience in a DevOps, Site Reliability Engineer (SRE), or Operations Engineer role in a cloud-first environment.
Technical Skills
Strong hands-on experience with AWS cloud services and infrastructure management.
Proficiency in scripting and development (Python, Bash, Node.js).
Experience with containerization and orchestration tools such as Docker, Kubernetes, or AWS ECS.
Deep familiarity with CI/CD tools: Jenkins, GitLab CI, CircleCI, Bitbucket Pipelines, or similar.
Proficient in infrastructure-as-code: Terraform, AWS CloudFormation, Serverless framework, etc.
Solid understanding of networking, security, Linux system administration, and cloud architecture patterns.
Soft Skills
Excellent analytical, problem-solving, and troubleshooting abilities.
Strong communication and collaboration skills across technical and non-technical stakeholders.
Self-starter with a proactive mindset and a passion for continuous improvement.
Comfortable working independently and as part of a distributed team.
Senior DevOps Engineer
Senior DevOps Engineer
As a Senior DevOps Engineer at SuccessKPI, you will be a key member of the Engineering team, responsible for designing, building, and maintaining the infrastructure that supports our SaaS analytics platform. You will champion automation, reliability, security, and scalability as you optimize cloud-based environments and drive best practices across CI/CD pipelines, monitoring, and infrastructure-as-code. This is a hands-on role that combines deep technical expertise with cross-functional collaboration to ensure seamless deployment and operational excellence in a high-growth, fast-paced environment.
Job Location: Remote Work, USA. Candidates must be in minimal driving distance to McLean, VA
Why work for SuccessKPI:
· Opportunity to work for an organization that prides itself on offering a diverse and dynamic culture where employees are proud to work
· Opportunity to work for a fast-growth global company in the rapidly growing analytics space
· Opportunity for career development and growth opportunities as we grow and scale
· Opportunity to build industry relationships and work alongside seasoned industry experts
· Opportunity to work with our leadership team to strategize, collaborate, and solve customer challenges every day - YOU HAVE A VOICE AT SUCCESSKPI!
What You’ll Do:
Infrastructure Design & Management
Design, implement, and maintain scalable, reliable, and secure infrastructure on cloud platforms (primarily AWS).
Create new infrastructure or environments to meet evolving customer and product demands.
Monitor infrastructure performance and availability, ensuring high uptime and efficiency.
Apply infrastructure-as-code principles using tools such as Terraform, AWS CloudFormation, or the Serverless framework.
CI/CD & Deployment
Build, maintain, and optimize CI/CD pipelines for application deployments using tools like Jenkins, Bitbucket Pipelines, or equivalent.
Automate and standardize release processes to support frequent, reliable, and fast software delivery.
Support production release and bug-fix deployments, including environment configurations.
Automation & Scripting
Develop scripts and tooling (using Python, Bash, Node.js, etc.) to automate infrastructure management, deployments, and operational tasks.
Champion continuous improvement in automation to reduce manual effort and improve reliability.
Monitoring, Logging & Troubleshooting
Implement and manage robust monitoring and logging systems using AWS CloudWatch, Datadog, Dynatrace, or custom solutions.
Proactively identify, troubleshoot, and resolve infrastructure and application issues before they impact end users.
Participate in on-call rotations for critical production systems support.
Security & Compliance
Apply security best practices across all infrastructure layers to ensure secure operations.
Conduct routine security audits and vulnerability assessments to maintain compliance with applicable standards and frameworks.
Collaboration & Cross-Functional Support
Partner closely with development teams to support application architecture, deployments, and infrastructure decisions.
Collaborate with QA, Product, and Customer Support teams to resolve customer-impacting issues and improve system reliability.
What You’ll Bring - Required qualifications to be successful in this role:
Education & Experience
Bachelor's degree in Computer Science, Information Technology, Engineering, or a related field.
8+ years of experience in a DevOps, Site Reliability Engineer (SRE), or Operations Engineer role in a cloud-first environment.
Technical Skills
Strong hands-on experience with AWS cloud services and infrastructure management.
Proficiency in scripting and development (Python, Bash, Node.js).
Experience with containerization and orchestration tools such as Docker, Kubernetes, or AWS ECS.
Deep familiarity with CI/CD tools: Jenkins, GitLab CI, CircleCI, Bitbucket Pipelines, or similar.
Proficient in infrastructure-as-code: Terraform, AWS CloudFormation, Serverless framework, etc.
Solid understanding of networking, security, Linux system administration, and cloud architecture patterns.
Soft Skills
Excellent analytical, problem-solving, and troubleshooting abilities.
Strong communication and collaboration skills across technical and non-technical stakeholders.
Self-starter with a proactive mindset and a passion for continuous improvement.
Comfortable working independently and as part of a distributed team.
