Director of Engineering - Infrastructure
About the position
Ditto is at an inflection point. We're about to hit a massive scaling phase, and we need an exceptional engineering leader to ensure our infrastructure can support this hyper-growth journey. This is a unique opportunity to join a company at the beginning of its rapid expansion and shape the future of our technical foundation.
As Director of Engineering - Infrastructure, you will lead and unify our Site Reliability Engineering (SRE), Platform, and CI Infrastructure teams. You'll be responsible for creating a holistic view of our infrastructure ecosystem, from our SRE processes to our customer-facing self-service multicloud Kubernetes platform architecture, ensuring we can scale reliably and efficiently to meet the demands of our enterprise customers.
This role is critical to our success - our infrastructure is the foundation that enables us to deliver on our promises to the whales we've courted. You'll work closely with our Cloud team (Big Peer, Portal, Pare, Data Integrations) and report to the VP of Engineering. Your mission will be to strengthen the bond between SRE and platform teams, foster collaboration, and build a world-class infrastructure organization that can support Ditto's ambitious growth plans.
As Director of Engineering - Infrastructure, you will:
Lead and manage our SRE, Platform, and CI Infrastructure teams. Manage managers and key ICs such as architects and senior staff engineers
Design, own, and execute the vision for platform excellence at Ditto
Play a central role in the transformation of Ditto’s engineering culture into a culture that prioritizes reliability and resilience of our mission critical software. Communicate & articulate this mission across the entire company in all hands, presentations, working sessions, and via enactment of strategic objectives
Consolidate infrastructure knowledge and expertise to reduce cognitive load across the organization and create scalable processes
Develop and execute a comprehensive infrastructure strategy that prepares Ditto for massive scale based on current sales trajectory
Strengthen the bond between SRE and Platform teams, fostering collaboration and shared ownership
Establish and maintain a healthy, sustainable working cadence for all infrastructure teams while reducing on-call incident frequency
Partner with the upcoming Cloud team leadership to ensure seamless integration of infrastructure services
Implement best practices for cloud infrastructure management, focusing on AWS with consideration for multi-cloud strategies
Lead the optimization of Kubernetes across our infrastructure stack
Build and mentor a high-performing team of infrastructure engineers and managers
Collaborate with Product, Sales, and other Engineering teams to align infrastructure capabilities with business needs
Manage infrastructure budgets and make strategic decisions about tooling and technology investments
Establish SLOs, monitoring, and alerting strategies that ensure reliability at scale
Champion infrastructure as code, automation, and self-service capabilities
Requirements:
10+ years of experience in cloud SRE/Platform engineering, with deep expertise in distributed systems and infrastructure at scale
5+ years of experience with Kubernetes in production environments
5+ years of experience managing SRE/Platform engineering teams
Proven track record of managing teams of 10+ people and scaling engineering organizations
Strong AWS experience is required; experience with GCP and Azure is highly desired
Advanced knowledge of cloud best practices and infrastructure automation
Excellent verbal and written communication skills with the ability to explain complex technical concepts to diverse audiences
Strong conflict management and leadership skills
Experience managing through rapid growth phases and organizational change
Strategic thinking abilities with a focus on business outcomes
Track record of successfully reducing operational overhead while improving system reliability
Experience with CI/CD pipelines and developer productivity tooling
Understanding of security best practices and compliance requirements
Nice to have:
Experience in companies going through hyper-growth phases
Background in edge computing or distributed database technologies
Experience with multi-region and multi-cloud architectures
Knowledge of mesh networking or peer-to-peer systems
Previous experience working with enterprise customers with high reliability requirements
Director of Engineering - Infrastructure
About the position
Ditto is at an inflection point. We're about to hit a massive scaling phase, and we need an exceptional engineering leader to ensure our infrastructure can support this hyper-growth journey. This is a unique opportunity to join a company at the beginning of its rapid expansion and shape the future of our technical foundation.
As Director of Engineering - Infrastructure, you will lead and unify our Site Reliability Engineering (SRE), Platform, and CI Infrastructure teams. You'll be responsible for creating a holistic view of our infrastructure ecosystem, from our SRE processes to our customer-facing self-service multicloud Kubernetes platform architecture, ensuring we can scale reliably and efficiently to meet the demands of our enterprise customers.
This role is critical to our success - our infrastructure is the foundation that enables us to deliver on our promises to the whales we've courted. You'll work closely with our Cloud team (Big Peer, Portal, Pare, Data Integrations) and report to the VP of Engineering. Your mission will be to strengthen the bond between SRE and platform teams, foster collaboration, and build a world-class infrastructure organization that can support Ditto's ambitious growth plans.
As Director of Engineering - Infrastructure, you will:
Lead and manage our SRE, Platform, and CI Infrastructure teams. Manage managers and key ICs such as architects and senior staff engineers
Design, own, and execute the vision for platform excellence at Ditto
Play a central role in the transformation of Ditto’s engineering culture into a culture that prioritizes reliability and resilience of our mission critical software. Communicate & articulate this mission across the entire company in all hands, presentations, working sessions, and via enactment of strategic objectives
Consolidate infrastructure knowledge and expertise to reduce cognitive load across the organization and create scalable processes
Develop and execute a comprehensive infrastructure strategy that prepares Ditto for massive scale based on current sales trajectory
Strengthen the bond between SRE and Platform teams, fostering collaboration and shared ownership
Establish and maintain a healthy, sustainable working cadence for all infrastructure teams while reducing on-call incident frequency
Partner with the upcoming Cloud team leadership to ensure seamless integration of infrastructure services
Implement best practices for cloud infrastructure management, focusing on AWS with consideration for multi-cloud strategies
Lead the optimization of Kubernetes across our infrastructure stack
Build and mentor a high-performing team of infrastructure engineers and managers
Collaborate with Product, Sales, and other Engineering teams to align infrastructure capabilities with business needs
Manage infrastructure budgets and make strategic decisions about tooling and technology investments
Establish SLOs, monitoring, and alerting strategies that ensure reliability at scale
Champion infrastructure as code, automation, and self-service capabilities
Requirements:
10+ years of experience in cloud SRE/Platform engineering, with deep expertise in distributed systems and infrastructure at scale
5+ years of experience with Kubernetes in production environments
5+ years of experience managing SRE/Platform engineering teams
Proven track record of managing teams of 10+ people and scaling engineering organizations
Strong AWS experience is required; experience with GCP and Azure is highly desired
Advanced knowledge of cloud best practices and infrastructure automation
Excellent verbal and written communication skills with the ability to explain complex technical concepts to diverse audiences
Strong conflict management and leadership skills
Experience managing through rapid growth phases and organizational change
Strategic thinking abilities with a focus on business outcomes
Track record of successfully reducing operational overhead while improving system reliability
Experience with CI/CD pipelines and developer productivity tooling
Understanding of security best practices and compliance requirements
Nice to have:
Experience in companies going through hyper-growth phases
Background in edge computing or distributed database technologies
Experience with multi-region and multi-cloud architectures
Knowledge of mesh networking or peer-to-peer systems
Previous experience working with enterprise customers with high reliability requirements