Manager II - Production Engineering
Production Engineering at Pinterest is an evolution of our Site Reliability Engineering organization blending a hybrid of systems and software engineering with a focus on scaling, resiliency, reliability, performance, and efficiency. Our organization accomplishes this through building & integrating software, increasing automation, and infusing our knowledge & best practices into our Platform products so we can scale our large distributed systems to keep our customers happy, and our Pinners inspired. We also do this by developing short and long term embedded engagements with our engineering partners to help remove barriers, up-level reliability & best practices, and maintain a high consistent bar for reliability in a fast paced ever changing environment. We are always on a mission to improve reliability while also increasing engineering velocity, reducing toil and KTLO impact for both ourselves and our customers: fast, efficient, quality - we will accomplish all three!
What you’ll do:
Lead our engineers to deliver on the biggest impact work across engineering to ensure we’re infusing best practices into our products relating to reliability, scalability, performance and efficiency
Drive technical architecture discussions; including being capable of driving and decision making for technology or applications that you have not had previous experience with
Continuously assess your team’s performance, address and coach under-performance, and recognize and promote high performance
Create an inclusive and welcoming workplace where every team member feels valued and supported
Foster an environment of open and honest communication, allowing team members to be safe to fail, encourage risk taking with a fail-fast mentality, and establish forums where they can share their ideas
Empower engineers to develop their careers, matching their strengths with projects tailored to their skill levels, long-term skill development, personalities, and work styles
Create an inspiring team charter and direction that align with the goals of the broader Production Engineering organization
Develop strong partnerships with Product & Program Management partners across infrastructure by communicating a clear and impactful vision and priorities
Establish team norms around planning, execution, and continuous improvement
What we’re looking for:
3+ years experience managing teams within an SRE, Production Engineering or other Platform/Infrastructure organizations
Customer obsession: Demonstrated ability to work cohesively and build relationships with partners across engineering disciplines and capable of influencing without authority
Familiarity with the concepts and use cases for SDLC including SCM tools, Build platforms, test frameworks, CI/CD products
Familiar with usage and high level architecture of data platform technologies such as relational databases, storage & caching, key value stores, time series data stores, etc.
Strong domain expertise in reliability concepts and best practices with the ability to innovate and provide thought leadership and direction in this problem space
Hands on familiarity with public cloud platforms such as AWS, GCP, or Azure
Knowledge of Linux systems internals and networking
Thrive in an environment with a lot of ambiguity with the ability to be self sufficient and ruthlessly prioritize the highest impact projects
Infrastructure technologies such as Docker, Kubernetes, Tensorflow, ElasticSearch, ZooKeeper, and Infrastructure as code (e.g. Terraform, Puppet, Chef, Ansible, Salt, Fabric, etc)
Heavy bias toward action; able to drive resolution and making quick decisions balancing being data driven along with leveraging your experience & judgement
In-Office Requirement Statement:
We let the type of work you do guide the collaboration style. That means we're not always working in an office, but we continue to gather for key moments of collaboration and connection.
This role will need to be in the office for in-person collaboration 1-2 times/quarter and therefore can be situated anywhere in the country.
Relocation Statement:
This position is not eligible for relocation assistance. Visit our PinFlex page to learn more about our working model.
#LI-REMOTE
#LI-JT1
About the job
Apply for this position
Manager II - Production Engineering
Production Engineering at Pinterest is an evolution of our Site Reliability Engineering organization blending a hybrid of systems and software engineering with a focus on scaling, resiliency, reliability, performance, and efficiency. Our organization accomplishes this through building & integrating software, increasing automation, and infusing our knowledge & best practices into our Platform products so we can scale our large distributed systems to keep our customers happy, and our Pinners inspired. We also do this by developing short and long term embedded engagements with our engineering partners to help remove barriers, up-level reliability & best practices, and maintain a high consistent bar for reliability in a fast paced ever changing environment. We are always on a mission to improve reliability while also increasing engineering velocity, reducing toil and KTLO impact for both ourselves and our customers: fast, efficient, quality - we will accomplish all three!
What you’ll do:
Lead our engineers to deliver on the biggest impact work across engineering to ensure we’re infusing best practices into our products relating to reliability, scalability, performance and efficiency
Drive technical architecture discussions; including being capable of driving and decision making for technology or applications that you have not had previous experience with
Continuously assess your team’s performance, address and coach under-performance, and recognize and promote high performance
Create an inclusive and welcoming workplace where every team member feels valued and supported
Foster an environment of open and honest communication, allowing team members to be safe to fail, encourage risk taking with a fail-fast mentality, and establish forums where they can share their ideas
Empower engineers to develop their careers, matching their strengths with projects tailored to their skill levels, long-term skill development, personalities, and work styles
Create an inspiring team charter and direction that align with the goals of the broader Production Engineering organization
Develop strong partnerships with Product & Program Management partners across infrastructure by communicating a clear and impactful vision and priorities
Establish team norms around planning, execution, and continuous improvement
What we’re looking for:
3+ years experience managing teams within an SRE, Production Engineering or other Platform/Infrastructure organizations
Customer obsession: Demonstrated ability to work cohesively and build relationships with partners across engineering disciplines and capable of influencing without authority
Familiarity with the concepts and use cases for SDLC including SCM tools, Build platforms, test frameworks, CI/CD products
Familiar with usage and high level architecture of data platform technologies such as relational databases, storage & caching, key value stores, time series data stores, etc.
Strong domain expertise in reliability concepts and best practices with the ability to innovate and provide thought leadership and direction in this problem space
Hands on familiarity with public cloud platforms such as AWS, GCP, or Azure
Knowledge of Linux systems internals and networking
Thrive in an environment with a lot of ambiguity with the ability to be self sufficient and ruthlessly prioritize the highest impact projects
Infrastructure technologies such as Docker, Kubernetes, Tensorflow, ElasticSearch, ZooKeeper, and Infrastructure as code (e.g. Terraform, Puppet, Chef, Ansible, Salt, Fabric, etc)
Heavy bias toward action; able to drive resolution and making quick decisions balancing being data driven along with leveraging your experience & judgement
In-Office Requirement Statement:
We let the type of work you do guide the collaboration style. That means we're not always working in an office, but we continue to gather for key moments of collaboration and connection.
This role will need to be in the office for in-person collaboration 1-2 times/quarter and therefore can be situated anywhere in the country.
Relocation Statement:
This position is not eligible for relocation assistance. Visit our PinFlex page to learn more about our working model.
#LI-REMOTE
#LI-JT1