Reddit is a community of communities where people can dive into anything through experiences built around their interests, hobbies, and passions. Our mission is to bring community, belonging, and empowerment to everyone in the world. Reddit users submit, vote, and comment on content, stories, and discussions about the topics they care about the most. From pets to parenting, with over 100,000 active communities and over 70 million daily active users, it is home to the most open and authentic conversations on the internet. For more information, visit redditinc.com.

As the Director of Site Reliability, you will be responsible for ensuring the overall performance, availability and resilience of our infrastructure. This includes implementing and maintaining monitoring systems,collaborating with cross-functional teams to address performance bottlenecks and continuously improving the reliability and scalability of our systems to meet the evolving needs of our users. 

How You’ll have impact: 

Reporting into the global Head of Infrastructure, your peers and customers will be all of the other engineering directors at Reddit.  You will partner with a multitude of stakeholders to understand Reddit’s core service priorities across all of our product lines, and will guide design, development, and adoption of a scalable, reliable, and low latency core service stack. This stack will operate in multiple cloud environments, and provide an API platform for Reddit to rapidly deliver reliable, performant, and efficient services to our end users. 

You will be accountable for building, growing, and mentoring a world-class team of engineers to help Reddit reach its goal of bringing community and belonging to everyone.

What You’ll Do

  • Develop, drive and execute a long term vision and strategy for Site Reliability to be leveraged by all of Reddit’s products. 
  • Support multiple Reddit product teams with expertise and engineering development to optimize availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning.
  • Establish a stronger Platform-Product interface for feature tracking and prioritization, and provide an opinionated and trusted voice for guiding these decisions.
  • Coordinate across product and engineering teams to understand and widely socialize Reddit’s SRE priorities across all of our products. 
  • Support the reliable operation of these systems as a Platform for Reddit products, and allow us to rapidly deliver reliable, performant, and efficient services to our end users. 
  • Evolve our backend tech stack using modern and internal supported options (Golang, Redis, etc)
  • Lead, manage and grow high-caliber engineering teams.  
  • Provide mentorship and growth opportunities for team members and leaders to evolve in their roles at a company scale. 
  • Set and support a culture of metrics driven Quality, with efficient processes and strong transparency. 
  • Drive a cycle of virtuous improvement with blame-free postmortems.  

What We Look For

  • 6+ years experience of managing teams of site reliability and infrastructure engineers.
  • 10+ years of experience developing internet-scale software, preferably in infrastructure roles.
  • Experience designing, deploying, building or managing distributed systems of significant scale 
  • Professional experience and capability with essential cloud infrastructure systems (Kuberrnetes, AWS, GCE).
  • Track record of assembling high functioning organization
  • Strong organizational skills, the ability to prioritize tasks and keep projects on schedule.
  • BS degree in Computer Science, similar technical field of study or equivalent practical experience.

Benefits:

  • Pension Savings plan 
  • Medical Plan
  • Short term sickness benefits 
  • WIA excess and WGA gap insurance 
  • Workspace benefits for your home office 
  • Personal & Professional development funds
  • Family Planning Support 
  • Flexible Vacation & Reddit Global Days Off

#LI-remote, #LI-JS5

Reddit is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or an accommodation due to a disability, please contact us at ApplicationAssistance@Reddit.com.

Apply for this Job

* Required

resume chosen  
(File types: pdf, doc, docx, txt, rtf)
cover_letter chosen  
(File types: pdf, doc, docx, txt, rtf)


Please reach out to our support team via our help center.
Please complete the reCAPTCHA above.