D

Engineering Manager - Security Incident Response (EMEA)

Datadog
11 days ago
Full-time
Remote
Worldwide
Remote Engineering

The Security Incident Response team is part of our Resilience Engineering organisation and plays a vital role in keeping Datadog safe. Our goal is to ensure that Datadog is prepared for and efficiently responds to security-related incidents, ensuring that threats to our systems and data are contained as fast as possible. We also partner with teams after incidents by leveraging them as opportunities to learn. By focusing on our ability to adapt and fix systemic problems, we contribute towards a larger culture of building resilience in our people and systems.

As an Engineering Manager, you will help us realize this mission by leading a talented group of engineers who are committed to driving Datadog’s incident response capabilities to the next level. Along with building tools and automation to streamline our efficiency, you will work with key stakeholders across Datadog to ensure we are focusing our efforts in the right areas and are measuring how we improve. As part of a leadership team, you will be active in shaping our organizational strategy and culture. 

At Datadog, we place value in our office culture - the relationships and collaboration it builds and the creativity it brings to the table. We operate as a hybrid workplace to ensure our Datadogs can create a work-life harmony that best fits them.

What You’ll Do: 

  • Lead and mentor a team of experienced incident responders who are passionate about building a culture of security and resilience at Datadog. Help engineers grow to the next level and continuously provide them opportunities to develop. 
  • Serve as a hands-on leader during incidents. Lead under pressure, make decisions in ambiguous situations, and collaborate across several teams to drive towards resolution. Be on-call in our secondary rotation (along with around 5 other leaders), which is escalated to when responders need help with resourcing or decision-making.
  • Ensure the team is triaging alerts and signals in Datadog Cloud SIEM consistently and to a high level so that we can respond to emerging threats. Partner with our Threat Detection team to tune and calibrate these signals so they’re delivering value.
  • Build tools, systems, and processes to ensure Datadog is maturing its security incident response capabilities. Ensure that our operational capabilities are measured and communicated with stakeholders.
  • Lead post-incident analysis efforts so that engineers at Datadog learn from security incidents, ensuring postmortems are blameless and actionable. Ensure we are capturing follow-up items that repair systematic issues and preve