A

Staff Software Engineer - Platform & Infrastructure

Abnormalsecurity
18 days ago
Full-time
Remote
Worldwide
Remote Engineering

About the Role

Enterprises of all sizes trust Abnormal Security’s cloud products to stop cybercrime—and these products are only as powerful as the platform they run on. The Platform Infrastructure team builds and operates the core systems that make Abnormal’s AI-driven detection and prevention possible: delivering reliability, scalability, and security at cloud scale.

We’re looking for a Staff Software Engineer to lead foundational efforts across multiple areas of Platform Infrastructure. In this role, you’ll guide a high-performing team, shape the roadmap for a true self-service infrastructure platform, and drive ambitious technical projects that use AI to automate and elevate how we build and operate our systems.

The ideal candidate:

  • Tackles complex, ambiguous problems and turns them into actionable plans.
  • Leads by example and dives deep when needed.
  • Embodies our VOICE values and builds software that delights customers.
  • Earns trust across Engineering, Product, and Design through thoughtful collaboration.

Team mission: Build and evolve the core infrastructure—compute, orchestration, and data platform—that powers Abnormal’s AI/ML products at scale. We treat platforms as products: usable, reliable, secure, and cost-efficient.

What you will do

  • Shape the core areas of Platform Infrastructure such as compute (EC2/EKS, autoscaling, container runtime) and orchestration (Kubernetes, workload APIs, multi-cluster, policy/quotas), as well as data platform (streaming, batch, durable storage, data tooling)—with demonstrated depth in at least two of these.
  • Design and drive platform architecture & roadmap to support Abnormal’s expanding AI/ML portfolio—scaling seamlessly across services, tenants, and regions.
  • Partner deeply with product & ML workflows to make pragmatic trade-offs, accelerating our shift to a platform-first operating model and enabling self-service.
  • Raise the bar on operational excellence (SLOs, availability, performance, incident response, change management, on-call hygiene) and help teams consistently meet it.
  • Act as the team’s technical lead: define quarterly roadmaps, de-risk delivery, mentor engineers, and land high-leverage, cross-team initiatives.
  • Champion AI-native software development,