Senior Platform Engineer
Socket
About Us
Socket helps devs and security teams ship faster by cutting out security busywork. Thousands of orgs use Socket to safely find, audit, and manage open source code. Our customers — from Anthropic to xAI, and Figma to Vercel — love Socket (just check out their tweets https://socket.dev/love to see for yourself!)
Founded by Feross Aboukhadijeh https://www.linkedin.com/in/feross/, a long-time open source maintainer with software downloaded over a billion times a month, Socket has raised $65M in funding https://socket.dev/blog/series-b from top angels, operators, and security leaders.
About the Role
- Help us scale. Socket ships daily to an ever-growing group of customers. As our customer base grows, we’re building features and systems at lightning speed, and we need to ensure that our infrastructure and tooling meets that demand. You’ll be critical part of the team that supports this growth.
- Grow the team and culture: As an early member of the team, you will form the defining DNA for the company's culture and our future team. Our ability to build a market-defining product is solely dependent on the culture we foster.
- Set up foundational frameworks: You'll join at the genesis of something totally new and come into a fast-paced environment. We value process-driven systems that enable us to work smarter as we scale, and you'll build out systems that will serve as guide rails for the engineering team.
What You'll Do
- Partner closely with our engineers to debug production issues, improve performance, and design systems that scale reliably
- Own and evolve Socket’s infrastructure, with a focus on reliability, performance, and cost as we scale
- Help define and evolve SLIs and SLOs for new and existing systems, turning reliability into something that can be measured and improved
- Debug, maintain, and improve our deployment pipeline, including addressing failures in production and driving meaningful improvements over time
- Build and maintain observability across our systems (metrics, logs, traces) to support faster detection and resolution of issues
- Participate in an on-call rotation and drive incident reviews with an emphasis on concrete follow-ups and system improvements
What You'll Bring
- 5+ years of software development experience, including 1+ year in a DevOps or SRE role.
- Comfortable working on a distributed, cross-functional team where priorities shift and the problems change day to day
- Experience scaling and operating production web applications, preferably in a TypeScript / NodeJS environment
- Strong knowledge of relational databases, with Postgres preferred
- Hands-on experience building and using observability systems (Prometheus/Mimir, OpenTelemetry, Grafana)
- Experience with container orchestration (Docker, Kubernetes)
- Practical experience managing infrastructure-as-code with Terraform
- Experience running systems in a cloud environment, with GCP preferred
- Experience building