Software Engineer, Infrastructure Platform
Posted 21ds ago
Employment Information
Report this job
Job expired or something wrong with this job?
Job Description
Software Engineer building and operating internal platform services at Docker. Focus on automation, reliability, and Kubernetes in a remote-first engineering team.
Responsibilities:
- Build and operate internal platform services and APIs in Go, including provisioning, quotas and policies, cost insights, and platform workflows.
- Deliver golden paths for self-serve onboarding and day-2 operations, including access, deployment setup, observability defaults, and governance guardrails.
- Partner with teams to drive adoption through clear docs, examples, and measurable outcomes.
- Codify infrastructure with Terraform and GitOps practices, and contribute to platform tooling in Go.
- Define and improve SLOs, alerting, and operational readiness. Participate in incident response and preventive follow-ups.
- Help standardize safe delivery patterns, including testing gates, canaries, and rollback triggers, so deployments are routine and low-risk.
- Operate and scale multi-tenant EKS clusters and traffic and ingress systems to deliver secure, reliable routing.
- Evaluate and adopt improvements with a bias toward incremental rollout and measurable impact.
- Build and iterate on agentic workflows that reduce operational toil, including triage support, context gathering, safe runbook execution, and remediation suggestions.
- Integrate automation into delivery and operations in a way that is safe, observable, and auditable.
- You’ll join an on-call rotation after onboarding and shadowing, and participate in incident response during your shifts.
- We aim for sustainable on-call through good alerting, automation, and blameless postmortems focused on prevention.
Requirements:
- 4+ years of backend software engineering experience building large-scale cloud or distributed systems
- Strong software development skills in Go or a similar language, including design, testing, debugging, and code review.
- Experience shipping and operating cloud services in production, often 3+ years. We hire for skill and impact, not years alone.
- Solid foundation in Linux, networking fundamentals, and cloud security.
- Experience building operational automation, including AI-assisted or agentic workflows, with an emphasis on safety, guardrails, and auditability.
- Clear written and verbal communication in a remote environment, including RFCs, incident writeups, and async collaboration.
Benefits:
- Freedom & flexibility; fit your work around your life
- Designated quarterly Whaleness Days plus end of year Whaleness break
- Home office setup; we want you comfortable while you work
- 16 weeks of paid Parental leave
- Technology stipend equivalent to $100 net/month
- PTO plan that encourages you to take time to do the things you enjoy
- Training stipend for conferences, courses and classes
- Equity; we are a growing start-up and want all employees to have a share in the success of the company
- Docker Swag
- Medical benefits, retirement and holidays vary by country
- Remote-first culture, with offices in Seattle and Paris

















