DevOps, Infrastructure Engineer
Posted 49ds ago
Employment Information
Report this job
Job expired or something wrong with this job?
Job Description
DevOps / Infrastructure Engineer managing observability platform for tech company. Designing and maintaining observability tools and practices across engineering teams.
Responsibilities:
- Design, build, and maintain our observability platform—metrics, logs, traces, and everything in between
- Get hands-on with infrastructure: deploy services, troubleshoot incidents, and fix things when they break (because they will)
- Instrument applications and services to capture meaningful telemetry data that drives real insights
- Build dashboards and alerting systems that teams actually use—not just noise generators
- Dive into production issues, correlate data across systems, and lead root cause analysis
- Champion observability best practices across engineering teams and help developers instrument their own code
- Automate everything you can: infrastructure provisioning, deployment pipelines, and operational runbooks
- Work closely with SRE and development teams to improve system reliability and performance
- Evaluate and integrate new observability tools and technologies as the landscape evolves
Requirements:
- 3+ years of experience in DevOps, Infrastructure, or SRE roles—with real production battle scars
- Deep hands-on experience with observability tools: Prometheus, Grafana, Datadog, New Relic, Splunk, ELK stack, Jaeger, or similar
- Strong proficiency with cloud platforms (AWS, GCP, or Azure) and infrastructure-as-code (Terraform, Pulumi, CloudFormation)
- Solid scripting and automation skills (Python, Bash, Go, or similar)
- Experience with containerisation and orchestration (Docker, Kubernetes)
- Understanding of distributed systems, microservices architectures, and the unique observability challenges they present
- Familiarity with CI/CD pipelines and GitOps workflows
- Excellent troubleshooting skills—you're the person who doesn't give up until you've found the root cause
Benefits:
- Competitive salary and equity package
- Flexible working arrangements
- Learning and development budget
- Modern tech stack and the autonomy to make real impact
- A team that values doing things properly over just doing things quickly
















