Lead Infrastructure Engineer
Posted 13hrs ago
Employment Information
Report this job
Job expired or something wrong with this job?
Job Description
Lead Infrastructure Engineer for NexGen Cloud optimizing OpenStack and Kubernetes for GPU workloads. Lead a team and ensure platform reliability in a rapidly scaling environment.
Responsibilities:
- Own and drive the design, deployment, and operation of OpenStack and Kubernetes clusters optimised for GPU workloads
- Lead and develop a team of 4–5 infrastructure engineers, setting clear direction and standards
- Build and improve infrastructure through automation — IaC, GitOps, and CI/CD pipelines
- Ensure platform reliability through strong monitoring, observability, and incident management practices
- Collaborate closely with DevOps, Product, and Support teams to align infrastructure with real-world customer needs
- Take ownership of operational governance including incident, problem, and change management
- Identify opportunities to simplify, standardise, and scale systems as the platform grows
- Communicate clearly with leadership on platform performance, risks, and improvements
Requirements:
- Strong hands-on experience operating OpenStack in production environments
- Experience running production-grade Kubernetes clusters — ideally bare-metal or private cloud
- Solid Linux, networking, and storage fundamentals with a pragmatic troubleshooting approach
- Experience with infrastructure automation, CI/CD, and Git-based workflows
- Proven leadership or mentoring experience within infrastructure or platform teams
- Experience managing incidents and coordinating response during critical service events
- Strong communication skills, particularly translating technical issues for non-technical stakeholders.
- Experience integrating Kubernetes with OpenStack (Nice to Have)
- Exposure to GPU infrastructure, HPC, or large-scale compute platforms (Nice to Have)
- Familiarity with advanced networking or cloud-native ecosystems (Nice to Have)
- Contributions to open-source or cloud-native communities (Nice to Have)
Benefits:
- Competitive salary and annual discretionary bonus scheme
- Employee wellbeing benefits
- 25 days of holiday, plus public holidays
- Flexible working arrangements (remote or hybrid, depending on role and location)
- Real ownership and autonomy, with the trust to take initiative and experiment
- The opportunity to make a visible, meaningful impact as we scale
- Clear career progression and growth opportunities in a fast-growing company
- A collaborative, international culture built on trust, transparency, and ownership
- The chance to help shape NexGen Cloud's team, culture, and future alongside ambitious, mission-driven colleagues


















