Lead Infrastructure Engineer

Posted 12hrs ago

Employment Information

Education
Salary
Experience
Job Type

Report this job

Job expired or something wrong with this job?

Job Description

Lead Infrastructure Engineer managing OpenStack and Kubernetes for NexGen Cloud's GPU infrastructure. Overseeing a team to optimize performance and scalability in a cloud-based environment.

Responsibilities:

  • Own and drive the design, deployment, and operation of OpenStack and Kubernetes clusters optimised for GPU workloads
  • Lead and develop a team of 4–5 infrastructure engineers, setting clear direction and standards
  • Build and improve infrastructure through automation — IaC, GitOps, and CI/CD pipelines
  • Ensure platform reliability through strong monitoring, observability, and incident management practices
  • Collaborate closely with DevOps, Product, and Support teams to align infrastructure with real-world customer needs
  • Take ownership of operational governance including incident, problem, and change management
  • Identify opportunities to simplify, standardise, and scale systems as the platform grows
  • Communicate clearly with leadership on platform performance, risks, and improvements

Requirements:

  • Strong hands-on experience operating OpenStack in production environments
  • Experience running production-grade Kubernetes clusters — ideally bare-metal or private cloud
  • Solid Linux, networking, and storage fundamentals with a pragmatic troubleshooting approach
  • Experience with infrastructure automation, CI/CD, and Git-based workflows
  • Proven leadership or mentoring experience within infrastructure or platform teams
  • Experience managing incidents and coordinating response during critical service events
  • Strong communication skills, particularly translating technical issues for non-technical stakeholders.

Benefits:

  • Competitive salary and annual discretionary bonus scheme
  • Employee wellbeing benefits
  • 25 days of holiday, plus public holidays
  • Flexible working arrangements (remote or hybrid, depending on role and location)
  • Real ownership and autonomy, with the trust to take initiative and experiment
  • The opportunity to make a visible, meaningful impact as we scale
  • Clear career progression and growth opportunities in a fast-growing company
  • A collaborative, international culture built on trust, transparency, and ownership
  • The chance to help shape NexGen Cloud's team, culture, and future alongside ambitious, mission-driven colleagues