Senior Site Reliability Engineer, Node Platform

Posted 121ds ago

Employment Information

Industry

Education

Salary

Experience

Job Type

Location

Report this job

Job expired or something wrong with this job?

Job Description

Senior Site Reliability Engineer designing infrastructure primitives for decentralized networks. Collaborate on Kubernetes-based control planes and improve operational efficiency.

Responsibilities:

You will design and build the infrastructure primitives that define how Chainlink Decentralized Oracle Networks (DONs) scale across internal systems and the decentralized ecosystem.
You will help create the CRE (Kubernetes-based) control plane that enables:
Deterministic horizontal scaling of DONs
Safe and repeatable infrastructure expansion
Improved operational efficiency and scalability
You will develop the core infrastructure components, including Kubernetes Operators and scaling automation, that Product teams will adopt and then might later be distributed to external node operators to improve decentralized scaling.

Requirements:

6–9+ years in SRE / Platform / Infrastructure Engineering
Proven experience scaling Kubernetes in high-throughput production environments
Deep knowledge of:
Scheduler behavior
StatefulSets & persistent workloads
Autoscaling strategies (HPA, VPA, KEDA, custom scaling)
Resource management & performance tuning
Multi-cluster and multi-region architectures
Experience in diagnosing production failures at the cluster scale
Strong Terraform or Crossplane experience
GitOps workflows (ArgoCD / Flux) experience
CI/CD reliability experience
Automation-first mindset
AWS production experience
Proficiency in Go (strongly preferred) or equivalent systems language.

Benefits:

All roles with Chainlink Labs are global and remote-based.
We carefully review all applications and aim to provide a response to every candidate within two weeks after the job posting closes.
Commitment to Equal Opportunity

41min

Ingeniero/a Cloud DevOps

Cloud DevOps Engineer specializing in Microsoft environments for a full-remote project with IRIUM. Seeking candidates with Azure experience and strong English skills.

Senior Site Reliability Engineer, Node Platform

Employment Information

Report this job

Job Description

Responsibilities:

Requirements:

Benefits:

Chainlink Labs

Report this job

Similar Jobs

IRIUM

IRIUM

Envision Healthcare

Pear Tree.

Veeam Software

Veeam Software

Arclin

EXL

Celonis

Upstart

Leidos

Ascensus

The Home Depot

Stord

DraftKings Inc.

Smarthis

Addvisor Group

Blackpoint Cyber

Blackpoint Cyber

MyFitnessPal