Manager, Site Reliability Engineering

Posted 143ds ago

Employment Information

Industry

Education

Salary

Experience

Job Type

Location

Report this job

Job expired or something wrong with this job?

Job Description

Manager of Site Reliability Engineering leading infrastructure and engineering excellence initiatives at Jellyvision. Elevating SRE organization and driving technology optimization for healthcare benefits technology.

Responsibilities:

Directly manage a team of onshore and offshore software engineers
Lead and elevate our existing SRE team to world-class performance standards, advancing career development and technical excellence
Optimize and mature our established SRE practices, enhancing SLO/SLI frameworks, error budget management, and incident response effectiveness
Strengthen our culture of reliability and observability, driving higher standards for continuous improvement across all engineering teams
Refine existing on-call processes, escalation procedures, and post-incident reviews to accelerate learning and prevent recurring issues
Drive an AI-first agenda, leveraging AI tooling to address key pain points and improve speed to market
Partner with Product and Engineering leadership to help deliver the core product technology roadmap, balancing feature delivery with reliability and scalability requirements
Drive strategic decisions on technology consolidation and simplification to reduce operational overhead and costs
Lead technology platform evaluations and migrations that align with business objectives and cost optimization goals
Implement comprehensive monitoring, alerting, and observability solutions across all systems
Establish reliability engineering practices, load testing, and capacity planning
Drive automation initiatives that reduce manual toil and improve system reliability
Create and maintain disaster recovery and business continuity plans
Work closely with Platform & Infrastructure, Product Development, and Security teams to ensure aligned priorities
Collaborate with Finance and Operations teams on cost optimization and resource planning initiatives
Present technical strategies and progress to executive leadership

Requirements:

6+ years of software engineering experience with 2+ years in SRE, DevOps, or infrastructure leadership roles
Proven experience building and scaling SRE teams at high-growth technology companies
Deep expertise in cloud platforms (AWS, GCP, Azure), containerization (Kubernetes, Docker), and Infrastructure as Code
Strong background in distributed systems, microservices architecture, and database technologies
Experience with monitoring and observability tools (Dynatrace, DataDog, New Relic, etc.)
Experience with AI automation and Workflow optimization tools
Demonstrated success leading engineering teams composed of FTEs and offshore contractors
Track record of driving significant cost reductions through technology optimization and consolidation
Experience managing complex technical roadmaps with competing priorities and resource constraints
Strong analytical skills with the ability to make data-driven decisions on technology investments
Excellent written and verbal communication skills with the ability to present to executive audiences
Experience translating technical concepts into business impact and ROI metrics
Proven ability to influence cross-functional teams and drive consensus on technical decisions.

Benefits:

Check out our benefits here!

Manager, Site Reliability Engineering

Employment Information

Report this job

Job Description

Responsibilities:

Requirements:

Benefits:

Jellyvision

Report this job

Similar Jobs

South Geeks

Digibee

Salesforce

Proofpoint

Oscilar

Expleo Group

Jusbrasil

Verity Group

Visionary Integration Professionals (VIP)

Xenon Seven

ZigZag Offshoring

Ford Motor Company

easybill GmbH

DonWeb

IRIUM

TechInsights

Keiki

Hewlett Packard Enterprise

General Dynamics Information Technology

TechInsights