Manager, Site Reliability Engineering

Posted 98ds ago

Employment Information

Education
Salary
Experience
Job Type

Report this job

Job expired or something wrong with this job?

Job Description

Manager of Site Reliability Engineering leading infrastructure and engineering excellence initiatives at Jellyvision. Elevating SRE organization and driving technology optimization for healthcare benefits technology.

Responsibilities:

  • Directly manage a team of onshore and offshore software engineers
  • Lead and elevate our existing SRE team to world-class performance standards, advancing career development and technical excellence
  • Optimize and mature our established SRE practices, enhancing SLO/SLI frameworks, error budget management, and incident response effectiveness
  • Strengthen our culture of reliability and observability, driving higher standards for continuous improvement across all engineering teams
  • Refine existing on-call processes, escalation procedures, and post-incident reviews to accelerate learning and prevent recurring issues
  • Drive an AI-first agenda, leveraging AI tooling to address key pain points and improve speed to market
  • Partner with Product and Engineering leadership to help deliver the core product technology roadmap, balancing feature delivery with reliability and scalability requirements
  • Drive strategic decisions on technology consolidation and simplification to reduce operational overhead and costs
  • Lead technology platform evaluations and migrations that align with business objectives and cost optimization goals
  • Implement comprehensive monitoring, alerting, and observability solutions across all systems
  • Establish reliability engineering practices, load testing, and capacity planning
  • Drive automation initiatives that reduce manual toil and improve system reliability
  • Create and maintain disaster recovery and business continuity plans
  • Work closely with Platform & Infrastructure, Product Development, and Security teams to ensure aligned priorities
  • Collaborate with Finance and Operations teams on cost optimization and resource planning initiatives
  • Present technical strategies and progress to executive leadership

Requirements:

  • 6+ years of software engineering experience with 2+ years in SRE, DevOps, or infrastructure leadership roles
  • Proven experience building and scaling SRE teams at high-growth technology companies
  • Deep expertise in cloud platforms (AWS, GCP, Azure), containerization (Kubernetes, Docker), and Infrastructure as Code
  • Strong background in distributed systems, microservices architecture, and database technologies
  • Experience with monitoring and observability tools (Dynatrace, DataDog, New Relic, etc.)
  • Experience with AI automation and Workflow optimization tools
  • Demonstrated success leading engineering teams composed of FTEs and offshore contractors
  • Track record of driving significant cost reductions through technology optimization and consolidation
  • Experience managing complex technical roadmaps with competing priorities and resource constraints
  • Strong analytical skills with the ability to make data-driven decisions on technology investments
  • Excellent written and verbal communication skills with the ability to present to executive audiences
  • Experience translating technical concepts into business impact and ROI metrics
  • Proven ability to influence cross-functional teams and drive consensus on technical decisions.

Benefits:

  • Check out our benefits here!