SRE Production Support
Posted 9hrs ago
Employment Information
Report this job
Job expired or something wrong with this job?
Job Description
SRE Production Support role in NBCUniversal's AIOps group supporting digital media delivery systems and automation. Collaborate with teams to enhance system reliability and performance.
Responsibilities:
- Design, implementation, and full-stack lifecycle support for digital asset delivery systems
- Delivery application tuning, performance optimization and troubleshooting
- Assisting with scoping, design, and implementation of media delivery project initiatives, under supervisor and team lead guidance
- Participating in incident cause-analysis & assistance in remediation & design efforts to improve reliability/prevent future failure scenarios
- Working closely with DevOps teams to identify, understand & develop monitoring for key system health/performance metrics
- Writing code and scripts to automate everything possible
Requirements:
- Experience with Site Reliability Engineering best practices and principles (Resiliency, Observability, Availability, Scalability)
- Experience with Agile DevOps methodologies, process & associated software (ServiceNow Agile, Jira)
- Experience with digital asset delivery systems (Signiant, Aspera)
- Experience supporting/administering Linux OS versions & features
- Experience with public cloud service offerings (AWS, Azure, Google)
- Familiarity with network technology concepts (TCP/IP, UDP, IPV4, IPV6, DNS, SSL, Firewalls, F5 LTM)
- Familiarity with automation, CI/CD pipeline & software testing (Ansible, Terraform, Puppet, Chef, and Jenkins)
- Experience with version control management (git, GitHub)
- Experience with script language development (Python, Node.js, Perl)
- BS in computer science or related field
- At least 3 years of experience supporting high volume/large-scale environments
Benefits:
- Flexible work arrangements
- Professional development opportunities




















