Site Reliability Engineer 5 – Live Encoding SRE
Posted 85ds ago
Employment Information
Report this job
Job expired or something wrong with this job?
Job Description
Site Reliability Engineer at Netflix supporting live streaming operations and managing the reliability of live streaming pipelines. Collaborating with cross-functional teams on live event lifecycles and innovations.
Responsibilities:
- Support our live streaming pipeline team and day-to-day live-streaming operations for Netflix
- Responsible for the reliability of our live streaming pipeline (transmission, encoding, packaging, origin)
- Instrumenting end to end observability and visualizing the data to achieve the desired availability at scale
- Working with cross functional teams in the preparation, validation, and execution of live streaming focused initiatives
- Impact multiple areas of the live event lifecycle, from the planning phase through testing and event launch days
- Lead innovation initiatives, driving new features that will enhance our live streaming services, encoding & content delivery
- Drive continual improvement in resilience, observability, monitoring, instrumentation, and automation with the primary goal to maintain highly scalable and reliable services worldwide
- Implement, automate, execute, and analyze results from a broad range of live streaming delivery focused functional, performance, resilience, and fault injection testing
- Coordinate, collaborate, and partner across multiple stakeholders for the smooth execution of live-streaming events
- Aggregate, analyze, and correlate large amounts of server and application performance data
- Use the innovative Netflix Big Data platform as a highly flexible, specialized and efficient toolset for service delivery optimization and system reliability improvements
- Participate in an on-call rotation and be able to work with flexible hours based on the live events schedule
Requirements:
- 5+ years service reliability/operational experience running large scale, high performance systems & internet services with focus on live-streaming and video-on-demand (VOD) delivery
- Experience with video transport protocols such as RTP, RTMP, SRT, UDP, Zixi, RIST, HLS, MPEG-DASH
- Knowledge of and proven experience with HTTP cache/proxy technologies
- Experience supporting live-streaming delivery at scale
- Expert-level knowledge of Unix or Linux system engineering fundamentals (networking, storage, operating systems) at scale
- Proficient understanding of networking principles, transport, and application protocols, especially TCP/IP, BGP, DNS, TLS, and HTTP/S
- Experience with using distributed analytic processing technologies (Hive, Presto/Trino, Spark SQL, etc)
- Proficient in a programming language such as Python or Go
- Preferred - B.S. in Computer Science, Electrical or Computer Engineering (or equivalent professional experience)
Benefits:
- Health Plans
- Mental Health support
- 401(k) Retirement Plan with employer match
- Stock Option Program
- Disability Programs
- Health Savings and Flexible Spending Accounts
- Family-forming benefits
- Life and Serious Injury Benefits
- Paid leave of absence programs
- Full-time hourly employees accrue 35 days annually for paid time off to be used for vacation, holidays, and sick paid time off
- Full-time salaried employees are immediately entitled to flexible time off



















