Site Reliability Engineer
Posted 14ds ago
Employment Information
Report this job
Job expired or something wrong with this job?
Job Description
Site Reliability Engineer ensuring system stability and troubleshooting issues at Motorola Solutions. Collaborating with teams to maintain high-performance applications and infrastructure.
Responsibilities:
- Diagnose complex, intermittent, and high-impact issues to maintain system stability
- Research and utilize advanced diagnostic tools to troubleshoot ongoing customer issues within live production environments
- Identify single points of failure in the architecture to re-design systems for maximum redundancy and auto-recovery
- Analyze application source code in Java and Angular to identify memory leaks, race conditions, or inefficient logic
- Propose and implement code fixes directly to improve long-term system reliability rather than simply filing bug tickets
- Adjust kernel parameters and network stack configurations to optimize low-level system performance
- Build internal tooling to empower other engineering teams to self-serve their infrastructure needs
- Develop high-quality automation to ensure that manually solved problems are never repeated
- Tweak database queries and application thread pools to tune the performance of the entire software stack
- Serve as a critical member of the on-call rotation to respond to and mitigate major system outages
- Lead incident command efforts during high-pressure situations to restore service and protect critical data flows
- Conduct post-incident reviews to convert outages into actionable architectural improvements
Requirements:
- 5+ years of experience with Java or Angular to debug and patch application-level reliability issues
- 5+ years of experience with Linux Internals to tune kernel parameters and network stack configurations
- Advanced English Proficiency
- Expertise in Infrastructure as Code (IaC) to build automated, repeatable environments
- Expertise in Database Optimization to refine complex queries and improve data retrieval speeds
- 5+ years of experience in a Site Reliability Engineering or DevOps role to manage high-availability production environments
- 5+ years of experience with Cloud Infrastructure to design resilient and scalable architectures
- Experience in Incident Command to lead the resolution of mission-critical system outages
- Bachelor’s degree in Computer Science, Software Engineering, or a related technical field
Benefits:
- Health insurance
- Flexible work arrangements
















