Site Reliability Engineer

Posted 2ds ago

Employment Information

Education
Salary
Experience
Job Type

Report this job

Job expired or something wrong with this job?

Job Description

Site Reliability Engineer at Pathlock ensuring scalability and reliability of applications on Azure and Kubernetes. Automating infrastructure and collaborating with teams for optimal performance.

Responsibilities:

  • Design, build, and improve CI/CD pipelines for applications and infrastructure
  • Develop automation frameworks that reduce manual effort and increase consistency.
  • Configure and optimize cloud infrastructure to align with security, scalability, and performance best practices.
  • Collaborate with development teams to remove deployment blockers and improve delivery workflows.
  • Monitor reliability and performance, identify issues early, and implement data-driven improvements to increase uptime and efficiency.
  • Participate in on-call rotations and drive incident resolution with clear postmortems and preventive actions.
  • Maintain technical documentation for pipelines, configurations, and runbooks.
  • Perform readiness assessments and validation tests before production rollouts.
  • Implement Infrastructure as Code using Terraform and ARM templates with version control and reproducibility.
  • Troubleshoot complex deployment, provisioning, and performance issues across multicloud and containerized environments.

Requirements:

  • 3 to 5 years in SRE or DevOps roles operating production systems
  • Hands-on experience running production workloads on Kubernetes in a cloud environment, including cluster design, autoscaling, upgrades, and network policies.
  • Proven CI/CD delivery using GitHub Actions or Jenkins, including promotion across environments, approvals, and rollback strategies.
  • Infrastructure as Code expertise with Terraform and ARM templates, including modules, remote state, workspaces, and policy enforcement.
  • Strong scripting in PowerShell, Bash, or Python for automation and diagnostics.
  • GitOps experience with Argo CD or Flux, managing multi-environment application delivery and drift remediation.
  • Containerization with Docker and Kubernetes, including health probes, PodDisruptionBudgets, resource quotas, HorizontalPodAutoscaler, and operators.
  • Networking fundamentals with cloud network security practices such as VNet design, NSGs, Private Link, and ingress controllers.
  • Working knowledge of cloud security and compliance, including least privilege, secrets management, audit trails, and control evidence.
  • Excellent written and spoken English.
  • Ability to collaborate across US time zone.