Senior Engineer, Site Reliability
Posted 1hrs ago
Employment Information
Report this job
Job expired or something wrong with this job?
Job Description
Senior SRE designing and implementing automation for Syniti's cloud-hosted SaaS platform. Collaborating with multiple teams to ensure compliance and scalability across Azure and AWS environments.
Responsibilities:
- Design and build automated cloud infrastructure for Syniti-hosted SaaS workloads across Azure and AWS.
- Implement and manage CI/CD pipelines using GitHub Actions and ArgoCD GitOps workflows for multi-environment deployments (dev, preprod, prod, GovCloud).
- Develop and maintain Terraform modules for infrastructure provisioning, including EKS/AKS clusters, Aurora PostgreSQL, Redis, OpenSearch, and S3/Azure Storage.
- Integrate and maintain observability frameworks (Prometheus, Grafana, Loki, Mimir, Jaeger) and enforce structured logging of application, audit, and security events.
- Support and extend Istio service mesh configurations including mTLS policies, AuthorizationPolicies, and SPIFFE-based workload identity.
- Implement and maintain supply chain security controls: container image signing (Cosign), SBOM generation (Syft), provenance attestation (SLSA), and vulnerability scanning (Trivy, Inspector, Snyk).
- Collaborate with security teams to meet FedRAMP High, Cyber Essentials+, NIST 800-53, and SecNumCloud control objectives across Azure and AWS environments.
- Provide L3 incident response and lead root cause analysis efforts for application-tier outages across the Northstar platform.
- Support the automated compliance gate (11 controls: NAC, FIPS, STIG, IRSA, SAST, DAST, SBOM, Sign, Vuln, Audit, Auth) and ensure application services pass all gates.
- Manage Kubernetes workload autoscaling (Karpenter, KEDA, HPA) and pod security policies (Kyverno) for production services.
Requirements:
- 10+ years of experience in SRE, DevOps, or Cloud Engineering.
- 5+ years of hands-on experience with Microsoft Azure including AKS, Storage, Monitor, and IAM/Entra ID.
- 3+ years of hands-on experience with AWS (EKS, RDS/Aurora, IAM, S3, SQS, Cognito).
- 3+ years of Terraform module development for cloud infrastructure provisioning.
- 3+ years in CI/CD tooling (GitHub Actions, ArgoCD, or equivalent GitOps platforms).
- Strong Kubernetes operations experience: cluster management, pod security, autoscaling, troubleshooting.
- Experience with service mesh technologies (Istio, Envoy, or Linkerd) and workload identity (SPIFFE/SPIRE, IRSA, or workload identity federation).
- Proficiency in scripting languages: Python, Bash, or PowerShell. Go or .NET experience a plus.
- Experience implementing regional compliance controls (FedRAMP, SOC 2, Cyber Essentials+, or equivalent).
- Understanding of Zero Trust principles, mTLS, service identity, and network segmentation.
- Experience with observability stacks (Prometheus, Grafana, Loki, or equivalent) and distributed tracing.
- Familiarity with container supply chain security (image signing, SBOM, vulnerability scanning) is a plus.
Benefits:
- Trust in your talent.
- Growth opportunities.
- Supportive environment.
- Recognition of individual achievements.
- Commitment to inclusion and diversity.


















