Site Reliability Engineer
Posted 5hrs ago
Employment Information
Report this job
Job expired or something wrong with this job?
Job Description
Site Reliability Engineer optimizing reliability, scalability, and performance for Luupli's AWS cloud infrastructure. Collaborating with teams to enhance automation and incident management.
Responsibilities:
- Collaborate with software engineering and operations teams to design, build, and maintain cloud-based infrastructure using AWS and Terraform
- Implement and enhance infrastructure-as-code (IaC) practices using Terraform to ensure reproducibility and scalability of infrastructure components
- Develop and maintain monitoring solutions to proactively identify performance bottlenecks, system outages, and other potential issues
- Participate in incident response and root cause analysis efforts to drive continuous improvement and prevent future incidents
- Optimise system performance, reliability, and cost efficiency through continuous monitoring, performance tuning, and capacity planning
- Identify opportunities to automate manual processes and improve system resilience
- Utilise Python or Bash scripting to create and maintain automation tools for various operational tasks and deployments
- Implement and improve continuous integration and continuous deployment (CI/CD) pipelines
- Collaborate with security teams to implement best practices for securing cloud infrastructure and services
- Ensure compliance with relevant industry standards and regulations
- Support CI/CD pipelines for application deployments and updates
- Contribute to the design and implementation of deployment strategies that promote zero-downtime releases
- Maintain clear and up-to-date documentation for infrastructure configurations, processes, and incident resolution procedures
- Participate in knowledge sharing with team members to enhance overall expertise and skill sets
Requirements:
- Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent practical experience)
- Proven experience as a Site Reliability Engineer or similar role
- Extensive experience with Amazon Web Services (AWS) and its core services (EC2, S3, RDS, IAM, etc.)
- Strong proficiency in infrastructure-as-code (IaC) tools, with a focus on Terraform
- Proficient in scripting with Python or Bash for automation and operational tasks
- Solid understanding of networking principles and protocols
- Knowledge of CI/CD pipelines and related tools
Benefits:
- equity-only position
- opportunity to gain a stake in a rapidly growing company
- contribute directly to its success



















