Senior SRE, Databricks
Posted 7hrs ago
Employment Information
Report this job
Job expired or something wrong with this job?
Job Description
Senior Data Infrastructure Engineer focusing on building and automating data platform on Google Cloud Platform. Ensuring security, scalability and efficiency in data infrastructure and governance.
Responsibilities:
- Create, maintain and evolve Terraform modules for provisioning the data infrastructure.
- Manage state management, workspaces and infrastructure versioning best practices.
- Ensure reproducible, auditable and traceable infrastructure via code.
- Provision and manage networks (VPCs, subnets, firewall rules) following security best practices.
- Configure IAM: roles, policies and service accounts with the principle of least privilege.
- Manage Google Cloud Storage (GCS) as the storage layer for the data platform.
- Ensure compliance with cloud security and governance policies.
- Provision and configure Databricks workspaces via Terraform/IaC.
- Manage clusters, jobs, notebooks and permissions on the platform.
- Integrate Databricks with GCP infrastructure (service accounts, VPC, GCS, IAM).
- Build and maintain CI/CD pipelines for infrastructure (GitHub Actions or similar).
- Apply GitOps practices: all infrastructure changes via Pull Request with review and automated validation.
- Ensure secure and auditable deployment across multiple environments (dev/staging/prod).
- Implement secrets and credential management following best practices (Secret Manager, Vault, etc.).
- Automate and standardize environments to ensure consistency and eliminate manual configuration.
- Support the data team with reliable, self-service infrastructure.
Requirements:
- Infrastructure as Code (IaC): Terraform with modules, state management and best practices.
- Google Cloud Platform (GCP): VPC, subnets, firewall rules, IAM (roles, policies, service accounts), GCS.
- Databricks: provisioning workspaces, managing clusters, jobs and notebooks.
- Version control with Git and GitOps practices.
- CI/CD pipelines (GitHub Actions, GitLab CI, Azure DevOps or similar).
- Cloud security: IAM, secrets, access policies and compliance.
- Automation and standardization of data environments.
- BigQuery: modeling, optimization and integration with the data platform. [**DIFFERENTIAL**]
- Apache Spark / PySpark for distributed processing. [**DIFFERENTIAL**]
- Delta Lake and Lakehouse architecture. [**DIFFERENTIAL**]
- Knowledge in Data Engineering and data pipelines (ETL/ELT). [**PLUS**]
- Kubernetes (GKE) for workload orchestration. [**PLUS**]
- FinOps: cloud cost optimization, rightsizing, reservations. [**PLUS**]
- Other IaC tools: Pulumi, Cloud Deployment Manager. [**PLUS**]
- Observability: monitoring, logging and alerting (Cloud Monitoring, Datadog, etc.). [**PLUS**]
Benefits:
- 🏥 Porto Seguro Health Insurance
- 🦷 Porto Seguro Dental Insurance
- 💰 Profit Sharing (PLR)
- 👶 Childcare Allowance
- 🍽️ Alelo Food and Meal Vouchers
- 💻 Home Office Allowance
- 📚 Partnerships with Educational Institutions
- 🚀 Support for Certifications, including Cloud
- 🎁 Livelo Points
- 🏋️♂️ TotalPass
- 🧘♂️ Mindself
- __ Temporary project until December, with the possibility of extension into 2027.__


















