Data Engineering Lead
Posted 46ds ago
Employment Information
Report this job
Job expired or something wrong with this job?
Job Description
Data Engineering Lead overseeing design and development of next-generation data platform at MediaRadar. Spearheading data architecture and leading a team of data engineers for large-scale projects.
Responsibilities:
- Design and supervise the implementation of comprehensive data pipelines utilizing Azure Databricks and PySpark.
- Direct a team of data engineers, performing code reviews, offering technical expertise, and cultivating a culture of ongoing learning.
- Develop high-performance schemas in PostgreSQL and refine complex SQL queries for large datasets.
- Establish and apply optimal practices for data ingestion, transformation, and storage (Delta Lake/Lakehouse patterns).
- Collaborate closely with Data Analysts, Architects, and Product Managers to convert business requirements into technical specifications.
- Promote the implementation of CI/CD, unit testing, and automated monitoring to achieve 99.9% data reliability.
- Ensure data quality, governance, and compliance through validation, documentation, and secure practices.
- Continuously improve data systems for enhanced performance, reliability, and scalability.
- Effectively engage within an agile, cross-functional project team.
Requirements:
- Azure Databricks:
- ○ Expert-level experience managing workspaces, clusters, and job scheduling.
- ○ Solid understanding of data lakehouse architectures and Delta Lake.
- ○ Proven experience in Performance Tuning, Spark Optimization and Cost Reduction.
- PySpark: Advanced proficiency in Spark DataFrame APIs and Spark SQL for large-scale data processing involving various data formats.
- SQL Mastery: Exceptional ability to write, tune, and troubleshoot complex queries.
- PostgreSQL: Hands-on experience with relational database design, indexing, and performance optimization.
- ETL/ELT Frameworks: Proven track record of building scalable data pipelines from scratch.
- Workflow Orchestration: Experience with Apache Airflow for managing complex task dependencies.
- Containerization: Familiarity with Azure Kubernetes Service (AKS) for deploying containerized data services.
- Infrastructure as Code (IaC): Knowledge of Terraform or Bicep for managing Azure resources.
- 10+ years of experience in Data Engineering or Software Engineering.
- 3+ years as a formal technical Lead managing an agile team and implementing E2E solutions.
- Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field.
- Strong communication skills with the ability to explain complex technical concepts to non-technical stakeholders.
- Strong problem-solving skills and attention to detail.















