Lead Data Engineer
Posted 88ds ago
Employment Information
Report this job
Job expired or something wrong with this job?
Job Description
Lead Data Engineer at Greenbox Capital designing scalable data solutions using Azure for small businesses. Collaborate with teams to ensure high-quality, governed data pipelines that support critical business decisions.
Responsibilities:
- Design, develop, and maintain scalable data pipelines and ETL processes using Azure Data Factory, Azure Databricks, and other Azure services.
- Own the data engineering framework, including pipeline patterns, orchestration standards, and reusable components.
- Collaborate with data scientists, Software engineers, analysts, and other stakeholders to understand data requirements and deliver high-quality data solutions.
- Define, document, and enforce best practices for ADF, Databricks, Spark, and data modeling.
- Implement and maintain data storage solutions using Azure SQL Database, Azure Data Lake Storage, and Azure Cosmos DB.
- Ensure data quality and integrity by implementing data validation, cleansing, and transformation processes.
- Implement data quality checks, validation frameworks, and monitoring for critical data assets.
- Design and support governance patterns leveraging Databricks Unity Catalog and Azure-native controls.
- Develop and maintain documentation for data engineering processes and solutions.
Requirements:
- Bachelor’s degree in computer science, Information Technology, or a related field.
- 5+ years of experience in data engineering, with demonstrated ownership of production systems.
- 3+ years of experience in the Azure ecosystem.
- Proficiency in Azure Data Factory, Azure Databricks, Azure SQL Database, Azure Data Lake Storage, Azure Cosmos DB, Databricks Unity Catalog.
- Material experience interacting with relational and NoSQL, JSON, XML, and interacting with REST APIs.
- Deep hands-one experience with: Azure Data Factory (orchestration, patterns, parameterization), Azure Databricks / Apache Spark (PySpark, performance tuning, cluster design), Azure Data Lake Storage and Azure SQL.
- Advanced experience in programming in Python & SQL.
- Solid understanding of data modeling, ETL/ELT design, and analytical data platforms.
- Experience with Azure DevOps or GitHub and CI/CD pipelines.
- Experience designing and deploying data governance frameworks from the ground up.
- Proven experience owning data engineering frameworks, not individual pipelines.
Benefits:
- Competitive Pay - We know your worth and we pay accordingly.
- Flexible PTO - Work hard, rest well. Take the time you need to recharge.
- Full Benefits Package - Health, dental, vision
- Smart, Supportive Teammates - Collaborate with sharp minds who are kind, driven and uphold our core values: Commitment, Communication, Teamwork, Service and Integrity!




















