Data Engineer
Posted 11hrs ago
Employment Information
Report this job
Job expired or something wrong with this job?
Job Description
Data Engineer responsible for managing critical pipelines and customer identity data assets for Europe's leading platform in e-commerce. Collaborating with data engineers and analysts to deliver data-driven solutions.
Responsibilities:
- Own the end-to-end pipeline that creates the unified customer_uuid across Books & Media and Fashion
- Maintain and evolve our customer identity master data with a strong focus on accuracy, reliability, and production quality
- Improve our probabilistic identity resolution model and make matching decisions measurable, transparent, and explainable
- Build scalable and cost-efficient data pipelines across BigQuery, GCS, and Cloud Run Jobs
- Introduce diagnostics, monitoring, and structured validation for every relevant model change
- Identify and resolve edge cases in customer matching logic before they become production issues
- Work closely with business and technical stakeholders to turn complex matching challenges into robust data solutions
Requirements:
- 5+ years of experience in production data engineering
- Strong experience with BigQuery and advanced SQL in large-scale analytical environments
- Strong Python skills for production-grade data engineering
- Solid Airflow experience and a strong understanding of reliable orchestration patterns
- Hands-on experience with incremental pipelines and idempotent data processing
- Experience with probabilistic record linkage or entity resolution in production
- Strong understanding of data quality, matching logic, and precision/recall trade-offs
- A careful, structured, and ownership-driven way of working
- Strong communication skills and the ability to explain technical decisions clearly
Benefits:
- Healthcare insurance
- Educational budget
- Challenging tasks and professional development, knowledge & best practice sharing


















