Senior Data Engineer – AI/ML
Posted 98ds ago
Employment Information
Report this job
Job expired or something wrong with this job?
Job Description
Data Engineer for iFood's AI/ML Platforms, collaborating on data solutions and managing robust data pipelines. Join a leading Latin American tech company focused on innovation in food delivery, fintech and more.
Responsibilities:
- Collaborate with stakeholders across multiple companies in the iFood ecosystem to map data systems, define ingestion strategies, and coordinate secure, governed data sharing for collaborative AI projects.
- Design, build, and operate reliable batch and streaming pipelines that ingest, transform, and store large volumes of structured and unstructured data on platforms such as AWS S3.
- Define and manage data access policies and controls, ensuring secure and appropriate access to datasets across platforms (AWS S3, Databricks) aligned with governance and compliance requirements.
- Implement data quality frameworks and validation checks tailored to each data source in the ecosystem.
- Develop and apply best practices for organization, cataloging, and comprehensive metadata management optimized for efficient training of LLMs and for analytics.
- Build and manage ETL/ELT workflows using distributed computing frameworks such as Spark on Databricks or Amazon EMR.
- Monitor, analyze, and proactively optimize costs associated with data storage, processing, and transfer across the platform.
- Troubleshoot pipeline issues, optimize queries, and document solutions to enable team autonomy.
Requirements:
- Languages: Advanced Python and SQL with strong development experience.
- Big Data: Proficiency in Spark (Databricks/EMR) and knowledge of Ray.
- Cloud: Strong experience with AWS (primary); knowledge of GCP is a plus.
- Databases: Experience with relational databases and NoSQL (MongoDB).
- Concepts: Expertise in ETL/ELT and data modeling for production environments.
- Experience: Solid background in data engineering with a focus on production readiness and reliability.
- Direct experience supporting multiple external clients or business units in complex environments.
- Knowledge of workflow orchestration tools (Airflow, Prefect, Luigi) for automation of processes.
- Experience with graph databases (Neo4j, Amazon Neptune) for advanced relationship modeling.
- Knowledge of containerization and Kubernetes for scalable data workloads.
Benefits:
- We do not discriminate on the basis of disability, gender, sexual orientation, race/ethnicity, age, national origin, family status, or appearance.
- We have voluntary employee groups (foodlovers) that discuss topics such as race, gender, LGBTQI+, and people with disabilities.
- We work in a highly dynamic and versatile environment.















