Data Engineer – AI, Spark, Databricks, Healthcare

Posted 6ds ago

Employment Information

Education
Salary
Experience
Job Type

Report this job

Job expired or something wrong with this job?

Job Description

Data Engineer managing advanced Spark scripts for data validation and integration at Cotiviti. Collaborating with engineering teams to ensure data integrity and compliance in healthcare data processes.

Responsibilities:

  • Create, maintain and execute intermediate to advanced Spark scripts for data management and data validation, and data integration.
  • Create, maintain and execute basic to intermediate SQL scripts for data management and data validation.
  • Optimize the queries to improve the efficiency of daily tasks.
  • Perform data analysis and identify any issues.
  • Work with other groups such as Engineering team, DBA, Cloud ops, etc. to troubleshoot and resolve any environmental or network issues that impact your work.
  • Extend your support to after – hours or weekends as needed.
  • Create and maintain data pipelines as needed.
  • Validates the tasks results to ensure that all the requirements are met.
  • Adhere to all the industry level and organization level compliance rules and regulations to maintain data integrity.
  • Complete individual productivity tracking.
  • Complete task assignments using department ticketing system within assigned deadline.
  • Achieve organizational and individual goals as identified in performance reviews and goal setting exercises.
  • Complete all special projects and other duties as assigned.

Requirements:

  • Bachelor’s degree in Computer Science, Information Technology or equivalent work experience
  • 3+ years of working knowledge of big data technologies (Spark, S3, Kafka, Ray, Hadoop, etc.)
  • 2+ years of working knowledge of big data / cloud technologies (Databricks, AWS, Azure, Hadoop, Spark, Snowflake etc.)
  • 3+ years of working knowledge of cloud (AWS, Azure, GCP, OCI etc.)
  • 3+ years of working knowledge of RDBMS (Oracle, MS SQL, Vertica, etc.) and experience using SQL, PL/SQL or other data integration/ETL tools
  • Any Databricks / AWS certifications is a big plus
  • Familiarity with data pipeline orchestration tools (e.g., Airflow, Databricks Workflows)
  • 3+ years of data analysis
  • Preferably in the Healthcare industry of enrollment, medical claims and/or pharmacy claims
  • Proficient in Microsoft Office Suite applications PowerPoint, Word, Excel and Outlook
  • Flexible work schedule
  • Experience with project management tools like JIRA
  • Databricks and/or Snowflake environment familiarity a plus

Benefits:

  • medical, dental, vision, disability, and life insurance coverage
  • 401(k) savings plans
  • paid family leave
  • 9 paid holidays per year
  • 17-27 days of Paid Time Off (PTO) per year