Data Engineer

Company:  Cypress HCM
Location: Plano
Closing Date: 20/10/2024
Hours: Full Time
Type: Permanent
Job Requirements / Description

Data Engineer


The Data Platform team builds and delivers data assets that serve our client’s customers in order to have visibility into their plan costs and clinical outcomes. The team collaborates closely with other departments within the organization such as Data Science and the Analytics team.


The Data Engineer (L2 equivalent) role will work with the team to decide on the best path for execution on analysis in collaboration with other team members. It requires deep problem solving skills in determining root causes and resolving issues across data quality, skills in multi-tasking across multiple priorities, and ability to work cross-collaboratively with customers and stakeholders.


Responsibilities :

  • Data Pipelines - Create new pipelines and improve/maintain existing pipelines using Spark (Pyspark, Spark SQL)
  • Data Modeling - Partner with analytic consumers to design logical and physical schemas, improve existing data models and build new ones
  • Cross-functional Collaboration - Interface with Product, Engineering, Data Science, Analytics/BI, and Operations to understand their data needs, providing both consultative and data engineering solutions for consumers
  • Build data expertise and own data quality across various business domains including healthcare claims and member experience
  • Leverage best in industry practices to build the next generation data ecosystem to collect, move, store and analyze data
  • Agile oriented development experience


Skills and qualifications include:

  • BS degree in Computer Science or related technical field, or equivalent practical experience
  • 2+ years proven work experience as a data engineer, working with at least one programming language (e.g. Scala, Python/PySpark) plus SQL expertise
  • 2+ years experience with schema design, dimensional data modeling, and large-scale data warehousing architecture
  • Expertise in building data pipelines through efficient ETL design, implementation and maintenance
  • Background working with distributed data systems such as Spark, Presto, Hive, and Redshift. Experience with schedulers/workflow management tools is a plus
  • Excellent communication skills to collaborate with stakeholders in Engineering, Product, Data Science, Analytics/BI, and Operations


Compensation: $40 - $55.00 per hour

ID#: 2015

Apply Now
Share this job
Cypress HCM
An error has occurred. This application may no longer respond until reloaded. Reload 🗙