TheData Engineer role is responsible for planning, developing and maintenance of the research and development data infrastructure, consisting of multiple database architectures. The role is a member of the bioinformatics team, working to ensure database integrity while automating systems to ingest data from various data sources.
This role is pivotal in designing and maintaining a scalable and secure data infrastructure that drives critical research and development initiatives. By ensuring data integrity and automating data ingestion from various sources, the position plays a key role in streamlining processes and enabling high-quality data analysis. It supports cross-functional teams and directly impacts the company's ability to make data-driven decisions and advance scientific discoveries.
Location note: This role requires on-site work at our Carlsbad office with hybrid flexibility and candidates must live within commuting distance.
Responsibilities
- Create and maintain data pipeline architecture, maintaining comprehensive documentation of data sources, data flows and data models
- Scope and develop process improvements, including automation of manual data ingestion processes
- Develop and maintain data governance procedures, ensuring data quality, consistency, and governance through the implementation of data standards and best practices.
- Collaborate with various teams (Laboratory Information Systems, Clinical Affairs, Research & Development and Billing) to prioritize, plan and execute project tasks aligned with company objectives
- Manage data security within R&D data systems using best practices
- Coordinate with bioinformatics team to execute data curation for statistical analysis and associated documentation
Minimum Qualifications
- Expertise in SQL using SSMS and MySQL, including TSQL for updating tables, inserting data, and creating views.
- Proficiency in understanding database structures through information schemas and primary keys.
- Knowledge of database maintenance best practices and the ability to execute them.
- Skill in manually normalizing data and automating data imports into appropriate tables.
- Ability to collaborate with various teams (Laboratory Information Systems, Clinical Affairs, R&D) to understand and implement their projects within the desired timeframes.
- Data Visualization: Create dashboards and reports using tools like Power BI.
Preferred Qualifications:
- Competence in using MirthConnect/JavaScript and handling various file types (.csv, XML, JSON) for data transfer and transformation.
- Experience with Python or R programming for data manipulation, analysis, and automation.
- Experience working with biological data, including familiarity with handling large, complex datasets in different units and formats typical in research or clinical environments.
Pay range: $43.27/hr - $52.88/hr
#J-18808-Ljbffr