Job title - Senior Consultant-Data Engineer
Career Level - D2
Leverage technology to impact patients and ultimately save lives. Do you have expertise in, and passion for, information technology? Would you like to apply your expertise to impact the IT strategy in a company that follows the science and turns ideas into life-changing medicines? If so, AstraZeneca might be the one for you!
ABOUT ASTRAZENECA
AstraZeneca is a global, science-led, patient-focused biopharmaceutical company that focuses on the discovery, development, and commercialization of prescription medicines for some of the world’s most serious diseases. But we’re more than one of the world’s leading pharmaceutical companies. At AstraZeneca, we’re dedicated to being a Great Place to Work.
ABOUT OUR IT TEAM
It’s a dynamic and results-oriented environment to work in – but that’s why we like it. There are countless opportunities to learn and grow, whether that’s exploring new technologies in hackathons, or redefining the roles and work of colleagues, forever. Shape your own path, with support all the way. Diverse minds that work cross-functionally and broadly together.
At AstraZeneca, we're not just about creating life-changing medicines; we're about creating a culture of innovation and collaboration. We have taken on an ambitious goal of revolutionizing antibody discovery at AstraZeneca by significantly reducing the time it takes to discover a clinical candidate using world-class technology and advanced data & AI capabilities. To support this initiative, we are building Augmented Biologics Discovery Platform within the R&D IT. We're looking for a Data Engineer with a focus on data testing and validation. In this role, you will join a global team of engineers (software, data, MLOps), architects, BAs, PMs, in our Augmented Biologics Platform to support biologics and antibody drug discovery.
The following will form part of the role:
- Responsible for designing and implementing software-based solutions to make our science easier to do, easier to learn from, and offer faster delivery and higher quality across biologics discovery.
- Design and build novel products and features to address long-standing problems in drug discovery with a focus on producing amazing user experiences that are more than the sum of their parts.
- Collaborate with product, design, data science, and our scientific teams to build cutting-edge experiences and services.
- Propose and implement changes to our data models, core architecture, and codebase. Develop data products with state-of-the-art data technologies even if you’ve not worked with those before.
- Advocate and advance modern, agile software development practices and help develop and evangelize a vibrant software engineering culture.
- Collaborate in the transformation of an estate of siloed systems into an ecosystem that delivers great user experiences.
- Plan, implement, and support core infrastructure development collaboratively with an overall objective to improve the scalability, reliability, performance, and availability.
- Passionate to stay on top of tech trends, experiment with and learn new technologies, participate in internal & external technology communities, and mentor other members of the engineering community.
- Working with cutting-edge technology stack in a cloud environment.
The skills required:
- Experience designing, implementing data products supporting bespoke analytics and AI applications.
- Proficiency in languages such as Python, Java for data manipulation, analysis, and engineering tasks.
- Strong hands-on experience on python framework (Ex: Dataframes, Pandas, Regix, OracleDB etc.)
- Proficiency in database management systems such as SQL and NoSQL databases (e.g., Oracle, MySQL, PostgreSQL, MongoDB) for data storage and retrieval.
- Strong knowledge of creating and consuming APIs (FastAPI, RESTful, etc.) is important for integrating the backend with other systems and services.
- Familiarity with cloud platforms such as AWS for deploying and managing data engineering solutions.
- Solid experience using data-related services in AWS Data services like DMS, SCT, RDS, Aurora, Redshift etc.
- Experience of data analysis – profiling, investigating, interpreting, and documenting data structures.
- Experience in performance tuning SQL and understanding ETL pipelines. Knowledge of ETL processes and tools (e.g., Informatica, Talend, etc.) for data ingestion, transformation, and loading into target data systems.
- Extensive experience in troubleshooting data issues, analyzing end-to-end data pipelines and working with users in resolving issues.
- Good experience in consuming or exposing web services (i.e. SOAP, REST).
- Proficiency in using version control systems like Git for managing codebase changes and collaboration with other data engineers and developers.
- Excellent verbal and written skills for effective communication with a variety of personas including engineers, testers, architects, product managers, scientists, etc.
The following skills would be advantageous for your application but are not considered essential:
- Design, develop, and deploy production-grade scalable data products using container technologies like Docker and Kubernetes with a focus on data testing and validation.
- Production experience delivering CI/CD pipelines (Jenkins, ArgoCD, TravisCI, Git).
- Experience working in Bioinformatics / Computational or molecular biology domain.