Job Description
Guardant’s HPC team builds and operates the computational technology backbone of the company.
This includes scalable data storage that holds PBs of genomics data, high performance compute clusters running a custom bioinformatics pipeline in production and R&D environments, and the softwareinfrastructurethat hosts an ecosystem of services for internal data processing and external data integration.To facilitateGuardantHealth’s fast growth in the next few years, the HPC team is looking for a strong technical engineer who can help maintain and help grow the HPC infrastructure during its aggressive expansion, while working with corporate IT, SQA and DevOps/SREteams.
This role can be remotely worked part-time, but requires a very hands on, on-premise presence when on rotation, minimally.
In this role, you will primarily:
Help manage multiple HPC clusters and cluster file systems.
Help research, develop and implement the next HPC solution
Troubleshoot the production system stack down to source code level e.g. shell scripts, python and others.
Maintains, monitors, and supports theinfrastructureenvironment and/or facilities.
Used and maintained enhanced production monitoring and additional capability.
Support improvements for increased system reliability and performance.
Supports in a senior role multiple systems or applications of medium to high complex (complexity defined by size, technology used, and system feeds and interfaces) with multiple concurrent users, ensuring control, integrity, and accessibility.
Work with offsite consultants to maintain the infrastructure
Work with vendors to troubleshoot, upgrade and repair systems as needed
Participate in a 24/7 on-call rotation