HPC Software Engineer II

Company:  The University of Texas at Dallas
Location: Richardson
Closing Date: 27/10/2024
Salary: £100 - £125 Per Annum
Hours: Full Time
Type: Permanent
Job Requirements / Description

Functional Title: High Performance Computing Software Engineer II

Department: High Performance Computing

Salary Range: $90K-$105K DOQ

Pay Basis: Monthly

Position Status: Regular full-time

Location:

Posting Open Date: 07/31/2024

Posting Close Date: Open Until Filled

Desired Start Date: 08/19/2024

Job Summary: Reporting to the Director High Performance Computing (HPC) Facilitation, this is a mid-level HPC systems software engineer position with a required background in HPC environments. To collaborate with and support our customers, this engineer must have demonstrated a consultative customer service attitude in prior roles in similar organizations. Primary responsibilities include:

  • Installing and managing software to support researchers.
  • Lifecycle management of software including patching and versioning.
  • Tailoring installations and management according to researcher needs while leveraging HPC systems capabilities.
  • Training users in software use through workshops.
  • Preparing software documentation.
  • Troubleshooting user software issues.
  • Planning software use in HPC environments.

The applicant must have broad industry knowledge of hardware and/or software services involved in building and operating HPC environments.

Minimum Education and Experience:

  • High school diploma with eight (8) years recent applicable experience.
  • Associate Degree with six (6) years recent applicable experience.
  • Bachelor’s Degree with four (4) years recent applicable experience.

Preferred Education and Experience:

  • Master’s degree in Computer Science or equivalent with two years of experience in corresponding research services, support efforts, products, and technologies.
  • Current knowledge of HPC best practices and systems deployment and maintenance.
  • Troubleshooting methodology and awareness of industry standards.
  • Excellent interpersonal, written, and verbal communication skills.
  • Good technical documentation, architecture diagramming, and organizational skills.
  • Ability to manage support tickets and prioritize varied scope, scale, and technical requirements.
  • Familiarity with data center operations fundamentals in networking and power.
  • Experienced in Linux operating system for installing and managing software.
  • Ability to package scientific software into RPMs and containers.
  • Familiarity with Apptainer/Singularity HPC/Docker/Kubernetes.

Essential Duties and Responsibilities:

  • Be a software support engineer for the university HPC cluster or services with support from core HPC team for complex scenarios.
  • Respond to user tickets from faculty and students.
  • Act as a role model in demonstrating integrity and ethical behavior.
  • Assist in the development and implementation of internal policies for HPC and cyber infrastructure.
  • Perform annual updates and expert level software coding (prefer Python, Linux Shell, etc.) in at least two languages.
  • Perform installation, configuration, updating, performance monitoring, and troubleshooting of software on HPC systems.
  • Assist users with optimizing their software use for underlying system capabilities.
  • Assist customers with porting and optimizing code.
  • Ability to package scientific software into RPMs and integrate with Lmod.
  • Able to compile, test, and install related open-source scientific software packages as requested.

Knowledge, Skills and Ability:

  • Familiarity with at least two high-performance cluster operating systems.
  • Familiarity with large scale high performance parallel file storage systems.
  • Experience with installing and supporting open source and commercial research related software.
  • Experience with all related dev ops tools.
  • Experience with SLURM job scheduler.

Physical Demands and Working Conditions:

  • On-call availability for quickly responding to system software emergencies.
  • Hybrid remote work available for Texas residents.
  • Sitting for extended periods and handling computer components.
  • Occasional lifting and transporting of moderately heavy objects.
  • Visa sponsorship is not available.

What We Can Offer:

UT Dallas is an Equal Opportunity Employer with an employee-friendly and supportive work environment. Our comprehensive compensation and benefits package includes:

  • Internal Training
  • BCBS PPO Medical insurance – 100% paid for full-time employees
  • PPO and DHMO Dental Insurance Plan
  • Vision Insurance
  • Long and short-term disability
  • TRS Retirement Plan
  • Voluntary Retirement Plan Options
  • Paid time off
  • Paid Holidays
  • Paid Winter Break
  • Fertility Benefits
  • Remote Work Options

All UT Dallas employees have access to various professional development opportunities.

If you are looking for a rewarding career opportunity with great benefits, look no further! Join our team!

Special Instructions Summary:

  • All employees serve as a representative of the University and are expected to display respect, civility, professional courtesy, and discretion in all interactions.
  • The University of Texas at Dallas is committed to providing an educational, living, and working environment that is welcoming and inclusive.

Supplemental Questions:

Required fields are indicated with an asterisk (*).

Required Documents:

  • Resume
  • Cover Letter/Letter of Application

Optional Documents:

  • Veteran Employment Preference - Form DD-214

Human Resources,
800 West Campbell Road, AD3.418
Richardson,
TX 75080-3021

#J-18808-Ljbffr
Apply Now
Share this job
The University of Texas at Dallas
  • Similar Jobs

  • HPC Software Engineer II

    Richardson
    View Job
  • HPC Software Engineer II

    Richardson
    View Job
  • HPC Software Engineer II

    Richardson
    View Job
  • Software Engineer II (Richardson, TX)

    Richardson
    View Job
  • Software Engineer

    Richardson
    View Job
An error has occurred. This application may no longer respond until reloaded. Reload 🗙