HPC Software Engineer II

Company:  University of Texas at Dallas
Location: Richardson
Closing Date: 16/10/2024
Salary: £100 - £125 Per Annum
Hours: Full Time
Type: Permanent
Job Requirements / Description

Posting Details

Posting Number: S05840P

Position Title: HPC Software Engineer II

Functional Title: High Performance Computing Software Engineer II

Department: High Performance Computing

Salary Range: $90K-$105K DOQ

Pay Basis: Monthly

Position Status: Regular full-time

Location: Richardson

Posting Open Date: 07/31/2024

Open Until Filled: Yes

Desired Start Date: 08/19/2024

Job Summary

Reporting to the Director High Performance Computing (HPC) Facilitation, this is a mid-level HPC systems software engineer position with a required background in HPC environments. The engineer must demonstrate a consultative customer service attitude in prior roles in similar organizations. Primary responsibilities include:

  1. Installing and managing software to support researchers.
  2. Lifecycle management of software including patching and versioning.
  3. Tailoring installation and management according to researcher needs while leveraging HPC systems capabilities.
  4. Training users in the use of software through workshops.
  5. Preparing software documentation.
  6. Troubleshooting user software issues.
  7. Planning software use in HPC environments.

The applicant must have broad industry knowledge of hardware and/or software services involved in building and operating HPC environments.

Minimum Education and Experience

High school diploma with eight (8) years recent applicable experience.
Associate Degree with six (6) years recent applicable experience.
Bachelor's Degree with four (4) years recent applicable experience.

Preferred Education and Experience

Preferred Qualifications:

  • Master's degree in Computer Science or equivalent with two years of experience in corresponding research services, support efforts, products, and technologies.
  • Current knowledge of HPC best practices and systems deployment and maintenance.
  • Troubleshooting methodology and awareness of industry standards.
  • Excellent interpersonal, written, and verbal communication skills.
  • Good technical documentation, architecture diagramming, and organizational skills.
  • Ability to manage support tickets and prioritize considering varied scope, scale, and technical requirements.
  • Familiarity with data center operations fundamentals in networking and power.
  • Experience in Linux operating system for installing and managing software.
  • Ability to package scientific software into RPMs and containers (and integrate with Lmod for module loading).
  • Familiarity with Apptainer/Singularity HPC/Docker/Kubernetes.

Essential Duties and Responsibilities

  • Be a software support engineer for the university HPC cluster or services with support from the core HPC team for complex scenarios.
  • Respond to user tickets from faculty and students. Level 2 support experience at a scale of 1 to 3 with 3 being a senior specialist.
  • Act as a role model in demonstrating integrity and ethical behavior in working with confidential and university information.
  • Assist in the development and implementation of internal policies, rules, and operation procedures for HPC and cyber infrastructure.
  • Perform annual updates and expert level software coding (prefer Python, Linux Shell, etc.) in at least two or more languages.
  • Perform installation, configuration, updating, performance monitoring, and troubleshooting of software on HPC systems.
  • Assist users with optimizing their software use for the underlying system capabilities.
  • Assist customers with porting and optimizing code including selection and usage of appropriate development tools.
  • Ability to package scientific software into RPMs and integrate with Lmod.
  • Able to compile, test, and install various open-source scientific software packages as requested by research faculty, staff, and students.

Knowledge, Skills and Ability:

  • Familiarity with at least two high-performance cluster operating systems such as OpenHPC, ROCKS, Bright/Nvidia Cluster Manager.
  • Familiarity with large scale high performance parallel file storage systems such as WEKA, VAST, GPFS, BGFS, CEPH.
  • Experience with installing and supporting open source and commercial research related software, Python, R, Matlab, Mathworks, Julia, Ansys, Intel, nVidia CUDA and GCC compilers.
  • Experience with all related dev ops tools such as GitHub, GitLab, Ansible, package management tools for RPM and/or DEB package building.
  • Experience with SLURM job scheduler.

Physical Demands and Working Conditions

On-call availability for quickly responding to and resolving system software emergencies, both during regular and emergency off-hours.

Hybrid remote work available for Texas residents with further discussion and agreement.

Sitting for extended periods. Dexterity of hands and fingers to operate a computer keyboard, mouse, and power tools, and to handle other computer components.

Occasional lifting and transporting of moderately heavy objects, such as computers, switches, and peripherals.

Visa sponsorship is not available.

What We Can Offer

UT Dallas is an Equal Opportunity Employer with an employee-friendly and supportive work environment. Our comprehensive compensation and benefits package includes:

  • Competitive Salary
  • Tuition Benefits
  • Internal Training
  • BCBS PPO Medical insurance - 100% paid for full-time employees
  • PPO and DHMO Dental Insurance Plan - PPO plans include ortho benefits
  • Vision Insurance
  • Long and short-term disability
  • TRS Retirement Plan - defined benefit plan offering lifetime annuity upon retirement
  • Voluntary Retirement Plan Options - additional savings opportunities with Tax-Sheltered Annuity Plans and Deferred Compensation Plans
  • Dental/Vision/AD&D
  • Paid time off
  • Paid Holidays
  • Paid Winter Break
  • Fertility Benefits
  • Remote Work Options - approved by department

All UT Dallas employees have access to various professional development opportunities, including a membership to Academic Impressions, LinkedIn Learning, and UT Dallas Bright Leaders Program.

Visit for more information.

If you are looking for a rewarding career opportunity with great benefits, look no further! Join our team!

Special Instructions Summary

Important Message

1) All employees serve as a representative of the University and are expected to display respect, civility, professional courtesy, consideration of others, and discretion in all interactions with members of the UT Dallas community and the general public.
2) The University of Texas at Dallas is committed to providing an educational, living, and working environment that is welcoming, respectful, and inclusive of all members of the university community. UT Dallas does not discriminate on the basis of race, color, religion, sex (including pregnancy), sexual orientation, gender identity, gender expression, age, national origin, disability, genetic information, or veteran status in its services, programs, activities, employment, and education, including in admission and enrollment. EOE, including disability/veterans. The University is committed to providing access, equal opportunity, and reasonable accommodation for individuals with disabilities. To request reasonable accommodation in the employment application and interview process, contact the ADA Coordinator. For inquiries regarding nondiscrimination policies, contact the Title IX Coordinator.

#J-18808-Ljbffr
Apply Now
Share this job
University of Texas at Dallas
  • Similar Jobs

  • HPC Software Engineer II

    Richardson
    View Job
  • HPC Software Engineer II

    Richardson
    View Job
  • HPC Software Engineer II

    Richardson
    View Job
  • Software Engineer

    Richardson
    View Job
  • Software Engineer

    Richardson
    View Job
An error has occurred. This application may no longer respond until reloaded. Reload 🗙