General Description:
We are seeking a DevOps Software Engineer – HPC Specialist, where our DevOps team supports and maintains high-performance computing (HPC) environments and secure CI/CD infrastructure that support scientific research. This role demands expertise in Linux cluster administration, Slurm workload manager, and DevOps tools such as GitLab CI/CD, Python, and JFrog Artifactory, all within a highly secure, air-gapped environment. You’ll also document complex systems and processes clearly for a variety of technical and non-technical audiences.
Essential Duties:
Administer and troubleshoot Linux-based HPC clusters running Slurm.
Manage and maintain Slurm configurations and job scheduling policies.
Collaborate with researchers to support scalable and automated scientific workflows.
Monitor and optimize HPC performance, capacity, and reliability.
Develop and automate cluster management tasks, including node provisioning, software deployment, and user environment setup.
Administer and troubleshoot CI/CD infrastructure across open and air-gapped networks.
Contribute to Infrastructure-as-Code (IaC) automation and system administration.
Collaborate with developers, system administrators, and research staff to support integrated platforms.
Write and maintain high-quality technical documentation.
Participate in Agile team activities to support iterative problem-solving and project delivery.
Required Skills:
Proven ability to communicate complex technical concepts clearly in both written and verbal formats.
Hands-on experience administering Slurm in HPC environments.
Knowledge of HPC environment architecture and common challenges in scientific computing.
Strong Linux system administration skills.
Proficiency in Python programming and scripting languages (e.g., Bash or PowerShell).
Experience with software packaging and environment management (e.g., Conda) in HPC contexts.
Strong troubleshooting, analytical, and problem-solving abilities.
Familiarity with air-gapped or high-security computing environments.
Experience working in research or scientific computing environments is highly desired.
Required Education:
BS + 2 years of experience, or MS in computer science, computer engineering, or a related field. Candidates with different experience levels will be considered for other positions.
Special Requirements:
U.S. citizenship
Must be able to obtain and maintain a U.S. Government security clearance as required.
This is an on-site position due to the need to work with air-gapped networks and sensitive information.
Compensation:
The base salary range for this full-time position is $99,705 - $124,683 + bonus + benefits.
Our salary ranges are determined by role, level, and location. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range during the hiring process. Please note that the compensation details listed reflect the base salary only, and do not include potential bonus or benefits.
We are proud to be an EEO/AA employer M/F/D/V. We maintain a drug-free workplace and perform pre-employment substance abuse testing.
If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.
Innovate navigation solutions with HRL as an Embedded Systems Engineer focused on MEMS sensor subsystems in Malibu, CA.
Join HRL as a Computational Scientist to advance quantum processor simulation and quantum control in a dynamic scientific research team.
AnaVation seeks a skilled Cloud Engineer to design and maintain secure AWS cloud infrastructure supporting critical federal intelligence missions in Reston, VA.
Experienced Integration & Test Engineer wanted to lead production and testing improvements for Starlink propulsion at SpaceX in Redmond, WA.
Join AECOM's Transportation team as a Civil Engineer and help deliver innovative infrastructure solutions in Murray, Utah.
Contribute to space innovation by performing critical EMC/EMI testing for SpaceX’s advanced aerospace hardware as an Electromagnetic Environmental Effects Test Specialist.
Lead supplier quality initiatives for cutting-edge defense avionics systems at Anduril Industries, ensuring excellence from prototype through volume production.
EDAG Group invites experienced Project Engineers with expertise in automotive conveyor systems to contribute to innovative manufacturing solutions in Troy, Michigan.
As a Facility ADA Assessor, you will play a critical role in promoting accessibility within public infrastructure projects across New York City.
Kimley-Horn is looking for a detail-oriented Civil Engineer-in-Training with 2+ years of experience to contribute to their roadway team in Saint Paul, MN.
Experienced Electrical Engineer needed in Nashville to lead design projects in zero-emission vehicle electrical systems at Kimley-Horn.
Lead AECOM’s New Jersey transportation environmental permitting team to deliver complex, compliant projects with expert management and client coordination.
Senior Principal Configuration Analyst role at Northrop Grumman managing configuration and data management for the F-35 program.
Experienced Senior Structures (Bridge) Design Engineer wanted at CDR Maguire Engineering to lead complex bridge design tasks within a flexible hybrid work environment.
As a Senior Radar Algorithms and DSP Engineer at Mach Industries, you'll play a crucial role in developing cutting-edge defense technologies.
GREAT CAREERS, AMAZING COLLEAGUES We're looking for the best and brightest scientists and engineers to help us develop the most innovative technologies for aerospace, automotive and defense applications. You'll have the opportunity to conduct ba...
180 jobsSubscribe to Rise newsletter