-
Boston, Massachusetts
-
Posted Date: 09/12/2025
-
Consulting
-
$0-$0
Job Description
We are seeking an experienced Linux Systems Administrator with a strong background in managing high-performance computing (HPC) environments. This individual will play a key role in ensuring the reliability, scalability, and performance of complex compute clusters that support research and enterprise workloads.
Key Responsibilities:
- Oversee the maintenance and administration of a Linux-based HPC environment.
- Configure, monitor, and optimize cluster workloads using job scheduling tools (e.g., Slurm).
- Support users with job submissions, troubleshooting, and permissions management.
- Install, configure, and maintain Linux-based software environments to support a range of applications.
- Collaborate with vendors and internal teams to ensure hardware and software stability.
- Execute system upgrades, migrations, and validation procedures to maintain a secure, high-performing environment.
- Monitor cluster health, proactively resolve performance issues, and respond to system incidents.
- Manage and support HPC storage platforms, ensuring reliable data access for large-scale workloads.
- Provide technical expertise for diagnosing hardware-related issues across supported systems.
- 10+ years of experience as a Linux Systems Administrator, ideally with HPC exposure.
- Hands-on expertise with workload schedulers (preferably Slurm).
- Strong background in shell scripting, cron jobs, and general automation practices.
- Familiarity with HPC storage solutions such as WekaFS and CEPH.
- Experience supporting or integrating Open OnDemand portals.
- Ability to troubleshoot and work directly with server hardware (experience with Dell systems a plus).
- Excellent problem-solving skills with a proactive approach to system health and user needs.
- Ability to work on-site several days per week as part of a hybrid schedule.
- Experience in academic, research, or enterprise HPC environments.
- Broader scripting knowledge (Python, Perl, etc.) for advanced automation.
- Exposure to modern HPC trends such as GPU acceleration, containerization, or cloud/HPC hybrid solutions.
Apply Now
Recruiter
Clayton Minnich
clayton.minnich@avidtr.com
Job ID: JN -092025-17366
Find Jobs Faster
Login or sign up to create customized job alerts to be notified first.
Share this job