• Location Icon

    Boston, Massachusetts

  • Post Date Icon

    Posted Date: 09/12/2025

  • Catagory

    Consulting

  • Pay Icon

    $0-$0

Job Description

We are seeking an experienced Linux Systems Administrator with a strong background in managing high-performance computing (HPC) environments. This individual will play a key role in ensuring the reliability, scalability, and performance of complex compute clusters that support research and enterprise workloads.
Key Responsibilities:

  • Oversee the maintenance and administration of a Linux-based HPC environment.
  • Configure, monitor, and optimize cluster workloads using job scheduling tools (e.g., Slurm).
  • Support users with job submissions, troubleshooting, and permissions management.
  • Install, configure, and maintain Linux-based software environments to support a range of applications.
  • Collaborate with vendors and internal teams to ensure hardware and software stability.
  • Execute system upgrades, migrations, and validation procedures to maintain a secure, high-performing environment.
  • Monitor cluster health, proactively resolve performance issues, and respond to system incidents.
  • Manage and support HPC storage platforms, ensuring reliable data access for large-scale workloads.
  • Provide technical expertise for diagnosing hardware-related issues across supported systems.
Qualifications:
  • 10+ years of experience as a Linux Systems Administrator, ideally with HPC exposure.
  • Hands-on expertise with workload schedulers (preferably Slurm).
  • Strong background in shell scripting, cron jobs, and general automation practices.
  • Familiarity with HPC storage solutions such as WekaFS and CEPH.
  • Experience supporting or integrating Open OnDemand portals.
  • Ability to troubleshoot and work directly with server hardware (experience with Dell systems a plus).
  • Excellent problem-solving skills with a proactive approach to system health and user needs.
  • Ability to work on-site several days per week as part of a hybrid schedule.
Skills & Experience Advantageous:
  • Experience in academic, research, or enterprise HPC environments.
  • Broader scripting knowledge (Python, Perl, etc.) for advanced automation.
  • Exposure to modern HPC trends such as GPU acceleration, containerization, or cloud/HPC hybrid solutions.

Apply Now

Recruiter

Clayton Minnich

clayton.minnich@avidtr.com

Job ID: JN -092025-17366

Share this job

Related Jobs