HPC Sr. Systems Engineer

$85,500 - $149,800/Yr

Johns Hopkins - Baltimore, MD

posted 2 months ago

Full-time - Mid Level
Baltimore, MD
Educational Services

About the position

The HPC Sr. Systems Engineer at Johns Hopkins University is responsible for the strategic planning, design, testing, organization, and implementation of advanced technology projects for the Advanced Research Computing at Hopkins (ARCH) group. This role involves the administration of high-performance computing (HPC) clusters and storage systems, ensuring their stability and availability for researchers. The engineer will also engage in technological research and training to enhance the HPC environment and support the needs of the academic community.

Responsibilities

  • Design, organize, plan, test and implement cutting-edge hardware designs for an HPC environment.
  • Extensively document systems processes for user accessibility and staff backup.
  • Provide stable solutions for HPC resources.
  • Maintain job scheduling and storage allocation systems and policies.
  • Maintain extensive monitoring systems for proactive responses to failures and performance data logging.
  • Provide general system administration backup and escalation for other staff.
  • Consult with building engineers on new facilities under MARCC control.
  • Assist with facilities-related issues affecting MARCC.
  • Ensure resources meet community needs and are highly available.
  • Manage inventory of resources in coordination with vendors.
  • Automate user account creation, management, and purging.
  • Contribute to planning sessions on network and security issues for MARCC.
  • Implement network configuration and security measures.
  • Create and maintain a stable, secure operating system and software environment.
  • Implement and maintain secure measures to protect data subject to restrictions.
  • Manage data access restrictions on a per user and group basis.
  • Implement and maintain monitoring measures for data and system access.
  • Perform other systems tasks as assigned by supervisor.
  • Architect future clusters and plan the retirement of aging systems.
  • Offer technical advice on new HPC projects.
  • Develop custom tools and contribute to open-source efforts.
  • Research and implement new technologies for HPC.
  • Test and vet new technology in support of HPC efforts.
  • Work with vendors to procure prototypes and demo units.
  • Continuously evaluate new tools and technologies for clusters.
  • Attend training to increase knowledge and skills.

Requirements

  • Bachelor's Degree.
  • Six years related experience in HPC or similar roles.
  • Proven experience deploying large-complex scale projects.
  • In-depth knowledge of HPC cluster hardware and management software.
  • Expert knowledge of Unix/Linux systems administration.
  • Expert knowledge of networking and network security principles in an HPC environment.
  • Use of configuration management tools for maintaining large-scale systems.
  • Ability to troubleshoot and support job scheduling and resource management systems.
  • Advanced knowledge of Linux, Apache, SQL, PHP/Python/Perl (LAMP) technology/toolkits.

Nice-to-haves

  • Seven years managing Linux servers with HPC clusters experience.
  • Experience as a high-level Linux system administrator.
  • Familiarity with configuration of the HPC software stack including MPI and OpenMP.
  • Experience with open-source software compilation.
  • In-depth knowledge of TCP/IP networking and related protocols.
  • Experience with scientific application management packages.
  • Excellent scripting skills in Python, Perl, shell.
  • Programming skills in C, C++, or scientific language.
  • Experience with MySQL or MariaDB database programming.
  • Expert level knowledge of configuration management and monitoring tools.

Benefits

  • Competitive salary range of $85,500 - $149,800 annually based on experience.
  • Full-time position with a 37.5 hours/week schedule.
  • Hybrid work location options.
Job Description Matching

Match and compare your resume to any job description

Start Matching
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service