JobsEQ by Chmura Logo

HPC Systems - Linux Administrator

Johnson Service Group

Location: Tampa, Florida
Type: Full-Time, Non-Remote
Posted on: February 17, 2021
This job is no longer available from the source.

Employee Type:
Full-Time

Location:
Tampa, FL

Job Type:
Other Area(s)

Experience:
Not Specified

Date Posted:
2/17/2021
Job Description JobID: 382521
Johnson Service Group is looking for a High Performance Systems Administrator (Linux) for a 6-month Contract to DIRECT hire role in Tampa, FL.
These three things are an absolute along with a matching of the requirements below:
• Experience with Linux/UNIX environment
• Experience scheduling and reviewing in a HPC environment
• Some Linux Desktop Ticket Support
• Computation resource to multi user and parsed out resources
Job Description
• Designs, develops, evaluates, and modifies software packages for the solution of engineering problems and for the support of R&D.
• Analyzes existing systems and formulates logic for new systems.
• Devises logical procedures, prepares flow charts, performs coding, tests, and debugs programs.
• Will provide input for the documentation of new or existing programs, and determines system specifications, input/output processes, and working parameters for hardware/software compatibility.
• Contribute to decisions on policies, procedures, expansion strategies, and product evaluations for the HPC resources.
• The role is focused primarily on working with Cluster system. The individual must exercise their own judgment for daily implementation and maintenance.
• Applies technical expertise and background to work within a team of pure and applied scientists and software engineers to consult for and support scientific researchers who use resources.
• Creates and optimizes computational solutions to the specific scientific computing needs of each constituency; ensuring that the appropriate technology resources are identified and utilized optimally. This position consults on applicable software packages and algorithms, and assists in optimizing them for scalability and (massive) parallelization as needed.
TYPE OF EXPERIENCE:
Required Qualifications:
• Minimum of 5 years experience designing, developing and successfully administering or supporting unix-based systems
• Well-versed in cluster development methodologies in particular open source operating systems, tools, languages and frameworks for cluster environments.
Proficient in one or more of the following languages:
• Python
• Shell scripting (i.e., BASH, CSH, KSH, etc.)
• Java
• C/C++
Proficient in one or more of the following tools:
• GIT
• Trac
• Modules
• Fabric/Puppet or other code deployment tools
Familiarity in one or more of the following Packages:
• Matlab (or any open source similar tools like Octave, SciLab)
• Familiarity with one or more of the following HPC cluster scheduling software:
• Altair PBS Works, OpenPBS, or MOAB suites.
• Working knowledge of computer hardware, networking concepts and tools
Preferred Qualifications:
• Experience with the Rocks Avalanche Installer for distributing OS images to cluster compute nodes is preferred
• Experience with cluster filesystems and enterprise storage systems (e.g., Isilon).
• Experience working in a scientific computing environment, particularly in an academic or research setting.
• Ability to use logical, principled approach in troubleshooting software and/or hardware problems.
• Ability to navigate a heterogeneous computing environment including UNIX/Linux, Mac OS and Windows
SOFTWARE:
• Configures, debugs and ensures stable operation of High Performance Computing cluster tools such as Rocks, Avalanche, Ganglia, TORQUE and MOAB utilities.
• Configures, optimizes and ensures stable operation of MATLAB and open source equivalents such as Octave and SciLab.
• In addition, ensures that any added module or toolbox needed for this software is working properly, including necessary licenses.
• Configures, optimizes and ensures stable operation and availability of Message Passing Interface (MPI) such as OpenMP libraries and utilities.
• Configures, optimizes and ensures stable operation and availability of the R statistical package and necessary extra modules needed by the cluster users.
• Configures, optimizes and ensures stable operation of GPU related software and libraries such as CUDA and others.
• Configures, optimizes and ensures stable operation of specific software needed for mathematical oncology, bioinformatics, bio-statistics and any other groups of research involved in the use of the cluster.
• Creates deployment scripts that facilitate the deployment of commercial off-the-shelf (COTS) and custom applications.
• Estimates time and effort involved in realizing new cluster capabilities for enterprise-level resource allocation, project planning and forecasting purposes.
• Automates solutions for routine tasks such as system deployments, database backups and open source software provisioning.
• Collaborates with software developers to build, maintain, test and deploy user-friendly web-based interfaces that simplify scientists' views of their data and workflows.
• Educates cluster user community on the optimal use of the cluster's computational resources via one-on-one collaboration, workshops and preparation of relevant documentation and tutorials.
• Performs other duties as assigned.
#D600