HPC Linux Systems Administrator at Kforce Inc

Posted in Other about 3 hours ago.

Location: Boulder, Colorado





Job Description:


RESPONSIBILITIES:

Kforce has a client that is seeking a HPC Linux Systems Administrator in Boulder, CO.

Overview:
Our work depends on a Systems Engineer joining our team to support the National Oceanic and Atmospheric Administration (NOAA), Research and Development High Performance Computing Systems (RDHPCS) customer at the NOAA Global Systems Laboratory in Boulder, Colorado.

The qualified candidate will bring their hands-on technical and system administration expertise on-site to maintain the operational readiness and availability of NOAA's high performance computing systems, manage and support new technology insertions, and provide remote technical support and collaboration with our other supported NOAA sites at Fairmont, West Virginia and Princeton, New Jersey.

We are looking for an individual to join Kforce's team to deploy, operate, and support leading-edge technology for NOAA RDHPCS. Specific technology training will be provided.

How a Systems Engineer advisor will make an impact:


  • Apply current systems administrative skills

  • Learn and deploy new technologies

  • Develop and deploy monitoring capabilities

  • Develop and implement tools for cluster administration

  • Provide technical support with a team of HPC System and Storage Administrators to resolve operational issues

  • Independent problem solving and troubleshooting to quickly advance towards viable resolutions

  • Perform hardware break/fix support, which may include node, blade, or board-level replacements, replacement of backplanes, failed DIMMs, hard drives, controller boards, failed cables, network switches, and other failed components

  • Manage and maintain spare part inventories

  • Perform tracking, shipping, and receiving of vendor RMAs

  • Develop, improve, and enhance user and system administration online documentation repositories

  • Support HPC system users by leveraging the helpdesk ticketing system







REQUIREMENTS:



  • Bachelor's degree or 8+ years of experience

  • Experience with Systems Administration or IT support with diverse responsibilities

  • Hands-on experience with computer hardware maintenance and troubleshooting, such as identifying and replacing failed processors, DIMMs, disk drives, PCIe cards, and other field-replaceable components

  • Programming or scripting knowledge in at least one language (e.g., Bash, Perl, Python)

  • Demonstrated experience deploying and managing large-scale HPC systems using OS provisioning tools (e.g., xCAT, Warewulf)

  • Demonstrated experience using configuration management tools (e.g., Ansible, Puppet)

  • Linux system administration experience (e.g., RedHat or Rocky Linux)

  • Batch management/scheduling experience, Slurm preferred

  • Network interconnect configuration and monitoring experience (e.g., InfiniBand, Ethernet)

  • Strong writing skills for technical documents, system procedures, user wiki's and FAQs

  • Applicants selected will be subject to a government security investigation and must meet eligibility requirements for access to classified information

Other specific skills or competencies:

  • Team player with the ability to work with a diverse team in both local and remote technical support environments

  • Resourceful with initiative to perform independent technical troubleshooting and identify/recommend solutions and improvements

  • Willingness and motivation to learn, grow, and retain and apply knowledge acquired towards future projects

  • Disciplined troubleshooting skills balanced with creative problem-solving skills to tackle highly complex large-scale technical problems

  • Attention to detail in areas such as time management, pre-planning, analytical thinking, observation, and active listening






The pay range is the lowest to highest compensation we reasonably in good faith believe we would pay at posting for this role. We may ultimately pay more or less than this range. Employee pay is based on factors like relevant education, qualifications, certifications, experience, skills, seniority, location, performance, union contract and business needs. This range may be modified in the future.



We offer comprehensive benefits including medical/dental/vision insurance, HSA, FSA, 401(k), and life, disability & ADD insurance to eligible employees. Salaried personnel receive paid time off. Hourly employees are not eligible for paid time off unless required by law. Hourly employees on a Service Contract Act project are eligible for paid sick leave.



Note: Pay is not considered compensation until it is earned, vested and determinable. The amount and availability of any compensation remains in Kforce's sole discretion unless
and until paid and may be modified in its discretion consistent with the law.



This job is not eligible for bonuses, incentives or commissions.



Kforce is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, pregnancy, sexual orientation, gender identity, national origin, age, protected veteran status, or disability status.



By clicking "Apply Today" you agree to receive calls, AI-generated calls, text messages or emails from Kforce and its affiliates, and service providers. Note that if you choose to communicate with Kforce via text messaging the frequency may vary, and message and data rates may apply. Carriers are not liable for delayed or undelivered messages. You will always have the right to cease communicating via text by using key words such as STOP.





More jobs in Boulder, Colorado


Southwest Research Institute

Athleta
More jobs in Other


Barge Design Solutions

Barge Design Solutions

ROUSH