Experience Inc. Jobs

Job Information

Cognizant Technology Solutions HPC Engineer in New York, New York

Cognizant is seeking a HPC Engineer for a full-time opportunity in New York City, NY.

The salary range for this role is between $133,000 to $154,000 depending on experience and qualifications of the candidate.

Applications will be accepted till 2/22/2025.

Benefits: Cognizant offers the following benefits for this position, subject to applicable eligibility requirements:

  • Medical/Dental/Vision/Life Insurance
  • Paid holidays plus Paid Time Off
  • 401(k) plan and contributions
  • Long-term/Short-term Disability
  • Paid Parental Leave
  • Employee Stock Purchase Plan

Disclaimer: The benefits information is accurate as of the date of this posting. Cognizant reserves the right to modify this information at any time, subject to applicable law.

The HPC Engineer will support day-to-day operations of large-scale parallel file systems, deploy and maintain Linux HPC infrastructure across multiple data centers.

Key Responsibilities:

* Design architect and oversee implementation of Linux based HPC clusters and storage.

* Deploy physical hardware using HPC deployment tools and configuration and orchestration tools (Ansible).

* Parallel file system (GPFS) performance tuning monitoring and troubleshooting.

* Perform systems benchmarking and developing automated tests for the HPC environment ensuring the reliability and efficiency of our computational infrastructure.

* InfiniBand network maintenance and troubleshooting.

* Automate and monitor the HPC user lifecycle process.

* Slurm installation configuration performance tuning and troubleshooting.

* Plan design and implement a transition from the LSF scheduler to Slurm.

* Manage the Slurm scheduler and translate Research policies into scheduler configurations.

* Consult with faculty and students to develop research pipelines for use on the HPC cluster.

* Develop and maintain user lifecycle software suite in Python implement CI/CD pipeline

* Test and automate upgrades of critical system applications using Ansible and shell scripts.

Qualifications:

* Experience working in large-scale research based HPC environment

* Proven experience working with distributed file storage solutions (i.e. GPFS)

* Experience with deploying and troubleshooting Linux Operating Systems (RHEL/CentOS)

* Experience with Scripting and Automation (Ansible Python Shell Scripting)

* Solid understanding of job schedulers (LSF/SLURM)

* Experience with GPU-based compute infrastructure (including CUDA

* Ability to communicate effectively with clinician's researchers and other team members to develop technological solutions.

Cognizant is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to sex, gender identity, sexual orientation, race, color, religion, national origin, disability, protected Veteran status, age, or any other characteristic protected by law.

Minimum Salary: 34320.00 Maximum Salary: 34320.00 Salary Unit: Yearly

DirectEmployers