Lead HPC Engineer

EPAM Systems β€’ Barra do Garças, Brazil

Company

EPAM Systems

Location

Barra do Garças, Brazil

Type

Full Time

Job Description

We are seeking a Lead HPC Engineer to oversee the day-to-day operations and engineering activities within our HPC environment.
The ideal candidate will be a skilled engineer with extensive experience in deploying and optimizing HPC infrastructure. This role involves working with our L3 HPC infrastructure engineering team to support the exploitation of an HPC cluster for our Scientific research team. Preference will be given to candidates located in India, although the position is open to applicants from any location.

#LI-DNI

Responsibilities

  • Support and maintain the HPC infrastructure
  • Implement infrastructure automation using IaC (Infrastructure as Code)
  • Resolve incidents and participate in software and hardware upgrades
  • Manage job scheduling and resource allocation with HPC job schedulers
  • Configure and install Bright Cluster Manager
  • Maintain and optimize GPFS/Lustre file systems
  • Oversee InfiniBand/OmniPath network interconnect configurations
Requirements

Want more jobs like this?

Get jobs in Barra do GarΓ§as, Brazil delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.
  • 7+ years of experience as an HPC general technical expert
  • Background in engineering or HPC system development
  • Proficiency in supporting and configuring HPC infrastructure
  • Proficiency in Linux (any rpm-based) including kernel modules compilation and debugging tools like strace, coredump, and tcpdump
  • Skills in managing HPC job schedulers including IBM LSF and Slurm
  • Competency in configuring and installing Bright Cluster Manager
  • Familiarity with both GPFS and Lustre file systems
  • Understanding of InfiniBand and OmniPath network interconnect technologies
Nice to have
  • Experience with hardware diagnostics, upgrades, and tuning including HCA InfiniBand and disk arrays from Lustre, Vast, IBM
  • Skills in infrastructure monitoring using Zabbix, Splunk, or Grafana
  • Familiarity with Easybuild
  • Experience working in a GxP environment
  • Knowledge of Jira and ServiceNow

Apply Now

Date Posted

01/25/2025

Views

0

Back to Job Listings ❀️Add To Job List Company Info View Company Reviews
Neutral
Subjectivity Score: 0

Similar Jobs

Partner Customer Success Manager - Qlik

Views in the last 30 days - 0

View Details

Operador(a) de Garantia - São Caetano do Sul - General Motors

Views in the last 30 days - 0

View Details

Market Development Manager - Brazil - Waters

Views in the last 30 days - 0

View Details

Materials Planner - GE Vernova

Views in the last 30 days - 0

View Details

Lead Manufacturing Specialist 2 - Prod Process and Equip_AVI - GE Aerospace

Views in the last 30 days - 0

View Details

Sales Specialist, Google Workspace for Education, Google Cloud - Google

Views in the last 30 days - 0

View Details