Prabu Sekar — Software Engineer
With over two years of experience at Core42, I specialize in deploying and configuring high-performance computing (HPC) clusters. My work focuses on leveraging Slurm to manage computational workflows across thousands of NVIDIA H100/H200 and AMD GPGPUs, ensuring optimal performance through benchmarking tools like NCCL, RCCL, and HPL. I also facilitate seamless integration of Azure-based HPC clusters with distributed workload storage solutions. At Core42, I contribute to diagnosing and resolving complex system issues across Slurm clusters, InfiniBand networks, and NVIDIA DGX/HGX systems. My role includes providing L3 operational support for HPC platforms, enhancing system reliability, and advancing computational capabilities through collaborative problem-solving and innovative system optimization.
Stackforce AI infers this person is a High Performance Computing Infrastructure Specialist.
Location: Abu Dhabi, United Arab Emirates
Experience: 6 yrs 6 mos
Skills
- High Performance Computing (hpc)
- Slurm Workload Manager
- Linux System Administration
Career Highlights
- Expert in deploying and configuring HPC clusters.
- Proficient in managing computational workflows with Slurm.
- Strong background in troubleshooting complex system issues.
Work Experience
Core42
Engineer - HPC Systems (2 yrs 3 mos)
Lenovo PCCW Solutions
Senior System Engineer (1 yr 2 mos)
Nanyang Technological University
Research Engineer, HPC (3 yrs 1 mo)
HARRINGTON HPC MICROSYSTEMS LTD
HPC Systems Engineer (2 yrs 5 mos)
Micropoint Computers Ltd.
HPC Technical Support Engineer (2 yrs 6 mos)
Indian Institute of Technology, Madras
Project Technician in HPC (2 yrs 10 mos)
Education
Master of Technology - MTech at Birla Institute of Technology and Science, Pilani
Bachelor of Technology - BTech at Birla Institute of Technology and Science, Pilani
Diploma at State Board of Technical Education and Training