R

RAHUL KUMAR JAISWAL

AI Researcher

Bengaluru, Karnataka, India9 yrs 1 mo experience

Key Highlights

  • 9 years of High Performance Computing expertise.
  • Proficient in managing and optimizing HPC clusters.
  • Strong background in automation and system monitoring.
Stackforce AI infers this person is a High Performance Computing specialist in the Fintech sector.

Contact

Skills

Core Skills

High Performance Computing (hpc)Cluster ManagementCloud Computing

Other Skills

Active Directory in WindowsAnsibleAuditingComputer NetworkingComputer SecurityCray XC40Cyber ForensicsDockerDomain Name System (DNS)Dynamic Host Configuration Protocol (DHCP)ELK StackFTPGangliaGrafanaHPC

About

* Around 9 years of High Performance Computing administration experience. * Excellent knowledge in designing, prototyping and deploying HPC clusters. * Strong understanding of cluster resource managers, job schedulers, clusterware and GPU computing. * Experience in benchmarking and performance optimization of large-scale HPC systems. * Experience in installing and managing high performance storage and network interconnects. * Extensive experience in developing Linux installers for cluster software and OS deployment and automation. * Experience in building computer labs ground up, capacity planning and installing racks. * Extensive experience in troubleshooting Linux OS, filesystems, cluster hardware, scripting and GPU computing hardware. * Ability to create, maintain and implement scripts in order to reduce administrative efforts. * Ability to operate in multi-platform, multi operating system, multi-component environment utilizing a large number of server builds and configurations. * Experienced in project management skills and the demonstarted ability to drive for results. * Excellent interpersonal, communication, customer interaction, documentation skills and design making ability. * Experienced in organising and delivering the HPC systems related workshops to the users.

Experience

9 yrs 1 mo
Total Experience
2 yrs 4 mos
Average Tenure
11 mos
Current Experience

Nvidia

Senior AI-HPC Cluster Engineer

May 2025Present · 11 mos

Graviton research capital llp

Senior HPC Engineer

May 2023Jul 2025 · 2 yrs 2 mos · Gurugram, Haryana, India

  • Managed and maintained the compute clusters used for research and trading simulations, ensuring optimal performance, uptime, and resource utilization.
  • Configured and tuned Slurm workload manager to efficiently schedule thousands of compute jobs daily. Created custom Slurm scripts and automated job monitoring and debugging.
  • Developed and maintained Ansible playbooks for provisioning, configuration management, and automated deployment of compute/storage/network resources.
  • Set up and maintained system health monitoring using tools like Grafana, Prometheus, ELK Stack, and custom alerting mechanisms to detect failures and anomalies proactively.
  • Administered high-performance file systems including Lustre and WekaIO, handling performance tuning, upgrades, quota management, and incident troubleshooting.
  • Worked closely with quant researchers, analysts, and developers to resolve system issues, debug stuck jobs, and optimize application workflows.
  • Maintained secure access controls, audited system usage, and ensured compliance with internal security policies.
  • Collaborated with DevOps and trading infrastructure teams to support CI/CD pipelines, containerized environments, and low-latency trading systems.
SlurmAnsibleGrafanaPrometheusELK StackLustre+3

Tata consultancy services

HPC and Cloud Consultant

Jun 2022May 2023 · 11 mos · Pune, Maharashtra, India · On-site

  • HPC System Engineer
Rancher Kubernetes ManagementInfinibandLustre File SystemKubernetesDockerSlurm+7

Hewlett packard enterprise

System Specialist

Jan 2021May 2022 · 1 yr 4 mos · India

Indian institute of technology, delhi

HPC System Engineer

Jun 2017Jan 2021 · 3 yrs 7 mos · New Delhi, Delhi, India

  • On deputation from CDAC, Pune to IIT Delhi.

Centre for development of advanced computing (c-dac)

Project Engineer (HPC)

Feb 2017Jan 2021 · 3 yrs 11 mos · Pune, Maharashtra, India

Network bulls

CCNA (Routing and Switching)

Jul 2014Sep 2014 · 2 mos

Education

Centre for Development of Advanced Computing (C-DAC)

PG Diploma in IT Infrastructure

Jan 2016Jan 2017

I.E.T, Dr. Ram Manohar Lohia Awadh University, Ayodhya (Faizabad)

Bachelor of Technology (B.Tech.) — Information Technology

Jan 2012Jan 2016

Bishop George School and College

Intermediate (ISC Board) — Mathematics and Science

Mar 2011Present

Bishop George School and College

High School (ICSE Board) — General

Mar 2009Present

Stackforce found 100+ more professionals with High Performance Computing (hpc) & Cluster Management

Explore similar profiles based on matching skills and experience