Hari Hud — Software Engineer
Senior Software Engineer with 10+ years of experience building scalable distributed systems, MLOps platforms, and cloud-native infrastructure at NVIDIA. Expertise in backend development, CI/CD, DevOps, Infra Automation, and large-scale data pipelines. Proven track record of delivering 10× performance improvements, reducing processing time from weeks to days, and building secure, production-grade platforms used across organizations. For the past 7+ years, I have been with Nvidia, contributing to various projects within the platform development team. Currently, I am working on building a unified MLOps platform for model development, data preparation, model training, and evaluations. My main focus is on creating an evaluation tool and evaluator microservice for assessing models like NeMo, Megatron, and LLaMA. One of my key contributions was developing ClusterForge, a modern platform built on NVIDIA's Kaizen Framework, Temporal, Go, Java, and Kubernetes. I was responsible for developing REST APIs, a workflow engine, and Ansible playbooks to manage BCP clusters on both NVIDIA's infrastructure and external data centers. I also worked on a DevSecOps project, where I built a security service to help developers identify and resolve security vulnerabilities before release. This included container scanning, open-source dependency scanning, static code analysis, secret detection, and infrastructure-as-code security checks. Another significant project I contributed to was AKUC, a non-disruptive Kubernetes upgrade controller designed to upgrade Kubernetes clusters without interrupting workloads. Additionally, I developed a managed CI/CD platform to ensure seamless integration, deployment, security, and testing of applications throughout their lifecycle. I have experience in developing command-line interfaces (CLIs) using various programming languages and implementing infrastructure as code. Before joining Nvidia, I spent 3 years at GSLab Pune as a Cloud Orchestration Engineer, where I focused on developing workflows for VM lifecycle management using Python, Django, OpenStack, and VMware vRA/vRO. Overall, I am passionate about software development and have a strong background in platform development, cloud orchestration, CI/CD, infrastructure as code, MLOps, and DevOps.
Stackforce AI infers this person is a Backend-heavy Fullstack Engineer specializing in MLOps and Cloud Infrastructure.
Location: Pune City, Maharashtra, India
Experience: 9 yrs 7 mos
Skills
- Cloud Computing
- Infrastructure Automation
- Mlops
- Machine Learning
- Devops
- Cloud Orchestration
- Full Stack Development
Career Highlights
- 10× performance improvements in data processing.
- Expertise in building scalable MLOps platforms.
- Proven track record in CI/CD and DevSecOps.
Work Experience
NVIDIA
Senior System Software Engineer, AI (3 mos)
Senior System Software Engineer, LLM MLOps & Speech AI (2 yrs 8 mos)
System Software Engineer (3 yrs 7 mos)
GS Lab
Sr. Software Engineer, Cloud Orchestration (8 mos)
Software Engineer, Full Stack (2 yrs 5 mos)
Education
B.Tech at Shri Guru Gobind Singhji Institute of Engineering and Technology, Vishnupuri, Nanded
Diploma at PES POLYTECHNIC, Chhatrapati Sambhajinagar (Aurangabad, MH)
SSC at ZPPS Gadiwat, Chhatrapati Sambhajinagar (Aurangabad, MH)