Sushil Dubey

AI Researcher

Bengaluru, Karnataka, India9 yrs 11 mos experience
Most Likely To SwitchAI ML Practitioner

Key Highlights

  • 8+ years in AI software engineering.
  • Expert in optimizing AI frameworks and models.
  • Led significant MLPerf submissions for Intel.
Stackforce AI infers this person is a highly skilled AI Software Engineer with expertise in deep learning and compiler optimization.

Contact

Skills

Core Skills

Ai Software DevelopmentDeep LearningCompiler DevelopmentSoftware EngineeringGpu ProgrammingSoftware Development

Other Skills

Large Language Models (LLM)PyTorchPerformance OptimizationC++Compiler OptimizationCUDAGenerative AIData StructuresAlgorithmsResearchProgrammingDebuggingJavaCLinux

About

I’m an AI Software Engineer with 8+ years of experience building AI frameworks, full stack deep learning system and performance‑optimized AI training & Inference. Over the past five years at Intel (Habana Labs), I’ve contributed across the full AI software stack—developing PyTorch device plugins, writing deep learning kernels (GEMM, convolution, attention, MoE), enabling large‑scale distributed training, and improving end‑to‑end model performance on Intel Gaudi hardware. I’ve played a key role in enabling and optimizing open‑source LLaMA, DeepSeek and Stable Diffusion models on Intel Gaudi using 3D parallelism (tensor, pipeline, data, and expert parallelism) with Megatron‑LM. I also led four MLPerf training and inference submissions on Intel AI accelerators, achieving good performance improvement. Before Intel, I worked at Synopsys, contributing to HDL compiler development for FPGA synthesis in C++. Earlier at TIFR (CERN CMS project), I developed GPU‑accelerated parallel clustering and accelerated path‑finding algorithms for high‑throughput CMS detector data processing. I hold Master’s in Artificial Intelligence from the Indian Institute of Science (IISc), with strong mathematical grounding in generative models, LLMs, Diffusion model. I also bring strong problem‑solving skills and solid fundamentals in data structures, algorithms, C++, Python, PyTorch internals, AI frameworks, and experience working with large, production‑grade codebases.

Experience

9 yrs 11 mos
Total Experience
3 yrs 3 mos
Average Tenure
6 yrs 2 mos
Current Experience

Intel corporation

AI Software Solution Engineer

Mar 2020Present · 6 yrs 2 mos · Bengaluru

  • AI model enablement using pytorch framework and performance optimization for Intel AI accelerators across training and inference workloads.
  • Kernel/operator optimization for LLM, diffusion, and RecSys models; profiling-driven throughput/latency improvements.
  • Distributed training and inference scaling using TP/PP/DP/EP parallelism with mixed precision BF16/FP8.
  • Led 4 MLPerf Training and Inference submission cycles end-to-end for key benchmarks including Stable Diffusion and DLRM.
  • Leading SGLang Diffusion enablement on Intel GPU for image and video generation models.
Large Language Models (LLM)PyTorchAI Software DevelopmentDeep Learning

Synopsys inc

R&D Engineer, II

Dec 2017Mar 2020 · 2 yrs 3 mos · Bangalore

  • Maintained and enhanced a legacy HDL compiler for an FPGA protocompiler suite in C++, ensuring stability and production readiness.
  • Implemented incremental Verilog feature enhancements, including interface-related language support across unified compiler components.
  • Drove root-cause analysis, bug fixes, and regression prevention for customer-reported issues in collaboration with QA and field teams.
C++Compiler OptimizationCompiler DevelopmentSoftware Engineering

Tata institute of fundamental research, mumbai

Research Fellow

Jun 2016Dec 2017 · 1 yr 6 mos · Mumbai, Maharashtra, India

  • Patatrack - Developed software package for Accelerated Pixel Track at High Level Trigger CMS using GPU CMSSW framework, C++ and CUDA at CERN.

Education

Indian Institute of Science (IISc)

Master of Technology - MTech — Artificial Intelligence

University of Mumbai

Bachelor's degree — Computer Engineering

Jan 2012Jan 2016

Stackforce found 100+ more professionals with Ai Software Development & Deep Learning

Explore similar profiles based on matching skills and experience