Lokesh Koshale — Co-Founder
I develop performance-critical software and algorithms. I am a highly accomplished GPU Optimization and Algorithms Engineer with over five years of industry experience at KLA, specializing in CUDA kernel design, GPU performance engineering, and AI/ML acceleration. My expertise lies in achieving significant performance gains for complex computational problems in computer vision, deep learning, and large-scale parallel algorithms, consistently exceeding the performance of state-of-the-art frameworks. My contributions to best-in-class GPU optimizations have been recognized by experts at NVIDIA. My technical foundation is built on bridging traditional algorithms, Machine Learning, and LLM-based code generation to consistently deliver truly exceptional performance. I am highly skilled in CUDA, Triton, C/C++, Python, TensorFlow, and PyTorch , with a focus on deep-level optimizations, including assembly-level PTX kernels and advanced distributed ML training techniques. As an engineer, I thrive on tackling challenging problems in HPC, Parallel Computing, and Algorithm Design to scale solutions across multi-node, multi-GPU heterogeneous systems.
Stackforce AI infers this person is a GPU Optimization and AI/ML Engineering expert in high-performance computing.
Location: Chennai, Tamil Nadu, India
Experience: 7 yrs 7 mos
Skills
- Gpu Optimization
- Performance Engineering
Career Highlights
- Expert in GPU optimization and algorithm design.
- Proven track record in AI/ML acceleration.
- Recognized contributions to NVIDIA's GPU optimizations.
Work Experience
KLA
Algorithm Engineer (5 yrs 8 mos)
Intern Algorithm and AI (6 mos)
Intern Algorithm Engineering (2 mos)
eClerx
Intern Software Development (2 mos)
EdarLabs
Founder and CTO (1 yr 11 mos)
Machadalo
Android Developer (2 mos)
Education
Dual Degree (BTech + MTech) at Indian Institute of Technology, Madras
JEE coaching at ALLEN
Higher secondary school at Jawahar Navodaya Vidyalaya - JNV