Sunay Hegde

Co-Founder

Mumbai, Maharashtra, India0 mo experience

Key Highlights

  • Developed Air.rs for LLM inference on consumer GPUs.
  • Expert in low-level systems and performance optimization.
  • Open to collaborations in AI infrastructure.
Stackforce AI infers this person is a specialist in AI infrastructure with a focus on performance optimization.

Contact

About

I built Air.rs to run 70B models — which need ~35GB VRAM at Q4 — on an RTX 4090 with only 24GB, using a triple-buffered pipeline where the GPU executes layer N while PCIe uploads layer N+1. I gravitate toward low-level systems, algorithms, and anything at the intersection of performance and intelligence. The language matters less than the problem — though lately that language is Rust. I'm always open to: — Open source collaborations in AI infrastructure and systems — Conversations with researchers and engineers building real models — Opportunities where technical depth actually matters

Education

SVKM's NMIMS Mukesh Patel School of Technology Management & Engineering

Bachelor of Technology — Computer Science

Jan 2022Jan 2028