Sumit Sardana โ Software Engineer
๐ Building Scalable AI/ML Infrastructure | Architecting High-Performance AI/ML Observability Systems Iโm a Senior Software Engineer passionate about designing and building scalable AI/ML systems with a strong focus on observability, data pipelines, and performance optimization. I thrive at the intersection of engineering and product, driving technical innovation while ensuring real-world impact. What I Do ๐น AI/ML Observability & Monitoring โ Architected and implemented a scalable ML Monitoring platform, optimizing data transformation pipelines to enhance performance and cost efficiency. ๐น Scalable Data Pipelines โ Engineered high-performance ML Observability pipelines, achieving 4-5X speed & cost improvements in user data transformation. ๐น LLM Observability & Metadata Management โ Developed a robust metadata tracking layer to monitor LLM evaluation and operational status. ๐น Optimized Query Execution for ML Metrics โ Integrated core ML Observability metrics into a proprietary query execution engine, enhancing real-time monitoring capabilities. ๐น SQL for AI Monitoring โ Designed and implemented SQL-based solutions for efficient retrieval and analysis of observability data. ๐น Data Sketching & Performance Benchmarking โ Conducted in-depth benchmarking of quantile-based data sketches, comparing t-Digest and q-Digest for ML Observability. Analyzed algorithmic trade-offs, optimizing for accuracy, memory efficiency, and query performance in large-scale monitoring workloads. ๐น Schema Design & Storage Efficiency โ Created a normalized table schema for storing aggregated observability statistics at scale. I love solving high-impact engineering challenges that push the boundaries of AI/ML infrastructure. Whether it's designing efficient query execution, optimizing large-scale data pipelines, or driving product decisions, I am always focused on building for scale and efficiency.
Stackforce AI infers this person is a SaaS-focused engineer with expertise in AI/ML infrastructure and observability.
Location: San Francisco, California, United States
Experience: 8 yrs 1 mo
Skills
- Software Design
- Distributed Systems
- Large Language Models (llm)
- Data Ingestion
- Machine Learning
Career Highlights
- Architected scalable ML observability systems.
- Achieved 4-5X speed improvements in data transformation.
- Led development of enterprise-scale ML solutions.
Work Experience
Snowflake
Senior Software Engineer (11 mos)
Senior Software Engineer (1 yr 2 mos)
TruEra
Senior Software Engineer (9 mos)
Software Engineer (1 yr)
nference
Staff Engineer (10 mos)
Senior Software Engineer (6 mos)
Software Engineer (1 yr 11 mos)
Nutanix
Systems Reliability Engineer (7 mos)
Systems Reliability Engineer - Intern (5 mos)
MakeMyTrip
Software Engineer Intern - Gofro.com (1 mo)
ThinkSys Inc
Summer Intern (1 mo)
Education
Bachelor of Technology (B.Tech.) at Vellore Institute of Technology