A

Ayush Kumar

Software Engineer

San Francisco, California, United States3 yrs 3 mos experience
AI Enabled

Key Highlights

  • Built automated root cause analysis framework improving failure triage accuracy.
  • Developed advanced log observability system reducing manual triage time significantly.
  • Contributed to India's first driverless car project focusing on path planning algorithms.
Stackforce AI infers this person is a SaaS and AI/ML specialist with a focus on distributed systems and cloud infrastructure.

Contact

Skills

Core Skills

Distributed SystemsLarge-scale Data ProcessingAi ObservabilityLog CompressionCloud Application DevelopmentDashboard DevelopmentPath PlanningRoute PlanningAlgorithm Development

Other Skills

Distributed DatabasesLog ObservabilitySearch Engine DevelopmentiOS DevelopmentConversational Agent DevelopmentShape RecognitionMathematical ModelingSparse Matrix Computation

Experience

3 yrs 3 mos
Total Experience
10 mos
Average Tenure
4 mos
Current Experience

Anyscale

Software Engineer

Feb 2026Present · 4 mos · San Francisco, California, United States

  • Ray Data
  • https://github.com/ray-project/ray/
Distributed SystemsDistributed DatabasesLarge-scale Data Processing

Machine learning department at cmu

Teaching Assistant

Aug 2025Dec 2025 · 4 mos · Pittsburgh, Pennsylvania, United States

  • 10-605/805: Machine Learning with Large Datasets, Prof. William Cohen
  • Led recitations, office hours, and course Q&A for a graduate (MS/PhD) class on distributed ML

Nutanix

Intern, Member of Technical Staff

May 2025Aug 2025 · 3 mos · San Francisco Bay Area

  • Software Engineering - AI Observability, Nutanix AI
  • Built an agentic framework for automated root cause analysis of failures involving Nutanix Cloud Infrastructure, improving failure triage accuracy from ~35% to 76%
  • Designed an end-to-end log observability system, leveraging Spark for telemetry ingestion, LangGraph for model/tool orchestration, and Ollama for efficient inference
  • Developed a distributed tracing module to localize root cause failures and summarize entity traces by analyzing error logs, cutting down manual triage time from 45-60 minutes to <1 minute
  • Built LiteLog (judged winners, 2025 Intern Hackathon), a log compression and search engine enabling advanced (timestamp/node/service/level-filtered), fast querying on compressed log indices, reducing storage requirements of failure triage by upto 93%

Carnegie mellon university

Collaborator @ Catalyst Research Group

Feb 2025Dec 2025 · 10 mos · Pittsburgh, Pennsylvania, United States

  • Contributing to Mirage, a tool that generates fast GPU kernels for PyTorch programs through superoptimization techniques
  • Working on program partitioning framework to split and merge operators in a computational graph

Sap

2 roles

Software Engineer Intern

Jan 2024Jul 2024 · 6 mos · Bengaluru, Karnataka, India

  • BTP SDK for iOS backend engineering team
  • Developed software tools for cloud application development on iOS and visionOS
  • Implemented support for a new development landscape (Edge), designed and implemented authentication flow using SAML for this landscape
  • Integrated SAP's Identity Authentication Service (IAS) with the SDK Assistant, allowing users to setup default authentication, single-sign on, and user management for their apps from an interactive wizard

Summer Intern

Jun 2023Aug 2023 · 2 mos · Bengaluru, Karnataka, India

  • Built an interactive dashboard to streamline internal communications and event management workflows
  • Worked on a context-aware conversational agent to answer queries about employee benefits, along with a vector database for knowledge search

Cistup (centre for infrastructure, sustainable transportation and urban planning) @ iisc

Research Intern

Dec 2022Feb 2023 · 2 mos · Bengaluru, Karnataka, India

  • Developing mathematical models and transit route planning algorithms to predict congestion on Bangalore's BMTC bus transportation network
  • Parallelized route assignment pipeline using OpenMP and attained significant (68x) runtime speedup
  • Refactored data structures to optimise memory and runtime performance of multi-criteria route planning algorithm (McRAPTOR)
  • Developed mapping and visualization module to plot network-wide congestion over time

Chennai mathematical institute

Research Intern

Feb 2022Jun 2022 · 4 mos

  • Working under the guidance of Prof. Mandayam Srivas on algorithms to prove permutation invariance in deep neural networks
  • Implemented efficient SVD algorithm for sparse matrices, ported solver code from Python to C++

Project manas

AI Subsystem Member

May 2021Jun 2023 · 2 yrs 1 mo · Udupi, Karnataka, India

  • Member of AI (Perception and Planning) department of Project MANAS: a student project team building India's first driverless car.
  • Worked on coverage path planning algorithms for an autonomous drone for AUVSI SUAS 2022
  • Developed a shape recognition model to classify aerial images of targets using invariant Hu moments

Education

Carnegie Mellon University

Master's in Computational Data Science

Aug 2024Dec 2025

Manipal Institute of Technology

Bachelor of Technology - BTech — Computer Engineering

Jan 2020Jan 2024

The Cathedral and John Connon School

ISC

Jan 2007Jan 2020

Stackforce found 100+ more professionals with Distributed Systems & Large-scale Data Processing

Explore similar profiles based on matching skills and experience