Aayush Devgan

AI Researcher

Pune, Maharashtra, India9 yrs experience
AI EnabledAI ML Practitioner

Key Highlights

  • Expert in building scalable data platforms and pipelines.
  • Proven track record in AI-driven automation and ML systems.
  • Strong leadership in cross-functional data initiatives.
Stackforce AI infers this person is a Data & AI leader with expertise in Fintech and Healthcare analytics.

Contact

Skills

Core Skills

Artificial Intelligence (ai)Data PipelinesReliability EngineeringData EngineeringMachine LearningData Governance

Other Skills

AI Agent DevelopmentAgentic AI DevelopmentAgile Project ManagementAmazon RedshiftAmazon Web Services (AWS)AmundsenApache AirflowApache KafkaApache SparkAzure DatabricksCockroachDBData AnalyticsData MeshData ModelingData Warehousing

About

I’m a Data & AI leader with 9+ years of experience building scalable data platforms and data pipelines, along with production grade ML systems and AI driven automation across consumer tech, fintech, healthcare, and enterprise environments. My work spans real time and batch data pipelines, large scale data platforms, ML systems, and reliability engineering. I’ve led initiatives ranging from building foundational data infrastructure, governance, and high throughput pipelines to developing ML and LLM powered systems for analytics, intelligence, and operational automation consistently reducing operational overhead, improving system reliability, and increasing engineering productivity at scale. I’ve built and scaled data systems from scratch at startups, delivered modern data platforms for global clients, and owned end to end execution including platform cost optimization, platform reliability, and productization. This breadth allows me to turn ambiguous problems into clear, outcome driven roadmaps with real business impact. Currently at DoorDash, I focus on applying AI to improve data platform reliability, operational excellence, production on call support, and platform cost efficiency at scale helping reduce MTTR, operational toil, and infrastructure overhead across critical production systems. Previously at Relcu, I led end to end data and ML systems spanning analytics platforms, intelligence features, and data monetization, while also driving platform resource optimization to ensure scalable, sustainable growth. If you’d like to get in touch feel free to reach out on aayushdevgan@gmail.com or DM me on LinkedIn.

Experience

Doordash

Senior AI Engineer

May 2025Present · 10 mos

  • Part of Data Reliability and Engineering Excellence team.
  • Built AI powered on call support systems using logs, metrics, traces, and lineage to automate debugging, RCA, and guided troubleshooting, driving ~60% improvements in MTTR/MTTD across production platform incidents.
  • Launched AI generated incident handovers and post mortems, reducing documentation effort by ~80% with ~90% accuracy, saving ~30 minutes on average per incident and improving on call transition quality across 9+ global teams.
  • Led the production rollout of an AI first support agent leveraging LLMOps platform providing Pipeline run status, SLA validation, observability insights, and runbook guidance, achieving ~50 MAU across engineering teams.
  • Supported a global shift to an India primary on call model by standardizing cross region handoffs and escalation, resulting in a ~70% reduction in US off hours interrupts and increasing India team ownership from 2.5% to 77%, significantly lowering on call burden and improving response quality.
  • Contributed to an initiative that reduced US off hours interrupts by ~70% and increased India team ownership from ~2.5% to ~80%, significantly lowering on call burden and improving response quality.
  • Drove a platform cost optimization initiative across Snowflake and data lake infrastructure, targeting ~$1.5M+ in annualized savings by partnering with vendors and collaborating cross-functionally across engineering teams.
Agentic AI DevelopmentArtificial Intelligence (AI)Large Language Models (LLM)Large Language Model Operations (LLMOps)Retrieval-Augmented Generation (RAG)Prompt Engineering+12

Relcu

Lead Data and ML Engineer

Jun 2023May 2025 · 1 yr 11 mos

  • First hire in the data team.
  • Designed and launched Relcu’s data monetization strategy (pricing, packaging, customer validation) in close partnership with the CEO, CPO, CDO, converting data from a support function into a recurring revenue stream of ~$5K per client per month.
  • Scaled Relcu’s Data & ML platform from 0 → 30+ paying customers, owning platform reliability, performance, and cost efficiency while strengthening product market fit in the financial services CRM space.
  • Built and led a high ownership Data team, establishing delivery standards, architectural decision clarity, and coaching practices that supported rapid platform scaling.
  • Introduced AI / ML powered conversational intelligence (transcription, summarization, diarization, sentiment analysis, keyword extraction) and multi channel engagement scoring across calls, text, and email, saving clients ~3,000 hours of manual work per month and improving sales effectiveness.
  • Engineered data ingestion and export frameworks integrating 10+ third-party sources (including Datadog and Twilio), enabling seamless data consumption for clients and empowering internal teams with richer operational and customer insights.
  • Worked closely with product and design to surface ML and data insights directly in the application UI, and built no code, data generation, and masking tools that accelerated client onboarding, reduced engineering dependency, and improved testing at scale.
  • Owned monthly cost tracking, budgeting, and optimization for Data & ML infrastructure, ensuring efficient resource usage as the platform scaled in customers and data volume.
Data EngineeringMLOpsSentiment AnalysisProduct ManagementData AnalyticsData Governance+10

Thoughtworks

Senior Data Engineer

Nov 2021Jun 2023 · 1 yr 7 mos

  • Worked with UK and South East Asia Clients providing premium tech consultancy.
  • Advised global enterprise clients across the UK, Japan, Singapore, and Germany, translating complex legacy constraints into practical data platform, governance, and cloud modernization decisions.
  • Worked on modernization of a data mesh platform on AWS for an 80 year old enterprise, processing 2000+ TB of data and supporting 10 teams across 3 continents, enabling faster, evidence based marketing decision making through automated quality checks, cataloging, and governance.
  • Designed and deployed a cloud native data platform on Azure using Databricks, Data Factory, and Power BI, while architecting end to end infrastructure with Terraform (CI/CD, high availability, monitoring, release management, migrations), improving analytics reliability, performance, and operational efficiency.
  • Implemented full-stack observability using OpenTelemetry, enabling centralized logging, metrics, and distributed tracing to improve production visibility and incident diagnosis.
  • Mentored interns and early-career engineers and led workshops on data privacy, data mesh, and governance, strengthening engineering practices across teams.
Data MeshData GovernanceAzure DatabricksAmazon Web Services (AWS)Google Cloud Platform (GCP)Microsoft Azure+11

Fi

Senior Data Engineer

Jul 2020Nov 2021 · 1 yr 4 mos

  • Second hire in the data team.
  • Built and scaled a near real time events pipeline handling 100M+ events per day with 99.9% data accuracy, powering user funnels and analytics in near real time while meeting strict SLOs on an open source stack.
  • Designed and scaled batch data pipelines supporting 2,000+ entities and vendors, enabling reliable metrics, dashboards, and reporting across business and operations teams.
  • Engineered a SQL query engine framework over the data lake, enabling flexible, performant analytical querying for data analysts and data scientists.
  • Built a comprehensive data validation framework and implemented a schema registry to enforce event contracts, manage schema evolution, and significantly reduce data quality regressions in production.
  • Improved data discoverability and literacy by launching a data discovery platform using Amundsen, increasing visibility and adoption of data assets across the organization.
  • Partnered with data scientists to develop PySpark based ML pipelines, providing scalable data access patterns and infrastructure support for multiple ML use cases.
  • Built end to end observability and deployment automation for data and ML platforms (alerting, monitoring, logging, CI/CD), improving operational reliability and reducing manual intervention during releases.
Data GovernanceData AnalyticsDevOpsData ModelingData PipelinesData Warehousing+15

Urban company

Senior Software Development Engineer

Apr 2019Jun 2020 · 1 yr 2 mos

  • First hire in the data platform team.
  • Built Urban Company’s Customer Data Platform from scratch, ingesting transactional, clickstream, and server events to support real time and batch analytics for data, product, business, operations, sales, and marketing teams.
  • Designed and launched the company’s first Master Data Management platform, aggregating data from multiple vendors and internal systems with an intuitive onboarding UI, establishing a single source of truth across the organization.
  • Developed event ingestion microservices integrating sources such as Appsflyer and Google Ads, enabling reliable attribution, funnel analysis, and growth analytics at scale.
  • Owned and optimized a 200+ TB data warehouse, improving query performance, reliability, and cost efficiency as data volume and usage grew.
  • Led major platform modernization initiatives, migrating the data platform from Python → Spark, the data warehouse from Redshift → Snowflake, and BI tooling from Tableau → Looker, significantly improving scalability, performance, and analyst productivity.
Data GovernanceData AnalyticsScalaData ModelingData PipelinesAmazon Redshift+13

Innovaccer

Member of Technical Staff

Jan 2017Mar 2019 · 2 yrs 2 mos

  • Early hire in the data product team.
  • Early hire in the data product team, building resilient, large scale healthcare analytics systems for US healthcare clients using HBase, HDFS, Spark, and Scala, supporting mission critical, data driven products.
  • Developed and scaled Patient 360, integrating patient data from multiple heterogeneous clinical and operational sources into a unified patient view used across analytics and product workflows.
  • Designed and optimized a rule based record linkage and deduplication framework, enabling accurate patient identity resolution and providing a single enterprise wide identifier for downstream systems.
  • Built RESTful APIs using Scala Play Framework to verify, create, and manage patient records, streamlining data access and improving reliability of patient data ingestion workflows.
  • Implemented a custom Spark job executor to schedule and manage batch processing workloads, improving processing reliability and operational control for large scale healthcare datasets.
Data GovernanceData AnalyticsScalaData PipelinesAgile Project ManagementData Warehousing+9

Education

The University of Texas at Austin

Master of Science - MS — Computer Science

Vellore Institute of Technology

Bachelor of Technology - BTech — Computer Science

Dr. Virendra Swarup Public School

High School and Secondary School — Computer Science

Stackforce found 100+ more professionals with Artificial Intelligence (ai) & Data Pipelines

Explore similar profiles based on matching skills and experience