A

Aasavari Kakne

AI Researcher

Noida, Uttar Pradesh, India4 yrs experience

Key Highlights

  • Achieved 51% TAT reduction for steel clients.
  • Ranked #2 in industry leaderboard for RecSys 2023.
  • Published research on privacy-preserving recommendation systems.
Stackforce AI infers this person is a Senior AI Research Engineer specializing in machine learning and data analysis for AI-driven solutions.

Contact

Skills

Core Skills

Machine LearningDeep LearningData Analysis

Other Skills

PythonLaTexgitTensorFlowPyTorchKerasC++Statistics

About

Hello 👋✨ I am a Senior AI Research Engineer at Attentive AI where I lead a team of talented AI Research Engineers and Subject Matter Experts to deliver AI-automated steel takeoffs. We recently achieved 51% TAT reduction for our steel clients and are baking many exciting features for year 2026. Stay tuned for more updates!✨ I worked as an AI engineer at Intel’s AISE applied ML team where I focus on LLMs (𝗟𝗟𝗠-𝗮𝘀-𝗮-𝗷𝘂𝗱𝗴𝗲 𝗳𝗼𝗿 𝗼𝗽𝗲𝗻𝘀𝗼𝘂𝗿𝗰𝗲 𝗮𝗻𝗱 𝗮𝗻𝗻𝗼𝘁𝗮𝘁𝗶𝗼𝗻 𝗳𝗿𝗲𝗲 𝗲𝘃𝗮𝗹𝘂𝗮𝘁𝗶𝗼𝗻 𝗼𝗳 𝗥𝗔𝗚 on Gaudi2). Prior to this, I worked on 'Preprocessing pipeline of large datasets for LLM pretraining comprising of Near deduplication, PII removal and Global deduplication'. We quantified efficacy of our Near Deduplication and benchmarked it at F-1 score of 0.99861. ✨ I also dabble in GNNs and am grateful to receive ‘𝗔𝗜𝗦𝗘 𝗗𝗲𝗽𝗮𝗿𝘁𝗺𝗲𝗻𝘁 𝗥𝗲𝗰𝗼𝗴𝗻𝗶𝘁𝗶𝗼𝗻 𝗔𝘄𝗮𝗿𝗱’ for 2024 for being a core contributor to our RecSys 2023 submission that ranked #2 𝗶n the industry leaderboard. 𝗢𝘂𝗿 𝘀𝗼𝗹𝘂𝘁𝗶𝗼𝗻 𝗶𝘀 𝗽𝘂𝗯𝗹𝗶𝘀𝗵𝗲𝗱 𝗶𝗻 𝗔𝗖𝗠 - ‘Graph Enhanced Feature Engineering for Privacy Preserving Recommendation Systems’ . 𝗪𝗲 𝗮𝗹𝘀𝗼 𝗳𝗶𝗹𝗲𝗱 𝗮 𝗨𝗦 𝗽𝗮𝘁𝗲𝗻𝘁 for ‘Construction of bipartite graphs from Privacy Preserved datasets’ as part of our RecSys 2023 submission. ✍️ Prior to Intel, I completed 𝗠𝗦 𝗮𝘁 𝗦𝘁𝗮𝗻𝗳𝗼𝗿𝗱 in Applied Math. I appreciated learning from classes like - 𝗡𝗟𝗣, 𝗡𝗟𝗨, 𝗖𝗩, 𝗚𝗡𝗡𝘀 𝗮𝗻𝗱 𝗠𝗟. Our blog ‘Fantastic graphs and how to complete them’ for GNN class was selected to be published by Prof. Jure Leskovec at Stanford. 𝗜 𝗮𝗹𝘀𝗼 𝗲𝗻𝗷𝗼𝘆𝗲𝗱 𝗧𝗔𝗶𝗻𝗴 𝗳𝗼𝗿 𝗰𝗹𝗮𝘀𝘀𝗲𝘀 - 𝗦𝗼𝗳𝘁𝘄𝗮𝗿𝗲 𝗘𝗻𝗴𝗶𝗻𝗲𝗲𝗿𝗶𝗻𝗴, 𝗔𝗱𝘃𝗮𝗻𝗰𝗲𝗱 𝗦𝗼𝗳𝘁𝘄𝗮𝗿𝗲 𝗘𝗻𝗴𝗶𝗻𝗲𝗲𝗿𝗶𝗻𝗴 𝗮𝗻𝗱 𝗡𝗟𝗨 𝗮𝘁 𝗦𝘁𝗮𝗻𝗳𝗼𝗿𝗱. In this time, I also got to work on - Weak Supervised networks, Semi Supervised networks, Multimodal representation learning, Transfer Learning techniques, Generative Adversarial Networks, and Knowledge Graph completion during my projects and internships. ✍️ Prior to Stanford, I completed 𝗕.𝗧𝗲𝗰𝗵 𝗶𝗻 𝗠𝗮𝘁𝗵 𝗮𝗻𝗱 𝗖𝗼𝗺𝗽𝘂𝘁𝗶𝗻𝗴 𝗮𝘁 𝗜𝗜𝗧 𝗗𝗲𝗹𝗵𝗶 where my favourite classes were probability, statistics and linear algebra. 𝗜 𝗹𝗼𝘃𝗲𝗱 𝘁𝗼 𝗧𝗔 𝗳𝗼𝗿 - 𝘀𝘁𝗮𝘁𝗶𝘀𝘁𝗶𝗰𝗮𝗹 𝗶𝗻𝗳𝗲𝗿𝗲𝗻𝗰𝗲 𝗮𝗻𝗱 𝗰𝗮𝗹𝗰𝘂𝗹𝘂𝘀 𝗰𝗹𝗮𝘀𝘀𝗲𝘀 𝗮𝘁 𝗜𝗜𝗧-𝗗. During my third year at IIT-D, I studied at University of Waterloo as part of IIT-D’s foreign exchange program. UW’s Differential equations and ‘learning how to learn’ classes were amazing.

Experience

4 yrs
Total Experience
1 yr 4 mos
Average Tenure
1 yr 4 mos
Current Experience

Attentive.ai

2 roles

Senior AI Research Engineer

Promoted

Dec 2025 – Present · 5 mos · Noida, Uttar Pradesh, India

  • Leading a team of AI engineers along with industry experts to generate high-quality steel takeoffs at a lightning speed
Machine LearningDeep LearningPythonData Analysis

Research Engineer II

Jan 2025 – Dec 2025 · 11 mos · Noida, Uttar Pradesh, India

  • my current project focuses on the visual understanding and reasoning capabilities given extra long context inputs for Large Language Models and large multimodal models.
Machine LearningDeep LearningPython

Intel corporation

AI software solutions engineer

Aug 2022 – Nov 2024 · 2 yrs 3 mos · Santa Clara County, California, United States · Hybrid

  • Accepted 'AISE Department Recognition Award' in 2024
  • ✨ 𝗔𝘂𝘁𝗼𝗘𝘃𝗮𝗹 - 𝗟𝗟𝗠-𝗮𝘀-𝗮-𝗷𝘂𝗱𝗴𝗲 𝗳𝗼𝗿 𝗼𝗽𝗲𝗻𝘀𝗼𝘂𝗿𝗰𝗲 & 𝗮𝗻𝗻𝗼𝘁𝗮𝘁𝗶𝗼𝗻-𝗳𝗿𝗲𝗲 𝗲𝘃𝗮𝗹𝘂𝗮𝘁𝗶𝗼𝗻 𝗼𝗳 𝗥𝗲𝘁𝗿𝗶𝗲𝘃𝗮𝗹 𝗔𝘂𝗴𝗺𝗲𝗻𝘁𝗲𝗱 𝗚𝗲𝗻𝗲𝗿𝗮𝘁𝗶𝗼𝗻 with metrics - factualness, answer correctness, answer relevance, context relevance & context recall.
  • Performed long-context inference using Mixtral-MoE 8*7b, Llama3 8b on Gaudi2 AI accelarator chip to provide rating and reasoning for each metric.
  • ✨ 𝗗𝗮𝗻𝗶𝘀𝗵 𝗡𝗲𝘄𝘀 𝗥𝗲𝗰𝗼𝗺𝗺𝗲𝗻𝗱𝗮𝘁𝗶𝗼𝗻 𝗳𝗼𝗿 𝗥𝗲𝗰𝗦𝘆𝘀 𝟮𝟬𝟮𝟰 - Implemented Pinterest’s TransAct model in FuxiCTR framework on Gaudi2.
  • ✨ 𝗟𝗟𝗠𝗮𝗮𝘀: worked on cpu-distributed data preprocessing pipeline for SlimPajama, Pile and Falcon Refined Web dataset
  • Comprises of our adaption of SlimPajama Near Deduplication, PII removal & Global Deduplication using Ray
  • Quantified efficacy of Near Deduplication to achieve 0.99861 F-1 score by implementing a TF-IDF based model
  • Measured data quality after preprocessing using multi-processed GPT-3 quality scorer & implemented Gopher Quality Filter
  • Proved that Near deduplication improved data quality for PILE (NIH)
  • Global deduplication did not affect data quality for SlimPJ (ArXiv, Wikipedia)
  • Significant levels of PII removal harmed data quality for SlimPajama (Stack Exchange, GitHub)
  • ✨ 𝗥𝗮𝗻𝗸𝗲𝗱 #𝟮 𝗥𝗲𝗰𝗦𝘆𝘀 𝟮𝟬𝟮𝟯 challenge organized by ShareChat - 𝗽𝗮𝗽𝗲𝗿 𝗮𝗰𝗰𝗲𝗽𝘁𝗲𝗱 𝘁𝗼 𝗔𝗖𝗠 'Graph Enhanced Feature Engineering for Privacy Preserving recommendation systems.
  • 𝗨𝗦 𝗽𝗮𝘁𝗲𝗻𝘁 𝗳𝗶𝗹𝗲𝗱 for - Construction of bipartite graphs from Privacy Preserved datasets.
  • Stabilized training of supervised GNN while disabling message passing from test edges to achieve indepedent evaluation.
  • ✨ 𝗔𝗰𝗵𝗶𝗲𝘃𝗲𝗱 𝟬.𝟵𝟰 𝗔𝗨𝗖𝗣𝗥 (𝘀𝘁𝗮𝘁𝗲-𝗼𝗳-𝘁𝗵𝗲-𝗮𝗿𝘁) 𝗳𝗼𝗿 𝗚𝗡𝗡-𝗲𝗻𝗵𝗮𝗻𝗰𝗲𝗱 𝗖𝗿𝗲𝗱𝗶𝘁 𝗖𝗮𝗿𝗱 𝗙𝗿𝗮𝘂𝗱 𝗗𝗲𝘁𝗲𝗰𝘁𝗶𝗼𝗻 usecase for TabFormer dataset.
Machine LearningDeep LearningPythonData Analysis

Stanford university

3 roles

Teaching Assistant, Natural Language Understanding

Mar 2022 – Sep 2022 · 6 mos

  • Assisting in teaching of Lexical semantics, distributed representations of meaning, relation extraction, semantic parsing, sentiment analysis, dialogue agents

Teaching Assistant, Advanced Software Development

Jan 2022 – Mar 2022 · 2 mos

  • Teaching how to write highly performant, scalable and sophisticated software systems in C++

Teaching Assistant, Software Development

Sep 2021 – Dec 2021 · 3 mos

  • Teaching how to design and implement efficient, readable, and reusable software systems

Sprinklr

Machine Learning Intern

Mar 2021 – Aug 2021 · 5 mos · New Delhi, Delhi, India

  • To generate synthetic dataset for logo detection model, I implemented state-of-the-art HIC-GAN model. Automatically scaled, rotated and placed the logo foreground on the background image.
Machine LearningPython

Nptel

Teaching Assistant

Aug 2018 – Sep 2018 · 1 mo · New Delhi, Delhi, India

  • Designed exams on Order Statistics, Estimators, Hypothesis Testing and Confidence Intervals

Indian institute of technology, delhi

Teaching Assistant, freshmen Calculus

Jul 2018 – Nov 2018 · 4 mos · New Delhi, Delhi, India

Nobroker.com

Automatic Description Generation | Data Science Intern

May 2018 – Jul 2018 · 2 mos · Bengaluru Area, India

  • Automatically generated human-like, information-rich descriptions for real estate property profiles.
  • Incorporated SEO, accelerated the process 200x & saved 100+ human hours every day. Model can be easily extended to different languages, cities and platforms.

Education

Stanford University

Master of Science - MS — Applied Mathematics

Jan 2019 – Jan 2022

Indian Institute of Technology, Delhi

Bachelor of Technology - BTech — Mathematics and Computing

Jan 2015 – Jan 2019

University of Waterloo

Exchange student — Mathematics

Jan 2017 – Jan 2017

Stackforce found 100+ more professionals with Machine Learning & Deep Learning

Explore similar profiles based on matching skills and experience