Abheesht Sharma

AI Researcher

Bengaluru, Karnataka, India5 yrs 8 mos experience
AI EnabledAI ML Practitioner

Key Highlights

  • Expert in building scalable ML solutions.
  • Top contributor to KerasNLP with significant impact.
  • Award-winning researcher in AI and ML.
Stackforce AI infers this person is a Machine Learning Engineer with expertise in SaaS and E-commerce.

Contact

Skills

Core Skills

Machine LearningDeep LearningData AnalysisNetworking

Other Skills

API DevelopmentAlgorithmsAnomaly DetectionApache SparkApplied SciencesArtificial Intelligence (AI)CC++Computer VisionData CurationData PipelinesFraud DetectionGraph Representation LearningICD CodingImage Classification

About

At Google, I am a member of the Keras/JAX team. I am currently building the following (open source is alive and kickin’): 1. KerasHub: A multi-backend, pre-trained modelling library, housing all popular LLMs, and efficient ways to train them; 2. KerasRS: A library for building Recommender Systems in Keras; 3. Tunix: A JAX-based LLM post-training library (RL, distillation, PEFT, etc.). Before joining Google, I was an Applied Scientist at Amazon Science, where I worked in the Traffic Quality team. My work involved devising ML techniques and algorithms and deploying them at scale to catch advertisement bots. Outside of work, I’m a voracious reader, gobbling up anything to do with fantasy or science fiction. Mostly, you’ll find me with a vacant expression on my face, concocting a novel plot, or singing melodies.

Experience

Google

Machine Learning Engineer

Jan 2025Present · 1 yr 2 mos · Bengaluru, Karnataka, India

  • Serving the ML community with Keras!
KerasMachine LearningDeep Learning

Amazon

2 roles

Applied Scientist

Dec 2023Jan 2025 · 1 yr 1 mo

  • Worked on building and productionising realtime ML solutions for e-commerce Ad Fraud Detection for billion-scale data on Sponsored Ads and Twitch.
  • Developed algorithms for fraud detection such as (a) GPT-style model which takes in user-level activity sequences, (b) R-GCN model to detect colluding entities committing fraud and (c) DNN-based algorithms which use activity-based features to catch click and impression bots.
Machine LearningAlgorithmsData Analysis

Research Scientist

Sep 2022Dec 2023 · 1 yr 3 mos

  • Built and deployed a DNN-based algorithm, which uses activity-based features (click frequency and velocity) and distils model scores into interpretable fraud entity lists to detect sophisticated click bots.
  • Collaborated on building a realtime hybrid system of heuristics and ML algorithms to detect sophisticated crawler bots using network request level features, leading to detection of crawler impressions. Paper accepted at KDD Workshop on AI-Enabled Cybersecurity Analytics.
Deep LearningFraud DetectionData AnalysisMachine Learning

Nutanix

2 roles

Member of Technical Staff-II

Jul 2022Sep 2022 · 2 mos

  • Built a networking performance testing library from scratch to compare different networking stacks.
  • Used by entire team to compare multiple stacks like Open vSwitch (OVS) and OVS-Data Plane Dev Kit (OVS-DPDK), and make important decisions like which stack to migrate to, etc.
NetworkingPerformance Testing

Intern, Member of Technical Staff

Aug 2021Jul 2022 · 11 mos

  • Implemented an end-to-end pipeline for fetching counters for Access Control Lists (ACLs) for telemetry purposes.
  • Designed the APIs, RPCs, and implemented the logic for fetching the counters by interfacing with the OVS and the Open Virtual Network (OVN).
API DevelopmentTelemetryNetworking

Google summer of code

Contributor | TensorFlow

May 2022Sep 2022 · 4 mos

  • Worked with the Keras team [Google] on KerasNLP, Keras’ native NLP offering and a multi-backend Keras, i.e., new Keras version able to run on top of PyTorch, TensorFlow, JAX.
  • Top contributor to KerasNLP, involved at every stage of development, and made important contributions – models like DistilBERT, BART, DeBERTa, Whisper, etc., efficient XLA-compatible text generation APIs, NLP metrics, LoRA and so on.
  • Invited to give technical talks on Keras (and Deep Learning, in general) at Google events around the country and outside. Received the Google AI/ML Community Award and the Google Open Source Peer Bonus Award.
  • Paper accepted at JMLR 2024 (Open Source track).
KerasNLPMachine Learning

Kerasnlp (tensorflow)

Contributor

Mar 2022Nov 2023 · 1 yr 8 mos · Remote

  • Worked with the Keras team [Google] on KerasNLP, Keras’ native NLP offering and a multi-backend Keras, i.e., new Keras version able to run on top of PyTorch, TensorFlow, JAX.
  • Top contributor to KerasNLP, involved at every stage of development, and made important contributions – models like DistilBERT, BART, DeBERTa, Whisper, etc., efficient XLA-compatible text generation APIs, NLP metrics, LoRA and so on.
  • Invited to give technical talks on Keras (and Deep Learning, in general) at Google events around the country and outside. Received the Google AI/ML Community Award and the Google Open Source Peer Bonus Award.
  • Paper accepted at JMLR 2024 (Open Source track).
KerasNLPDeep LearningMachine Learning

Carnegie mellon university

Research Collaborator

Jul 2021Sep 2022 · 1 yr 2 mos · Remote · Remote

  • Built an ICD coding framework which provides a streamlined pipeline for preprocessing, training, and evaluation for the automatic ICD coding task, including an interactive demo and explanation of outputs using popular interpretability methods.
  • Paper accepted at EMNLP 2022.
ICD CodingMachine Learning

Appcair

Research Collaborator

Mar 2021May 2022 · 1 yr 2 mos · Goa, India · Remote

  • Worked on infusing super-pixel-based knowledge (higher-order perceptual groups of pixels) into deep neural networks for the image classification task.
  • Used a CNN to deal with spatial information present in the image, and a Graph Attention Network (GAT) model to deal with relational superpixel information.
  • Won the best short paper award at ACM-SE 2022.
Image ClassificationDeep LearningMachine Learning

Laboratory for computational social systems, iiit-delhi

Research Collaborator

Jan 2021Sep 2021 · 8 mos · Delhi, India · Remote

  • First exposure to Vision+Language models.
  • Focused on building a dataset of political memes - built a local annotation tool, and curated memes data from Reddit.
  • Designed multimodal models (ViT + BERT, VLBERT, etc. as encoders) to locate political explanations of a given meme in a piece of text.
Deep LearningAnomaly DetectionMachine Learning

Indian institute of technology, madras

Research Intern

Apr 2020Nov 2020 · 7 mos

  • Developed a novel architecture dubbed "3DBLES‑UNet" which uses the concept of "Bilateral Grids" for estimation of depth maps with prominent, accentuated edges.
  • Paper accepted at IC3D 2020.

Birla institute of technology and science, pilani - goa campus

2 roles

Undergraduate Teaching Assistant (Operations Research)

Jan 2020May 2020 · 4 mos

Undergraduate Teaching Assistant (Mathematics-III)

Aug 2019Dec 2019 · 4 mos

Csir-ceeri

Summer Research Intern

May 2019Jul 2019 · 2 mos · Pilani, Rajasthan · On-site

  • Worked on a popular Deep Learning problem "Anomaly Detection in Videos of Daily Living using Deep Learning" and simulated state-of-the-art results.
  • Trained a GAN on dynamic images generated from videos from the URFD dataset (for fall detection).

Education

Birla Institute of Technology and Science, Pilani - Goa Campus

Bachelor's — Computer Science

Aug 2017Jul 2022

Birla Institute of Technology and Science, Pilani - Goa Campus

Master's — Mathematics

Aug 2017Jul 2022

Stackforce found 100+ more professionals with Machine Learning & Deep Learning

Explore similar profiles based on matching skills and experience