Aditya Kallappa

AI Researcher

Bangalore, Karnataka, India7 yrs 8 mos experience

Key Highlights

Expert in developing and fine-tuning Large Language Models.
Pioneered India's first foundational language model for 12 languages.
Led innovative projects in NLP and Computer Vision.

Stackforce AI infers this person is a Deep Learning and NLP specialist in the AI industry.

Contact

Skills

Core Skills

Natural Language Processing (nlp)Large Language Models (llm)Computer VisionDeep LearningMachine Learning

Other Skills

AlgorithmsAutomatic Speech RecognitionC (Programming Language)C++Data AnalysisDistributed TrainingEmbedded CJavaPython (Programming Language)SQLText-to-SpeechTransformer ModelsVoice Activity Detection

About

Innovative Deep Learning Researcher with hands-on experience in NLP, Computer Vision (CV), and Generative AI. Specialized in developing and fine-tuning Large Language Models (LLMs), custom tokenizers, and AI benchmarks for multilingual and multimodal applications. Skilled in pretraining, alignment, distributed training (DeepSpeed, FSDP), and efficiency optimization (PEFT, LoRA, Quantization). Passionate about advancing AI for Indic languages, optimizing LLM instruction following, reasoning, and alignment with human preferences to drive real-world impact.

Experience

7 yrs 8 mos

Total Experience

1 yr 7 mos

Average Tenure

1 yr 2 mos

Current Experience

Dp world

Group Senior Data Scientist

Mar 2025 – Present · 1 yr 2 mos · Bengaluru, Karnataka, India · Hybrid

Krutrim

2 roles

Research Engineer 2

Apr 2024 – Mar 2025 · 11 mos · Bengaluru, Karnataka, India · On-site

1. LLM Training

Data Scientist

Jul 2023 – Mar 2024 · 8 mos · Bengaluru, Karnataka, India · On-site

1. Foundational Large Language Model (LLM)
a. Integral member of the core team that built Krutrim - India's 1st Foundational Language Model that supports upto 12 Indian Languages
b. Achieved superior token to word ratio by training tokenizers for various Indian languages, surpassing existing systems' efficiency.
c. Implemented advanced techniques for cleaning and filtering Indian language data, surpassing the effectiveness of existing methods.
d. Established a robust pipeline for deduplication of Indian language data, optimizing processes by leveraging existing methods.
1. Live Streaming with Multilingual Translation and Lip Syncing:
a. Spearheaded video lip syncing part of live streaming project with translation and lip syncing in five different Indian languages, enhancing accessibility and engagement.
b. Incorporated Voice Activity Detection (VAD), Automatic Speech Recognition (ASR), Lip Sync, and Live Streaming using FFMPEG to create a live multilingual translation and lip-sync video, broadcasted seamlessly in real-time settings.

Transformer ModelsNatural Language Processing (NLP)Data AnalysisDistributed TrainingLarge Language Models (LLM)Text-to-Speech

Ola

Data Scientist

Jul 2022 – Jun 2023 · 11 mos · Bengaluru, Karnataka, India · On-site

1. Automated Detection of Battery Cell Defects (2 months):
a. Spearheaded the development of an automated system for detecting physical defects in battery cells.
b. Implemented classification as well as detection to enhance accuracy and efficiency.
c. Collaborated with Ola Electric Factory as well as Automation Team to ensure seamless integration.
2. Traffic Signal Detection Using YOLOV5 (2 months):
a. Achieved an impressive 85% accuracy in the detection of traffic signals using YOLOv5.
b. Collaborated closely with annotators to ensure high-quality labeled data for continuous training with daily annotated data to increase accuracy.
c. Established a workflow utilizing the DataLoop annotation tool to automatically retrieve newly annotated images and facilitate automated model training.
3. Automatic Speech Recognition (ASR) with OpenAI's Whisper Model (2 Months):
a. Managed the inferencing of OpenAI's Whisper Model across multiple GPUs for conducting transcriptions of call center calls.
b. Implemented optimizations to enhance speed and resource efficiency.
4. Language Model Fine-Tuning for FAQ Responses (1 months):
a. Fine-tuned open-source Language Models (LLMs) to provide accurate responses to frequently asked questions (FAQs).
b. Leveraged LoRA (Low Rank Approximation) technique to effectively fine-tune models, enhancing model performance and efficiency.
b. Leveraged PyTorch's distributed package (torchrun) for parallel training across multiple nodes.
5. Language Model Fine-Tuning for Chatbot in Customer Care (3 months):
a. Successfully finetuned open-source LLMs to serve as a chatbot in a customer care environment for specific problems.
b. Applied DeepSpeed's ZeRO (Zero Redundancy Optimizer) Technique(s) to enhance model training efficiency.
c. Structured multi-turn customer care data to ensure the model behaves like a ChatBot rather than a simple QA bot.