Shounak Das

AI Researcher

Mumbai, Maharashtra, India1 yr 1 mo experience
AI EnabledAI ML Practitioner

Key Highlights

  • Developed advanced AI frameworks for real-world applications.
  • Proficient in multiple domains including NLP and Computer Vision.
  • Contributed to research with publications in premier venues.
Stackforce AI infers this person is a skilled AI and Machine Learning professional with a focus on Generative AI and NLP.

Contact

Skills

Core Skills

Large Language Models (llm)Generative Artificial IntelligenceNatural Language Processing (nlp)Optical Character Recognition (ocr)Topic ModelingGenerative AiRetrieval-augmented Generation (rag)Machine LearningGenerative Adversarial Networks (gans)

Other Skills

AlgorithmsAmazon Web Services (AWS)Analytical SkillsArtificial Intelligence (AI)Azure DatabricksC (Programming Language)C++CUDACascading Style Sheets (CSS)Computer ScienceComputer VisionControl and ComputingData AnalyticsData CleaningData Science

About

Hello, I'm Shounak Das, a Final-Year Undergraduate in Electrical Engineering at IIT Bombay, also pursuing a minor in AI, Machine Learning, and Data Science. I enjoy exploring how technology and intelligent systems can make our lives better. I've worked on many projects in areas like NLP, Generative AI, LLMs, and Computer Vision. I'm also proficient at DSA & programming and am skilled in Signal Processing and Communication Systems. During my internships, I gained hands-on experience solving real-world AI challenges: At Fujitsu Research, I developed a proactive RCA framework on ~100k-line Warrior logs and built the InstructRAG pipeline with Gemini, fine-tuning Mamba-2 with LoRA. At Intel, I built an OCR pipeline for Hindi, Telugu, and Kannada using ViT and IndicBERT for NIC, leveraged WhisperX for multilingual audio labeling, and optimized LLM inference with OpenVINO and IPEX. At Swiggy, I developed a BERTopic + LLM pipeline for trend detection and a spell-correction system for Instamart. Beyond academics, I’ve contributed to research in domain adaptation and medical image analysis, with publications in premier venues such as ICML, MIDL, ISBI, and ICPR. I’m excited to apply my skills to impactful projects and am currently looking for full-time roles in Machine Learning, Data Science, and Software Engineering starting in 2026. Let’s connect and explore how we can innovate together! Feel free to reach me at shounakd56@gmail.com

Experience

1 yr 1 mo
Total Experience
6 mos
Average Tenure
--
Current Experience

Fujitsu

AI Research Intern

May 2025Jul 2025 · 2 mos · On-site

  • Developed a proactive RCA framework on ~100k-line Warrior logs for next-step prediction & failure diagnosis
  • Built the InstructRAG pipeline using Gemini to synthesize QA data and fine-tuned Mamba-2 with LoRA
  • Overcame LLM context limits by using hypergraphs with Personalized PageRank and similarity compression
  • Achieved 42% BLEU and 4.4/5 LLM-as-a-Judge score via hypergraph embeddings and cross-attention fusion
Large Language Models (LLM)Retrieval-Augmented Generation (RAG)Generative Artificial Intelligence

Intel corporation

AI Solutions Engineering Intern

Jul 2024May 2025 · 10 mos

  • Pipelined an OCR system for Hindi, Telugu & Kannada using ViT to extract robust image features
  • Integrated IndicBERT for context-aware recognition, boosting regional digitization for NIC (Govt of India)
  • Utilized WhisperX for multilingual audio labeling, enabling accurate transcription alignment & timestamps
  • Optimized inference speed by 67% (LLaMA-3 8B) and 80% (Mistral 7B) using OpenVINO and IPEX
Large Language Models (LLM)Natural Language Processing (NLP)Optical Character Recognition (OCR)Performance Engineering

Swiggy

Data Science Intern

Jul 2024Aug 2024 · 1 mo

  • Developed a robust topic modeling framework using BERTopic and an Azure OpenAI LLM agent to predict emerging trends in events and items based on Q2C, enhancing trend prediction & analytics
  • Built a spell-error dataset from Instamart SQL database using edit distance, decompounding, and phonetics
  • Applied unigram and bigram probability models with fuzzy logic, achieving up to 83% spell correction accuracy
Topic ModelingLarge Language Models (LLM)Natural Language Processing (NLP)Generative AIAzure DatabricksSnowflake+1

Ibm

LLM Intern

May 2024Jun 2024 · 1 mo

  • Optimized vector databases (ChromaDB & Weaviate) for Retrieval-Augmented Generation (RAG) in Large Language Models (LLM), improving indexing efficiency & search accuracy.
  • Integrated TraceLoop & IBM Instana for observability of VectorDB searches & real-time performance analytics.
Large Language Models (LLM)Generative AIRetrieval-Augmented Generation (RAG)

Gmac intelligence

Machine Learning Intern

Dec 2023Jan 2024 · 1 mo · San Diego, California, United States · Remote

  • Actively participated in the prestigious MLCommons AlgoPerf Training Algorithms Benchmark Competition, a global event aimed at fostering innovation in machine learning algorithms
  • Leveraged 6 diverse datasets-Criteo 1TB-clickthrough rate prediction, FastMRI-reconstruction,ImageNet-image classification,LibriSpeech-speech recognition, OGBG-molecular property prediction, & WMT-translation
  • Worked on developing & optimizing a novel machine learning training algorithm under fixed hardware and lower-level software environments of 8 different workloads
Machine Learning

Murven design solutions

Generative AI Engineer

Dec 2022Apr 2023 · 4 mos

  • Implemented Generative AI models like Deforum Stable Diffusion & VQGAN for various applications, including text-to-image, image-to-image & image-to-animation generation.
  • Explored Variational Autoencoders (VAEs) and advanced flow-based models like RealNVP, to enhance the diversity & quality of generated content, leading to more realistic and creative outputs in various applications.
  • Collaborated with team to enhance diversity & quality of generated content giving more realistic, creative outputs.
  • Developed & deployed a prompt-based API backend mechanism using Amazon Web Services(AWS)
Generative AIGenerative Adversarial Networks (GANs)Stable DiffusionAmazon Web Services (AWS)

Education

Indian Institute of Technology, Bombay

Integrated Dual Degree (B.Tech + M.Tech)

Oct 2021May 2026

Indian Institute of Technology, Bombay

Minor Degree : Machine Learning — Artificial Intelligence and Data Science

Oct 2021May 2026

Bhartiya Vidya Bhavan-Delhi Kendra

CBSE Senior Secondary Examination (Class 12th)

Mar 2019Mar 2021

Bhartiya Vidya Bhavan-Delhi Kendra

CBSE Secondary School Examination (Class 10th)

Mar 2018Mar 2019

Stackforce found 100+ more professionals with Large Language Models (llm) & Generative Artificial Intelligence

Explore similar profiles based on matching skills and experience