Sagar Sarkale

Co-Founder

Bengaluru, Karnataka, India8 yrs 3 mos experience
AI EnabledAI ML Practitioner

Key Highlights

  • Founder of Quickcall.dev, driving AI innovations.
  • Designed advanced content moderation systems with high accuracy.
  • Expert in building scalable AI solutions across multiple industries.
Stackforce AI infers this person is a highly skilled AI architect specializing in scalable machine learning solutions.

Contact

Skills

Core Skills

Generative AiLarge Language Models (llm)Recommender SystemsMlopsNatural Language Processing (nlp)Data Science

Other Skills

Document AIComputer VisionAutoencodersPyTorchTensorFlowNumPyPandas (Software)Moderation SystemC++ProgrammingCBootstrapJavaDatabasesLinux

About

Building in the AI chaos.

Experience

8 yrs 3 mos
Total Experience
1 yr 7 mos
Average Tenure
5 mos
Current Experience

Quickcall dev

Founder

Dec 2025Present · 5 mos · Remote

  • • Agentic engineering and productivity for developers

Medpiper | smallstepai | people+ai | yral

Artificial Intelligence Consultant

Dec 2023Nov 2025 · 1 yr 11 mos · Bengaluru, Karnataka, India · Hybrid

  • 𝗬𝗥𝗔𝗟 · AI Consultant
  • Jan 2025 · Nov 2025 · 11 mos · Remote
  • Building scalable GenAI stack and ML systems for feed
  • Designed RAG-based content moderation system achieving 86.8% accuracy (26% improvement) using Phi-3.5 4B model with dynamic prompt generation and similarity search
  • Optimized LLM inference using SGLang with KV caching, achieving 40-90% cache hit rates and 100-200 tokens/sec throughput on T4 GPU (16GB VRAM)
  • Deployed cost-efficient inference infrastructure: T4x2 (19,440 req/USD) vs A100 (24,444 req/hr) with comprehensive hardware benchmarking
  • Built scalable user clustering recommendation system reducing computational complexity from 10^12 to 10^6 user-item pairs through strategic segmentation
  • Implemented multi-stage candidate generation using Apriori algorithms, IoU scoring, and embedding-based similarity for personalized content delivery
  • Architected end-to-end ML pipeline with Cloud Composer (Airflow), GCP BigQuery, Redis caching, and real-time event processing for million scale interactions
  • Self-hosted talking head video generation system, supporting image-to-video and video-to-video avatar synthesis
  • 𝘀𝗺𝗮𝗹𝗹𝘀𝘁𝗲𝗽.𝗮𝗶 · Founder
  • Dec 2023 · May 2025 · 1 yr 6 mos · Remote
  • Built Misal · Marathi LLM:
  • Pretrained and finetuned Misal 7Bn, 1Bn parameter models with custom tokenizer
  • Custom LLM evaluation for regional language
  • Impact : Outperformed ChatGPT 3.5 in reading comprehension
  • 𝗣𝗲𝗼𝗽𝗹𝗲+𝗮𝗶 · AI Consultant
  • Nov 2024 · Dec 2024 · 2 mos · Bengaluru · On-site
  • Multilingual LLM · Evals:
  • Conducted evaluation of 15+ Indic LLM benchmarks, mapping coverage gaps across 22 Indian languages
  • Authored strategic roadmap for 10 trillion token collection across Indian languages
  • 𝗠𝗲𝗱𝗣𝗶𝗽𝗲𝗿 𝗧𝗲𝗰𝗵𝗻𝗼𝗹𝗼𝗴𝗶𝗲𝘀 · AI Consultant
  • Jan 2024 · Jun 2024 · 6 mos · Remote
  • Built health records extraction platform with sub-second batch latency
  • Impact : Reduced report processing time from 10 min to 1 min (10x gains)
Generative AILarge Language Models (LLM)MLOpsRecommender SystemsDocument AI

Tekion corp

Data Scientist

Aug 2022Jul 2023 · 11 mos · Bengaluru, Karnataka, India

  • Document AI - Table Extraction
  • Developed a robust solution for extracting rows and columns from tables in images
  • Trained a mask-RCNN based model to identify the structure of table components
  • Reconstruction of table components for consumption
  • Automobile Service Recommendation
  • Conducted comprehensive EDA on seasonal trends in services taken
  • Discovered key associations between different services taken
  • Generated recommendations for individual vehicles based on their unique service histories.
  • Impact : Significant increase in service add to cart and service taken rates
Natural Language Processing (NLP)Data ScienceRecommender SystemsComputer Vision

Pratilipi

2 roles

Data Scientist

Mar 2021Aug 2022 · 1 yr 5 mos · Bengaluru, Karnataka, India

  • Modelling user item interactions using autoencoders
  • Leveraging bottle neck vectors as embeddings
  • Creating a querying model to fetch similar interactions
  • Built a collaborative filtering model
  • Impact : High impact in reads and improvements in monetization observed
  • Creating hooks for increasing interaction at the end of each content
  • Leveraging Author similarity and common interactions of users
  • Generating relevant author embeddings to capture category information
  • Impact : 2x growth in reads after a user completed a read
  • Next authors to follow model
  • Applied clustering and probabilistic approach to compute “follow author” recommendations
  • Impact : 20% increase in follow action from author profile page
  • Category personalisation "For you" section
  • Captured category interests and subcategory interests of users
  • Personalised various category combinations
  • Impact : Multi front impact on top line, monetisation and author follows observed
  • Conducted multiple experiments in "For you" section of the app
  • Tested multiple hypothesis to increase reads
  • Conducted analysis to get insights of user behaviour across multiple geographies
Natural Language Processing (NLP)AutoencodersPyTorchTensorFlowNumPyPandas (Software)+1

Data Science Intern

Dec 2020Feb 2021 · 2 mos · Bengaluru, Karnataka, India

Parva

Machine Learning Intern

Jun 2020Sep 2020 · 3 mos · India

Manastu space

Machine Learning Research Engineer

Dec 2018Apr 2019 · 4 mos · Work from home

  • To develop Machine learning tools for algorithmic trading.

Expertshub

Summer Intern

Jun 2018Jun 2018 · 0 mo · Pune

  • Internship organised by Expertshub.
  • Various machine learning algorithms hands on experience. Deep learning
  • model implemented using tensorflow API on given project.

Dr.babasaheb ambedkar technological university, lonere - raigad

CS Graduate

Jun 2016Jan 2020 · 3 yrs 7 mos · Lonere

Education

Dr. Babasaheb Ambedkar Technological University

Bachelor of Technology - BTech — Computer Engineering

Jan 2016Jan 2020

Stackforce found 100+ more professionals with Generative Ai & Large Language Models (llm)

Explore similar profiles based on matching skills and experience