Siddharth Sahani

AI Researcher

Bengaluru, Karnataka, India10 yrs 1 mo experience
AI ML PractitionerHighly Stable

Key Highlights

  • Led ML strategy across merged business units
  • Built ML teams from ground up with 15+ engineers
  • Achieved significant revenue outcomes through ML
Stackforce AI infers this person is a Machine Learning expert in Adtech and Logistics with a strong focus on product development.

Contact

Skills

Core Skills

Machine LearningDeep LearningProduct DevelopmentMlopsData EngineeringMentoringProduct ManagementData ScienceReinforcement LearningRobotics

Other Skills

SaaS DevelopmentDistributed ComputingTransformersHigh Performance Computing (HPC)Model DevelopmentGenerative AI ToolsTransformer ModelsSQLApache KafkaApache SparkCatboostVowpal WabbitElasticsearchAirflowBig Data

About

AI/ML engineering leader with 10 years of experience building, shipping, and scaling production ML systems across adtech, logistics, legal-tech, and ed-tech. Currently leading ML strategy and cross-functional execution across three merged business units at a European DSP processing 50B+ daily bid requests under sub-15ms latency. Track record of building ML teams from the ground up (up to 15+ engineers), owning end-to-end product delivery from experimentation to production, and directly driving revenue outcomes. Deep hands-on expertise spanning real-time bidding, reinforcement learning, LLM/RAG systems, computer vision, and NLP paired with the business acumen to translate ML capabilities into measurable P&L impact.

Experience

10 yrs 1 mo
Total Experience
2 yrs 2 mos
Average Tenure
1 yr 4 mos
Current Experience

Appodeal, inc.

Principal Machine Learning Tech Lead

Jan 2025Present · 1 yr 4 mos · India · Remote

  • Stabilized the real-time Predictor system during a 10x load surge (30B→60B daily requests, 5x bidding strategies per campaign), reducing bid timeouts from 8% to 0.04% and enabling 200-model serving under 15ms latency
  • Designed cross-layer A/B testing framework with experiment isolation to eliminate model contamination across 1000 campaigns and 20 mobile objects, building the business case for coordinated experimentation infrastructure
  • Built MLflow-based model registry with CI/CD governance, moving from ad-hoc deployments to validated promotion pipelines with 2–3 minute deployment cycles
  • Developed bid automation system with lambda controllers for dynamic campaign optimization, addressing 76% campaign underperformance on ROAS targets
  • Optimized Bid-Shading training pipelines to process 1.5 trillion samples/day (10x increase) for pricing optimization across the exchange
  • Acting as the integration layer for models development and deployment between Data Science, Account Management, Bidder, DevOps, and Data Engineering teams across all three business units
SaaS DevelopmentDistributed ComputingTransformersHigh Performance Computing (HPC)Model DevelopmentGenerative AI Tools+54

Kayzen

2 roles

Senior Machine Learning Engineer

Promoted

Mar 2023Jan 2025 · 1 yr 10 mos

  • Led a team of 4 ML Engineers and 3 DevOps engineers; served as the critical integration point between DS, Platform Bidder, DevOps, and Data Engineering.
  • Shipped 12 advertiser-specific micro-models with robust pipelines, achieving 10% CPI improvement with a framework scalable to 100+ models without manual intervention
  • Drove 3x deployment throughput (9→28 models/year) through pipeline optimization and process improvements
  • Developed session depth, device fingerprinting, and budget pacing features powering new data-centric optimization strategies
Generative AI ToolsSQLHigh Performance Computing (HPC)Product DevelopmentTransformersAWS SageMaker+10

Machine Learning Engineer II

Mar 2021Feb 2023 · 1 yr 11 mos

  • Optimised training/shipping pipelines such that the number of ML model deployments grew 3-fold from 9 models in 2021 to 28 models in 2022, when our scale was growing at a tremendous pace and DS:MLE ratio was still 5:1
  • Re-organised & optimised numerous data pipelines and legacy codebase to fit the changing needs of the ML algorithms while adding along the way features like CPA-Retargeting optimisation models, enhancing A/B testing framework for flexible experimentation, progressing the CPI optimisation models, and bifurcating models by traffic type, segment etc
  • Worked on deploying 4 major models : User Counter, Taxonomy & App Categories, Campaign BusinessType, & User level feature model which cumulatively gave a bump of 25% in desired CPI & IPM metrics
  • Improved the resilience of the existing pipelines with enhanced monitoring, alerting and integrations which saved ETAs on Incidents detection and resolution 2x faster than back in early 2021.
Generative AI ToolsVowpal WabbitSQLMySQLHigh Performance Computing (HPC)Apache Kafka+12

Great learning

AI/ML Mentor

Apr 2020Mar 2021 · 11 mos · Remote

  • Mentored and taught over 250+ data science aspirants, under Machine Learning, Recommender Systems, Deep Learning & Natural Language processing course.
  • Mentored a batch of 15 Industry Leaders who were in their career transformation journey into AI. Guided them in enabling and identifying AI/ML opportunities in their current role & also helped them with an overall industry outlook.
Reinforcement LearningMachine LearningNatural Language ProcessingMLOpsDeep LearningMentoring

Shipmnts

4 roles

ML Lead & Product Owner

Feb 2020Feb 2021 · 1 yr · Ahmedabad, Gujarat, India

  • Led the remote product roll-out in Alpha stage in 15 countries during onset of Covid in March 2020, across the largest Shipping Line in the world, leading NVOCC in India, and top AirForwarder in India. Took charge of Customer Onboarding, User Acceptance Testing, Deployment strategies, training & documentation material culminating in 40 daily active customers within 4 months.
  • Designed a feedback loop mechanism for continuous training of ML models, which would capture the corrections made by the end-user and pass them as training data in a stratified manner. This enables the models to stay up to date and counter data drift in production
  • Worked on cutting edge DocumentAI solutions to utilise the multimodal transformer-based architectures to solve problems like key-value pair extraction and mapping with advanced architectures like LayoutLM, LayoutLMv2, DocFormer, etc.
Long Short-term Memory (LSTM)SQLPostgreSQLProduct DevelopmentTransformersTransformer Models+8

Senior Data Scientist & MLE

Promoted

Feb 2019Feb 2020 · 1 yr · Ahmedabad, Gujarat, India

  • Led the AI product roadmap, strategy, technical guidance for both Document Extraction product and the AI needs for Shipmnts Suite
  • Built an end-to-end solution using custom YOLOv3 inspired architecture for object detection and localisation along with an orientation detection model followed by a classifier pipeline to process information from complex non-traditional images (stamps, logos, etc).
  • Delivered an end to end product feature on document classification for limited data, this project included in-depth benchmarking of various ML and DL approaches for document classification and concluded with two scalable approaches, one is a custom CNN based architecture on the text and the other is a custom CNN based model which utilizes both texts as well as an image as inputs. This was deployed as a product where customers can seamlessly train models by just uploading the data.
  • Designed and implemented an end-to-end platform for page stream segmentation (pagination) based on a custom deep network that utilizes both texts as well as image features to compare if two consecutive pages are a part of the same document or not. The entire system which has this as one of the components is now under Patent review status.
SQLPostgreSQLProduct DevelopmentTransformersTransformer ModelsElasticsearch+7

Data Scientist & MLE

Promoted

Feb 2018Jan 2019 · 11 mos · Ahmedabad, Gujarat, India

  • Worked on Document Condition assessment from Image Perspective. Owned and delivered multiple Deep and Machine Learning models from inception to the Customer Feedback stage that involved semi-automated training of all Machine Learning models part of the solution
  • Lead team for creating internal ML framework for automating rudimentary ML/DL tasks for Non-ML personnel in the company
  • Lead Computer Vision team for our product to solve Document Processing problem statements from Computer Vision perspective. ie, Table extraction from documents - from Dataset collection to Experiment tracking using MLflow and DVC till deployable application using tfserve. I had worked on State of Art architectures like Yolo, EfficientDdet for detection and Segmentation use cases.
SQLPostgreSQLKerasProduct DevelopmentModel DevelopmentTransformer Models+9

Data Scientist

Jan 2017Jan 2018 · 1 yr · Ahmedabad, Gujarat, India

  • Being the first Data Scientist on board, I explored the numerous areas where repetitive processes were rampant with Forwarders & CHAs be it email, physical documents, or phone calls for quotations
  • Understood the pain points of Freight Forwarders, Clearing House Agents on the mundane yet important documentation work they have to do for creating Customs & Carrier Documentation
  • Built an NER (Named entity recognition) pipeline for information extraction from Emails to speed up the process of Shipment Creation
  • Performed EDA (Exploratory Data Analysis) on the Shipping documents to extract important insights using rule based techniques
SQLMySQLSeabornProduct DevelopmentDockerDeep Learning+6

Infocusp

Machine Learning Engineer

Jun 2016Jan 2017 · 7 mos · Ahmedabad Area, India

  • Worked on a project ContractSifter, a distributed web-application to extract information from legal documents, for our client LegalSifter.
  • My responsibilities included:
  • Pre-processing, cleansing, and verifying the integrity of data used for training.
  • Text Mining using unsupervised and semi-supervised models.
  • Optimizing classifiers (sifters) using machine learning and feature engineering.
  • Building a platform to enhance data-collection, annotation, and sifter development process.
  • Implementing multi-GPU training in TensorFlow in our project
Deeplearning4jLong Short-term Memory (LSTM)TkinterSQLProduct DevelopmentMachine Learning+8

Zaya learning labs

Machine Learning Researcher

Dec 2015May 2016 · 5 mos · Mumbai Area, India

  • Used Markov models for random text generation and applied them to build a novel, sensible question generator.
  • Built a back-propagation Neural Network for Upper and Lowercase alphabet recognizer, which gave percentage correctness of the hand-drawn letter
  • Developed an algorithm to determine a set of litmus questions that determine a child’s intellectual level in the Concepts from previous, before entering the next grade thus providing a customized child-centric blended learning.
  • Worked with POS tagger, Parse trees, Dependency trees using Stanford’s Core NLP package to build pragmatic question templates. Involved Answer word Classification, which was implemented through Decision Trees.
  • Dived into Topic Modelling, Latent Semantic Analysis, understood the beautiful Math behind it, implemented it with and without popular packages like Gensim.
Data Science

Indian institute of technology, madras

Undergraduate Research Fellow

May 2015May 2016 · 1 yr · Chennai Area, India

  • Implementing Bipolar Disorder of Basal Ganglia with Extended Reinforcement Learning Model
  • Designed and implemented the classical Bandit and Grids World problem under varying concentrations of Dopamine & Seretonin
  • Appreciated the above testbed scenarios in Value and Utility based Decision making
  • Continued the work by modifying the update equations in pursuit of finding a faster medication than the already existing exercises
Reinforcement LearningQ LearningNeuroscienceJavaStatistics

Srm team robocon

2 roles

Core Team Member

Mar 2014Aug 2014 · 5 mos · Chennai Area, India

  • Implemented navigation of omni-directional y-chassis robot that gave about 30% increase in navigation
  • Led the Recruitment Drive for Robocon 2015. Handled public relations, planned coding quizzes & conducted interviews to select 5 ace coders from 150 applicants.
  • On-boarded new hires and mentored them on Robocon 2014's problem statement.
  • Introduced Reinforcement learning as a training strategy and assisted in the initial design research and components procurement for Robocon 2015
Reinforcement LearningRoboticsArduino

Autonomous Robot Programmer

Mar 2013Feb 2014 · 11 mos · Chennai Area, India

  • Worked on Robocon 2013 contest's problem statement and ironed out bugs in the legacy code. (video linked)
  • Learnt to use IMU, Rotary encoders and LiDARs.
  • Optimized the hardware and software electronics of the 3 wheel pick and place robot with PS2 interface for manual driving.
  • Contributed to the functioning of two robots according to the problem statement of Robocon'14.
  • One was a manually controlled robot (Parent Robot) capable of picking and placing the autonomous robot (Child Robot) to specific places like see-saw, swing and pole walk. It was also capable of pushing one end of a see-saw & pushing the swing. It was driven on a 4WD Omni Wheel system and manipulated using pneumatic actuators. The Child Robot was capable of walking on a set of uniformly arranged poles using ultrasonic sensor based pole detection and pneumatically actuated grippers.
  • Won the award for "Best Economical" Robot out of 92 teams that participated in India
  • Won Best college club award at Aarush (SRM University tech fest) 2014's exhibition

Education

SRM IST Chennai

Bachelor of Technology (B.Tech.) — Computer Science and Engineering

Jan 2012Jan 2016

Omkarananda Sarawati Nilayam, Rishikesh

12th (Sr. Secondary) — Science

Jan 2011Jan 2012

Omkarananda Sarawati Nilayam, Rishikesh

10th (Secondary)

Jan 2009Jan 2010

Stackforce found 100+ more professionals with Machine Learning & Deep Learning

Explore similar profiles based on matching skills and experience