Santosh Sawant

Lead ML Engineer

Bengaluru, Karnataka, India17 yrs 10 mos experience
AI EnabledAI ML Practitioner

Key Highlights

  • Expert in Large Language Models and Generative AI.
  • Proven track record in developing scalable ML systems.
  • Strong leadership in cross-functional AI project delivery.
Stackforce AI infers this person is a Generative AI and Machine Learning expert with a focus on Data Analytics and Telecommunications.

Contact

Skills

Core Skills

Large Language Models (llm)Deep LearningMlops

Other Skills

Agile MethodologiesAmazon Web Services (AWS)Apache FlinkApache KafkaApache SparkApache Spark StreamingC#C++Cloud ComputingComputer VisionDistributed SystemsHelm ChartsInternet Protocol Suite (TCP/IP)JavaJavaScript

About

LLM Architect learning to innovate, optimize, and scale the next generation of large language models.

Experience

Philips

Senior Solutions Architect, Generative AI

Sep 2024Present · 1 yr 6 mos · Bangalore Urban, Karnataka, India · Hybrid

Tredence inc.

Senior Machine Learning Architect

Sep 2022Sep 2024 · 2 yrs · Bengaluru, Karnataka, India

  • Led a cross-functional team to deliver a cutting-edge Generative AI platform and products in the Data Analytics, Healthcare and ESG domain.
  • Fine-tuning LLM with multi LoRA adaptor for domain specific task in Retail, CPG and Supply Chain domain
  • Developed distributed cloud GPU training approaches for LLMs models, including data distribution editing, data quality improvements, and representation learning with self-supervision.
  • Experience in reading papers and implementing algorithms described in papers to increase performance, quality, data management, and accuracy of AI systems.
  • Designed lean proofs of concepts (POC) to answer targeted business questions using Gen.AI.
  • Design, Developed and integrated various large-scale, distributed machine learning systems for production ready Gen.AI services.
Knowledge Graph-Based RecommendationLarge Language Models (LLM)PyTorchvLLMRecommender SystemsDeep Learning+1

Parallel wireless

Lead Machine Learning Engineer

Dec 2021Sep 2022 · 9 mos · Bengaluru, Karnataka, India

  • Develop end-to-end ml pipeline in hybrid cloud using kubeflow pipeline (model training), mlflow (experiment tracking and model registry), kserve (model serving), feast feature store and jenkins CICD.
  • Developed a key performance index forecasting system using LSTM with 70% accuracy; entailing reduced network downtime with improved log-collection triggering.
  • Defined and executed specific ML workflow, which includes data collection, sampling, model building and training, metrics definition and evaluation.
  • Develop a framework for experimental tracking using mlflow tracking and integrated its logs with AutoML for hyperparameter tuning.
  • Develop model monitoring framework using Evidently AI (data, model, target and custom drift) and raise alerts for decision on model retraining.
TensorFlowPyTorchKubernetesComputer VisionKubeflowMLflow+5

Ola (ani technologies pvt. ltd)

Principal Engineer

Dec 2015Dec 2021 · 6 yrs · Bengaluru, Karnataka, India

Apache Spark StreamingDeep Learning

Zynga

SDET - II

Jul 2011Jul 2015 · 4 yrs · Bangalore, India

Pengala

SDET - II

Jul 2010Jun 2011 · 11 mos · Bangalore

Emc

SDET - I

Nov 2007Jun 2010 · 2 yrs 7 mos · Bangalore

Education

Visvesvaraya Technological University

MTech — Computer Intergated Manufacturing (CIM)

Jan 2005Jan 2007

Visvesvaraya Technological University

BE — Industrial Production and Managemantal Eng

Jan 2003Jan 2005

Stackforce found 100+ more professionals with Large Language Models (llm) & Deep Learning

Explore similar profiles based on matching skills and experience