Sanjib Khetan

Data Scientist

Bengaluru, Karnataka, India7 yrs 2 mos experience
AI EnabledAI ML Practitioner

Key Highlights

  • Expert in Explainable AI and deep learning techniques.
  • Developed innovative AI solutions for diverse industries.
  • Strong background in product development and data science.
Stackforce AI infers this person is a Data Scientist specializing in AI and Machine Learning across various sectors.

Contact

Skills

Core Skills

Deep LearningRecommender SystemsPredictive AnalyticsNatural Language GenerationComputer VisionNatural Language ProcessingMachine Learning

Other Skills

Large Language Models (LLM)AsposeWord EmbeddingsCluster AnalysisPandas (Software)Apache SparkPredictive ModelingObject-Oriented Programming (OOP)Scikit-LearnData StructuresRandom ForestArtificial Neural NetworksSentiment AnalysisNamed Entity Recognition (NER)Neural Networks

About

πŸ‘‹πŸ½ Hi, I am Sanjib πŸ“š I have total 8 years of experience in solving complex problem using AI and deep learning techniques. πŸ“š I have 4 years experience in Research and Development team of Wipro and Sony India Software Centre which allows me to be well versed with a breadth of Data Science concepts and pushed me to grab the depth of each algorithms. I have designed and built the Explainable AI platform to explain all the ML/DL black box models. I have also published a paper in ADCOM conference on Explainable AI. πŸ“š I have also worked for sometime in EdTech domain to have the proper understanding of ML/DL product lifestyle from the data gathering to entire product deployment and functioning. I have worked on Image/Text Search and Recommendation Engine, Content Moderation Engine for the organization. πŸ“š I have also worked in Product Development, where I built products by leveraging the AI capabilities of LLMs to solve business problems. For Ex: I developed solutions for Business Summary Generation, PowerPoint Generation based on User Prompts, and Named Entity Recognition. I have also Fine-Tuned existing Open-Source LLM models on custom datasets, which involved everything from Data Creation to Model Fine-tuning. πŸ’» As for my future, I hope to lead entire product development cycle and want to become an AI Engineer with a diverse team where I can use my knowledge of tech, business, psychology and logical thinking to solve user problems. πŸ” In my free time, you would find me at coffee shop discussing about tech or different business strategy with use cases with same minded people. I love writing poems and stories, exploring diverse areas. I also love to teach/guide people, students in my free time. πŸ’ͺ🏽 My strengths are Logic Building, Mathematics, Problem Solving, Data Science, AI Model Building. πŸ‘¨β€πŸ’» π—§π—˜π—–π—› π—¦π—§π—”π—–π—ž 𝐀𝐒 πƒπžπ―πžπ₯𝐨𝐩𝐦𝐞𝐧𝐭: ➀ Computer Vision: Image Classification, Object Detection, Segmentation, Tracking, GAN, CNN, Deep Neural Networks, Residual Networks, YOLO Networks. ➀ NLP and ML Model: LLM, Transformer, Regression, Clustering, Machine Learning Algorithms, Word Embeddings, Transformers, Random Forest and Trees, Probability Methods. ➀ Libraries: Tensorflow 2.0, Pytorch, Keras, OpenCV, PIL, BERT, Spacy. ➀ Use Cases: Search and Recommendation Engine, Content Moderation, Model Explanation, Object Tracking, Detection, Business Summary Generation, Named Entity Recognition. π–πžπ› πƒπžπ―πžπ₯𝐨𝐩𝐦𝐞𝐧𝐭 ➀ SQL - MySQL, NoSQL, MongoDB ➀ Backend Development – Python3, Flask

Experience

7 yrs 2 mos
Total Experience
1 yr 5 mos
Average Tenure
--
Current Experience

Prezent

Senior Data Scientist

May 2023 – Oct 2024 Β· 1 yr 5 mos Β· Bengaluru, Karnataka, India Β· Remote

  • Entire Search Platform:
  • Build the Product Search Platform from Semantic Search to a Hybrid Search having a combination of Keyword Search and Semantic Search with the recommendation system. We have also Implemented BLISS paper approach for our platform which uses minimal computation to build the index and also reduces the Search time.
  • Charts Summarization in a Power point presentation :
  • Built a Module for the synthesis feature, which takes user uploaded deck and finds the Charts presents in it. It finds the chart type and Generates Insights from those Charts which describes the key points and the purpose of the chart.
Deep LearningRecommender SystemsLarge Language Models (LLM)Aspose

Tredence inc.

Senior Data Scientist

Sep 2021 – May 2023 Β· 1 yr 8 mos

  • Business Insights Generation Tool For E-Commerce Data.
  • Tech : Time-Series Analysis, Anomaly Detection, Clustering, K-Means, Z-Score, DBSCAN, NLG.
  • Built and Deploy a Tool which Generates all the Business Insights combing the Sales, Traffic and Customers Data. We have built different set of Algorithmic Trees which uses different ML methods and Business Rules to generate all the Insights which replaces an Analyst’s work. We have used DBSCAN, I-FOREST, Z-SCORE, K-Means Clustering and other methods for Insight Generation.
Word EmbeddingsRecommender SystemsCluster AnalysisPredictive AnalyticsNatural Language GenerationPandas (Software)+5

Innominds

Data Scientist

Apr 2021 – Aug 2021 Β· 4 mos Β· Bangalore Urban, Karnataka, India Β· Hybrid

  • Content Moderation for Forum:
  • Tech : NLP, Sentence Transformer, Python, Convolution Layer
  • Built the moderation pipeline using deep learning model which can automatically moderate the content for the forum. This model restricts the negative content as per the business requirement to be posted on the forum. We have cleaned the entire data and build the dataset for model training including synthetic data generation. We have also built model with different set of approach and compare result with different Approaches which includes building the entire pipeline starting from getting the data and filter it by using moderation.
  • Entire Search Pipeline for Question. (Text and Image Similarity Search)
  • Tech : NLP, Flask, Sentence Embedding, Faiss, MongoDB, ResNet50, Clustering.
  • Built the entire pipeline for question to question search. We takes the question and find the similar question from our database to map it with the corresponding Learning Objective. We have designed the entire structure and implemented the pipeline for search functionalities. It Took the entire mapped data and build FAISS model on top the embedding vector to search.
Random ForestComputer VisionWord EmbeddingsArtificial Neural NetworksRecommender SystemsNatural Language Generation+8

Freelance

Machine Learning Engineer / Data Scientist

Jan 2021 – Apr 2021 Β· 3 mos Β· Remote

  • Automatic Key Value pair extraction from a pdf Document:
  • Tech : Semantic Segmentation, Pytesseract, Clustering, Silhouette Score.
  • Extracting the key value pair text provided inside the tables with the help of deep learning and puts the bounding box around each word. it also builds the relationship between each key and different values for that key(column wise) present inside a table. it detects the table and all it s column automatically then finds all the key value pair and builds the relationship among all those key value pair.
Word EmbeddingsArtificial Neural NetworksRecommender SystemsCluster AnalysisNatural Language GenerationPandas (Software)+4

Sony india software centre

Deep Learning Engineer

Aug 2019 – Jan 2021 Β· 1 yr 5 mos Β· Greater Bengaluru Area

  • Correcting AI (Improving detection accuracy using Representer Function):
  • Improving the accuracy of a Object Detection model with the help of Representer Function Algorithm. We have built an algorithm on top of YOLO algorithm and Representer function Algorithm which can improve the correctness of the detection model. It includes modifying the YOLO V3 loss function and YOLO V3 optimisation function with its implementation. Built different object detection model for different dataset. Understanding of and implementation of complete YOLOV3 architecture, training, optimisation process with Representer function algorithm. Complete design and development of the framework to improve the accuracy of object detection model.
  • Build and Explain Video Classification Model using 3D-Convolutional Neural Network:
  • Built a Toolbox which opens the AI decisions of a Video Classifier which takes a video segment(for ex: playing a cricket shot or riding bicycle etc) and the model as an input and explains the prediction result. It also provides the contribution of each pixel of each frame towards the classification and tells what are the important region in the input which contributes more towards classification.
  • Built different Video Classification models for the use. Proposed the idea of explaining video classification models. Understanding of different algorithms and methods used to explain video classification model and building of different video classification model for different datasets.
Computer VisionOpenCVArtificial Neural NetworksObject-Oriented Programming (OOP)Data StructuresNeural Networks

Wipro limited

2 roles

Machine Learning Engineer

Apr 2017 – Aug 2019 Β· 2 yrs 4 mos Β· Bengaluru, Karnataka, India

  • Working with WIPRO Research and Development team to build new and in demand products with the help of Machine Learning and AI.
  • Misclassification Dectection and Correction in Deep learning Models:
  • Built a Toolbox to show why a mis-classification occurs(Layer-wise visualization), where exactly the test image deviates, gets mis-classified and how can we overcome it by showing the Movement of the image from the starting layer to the end layer and show which filter plays the role for deviation(mis-classification) and also generates textual explanations for misclasification.
  • Deep Learning Library to Generate LRP:
  • Building a Deep Learning optimized Framework to generate heat-map(from LRP) which supports the model of any architecture and framework(currently supports TensorFlow, PyTorch, Keras). It includes the implementation of all the deep learning layer from scratch for ex: CNN, FC, ReLu, Pooling layers etc.
  • Sentiment Analysis on Indian Languages: (Word2vec, CNN, LSTM, Fasttext):
  • Classification of sentences into different categories such as positive, negative, and neutral on indian languages using sentiment analysis. Built the classifier model to classify the sentences in to different categories. Used Word2vec and Fasttext embedding for indian language and also handled
  • unseen word problem. Used CNN and LSTM to build the model.
  • Automation of Reimbursement Portal: (OCR, CNN, Text processing):
  • Designed a pipeline for automated claim response system which automates the claim reimbursement process and helps the team to optimize response time by minimizing the manual intervention. Built complete preprocessing code and AI model to fetch the information from the inputs. Built the entire pipeline of the automation tool of claim reimbursement process.
Computer VisionOpenCVWord EmbeddingsArtificial Neural NetworksCluster AnalysisPandas (Software)+6

Trainee

Dec 2016 – Mar 2017 Β· 3 mos Β· Bengaluru, Karnataka, India

  • Completed Training on Python, AI and different Machine learning Techniques .
  • Building Pneumonia Detection Classifier:
  • Built a Deep Learning Classifier by using Inception V3 network architecture to detect whether
  • someone is sick or not from their chest X-ray or their radiology report which classifies takes the X-ray image as an input and classifies it in pneumonia positive or negative classes.
Pandas (Software)Scikit-Learn

Education

Veer Surendra Sai University Of Technology ( Formerly UCE ), Burla

Bachelor of Technology - BTech β€” Computer Science

Jan 2012 – Jan 2016

Jawahar Navodaya Vidyalaya - JNV

High School Diploma β€” Science

Jan 2004 – Jan 2011

Stackforce found 100+ more professionals with Deep Learning & Recommender Systems

Explore similar profiles based on matching skills and experience