S

Shobhandeb Paul

Data Scientist

Kolkata, West Bengal, India4 yrs 10 mos experience
AI EnabledAI ML Practitioner

Key Highlights

  • Expert in NLP and machine learning for insurance.
  • Developed automated solutions improving customer support.
  • Enhanced customer segmentation leading to increased campaign effectiveness.
Stackforce AI infers this person is a Data Scientist specializing in Insurance and Fintech with strong NLP and machine learning expertise.

Contact

Skills

Core Skills

Natural Language Processing (nlp)Machine LearningData AnalysisPredictive AnalyticsProduct DevelopmentFinancial AnalysisData ScienceWeb DevelopmentCloud Computing

Other Skills

AWSAmazon Web Services (AWS)AngularAngular Command Line Interface (CLI)BERTopic ModellingCascading Style Sheets (CSS)CommunicationComputer VisionCritical ThinkingDeep LearningEmotional IntelligenceFlaskGenAIGitHub ActionsHDBSCAN

About

Data Scientist with over 3.7 years of experience specializing in the insurance industry, with expertise in Anomaly Detection, Customer Segmentation, and Intelligent Character Recognition (ICR)-based form processing. Currently at Fractal, I contribute to a team focused on optimizing NLP and machine learning pipelines, enhancing efficiency through code optimization and applying new methodologies. My recent work involves developing and refining unsupervised and supervised NLP solutions for customer support, utilizing advanced clustering and BERT-based models to uncover key insights in customer interactions.

Experience

4 yrs 10 mos
Total Experience
1 yr 10 mos
Average Tenure
1 yr 2 mos
Current Experience

Cognizant

Associate Data Scientist

Apr 2025Present · 1 yr 2 mos · Kolkata, West Bengal, India

Fractal

Data Scientist

Jun 2024Apr 2025 · 10 mos · Bengaluru, Karnataka, India · Hybrid

  • Improving customer support by reducing chat/call transfers, callbacks, and repeat contacts, focusing on analyzing chat and call transcripts to pinpoint core issues behind escalations. Using an unsupervised approach, I first generated embeddings with sentence transformers and then clustered data with HDBSCAN to reveal key topics behind transfer problems. BERTopic modeling extracted topics from each cluster, while large language models (LLMs) provided relevant labels for these topics through instruction finetuning with zero-shot and few-shot learning. To manage vast data volumes, I initially built pipelines on sample data and then scaled up by creating a supervised multi-class classifier.
  • This involved fine-tuning a prebuilt BERT model by adapting the final layer architecture to accommodate multiple classes, using customized weights based on the LLM-labeled clustered data. This supervised model was then used to tag the remaining data, automating the categorization process. This comprehensive approach enabled proactive issue resolution, improved first-contact resolution rates, and significantly enhanced the overall customer support experience.
Natural Language Processing (NLP)Sentence TranformerHDBSCANInstruction Finetuning of LLM - zero-shot,few-shot learningUMAPBERTopic Modelling+4

Tata consultancy services

3 roles

Data Scientist

Promoted

Jul 2023Jun 2024 · 11 mos · Kolkata, West Bengal, India

  • Delivered business-tech insights in insurance by analyzing diverse RFP responses, crafting insightful dashboards thus improving future RFP responses. Also, deployed an Angular App for streamlined access, enhancing data utilization.
  • Processed 50 GB of policyholder data, achieving a 10x size reduction10x size reduction via Parquet transformation.
  • Employed pandas, scikit-learn, matplotlib, and seaborn to efficiently analyze, visualize, and extract insights from expansive datasets.
  • Contributed & enhanced customer segmentation model for targeted marketing. Utilized advanced data analysis tools including pandas, scikit-learn, and TensorFlow, resulting in a 25% increase in campaign effectiveness through data-driven insights and optimized marketing strategies.
  • Optimized operational efficiency by creating an automated claims review workflow that intelligently directed claims to relevant teams based on their severity,led to a notable reduction in processing times by 30%.
  • Employed & integrated machine learning algorithms to accurately identify potentially fraudulent claims, resulting in a significant reduction of false positives by 20% and substantial cost savings.
  • Contributed by developing a predictive model that evaluated risk factors & forecasted claim probability, boosting underwriting precision by 15% and streamlining risk assessment procedures.
  • Working closely with the TCS BaNCS Pre-Sales Team, understanding the requirement of the customers to develop P&C and Health Products for demonstration.
  • Bagged 5th position in the GenAI Experience challenge hosted within TCS.
Financial AnalysisProduct DevelopmentUser Interface DesignPredictive AnalyticsMLOpsCritical Thinking+14

AI/ML Developer

Promoted

Jul 2022Jul 2023 · 1 yr · Kolkata, West Bengal, India

  • Working closely with the TCS BaNCS Pre-Sales Team, understanding the requirement of the customers to develop P&C and Health Products for demonstration.
  • Segregating the Policies issued from various region as per client given records and performing statistics
  • and finding out insights for the same.
  • Task in making the P&C and Health Product more enhanced and market ready depending upon the
  • needs of various regions, worldwide.
Financial AnalysisProduct DevelopmentUser Interface DesignPredictive AnalyticsCritical ThinkingProfessional Skills+9

Assistant System Engineer Trainee

Jul 2021Jun 2022 · 11 mos · Kolkata, West Bengal, India

  • TCS BaNCS Insurance Digital Platform: Front-End Development, Core TCS BaNCS System
  • Client: TCS Internal Product Development
  • From the scratch development, testing and bug fixing of TCS Bancs Insurance product which includes
  • Dashboard, New Business Issuance and Claim section for end-point customers, a complete end to end
  • Insurance requirement any Insurance Company to manage its flow.
  • Using the Core BaNCS system to develop a DEMO environment of P&C and Health products of TCS
  • BaNCS, to meet the requirements for end-point customers.
  • POC: Collecting and Converting Data from Oracle DB to Parquet file format and performing analytics
  • Used the cx_Oracle library to extract the data from the Oracle Database and then using the Spark to
  • convert the entire DB to Parquet file format.
  • Using the Pandas library, the Parquet file was converted into a DataFrame.
  • Basic charts and graphs were then derived from Parquet file using the Matplotlib, Seaborn and Pandas
  • library in order to get the insights on the behavior of the Policy Holders.
AngularFlaskPredictive AnalyticsSpring BootPython (Programming Language)SQL+4

Ineuron.ai

Project Intern

Dec 2022Jan 2023 · 1 mo · Bengaluru, Karnataka, India · Remote

  • Project Overview:
  • The goal of this project was to develop a cloud-based insurance premium predictor using AWS, GitHub Actions for Continuous Integration/Continuous Deployment (CI/CD), and Streamlit for creating a web application. The model would predict the insurance premium for a given customer based on various features . This model would be used by insurance companies to provide instant quotes to their customers and help them make informed decisions.
Financial AnalysisAmazon Web Services (AWS)Machine LearningStatisticsPredictive AnalyticsInsurance+5

Education

Guru Nanak Institute of Technology , Kolkata

Bachelor of Technology - BTech — Electronics and Communications Engineering

Jan 2017Jan 2021

iNeuron.ai

Full Stack Data Scientist — Data Science

Sep 2022Apr 2024

Stackforce found 100+ more professionals with Natural Language Processing (nlp) & Machine Learning

Explore similar profiles based on matching skills and experience