Soumyabrata Pal

AI Researcher

Bengaluru, Karnataka, India9 yrs 7 mos experience
Highly Stable

Key Highlights

  • Expert in Statistical Machine Learning and Non-convex Optimization.
  • Designed algorithms for scalable personalization in machine learning.
  • Proven track record in developing robust meta learning algorithms.
Stackforce AI infers this person is a Machine Learning Specialist with a focus on Healthcare and SaaS applications.

Contact

Skills

Core Skills

Machine LearningCollaborative FilteringMeta LearningDeep Learning

Other Skills

Low Rank Matrix CompletionDeep Neural NetworksSequence to Sequence ModelsClusteringLinear RankingDynamic ProgrammingLSTMJavaScriptNode.jsCInformation TheoryNumber TheoryReal AnalysisC++Algorithms

About

My research interests are Statistical Machine Learning with a focus on Non-convex Optimization and Online Learning. More concisely, I like Statistical recovery/reconstruction problems under different reasonable structural assumptions on the data generating mechanism such as sparsity, low-rank, presence of latent clusters among others. Nowadays, I am working on designing algorithms in offline/online/hybrid systems aimed at incorporating personalization efficiently at scale. Most of my work so far can be categorized into five topics namely 1) Scalable Personalization via Low Rank and Sparse Decomposition 2) Multi-agent Online Learning via Collaborative Filtering 3) Latent Variable models - Mixtures of Linear Regression, Linear Classifiers and Distributions 4) Generative models for Graph Clustering - Geometric Block Model and 5) Active learning for Semi-supervised clustering - Disjoint Clusters, Overlapping Clusters and Fuzzy Clusters

Experience

9 yrs 7 mos
Total Experience
3 yrs 6 mos
Average Tenure
2 yrs 5 mos
Current Experience

Adobe

2 roles

Research Scientist 2

Feb 2025Present · 1 yr 4 mos · Bengaluru, Karnataka, India

Research Scientist

Jan 2024Feb 2025 · 1 yr 1 mo · Bengaluru, Karnataka, India

Google

Visiting Faculty Researcher

Jan 2022Dec 2023 · 1 yr 11 mos · Bengaluru, Karnataka, India

  • Designed and analyzed Interactive Machine learning models for fast personalized recommendations via
  • Online Collaborative Filtering algorithms. Our approach uses low rank matrix completion techniques with
  • provable theoretical guarantees. Our designed algorithm is scheduled to be implemented for predicting available call-slots to provide advice for 2.2 million underprivileged expectant mothers. In simulations, our algorithm is providing a 20% reduction in cost over non-collaborative algorithms in the above application.
  • Designed Robust Meta Learning Algorithms with privacy guarantees for multi-task learning/meta-learning in a novel Low Rank and Sparse framework that carefully model parameter sharing across users. Used
  • fast Alternating Minimization techniques with provable guarantees. Implemented designed algorithm for
  • fine-tuning pre-trained Deep Neural Network thus personalizing Google’s Email Ranking program - led to
  • > 2% increase over baseline MRR obtained without any personalization (across > 100k enterprises).
Machine LearningCollaborative FilteringLow Rank Matrix CompletionMeta LearningDeep Neural Networks

Amazon

Applied Scientist Intern

Feb 2020May 2020 · 3 mos · Berkeley, California, United States

  • Worked on PECOS and Sequence to sequence deep learning models in order to predict the queries of a user given the previous query in a prefix-free setting. I completed a detailed evaluation of both these models on random traffic data and came up with conclusive evidence to suggest that Seq2seq performed better than PECOS.
  • I designed a new two-stage model that combined PECOS and Seq2seq by clustering the ELMO embedding of each word in the vocabulary and training the seq2seq model on the clusters as tokens. In the second stage, I trained a linear ranker to rank the items in each cluster.
  • The new model had the best Weighted BLEU score beating the state-of-the-art pre-trained models and also obtained a comparable MRR score. Further, this model performed better than all other naive baselines.
Deep LearningSequence to Sequence ModelsClusteringLinear RankingMachine Learning

Ernst & young

PhD AI Intern

May 2019Aug 2019 · 3 mos · Palo Alto

  • Worked on document segmentation in order to recover Unit of Analysis (UoA) from a single page invoice. I developed an end-to-end trainable deep learning model using LSTM that learns the similarity between two bounding boxes of words present in a document.
  • From the resulting fully connected weighted graph, I bypassed the necessity for the graph clustering problem (and therefore any hyperparameters) by developing a novel dynamic programming algorithm for inference. The DP algorithm was completely parameter-free and therefore domain independent.
  • I also designed a fast version of this algorithm that is able to achieve highly accurate results on documents and required an approximate time of only 20 seconds.
Deep LearningDynamic ProgrammingLSTMMachine Learning

University of massachusetts amherst

Graduate Research Assistant

Aug 2016Nov 2021 · 5 yrs 3 mos · Amherst (MA), United States

  • I completed my Ph.D under the supervision of Dr. Arya Mazumdar at the University of Massachusetts Amherst. My thesis titled "Mixture Models in Machine Learning" can be found at this link https://web.cs.umass.edu/publication/docs/2022/UM-CS-PhD-2022-001.pdf

Samsung r&d institute india - bangalore private limited

Summer Research Intern

May 2015Jul 2015 · 2 mos · Greater Bengaluru Area

  • Worked on open source codes of XTK viewer and Rest-3D server and integrated them and
  • modified them to include a large number of features
  • Extensive work on javascript and Node.js to build both server side and client side systems that
  • interact with each other by sending and responding to requests.
  • Guide:-Ashes Dhanna Ganguly
JavaScriptNode.js

Indian academy of sciences

Summer Research Intern

May 2014Jul 2014 · 2 mos

  • Worked on the establishment of a Line of Sight radio link for horizontal communication using the
  • modern DVB-S and DVB-S2 technology used for satellite communication.
  • This work, if implemented in reality, will lead to a quasi-error free horizontal communication
  • Guide: Prof. Dr. Rabindranath Bera, Member of the Indian Academy of Sciences, India
  • See project Development of Communication Model

Education

University of Massachusetts Amherst

Doctor of Philosophy (Ph.D.) — Computer Science

Jan 2016Jan 2022

Indian Institute of Technology, Kharagpur

Bachelor of Technology (B.Tech) — Electronics and Electrical Communication Engineering

Jan 2012Jan 2016

Hemsheela Model School, Durgapur

CBSE Board

Jan 2010Jan 2012

St.Xavier’s School, Durgapur

Indian Certificate of Secondary Education (CISCE)

Jan 2010Jan 2010

Indian Institute of Technology, Kharagpur

Bachelor of Technology - BTech

Aug 2012Jul 2016

Stackforce found 100+ more professionals with Machine Learning & Collaborative Filtering

Explore similar profiles based on matching skills and experience