Aryan Singh

AI Researcher

Noida, Uttar Pradesh, India11 yrs 10 mos experience
Most Likely To SwitchHighly Stable

Key Highlights

  • 10 years of data science experience across multiple industries.
  • 5 patents in visual artifact creation using generative AI.
  • 17 data products developed and running in production.
Stackforce AI infers this person is a Data Science expert with extensive experience in AI-driven solutions across various industries.

Contact

Skills

Core Skills

Natural Language Processing (nlp)Machine LearningComputer VisionEngineering ManagementData Science

Other Skills

ARIMAAlgorithmsApache KafkaApache PigApache SparkArtificial Neural NetworksBig Data AnalyticsCCSSCatboostCollaborative FilteringConcept ExtractionConvolutional Neural Networks (CNN)Core JavaDash

About

Howdy ! My name is Aryan and I am working as Senior Data & Applied Scientist at Microsoft where I am revolutionizing the creator landscape by powering flagship product Microsoft Designer using computer vision and NLP solutions. https:\\designer.microsoft.com I have 10 years of experience in data science for some of the biggest companies in retail, transportation, e commerce, leisure and healthcare domain. I have spent a year each in US and UAE solving problems with data. I have also worked with clients in Denmark, UK and Switzerland. My experiences range from building a smart crop model using mobile net V2 to contextual QA and information retrieval system for Novartis value access teams to building ranking and recommendation models using reinforcement learning. I currently have 5 patents in the area of visual artifact creation using generative AI, 4 papers published in the area of computer vision and NLP. Currently, more than 17 data products developed by teams I have either lead or been part of are running in production spanning across Computer vision, Demand Forecasting, Marketing Analytics, Customer Analytics, Text Summarization, QA Systems and recommender systems. Technology wise I have experience in building scalable ML systems using Py spark and Hive. Also, I have expertise in Python, R, SQL, Scikit-Learn, Keras, and Pytorch. In my free time I love playing chess and my favorite opening is Sicilian Defense.

Experience

11 yrs 10 mos
Total Experience
1 yr 7 mos
Average Tenure
4 yrs 7 mos
Current Experience

Microsoft

3 roles

Senior Applied AI Scientist 2

Aug 2025Present · 10 mos · Noida, Uttar Pradesh, India

Senior Applied AI Scientist

Aug 2023Aug 2025 · 2 yrs · Noida, Uttar Pradesh, India

  • 1. Sticker generation model using Dalle 3 with zero shot post processing using open cv.
  • 2. Contextual Font Recommendation Model: Lightweight low latency Mini LM V2 for matching design elements to fonts. Works via few shot inference on data generated through GPT 3.5.
  • 3. Font Classification on OCR regions of AI generated designs with a fine tuned Mobile Net V3 network.
  • 4. Generative designs using Dall E 3, SAM etc.
  • 5. Text to collage and album generation using GPT 3.5, Dall E 3 and Open cv.
Natural Language Processing (NLP)Machine LearningDeep LearningPyTorchComputer VisionLarge-scale Projects

Applied AI Scientist 2

Nov 2021Aug 2023 · 1 yr 9 mos · Noida, Uttar Pradesh, India

  • Providing state of the art solutions on NLP and Vision for Microsoft Designer: https://designer.microsoft.com
  • Ranking font, color and image suggestions using RL (contextual bandit models)
  • Abstractive and Extractive Title generation model given design description
  • Hashtag generation using GPT-3.5
  • Low Latency smart crop for videos
  • Custom named entity recognition model for specialized effects
Natural Language Processing (NLP)Machine LearningDeep LearningComputer VisionReinforcement LearningTransformers

Novartis

Manager, Data Science

Apr 2020Nov 2021 · 1 yr 7 mos · Hyderabad, Telangana, India

  • 1. Contextual Question Answering engine for HTA* reports.
  • 2. Data Science engine for personalized tactic sequencing and content:
  • AI/ML backed sales attribution from brick to HCP level.
  • Comprehensive 360 view of the HCP along with micro clustering.
  • Ensemble of predictive model and optimizer for personalized channel sequencing.
  • Affinity models for personalized content.
  • 3. CAPE(Capabilities For Analyzing Price Estimates)
  • Innovative, integrated and interpretable tool built on python dash backed by Catboost and KNN models to provide Price range, Price and sales evolution of a prospective brand launch at Novartis.
  • It does this by providing the top analogs for the to-be launched brand.
  • I lead the conceptualization as well as development of the tool. Technologies Used - Python, Catboost, KNN, Plotly, Dash.
  • 4. Dynamic Targeting - Identify the best promotional strategy for Novartis Oncology brands to dynamically target the stakeholders. Technologies Used - Python, Catboost, Hive, Impala.
  • 5. Congress and Event Optimization: Optimize the congress and event invitations and the roles of the invitees using machine learning. Technologies Used - Excel, Decision Tree, Python.
Engineering ManagementLarge-scale Projects

Udemy

Author

Jan 2020Dec 2020 · 11 mos

  • Authored and reviewed courses on Python and Machine Learning. Machine learning for finance is one of them with a rating of 4.5/5.
Data ScienceNatural Language Processing (NLP)Machine LearningPython (Programming Language)

Packt

Author

Jan 2020Dec 2020 · 11 mos

  • Authored and reviewed courses on Machine Learning and Deep Learning.

Upgrad.com

Teaching Assistant - NLP

Jul 2019Oct 2019 · 3 mos · Mumbai Metropolitan Region

  • Helping students with doubt clearing sessions on NLP.

Publicis sapient

Senior Data Scientist

Feb 2019Apr 2020 · 1 yr 2 mos · Dubai, Dubai, United Arab Emirates

  • UAE based Cinemas: Introduced new combos in the candybar menu to increase combo incidence by over 10 percent and revenue per admission by 4 AED. Techniques Used: Python, FP Growth, Apriori, A/B Testing(T-Test, Chi-Square), Power BI, SQL, Vertica.
  • UAE based Cinemas: Developed a Customer 360 vision by introducing Segmentation, CLV and Churn models. Techniques Used: Python, R, Pareto/BG-NBD, LSTM, Logistic Regression, XGBoost, K- means, Tableau, SQL, Vertica.
  • UAE based Leisure and Entertainment business: Revamped the pricing of different packages being offered to the customers in order to increase the NPS value and revenue. Techniques Used: Python, Simulated Annealing, Tableau
  • UAE based retailer: Enabling the retail business by forecasting demand across 250 + stores across 2000 SKUs and decreasing the shortage by over 30 percent in 3 months of launch. Techniques Used: Python, LSTM, Prophet, DTW.
  • U.S based investment bank: Build a knowledge graph and search mechanism to power the chatot. Techniques Used: Python, Spacy NER, Concept Extraction, Word2Vec and Neo4j.
  • Sapient COE: Develop a generalized Media Mix Modelling accelerator which analyzes contribution, elastic, decay and carryover effect to recommend optimum budget allocation. Techniques Used: Python, Linear Regression, L1-L2, Elastic Net, Scipy Optimization.
  • Sapient COE: Developed recurrent capsule networks for extractive summarization. Techniques Used: NLP, NLU, Deep Learning, LSTM, Capsule Networks, Machine Learning, Tensorflow, Python, Bleu Score, Rogue N Metrics, Tensorboard.

Tapchief

Expert - NLP and Computer Vision

Jan 2019Sep 2019 · 8 mos · Gurugram, Haryana, India

  • Mentoring students and industry professionals on chatbot and computer vision related problems including entity and intent extraction, semantic segmentation and image classification.

Globallogic

2 roles

Senior Machine Learning Engineer

Jul 2017Feb 2019 · 1 yr 7 mos

  • Client: Fortune 500 Retailer Of USA
  • Procured the rating and review data across omni-channels and collating it with previous product demand for inventory forecasting across stores.
  • Lead the team to build an end to end solution for categorizing and ranking the vendors to fulfill each shipment saving over 20 % in vendor default and shipment quality control issues.
  • Technologies: Python, Collaborative Filtering, K-Mode clustering, XGBoost, ARIMA, Logistic Regression, Multinomial Naive Bayes, Feed Forward Neural Network, Keras, LSTM.
  • Client: Major Manufacturing Company Of Germany
  • Lead the effort to predict the future requirement of spare parts at various distribution centres of the client.
  • Technologies: Python, RNN, LSTM, Keras, Pytorch, XGBoost, Time Series Forecasting, Pandas, Numpy, SGD, ARIMA.
  • Client: Major publication of U.S.A
  • Developed a text pruning solution to free up space on the front page of the newspaper by using unsupervised NLP techniques.
  • Technologies: Python, NLP, Lex Rank, PyText rank, Gensim, Pandas, Scikit Learn, Bleu score, Rogue-N metrics.
  • Client: Major Ecommerce player in India
  • Developed a CNN based model using the Resnet 50 architecture to identify the label from the
  • from product images.
  • Technologies: Python, Deep Learning, Keras, SGDR, Transfer Learning, Computer Vision.
Data ScienceNatural Language Processing (NLP)Big Data AnalyticsTime Series Analysis

Machine Learning Engineer

Jul 2015Jul 2017 · 2 yrs

  • Electricity consumption intelligence for a Canada based client:
  • Analysis of the electricity consumption data using linear regression in multiple variables.
  • Predicting the future electricity consumption using devised model.
  • Detecting anomalies in consumption pattern by using spring batch and K Means Clusteirng analysis.
  • Key Responsibilities:
  • Leading a team of 6 for end to end integration and delivery of the application.
  • Conceiving design, frameworks and technology stack for the app.
  • Developed the data analysis model for prediction of future electricity consumption.
  • Technologies:
  • R, Statistics, Machine Learning, Big Data, MongoDB, Spring Batch.
  • Project: Gloria(https://www.globallogic.com/our-work/gloria-chatbot/)
  • Developed a closed domain voice driven chatbot paired with IoT system to control the devices around.
  • Technologies: RASA, Bluetooth Low Energy module, Arduino, Spring Boot, Gradle.
  • Wallet Application
  • Wrote Splunk queries and spring batch jobs to generate descriptive analysis insights from streaming log files.
  • Improved the accuracy of data access layer by 20 to 30 percent by caching the retrieval of more than 20 million data objects nearing 20 GB.
  • Developed and designed highly scalable and high throughput offer prediction model using xgboost classifier.
  • Single handedly developed a tool to schedule periodic health checks of the REST services.
  • Chat Application for the support associates in the retail store
  • Technologies Used: Spring Boot, Gradle, REST, Mosquitto Broker, Cassandra database.
  • Responsibility: Creating the RESTful web services to be consumed by the android and ios chat client. Configuring the MQTT broker to persist the messages into cassandra.
Machine LearningPython (Programming Language)Core JavaJava Enterprise EditionXGBoost

Tata consultancy services

Assistan System Engineer

Aug 2014Jul 2015 · 11 mos · Gurugram, Haryana, India

  • AP Moller - Maersk
  • Optimisation of possible routes for evacuation of containers from a surplus
  • site to a deficit one using SAS. Showing these routes on google maps.
  • Cleaning up of huge csv data and load into the oracle database by writing ETL jobs.

Education

Indian Institute of Technology Hyderabad

Master of Technology - MTech — Artificial Intelligence

Jun 2020Jun 2023

International Institute of Information Technology Bangalore

Post Graduate Diploma in Machine Learning and AI — Artificial Intelligence

Jan 2018Jan 2019

Guru Nanak Dev University

Bachelor of Technology (B.Tech.)

Jan 2010Jan 2014

Stackforce found 100+ more professionals with Natural Language Processing (nlp) & Machine Learning

Explore similar profiles based on matching skills and experience