Vatsal Parsaniya โ AI Researcher
As a Data Scientist at Embibe, I collaborate with the product team within the Discovery Search Science group to transform intricate business challenges into data science problem statements. My role revolves around enhancing the search experience for users by optimizing multilingual search outcomes across a wide range of customer products and internal tools. I have over 3 years of expertise in Information Retrieval, NLP, and micro-service development, with hands-on experience in innovating new products using Data Science and Machine Learning. Some of my notable achievements include developing a scalable text entity extraction algorithm that identifies academic entities from multiple ontology datasets and synonym dictionaries, and designing a Retrieval Augmented Generation system for search and chatbot applications that retrieves academic content with ontologies information and dynamically generates responses. I am proficient in various data stores, development tools, backend tools, observability tools, and frameworks and models for NLP and ML. I am passionate about multilingual search systems and uncovering valuable insights from complex data. โข ๐๐๐ญ๐ ๐๐ญ๐จ๐ซ๐๐ฌ : Elasticsearch, Milvus, Solr, MongoDB, Redis, PostgreSQL โข ๐๐๐ฏ๐๐ฅ๐จ๐ฉ๐ฆ๐๐ง๐ญ ๐๐จ๐จ๐ฅ๐ฌ : Git, Curl, Jupyter Notebook, PyCharm, Postman โข ๐๐๐๐ค๐๐ง๐ ๐๐จ๐จ๐ฅ๐ฌ : FastAPI, Airflow, Docker, Azure, Jenkins(CI/CD) โข ๐๐๐ฌ๐๐ซ๐ฏ๐๐๐ข๐ฅ๐ข๐ญ๐ฒ : Newrelic, Loggly, Pyinstrument ๐๐ญ๐๐ญ๐ข๐ฌ๐ญ๐ข๐๐ฌ / ๐๐๐๐ก๐ข๐ง๐ ๐๐๐๐ซ๐ง๐ข๐ง๐ / ๐๐๐๐ฉ ๐๐๐๐ซ๐ง๐ข๐ง๐ / ๐๐๐ : โ ๐ ๐ซ๐๐ฆ๐๐ฐ๐จ๐ซ๐ค๐ฌ ๐๐จ๐ซ ๐๐๐ : NLTK, Spacy, PyTorch, Pandas, Scikit-Learn, Text Blob โ ๐๐๐ ๐๐จ๐๐๐ฅ๐ฌ ๐๐ฌ๐๐ : BERT, RoBERTa, Elastic-ELSER, ALBERT, T5, LLM โ ๐๐จ๐๐๐ฅ ๐๐๐ฉ๐ฅ๐จ๐ฒ๐ฆ๐๐ง๐ญ & ๐๐ข๐๐๐๐ฒ๐๐ฅ๐ : NVIDIA Triton, MLflow โ ๐๐ ๐๐ฅ๐ ๐จ๐ซ๐ข๐ญ๐ก๐ฆ๐ฌ ๐๐ฆ๐ฉ๐ฅ๐๐ฆ๐๐ง๐ญ๐๐ : Linear Regression, Logistic Regression, XGBoost, KNN, KMeans, PCA, TSNE, TF-IDF, Word2Vec, Ensemble Algorithms, Topic Modeling โ ๐๐ข๐ฌ๐ฎ๐๐ฅ๐ข๐ณ๐๐ญ๐ข๐จ๐ง : Elastic-Kibana, Metabase, Matplotlib, Seaborn, Plotly โ ๐๐ฉ๐ฉ๐ฅ๐ข๐๐๐ญ๐ข๐จ๐ง ๐๐๐ฆ๐จ : Gradio, Streamlit
Stackforce AI infers this person is a Data Scientist specializing in NLP and search optimization within the SaaS industry.
Location: Bangalore, Karnataka, India
Experience: 7 yrs 6 mos
Skills
- Natural Language Processing (nlp)
- Search Engine Optimization
- Microservice Development
- Machine Learning
- Conversational Ai
Career Highlights
- Expert in optimizing multilingual search systems.
- Developed scalable entity extraction algorithms.
- Proficient in NLP and machine learning technologies.
Work Experience
PW (PhysicsWallah)
Senior Data Scientist (1 yr 10 mos)
Embibe
Data Scientist (2 yrs 1 mo)
Jr. Data Scientist (4 mos)
Data Science Intern (3 mos)
Intellica.AI
Machine Learning Engineer (3 mos)
Machine Learning Intern (7 mos)
Cretus- The Robotics and Automation Club of PDPU
Advisor (1 yr 1 mo)
Event Management Head (10 mos)
Committee Member (1 yr 1 mo)
Education
Bachelor of Engineering - BE at Pandit Deendayal Energy University