Mohd Muttalib

Data Scientist

Bengaluru, Karnataka, India4 yrs 8 mos experience
Highly StableAI ML Practitioner

Key Highlights

  • Kaggle Expert with a strong competitive track record.
  • Proficient in Machine Learning and Data Analysis.
  • Active contributor to open-source projects on GitHub.
Stackforce AI infers this person is a Data Scientist with expertise in Machine Learning and Data Analysis in the SaaS industry.

Contact

Skills

Core Skills

Machine LearningData Analysis

Other Skills

Python (Programming Language)Data StructuresNatural Language Processing (NLP)ChatbotsPyTorchRandom ForestData PipelinesDeep LearningComputer VisionMachine Learning AlgorithmsSQLAlgorithmsAdvanced ExcelAdvanced SQLPowerBI

About

Self-motivated and hardworking,diligent professional with 2.5 year of experience in Data Scientist, Currently working as a Data Scientist, I am eager to thrive in a challenging environment where I can demonstrate my skills, utilize my knowledge, and contribute to the organization's growth. I am particularly interested in Artificial Intelligence and seek opportunities in Data Science, Machine Learning, Data Analysis, and related fields where I can leverage my abilities to make significant contributions to the employer's success while also enhancing my own capabilities for personal growth. TECHNICAL SKILLS: Data Structure & Algorithms,Machine Learning,SQL Python, Statistical Modelling Classification, Clustering ,Machine learning, Deep learning, Natural language processing, Data Structures Data Visualization, Feature Engineering, Regression. Programming Languages: Python(Proficient),Java(basics) Framework & Libraries: TensorFlow, TensorFlow Lite, Keras, Numpy, Pandas , PyTorch, SK-Learn, OpenCV, Matplotlib, Seaborn, Plotly Dash, re, Pandas Profiling,Flask,Heroku,Streamlit. Softwares & Tools: Git, GitHub,PyCharm,VScode,Jupyter Notebook,Hugging Face,Roboflow Excellent communication skill,Ability to grasp the new skills quickly,Hard-working,Excellent knowledge of Core subjects,Participated in various sports events,Participated in Annual Sports Day at school,Participated in various cultural events in School and Colleges. 🔍 You can reach me at : ✅GitHub repository : https://github.com/MMuttalib1326 ✅Kaggle : https://www.kaggle.com/mohdmuttalib

Experience

4 yrs 8 mos
Total Experience
4 yrs 8 mos
Average Tenure
4 yrs 8 mos
Current Experience

Almabetter

4 roles

Netflix Movies and TV Shows Clustering (Unsupervised Learning)

Mar 2023 – May 2023 · 2 mos

  • Developed an unsupervised ML model that can perform clustering on the comparable dataset by matching text-based attributes.
  • Utilized one-hot encoding to transform data and evaluated feature correlation.
  • Experimented with Elbow Method, Hierarchical Clustering and Silhouette analysis to figure out optimal number of cluster.
  • Performed Topic Modeling using LDA and LSA to figure out the latent topics of the contents for the calculated 3 clusters.
  • Performed EDA and Implemented clustering algorithms on over 7700 records of Netflix Movies and TV Shows and successfully identified well-separated clusters in high dimensional space..
  • Determined the optimal number of clusters using the Elbow method and Silhouette Scores. The optimal number of clusters was 4 for K-Means, and 2 for Hierarchical clustering.
  • Processed the textual features using NLP techniques, including text cleaning, tokenization, text normalization, and text vectorization using TFIDF followed by PCA to handle the created sparse matrix containing over 46,000 attributes.
  • Skills: k-means clustering · Hierarchical Clustering · PCA · Scikit-Learn · Data Analysis · NumPy · Seaborn
k-means clusteringHierarchical ClusteringPCAScikit-LearnData AnalysisNumPy+2

NYC-Taxi-Trip-Duration-Prediction

Feb 2023 – Mar 2023 · 1 mo

  • This dataset consists of Cab Trip Record data, and the dataset is based on the 2016 NYC Cab trip record data, .
  • The dataset was originally published by the NYC Taxi and Limousine Commission.
  • The dataset consists the information of Cab Trip Data like pickup time, geo-coordinates, number of passengers etc.
  • Based on individual trip attributes, we will be predicting the duration of trip and The task was to build a model that predicts the total ride duration of taxi trips in New York City.
  • It will be interesting to explore what all other insights can be obtained from the same dataset.
  • Our main goal in this project was to determine different factors affecting to Taxi trip duration and service.
  • Before visulization of the data, data analysis was done and checked for the missing values and treated.
  • From data visualization, found that Most of the trips durations took between 10-30 mins to complete.
  • To predict the trip duration for a particular taxi, we can conclude that XGBooster Regressor is the most suitable model as compared to the other models.
  • This type of prediction and research in the cab booking segment helps companies to gain more profit. • • Predicting bookings and peak hours are a very important factor for cab providers.

Personal & Professional Development (AlmaX)

Oct 2022 – Oct 2023 · 1 yr

  • Mastering acquired skills like Advanced Excel, Advanced SQL, PowerBI, Tableau, Looker Studio, and Python.
  • Working on Personal Development and Building a Professional portfolio.
  • Challenging myself every single day to push my bars of understanding.
  • Learning Personal Branding and developing business communication skills for smooth professional interaction.
  • Exploring the future leading libraries, tools, technologies and frameworks such as Big Data, Hadoop, Hive, Airflow, Apache, PySpark, ETL, Snowflake, OpenCV, SpaCy, Web Scrapping, Azure, AWS, Streamlit, Flask, Keras, TensorFlow, and Deep Learning.
Advanced ExcelAdvanced SQLPowerBITableauLooker StudioPython+19

EDA on Telecom churn

Oct 2022 – Nov 2022 · 1 mo

  • Steps performed:
  • Collect the required data from the Telecom Churn.
  • Clean the collected dataset by removing any irrelevant, duplicate data, removing missing or null values.
  • Prepare the dataset for analysis by transforming and encoding the data into a format that is suitable for analysis
  • Visualize the data using graphs and charts to identify trends and patterns. This can help to gain insights into user behavior and preferences.
  • Conclusions made:
  • 1. Completed EDA on Telecom churn data set and analyzed those factors which were affecting our customer churn and come up with business strategy.
  • 2. Made some useful insights by analyzing our data for churn prevention.1. Completed EDA on Telecom churn data set and analyzed those factors which were affecting our customer churn and come up with business strategy. 2. Made some useful insights by analyzing our data for churn prevention.
  • Skills: Exploratory Data Analysis · Visualization · Matplotlib · Seaborn · Python (Programming Language)

Kaggle

2 roles

Kaggle Expert

Aug 2022 – Present · 3 yrs 9 mos

  • I am proud to be a Kaggle Expert, My expertise and skills in programming, statistical analysis, and machine learning algorithms have been honed through consistent high performance on Kaggle's challenging competitions. As a Kaggle Master, I am committed to contributing to the data science community by sharing my knowledge and expertise through tutorials, blogs, and discussions. This designation is a testament to my dedication and hard work in the field, and I am eager to continue my growth and impact as a leader in data science.
Machine LearningPython (Programming Language)Data StructuresNatural Language Processing (NLP)ChatbotsPyTorch+6

Kaggle Contributer

Sep 2021 – Present · 4 yrs 8 mos

  • I participated in numerous competitions, using my skills in machine learning, data analysis, and visualization to deliver high-quality results.I also actively engaged with the Kaggle community by sharing your insights and techniques, helping others to learn and improve their skills.
Machine LearningSQLPython (Programming Language)Data StructuresAlgorithmsDeep Learning+1

Github

Open Source Developer

Mar 2021 – Present · 5 yrs 2 mos · Bengaluru, Karnataka, India

  • I have been actively involved in open-source contributions on GitHub, exploring and contributing to various machine-learning projects.
  • Through this experience, I have been able to gain a better understanding of the open-source development process, hone my coding and problem-solving skills, and make valuable connections with other developers.
  • I love the idea of open source and I am looking forward to continuing to contribute to the open-source community in the future.
Machine LearningSQLPython (Programming Language)Natural Language Processing (NLP)Deep LearningComputer Vision+1

Education

Maulana Azad National Urdu University, Hyderabad

Bachelor of Technology - BTech — Computer Software Engineering

Jan 2018 – Jan 2022

Aligarh Muslim University

INTERMEDIATE

Jan 2016 – Jan 2018

Aligarh Muslim University

HIGH SCHOOL

Jan 2014 – Jan 2016

CHILDREN SENIOR SECONDARY SCHOOL AZAMGARH

SCHOOLING

Jan 2006 – Jan 2014

Stackforce found 100+ more professionals with Machine Learning & Data Analysis

Explore similar profiles based on matching skills and experience

Mohd Muttalib - Data Scientist | Stackforce