Ashish Kumar

CEO

San Francisco, California, United States17 yrs 10 mos experience
Most Likely To SwitchAI ML Practitioner

Key Highlights

  • 10+ years in Data Science and Machine Learning.
  • Authored two books on predictive analytics and data science.
  • Led over 25 successful data science projects.
Stackforce AI infers this person is a Data Science and AI Solutions expert with extensive experience in multiple industries.

Contact

Skills

Core Skills

Data ScienceAi SolutionsText AnalyticsProduct ManagementAnalytics ManagementPredictive MaintenanceText ClassificationGeospatial Analytics

Other Skills

AI AgentsAWSAirflowAlgorithmsAnalysisAnalytical SkillsAnalyticsApacheArtificial Intelligence (AI)Big DataBusiness AnalysisBusiness DevelopmentCCloud ComputingCloud Development

About

~Data Science & Machine Learning professional with 10+ years of work-ex ~ Currently based in SF Bay Area ~ Github - https://github.com/ashishbt08b004/Experiments ~Author, Learning Predictive Analytics with Python. Author, Mastering Pandas. Both published by PACKT. ~ Rich hands-on and project management experience (25+ projects) in creating Machine Learning solutions. ~ Versatile (75+ leads) pre-sales, solutions architecture, PoC development and data science consulting experience. ~ Lucid Communication of complex data science concepts using both verbal and written medium. ~ Experienced in conduction webinars and training sessions. ~BTech, IIT Madras |Young India Fellow 2012-13, Ashoka University ~ Financial Modeling| Business pitch expert Programming Language/Tools : [Python (polars, asyncio, gRPC), Rust, R, SQL], [Apache Iceberg, Parquet, Delta Lake, DuckDB, SNS], [vLLMs, Paged Attention, Flash Attention, Langchain, CrewAI] [Databricks, PostgreSQL, Redshift, Elastic Search, Spark, MongoDB], [Azure, AWS, Docker, Kubernetes], [JIRA, Confluence] Classical ML Algorithms : Linear & Logistic Regression; Survival Analysis, DBSCAN, k-Means; kNN; Naïve Bayes; SVM; Shapelets; Latent Dirichlet Allocation, Linear Discriminant Analysis, Latent Semantic Analysis, Hypothesis Testing, ANOVA, RFM, BTYD, Apriori, Voronoi, Geospatial Analytics, Tf-IDF/Word2Vec, CNN, RNN Gen AI Skills - Transformers, Flash/Cross/Masked/Self/Paged Attention, RAGs, Pinecone, GPU optimization, Multimodal RAGs, CLIP, Diffusion, Variational Autoencoders, Finetuning, Transformers library, tiktoken, bitsandbytes Business Domains: Manufacturing, FinTech, EdTech, Transportation & Logistics, Healthcare, Pharma, Retail, E-Commerce, Big Data Security, Urban Mobility. Presentation: Communicating ML results & business insights by creating dashboards/analytics products/report in Streamlit, Gradio, Power BI, JuPyter Notebook, MS-Excel, PowerPoint etc. What excites me: ~ Working on challenging and intellectually stimulating business problems to solve through data science ~ Working with emotionally intelligent people who can think beyond money & themselves ~ Perpetual learning. Student for life. ~ Debate over data, public policies, philosophy, pop culture

Experience

17 yrs 10 mos
Total Experience
2 yrs 10 mos
Average Tenure
7 yrs 5 mos
Current Experience

Indium software

4 roles

Chief Data Scientist

Promoted

Apr 2023Present · 3 yrs 2 mos

  • Work with BFSI, Semiconductor and Aerospace clients to build data pipelines and AI solutions
  • Expertise in LLMs, Python (pandas, numpy, pytest, asyncio) , Databricks (Delta Lake, Unity Catalog etc for data lineages and ACID compliances), SQL, Airflow, Lakehouse and OTFs
  • Expertise in Explainable AI, LLM Evaluations and Responsible AI
  • Domain expertise in wealth management, mutual funds, ETFs and mathemtical optimizations
  • Convert business requirement to Data Science/Engineering problems; AI and Data Strategy
  • Team & Stakeholder management
PythonLLMsDatabricksSQLAirflowData Science+1

Principal Data Scientist

Promoted

Jan 2019Apr 2023 · 4 yrs 3 mos

  • Spearheading the R&D and Product Management of the cutting-edge Text Analytics product offering Text Extraction, Summarization and Classification for enterprises. Get in touch to know more.
  • Text Extraction - Tabular and Peripheral data from PDFs, Images, Word Docs and Websites
  • Text Summarization - Latent Topic Modeling, Keyphrase Detection, Custom Named Entity Recognition, Text Matching and Clustering
  • Text Classification - Classify documents in categories for better organisation.
Text AnalyticsPythonLatent Topic ModelingNamed Entity RecognitionProduct Management

Senior Manager, Analytics

Jan 2018Jan 2019 · 1 yr

  • Project Management for 5 running projects and pre-sales for qualified leads.
  • Grew a large manufacturing account 4X in terms of team and ticket size. ~100% project extension success rate.
  • Pre-sales; Requirement-Capability matching by talking to US/Europe clients; Contributing to Scope/SoW of projects; Deciding the team structure.
  • Architecting the analytics solutions and then implementing them with the teams.
  • Abnormal Chamber Detection & Predictive Maintenance - Led development of IoT based analytics product to detect abnormal chambers for a semiconductor wafer manufacturer in CA, USA. Algorithm & tools - Mahalanobis distance, Hoteling T2, R-Shiny
  • Product Categorisation of 50mn+ products - Led categorization of 45mn+ products using text classification ML algorithms improving search results for an e-commerce firm in SF,US. Deployed the solution as an API. Algorithms & tools - Naive Bayes, SVM, Hierarchical Classification, Python, AWS.
  • Dropout/Failure/Withdrawl prediction - Implemented Dropout/Failure prediction for students from a classroom course for a learning management company in US. Algorithm & tools - Xgboost, R.
  • Intelligence Layer to knowledge on web - Led the development of NLP and text analytics based solutions – topic map for documents, document/publisher similarity, topic classification, entity recognition etc for a knowledge solutions company in Massachussets, US. Algorithms & tools - Latent Dirichlet Allocation, Nmaed Entity Recognition, Neural Networks, Python, D3.js.
  • Market & Product Analytics for a Fintech company - Conducted product/market analytics for a Fintech firm in SF,USA and devised growth strategies for the firm. Algorithms & tools - RFM Analysis, k-means/DBSCAN clustering, A/B Testing, R, Python, Periscope
  • Customer personas for an Event Management Company - Clustering customers into groups with a marketable personas for better targeted marketing. ALgorithms & tools - k-means, DBSCAN, tSNE, Expectation Maxamization, R, R Shiny.
Project ManagementPredictive MaintenanceText ClassificationRAnalytics ManagementData Science

Program Manager, Analytics

Jan 2016Jan 2018 · 2 yrs

  • Overview: Part of the senior management team driving sales, digital marketing, strategy and delivery of the firm. Looked after 10+ analytics engagements, devised and implemented various algorithms to help clients. Led research & PoC for 10+ prospective projects. Played active role in 25+ pre-sales conversations. Groomed and managed the 30+ membered Data Science team. Wrote blogs.
  • Geospatial analytics for the largest taxi-ride provider company in South East Asia| Tools & Techniques: Hive, R Shiny, Leaflets, Haversine formula.
  • Automated Hive Query Generation for a Big Data security product company based in San Francisco Bay area| Tools & Techniques: Hive, Impala, SparkQL, R Studio, Apriori
  • Text based entity mapping and search for Manhattan-based BI provider| Tools & Techniques: Python NLTK, Levenshtein distance, Elastic Search
  • Historical Price Analytics for products for a price-comparison e-commerce website based in San Francisco|| Tools & Techniques: Elastic Search, Kibana, Timelion
  • Growth Analytics for a price-comparison e-commerce website based in San Francisco|| Tools & Techniques: Google Analytics, Google Tag Manager, Mixpanel
  • Electricity demand forecasting based on weather data using Generalised Additive Models for a clean energy client based in NY| Tools & Techniques: R Studio, Time Series Analysis
  • Detection of defective pieces using power consumption time-series data|Tools & Techniques: R Studio, kNN, Azure ML Studio, Python
  • Data Science Training for employees of national statistical agency in UK|Tools & Techniques: Python, NLTK
  • Blogs:
  • 1) http://www.noahdatatech.com/leveraging-your-gps-data-using-geospatial-analytics/
  • 2) http://www.noahdatatech.com/iot-analytics
AnalyticsGeospatial AnalyticsText AnalyticsRAnalytics ManagementData Science

Great learning

Course Mentor

Jan 2020Apr 2023 · 3 yrs 3 mos

  • Mentoring students on their Data Science course.

Tex.ai

Principal Data Scientist

Jan 2019May 2023 · 4 yrs 4 mos

  • Leading R & D and Product Development of teX.ai.
  • teX.ai has been recognised as 'Top 25 ML startups to watch out for in 2020 by Forbes.'

Packt

Author, Mastering Pandas

Jan 2019Jan 2019 · 0 mo

Simplilearn

Trainer & SME

Jan 2019Jan 2019 · 0 mo

  • Trained a batch of 40 students in a virtual classroom on Advanced Machine Learning concepts using Python as the implementation tool.
  • Trainer NPS in top 5% percentile. Got offered to join the premium pool of trainers at Simplilearn.

Zeef

Curator, Data Science Page

Oct 2015Dec 2015 · 2 mos · Chennai Area, India

  • Curated a peer-reviewed page for data science containing list of online material needed to self-train oneself as a Data Scientist.
  • The page has 100+ links, 2.5K+ views, 600+ clicks and has gained significant traction in the data science community.

Packt

Author, Learning Predictive Analytics with Python

Jun 2015Mar 2016 · 9 mos · Chennai Area, India

  • Wrote a 350-pager/9 chapter book demonstrating the concepts of predictive analytics with Python emphasizing on Data Cleaning, Wrangling, Modeling, Validation, and Visualization. Used publicly available datasets to develop original content
  • Liaised with a team of editors, reviewers, experts, designers etc. Got an offer to write the next title from the publisher.
  • Rated must-read for data science enthusiasts by Analytics India Magazine . Garnered positive reviews on Amazon 50K USD in revenue. 300+ copies sold in the first 3 months. Selected for display and early sale at the PACKT publication website during the Python week.
  • Got an offer to write the next title from the publisher. Selected to become part of a PACKT video course.

Tiger analytics

Senior Analyst

May 2014Jan 2016 · 1 yr 8 mos · Chennai Area, India

  • Project 1: Wheel failure forecasting for a leading railroad car pooling client
  • Developed a SAS model to forecast wheel failures using Survival Analysis on historical failure data of 30 years from a leading railroad car pooling client. The model could predict the number of failures with an accuracy of around 1.5-2%.
  • Developed a tool with VBA-based MS Excel interface to visualise and summarize the results of the aforementioned model. This tool is used by the client to plan quarterly spend on maintenance & repair.
  • Tools/Techniques used: SAS, VBA, R; Survival Analysis, Monte Carlo Simulations, Seasonality
  • Project 2: Merchant, Customer and Sales analytics for an Online Payments client
  • Integrated telesales data of a newly acquired firm with the Salesforce data of the client in Teradata using Python
  • Analysed transactions data from the client’s customers, built Tableau dashboards and a Python simulation to notify the customers of their operational and financial status
  • Tools/ Techniques used: Teradata/SQL, Python, Tableau
  • Project 3: Impact assessment of a packaging change on sales for a Pharmaceutical manufacturer
  • Assessed the impact of a new packaging method on drug sales for 100+ SKUs
  • Calculated the lift in drug sales and contribution of drug sales to the total sales attributed to the new packaging
  • Project 4: Marketing-mix modeling and RoI calculations for a retail Medicare product manufacturer
  • Assessed the contribution of 30+ marketing stimuli on sales. Calculated the RoIs of the various marketing stimuli
  • Project 5: Content development for training module on Data Science with R
  • Created and validated the assignments by solving them for a couple of chapters in the training module
  • Project 6: Time Series model automation for for a leading US based railroad car pooling client
  • Implemented ARIMA model for cost forecasting for a railroad car company
  • Wrote an algorithm to automate the ARIMA implementation to find out the optimum ARIMA parameters

Sughavazhvu healthcare

Business Analyst

Jun 2013May 2014 · 11 mos · Thanjavur Area, India

  • Designed the Excel-based interactive MIS for SV. This MIS became indispensable tool for performance review and management across the network. Analysed the MIS data on Excel on a monthly basis for performance management and business insights.
  • Created detailed financial model and funding proposal for a Mobile clinic and semi-urban clinic. The model garnered a substantial funding for SV.
  • Led the efforts to design (interior and exterior), drafting the operational plan, negotiating with vendors and on-the ground execution of the Mobile clinic on the ground.
  • Designed interactive Open Data Kit XML-based survey forms for data collection of Cardio Vascular Diseases Risk factors. Led the implementation of the activity. Analysed monthly data to target at-risk patients and manage agent performances.
  • Designed the Excel-based interactive MIS for SV. Standardised the SQL queries and data-parsing algorithms needed to update the MIS. Analysed the MIS data on Excel on a monthly basis for performance management and business insights.
  • Got selected to talk about SV’s disruptive healthcare model at a social entrepreneur summit in Nairobi,Kenya organized by Ashoka foundation

Ashoka university

Young India Fellow

May 2012May 2013 · 1 yr · New Delhi Area, India

  • Young India Fellowship is a one year, multi-disciplinary, (Liberal Arts+Leadership) postgraduate flagship programme, run by Ashoka University.
  • Got a full scholarship of Rs.5.5 lakhs to pursue the program
  • Was one among the 97 Fellows selected out of 3000 applicants in this class, on full scholarship.I studied Group Dynamics, Leadership, Business, Statistics, Arts Appreciation, Sociology, Philosophy etc.
  • Worked with British High Commission for mapping India's FDI outflow footprint.
  • Wrote papers on similarities between ideals of Gandhi and Islam, weddings in Bihar, reimagination of Shakespeare's Tempest in the context of Naxalism in India
  • Interviewed Madhubani painter Bharti Dayal and presented a report on Madhubani painting
  • Made a movie on an NGO (Chintan) working with ragpickers in Delhi.

Energy alternatives india/oilgae

Research Analyst

Dec 2010Jan 2011 · 1 mo · Chennai

  • Made a report on feasibility of a certain strain of algae to produce biodiesel. The report became a part of the final submission made to NTPC for alternative energy sources.

National centre for biological sciences

Research Associate

May 2010Jul 2010 · 2 mos · Bangalore

  • A project in developmental genetics.
  • Created a genetic screen to identify responsible Transcription Factors (TF) for sensillae development. Identified 4 such TFs out of 20.
  • Designed the project flow of the experiment, scheduled the crosses of the insects and prepared the antennae glass-slides.
  • Found 2 TFs playing important roles in antennae development.
  • Worked under the guidance of famous scientist Dr.Veronica Rodrigues.

Indian institute of technology of madras

Project Associate

Jul 2008May 2012 · 3 yrs 10 mos · Chennai

  • Worked on a industry sponsored project by Caterpillar with Mathematics department of IITM.
  • Worked on a Game Theory project with a Department of Management Studies professor.
  • Worked on a project on Mutual Fund performance analysis in Indian markets with a Department of Management Studies professor.

Education

Indian Institute of Technology, Madras

B.Tech

Jan 2008Jan 2012

Ashoka University

Young India Fellowship — Post Graduate Certificate in Liberal Arts and Studies

Jan 2012Jan 2013

Edvancer Eduventures Pvt. Ltd.

Certified Business Analytics Professional — Business Statistics

Jan 2013Jan 2014

Kendriya Vidyalaya

High School — Science

Stackforce found 100+ more professionals with Data Science & Ai Solutions

Explore similar profiles based on matching skills and experience