Nirmal Budhathoki

AI Researcher

Seattle, Washington, United States16 yrs 3 mos experience
Most Likely To SwitchHighly Stable

Key Highlights

  • Expert in building machine learning models for cybersecurity.
  • Experienced mentor in data science and machine learning.
  • Proven track record in reducing false positives in threat analysis.
Stackforce AI infers this person is a Cybersecurity and Data Science expert with a strong focus on machine learning applications.

Contact

Skills

Core Skills

Machine LearningData ScienceCoaching & MentoringData AnalysisNetwork SecurityCybersecurityMilitary OperationsLogistics

Other Skills

AWS SageMakerAdvisory BoardsApache AirflowApache SparkBig DataBloggingCommunicationConference SpeakingCreativity and InnovationCustomer ServiceData ModelingData VisualizationDecision SciencesEmotional IntelligenceExploratory Data Analysis

About

Nirmal is a Data Scientist who yearns for solving problems that actually impact product strategy and business outcomes. As a self-described realist, he believes that with pragmatism and high-quality data, Data Science can be a very powerful tool. If you are preparing for ML interview, I have something to share: https://onlyoneoutlier.gumroad.com/l/decodingML Nirmal has experience working for both government and private industries. He had worked as Security Data Analyst for Department of Navy, and had also served in US Army. He loves working in the rare intersection between security and data science, where there are ample challenges to tackle. If you have stalked me this far, you have to hit that follow button if you have not done yet :) Jokes a side, I love to share frequent posts on data science, machine learning, mentoring and career coaching, so following me won’t hurt. Want to have some data science career chat regarding interviews, resume reviews or anything? You're welcome to reserve a time at: https://topmate.io/nirmal_budhathoki I also write data science blogs in substack, please subscribe at: https://onlyoneoutlier.substack.com/

Experience

16 yrs 3 mos
Total Experience
2 yrs 9 mos
Average Tenure
4 yrs 2 mos
Current Experience

Microsoft

Senior Data & Applied Scientist

Apr 2022Present · 4 yrs 2 mos

  • Currently working as Data Scientist in Microsoft Security.
  • Building anomaly detection model using unsupervised techniques on user or application service behavior and activities based on identities used for login and access of resources.
  • Formulating problem statement, gathering project requirements, and defining success criteria, in collaboration with product managers and other stakeholders. Data-> Modeling journey is incomplete without business understanding.
  • Building data pipeline and ETL jobs to bring multiple data sources into integration, and create aggregated views as a part of feature engineering.
SQLMachine LearningGenerative AIPythonApache SparkCommunication+1

Great learning

Data Science Mentor

Feb 2022Present · 4 yrs 4 mos

  • Conducting the mentored learning sessions for the cohorts of Data Science and Machine Learning certification program from MiT Institute for Data, Systems, and Society (IDSS) in collaboration with great learning.
  • Link to the program: https://idss.mit.edu/about-us/
  • I also started to mentor the post graduate certification program on AI and ML for business applications with UT Austin.
  • Link: https://www.mygreatlearning.com/pg-program-online-artificial-intelligence-machine-learning
Data ScienceCoaching & MentoringTeachingCommunicationMachine Learning Algorithms

Vmware carbon black

Senior Data Scientist

May 2021Apr 2022 · 11 mos · United States

  • Data Scientist at Security Business Unit of VMware Carbon Black.
  • Built prevalence-based scoring model that effectively suppresses the noise (false positives alerts) by x% , saving the time and resources for threat analysts and hunters
  • Worked on building an unsupervised machine learning model to cluster the command lines, identifying anomalous command lines that can result in malicious behavior (algorithms explored in this project are- Doc2Vec, UMAP, K-Means, and HDBSCAN)
  • Created data transformation pipeline in Apache Airflow to orchestrate and schedule the jobs for batch preprocessing, which leads to effective feature engineering for training ML models
  • Designed architecture and threat models, in collaboration with engineering team to productize ML model
  • Worked with ML engineers to understand, establish and follow CI/CD pipeline for MLOps
PySparkPython (Programming Language)Statistical ResearchApache AirflowCommunicationAWS SageMaker+2

Microsoft

Data & Applied Scientist

Dec 2019May 2021 · 1 yr 5 mos

  • Worked as a Data Scientist for Azure Core Security Service team under Microsoft Cloud + AI.
  • Worked on detection models to find patterns, detect advanced threats, bad actor techniques, anomalous or suspicious activity, and active risks to Azure network and systems
  • Conducted end-to-end data analytics and reporting on UDP based DDoS attacks using unsupervised learning methods like clustering and pattern analysis to detect and fix the vulnerabilities like OS baseline configuration, open ports/protocols in network security groups
  • Performed exploratory data analysis on internet exposed endpoints, curate the raw data from different sources for machine attribution, and generate reports providing actionable insights
  • Created a baseline model of Incidents data to help calculate and optimize mean TTA/ TTM (Time To Acknowledge, and Time To Mitigate) metrics for security incidents, meeting the SLAs on those metrics, and closing gaps based on root causes like ticket routing/transfer issues
  • Expanded the knowledge of data science within the team by mentoring data analysts to help them learn/grow in the data science domain, and collaborated with data engineers/developers to create, manage and monitor the data pipeline for the ETL workflow
Data AnalysisKusto Query Language (KQL)Network SecurityPredictive Modeling

Wells fargo

Data Scientist

Jan 2019Jan 2020 · 1 yr

  • Worked on building machine learning model for Credit Card debt collections to predict whether customers will contact and pay once their account is delinquent. This model scores the conditional probability based on historical contacts data of each customer across different communication channels like phone call, email, mobile or desktop app, branch visits, and text data. Completed data ingestion phase- working along with data SMEs from various departments to collect and ingest the training data, sampling the data and defining the base population. Worked with data engineers in the team to build data flow diagram, entity relation diagram; and use those artifacts to streamline the data pipeline and complete data transformation/validation.
  • Tools & Technologies: Python, Git, SQL, Jupyter notebook, Pyspark, H20 ML, SkLearn, SAS
Exploratory Data AnalysisMachine LearningFeature EngineeringDecision SciencesQuantitative Analysis (Finance)PostgreSQL+1

Navwar

Security Data Science

Sep 2014Mar 2019 · 4 yrs 6 mos

  • Collecting software scan data from network and process it to identify and remove vulnerable & non-compliant applications using data analytics and modeling.
  • Network traffic data analysis (approx. 10K endpoints streaming gigabytes of data everyday in Research Development & Test Environment): cleaning/sampling the data, creating predictive models to categorize, block and remediate the targeted attacks and threat vectors.
  • Work with security operation teams like forensics and incident management to collect required data, and perform analysis to aid their investigations.
  • Understand/ follow policies regarding information/cyber security like NIST, FIPS, RMF, FedRamp and others.
  • Tools & Technologies: Python, PostgreSQL, JIRA collaboration tool, Splunk log analytics, Vulnerability Management System (VMS), Enterprise Mission Assurance Support Service (eMASS), Host Based Security System (HBSS), Assured Compliance Assessment System (ACAS)
Data ModelingThreat AnalysisBig DataCybersecurityRisk AssessmentData Science

Us army

US Army Veteran

Feb 2010Aug 2014 · 4 yrs 6 mos · Fort Irwin, California

  • United States Army Service member.
Emotional IntelligenceMilitary LogisticsMilitary OperationsPredictive MaintenanceReport WritingSupply Chain Optimization+1

Education

UC San Diego

Master of Science — Data Science and Engineering

Jan 2016Jan 2018

Brandman University

Master of Business Administration (M.B.A.)

Jan 2012Jan 2014

Marist University

Master of Science — Information Systems

Jan 2007Jan 2009

Nepal College of Information Technology

Bachelors of Engineering — Computer Science

Jan 2002Jan 2006

Stackforce found 100+ more professionals with Machine Learning & Data Science

Explore similar profiles based on matching skills and experience