Aman Singh Thakur

AI Researcher

Seattle, Washington, United States4 yrs 9 mos experience
Most Likely To Switch

Key Highlights

  • Expert in building scalable ML systems.
  • Proven success in fraud detection model improvements.
  • Strong background in LLM evaluation and benchmarking.
Stackforce AI infers this person is a Machine Learning Engineer with expertise in AI Research and Financial Technology.

Contact

Skills

Core Skills

Machine LearningMlopsLarge Language Models (llm)Data EngineeringData Science

Other Skills

PyTorchLangChainAgents SDKModel fine-tuningFraud detectionSynthetic dataset generationData analysisPython (Programming Language)SQLMongoDBData VisualizationCNNDevOpsPythonDocker

About

I'm a MLE 2 at Amazon. At AWS Sagemaker AI, I was building agentic workflows for model customization — spanning model fine-tuning, evaluation, and deployment using tools like PyTorch, LangChain, and the Agents SDK. I specialize in building scalable ML systems, with experience across model training and serving infrastructure, ETL and data pipelines, real-time anomaly detection, and multi-agent orchestration using MCP and A2A protocols. I also work on fraud detection systems and ML infrastructure at scale. Previously, I worked as a Graduate Researcher at Meta, evaluating LLM alignment and judge reliability across models like Llama 2, Mistral, and GPT-4 for tasks including QA and ranking. I've also completed research internships at SLAC National Accelerator Laboratory, Stanford and Indian Institute of Technology (IIT), Kharagpur, where I developed ML-driven systems for online modeling, predictive analytics, and scientific simulation. Beyond research, I bring industry experience from Goldman Sachs and Morgan Stanley, where I built credit risk models, real-time anomaly detection systems, NLP pipelines, and large-scale data infrastructure powering some of the world's largest financial platforms. I hold an MS in Computer Science from UMass Amherst, with a focus on Distributed Systems, Deep Learning, and LLMs. I'm an active contributor to open-source, with Google Summer of Code projects at CERN-HSF and Lund University — building ML tools with PyTorch, Spark, and Autoencoders for scientific data compression and online physics modeling.

Experience

4 yrs 9 mos
Total Experience
1 yr 7 mos
Average Tenure
1 yr 11 mos
Current Experience

Amazon

MLE 2

Jul 2024Present · 1 yr 11 mos · Seattle, Washington, United States · Hybrid

  • Built SageMaker model customization agent to allow for agentic model fine-tuning, synthetic dataset generation & model evaluation.
  • Revamped fraud detection model for AWS Mechanical Turk, improving accuracy from 82% to 97%
PyTorchLangChainAgents SDKModel fine-tuningFraud detectionMachine Learning+1

Meta

Graduate Researcher

Feb 2024Jul 2024 · 5 mos · Remote

  • Evaluated LLM-as-Judge alignment across Llama 2, Mistral, and GPT-4; published at ACM GEM²
  • Benchmarked judge reliability across 1000+ A/B tests using Kappa, Scott's Pi, Pearson correlation, etc.
Python (Programming Language)Machine LearningLarge Language Models (LLM)SQLData ScienceMongoDB

Google summer of code

ML Intern

Jun 2023Oct 2023 · 4 mos · Remote

  • • Built 2D/3D Autoencoder CNNs into Baler with 95%+ accuracy and 200%+ compression ratio for text/image/video files.
Data EngineeringData VisualizationCNNData SciencePython (Programming Language)PyTorch+1

Slac national accelerator laboratory

ML Intern

Jun 2023Sep 2023 · 3 mos · Stanford, California, United States · On-site

  • • Migrated Keras to PyTorch for online ML modeling, 10x speedup over physics simulations. Streamlined CI/CD with Docker/Kubernetes, cutting deployment time 50%.
DevOpsMLOpsMachine LearningPythonDockerMongoDB+3

Goldman sachs

2 roles

Associate

Promoted

Dec 2021Jul 2022 · 7 mos

  • Built liquidity risk and anomaly detection models monitoring $500Bn+ exposure.
  • Designed data pipelines validating 100+ contracts across 20Bn+ daily data points.

Analyst

Oct 2020Nov 2021 · 1 yr 1 mo

  • • Updated quantitative models during Brexit transition, enabling $10Bn+ in impacted client accounts to continue brokerage and deposit operations.
Data EngineeringAgile Project ManagementJavaData AnalysisData ScienceC#+3

Morgan stanley

Analyst

Jul 2019Sep 2020 · 1 yr 2 mos · Mumbai Metropolitan Region

  • • Built NLP pipeline extracting trade signals from emails/CSV/XML, onboarding 50+ hedge funds/enterprises. Reduced trade processing time 40% via Angular onboarding UI + Spring backend
Data EngineeringJavaSpring BootJavaScriptData ScienceAngular+2

Education

University of Massachusetts Amherst

Master of Science - MS — Computer Science

Sep 2022May 2024

Manipal Institute of Technology

Bachelor of Technology - BTech — Computer Science and Engineering

Jan 2015Jan 2019

Stackforce found 100+ more professionals with Machine Learning & Mlops

Explore similar profiles based on matching skills and experience