Aman Singh Thakur

AI Researcher

Seattle, Washington, United States4 yrs 9 mos experience

Most Likely To Switch

Key Highlights

Expert in building scalable ML systems.
Proven success in fraud detection model improvements.
Strong background in LLM evaluation and benchmarking.

Stackforce AI infers this person is a Machine Learning Engineer with expertise in AI Research and Financial Technology.

Contact

Skills

Core Skills

Machine LearningMlopsLarge Language Models (llm)Data EngineeringData Science

Other Skills

PyTorchLangChainAgents SDKModel fine-tuningFraud detectionSynthetic dataset generationData analysisPython (Programming Language)SQLMongoDBData VisualizationCNNDevOpsPythonDocker

About

I'm a MLE 2 at Amazon. At AWS Sagemaker AI, I was building agentic workflows for model customization — spanning model fine-tuning, evaluation, and deployment using tools like PyTorch, LangChain, and the Agents SDK. I specialize in building scalable ML systems, with experience across model training and serving infrastructure, ETL and data pipelines, real-time anomaly detection, and multi-agent orchestration using MCP and A2A protocols. I also work on fraud detection systems and ML infrastructure at scale. Previously, I worked as a Graduate Researcher at Meta, evaluating LLM alignment and judge reliability across models like Llama 2, Mistral, and GPT-4 for tasks including QA and ranking. I've also completed research internships at SLAC National Accelerator Laboratory, Stanford and Indian Institute of Technology (IIT), Kharagpur, where I developed ML-driven systems for online modeling, predictive analytics, and scientific simulation. Beyond research, I bring industry experience from Goldman Sachs and Morgan Stanley, where I built credit risk models, real-time anomaly detection systems, NLP pipelines, and large-scale data infrastructure powering some of the world's largest financial platforms. I hold an MS in Computer Science from UMass Amherst, with a focus on Distributed Systems, Deep Learning, and LLMs. I'm an active contributor to open-source, with Google Summer of Code projects at CERN-HSF and Lund University — building ML tools with PyTorch, Spark, and Autoencoders for scientific data compression and online physics modeling.

Experience

4 yrs 9 mos

Total Experience

1 yr 7 mos

Average Tenure

1 yr 11 mos

Current Experience

Amazon

MLE 2

Jul 2024 – Present · 1 yr 11 mos · Seattle, Washington, United States · Hybrid

Built SageMaker model customization agent to allow for agentic model fine-tuning, synthetic dataset generation & model evaluation.
Revamped fraud detection model for AWS Mechanical Turk, improving accuracy from 82% to 97%

PyTorchLangChainAgents SDKModel fine-tuningFraud detectionMachine Learning+1

Google summer of code

ML Intern

Jun 2023 – Oct 2023 · 4 mos · Remote

• Built 2D/3D Autoencoder CNNs into Baler with 95%+ accuracy and 200%+ compression ratio for text/image/video files.

Data EngineeringData VisualizationCNNData SciencePython (Programming Language)PyTorch+1

Slac national accelerator laboratory

ML Intern

Jun 2023 – Sep 2023 · 3 mos · Stanford, California, United States · On-site

• Migrated Keras to PyTorch for online ML modeling, 10x speedup over physics simulations. Streamlined CI/CD with Docker/Kubernetes, cutting deployment time 50%.

DevOpsMLOpsMachine LearningPythonDockerMongoDB+3

Goldman sachs

2 roles

Associate

Promoted

Dec 2021 – Jul 2022 · 7 mos

Built liquidity risk and anomaly detection models monitoring $500Bn+ exposure.
Designed data pipelines validating 100+ contracts across 20Bn+ daily data points.

Analyst

Oct 2020 – Nov 2021 · 1 yr 1 mo

• Updated quantitative models during Brexit transition, enabling $10Bn+ in impacted client accounts to continue brokerage and deposit operations.

Data EngineeringAgile Project ManagementJavaData AnalysisData ScienceC#+3

Morgan stanley

Analyst

Jul 2019 – Sep 2020 · 1 yr 2 mos · Mumbai Metropolitan Region

• Built NLP pipeline extracting trade signals from emails/CSV/XML, onboarding 50+ hedge funds/enterprises. Reduced trade processing time 40% via Angular onboarding UI + Spring backend

Data EngineeringJavaSpring BootJavaScriptData ScienceAngular+2