Pradeep . — AI Researcher

I am an AI/ML Software Engineer with 5+ years of hands-on experience building and operating production-grade machine learning systems across retail, financial services, and data platforms. My work focuses on applied machine learning, distributed systems, and MLOps, with end-to-end ownership from data ingestion and feature engineering to production deployment, monitoring, and continuous optimization. I have built scalable AI systems using Python, SQL, PySpark, NumPy, Pandas, PyTorch, TensorFlow, Hugging Face Transformers, and scikit-learn, working across computer vision, NLP, semantic search, and Retrieval-Augmented Generation (RAG). I approach ML as a systems engineering problem, balancing model accuracy with latency, throughput, reliability, and cost in real-world environments. Currently at Sam’s Club, I work on real-time computer vision platforms using NVIDIA DeepStream and OpenCV, processing 8–12M video frames per day with sub-120 ms end-to-end latency. I designed event-driven pipelines using RabbitMQ and Kafka, sustaining 15K+ events/sec with 99.95% delivery reliability. I also built embedding-based retrieval services using Hugging Face models and Milvus, enabling low-latency similarity search across 50M+ vectors. Performance optimizations in Redis-based caching and feature lookups reduced retrieval latency by 35–50%. Previously at Wells Fargo, I built a production-grade RAG system using GPT-4, Claude, LLaMA, LangChain, Hugging Face embeddings, and Elasticsearch, reducing fraud investigation time by 45–60%. I developed NLP pipelines with TensorFlow and spaCy processing 1.0–1.4M documents/day at 90–93% accuracy, and implemented semantic search using PySpark and FAISS with sub-280 ms query latency across 18M+ records. Streaming pipelines using Kafka and Spark handled 250–400 GB/day of unstructured data. Across roles, I have deployed ML services using Docker, Kubernetes, Triton Inference Server, MLflow, AWS, and Azure, with monitoring via Prometheus, Grafana, and Tableau. I collaborate closely with product, platform, and data teams, enjoy translating ambiguous problems into scalable systems, and care deeply about building AI solutions that are robust, maintainable, and trusted in production.

Stackforce AI infers this person is a Machine Learning Engineer with expertise in Retail and Fintech sectors.

Experience: 2 yrs 10 mos

Skills

Machine Learning
Mlops
Natural Language Processing (nlp)

Career Highlights

5+ years of experience in AI/ML systems
Expertise in real-time computer vision and NLP
Proven track record in MLOps and system optimization

Work Experience

Sam's Club

AI/ML software engineer (1 yr 3 mos)

Wells Fargo

Machine Learning Engineer (6 mos)

George Mason University Career Services

Data Engineer (7 mos)

Kantar

Software Engineer (MLOps) (2 yrs 3 mos)

Education

Master's degree at George Mason University

Bachelor of Engineering - BE at Shanmugha Arts, Science, Technology and Research Academy

Pradeep .

AI Researcher

United States2 yrs 10 mos experience

AI EnabledAI ML Practitioner

Key Highlights

5+ years of experience in AI/ML systems
Expertise in real-time computer vision and NLP
Proven track record in MLOps and system optimization

Stackforce AI infers this person is a Machine Learning Engineer with expertise in Retail and Fintech sectors.

Contact

Skills

Core Skills

Machine LearningMlopsNatural Language Processing (nlp)

Other Skills

PythonSQLPySparkNumPyPandasPyTorchTensorFlowHugging Face Transformersscikit-learnNVIDIA DeepStreamOpenCVRabbitMQKafkaRedisMilvus

About

I am an AI/ML Software Engineer with 5+ years of hands-on experience building and operating production-grade machine learning systems across retail, financial services, and data platforms. My work focuses on applied machine learning, distributed systems, and MLOps, with end-to-end ownership from data ingestion and feature engineering to production deployment, monitoring, and continuous optimization. I have built scalable AI systems using Python, SQL, PySpark, NumPy, Pandas, PyTorch, TensorFlow, Hugging Face Transformers, and scikit-learn, working across computer vision, NLP, semantic search, and Retrieval-Augmented Generation (RAG). I approach ML as a systems engineering problem, balancing model accuracy with latency, throughput, reliability, and cost in real-world environments. Currently at Sam’s Club, I work on real-time computer vision platforms using NVIDIA DeepStream and OpenCV, processing 8–12M video frames per day with sub-120 ms end-to-end latency. I designed event-driven pipelines using RabbitMQ and Kafka, sustaining 15K+ events/sec with 99.95% delivery reliability. I also built embedding-based retrieval services using Hugging Face models and Milvus, enabling low-latency similarity search across 50M+ vectors. Performance optimizations in Redis-based caching and feature lookups reduced retrieval latency by 35–50%. Previously at Wells Fargo, I built a production-grade RAG system using GPT-4, Claude, LLaMA, LangChain, Hugging Face embeddings, and Elasticsearch, reducing fraud investigation time by 45–60%. I developed NLP pipelines with TensorFlow and spaCy processing 1.0–1.4M documents/day at 90–93% accuracy, and implemented semantic search using PySpark and FAISS with sub-280 ms query latency across 18M+ records. Streaming pipelines using Kafka and Spark handled 250–400 GB/day of unstructured data. Across roles, I have deployed ML services using Docker, Kubernetes, Triton Inference Server, MLflow, AWS, and Azure, with monitoring via Prometheus, Grafana, and Tableau. I collaborate closely with product, platform, and data teams, enjoy translating ambiguous problems into scalable systems, and care deeply about building AI solutions that are robust, maintainable, and trusted in production.

Experience

2 yrs 10 mos

Total Experience

1 yr 5 mos

Average Tenure

--

Current Experience

Sam's club

AI/ML software engineer

Feb 2025 – Present · 1 yr 3 mos · Remote

PythonSQLPySparkNumPyPandasPyTorch+10

Wells fargo

Machine Learning Engineer

Aug 2024 – Feb 2025 · 6 mos · Remote

GPT-4ClaudeLLaMALangChainHugging Face embeddingsElasticsearch+8

George mason university career services

Data Engineer

Oct 2023 – May 2024 · 7 mos · Fairfax, Virginia, United States · On-site

Kantar

Software Engineer (MLOps)

May 2020 – Aug 2022 · 2 yrs 3 mos · Bangalore Urban, Karnataka, India · On-site

Education

George Mason University

Master's degree

Aug 2022 – May 2024

Shanmugha Arts, Science, Technology and Research Academy

Bachelor of Engineering - BE — Electronics & Instrumentation Engineering

Jun 2016 – May 2020

Stackforce found 100+ more professionals with Machine Learning & Mlops

Explore similar profiles based on matching skills and experience

Ravindra Rao

Senior Manager

at Cognizant

Bengaluru, India14 yrs 10 mos exp

Data ScienceMachine Learning

Sagnik Majumder

Senior Software Engineer

at AMD

Bengaluru, India4 yrs 8 mos exp

Aparna Mehrotra

Product Technical Lead

at Infosys

Bengaluru, India0 mo exp

Shriya Shenoy

Project Assistant

at Sigma Consultants

Bengaluru, India1 yr 8 mos exp

Large Language Models (LLM)

Varshith Bathini

Senior AI Engineer

at Livepeer Studio

Hyderabad, India5 yrs 9 mos exp

Elastic Stack (ELK)KubernetesLanguage ModelsLarge Language Models (LLM)Spring Boot

Akshay Singh

Data Scientist

at Intel Corporation

Bengaluru, India3 yrs 11 mos exp

Machine LearningData Engineering

Dr. Habib Shaikh, PhD (AI)

Vice President

at Northern Trust

Pune, India8 yrs 1 mo exp

AI/MLJava Technologies

Shrivatsa Kanchi

Software Engineer

at Bosch Global Software Technologies

Bengaluru, India4 yrs 11 mos exp

Python (Programming Language)