Pradeep .

AI Researcher

United States2 yrs 10 mos experience
AI EnabledAI ML Practitioner

Key Highlights

  • 5+ years of experience in AI/ML systems
  • Expertise in real-time computer vision and NLP
  • Proven track record in MLOps and system optimization
Stackforce AI infers this person is a Machine Learning Engineer with expertise in Retail and Fintech sectors.

Contact

Skills

Core Skills

Machine LearningMlopsNatural Language Processing (nlp)

Other Skills

PythonSQLPySparkNumPyPandasPyTorchTensorFlowHugging Face Transformersscikit-learnNVIDIA DeepStreamOpenCVRabbitMQKafkaRedisMilvus

About

I am an AI/ML Software Engineer with 5+ years of hands-on experience building and operating production-grade machine learning systems across retail, financial services, and data platforms. My work focuses on applied machine learning, distributed systems, and MLOps, with end-to-end ownership from data ingestion and feature engineering to production deployment, monitoring, and continuous optimization. I have built scalable AI systems using Python, SQL, PySpark, NumPy, Pandas, PyTorch, TensorFlow, Hugging Face Transformers, and scikit-learn, working across computer vision, NLP, semantic search, and Retrieval-Augmented Generation (RAG). I approach ML as a systems engineering problem, balancing model accuracy with latency, throughput, reliability, and cost in real-world environments. Currently at Sam’s Club, I work on real-time computer vision platforms using NVIDIA DeepStream and OpenCV, processing 8–12M video frames per day with sub-120 ms end-to-end latency. I designed event-driven pipelines using RabbitMQ and Kafka, sustaining 15K+ events/sec with 99.95% delivery reliability. I also built embedding-based retrieval services using Hugging Face models and Milvus, enabling low-latency similarity search across 50M+ vectors. Performance optimizations in Redis-based caching and feature lookups reduced retrieval latency by 35–50%. Previously at Wells Fargo, I built a production-grade RAG system using GPT-4, Claude, LLaMA, LangChain, Hugging Face embeddings, and Elasticsearch, reducing fraud investigation time by 45–60%. I developed NLP pipelines with TensorFlow and spaCy processing 1.0–1.4M documents/day at 90–93% accuracy, and implemented semantic search using PySpark and FAISS with sub-280 ms query latency across 18M+ records. Streaming pipelines using Kafka and Spark handled 250–400 GB/day of unstructured data. Across roles, I have deployed ML services using Docker, Kubernetes, Triton Inference Server, MLflow, AWS, and Azure, with monitoring via Prometheus, Grafana, and Tableau. I collaborate closely with product, platform, and data teams, enjoy translating ambiguous problems into scalable systems, and care deeply about building AI solutions that are robust, maintainable, and trusted in production.

Experience

2 yrs 10 mos
Total Experience
1 yr 5 mos
Average Tenure
--
Current Experience

Sam's club

AI/ML software engineer

Feb 2025Present · 1 yr 3 mos · Remote

PythonSQLPySparkNumPyPandasPyTorch+10

Wells fargo

Machine Learning Engineer

Aug 2024Feb 2025 · 6 mos · Remote

GPT-4ClaudeLLaMALangChainHugging Face embeddingsElasticsearch+8

George mason university career services

Data Engineer

Oct 2023May 2024 · 7 mos · Fairfax, Virginia, United States · On-site

Kantar

Software Engineer (MLOps)

May 2020Aug 2022 · 2 yrs 3 mos · Bangalore Urban, Karnataka, India · On-site

Education

George Mason University

Master's degree

Aug 2022May 2024

Shanmugha Arts, Science, Technology and Research Academy

Bachelor of Engineering - BE — Electronics & Instrumentation Engineering

Jun 2016May 2020

Stackforce found 100+ more professionals with Machine Learning & Mlops

Explore similar profiles based on matching skills and experience