Pritam Pandit

AI Researcher

Manchester, Connecticut, United States9 yrs experience
AI EnabledAI ML Practitioner

Key Highlights

  • Designed end-to-end AI systems for retrieval and ranking.
  • Led multi-agent systems improving AI visibility by 20%.
  • Achieved $100K savings through data quality improvements.
Stackforce AI infers this person is a SaaS-focused AI Engineer with expertise in data-driven solutions and predictive analytics.

Contact

Skills

Core Skills

Ai EngineeringMlopsAi Content GenerationMachine LearningData EngineeringData SciencePredictive Analytics

Other Skills

Apache AirflowSemantic SearchOpenAI APIHugging Face ProductsVector DatabasesLlamaIndexQdrantpayload indexingmulti-vector rankingDatabricks ProductsLLMsreal-time trackingA/B TestingVersion ControlPostgreSQL

About

I design end-to-end AI systems—retrieval, ranking, and LLM pipelines—that are observable, explainable, and cost-aware. Recent: productionized hybrid retrieval with multi-vector ranking, RAG, and LLM explainability; led a multi-agent POC; drove $100K NLP savings. Background includes Amazon (Alexa), Nielsen (media analytics), Ecolab (advanced analytics). IC-focused. NightOwl Consulting (AI Engineer): Built multi-vector ranking using Gemini embeddings (3072-d) with weighted fusion; hybrid retrieval in Qdrant (dense similarity + metadata filtering) and skill-overlap reranking; RAG with GPT-4 batch classification via structured prompts. Implemented LLM orchestration (JSONL, async polling, exponential backoff, auto-retry), NL query parsing with Gemini/OpenAI fallback, and LLM-powered explainability (rationale + evidence). Deployed on Render with FastAPI/Streamlit, GitHub Actions CI/CD, cron scheduling, and environment management; focus on latency, reliability, and observability. Vibe Engine AI (AI/ML Engineer): IC lead for multi-agent system (Selenium crawling → Pinecone embeddings → competitor SOV analytics) across closed LLMs; shipped agent-generated content and tracking that lifted AI visibility ~20%. Stack: Python, Pinecone, Selenium, prompt engineering, evaluation, dashboards. Amazon (ML Data Associate): Improved Alexa intent diversity by 10% and model output diversity by 40% via five novel data augmentation strategies across 50+ conversations; supported NLP/intent classification quality. Nielsen (Data Scientist): Increased pipeline efficiency 40% using AWS Glue + SQL transformations; delivered 10+ QuickSight dashboards over multi-TB datasets; reduced data quality issues 15% with NLP preprocessing/deduplication, saving $100K (cost optimization and data reliability). Focus on ETL/ELT, data modeling, and stakeholder reporting. Ecolab (Sr. Technical Engineer, Advanced Analytics): Built predictive analytics POC supporting a $16M initiative; deployed Power BI KPI scorecards that improved operational efficiency and compliance 30%; time-series risk modeling cut troubleshooting time 20%. Stack: Azure, SQL, Power BI, Python. Keywords and signals: AI Engineer, applied AI, LLMs (GPT-4, Gemini), RAG, hybrid retrieval, vector search (Qdrant, Pinecone), embeddings, ranking, prompt engineering, LLM orchestration, batch pipelines, observability, explainability, evaluation, CI/CD (GitHub Actions), FastAPI, Streamlit, AWS, Azure, SQL, ETL/ELT, data modeling, A/B testing, experimentation, cost optimization, stakeholder communication, production systems.

Experience

9 yrs
Total Experience
2 yrs
Average Tenure
11 mos
Current Experience

Nightowl consulting

AI/ML Engineer

Jun 2025Present · 11 mos · Boston, Massachusetts, United States · On-site

  • Deployed 3 production RAG hybrid search systems over 200K pages on Qdrant with payload indexing and sub-ms latency; serves 20 concurrent users, saving team 10+ hrs/week.
  • Designed multi-vector semantic ranking using Gemini embeddings (3072-d) with weighted vector fusion, improving retrieval relevance 30%.
  • Built ETL and classification pipelines with LlamaIndex and GPT-4 for parsing unstructured docs and US mortgage domain tagging at scale; daily batch on Cloud Scheduler.
  • Engineered NL query parsing (Gemini + OpenAI fallback) to extract structured intent/filters with LLM-powered explainability for rationale and evidence.
  • Productionized on GCP (Cloud Run, Cloud Storage, Cloud Scheduler) with FastAPI and GitHub CI/CD for automated deployment.
Apache AirflowSemantic SearchOpenAI APIHugging Face ProductsVector DatabasesLlamaIndex+2

Vibe engine ai

AI/ML Engineer

Jan 2025Jun 2025 · 5 mos · Boston, Massachusetts, United States

  • Led GEO/AEO research and translated findings into production POC in 3 months; internal testing showed 20% Share of Voice lift over 90 days.
  • Architected core visibility engine that autonomously queries 4 LLMs (ChatGPT, Claude, Gemini, Perplexity) to track and score brand Share of Voice in real-time.
  • Built 4-agent system using CrewAI/Langchain (Auditor, Researcher, Copywriter, Analyst) to scrape client sites, identify semantic gaps, and generate optimized content.
  • Led AI content generation via MCP server across Reddit, LinkedIn, Twitter, and blogs for GEO/AEO optimization.
OpenAI APIDatabricks ProductsVector DatabasesAI EngineeringAI Content Generation

Amazon

ML Data Associate

Oct 2024Dec 2024 · 2 mos · Boston, Massachusetts, United States

  • Optimized Alexa chatbot AI by analyzing 50+ user conversations, resulting in 10% improvement in intent diversity.
  • Improved AI model output diversity by 40% with the help of 5 novel data augmentation strategies.

Heronai

ML Engineer

Jun 2024Aug 2024 · 2 mos · Cambridge, Massachusetts, United States · On-site

  • Implemented automated data cleaning workflows using pandas, enhancing data accuracy and integrity, which improved reporting efficiency by 20% within 2 months.
  • Powered analytics platform with proprietary Financial KPI by Ingesting processed data to Postgres SQL, helping users gain financial health visibility.
  • Conducted product experiments including A/B testing and competitor analysis, presenting results to CEO which informed user feedback collection strategy.
  • Developed Chrome extension integrating AI chat functionality, enhancing user experience for 1000 active users.
  • Conducted in-depth research on cutting-edge tools, techniques, and scientific literature for fine-tuning LLM models in financial applications, accelerating the company's AI strategy development.
Machine LearningA/B TestingVersion ControlData SciencePostgreSQLData Pipelines+5

Questrom school of business, boston university

AI Research Assistant

Aug 2023Jun 2024 · 10 mos · Boston, Massachusetts, United States

  • Benchmarked REaLTabFormer against GAN-based models (CTGAN) to evaluate statistical similarity, privacy preservation, and downstream ML utility.
  • Designed a comprehensive evaluation framework to assess synthetic data quality across fidelity, privacy risk, and model performance metrics.
  • Implemented privacy attack simulations including membership inference and attribute inference to validate privacy preservation guarantees.
Amazon AthenaA/B TestingAmazon QuickSightStatistical Data AnalysisVisualizationProduct Analytics+16

Nielsen

Data Scientist

Jun 2022Jun 2023 · 1 yr · India

  • Designed and published 10+ dashboards using AWS QuickSight, applying advanced SQL joins, filters, and calculated fields to derive insights from multi-TB datasets.
  • Improved data pipeline efficiency by 40% and enhanced reporting accuracy media analytics clients, leveraging AWS Glue and SQL-driven data transformations.
  • Saved $100K by reducing data quality issues by 15% through NLP-driven text preprocessing and deduplication for high-volume datasets.
R (Programming Language)Time Series ForecastingRoot Cause AnalysisVisualizationBusiness Intelligence (BI)Data Pipelines+8

Ecolab

Sr. Data Scientist (Advanced Analytics)

Jul 2016May 2022 · 5 yrs 10 mos · India

  • Translated business use case into predictive analytics frameworks. Validated the model by building POC for predicting asset performance using python for $16M project.
  • Developed and deployed interactive Power BI dashboards and KPI scorecards for predictive analytics, resulting in a 30% boost in operational efficiency and compliance score.
  • Improved performance of enterprise analytics platform by optimizing data ingestion from Azure and building scalable data models using SQL and enterprise data warehouse sources.
  • Reduced troubleshooting time by 20% recommending actionable insights by Performing time series analysis and improving system reliability by predicting risk.
  • Increased data cadence resulting in visibility of crucial client assets performance by 10%, Automated manual operations with Excel and custom tools.

Education

Questrom School of Business, Boston University

Master in Business Analytics

Aug 2023Sep 2024

North Maharashtra University

Bachelor of Engineering (B.E.) — Chemical Engineering

Jan 2012Jan 2016

Stackforce found 100+ more professionals with Ai Engineering & Mlops

Explore similar profiles based on matching skills and experience