Shweta Smriti Tripathi

Associate Consultant

New York, New York, United States3 yrs experience
AI EnabledAI ML Practitioner

Key Highlights

  • Built high-impact AI systems reducing invalid outputs from 28% to 6%
  • Engineered Kafka-to-Snowflake pipelines for 10M+ daily events
  • Automated TB-scale data refreshes from 8 hours to 30 minutes
Stackforce AI infers this person is a Data Engineer with expertise in SaaS and AI-driven solutions.

Contact

Skills

Core Skills

Database Management System (dbms)Python (programming Language)Generative Ai (langchain/langgraph)Mlops & ReliabilitySnowflake CloudBusiness IntelligenceData VisualizationMachine Learning

Other Skills

Arduino IDEAzure Data FactoryBusiness AnalysisC (Programming Language)ChromaClaudeCommunicationCritical ThinkingCustomer Relationship Management (CRM)Customer SuccessCybersecurityData WarehousingFastAPIFastAPI & MLOpsFinancial Reporting

About

I’m a Data Science grad student at Columbia who’s tired of seeing great models die in Jupyter notebooks. My focus is on the "how" of turning Generative AI into reliable, agentic systems that actually solve business bottlenecks. With a background in building data systems that actually survive the real world, my experience bridges the gap between raw data engineering and applied AI, moving beyond notebooks to build reliable, production-grade tools. Whether it's modeling 10M+ daily events into Snowflake at Providence or automating massive Spark pipelines for 4M+ records at Dell, I focus on making high-scale data both accessible and actionable. Right now, I’m focused on how Agentic AI is turning LLMs into functional partners rather than just chat interfaces. I’ve built RAG-based copilots that map complex business questions to grounded SQL queries and developed planners that handle everything from macro-tracking to schedule optimization. I love the challenge of cutting through the noise, like reducing invalid AI outputs from 28% to 6%, and I’m looking for a summer role where I can build more of these high-impact, act instead of just predict systems.

Experience

3 yrs
Total Experience
1 yr 6 mos
Average Tenure
--
Current Experience

Columbia climate school

Teaching Assistant

Jan 2026Present · 5 mos · New York, United States

  • CLMTG5053: Computing and Research Methods for Climate Data Science

Columbia university department of computer science

Graduate Teaching Assistant

Sep 2025Dec 2025 · 3 mos · New York, United States · On-site

  • CS 4111: Introduction to Databases
MySQLDatabase Management System (DBMS)Python (Programming Language)MongoDBFlask

Providence india

Senior AI Data Engineer

Jul 2024Jul 2025 · 1 yr · Hyderabad, Telangana, India · On-site

  • Agentic AI & RAG: Built an agentic LLM + RAG copilot for 15+ analysts, integrating SQL lookups and automated ticket drafting to reduce handle time by 28%.
  • Scale & Infrastructure: Engineered Kafka-to-Snowflake pipelines for 10M+ daily security events, utilizing star schema modeling and date partitioning to cut compute costs by 35%.
  • MLOps & Reliability: Tuned anomaly detection models (0.87 F1) to cut critical backlogs by 75% while managing GenAI releases via AWS canary deploys and evaluation gates.
  • Operational Excellence: Automated TB-scale refreshes from 8 hours to 30 minutes and enforced data contracts with dbt, lifting on-time SLA from 91% to 99%.
Generative AI (LangChain/LangGraph)RAG & Vector DatabasesSnowflake CloudCybersecurityAzure Data FactoryMicrosoft Azure Machine Learning+5

Dell technologies

Analyst, Business Intelligence

Jun 2022Jul 2024 · 2 yrs 1 mo · Bengaluru, Karnataka, India

  • Pipeline Engineering: Automated PySpark and SQL ETL pipelines for 5M+ transactions, reducing manual effort by 75% and shrinking processing windows by 60%.
  • Data Integrity: Increased pipeline trust by 60% and sustained 99.5% uptime by implementing automated schema-drift checks and SQL validation monitors.
  • Global Analytics: Shipped and maintained 30+ Power BI dashboards across EMEA and LATAM, reducing metric discrepancy escalations by 55% through standardized KPI definitions.
  • Stakeholder Impact: Collaborated with Sales Ops to stabilize daily GTM refreshes and lower pilot churn by 6% through data-driven performance insights.
Natural Language Processing (NLP)Machine LearningPySparkRobotic Process Automation (RPA)Business AnalysisMicrosoft Power BI+5

Highradius

Product Expert Analyst Intern

Sep 2021Jun 2022 · 9 mos · Hyderabad, Telangana, India

  • Collaborated with cross-functional teams to determine project KPIs, ensuring effective tracking and reporting.
  • Organised and streamlined files, spreadsheets, and reports, enhancing data accessibility and usability.
  • Analysed survey data to derive actionable insights, contributing to informed decision-making processes.
  • Developed problem-solving skills by identifying solutions and making data-driven decisions.
Customer SuccessSoftware as a Service (SaaS)Microsoft OfficeCustomer Relationship Management (CRM)Financial Reporting

Dell technologies

Business Intelligence Intern

May 2021Jul 2021 · 2 mos · Bengaluru, Karnataka, India

  • Developed an early warning system using SQL and Dell internal tools to identify data anomalies.
  • Ensured data accuracy and integrity by implementing solutions with Teradata for efficient storage.
  • Collaborated with cross-functional teams to enhance data quality, leading to improved decision-making processes.
Data VisualizationPython (Programming Language)Teradata SQLSQL Server Management StudioMicrosoft Power BI

Highradius

Machine Learning Intern

Jan 2021Mar 2021 · 2 mos · Bhubaneswar, Orissa, India

  • Model Optimization: Implemented a credit risk and anomaly flagging system, improving alert precision by 22% through segment-level error analysis.
  • Feature Engineering: Engineered Python and SQL preparation flows for 5K+ samples, enhancing label consistency by 18%.
  • Full-Stack Development: Designed and implemented an Invoice Management application using Python, ReactJs, and JDBC.
Machine Learning (Classification Models)LLMsStatistical Data AnalysisPython (Programming Language)React.jsJavaScript+1

Education

Columbia University

Master's degree — Data science

Aug 2025Dec 2026

KIIT - Kalinga Institute of Industrial Technology

Bachelor of Technology - BTech — Computer Science

Jan 2018Jan 2022

Stackforce found 100+ more professionals with Database Management System (dbms) & Python (programming Language)

Explore similar profiles based on matching skills and experience