S

Sana Gupta

Co-Founder

San Jose, California, United States4 yrs 4 mos experience

Key Highlights

  • Engineered ETL pipelines for 200,000+ health records.
  • Developed computer vision algorithms for robotics.
  • Benchmarking LLMs to enhance AI model performance.
Stackforce AI infers this person is a Data Science and AI specialist with a focus on healthcare and robotics.

Contact

Skills

Core Skills

Generative AiData ScienceMachine LearningComputer Vision

Other Skills

Data CleaningBayesian methodsCross-team CollaborationData VisualizationExtract, Transform, Load (ETL)Collaborative Problem SolvingAnalytical SkillsProject PlanningEmployee TrainingCustomer ServiceTeam LeadershipLogistics ManagementProject ManagementLaboratory SkillsResearch Design

About

I believe that data is only as powerful as the stories it tells and the problems it solves. Currently, I am a Data Science and Cognitive Science student at UC San Diego, where I am diving deep into the mathematical foundations of Probabilistic Modeling, Systems for Scalable Analytics, and Machine Learning. My goal is simple: to bridge the gap between raw, unstructured data and actionable engineering solutions. Unlike many undergraduates who only work with clean datasets, I thrive in the messy reality of real-world data. I have engineered ETL pipelines to normalize over 200,000 health records, audited thousands of weekly outputs for multimodal AI models, and developed computer vision algorithms for robotics. My technical foundation is built on a mix of Data Engineering, Analytics, and AI: ► Generative AI & LLMs: At Mercor, I currently work on the front lines of AI, benchmarking Large Language Models and refining prompt engineering strategies to improve model performance and reduce hallucinations. ► Data Engineering & Pipelines: I have architected Python-based ETL pipelines to process massive datasets, improving system efficiency by 30% and ensuring reproducibility for downstream modeling. ► Computer Vision: As a Robotics Software Engineer, I implemented "Depth Anything" models and processed 100k+ images to give humanoid robots accurate depth perception in dynamic environments. I am actively seeking internships in Data Science, Data Engineering, or Analytics for Summer 2026. If you are looking for a candidate who is comfortable moving from statistical theory to deploying code in production, let’s connect. 🛠 Core Competencies: Python (Pandas, NumPy, Scikit-learn), SQL, R, Generative AI, Computer Vision (OpenCV), Bayesian Methods, Git, Linux/Unix.

Experience

4 yrs 4 mos
Total Experience
--
Average Tenure
--
Current Experience

Healthedge

Project Management Intern

Jun 2026Present · 0 mo

  • Summer 2026

Mercor

AI Data Quality Analyst

Aug 2025Dec 2025 · 4 mos

  • Designed and executed scalable evaluation protocols to benchmark Generative AI models, standardizing prompt engineering workflows for 3,000+ weekly outputs.
  • Optimized Machine Learning training datasets by resolving annotation inconsistencies, directly reducing model hallucination rates.
  • Conducted data profiling and quality analysis on multimodal outputs to identify anomalies, strengthening dataset reliability for NLP fine-tuning.
  • Established a rigorous feedback loop with QA leads to refine evaluation rubrics, improving labeling consistency across 1,000+ annotators.
Generative AI

University of california, santa cruz

Student Statistical Data Analyst

Jun 2025Sep 2025 · 3 mos

  • Engineered an automated ETL data pipeline in Python to normalize 200,000+ biological health records, reducing processing time by 30%.
  • Architected a Bayesian modeling pipeline to automate cause-of-death classification, utilizing informative priors to improve prediction accuracy over baseline methods.
  • Implemented validation testing and rigorous cleaning protocols to resolve inconsistencies in heterogeneous datasets, ensuring data integrity for downstream backend modeling.
  • Translated complex statistical findings into actionable insights for interdisciplinary research teams to guide study parameters.
Data CleaningGenerative AIMachine LearningBayesian methodsData ScienceCross-team Collaboration+2

Slugbotics

Robotics Software Engineer - Computer Vision

Sep 2024Jun 2025 · 9 mos

  • Developed and deployed Deep Learning models (Depth Anything) using OpenCV to solve complex depth-estimation challenges in dynamic environments.
  • Engineered scalable Python pipelines to process and structure 100,000+ images, optimizing algorithms for real-time humanoid perception and reducing system latency.
  • Integrated perception modules and troubleshooting software to ensure reliable humanoid operation across diverse operational scenarios.
  • Managed codebase and version control using Git, conducting peer code reviews to maintain high standards for modularity and clean code.
Data CleaningMachine LearningData ScienceCross-team CollaborationData VisualizationExtract, Transform, Load (ETL)+1

Uc santa cruz science

Student Data Analyst - Research Analytics

Jun 2024Sep 2024 · 3 mos · Santa Cruz, California, United States

  • Cross-referenced 30 marine stranding datasets with several years of public records to verify accuracy.
  • Identified trends and discrepancies, which enhanced data collection, increasing process efficiency by 15% .
  • Communicated findings to research teams, translating technical insights for diverse audiences.
Data CleaningCross-team Collaboration

University of california, santa cruz

Undergraduate Representative - Committee of Planning and Budgeting

Oct 2023Jun 2024 · 8 mos

Collaborative Problem SolvingAnalytical SkillsCross-team Collaboration

Uc irvine - california state summer school for mathematics & science

Student Researcher

Jun 2022Aug 2022 · 2 mos

  • Isolated and identified bacteria, researched its stress response and impacts on resilience to other stressors. Presented findings at a research symposium.
Laboratory SkillsResearch DesignScientific CommunicationsCross-team Collaboration

Saratoga star aquatics

Deck Supervisor

Jun 2021Sep 2023 · 2 yrs 3 mos · Saratoga, California, United States

Project PlanningEmployee TrainingCustomer Service

Tutoring young minds

Founder & President

May 2020Jun 2023 · 3 yrs 1 mo · San Francisco Bay Area

  • Tutoring Young Minds is a 501(c)(3) non-profit organization whose mission is to provide educational mentorship to those who require support. With the services of experienced high school volunteers, Tutoring Young Minds provides free 1-on-1 tutoring services to middle and elementary school students nationally.
Team LeadershipLogistics ManagementProject Management

Education

UC San Diego

Bachelor of Science - BS — Data Science

Jan 2025Jan 2027

University of California, Santa Cruz

Jan 2023Jan 2025

Lynbrook High School

Jan 2019Jan 2023

Stackforce found 100+ more professionals with Generative Ai & Data Science

Explore similar profiles based on matching skills and experience