S

Saumya Gupta

AI Researcher

Boston, Massachusetts, United States3 yrs 9 mos experience
Most Likely To SwitchAI ML Practitioner

Key Highlights

  • Expert in developing AI models for healthcare applications.
  • Proficient in full-stack development with a focus on ML pipelines.
  • Strong leadership skills demonstrated through mentoring and team collaboration.
Stackforce AI infers this person is a Fintech and Healthcare-focused AI Developer with strong full-stack capabilities.

Contact

Skills

Core Skills

Artificial Intelligence (ai)Machine LearningFull-stack DevelopmentSoftware DevelopmentComputer Vision

Other Skills

ARMNNARMNN ConverterAdobe Experience ManagerAdobe Experience Manager (AEM)Amazon Web Services (AWS)Apache KafkaArtificial Neural NetworkBack-End Web DevelopmentBack-end DevelopmentBioinformaticsC (Programming Language)C++Cascading Style Sheets (CSS)Chain of Thought PromptingCommunication

About

Currently pursuing a Master’s in Artificial Intelligence at Khoury College of Computer Sciences and working as an AI Research Associate at Northeastern’s Institute for Experiential AI. I lead cutting-edge research on large language models (LLMs) for bioinformatics, developing models that tackle challenges in genomics and alternative splicing prediction hence bringing AI innovation to healthcare and life sciences. In parallel, I work on advancing geometric deep learning and flow matching algorithms, applying them to enhance semantic representation and generation tasks. These efforts aim to push the boundaries of what's possible in multimodal AI and generative modeling With 2+ years of industry experience at Razorpay and Rebel Foods, I’ve built scalable ML pipelines, production-grade backend systems, and worked on impactful AI solutions across domains. From hallucination detection in LLMs to building RAG systems with explainability, my work blends deep research with real-world impact. Proficient in a range of languages, tools, and technologies, I bring a unique blend of technical expertise, leadership skills, and a proactive approach to problem-solving. - Skilled in Python, Java, Golang | PyTorch, Hugging Face, Airflow | Transformers, CNNs, RNNs - Known for rapid execution, system-level thinking, and a passion for solving hard AI problems. I’m a lifelong learner, a tech geek who loves reading and exploring new horizons, and I’m always excited about the next adventure life has in store! Let’s connect if you’re working at the intersection of AI research and engineering or just want to talk LLMs!

Experience

3 yrs 9 mos
Total Experience
11 mos
Average Tenure
1 yr 10 mos
Current Experience

Institute for experiential ai at northeastern university

2 roles

Graduate Research Associate

Jan 2025Present · 1 yr 5 mos · Boston, Massachusetts, United States · Hybrid

Artificial Intelligence Research Coop

Jul 2024Dec 2024 · 5 mos · Boston, Massachusetts, United States · Hybrid

  • Developing Large Language Models to predict alternative splicing, leveraging multi-GPU training on highly imbalanced, complex datasets to improve detection and understanding of splicing events in pre-mRNA sequences, enabling more accurate insights and advancements in genetic research.
  • Leveraging Symmetric Latent Spaces in Deep Learning: Applying geometric deep learning principles, particularly equivariance and invariance, to develop Latent Diffusion Models. This work aims to exploit these models for tasks like uncertainty quantification.
Large Language ModelsMulti-GPU TrainingGeometric Deep LearningLatent Diffusion ModelsArtificial Intelligence (AI)Machine Learning

Khoury college of computer sciences

Khoury Graduate Teaching Assistant

Jan 2024Apr 2024 · 3 mos · Boston, Massachusetts, United States · On-site

  • Guiding students through Foundations of AI with interactive tutorials, clarifying concepts, and supporting their learning journey.
  • Assisting professors in course development, grading, and fostering a collaborative environment for students to excel in AI fundamentals.
  • Dedicated to creating an inclusive atmosphere, encouraging peer interaction, and promoting teamwork to enhance the overall educational experience.

Razorpay

Full Stack ML Engineer

Apr 2022Aug 2023 · 1 yr 4 mos

  • Spearheaded Golang development within the Optimizer - Payments Team at Razorpay, focusing on customer-facing projects that seamlessly addressed fundamental requirements of payment operations.
  • Collaborated with cross-functional teams, including product managers, to enhance the Optimizer product. Facilitated merchants in creating and defining diverse payment gateways, enabling them to prioritize and strategically balance payment loads through rule-based configurations.
  • Orchestrated the containerization of microservices using Docker and executed deployment on Kubernetes clusters through meticulously crafted helm charts.
  • Pioneered the design and implementation of robust integration tests for multiple microservices, in addition to creating unit tests for innovative product features, ensuring a reliable and scalable platform.
  • Notably, engineered a groundbreaking feature allowing customers to link their Paytm wallets on Razorpay’s checkout, streamlining subsequent payments through a single click. This significantly enhanced user experience and engagement.
  • Demonstrated leadership by mentoring an intern for a two-month duration, guiding them in developing a charge-back prediction model. The model utilized a simple Artificial Neural Network (ANN), achieving noteworthy success in predicting the probability of transactions resulting in charge-backs.
GolangDockerKubernetesIntegration TestingFull-Stack DevelopmentMachine Learning

Rebel foods (formerly faasos)

Software Development Engineer

Jul 2021Apr 2022 · 9 mos

  • Led back-end application development in Rebel Foods' In-order team, overseeing critical microservices for kitchen staff, food orders, and inventory management.
  • Designed and normalized a schema for efficient storage of food order, store, and kitchen staff details, fostering scalability.
  • Achieved a significant milestone by implementing a feature that allowed kitchen staff to cancel orders with validated reasons, preventing revenue loss due to false cancellations and showcasing a commitment to system integrity.
Back-end DevelopmentSchema DesignMicroservicesSoftware DevelopmentMachine Learning

Deloitte india (offices of the us)

Full Stack Development Intern

Jan 2021Jul 2021 · 6 mos

  • Contributed as a full-stack developer in Deloitte's Customers and Marketing team, focusing on projects for the Adobe client.
  • Designed and developed interactive web pages using Adobe Experience Manager, showcasing a proficiency in creating engaging user interfaces.
  • Demonstrated expertise in backend development by creating multiple REST APIs using Java, enhancing the functionality and connectivity of web applications for an optimal user experience.
Full-Stack DevelopmentREST APIsAdobe Experience ManagerSoftware Development

Indian institute of information technology

AI Research Intern

May 2020Jul 2020 · 2 mos · Prayagraj, Uttar Pradesh, India · Remote

  • Conducted a comprehensive comparative study focused on General Adversarial Networks (GANs) utilized in the transformation of text captions into images, leveraging the Caltech bird dataset.
  • Innovatively devised a novel performance evaluation method for these GANs, employing t-Distributed Stochastic Neighbor Embedding (t-SNE), a feature dimensionality reduction algorithm. This method involved plotting both the original images and images generated by GANs and analyzing the overlapping areas to provide a nuanced assessment of performance.
  • Demonstrated a keen analytical approach in evaluating the efficacy of various GANs for text-to-image generation, contributing to the advancement of research methodologies in the field of artificial intelligence and image synthesis.
GANsPerformance EvaluationArtificial Intelligence (AI)Computer Vision

Ocean energy

Computer Vision Engineer

Apr 2020May 2020 · 1 mo · Taloje Panchnad, Navi Mumbai · Remote

  • Developed a project focused on the detection of traffic rule violations through video inputs, emphasizing the development of a robust system.
  • Engineered a sophisticated system capable of identifying vehicles violating traffic signals, capturing their number plates, and subsequently converting the extracted license plate image data into text.
  • Implemented a seamless integration with a database, ensuring the organized storage of pertinent information, thereby contributing to the enhancement of traffic monitoring and enforcement systems.
Traffic Rule DetectionVideo ProcessingComputer VisionArtificial Intelligence (AI)

Samsung electronics

Samsung Prism Developer

Dec 2019Aug 2020 · 8 mos · Bangalore

  • Engaged in a project within the 'Device Intelligence' domain, titled "Design and Implement Manual Partitioning in ARMNN Converter," as part of a collaborative team of three under the guidance of Professor Delhibabu R. at VIT University.
  • Specialized in the ML/AI worklet area, where the primary focus was enhancing the ARMNN converter by introducing manual partitioning techniques.
  • Successfully contributed to the team's efforts in exploring and implementing advanced techniques within the ML/AI field, showcasing a commitment to research and innovation under the guidance of Professor Delhibabu R. at VIT University.
Manual PartitioningARMNN ConverterArtificial Intelligence (AI)Machine Learning

Binplus technologies (p) limited

Web Development Intern

May 2019Jun 2019 · 1 mo · Jhansi, Uttar Pradesh, India

  • Functioned as a full-stack developer at Binplus Technologies, actively contributing to the development of user-friendly websites with a focus on both frontend interfaces and backend APIs.
  • Built a significant project by designing and implementing a dynamic dashboard featuring multiple statistical charts generated from real-time data. Notably, integrated an advanced search bar with autofill functionality, providing users with seamless and efficient data exploration.
  • Awarded for exceptional hard work and dedication with a Certificate of Excellence at Binplus Technologies, for the valuable contributions made to projects and overall team success.
Full-Stack DevelopmentDynamic Dashboard DesignSoftware Development

Education

Khoury College of Computer Sciences

Master's degree — Artificial Intelligence

Sep 2023Dec 2025

Vellore Institute of Technology

Bachelor of Technology — Computer Science

Jan 2017Jan 2021

Stackforce found 100+ more professionals with Artificial Intelligence (ai) & Machine Learning

Explore similar profiles based on matching skills and experience