Diya Khetarpal

Lead ML Engineer

Hansi, Haryana, India0 mo experience
AI EnabledAI ML Practitioner

Key Highlights

  • Top 6 finalist in Google Girl Hackathon.
  • Selected in top 10 teams out of 50,000+ for Campus Beats ZS.
  • Ranked among top 500 for AWS AIML nanodegree program.
Stackforce AI infers this person is a skilled AI and Machine Learning developer with a focus on innovative solutions.

Contact

Skills

Core Skills

Natural Language ProcessingGenerative AiComputer VisionEdge AiMachine LearningFront-end Development

Other Skills

AWS BedrockAlgorithmsAmazon Web Services (AWS)Artificial Intelligence (AI)Artificial Neural NetworksBERTBotpressCascading Style Sheets (CSS)Chatbot DevelopmentCommunicationData AnalysisData AnalyticsData Privacy ComplianceData ScienceData Structures

About

Hi, I’m Diya, a final-year student at Maharaja Agrasen Institute of Technology, passionate about building impactful solutions at the intersection of AI, machine learning, and software engineering. Over the course of my journey, I’ve worked on diverse projects across generative AI, natural language processing, computer vision, and no-code ML platforms. I have hands-on expertise in LangChain, LangGraph, and PyTorch, along with experience in vector databases (Milvus, Pinecone, Weaviate), cloud services (AWS Bedrock), and edge AI systems (Jetson, Dockerized microservices). I’m also an open-source contributor to SunPy (6.1 release), where I collaborated with the global developer community to improve scientific computing tools. Beyond technical skills, I thrive in team environments, bringing creativity and problem-solving to projects that matter. Some highlights of my journey so far: Top 6 finalist in Google Girl Hackathon, recognized for building an innovative ML-based healthcare solution. Selected in top 10 teams out of 50,000+ teams who applied for Campus Beats ZS Selected among top 500 out of 60,000 people for AWS AIML nanodegree program based on theme Future Data Engineer I love to explore emerging AI frameworks, push the limits of what’s possible with LLMs and generative AI, and translate complex research into real-world applications. Always eager to collaborate on innovative ideas and contribute to the growing AI ecosystem.

Experience

Etech global services

GenerativeAI Intern

Apr 2025Oct 2025 · 6 mos · Texas, United States · Remote

  • As a Generative AI Intern at eTech Global Services, I had the opportunity to work on multiple cutting-edge AI projects that combined natural language processing, information retrieval, and large language model integration.
  • I contributed to the development of a Personal Information Redaction Bot, designed to automatically detect and redact sensitive data from text, ensuring compliance with data privacy requirements. I also worked on building a Recruitment Bot to streamline candidate interaction and hiring processes, as well as a Knowledge Base Chatbot that leveraged Retrieval-Augmented Generation (RAG) for accurate and context-driven responses from ingested organizational data.
  • During this internship, I gained hands-on experience with vector databases like Milvus, Pinecone, and Weaviate, optimizing retrieval performance for large-scale embeddings. I also explored AWS Bedrock for deploying and experimenting with foundation models, integrating them into scalable AI-driven solutions.
  • This role gave me practical exposure to the end-to-end lifecycle of generative AI applications, from data ingestion and retrieval pipelines to model deployment and real-world use cases. It sharpened my ability to solve business problems using AI and strengthened my skills in LLM fine-tuning, RAG pipelines, and cloud-based AI services.
Natural Language ProcessingInformation RetrievalLarge Language Model IntegrationVector DatabasesAWS BedrockGenerative AI

Iitd-aia foundation for smart manufacturing

AIML Intern

Sep 2024Jan 2025 · 4 mos · Delhi, India · On-site

  • As an AI/ML Intern at IIT Delhi, I worked on a World Food Programme (WFP) project focused on building a cost-efficient, reliable, and intelligent surveillance system for food security monitoring across multiple warehouses in India.
  • I contributed to designing a Cost-Optimized Surveillance System by leveraging Jetson Edge Computing and Dockerized microservices, reducing deployment costs by nearly 60% across 8 warehouses. I implemented real-time motion detection using ResNet models on the Jetson Orion Nano device, achieving 97% accuracy with ~800ms latency, which further triggered YOLOv8-based anomaly detection for activity monitoring.
  • To ensure system robustness, I engineered an Edge-to-Cloud Reliability Architecture, deploying Dockerized camera services with automated reattachment (3-retry logic) integrated with Firebase, successfully managing 37 concurrent image streams across 8+ edge devices.
  • This experience gave me deep exposure to computer vision, edge AI deployment, microservice architectures, and cloud-edge integration, while working on a project that directly contributed to global food security goals.
Jetson Edge ComputingDockerized MicroservicesReal-Time Motion DetectionComputer VisionEdge AI

Dblockchainers

NLP Research Intern

Jun 2024Sep 2024 · 3 mos · India · Remote

  • As an NLP Research Intern at DBlockchainers, I worked on building an end-to-end NLP experimentation framework to train, evaluate, and optimize a wide range of language models. Over the course of my internship, I implemented and trained models from the ground up, starting with classical embeddings like Word2Vec, GloVe, and FastText, and progressing to transformer-based architectures such as BERT and RoBERTa, all within a PyTorch Lightning framework.
  • A major focus of my work was on fact-checking datasets such as ChartFact and ChartCheck, where I experimented with embedding fusion techniques (e.g., combining GloVe and FastText) to improve the representation of tabular and textual data. I also developed a large-scale evaluation engine capable of testing models across multiple datasets, using Hydra for configuration management and strategies like negative sampling to enhance model generalization and recall.
  • On the deployment side, I optimized trained models for real-world inference by applying ONNX Runtime-based quantization and other efficiency techniques, significantly reducing latency while keeping accuracy stable. I also incorporated preprocessing and tokenization pipelines using NLTK and Hugging Face Transformers, ensuring that the models could handle diverse input formats consistently.
  • This internship deepened my expertise in transformer architectures, NLP evaluation pipelines, and model optimization for production, while allowing me to explore the research side of fact verification and information consistency in natural language processing.
NLP Experimentation FrameworkTransformer ArchitecturesModel OptimizationFact-Checking DatasetsNatural Language ProcessingMachine Learning

Ibm

FEWD

Jun 2024Jul 2024 · 1 mo · Online · Remote

  • As part of the IBM Front-End Web Development (FEWD) SkillBuild Program, I collaborated with a team on “Water Watch”, a platform that enables local communities to raise and track water-related issues. I specifically worked on building the issue management pages in React, where users could upload images, raise tickets, and check issue status. This project not only strengthened my front-end development skills but also gave me valuable exposure to team collaboration, problem-solving, and building user-focused solutions.
ReactFront-End DevelopmentTeam Collaboration

Xander corp.

Machine Learning Engineer

Mar 2024May 2024 · 2 mos · Delhi, India · Remote

  • As an AI/ML Engineer at Xander, I worked on building a no-code machine learning platform where users could simply upload datasets, and the system would automatically identify the best-suited model and generate it for them. My primary contributions included automating regression and classification tasks, streamlining the model selection and training pipeline, and integrating conversational interfaces.
  • I also developed and integrated a chatbot using Botpress, enabling users to interact with the platform more intuitively and receive real-time guidance throughout the ML workflow. This experience strengthened my skills in end-to-end ML automation, chatbot development, and no-code platform design, while contributing to making AI accessible for non-technical users.
No-Code Machine LearningChatbot DevelopmentModel Training PipelineMachine Learning

Education

Maharaja Agrasen Institute Of Technology, Delhi

Bachelor of Technology - BTech — Computer Science

Nov 2022Aug 2026

Shree kali devi vidya mandir hansi

12th non med — General Studies

Apr 2012Apr 2022

Stackforce found 100+ more professionals with Natural Language Processing & Generative Ai

Explore similar profiles based on matching skills and experience