Dishant Padalia

Co-Founder

Amherst, Massachusetts, United States3 mos experience

Key Highlights

  • Innovated evaluation methods for AI models.
  • Developed advanced document recognition systems.
  • Led impactful government projects in computer vision.
Stackforce AI infers this person is a Machine Learning Engineer specializing in AI and Computer Vision applications.

Contact

Skills

Core Skills

Machine LearningNatural Language ProcessingComputer Vision

Other Skills

Geographical bias evaluationText-to-image modelsFeature extractionCLIP Zero Shot classificationVision TransformerMultilingual document recognitionSynthetic dataset generationLayoutLMTableNetFastAPIEfficientNetBreast tumor detectionPre-processing pipelineYOLO modelLicense plate detection

About

🚀 AI Enthusiast | Software Engineer | Graduate Student Researcher at Meta Platforms, Inc. | MSc Computer Science at UMass Amherst Welcome to my LinkedIn page! I'm Dishant Padalia, currently pursuing a Master’s degree Computer Science at the University of Massachusetts Amherst. With a focus on Machine Learning, AI, and Natural Language Processing, I am dedicated to harnessing the power of AI to address and solve complex challenges in the real world 🌟 Professional Journey: - Meta Platforms, Inc. (Graduate Student Researcher): Led the enhancement of text-to-image models like Stable Diffusion and DALLE by developing novel evaluation methods for geographical biases and innovative feature extraction techniques. - Outmarket AI (Software Engineer Intern): Implemented a Retrieval Augmented Generation (RAG) Text2SQL model, significantly improving database querying efficiency with advanced prompt engineering and integration of large language models including GPT-4. - IIT Bombay (Machine Learning Intern): Developed a Vision Transformer model that significantly improved multilingual document recognition and streamlined data labeling processes, contributing to published research and boosting OCR performance. 🛠️ Technical Toolkit: Proficient in Python, Java, C++, TensorFlow, PyTorch, and more. I am always on the lookout for new technologies and methodologies that can push the boundaries of AI and machine learning. 💡 I thrive on innovation and collaboration. Whether it's leading a critical project or participating in groundbreaking research, I bring a blend of technical acumen and creative problem-solving to achieve outstanding results. 🔗 Let's Connect! I’m keen on connecting with like-minded professionals and leaders in tech who are passionate about the future of AI and technology. Feel free to reach out if you want to collaborate or just share insights and ideas!

Experience

3 mos
Total Experience
3 mos
Average Tenure
--
Current Experience

Outmarket ai

Founding Engineer

Feb 2024Present · 2 yrs 4 mos · San Francisco Bay Area · Hybrid

Meta

Graduate Student Researcher

Feb 2024Jun 2024 · 4 mos · Remote

  • 1. Innovated an evaluation method to measure geographical biases in text-to-image models like Stable Diffusion and DALLE, enhancing precision and diversity calculations using decoupled object and background representations in images
  • 2. Implemented Segment-Anything model and zero-shot text to bounding box techniques to decouple image elements
  • 3. Developed PatchViT, a novel technique that selects relevant image patches for enhanced feature extraction in Vision Transformer (ViT) models, outperforming standard ViT and CNN-based methods
  • 4. Analysed metric trends across regions, enhancing understanding of model performance under various prompt settings
  • 5. Applied CLIP Zero Shot classification to real \& generated datasets, assessing potential biases from regional information
Geographical bias evaluationText-to-image modelsFeature extractionCLIP Zero Shot classificationMachine LearningNatural Language Processing

Indian institute of technology, bombay

Machine Learning Intern

Apr 2022Jun 2023 · 1 yr 2 mos · Mumbai, Maharashtra, India · Hybrid

  • 1. Developed a Vision Transformer model using PyTorch and HuggingFace for multilingual handwritten document recognition, resulting in a 40% improvement in recognition rates.
  • 2. Engineered a synthetic dataset generation algorithm to enhance domain adaptation and model generalization, leading to a 50% performance boost across six benchmark OCR datasets.
  • 3. Integrated and fine-tuned LayoutLM and TableNet models to optimize document understanding and table detection, achieving a 30% increase in layout and text recognition efficiency.
  • 4. Streamlined a semi-supervised data programming pipeline, allowing programmatically labeled training data using user-defined labeling functions and rules, eliminating the need for manual labeling.
  • 5. Deployed end-to-end APIs for benchmarking and frontend integration using FastAPI, Postman, and Flask.
  • 6. Authored a paper introducing a novel Priority-Based Suppression technique on a combination of layout models for digitization of multilingual scanned documents.
  • 7. Instructed a Udemy course titled "A Hands-On Introduction to OCR" with over 800 enrollments.
Vision TransformerMultilingual document recognitionSynthetic dataset generationLayoutLMTableNetFastAPI+2

Dr. ninad mehendale research lab

Research Intern

Jan 2022Jun 2022 · 5 mos · Mumbai, Maharashtra, India

  • 1. Created an Enhanced EfficientNet (EEF-Net) model for Breast tumor detection with 97% accuracy and 98% F1-Score on mammography scans.
  • 2. Devised a pre-processing pipeline to remove the pectoral muscle from the mammography scans using Contrast Limited Adaptive Histogram Equalization and K-Means Clustering, boosting the model accuracy by 17%.
  • 3. Conducted a comprehensive analysis and comparison of the current state-of-the-art models with EEF-Net, resulting in the publication of a paper in Elsevier SSRN.
EfficientNetBreast tumor detectionPre-processing pipelineComputer Vision

Vasundharaa geo technologies pvt ltd

Machine Learning Intern

Jun 2021Dec 2021 · 6 mos · Pune, Maharashtra, India · On-site

  • 1. Headed a state government project focused on automating license plate detection of non-helmeted motorcycle riders, which resulted in a 25% reduction in traffic law violations at over 10 signal junctions in Pune.
  • 2. Implemented a three-stage system incorporating the YOLO model for motorcycle, helmet, and license plate detection, achieving a Mean Average Precision score of 99%.
  • 3. Optimized and integrated the model into the hardware system using TensorFlow Lite and Google Cloud, yielding a 70% reduction in real-time response time.
YOLO modelLicense plate detectionTensorFlow LiteComputer Vision

Education

University of Massachusetts Amherst

Master of Science - MS — Computer Science

Sep 2023May 2025

University of Mumbai

Bachelor of Technology - BTech — Electronics and Telecommunication Engineering

Jan 2019Jan 2023

Pace Junior Science College

Jun 2017Jun 2019

P.G.Garodia School ICSE

May 2007May 2017

Stackforce found 100+ more professionals with Machine Learning & Natural Language Processing

Explore similar profiles based on matching skills and experience