Dishant Padalia — Co-Founder

🚀 AI Enthusiast | Software Engineer | Graduate Student Researcher at Meta Platforms, Inc. | MSc Computer Science at UMass Amherst Welcome to my LinkedIn page! I'm Dishant Padalia, currently pursuing a Master’s degree Computer Science at the University of Massachusetts Amherst. With a focus on Machine Learning, AI, and Natural Language Processing, I am dedicated to harnessing the power of AI to address and solve complex challenges in the real world 🌟 Professional Journey: - Meta Platforms, Inc. (Graduate Student Researcher): Led the enhancement of text-to-image models like Stable Diffusion and DALLE by developing novel evaluation methods for geographical biases and innovative feature extraction techniques. - Outmarket AI (Software Engineer Intern): Implemented a Retrieval Augmented Generation (RAG) Text2SQL model, significantly improving database querying efficiency with advanced prompt engineering and integration of large language models including GPT-4. - IIT Bombay (Machine Learning Intern): Developed a Vision Transformer model that significantly improved multilingual document recognition and streamlined data labeling processes, contributing to published research and boosting OCR performance. 🛠️ Technical Toolkit: Proficient in Python, Java, C++, TensorFlow, PyTorch, and more. I am always on the lookout for new technologies and methodologies that can push the boundaries of AI and machine learning. 💡 I thrive on innovation and collaboration. Whether it's leading a critical project or participating in groundbreaking research, I bring a blend of technical acumen and creative problem-solving to achieve outstanding results. 🔗 Let's Connect! I’m keen on connecting with like-minded professionals and leaders in tech who are passionate about the future of AI and technology. Feel free to reach out if you want to collaborate or just share insights and ideas!

Stackforce AI infers this person is a Machine Learning Engineer specializing in AI and Computer Vision applications.

Location: Amherst, Massachusetts, United States

Experience: 3 mos

Skills

Machine Learning
Natural Language Processing
Computer Vision

Career Highlights

Innovated evaluation methods for AI models.
Developed advanced document recognition systems.
Led impactful government projects in computer vision.

Work Experience

Outmarket AI

Founding Engineer (2 yrs 4 mos)

Indian Institute of Technology, Bombay

Machine Learning Intern (1 yr 2 mos)

Dr. Ninad Mehendale Research Lab

Research Intern (5 mos)

Vasundharaa Geo Technologies Pvt Ltd

Machine Learning Intern (6 mos)

Education

Master of Science - MS at University of Massachusetts Amherst

Bachelor of Technology - BTech at University of Mumbai

at Pace Junior Science College

at P.G.Garodia School ICSE

Dishant Padalia

Co-Founder

Amherst, Massachusetts, United States3 mos experience

Key Highlights

Innovated evaluation methods for AI models.
Developed advanced document recognition systems.
Led impactful government projects in computer vision.

Stackforce AI infers this person is a Machine Learning Engineer specializing in AI and Computer Vision applications.

Contact

dishant.padalia01@gmail.com LinkedIn

Skills

Core Skills

Machine LearningNatural Language ProcessingComputer Vision

Other Skills

Geographical bias evaluationText-to-image modelsFeature extractionCLIP Zero Shot classificationVision TransformerMultilingual document recognitionSynthetic dataset generationLayoutLMTableNetFastAPIEfficientNetBreast tumor detectionPre-processing pipelineYOLO modelLicense plate detection

About

Experience

3 mos

Total Experience

3 mos

Average Tenure

Current Experience

Outmarket ai

Founding Engineer

Feb 2024 – Present · 2 yrs 4 mos · San Francisco Bay Area · Hybrid

Indian institute of technology, bombay

Machine Learning Intern

Apr 2022 – Jun 2023 · 1 yr 2 mos · Mumbai, Maharashtra, India · Hybrid

1. Developed a Vision Transformer model using PyTorch and HuggingFace for multilingual handwritten document recognition, resulting in a 40% improvement in recognition rates.
2. Engineered a synthetic dataset generation algorithm to enhance domain adaptation and model generalization, leading to a 50% performance boost across six benchmark OCR datasets.
3. Integrated and fine-tuned LayoutLM and TableNet models to optimize document understanding and table detection, achieving a 30% increase in layout and text recognition efficiency.
4. Streamlined a semi-supervised data programming pipeline, allowing programmatically labeled training data using user-defined labeling functions and rules, eliminating the need for manual labeling.
5. Deployed end-to-end APIs for benchmarking and frontend integration using FastAPI, Postman, and Flask.
6. Authored a paper introducing a novel Priority-Based Suppression technique on a combination of layout models for digitization of multilingual scanned documents.
7. Instructed a Udemy course titled "A Hands-On Introduction to OCR" with over 800 enrollments.

Vision TransformerMultilingual document recognitionSynthetic dataset generationLayoutLMTableNetFastAPI+2

Dr. ninad mehendale research lab

Research Intern

Jan 2022 – Jun 2022 · 5 mos · Mumbai, Maharashtra, India

1. Created an Enhanced EfficientNet (EEF-Net) model for Breast tumor detection with 97% accuracy and 98% F1-Score on mammography scans.
2. Devised a pre-processing pipeline to remove the pectoral muscle from the mammography scans using Contrast Limited Adaptive Histogram Equalization and K-Means Clustering, boosting the model accuracy by 17%.
3. Conducted a comprehensive analysis and comparison of the current state-of-the-art models with EEF-Net, resulting in the publication of a paper in Elsevier SSRN.

EfficientNetBreast tumor detectionPre-processing pipelineComputer Vision

Vasundharaa geo technologies pvt ltd

Machine Learning Intern

Jun 2021 – Dec 2021 · 6 mos · Pune, Maharashtra, India · On-site

1. Headed a state government project focused on automating license plate detection of non-helmeted motorcycle riders, which resulted in a 25% reduction in traffic law violations at over 10 signal junctions in Pune.
2. Implemented a three-stage system incorporating the YOLO model for motorcycle, helmet, and license plate detection, achieving a Mean Average Precision score of 99%.
3. Optimized and integrated the model into the hardware system using TensorFlow Lite and Google Cloud, yielding a 70% reduction in real-time response time.

YOLO modelLicense plate detectionTensorFlow LiteComputer Vision