O

Omkar Dhekane

Software Engineer

Chicago, Illinois, United States2 yrs experience
Most Likely To SwitchAI ML Practitioner

Key Highlights

  • Reduced manual operations by 50% through automation.
  • Developed scalable data pipelines processing millions of records.
  • Led full-stack development for IoT-enabled applications.
Stackforce AI infers this person is a Full-Stack Developer with expertise in Data Engineering and Machine Learning for IoT applications.

Contact

Skills

Core Skills

PythonData PipelinesFull-stack DevelopmentIot SystemsData ScienceMachine LearningNatural Language Processing (nlp)

Other Skills

GitGithubSQLRelational Models & unstructured schemasDockerAutomationDistributed SystemsCloud ToolsCourseraTechnical MentorshipLeadershipCommunicationProblem-solvingPatienceTeamwork

About

I am a Master of Computer Science candidate at UIUC with 1+ year of experience delivering production-ready software, scalable data pipelines, and full-stack applications across academia, research, and industry. I specialise in Python, SQL, React, AWS, Spark, and distributed systems, with a proven record of cutting manual operations by 50%, scaling data pipelines to millions of records, and deploying cloud-native IoT solutions. Software & Data Engineering: I bring professional experience as a Junior Data Engineer & Scientist at Findability Sciences, where I designed and deployed a multithreaded ETL + ML pipeline in Python to process 5M+ SAP HANA records end-to-end. I automated reporting workflows using Jenkins CI/CD and Docker, reducing manual operations by 50% while strengthening internal decision-support systems. Full-Stack & IoT Development: Spearheaded a real-time web app (React, Python, MQTT) for an IoT-enabled cement testing device in Prof. Garg’s lab, integrating image analysis (OpenCV) to automate feedback loops. Built end-to-end, responsive and scalable systems like RecruitMe (Django, MySQL) and FoodForward (React + Spring Boot, MySQL, GCP) during graduate Course Projects. AI/ML & Research Contributions: I have a deep interest in applying machine learning, vision & NLP to real-world challenges. I developed a multi-label wheat crop disease classification pipeline using TensorFlow, Docker, and Streamlit, achieving a 71.38% F1-score with explainable AI (Grad-CAM). At L3Cube Pune, I contributed to MahaNLP, an open-source NLP library powered by transformer models, optimising tokenisers and classifiers for 100K+ text records. I also engineered domain-adaptive data selection methods to improve data quality in low-resource settings, published through Springer. Cloud & DevOps: Experienced in AWS (Lambda, EC2, S3, DynamoDB, API Gateway, Cognito, Lex V2), CI/CD (GitHub Actions, Jenkins), and container orchestration (Docker, Kubernetes). Proficient in building resilient data pipelines, cloud services, and distributed systems. Leadership & Teaching: As a Graduate Teaching Assistant at UIUC, I revamped grading pipelines (Python, SQL, Docker) for 300+ submissions and led weekly office hours to support students. These experiences not only deepened my technical expertise but also honed my communication, mentoring, and leadership skills. I am actively seeking Early Career Software Engineering and Data Engineering roles for 2026. My passion lies in building production-ready, scalable, and impactful systems at the intersection of software, data, and AI.

Experience

2 yrs
Total Experience
8 mos
Average Tenure
10 mos
Current Experience

Siebel school of computing and data science

Teaching Assistant

Aug 2025Present · 10 mos · Champaign, Illinois, United States · On-site

  • As a Graduate TA for CS598 (Foundations of Data Curation), I'm assisting Prof. Willis Craig in both technical infrastructure development and student support, applying principles of software engineering and data engineering in an academic setting.
  • Responsibilities:
  • 1) Engineer, debug, and automate course grader pipeline (Python, SQL, Docker) for 300+ biweekly submissions on PrairieLearn, improving grading reliability and reducing manual effort.
  • 2) Develop and update programming labs, quizzes, and assignments on PrairieLearn & Coursera, ensuring scalable delivery and seamless integration with backend systems.
  • 3) Optimise course analytics and monitoring by compiling FAQs, analysing student engagement trends, and building structured feedback loops to drive continuous improvements.
  • 4) Maintain course infrastructure, including websites, autograders, and internal tools, ensuring system uptime and smooth operations.
  • 5) Collaborate in weekly staff meetings to design and refine technical course content, aligning learning outcomes with industry-relevant engineering skills.
  • 6) Provide technical mentorship through office hours, debugging complex assignments, and guiding students on database design, data pipelines, and scalable software practices.
  • 7) Weekly attend TA Training sessions to better serve as a TA.
  • Skills: Python, Git, Github, SQL, Relational Models & unstructured schemas, Docker, Data Pipelines, Automation, Distributed Systems, Cloud Tools, Coursera, Technical Mentorship, Leadership, Communication, Problem-solving, Patience, Teamwork, Time Management.
PythonGitGithubSQLRelational Models & unstructured schemasDocker+12

University of illinois urbana-champaign

3 roles

Software Vision and Automation Assistant

Promoted

May 2025Aug 2025 · 3 mos

  • Contributed to the Garg Research Group by leading the software architecture and full-stack development of an IoT-enabled cement strength testing device, working with a cross-functional team of engineers and researchers to modernise a critical operation in the construction materials industry. The goal is to design scalable, automated, and data-driven systems that seamlessly connect embedded hardware with cloud applications.
  • Responsibilities:
  • 1) Lead the end-to-end design and engineering of a full-stack web application using React, Node.js, Express, and PostgreSQL, applying modern SDLC practices to ensure scalability, maintainability, and production readiness. Also contributed to the PyQt5 legacy desktop app, maintaining compatibility across platforms.
  • 2) Enable real-time device-to-cloud connectivity by leveraging the MQTT protocol (HiveMQ Cloud broker), creating a reliable, low-latency communication pipeline between embedded hardware and the web dashboard accessible globally.
  • 3) Implement networking infrastructure on Raspberry Pi via RaspAP Auto Access Point mode, allowing seamless plug-and-play device setup and wireless connectivity without requiring external configuration.
  • 4) Develop multi-stage execution pipelines that coordinate low-level hardware instructions on Raspberry Pi with the web application, ensuring synchronisation between device tasks, user inputs, and backend services.
  • 5) Integrate computer vision pipelines using OpenCV to automatically extract regions of interest (ROI) from cuvette images, calculate concentration and dissolution indices, and deliver real-time analytics to researchers through the dashboard.
  • I collaborated in a cross-disciplinary environment (Electronics, Mechanical, and Researchers), translating research needs into software requirements and helping interview, onboard, and mentor new contributors joining the project.
  • Skills: Full-Stack Development, Automation, Cross-Functional Collaboration, Technical Mentorship, Leadership.
Full-Stack DevelopmentAutomationCross-Functional CollaborationTechnical MentorshipLeadershipWeb Development+8

Course Assistant

Promoted

Feb 2025May 2025 · 3 mos

  • Assisted Prof. Matthew Caesar in CS437: Internet of Things, focusing on evaluating and improving student projects in embedded systems, networking, and IoT application development.
  • Responsibilities:
  • 1) Grade projects/machine problems related to IoT protocols, distributed systems, and edge-device integration, ensuring technical accuracy and adherence to best practices.
  • 2) Provide detailed technical feedback to students on design, implementation, and debugging, strengthening their skills in embedded programming, data pipelines, and real-time systems.
  • 3) Collaborate with TAs and faculty to refine course content, ensuring assignments reflect industry-relevant technologies.
  • Skills: IoT Systems, Code Review, Software Systems, Technical Communication, Mentorship, Collaboration, Teamwork, Excel.
IoT SystemsCode ReviewSoftware SystemsTechnical CommunicationMentorshipCollaboration+5

Student Dining Assistant

Aug 2024Dec 2024 · 4 mos · Champaign, Illinois, United States · On-site

  • A side hustle that ultimately strengthened time-management, teamwork, multicultural and communication skills in a fast-paced environment.
  • Delivered friendly and efficient customer service to hundreds of students daily during high-traffic dining hours.
  • Collaborated with a diverse team to support meal preparation, serving, and inventory tasks.
  • Received commendation from the manager for going above and beyond duties to better serve people.
LeadershipIntercultural CommunicationTeam workingCollaborative Problem Solving

Findability sciences

Junior Data Scientist

Aug 2023Jul 2024 · 11 mos · Mumbai, Maharashtra, India · On-site

  • Engineered a Python-based, multithreaded ETL + forecasting tool, processing up to 8 million SAP HANA records monthly for an electronic components manufacturer and supplier client, achieving a 10% reduction in MAPE and 3× faster SKU-level time-series modelling.
  • Automated and orchestrated production ETL workflows with modern CI/CD and scheduling tools (GitHub-like SVN, Jenkins, Cron,), ensuring reliable monthly runs and reducing manual effort by 50%.
  • Built and integrated data ingestion and validation modules for external regressors from the FRED API, uncovering key demand drivers and improving forecast reliability by 20%.
  • Documented the pipeline architecture and data flow to support scalability, onboarding, and long-term maintainability.
  • Continuous experimentation and benchmarking of the state-of-the-art models with respect to the deployed model.
  • Collaborated with the Data Science team and Domain experts to investigate the gaps and opportunities in the purchase trends for the client and provide appropriate advice to the stakeholders.
LightGBMPython (Programming Language)Regression ModelsStatsmodelsCross-functional Team LeadershipTime Series Forecasting+6

L3cube pune

Software Developer and Research Intern | Open source NLP

Aug 2022Jun 2023 · 10 mos · Pune, Maharashtra, India · Hybrid

  • Co-authored 3 publications on domain-adaptive pretraining, datasets, and transformer-based NLP libraries (Springer/AACL)
  • Research Motivation:
  • High-quality data is crucial for effective machine learning models. For LLMs, pretraining on large unstructured data can be tedious. Our research explored multiple data selection strategies to extract in-domain corpus from generic-domain data, enhancing model alignment with domain knowledge.
  • Key Contributions:
  • Developed - mahaNLP, an open-source Marathi NLP toolkit built on MahaBERT-based transformer models, significantly improving Marathi text analysis. Enabled researchers and developers to process 100,000+ Marathi text datasets with state-of-the-art accuracy, advancing NLP for low-resource Indian languages. Published in IJCNLP 2023
  • Curated L3Cube-MahaNews, the largest supervised Marathi text classification dataset (108,643 records across 12 categories), facilitating document-length analysis with advanced BERT models. Published in SPELLL 2023
  • Designed 'TextGram', a rank-based domain-adaptive data selection strategy for LLM pretraining, achieving a 1% increase in F1-score on IMDb classification and outperforming existing techniques on the RealNews dataset. Published in SPELLL 2023
  • Conducted extensive literature reviews on advanced NLP techniques, driving innovation in low-resource language processing.
Natural Language Processing (NLP)Python (Programming Language)Extract, Transform, Load (ETL)Word EmbeddingsData PipelinesPyPI+2

Persistent systems

Machine Learning Intern

Jun 2022Aug 2022 · 2 mos · Pune, Maharashtra, India · Remote

  • Key Contributions:
  • Developed an end-to-end retrieval pipeline with intelligent contextual search, leveraging the Haystack QA system to efficiently extract information from policy documents.
  • Conducted Vulnerability Assessment and Penetration Testing (VAPT) to identify and mitigate security issues, enhancing application security and robustness.
Natural Language Processing (NLP)Python (Programming Language)MLOpsTeam workingLeadershipFlask+5

Pune institute of computer technology

Research Assistant Intern

Mar 2022May 2022 · 2 mos · Pune

  • Worked on the task of Optical Handwritten Charater Recongition under Dr. Geetanjali Kale to automate the task of checking the answer sheets written by students during examination which is generally done by human examiner.
  • The dataset was manually curated with in-college exam sheets database.
Regression Models

Education

University of Illinois Urbana-Champaign

Master's degree — Computer Science

Aug 2024Present

Pune Institute of Computer Technology

Bachelor of Engineering - BE — Computer Engineering

Jan 2019Jan 2023

Fergusson College

12th HSC Junior College — Science

Jun 2017May 2019

New English School, Ramanbaug, Pune

10th SSC

Jan 2011Jan 2017

Stackforce found 100+ more professionals with Python & Data Pipelines

Explore similar profiles based on matching skills and experience