Gayatri Sharma Kurmatey

CEO

San Francisco, California, United States4 yrs experience
Most Likely To Switch

Key Highlights

  • 5+ years of experience in data engineering and machine learning.
  • Research collaboration with NASA on protein structure analysis.
  • Expertise in building scalable data systems and pipelines.
Stackforce AI infers this person is a Data Engineering and Bioinformatics expert with a strong focus on machine learning applications.

Contact

Skills

Core Skills

Project ManagementData EngineeringBusiness IntelligenceResearchBioinformaticsData AnalysisData ScienceMachine LearningLogisticsData ManagementDatabase DesignBusiness Analysis

Other Skills

Technical Project LeadershipPython (Programming Language)Data PipelinesERP ImplementationsOdooSupply Chain OperationsData LoadingData ServicesDigital MarketingMicrosoft Power PagesBusiness Intelligence (BI)PresentationsNatural Language Processing (NLP)Unsupervised Machine Learning ModelsGraph Neural Networks

About

I am a data professional and aspiring researcher with 5+ years of experience at the intersection of machine learning, data engineering, and computational modeling. My work focuses on developing structure-aware and scalable learning frameworks for complex systems, with a particular interest in biological and networked data. I am currently engaged in research collaborations with NASA Ames Research Center, where I am building graph-based representations of protein structures using geometric deep learning and unsupervised learning techniques. My work explores how structural representations can capture functional variation, with broader implications for computational biology, genomics, and multi-scale system modeling. My research interests include graph neural networks, representation learning, biological networks, and scalable graph algorithms, with a long-term goal of bridging sequence, structure, and function through machine learning. I am particularly interested in designing models that are both computationally efficient and scientifically interpretable. Alongside my research, I bring strong industry experience from organizations such as Tesla, PG&E, and public sector institutions, where I have built end-to-end data systems, automated pipelines, and decision-support tools. These experiences inform my approach to research, grounding theoretical work in real-world applicability and scalability. Technically, I work with Python, deep learning frameworks, and cloud platforms, with hands-on experience in GNNs, large-scale data pipelines, and high-performance data systems.

Experience

4 yrs
Total Experience
1 yr 3 mos
Average Tenure
1 yr 4 mos
Current Experience

Belong automotive technology, llc usa

2 roles

Technical Project Lead

Promoted

Apr 2026Present · 2 mos

  • Leading a data engineering project with a team of 2 interns, managing task allocation, code reviews, and delivery timelines
  • Designing and implementing end-to-end data pipelines and system architecture for internal ERP and dashboard systems
  • Collaborating with leadership to translate business requirements into scalable technical solutions
Project ManagementTechnical Project LeadershipPython (Programming Language)Data PipelinesERP ImplementationsOdoo+2

Data Engineer

Oct 2025Present · 8 mos

  • Developed and maintained internal software and data systems to enhance service tracking for Belong Automotive Technology.
  • Collaborated closely with the CEO to align data initiatives with overarching business goals.
  • Focused on driving operational efficiency through innovative data solutions.
Data LoadingData ServicesDigital MarketingMicrosoft Power PagesBusiness Intelligence (BI)Data Engineering+1

Nasa ames research center

2 roles

Open-Source Researcher

Feb 2025Present · 1 yr 4 mos · Remote

  • Research collaboration with NASA Ames and the University of Idaho, focused on structural bioinformatics and AI/ML applications:
  • Developed Conducted bioinformatics-driven analysis of atomic structures of amino acids within protein chains using Chimera, PyMol and BioPython to extract 3D spatial and chemical features
  • Engineered atomic-level graph embeddings from PDB kinase structures using Graph2Vec and Graph Neural Networks (GNNs) as part of a structural bioinformatics pipeline
  • Constructed molecular graphs with NetworkX and Karateclub, testing use-cases like Graph2Vec, MEGNet and SE3 Transformers, to encode structural similarities for unsupervised Geometric Graph Neural Network based clustering of active/inactive protein conformations
  • Applied statistical modeling and dimensionality reduction (PCA/UMAP/t-SNE) to validate embedding quality and optimize clustering in bioinformatics workflows
PresentationsNatural Language Processing (NLP)Unsupervised Machine Learning ModelsGraph Neural NetworksDeep LearningStructural Bioinformatics+2

Open-Source Collaborator

Feb 2024Present · 2 yrs 4 mos · Remote

  • • Collaborated on an open-source protein research project at NASA Ames Research Center using AlphaFold and machine learning for protein structure analysis
Computational BiologyBioinformaticsNatural Language Processing (NLP)Unsupervised Machine Learning ModelsPython (Programming Language)Graph Neural Networks+1

Skytech services inc

Data Engineer

Oct 2024Oct 2025 · 1 yr · Piscataway, New Jersey, United States · Remote

  • Gosalus Project:
  • Led a multi-phase HR–provider data integration project, building MSSQL backend, data models, and API integrations improving system interoperability by 30%
  • Built Power BI dashboards and SQL analytics that identified data integrity issues and improved reporting accuracy by 25%.
  • Implemented Azure Synapse RBAC, compliance workflows, and automation, reducing manual effort by 15+ hrs/week and ensuring secure deployments

Robert half

Data Science Engineer

Mar 2024Jan 2025 · 10 mos · Fremont, California, United States · On-site

  • Client: Fremont Unified School District
  • (via SkyTech Services Inc from Oct 2024 on Corp-to-Corp)
  • Led a team of 6 to deploy scalable ETL pipelines Python, SQL, Azure Synapse, and Power Platform, cutting runtime by 25 hours/week and boosting data reliability
  • Boosted cross-platform data exchange efficiency by 30% through strategic API integrations and schema mapping, streamlining legacy- to-cloud migration
  • Designed ERP-integrated dashboards and automated pipelines with Power Platform and SharePoint, using A/B testing to align KPIs and boost finance reporting transparency
  • Built automated software for 2,500+ laptop allocations and HR data via automated Google Sheets, Power Platform and SharePoint, saving $100K+ annually and adopted as a lighthouse project statewide
  • Mentored team members on Power Platform, saving 2 hours daily through improved workflows and scalable BI practices
Cross-team CollaborationDatabasesMicrosoft ExcelAutomotive EngineeringVBA ExcelDatabase Consulting+9

Pacific gas and electric company

Business Intelligence Analyst

May 2023Aug 2023 · 3 mos · San Ramon, California, United States · On-site

  • Analyzed Pole and Tree inspection data using Power BI for PG&E, achieving a 70% reduction in dashboard data load time.
  • Integrated SQL, Python, and Power Automate for an ~80% weekly performance boost in workflow automation.
  • Administered Power BI, managing access and controls based on role and designation.
  • Assisted fellow teammates in debugging errors and resolving DAX issues to ensure efficient and optimized functioning of dashboards.
Data VisualizationMicrosoft ExcelData WarehousingMicrosoft Power PlatformTeamworkBusiness Intelligence (BI)+4

Tesla

Data Science Intern

May 2022Dec 2022 · 7 mos · Fremont, California, United States · On-site

  • Monitored infrastructure project invoices using a MySQL database and an interactive Tableau dashboard, delivering weekly email updates on payment dues to the teams.
  • Crafted a visually captivating Power BI dashboard, enhancing the re-entry process by providing executives with valuable insights for office space planning and capacity management across a workforce of 110,000 employees.
  • Engineered an optimized data model to streamline the collection of procurement and project data from diverse databases, significantly reducing Power BI data load time.
  • Utilized SQL and Python to analyze procurement data from multiple databases, presenting insights through Power Query, DAX, and Power BI to Tesla leadership, identifying profit opportunities.
  • Researched and analyzed 100+ construction projects, applying statistical analysis. Developed an efficient financial automation tool with PowerApps and Power BI, saving 4 hours monthly.
  • Integrated Power Apps and Power Automate Flows with Power BI to track metrics, leading the rollout of a notification workflow for efficient team updates.
  • Streamlined user access and maintenance of Power BI and Power Apps using Azure Active Directory.
  • Mentored and managed new hire interns and teammates, overseeing tasks, providing query support, and assisting with debugging to ensure a smooth learning curve.
  • Collaborated with engineering and supply chain teams to enhance a construction project's bill of materials (BoM), resulting in a 25% improvement in cost, quality, and delivery efficiency for optimized vendor quotes.
Data VisualizationDatabasesConfluenceMicrosoft ExcelBusiness AnalysisPresentations+4

Tango

Data Scientist Intern

Feb 2022May 2022 · 3 mos · Dallas, Texas, United States · Remote

  • Conducted data preprocessing on floor plan images using Computer Vision Annotation Tool.
  • Utilized OpenCV Python for precise image processing and annotation of floor plan elements.
  • Developed a Mask R-CNN machine learning model for automated blueprint labeling, achieving a 90% improvement in entity identification.
Image ProcessingPresentationsLinuxNatural Language Processing (NLP)Machine LearningData Preparation+1

William b. hanson center for space sciences, utd

Graduate Student Researcher

Jan 2022Sep 2022 · 8 mos · Dallas, Texas, United States

  • Machine Learning Super-Resolution for Remote Sensing and OpenDataCubes
  • Working in the Lary Research Group:
  • https://davidlary.info/
  • Developed a containerized instance of the Open Data Cube and associated data ingestion scripts to enable collection and processing of imagery with machine learning from NASA satellites like Landsat, Modis, and Sentinel.
  • Performed data ingestion in Docker Container and indexed the images on longitudes and latitudes using Jupyter Notebook and Ubuntu WSL, starting from Richardson, TX.
LinuxNatural Language Processing (NLP)DockerMachine LearningPython (Programming Language)Research Skills+1

Naveen jindal school of management, ut dallas

Research Assistant - Machine Learning

Sep 2021Dec 2021 · 3 mos · Dallas, Texas, United States

  • Effects of COVID-19 on Accident Severity
  • Focused on the research developments of risk factors associated with road accident severity, feature selection methods in the field of traffic safety, and classification algorithms for the prediction of accident severity.
  • Verified machine learning models like Support Vector Machine, Neural Networks, and Logistic Regression using feature selection algorithms on factors that can contribute to predicting road accident severity.
  • Performed research on stacking model in machine learning, to improve the accuracy of prediction of road accident severity by at least 70% to 80%.
Machine LearningResearch SkillsResearch

Amazon

Logistics Data Analyst Associate

Aug 2018Aug 2019 · 1 yr · Hyderabad, Telangana, India

  • Contributed to an 80% improvement in Flex onboarding efficiency while overseeing daily reporting for diverse businesses, leveraging Advanced Excel Functions for metric crafting.
  • Enhanced team efficiency by 75%, skillfully resolving on-site issues during customer deliveries.
  • Implemented Microsoft Power Automate and Python, streamlining processes and enhancing metrics visibility, significantly contributing to organizational growth and operational efficiency by 40%.
Microsoft ExcelMicrosoft Power AutomateExcel PivotMicrosoft Power QueryPython (Programming Language)Data Analysis+1

Ursc - u r rao satellite centre

2 roles

Project Trainee

Jan 2018Mar 2018 · 2 mos · Bengaluru, Karnataka, India

  • Designed and implemented conference hall booking system using Sybase, MySQL, HTML and Java technologies for ISRO employees that eliminated conflicting booking for conference rooms and improved the process by 99%.
DatabasesPresentationsDatabase DesignData WarehousingData LoadingData Analysis+2

Intern

May 2017Jun 2017 · 1 mo · Bengaluru, Karnataka, India

  • Created a materialized view project using multidimensional data modeling using procurement data by extracting data from Sybase and used interface with COWAA, by writing MySQL queries in SQLdbx, and designing a user interface HTML, CSS, and JAVA helping ISRO to make efficient business decisions by analyzing the data generated in the result.
DatabasesBusiness AnalysisInformation Technology InfrastructurePresentationsDatabase DesignData Warehousing+4

Education

The University of Texas at Dallas

Master's degree — Information Technology and Management - Data Science

Jan 2021Dec 2022

Jawaharlal Nehru Technological University

Bachelor of Technology - BTech — Computer Science & Engineering

Jan 2014Jan 2018

Indian Institute of Remote Sensing (IIRS), Indian Space Research Organization (ISRO)

Certification Course — Satellite Photogrammetry and Its Applications

Jan 2020Jan 2020

Bharatiya Vidya Bhavan's

High School

Jun 2012May 2014

Kendriya Vidyalaya (KV)

10th

Mar 2011Mar 2012

Stackforce found 100+ more professionals with Project Management & Data Engineering

Explore similar profiles based on matching skills and experience