S

Satvik Praveen

Data Engineer

College Station, Texas, United States2 yrs 10 mos experience
AI EnabledAI ML Practitioner

Key Highlights

  • Expert in machine learning and data science applications.
  • Led impactful research in agricultural informatics.
  • Proven track record in optimizing data-driven systems.
Stackforce AI infers this person is a Data Science professional with expertise in healthcare, agriculture, and transportation sectors.

Contact

Skills

Core Skills

Data ScienceMachine LearningComputer VisionMobile Application DevelopmentCloud ComputingDatabase ManagementSystem OptimizationWeb DevelopmentData ManagementData AnalysisTechnical Coordination

Other Skills

API DevelopmentAggregation PipelinesAgricultural InformaticsAmazon EC2Amazon Web Services (AWS)Artificial Neural NetworksBayesian statisticsCRUDData VisualizationDatabase Management System (DBMS)Deep LearningFront-End DevelopmentGitJavaScriptLarge Language Models (LLM)

About

🚀 Actively Seeking Full-Time Opportunities! As a Data Science graduate from Texas A&M University, I bring a passion for solving impactful, real-world problems at the intersection of AI, biomedical science, and systems engineering. My expertise spans machine learning, data visualization, scientific computing, and database systems, allowing me to drive innovations across healthcare, agriculture, and mobility domains. 🔬 At the Advanced Vision and Learning Lab (AVLL), under the guidance of Prof. Joshua Peeples, I researched Neural Radiance Field (NeRF) and Gaussian Splatting-based 3D reconstruction techniques for plant phenotyping using multispectral and hyperspectral imaging. These methods aim to automate phenotypic analysis, enhancing research on food security and biofuel optimization. 🚗 As a Research Intern at the ENDEAVR Institute, I contributed to an AI-powered autonomous microtransit mobile app for underserved communities. My work involved designing route optimization algorithms, implementing real-time location tracking in Flutter, and configuring cloud-based backend systems using AWS and PostGIS. 🧬 At the Texas A&M Institute of Data Science (TAMIDS), I led a Biomedical Data Science project under Dr. Jian Tao, analyzing 1,200+ drug molecules to model glioblastoma drug responses. I improved prediction accuracy using Graph Convolutional Networks (GCNs) and cheminformatics tools such as MCS and one-class SVMs, while identifying actionable pathways like EGFR and VEGFR. 🌍 In my role as a Graduate Assistant at ISSS, I spearheaded portal enhancements that reduced student processing time by 50%, contributed to the Sunapsis–Via system migration, and authored a department-wide Prompt Engineering guide to integrate AI into workflow optimization. 🧑‍💻 As a Student Technician, I supported web content design, created 10+ promotional assets, and improved accessibility for 10,000+ international students. 🔧 I thrive in interdisciplinary teams and fast-paced environments, where I can bridge data science and domain knowledge to build reliable, intelligent systems. Let’s connect if you're looking for someone who delivers data-driven impact with clarity, creativity, and technical depth.

Experience

2 yrs 10 mos
Total Experience
6 mos
Average Tenure
8 mos
Current Experience

Dreamstudio

Data Scientist

Oct 2025 – Present · 8 mos · Atlanta, Georgia, United States · Remote

  • Contributing to Model.Earth, an open-source data-visualization framework integrating AI insights with environmental trade-flow and sustainability datasets (Exiobase, Google Data Commons, BuildingTransparency.org).
  • Developing data pipelines using Python, SQL, and JavaScript to process and visualize multi-region input-output (MRIO) data for Sustainable Material Management (SMM).
  • Building interactive machine-learning dashboards and forecasting models leveraging Census APIs and RealityStream ML frameworks.
  • Assisting in full-stack web development using Next.js (JAM Stack), Rust-based REST APIs, and Azure PostgreSQL databases.
  • Implementing data preprocessing, ML analytics, and web-based visualization pipelines for open-access sustainability research.
PythonSQLJavaScriptData VisualizationMachine LearningData Science

Department of electrical and computer engineering at texas a&m university

Graduate Student Researcher

Jan 2025 – May 2025 · 4 mos · College Station, Texas, United States · On-site

  • Contributed to AI-driven agricultural research focused on plant phenotyping to address food security and renewable energy demands, with applications in biofuel production and yield optimization.
  • Developed and implemented 3D reconstruction algorithms inspired by Neural Radiance Fields (NeRFs) and Gaussian Splatting, achieving reconstruction accuracy improvements of up to 8% PSNR on selected plant datasets.
  • Explored novel applications of NeRF and Gaussian Splatting techniques for underutilized imaging modalities, including multispectral and hyperspectral imaging, enabling richer phenotype analysis across 4+ spectral channels.
  • Leveraged Texas A&M’s High-Performance Research Computing (HPRC) infrastructure, deploying models on systems with 256+ GB RAM and NVIDIA A100 GPUs, reducing training time by 40% compared to local GPU environments.
  • Improved phenotyping workflow efficiency by reducing manual annotation time by an estimated 5–10%, while maintaining high fidelity in 3D reconstructions across diverse crop scenes.
Data ScienceDeep LearningResearch SkillsAgricultural InformaticsComputer Vision

Endeavr institute

Research Intern

Nov 2024 – May 2025 · 6 mos · College Station, Texas, United States · On-site

  • Collaborated with a multidisciplinary team under the mentorship of Dr. Wei Li and Program Manager Anthony Chen to develop a mobile-based platform enabling autonomous microtransit services for underserved communities.
  • Designed and refined route optimization algorithms, improving path efficiency by ~23% based on simulated transport models and geospatial constraints.
  • Integrated real-time geolocation functionality into a Flutter-based mobile application, using the GeoLocator plugin and conducting feasibility evaluations across 3+ alternative plugin solutions.
  • Led research into real-time location update mechanisms, comparing HTTP vs. WebSocket architectures and advising on communication strategy for improved performance and reliability.
  • Proposed and helped implement backend support using AWS EC2 and PostGIS, optimizing server-side infrastructure to support scalable and secure tracking workflows across multiple device types.
  • Contributed to interface compatibility testing across iOS and Android platforms, ensuring seamless user experience in dynamic transit environments.
  • Participated in multiple technical review meetings, conducted codebase optimizations, and maintained CI/CD pipelines to ensure milestone delivery and cross-team alignment.
Amazon Web Services (AWS)SQLDatabase Management System (DBMS)Socket.ioMobile Application DevelopmentCloud Computing

Texas a&m university

2 roles

Graduate Assistant - Non-Teaching

Aug 2024 – May 2025 · 9 mos · College Station, Texas, United States · On-site

  • Spearheaded enhancements to the ISSS database portal, reducing student insurance refund processing time by over 50%, directly benefiting a user base of 10,000+ international students and scholars.
  • Conducted rigorous system testing and UI/data flow debugging, reducing error incidence by ~20% and improving the stability and reliability of student-facing services.
  • Designed, developed, and maintained ISSS website content, ensuring accessibility, compliance, and timely updates across 20+ core pages serving a global student audience.
  • Supported the maintenance and partial redesign of internal databases, optimizing 3+ operational workflows and improving access to student records and services.
  • Authored a comprehensive internal guide on prompt engineering and AI usage, streamlining document generation and enhancing staff productivity across communications.
  • Contributed to institutional projects supporting Texas A&M’s global engagement efforts, including orientation events, compliance updates, and reporting processes.
  • Participated in the full-cycle migration from Sunapsis to Via for case management, validating data integrity and minimizing disruption during the transition phase.
Prompt EngineeringMS ExcelDatabase Management System (DBMS)System TestingFront-End DevelopmentDatabase Management+1

Student Technician

Jun 2024 – Aug 2024 · 2 mos · College Station, Texas, United States · On-site

  • Contributed to improving content and user experience for the ISSS website, supporting ~10,000+ international students and scholars, and enhancing information accessibility.
  • Designed, contributed, and delivered 10+ promotional and informational posters used across campus offices and digital platforms, increasing student engagement in key programs by an estimated 10%.
  • Streamlined data workflows by updating and maintaining student records in internal databases, improving processing efficiency and reducing information retrieval time by ~10%.
MS ExcelFront-End DevelopmentDatabase Management System (DBMS)Web DevelopmentData Management

Texas a&m institute of data science

Capstone Project

Aug 2024 – Dec 2024 · 4 mos · College Station, Texas, United States · On-site

  • Pioneered a cutting-edge biomedical data science project focused on glioblastoma drug response modeling, analyzing 1,200+ small-molecule compounds using advanced cheminformatics approaches.
  • Achieved a 15.3% improvement in predictive accuracy over baseline models by implementing Graph Convolutional Networks (GCNs), Maximum Common Substructure (MCS) analysis, and similarity-based learning techniques.
  • Preprocessed and harmonized over 10,000 experimental datapoints from GDSC and COSMIC databases, mitigating data sparsity and inconsistency using three-tier imputation strategies and cross-dataset alignment algorithms.
  • Identified top 5 high-confidence drug candidates through pathway enrichment and sensitivity analysis, revealing actionable mechanisms linked to EGFR and VEGFR signaling pathways in glioblastoma.
  • Contributed to model innovation by integrating One-Class SVMs and surrogate modeling techniques for molecular property estimation, enabling high-efficiency screening with minimal computational overhead.
  • Participated in literature review and experimental design aligned with surrogate modeling frameworks, contributing to reproducibility and scalability in biomedical simulations.
Data SciencePython (Programming Language)Artificial Neural NetworksData AnalysisMachine Learning

M/s kamal kumar

Data Analyst & Technical Coordinator

Apr 2021 – Apr 2022 · 1 yr · Hazaribagh, Jharkhand, India · On-site

  • Reduced Optical Fiber Cable (OFC) downtime by 12% by analyzing fault logs and streamlining fault rectification procedures using structured data in MS Excel.
  • Led the rollout of FTTH (Fiber-to-the-Home) services across underserved zones, contributing to the connection of 1,500+ new customers in a single month and increasing revenue by ~12.8%.
  • Restored disrupted systems (10G, CPAN, MADM) and reduced average fault rectification time from 3 days to just 6 hours, using real-time diagnostic insights and historical fault data.
  • Coordinated daily splicing operations across 6+ regional teams, improving fault-handling efficiency and ensuring uninterrupted OFC link uptime across multiple districts.
  • Maintained detailed documentation for fiber faults, work logs, and splicing reports, contributing to a 25% faster incident response through standardized reporting workflows.
  • Authored technical letters, meeting minutes, and operational reports that improved intra-team communication and minimized delays in approval cycles.
  • Received 2 Letters of Appreciation for outstanding contributions in the FTTH and optical fiber maintenance domains, including record-setting installation performance.
Data AnalysisData VisualizationMS ExcelTechnical Coordination

Bharat sanchar nigam limited

Engineer Intern

Jul 2020 – Dec 2020 · 5 mos · Hazaribagh, Jharkhand, India · On-site

  • Collaborated within a 6-member technical team to implement FTTH (Fiber-to-the-Home) technology across 10+ residential and commercial zones, enhancing communication bandwidth and improving data speeds for 1,000+ end users.
  • Rectified Optical Fiber Cable (OFC) routes at over 27 locations, diagnosing and resolving fiber faults to restore disrupted connections and reduce signal loss across the network.
  • Assisted in configuring and maintaining transmission equipment, including OTDR, DTA Set, MADM modules, 10G routers, and splicing tools, gaining hands-on experience in high-speed optical data networks.
  • Prepared weekly reports and presentations detailing field progress, fault patterns, and resolution steps, contributing to better coordination between field engineers and supervisors.
  • Compiled and structured work implementation data using Excel and PowerPoint, improving clarity in documentation and communication with project leads.

Education

Texas A&M University

Master of Science - MS — Data Science

Aug 2023 – May 2025

International Institute of Information Technology Bangalore

Advanced Certificate Programme in Data Science — Data Science

May 2022 – Mar 2023

Indian Institute of Information Technology Senapati, Manipur

Bachelor of Technology - BTech — Electronics and Communications

Jul 2016 – Jun 2020

Stackforce found 100+ more professionals with Data Science & Machine Learning

Explore similar profiles based on matching skills and experience