M

Mahima T.

Data Engineer

Bangalore Urban, Karnataka, India4 yrs 4 mos experience

Key Highlights

  • Optimized data pipelines, reducing processing times significantly.
  • Published five peer-reviewed papers on predictive modeling.
  • Proficient in AWS and Azure data engineering tools.
Stackforce AI infers this person is a Data Engineering specialist with a focus on SaaS and Healthcare analytics.

Contact

Skills

Core Skills

Data EngineeringAwsData AnalysisMachine Learning

Other Skills

AWS CodeBuildAWS CodePipelineAWS Command Line Interface (CLI)AWS EMRAWS GlueAWS LakeFormationAWS LambdaAWS S3AWS SageMakerAmazon DynamodbAmazon EC2Amazon RedshiftAmazon Relational Database Service (RDS)Amazon S3Amazon Web Services (AWS)

About

TLDR; Data Engineer (3yrs) Big Data💙 ☁️AWS, Databricks, Snowflake layers. Making data pipelines faster, smarter.Predictive modeling researcher. Open to impactful data projects! curious? Let's chat 🚀 Versatile ,driven Data Engineer with years of exploring , converting raw data into meaningful insights. I’m an ever-ready experimenter and collaborative architect of impactful data solutions, constantly exploring what’s possible. Seeking a team that nurtures growth and values fresh perspectives in data innovation 🔎 . 🧰Tech Toolkit Highlights: ⚡Code: Python (pyspark, pandas), SQL - Clean, performant code for data manipulation and insights. ☁️ Cloud Platforms (AWS & Azure): Proficient in AWS (Glue, Redshift, EMR, Kinesis) and Azure (Data Factory, Synapse) 🔎Data Engineering Suite: Databricks , Apache Spark ✨, Snowflake ❄️, Delta Lake , Apache Iceberg , Airflow - Orchestration ⚙️ , data warehousing , and lakehouse architectures :building and managing high-performance data infrastructure and pipelines for both streaming and batch data workflows. 🔥 Key Impacts : 💡Performance Enhancement: Accelerating Data Pipeline Efficiency ⏱️ : Optimized critical data migration processes, slashing processing times and significantly reducing resource consumption. Fueling for both batch and near real-time data flows. 👩‍💻 ⚡ 🎓Research Quest : Authored 5 peer-reviewed research papers 🤓 on predictive modeling and machine learning applications in healthcare analytics, that showcase a blend of practical data science skills with some research-backed insights . ( some citations say , yay !). 🔬 🏗️ Dynamic and Collaborative : Thrive in challenging and innovative team environments, Seeking an environment where I can leverage my skills to build exceptional data solutions that make a real difference. 🤝

Experience

Sigmoid

Data Engineer

Apr 2025Present · 11 mos · Hybrid

Decision foundry

Data Engineer

Mar 2024Apr 2025 · 1 yr 1 mo · Hybrid

  • ➡️ Developed a centralized AWS S3 data lake integrated with Redshift Spectrum,
  • optimizing query performance by 30%. Automated data monitoring with
  • CloudWatch, reducing pipeline failures by 80%, and implemented S3 lifecycle
  • management for cost efficiency.
  • ➡️ Streamlined deployments and collaboration with AWS CodeBuild, AWS
  • LakeFormation, Atlassian Jira, Confluence, and Docker, enhancing reliability
  • and scalability. Created, managed, and supported workflows in Apache Airflow.
  • ➡️ Leveraged Databricks and PySpark for advanced data transformations,
  • optimizing Redshift SQL queries to enable real-time business intelligence.
  • Developed ETL workflows from raw to curated data layers for enhanced
  • analytics.
AWS S3Redshift SpectrumCloudWatchAWS CodeBuildAWS LakeFormationAtlassian Jira+8

Pcgi consulting

Data Analyst

Jan 2022Jan 2024 · 2 yrs · Hybrid

  • ➡️ Refactored PySpark code on AWS EMR, cutting cost and time by 50%. Built and
  • managed PySpark applications for large-scale data processing, enhancing
  • Redshift and Snowflake ETL workflows
  • ➡️ Led Postgres SQL to Snowflake migration, achieving a 60% reduction in
  • processing time for a critical long-running job.
  • ➡️ Developed Python-based scripts for anomaly detection and data validation,
  • ensuring 99.9% data accuracy.
  • ➡️ Designed and deployed Tableau dashboards for KPI visualization, improving
  • decision-making for stakeholders.
PySparkAWS EMRRedshiftSnowflakePostgres SQLPython+3

Srm ist chennai

Research Assistant

Feb 2021Jun 2021 · 4 mos

  • ➡️Published five peer-reviewed papers on predictive modeling,
  • leveraging TensorFlow, Keras, Random Forest, and DenseNet to build and optimize ML
  • pipelines with 95% accuracy for unstructured healthcare data analysis.
  • 🎓https://scholar.google.com/citations?user=y_lhPdUAAAAJ&hl=en
TensorFlowKerasRandom ForestDenseNetMachine Learning

Education

SRM IST Chennai

Bachelor of Technology - BTech — ECE

Jan 2018Jan 2022

Chinmaya Vidyalaya

High School/Secondary Certificate Programs

Jan 2016Jan 2018

Stackforce found 100+ more professionals with Data Engineering & Aws

Explore similar profiles based on matching skills and experience