Mahima T. — Data Engineer

TLDR; Data Engineer (3yrs) Big Data💙 ☁️AWS, Databricks, Snowflake layers. Making data pipelines faster, smarter.Predictive modeling researcher. Open to impactful data projects! curious? Let's chat 🚀 Versatile ,driven Data Engineer with years of exploring , converting raw data into meaningful insights. I’m an ever-ready experimenter and collaborative architect of impactful data solutions, constantly exploring what’s possible. Seeking a team that nurtures growth and values fresh perspectives in data innovation 🔎 . 🧰Tech Toolkit Highlights: ⚡Code: Python (pyspark, pandas), SQL - Clean, performant code for data manipulation and insights. ☁️ Cloud Platforms (AWS & Azure): Proficient in AWS (Glue, Redshift, EMR, Kinesis) and Azure (Data Factory, Synapse) 🔎Data Engineering Suite: Databricks , Apache Spark ✨, Snowflake ❄️, Delta Lake , Apache Iceberg , Airflow - Orchestration ⚙️ , data warehousing , and lakehouse architectures :building and managing high-performance data infrastructure and pipelines for both streaming and batch data workflows. 🔥 Key Impacts : 💡Performance Enhancement: Accelerating Data Pipeline Efficiency ⏱️ : Optimized critical data migration processes, slashing processing times and significantly reducing resource consumption. Fueling for both batch and near real-time data flows. 👩‍💻 ⚡ 🎓Research Quest : Authored 5 peer-reviewed research papers 🤓 on predictive modeling and machine learning applications in healthcare analytics, that showcase a blend of practical data science skills with some research-backed insights . ( some citations say , yay !). 🔬 🏗️ Dynamic and Collaborative : Thrive in challenging and innovative team environments, Seeking an environment where I can leverage my skills to build exceptional data solutions that make a real difference. 🤝

Stackforce AI infers this person is a Data Engineering specialist with a focus on SaaS and Healthcare analytics.

Location: Bangalore Urban, Karnataka, India

Experience: 4 yrs 4 mos

Skills

Data Engineering
Aws
Data Analysis
Machine Learning

Career Highlights

Optimized data pipelines, reducing processing times significantly.
Published five peer-reviewed papers on predictive modeling.
Proficient in AWS and Azure data engineering tools.

Work Experience

Sigmoid

Data Engineer (11 mos)

Decision Foundry

Data Engineer (1 yr 1 mo)

PCGI Consulting

Data Analyst (2 yrs)

SRM IST Chennai

Research Assistant (4 mos)

Education

Bachelor of Technology - BTech at SRM IST Chennai

High School/Secondary Certificate Programs at Chinmaya Vidyalaya

Mahima T.

Data Engineer

Bangalore Urban, Karnataka, India4 yrs 4 mos experience

Key Highlights

Optimized data pipelines, reducing processing times significantly.
Published five peer-reviewed papers on predictive modeling.
Proficient in AWS and Azure data engineering tools.

Stackforce AI infers this person is a Data Engineering specialist with a focus on SaaS and Healthcare analytics.

Contact

Skills

Core Skills

Data EngineeringAwsData AnalysisMachine Learning

Other Skills

AWS CodeBuildAWS CodePipelineAWS Command Line Interface (CLI)AWS EMRAWS GlueAWS LakeFormationAWS LambdaAWS S3AWS SageMakerAmazon DynamodbAmazon EC2Amazon RedshiftAmazon Relational Database Service (RDS)Amazon S3Amazon Web Services (AWS)

About

Experience

Sigmoid

Data Engineer

Apr 2025 – Present · 11 mos · Hybrid

Decision foundry

Data Engineer

Mar 2024 – Apr 2025 · 1 yr 1 mo · Hybrid

➡️ Developed a centralized AWS S3 data lake integrated with Redshift Spectrum,
optimizing query performance by 30%. Automated data monitoring with
CloudWatch, reducing pipeline failures by 80%, and implemented S3 lifecycle
management for cost efficiency.
➡️ Streamlined deployments and collaboration with AWS CodeBuild, AWS
LakeFormation, Atlassian Jira, Confluence, and Docker, enhancing reliability
and scalability. Created, managed, and supported workflows in Apache Airflow.
➡️ Leveraged Databricks and PySpark for advanced data transformations,
optimizing Redshift SQL queries to enable real-time business intelligence.
Developed ETL workflows from raw to curated data layers for enhanced
analytics.

AWS S3Redshift SpectrumCloudWatchAWS CodeBuildAWS LakeFormationAtlassian Jira+8

Pcgi consulting

Data Analyst

Jan 2022 – Jan 2024 · 2 yrs · Hybrid

➡️ Refactored PySpark code on AWS EMR, cutting cost and time by 50%. Built and
managed PySpark applications for large-scale data processing, enhancing
Redshift and Snowflake ETL workflows
➡️ Led Postgres SQL to Snowflake migration, achieving a 60% reduction in
processing time for a critical long-running job.
➡️ Developed Python-based scripts for anomaly detection and data validation,
ensuring 99.9% data accuracy.
➡️ Designed and deployed Tableau dashboards for KPI visualization, improving
decision-making for stakeholders.

PySparkAWS EMRRedshiftSnowflakePostgres SQLPython+3

Srm ist chennai

Research Assistant

Feb 2021 – Jun 2021 · 4 mos

➡️Published five peer-reviewed papers on predictive modeling,
leveraging TensorFlow, Keras, Random Forest, and DenseNet to build and optimize ML
pipelines with 95% accuracy for unstructured healthcare data analysis.
🎓https://scholar.google.com/citations?user=y_lhPdUAAAAJ&hl=en

TensorFlowKerasRandom ForestDenseNetMachine Learning