๐‘๐š๐ฃ๐ž๐ฌ๐ก ๐๐š๐ฆ๐ฎ๐ฃ๐ฎ๐ฅ๐š

Data Engineer

Bengaluru, Karnataka, India13 yrs experience
Most Likely To SwitchAI Enabled

Key Highlights

  • Reduced MarTech pipeline runtime by 60%.
  • Saved over $1M annually through DaaS platforms.
  • Pioneered AI Agent monitoring system for data quality.
Stackforce AI infers this person is a Big Data and Cloud Engineering expert specializing in AI-driven data solutions.

Contact

Skills

Core Skills

Big Data EngineeringCloud Data EngineeringBig Data SolutionsData EngineeringData Pipeline DevelopmentData ManagementSoftware DevelopmentApi Development

Other Skills

ADLSADLS Gen2AlgorithmsAnalyticsApache KafkaApache SparkAutomationAzure Data FactoryAzure DatabricksBusiness RequirementsClean CodingCommunicationDSAData CollectionData Governance

About

Lead Data Engineer | Cloud & Big Data Leader | Driving GenAI & AI Agent Innovation in Data Platforms Seasoned Lead Data Engineer with 12+ years of experience architecting and scaling Big Data platforms, real-time streaming pipelines, and cloud-native ETL systems. Expert in transforming raw, high-volume data into trusted, actionable insights that accelerate decision-making and business growth while delivering millions in cost savings through optimized pipelines and cloud efficiencies. Passionate about embedding Generative AI and AI Agents into the modern data stackโ€”leveraging LLMs and autonomous agents for: Data quality validation & anomaly detection Schema evolution & automated ETL optimization Intelligent metadata management & AI-driven analytics What I Bring Big Data & Python Expertise: PySpark (SQL, Structured Streaming, ML), Python (Pandas, NumPy, Scikit-learn) Cloud & Data Stack: Azure Data Factory, Databricks, Delta Lake, Snowflake, Azure Functions Modern Engineering: FastAPI, MongoDB, GitHub, Splunk, CI/CD pipelines, Data-as-a-Service (DaaS) ML, Analytics & AI Agents: EDA, Statistics, ML models, AI-driven automation agents for monitoring, governance, and pipeline optimization BI & Visualization: Power BI (DAX, data modeling, interactive dashboards) Key Achievements Optimized MarTech clickstream pipeline โ†’ reduced runtime from 12+ hrs to <2 hrs, cutting cloud compute costs by 60% while boosting performance Built enterprise-grade DaaS platforms and SCD Type 2 pipelines, enabling multi-source integration and saving $1M+ annually in manual processing & rework Delivered production-grade global data solutions, reducing technical debt by 40% and cutting operational costs by 25% Pioneered AI Agentโ€“based monitoring system โ†’ proactive data quality checks & automated remediation, preventing ~$500K/year in downstream data errors What Drives Me Building AI-augmented, future-ready data ecosystems, mentoring high-performing teams, and solving complex data challenges that unlock innovation, efficiency, and measurable business value.

Experience

Lululemon

Data Engineer III

Mar 2022 โ€“ Present ยท 4 yrs ยท Bengaluru, Karnataka, India ยท Hybrid

  • Developed and optimized data pipelines using Pyspark and Snowflake to enhance data processing efficiency.
  • Implemented machine learning applications on Azure Data Factory and Databricks for advanced analytics.
  • Utilized Delta tables and ADLS Gen2 to improve data storage and retrieval processes.
PysparkSnowflakeAzure Data FactoryDatabricksDelta tablesADLS Gen2+2

Mphasis

Module Lead Engineer - Data

Apr 2019 โ€“ Mar 2022 ยท 2 yrs 11 mos ยท Bangalore ยท Hybrid

  • Developed and implemented Big Data solutions using Spark, Kafka, HDFS, YARN, and Databricks.
  • Expertise in developing Apache Spark jobs with Spark structured streaming, Spark SQL, Spark ML, Python, and Scala.
  • Analyzed complex, high-volume data from various sources and big data sources.
SparkKafkaHDFSYARNDatabricksPython+3

Accenture

Senior Software Engineer

Jan 2018 โ€“ Apr 2019 ยท 1 yr 3 mos ยท Bangalore ยท On-site

  • Developed and monitored data pipelines using Hadoop, Spark, and Hive for efficient data processing.
  • Utilized Scala and Python programming languages to optimize data processing and analysis.
  • Collaborated with cross-functional teams to implement innovative solutions for data management and analysis.
HadoopSparkHiveScalaPythonData Pipeline Development+1

Maveric systems limited

Software Engineer

Apr 2013 โ€“ Jan 2018 ยท 4 yrs 9 mos ยท Chennai, Tamil Nadu, India ยท On-site

  • I have extensive experience developing Rest API, UI, and Mobile Automation Frameworks using Python and Java. In addition, I have strong hands-on experience with API Test and ETL test automation using open source tools.
REST APIsPythonJavaETL test automationSoftware DevelopmentAPI Development

Education

TRR

Bachelor of Engineering (B.E.) โ€” Electrical and Electronics Engineering

Jan 2008 โ€“ Jan 2012

iNeuron

Master's degree โ€” Machine learning

Oct 2021 โ€“ May 2022

Stackforce found 45 more professionals with Big Data Engineering & Cloud Data Engineering

Explore similar profiles based on matching skills and experience