Maxil Dourave

Product Manager

Kochi, Kerala, India9 yrs 11 mos experience
Most Likely To SwitchAI Enabled

Key Highlights

  • Architected enterprise-grade ETL pipelines across multi-cloud environments.
  • Migrated over 40 SAS workflows to PySpark, enhancing performance.
  • Designed AI-ready data pipelines processing millions of events daily.
Stackforce AI infers this person is a Data Engineering expert specializing in cloud-native solutions and machine learning applications.

Contact

Skills

Core Skills

Data EngineeringCloud ArchitectureEtl ModernizationData Pipeline DesignData MigrationData Lake ArchitectureEtl DevelopmentData PreparationPipeline DesignMachine LearningData Pipeline DevelopmentMicroservices DevelopmentErp DevelopmentWeb Development

Other Skills

ADBADFAJAXAWS GlueAWS SQS/SNSAWS SageMakerAgile MethodologiesAgile Project ManagementAmazon DynamodbAmazon RedshiftApache SparkArtificial Intelligence (AI)AthenaAtlassian BambooBigQuery APIs

About

I build modern, cloud-native data platforms that turn raw data into fast, reliable, production-grade insights. With 7+ years of experience across AWS, Azure, Databricks, PySpark, and large-scale ETL modernization, I specialize in designing end-to-end data pipelines that improve performance, scalability, and data quality. My journey started when I re-engineered a slow, repetitive ETL job and cut its runtime from 4 hours to under 1 hour. That project sparked my focus on building efficient, automated data systems. Since then, I’ve migrated 40+ SAS workflows to PySpark, delivered multi-layer data lakes, optimized Glue/EMR pipelines, and built AI-ready pipelines processing millions of events daily. I work across the full data lifecycle—architecture, ingestion, transformation, orchestration, governance, and optimization—ensuring systems are reliable, cost-efficient, and easy to scale. I collaborate closely with product owners, data scientists, and engineering teams to align solutions with business goals and deliver measurable impact. If you’re building next-generation cloud data platforms or accelerating AI and automation initiatives, let’s connect. I’m open to global opportunities and open to relocate.

Experience

9 yrs 11 mos
Total Experience
2 yrs
Average Tenure
5 yrs 1 mo
Current Experience

Tata consultancy services

Lead Technical Specialist — Data Engineering

Aug 2025Present · 10 mos · Kochi, Kerala, India · Hybrid

  • Architected enterprise-grade ETL pipelines using AWS Glue, Lambda, S3, Redshift and GCP (Databricks + BigQuery APIs), enabling multi-cloud batch and real-time processing.
  • Implemented Lakehouse architecture on Databricks, improving data reliability, lineage tracking, and analytical scalability.
  • Designed event-driven streaming pipelines using Kafka, AWS SQS/SNS, ensuring low-latency ingestion and guaranteed event delivery.
  • Established data governance frameworks with RBAC, metadata management, and audit controls across all layers.
  • Led on-prem-to-cloud migrations with zero downtime, coordinating with product owners and external vendors.
AWS GlueLambdaS3RedshiftGCPDatabricks+5

Tata consultancy services

5 roles

Lead Technical Specialist

Nov 2024Jul 2025 · 8 mos

  • Designed scalable ETL/ELT pipelines using Glue, S3, Athena, and PySpark for CRM modernization.
  • Migrated 50+ EMR batch jobs to Glue, improving performance by 20% and reducing infra overhead.
  • Migrated 10+ Boomi batch jobs and 10+ Boomi CDC workflows to Lambda & Glue for improved maintainability.
  • Built Splunk dashboards for ETL performance monitoring (job duration, throughput, error trends).
  • Implemented IAM, CloudFormation, and secured provisioning across all data workloads.
GlueS3AthenaPySparkIAMCloudFormation+3

Technical lead - Data Engineer

Promoted

Jun 2023Apr 2025 · 1 yr 10 mos

  • Modernized legacy SAS systems into AWS using PySpark, EMR, Hive, and S3-based data lake architecture.
  • Designed ingestion, transformation, and curation layers with Glue, Lambda, and Redshift.
  • Reduced ETL execution time by implementing partitioning, caching, and Spark optimization techniques.
  • Managed data lineage, schema evolution, governance, and CI/CD across environments.
  • Worked with Agile, Bitbucket, Bamboo, and Jira for delivery and release management.
PySparkEMRHiveS3GlueLambda+3

Big data Engineer

Aug 2021Oct 2023 · 2 yrs 2 mos

  • Developed PoC solutions for client use cases using PySpark, AWS Glue, EMR, and Lambda.
  • Built ETL pipelines to prepare data for Data Science and BI teams, improving downstream accuracy and reliability.
  • Implemented Spark optimization (broadcast joins, caching, bucketing) for 30%+ faster pipelines.
PySparkAWS GlueEMRLambdaSpark optimizationETL Development+1

Data Engineer (Azure + Databricks)

Promoted

Apr 2021Dec 2021 · 8 mos

  • Designed Lakehouse architecture (Bronze/Silver/Gold) improving data traceability and reusability.
  • Built scalable pipelines using ADB, ADF, PySpark, Delta Lake, integrating data from CRM, claims, and finance.
  • Improved processing speed by 30% using Spark optimization strategies.
  • Created ingestion framework for full-load & incremental loads, reducing latency by 40%.
ADBADFPySparkDelta LakeData Lake ArchitecturePipeline Design

Jr Machine learning Enginner

Apr 2021Jul 2021 · 3 mos

  • Developed end-to-end ML data pipelines including data collection, preprocessing, feature extraction, and model inference using Python, Pandas, and R.
  • Built reusable data preparation workflows that fed both Machine Learning and Data Engineering pipelines, ensuring clean, structured datasets for downstream analytics.
  • Implemented predictive model prototypes and statistical analysis workflows, enabling early insights for business use cases.
  • Integrated model outputs into lightweight microservices and dashboards, supporting real-time decision workflows.
  • Worked with Azure, AWS, and data lake storage systems to prepare and manage training data and experimentation datasets.
PythonPandasRFlaskMachine LearningData Pipeline Development

Apps team technologies, pvt. ltd

2 roles

Machine Learning Developer

Promoted

Jan 2021Mar 2021 · 2 mos

PythonOCRFlaskMachine LearningMicroservices Development

Python Odoo Developer

Mar 2020Mar 2021 · 1 yr

  • Odoo ERP Customization , developed modules for solving various business problems.
  • Experience in Sales,Purchase,Inventory,Accounting Modules.
PythonOCRMatplotlibMachine Learning

Freelance

Associate Machine Learning Engineer

Oct 2020Feb 2021 · 4 mos

  • OCR with Google vision using Python 3.6 and deployed on as microservice.
OdooPythonJavaERP Development

Indian servers - software development company

Machine Learning Intern

Jun 2020Aug 2020 · 2 mos · India

Self-employed

Software Engineer

Mar 2018Feb 2020 · 1 yr 11 mos · India

  • Based on the client requirement Developing API Web development and web application using Python Odoo, Django, Flask frameworks. And ETL and data analysis sing python and Odoo 13.
PythonOdooDjangoWeb Development

Other

Computer Hardware Engineer

May 2016Sep 2017 · 1 yr 4 mos · Kochi, Kerala, India · On-site

  • Trouble desktop and laptop

Island fabrication

Metal fabrication

Feb 2014Jan 2016 · 1 yr 11 mos

Education

MGUniversity

Master of Computer Applications - MCA with specialized cyber security — Computer Science

Jan 2016Jan 2020

Mahatma Gandhi University

Bachelor's degree — Electrical and Electronics Engineering

Jan 2013Jan 2016

Stackforce found 100+ more professionals with Data Engineering & Cloud Architecture

Explore similar profiles based on matching skills and experience