Maxil Dourave — Product Manager

I build modern, cloud-native data platforms that turn raw data into fast, reliable, production-grade insights. With 7+ years of experience across AWS, Azure, Databricks, PySpark, and large-scale ETL modernization, I specialize in designing end-to-end data pipelines that improve performance, scalability, and data quality. My journey started when I re-engineered a slow, repetitive ETL job and cut its runtime from 4 hours to under 1 hour. That project sparked my focus on building efficient, automated data systems. Since then, I’ve migrated 40+ SAS workflows to PySpark, delivered multi-layer data lakes, optimized Glue/EMR pipelines, and built AI-ready pipelines processing millions of events daily. I work across the full data lifecycle—architecture, ingestion, transformation, orchestration, governance, and optimization—ensuring systems are reliable, cost-efficient, and easy to scale. I collaborate closely with product owners, data scientists, and engineering teams to align solutions with business goals and deliver measurable impact. If you’re building next-generation cloud data platforms or accelerating AI and automation initiatives, let’s connect. I’m open to global opportunities and open to relocate.

Stackforce AI infers this person is a Data Engineering expert specializing in cloud-native solutions and machine learning applications.

Location: Kochi, Kerala, India

Experience: 9 yrs 11 mos

Skills

Data Engineering
Cloud Architecture
Etl Modernization
Data Pipeline Design
Data Migration
Data Lake Architecture
Etl Development
Data Preparation
Pipeline Design
Machine Learning
Data Pipeline Development
Microservices Development
Erp Development
Web Development

Career Highlights

Architected enterprise-grade ETL pipelines across multi-cloud environments.
Migrated over 40 SAS workflows to PySpark, enhancing performance.
Designed AI-ready data pipelines processing millions of events daily.

Work Experience

Tata Consultancy Services

Lead Technical Specialist — Data Engineering (10 mos)

Tata Consultancy Services

Lead Technical Specialist (8 mos)

Technical lead - Data Engineer (1 yr 10 mos)

Big data Engineer (2 yrs 2 mos)

Data Engineer (Azure + Databricks) (8 mos)

Jr Machine learning Enginner (3 mos)

Apps Team Technologies, Pvt. Ltd

Machine Learning Developer (2 mos)

Python Odoo Developer (1 yr)

Freelance

Associate Machine Learning Engineer (4 mos)

Indian Servers - Software Development Company

Machine Learning Intern (2 mos)

Self-employed

Software Engineer (1 yr 11 mos)

Other

Computer Hardware Engineer (1 yr 4 mos)

Island fabrication

Metal fabrication (1 yr 11 mos)

Education

Master of Computer Applications - MCA with specialized cyber security at MGUniversity

Bachelor's degree at Mahatma Gandhi University

Maxil Dourave

Product Manager

Kochi, Kerala, India9 yrs 11 mos experience

Most Likely To SwitchAI Enabled

Key Highlights

Architected enterprise-grade ETL pipelines across multi-cloud environments.
Migrated over 40 SAS workflows to PySpark, enhancing performance.
Designed AI-ready data pipelines processing millions of events daily.

Stackforce AI infers this person is a Data Engineering expert specializing in cloud-native solutions and machine learning applications.

Contact

Skills

Core Skills

Data EngineeringCloud ArchitectureEtl ModernizationData Pipeline DesignData MigrationData Lake ArchitectureEtl DevelopmentData PreparationPipeline DesignMachine LearningData Pipeline DevelopmentMicroservices DevelopmentErp DevelopmentWeb Development

Other Skills

ADBADFAJAXAWS GlueAWS SQS/SNSAWS SageMakerAgile MethodologiesAgile Project ManagementAmazon DynamodbAmazon RedshiftApache SparkArtificial Intelligence (AI)AthenaAtlassian BambooBigQuery APIs

About

Experience

9 yrs 11 mos

Total Experience

2 yrs

Average Tenure

5 yrs 1 mo

Current Experience

Tata consultancy services

Lead Technical Specialist — Data Engineering

Aug 2025 – Present · 10 mos · Kochi, Kerala, India · Hybrid

Architected enterprise-grade ETL pipelines using AWS Glue, Lambda, S3, Redshift and GCP (Databricks + BigQuery APIs), enabling multi-cloud batch and real-time processing.
Implemented Lakehouse architecture on Databricks, improving data reliability, lineage tracking, and analytical scalability.
Designed event-driven streaming pipelines using Kafka, AWS SQS/SNS, ensuring low-latency ingestion and guaranteed event delivery.
Established data governance frameworks with RBAC, metadata management, and audit controls across all layers.
Led on-prem-to-cloud migrations with zero downtime, coordinating with product owners and external vendors.

AWS GlueLambdaS3RedshiftGCPDatabricks+5

Tata consultancy services

5 roles

Lead Technical Specialist

Nov 2024 – Jul 2025 · 8 mos

Designed scalable ETL/ELT pipelines using Glue, S3, Athena, and PySpark for CRM modernization.
Migrated 50+ EMR batch jobs to Glue, improving performance by 20% and reducing infra overhead.
Migrated 10+ Boomi batch jobs and 10+ Boomi CDC workflows to Lambda & Glue for improved maintainability.
Built Splunk dashboards for ETL performance monitoring (job duration, throughput, error trends).
Implemented IAM, CloudFormation, and secured provisioning across all data workloads.

GlueS3AthenaPySparkIAMCloudFormation+3

Technical lead - Data Engineer

Promoted

Jun 2023 – Apr 2025 · 1 yr 10 mos

Modernized legacy SAS systems into AWS using PySpark, EMR, Hive, and S3-based data lake architecture.
Designed ingestion, transformation, and curation layers with Glue, Lambda, and Redshift.
Reduced ETL execution time by implementing partitioning, caching, and Spark optimization techniques.
Managed data lineage, schema evolution, governance, and CI/CD across environments.
Worked with Agile, Bitbucket, Bamboo, and Jira for delivery and release management.

PySparkEMRHiveS3GlueLambda+3

Big data Engineer

Aug 2021 – Oct 2023 · 2 yrs 2 mos

Developed PoC solutions for client use cases using PySpark, AWS Glue, EMR, and Lambda.
Built ETL pipelines to prepare data for Data Science and BI teams, improving downstream accuracy and reliability.
Implemented Spark optimization (broadcast joins, caching, bucketing) for 30%+ faster pipelines.

PySparkAWS GlueEMRLambdaSpark optimizationETL Development+1

Data Engineer (Azure + Databricks)

Promoted

Apr 2021 – Dec 2021 · 8 mos

Designed Lakehouse architecture (Bronze/Silver/Gold) improving data traceability and reusability.
Built scalable pipelines using ADB, ADF, PySpark, Delta Lake, integrating data from CRM, claims, and finance.
Improved processing speed by 30% using Spark optimization strategies.
Created ingestion framework for full-load & incremental loads, reducing latency by 40%.

ADBADFPySparkDelta LakeData Lake ArchitecturePipeline Design

Jr Machine learning Enginner

Apr 2021 – Jul 2021 · 3 mos

Developed end-to-end ML data pipelines including data collection, preprocessing, feature extraction, and model inference using Python, Pandas, and R.
Built reusable data preparation workflows that fed both Machine Learning and Data Engineering pipelines, ensuring clean, structured datasets for downstream analytics.
Implemented predictive model prototypes and statistical analysis workflows, enabling early insights for business use cases.
Integrated model outputs into lightweight microservices and dashboards, supporting real-time decision workflows.
Worked with Azure, AWS, and data lake storage systems to prepare and manage training data and experimentation datasets.

PythonPandasRFlaskMachine LearningData Pipeline Development

Apps team technologies, pvt. ltd

2 roles

Machine Learning Developer

Promoted

Jan 2021 – Mar 2021 · 2 mos

PythonOCRFlaskMachine LearningMicroservices Development

Python Odoo Developer

Mar 2020 – Mar 2021 · 1 yr

Odoo ERP Customization , developed modules for solving various business problems.
Experience in Sales,Purchase,Inventory,Accounting Modules.

PythonOCRMatplotlibMachine Learning

Freelance

Associate Machine Learning Engineer

Oct 2020 – Feb 2021 · 4 mos

OCR with Google vision using Python 3.6 and deployed on as microservice.

OdooPythonJavaERP Development

Indian servers - software development company

Machine Learning Intern

Jun 2020 – Aug 2020 · 2 mos · India

Self-employed

Software Engineer

Mar 2018 – Feb 2020 · 1 yr 11 mos · India

Based on the client requirement Developing API Web development and web application using Python Odoo, Django, Flask frameworks. And ETL and data analysis sing python and Odoo 13.

PythonOdooDjangoWeb Development