Piyush Trivedi

Data Engineer

Noida, Uttar Pradesh, India3 yrs 9 mos experience
Highly StableAI Enabled

Key Highlights

  • Expert in building scalable ETL pipelines on GCP.
  • Proven track record in data migration projects.
  • Strong automation skills reducing manual efforts significantly.
Stackforce AI infers this person is a Cloud Data Engineer specializing in scalable ETL solutions and data migration.

Contact

Skills

Core Skills

Data EngineeringEtlPython DevelopmentAutomationData MigrationCloud Engineering

Other Skills

MySQLData ManipulationApache AirflowPower BIPython (Programming Language)PuttyPythonStorage SystemsMicrosoft AzureBigQueryDataflowCloud StoragePub/SubAdvance pythonTkinter

About

🚀 GCP & Azure Certified | Python Developer • Data Engineer | BigQuery • Dataproc • Dataflow • Airflow • PySpark • SQL • Big Data I'm Piyush Trivedi, a Python Developer & Data Engineer with 3.5+ years of experience building cloud-native ETL pipelines, automation frameworks, and performance-optimized backend systems across enterprise-scale environments. At TCS, I have contributed to two major enterprise projects—NetApp (Python automation & Data Engineering) and HSBC Migrata (GCP Data Migration & ETL Engineering). My work spans automation, data validation, large-scale migrations, distributed processing, and storage optimization. 🔹 What I Do Build scalable ETL/ELT pipelines using BigQuery, Dataproc (PySpark), Dataflow, Pub/Sub, GCS, Cloud Composer Design Python automation frameworks for log parsing, debugging, workflow execution, and system optimization Migrate large enterprise datasets (70+ TB) from Teradata/on-prem to BigQuery with full validation & reconciliation Optimize storage performance across WAFL, NAS, SAN, improving storage utilization by 20% Reduce manual triaging time significantly (up to 86%) using Python automation Improve system reliability through CI/CD, Docker, Kubernetes, Terraform, Jenkins, and Git-based workflows 🔹 Tech Skills Python, SQL, PySpark, Airflow, Dataproc, Dataflow, BigQuery, Pub/Sub, GCS, Docker, Kubernetes, Terraform, Jenkins, Linux, Hadoop, Shell, REST APIs, Storage Systems (WAFL/NAS/SAN) 🔹 Projects I Delivered NetApp – Cloud & Automation Python automation • PySpark jobs • Storage optimization • CI/CD integration • Debugging & performance tuning HSBC – Migrata Teradata → BigQuery migration • Dataproc pipelines • Composer orchestration • Data validation • Query optimization Intelligent Ticket Categorization (ML) Python NLP • GCP Cloud Functions • BigQuery • Automated triage routing 🔹 Certifications Google Cloud Professional Data Engineer AWS Certified Data Engineer – Associate Microsoft Certified Data Engineer Associate 💡 Let’s connect if you’re hiring or building: Cloud Data Engineering solutions (GCP preferred) Python automation & backend systems Big Data & distributed data pipelines High-performance, scalable data platforms 📬 Email: piyushtrivedi461@gmail.com

Experience

3 yrs 9 mos
Total Experience
3 yrs 9 mos
Average Tenure
3 yrs 9 mos
Current Experience

Tata consultancy services

3 roles

System Engineer

Promoted

Apr 2024 – Present · 2 yrs 1 mo

  • Designed and implemented a data pipeline that extracted data from MySQL, performed data manipulation and transformation using SQL and Python, and loaded the refined data into a centralized warehouse. Utilized Apache Airflow for orchestration and scheduling of ETL workflows under Agile methodologies. Conducted exploratory data analysis (EDA) and created interactive dashboards in Power BI to deliver actionable insights. Applied data storytelling techniques to present findings to business stakeholders, driving data-informed decision-making. Worked alongside data architects to align pipelines with organizational data models and ensure data consistency across systems.
MySQLData ManipulationApache AirflowPower BIData EngineeringETL

Assistant System Engineer

Jul 2023 – Mar 2024 · 8 mos

  • Served as a core Python Developer in the NetApp storage domain, focusing on scalable backend module development and automation tools to improve system performance and operational efficiency.
  • Engineered Python-based solutions for test case triaging, log parsing, and storage diagnostics, minimizing manual intervention significantly.
  • Designed automation frameworks supporting NAS/FAS volumes, CIFS/NFS protocols, and CloudOps use cases.
  • Leveraged expertise in UNIX systems to integrate tools like Jira, Docker, and Terraform, ensuring the creation and maintenance of production-grade Python utilities.
Python (Programming Language)PuttyPython DevelopmentAutomation

Assistant System Engineer Trainee

Jul 2022 – Jul 2023 · 1 yr

  • GCP Data Engineer – Data Migration Project:
  • Led comprehensive data migration from Teradata to BigQuery, leveraging GCP tools including BigQuery, Dataflow, Cloud Storage, and Pub/Sub to boost data processing efficiency and reduce migration time by 25%.
  • Designed and managed scalable ETL pipelines orchestrated with Airflow, ensuring seamless and reliable data transfers with complete auditability.
  • Collaborated with cross-functional teams to refine ingestion workflows and optimize SQL queries, achieving improved performance and cost efficiency.
  • Developed robust Python-based automation scripts for data validation, schema alignment, and transformation logic, significantly reducing manual errors and rework.
  • Played a pivotal role in architectural planning, pre-production validations, and production cutovers, delivering timely and high-quality outcomes.
  • Enhanced expertise in GCP IAM, Terraform, and cloud-native DevOps practices, contributing to the success of a large-scale cloud transformation initiative.
Python (Programming Language)Microsoft AzureData MigrationCloud Engineering

Ethical edufabrica pvt. ltd

Python Developer

May 2021 – Oct 2021 · 5 mos · Remote

  • Completed a 6-month internship as a Python Developer with Ethical Edufabrica Pvt. Ltd. in association with SpringFest, IIT Kharagpur. Developed a GUI-based banking system using Tkinter with voice-enabled features using NLP. Integrated file handling for data management and enhanced performance using Pandas, NumPy, and OpenCV.
Python (Programming Language)Advance pythonPython Development

Education

Dr. A.P.J. Abdul Kalam Technical University

Bachelor of Technology - BTech — Computer Science

Aug 2018 – Jul 2022

ALLENHOUSE INSTITUTE OF TECHNOLOGY, KANPUR

B.Tech — Computer Science

Jan 2018 – Jan 2022

Vidya Niketan Inter College,Kanpur

Higher Secondary Education — Science

Jan 2015 – Jan 2017

Stackforce found 100+ more professionals with Data Engineering & Etl

Explore similar profiles based on matching skills and experience