Anudeep-Anu-Reddy P

Product Manager

San Antonio, Texas, United States8 yrs 3 mos experience

Key Highlights

  • Led migration of 120+ legacy pipelines to Snowflake.
  • Achieved 25% cost savings and 60% faster processing.
  • Designed GDPR-compliant data governance frameworks.
Stackforce AI infers this person is a Data Engineering expert in Fintech and SaaS industries.

Contact

Skills

Core Skills

Data ArchitectureSnowflakeData GovernanceMachine LearningData Engineering

Other Skills

PySparkAmazon Elastic MapReduce (EMR)Python (Programming Language)Data Governance & Quality FrameworksAmazon Web Services (AWS)SQLGitHub ActionsAirflowJiraSnowflake CloudAWSSnowParkCI/CDMicrosoft ExcelMATLAB

About

I am a Senior Data Engineer and Cloud Data Architect with over ten years of experience building, modernizing, and optimizing large-scale data ecosystems across AWS, GCP, and Databricks. My expertise lies in Snowflake migrations, PySpark orchestration, and data platform modernization helping enterprises improve scalability, performance, and governance while reducing operational costs. At PenFed Credit Union, I led the migration of 120+ legacy pipelines from Hadoop, SQL Server, and SAS to Snowflake, achieving 25% cost savings and 60% faster processing. I have built and optimized PySpark and SnowPark pipelines handling 2TB+ of weekly data and designed GDPR-compliant, audit-ready data governance frameworks for 500+ Snowflake tables. I am passionate about designing reliable, secure, and high-performance data platforms with strong observability, lineage tracking, and CI/CD automation. My focus is always on creating systems that enable analytics, AI, and data-driven decision making at scale. Core Expertise: Snowflake | PySpark | AWS | Databricks | Airflow | Data Architecture | ETL Modernization | Data Governance | Streaming Pipelines | Cloud Migrations | DataOps | CI/CD | SQL | Python

Experience

8 yrs 3 mos
Total Experience
2 yrs
Average Tenure
--
Current Experience

Penfed credit union

Colud Solution Architect / Senior Data Engineer ( HCL America)

Apr 2023Present · 3 yrs 1 mo · San Antonio, TX · On-site

  • ● Led migration of 120+ Hadoop, SQL Server, and SAS pipelines to Snowflake on AWS using PySpark orchestration; achieved 25% cost savings and 60% faster processing.
  • ● Built 50+ SnowPark and PySpark pipelines handling 2TB+/week ingestion with schema enforcement and observability.
  • ● Designed a GDPR-compliant governance layer for 500+ Snowflake tables, embedding lineage and CI/CD controls.
  • ● Refactored SSIS and SAS workloads into Airflow + SnowSQL DAGs, reducing runtimes from 8h to 10min.
  • ● Vintage Deposit Pipeline: Automated cohort creation with Snowflake Tasks + JS procs, cutting cycle time from 1 week to 2 hours.
  • ● Black Knight SCD2 Pipeline: Built PySpark + SnowSQL orchestration for 417+ tables, reducing runtimes from 6h to 2h.
  • ● Mentored 9 engineers and introduced SAFe Agile, reducing deployment errors by 40%.
PySparkAmazon Elastic MapReduce (EMR)Data ArchitecturePython (Programming Language)Data GovernanceData Governance & Quality Frameworks+7

Iconnections

Data Engineer

Jul 2022Aug 2022 · 1 mo · Philadelphia, Pennsylvania, United States · Remote

Microsoft SQL ServerPandasSeabornPython (Programming Language)Data ModelingData Governance+5

Michigan technological university

Research Analyst

Sep 2021Dec 2023 · 2 yrs 3 mos · Houghton, Michigan, United States · Remote

  • ● Built U-Net models for NASA landslide detection, improving IoU accuracy from 64% to 72%.
  • ● Automated grading pipelines for 30+ students (3 days to 5 minutes).
  • ● Designed a Genetic Algorithm–based DAG generator improving model accuracy across 30 datasets.
Microsoft ExcelPython (Programming Language)Machine LearningMATLABComputer VisionTensorFlow+1

Siteminder

Data Engineer III

Jan 2021Aug 2021 · 7 mos · Bangalore Urban, Karnataka, India · Remote

  • ● Refactored Redshift + Airflow ETLs, cutting query runtimes by 70%.
  • ● Optimized 1,500+ SQL lines, improving efficiency by 20%.
  • ● Developed K-Means segmentation for 2,500+ users, increasing campaign ROI by 10%.
Business AnalysisPySparkAmazon RedshiftTableauPredictive ModelingData Architecture+6

Rapido

Senior Data Engineer

Jul 2018Dec 2020 · 2 yrs 5 mos · Bengaluru, Karnataka, India · Hybrid

  • ● Scaled BigQuery + Presto pipelines across 108 cities, improving retrieval speed by 20%.
  • ● Automated 8 ETL workflows integrating Parquet to Metabase, reducing manual work by 90%.
  • ● Built real-time dashboards improving supply-demand balance and reducing churn by 15%.
  • ● Created SQL-based alerts, reducing unclaimed cashback by 30%.
A/B TestingPySparkMetabasePython (Programming Language)Data GovernanceGoogle Cloud Platform (GCP)+3

Tata consultancy services

Data Engineer

Jun 2015Jun 2018 · 3 yrs · Bengaluru Area, India · On-site

  • ● Developed HiveSQL + Redshift pipelines, improving performance by 20%.
  • ● Built predictive models (Random Forest) for healthcare analytics.
  • ● Delivered SQL and Tableau training to 40+ engineers.
Azure DatabricksTableauMicrosoft ExcelHiveSQLPython (Programming Language)Machine Learning+3

Education

Michigan Technological University

Master of Science - MS — Data Science

Jan 2021Jan 2022

Sree

Bachelor of Technology - B.Tech — Electronics and Instrumentation Engineering

Jan 2011Jan 2015

Board of Intermediate Education, A. P., Hyderabad

Intermediate

Jan 2009Jan 2011

Board of Secondary Education Rajasthan

Secondary School Certificate

Jan 2008Jan 2009

Stackforce found 100+ more professionals with Data Architecture & Snowflake

Explore similar profiles based on matching skills and experience