Avantika Penumarty

Co-Founder

San Francisco, California, United States7 yrs 10 mos experience
AI EnabledHighly Stable

Key Highlights

  • Built analytics infrastructure for 2 billion users at Meta.
  • Reduced data pipeline latency by 70% at Tredence.
  • Founded a community-driven platform for aspiring Data Engineers.
Stackforce AI infers this person is a Data Engineering expert with a strong focus on analytics infrastructure and education.

Contact

Skills

Core Skills

Data EngineeringEducation And TrainingAnalytics InfrastructureData AnalysisQuality AssuranceTest Automation

Other Skills

SQLPythonETLCloud Data EngineeringBig DataAWSGCPDBTTableauSparkPrestoHiveAirflowUnidashManual Testing

About

I'm a Senior Data Engineer with 7+ years building analytics infrastructure. I spent five years at Meta building analytics infrastructure across Reality Labs, Facebook Marketplace, and Consumer Connectivity. At Tredence, I worked on consumer tech analytics for Walmart and Marriott. I Founded ZERO2DATAENGINEER in Augus 2024 I love teaching and sharing my struggles in my journey that help others build a smoother path. I talk about #dataengineering, #AI, and career growth as an immigrant woman in tech. You can find me by looking up @avantika_penumarty on all social networks. Join my Data Learning Platform: https://zero2dataengineer.substack.com WHAT I BRING: → Product analytics at scale (Reality Labs, Marketplace, Consumer Connectivity) → Marketing measurement (campaign analytics, attribution modeling, ROI tracking) → Experimentation & funnel optimization → Data quality & reliability (99%+ uptime) TECH STACK: SQL, Python | Spark, Presto, Hive, Airflow, DBT | Snowflake, BigQuery | AWS, GCP | Tableau, Looker WHAT I'M LOOKING FOR: Senior Data Engineer roles building analytics infrastructure at scale. Looking to partner with Product, ML, and Growth teams at top companies. šŸ“§ apenumarty93@gmail.com --- → FOLLOW ME ON SOCIALS FOR UPDATES LinkedIn: @AvantikaPenumarty Instagram: https://www.instagram.com/avantika.data/

Experience

7 yrs 10 mos
Total Experience
2 yrs 7 mos
Average Tenure
--
Current Experience

Zero2dataengineer

Founder

Dec 2024 – Present Ā· 1 yr 5 mos Ā· United States Ā· Remote

  • I love teaching and helping others grow.
  • I Founded an education and career acceleration platform for aspiring and mid-level Data Engineers.
  • Built a 15K+ community across LinkedIn and Substack by publishing technical deep-dives, challenges, and career growth content.
  • Designed and delivered bootcamps and structured learning tracks in SQL, Python, ETL, Cloud Data Engineering, and Big Data.
  • Partnered with DataExpert.io to contribute to training programs, mentorship sessions, and practical bootcamps.
  • Represented the platform at hackathons, guest lectures, and industry conferences, sharing insights on scalable data pipelines, migration strategies, and real-world workflows.
  • Created a content-driven funnel (newsletters + LinkedIn content + events) blending technical depth with career readiness, helping learners land roles and accelerate their growth.
SQLPythonETLCloud Data EngineeringBig DataData Engineering+1

Tredence inc.

Senior Data Engineer

May 2023 – Sep 2024 Ā· 1 yr 4 mos Ā· San Francisco Bay Area Ā· Remote

  • Senior Data Engineer | Tredence Inc. | May 2023 - September 2024
  • San Francisco Bay Area
  • Worked with enterprise clients like Walmart and Marriott, delivering data infrastructure solutions that drove measurable business outcomes.
  • Reduced property onboarding time from 6 months to 6 weeks, enabling $50M+ incremental annual revenue
  • Implemented distributed ETL pipelines processing 5TB+ daily across 4,500+ stores, powering $500M+ inventory decisions
  • Reduced data pipeline latency from 4 hours to 45 minutes (70% improvement), enabling same-day decision-making
  • Architected cloud migration from legacy systems to AWS/GCP, reducing infrastructure costs by 30% ($2.4M annual savings)
  • Developed SQL optimizations cutting query execution times by 60%, delivering insights 3x faster to analysts
  • Built anomaly detection models reducing data incidents by 70%, preventing $2M+ in potential losses
  • Established DBT-based transformation framework improving data accuracy to 99.9%
  • Mentored 5 junior engineers resulting in 3 promotions and 40% improvement in team delivery speed
  • Technologies: Spark, Airflow, Python, SQL, Snowflake, AWS, GCP, DBT, Terraform, Tableau
PythonSQLETLAWSGCPDBT+3

Meta

2 roles

Data Engineer

Promoted

May 2020 – Mar 2023 Ā· 2 yrs 10 mos Ā· San Francisco Bay Area

  • Built analytics infrastructure across Reality Labs, Facebook Marketplace, and Consumer Connectivity serving 2+ billion users.
  • Designed petabyte-scale ETL pipelines processing 10B+ daily events across Reality Labs VR/AR platforms (Meta Quest)
  • Built real-time analytics enabling product teams to measure VR session quality, feature adoption, and content performance
  • Partnered with ML teams deploying recommendation pipelines improving content discovery by 25%, increasing VR session time by 18 minutes per user
  • Architected funnel analytics tracking 1B+ monthly Marketplace users across search, discovery, messaging, and transactions
  • Built datasets powering 100+ concurrent A/B experiments testing listing quality, search ranking, and buyer-seller matching
  • Developed audience segmentation models for Consumer Connectivity powering re-engagement campaigns that increased DAU by 3% (60M+ additional daily users)
  • Reduced compute costs by 50% ($4M+ annual savings) through intelligent query optimization and resource management
  • Optimized dashboard latency by 5x (30 min to 6 min), enabling same-day strategic decisions for product leadership
  • Built real-time monitoring detecting anomalies across transaction pipelines, preventing $10M+ in potential GMV loss
  • Created automated validation frameworks reducing data quality incidents by 90%, ensuring reliability for 500+ internal teams
  • Orchestrated 5,000+ daily Airflow jobs with 99.9% reliability and zero-downtime deployments
  • Pioneered metadata management improving data discoverability for 200+ teams, reducing time-to-insight by 60%
  • Automated 80% of manual validation work, freeing 40+ analyst hours weekly
  • Technologies: Spark, Presto, Hive, Airflow, Dataswarm, Python, SQL, Druid, Tableau, Looker
SparkPrestoHiveAirflowPythonSQL+3

Data Analyst

Sep 2018 – Aug 2020 Ā· 1 yr 11 mos Ā· San Francisco Bay Area

  • Data Analyst | Meta | Sep 2018 - Aug 2020
  • San Francisco Bay Area
  • Built analytics foundations and experimentation frameworks supporting product decisions across Meta's platforms.
  • Built Tableau/Unidash dashboards surfacing insights that drove product decisions affecting 100M+ users across Reality Labs and Marketplace
  • Conducted SQL analysis identifying $5M+ revenue optimization opportunities in ad serving and monetization systems
  • Designed A/B testing frameworks measuring impact across 50+ product features, establishing experimentation methodology adopted across Product and Marketing teams
  • Optimized database structures improving query efficiency by 5x (30 min to 6 min), enabling analysts to deliver insights same-day instead of next-week
  • Automated 90% of manual reporting pipelines, freeing 20+ hours weekly for strategic analysis instead of data extraction
  • Led Python-based automation eliminating repetitive tasks, increasing team productivity by 40%
  • Built robust data validation pipelines ensuring 99%+ accuracy across 200+ business-critical metrics used by leadership
  • Presented data-driven insights to VP-level leadership influencing quarterly product roadmaps and resource allocation decisions
  • Designed funnel analytics tracking user journeys from acquisition through retention, identifying drop-off points that informed product improvements increasing conversion by 12%
  • Created statistical frameworks for trend analysis and anomaly detection, catching data quality issues before they impacted business decisions
  • Technologies: SQL, Python, Tableau, Unidash, Hive, Presto, Airflow, Jupyter (Bento)
SQLPythonTableauUnidashAirflowData Analysis+1

Tech mahindra

Test Automation Engineer

May 2014 – May 2016 Ā· 2 yrs Ā· India Ā· Hybrid

  • Test Automation Engineer | Tech Mahindra | May 2014 - May 2016
  • India
  • Built automation frameworks and quality assurance systems for enterprise applications.
  • Developed full-stack test automation frameworks reducing regression testing time by 70%, accelerating release cycles
  • Built CI/CD integrations streamlining deployment processes and improving release efficiency by 50%
  • Led API testing strategies ensuring system robustness across microservices architecture handling millions of transactions
  • Collaborated with developers catching 90% more critical bugs before production release, reducing post-deployment incidents
  • Optimized performance testing workflows enabling systems to scale and handle millions of concurrent transactions
  • Trained QA teams on automation best practices improving overall software quality and team productivity by 40%
  • Automated test case generation reducing manual testing effort by 60% and accelerating testing cycles
  • Enhanced bug tracking workflows reducing resolution times by 35% and improving software reliability
PythonManual TestingAutomation TestingQuality AssuranceTest Automation

Education

California State University, Northridge

Master of Science - MS — Engineering Management

Jawaharlal Nehru Technological University

Master of Technology - MTech — Computer Science

Jawaharlal Nehru Technological University

Bachelor of Technology - BTech — Bio Medical Engineering

Jawaharlal Nehru Technological University Kakinada (JNTUK)

Master of Technology - MTech — Computer Science

May 2014 – May 2016

Stackforce found 100+ more professionals with Data Engineering & Education And Training

Explore similar profiles based on matching skills and experience