Ashutosh Pandey

AI Researcher

San Ramon, California, United States22 yrs 1 mo experience
AI EnabledAI ML Practitioner

Key Highlights

  • Over a decade of experience in data engineering.
  • Expert in advanced data architecture and cloud technologies.
  • Proven track record of driving business transformations.
Stackforce AI infers this person is a Data Engineering expert with a focus on SaaS and cloud-based solutions.

Contact

Skills

Core Skills

Data EngineeringCloud ComputingData ArchitectureMachine LearningData ManagementErp ImplementationUser Training

Other Skills

Data EngeeringLarge Language Models (LLM)Big DataApache SparkPython (Programming Language)FastAPISQLNoSQLData LakesData ModelingData QualityMaster Data ManagementREST APIApache KafkaDocker

About

Dynamic Principal Data Engineer with over a decade of leading advanced data architecture deployments that drive business transformations. Expert in Apache Spark, Kafka, Databricks, and Snowflake, with proficiency in Python, SQL, and modern cloud technologies. Renowned for creating scalable data infrastructures that enhance operational efficiency. Combines innovative solution development with strategic data governance to optimize analytics, boost customer engagement, and maximize revenue growth. Passionate about leveraging cutting-edge data technologies to solve complex business challenges and continually improve data-driven decision-making processes.

Experience

22 yrs 1 mo
Total Experience
3 yrs 4 mos
Average Tenure
1 yr 9 mos
Current Experience

Comcast

Principal Data Engineer

Sep 2024Present · 1 yr 9 mos · California, United States · Remote

Caffeine

Principal Data Engineer

Oct 2022Aug 2024 · 1 yr 10 mos · Redwood City, California, United States · Remote

  • Improved processing efficiency by 50% with 80% reduced operational costs by leading the design and deployment of advanced video streaming data pipelines using Kafka, PySpark, Delta Lake, and Unity Catalog.
  • Enhanced data insight extraction and streamlined complex data modeling and transformation processes by architecting a Data Lakehouse on Databricks and DBT.
  • Increased user engagement by 30% and improved retention rates by developing a real-time video recommendation engine using machine learning techniques.
  • Enhanced content discoverability and retention through the integration of LLM models for video transcript analysis, creating automated video chapters and summaries.
  • Optimized media asset management by engineering and deploying high-performance video catalog syndication microservices.
Data EngeeringLarge Language Models (LLM)Big DataCloud ComputingApache SparkPython (Programming Language)+37

Meta

Lead Data Engineer

Nov 2021Oct 2022 · 11 mos · Menlo Park, California, United States · On-site

  • Reduced execution times by 40% and streamlined operations by optimizing the Central Integrity Decision data pipeline.
  • Enhanced data accuracy and boosted operational efficiency by developing and deploying a maturity metrics dashboard for Integrity Automation services.
  • Identified 20% redundancy and saved approximately $30 million annually by thoroughly analyzing the content moderation queue and engineering a strategic solution.
JavaScriptAgile MethodologiesArchitectureData ManagementRepresentational State Transfer (REST)Hadoop+10

Freedom financial network

Senior Staff Data Engineer

Jan 2021Nov 2021 · 10 mos · United States · Hybrid

  • Led the creation of a large-scale, distributed, event-driven data platform using GCP, BigQuery, Kafka, and Python, serving as critical infrastructure for all organizational offerings.
Google Cloud Platform (GCP)JavaScriptAgile MethodologiesArchitectureData ManagementRepresentational State Transfer (REST)+6

Netapp

Principal Data Engineer

Oct 2011Jan 2021 · 9 yrs 3 mos · Sunnyvale

  • Significantly enhanced business scalability and agility by engineering a sophisticated enterprise data platform from the ground up, integrating Apache Spark, cloud infrastructure, Delta Lake, Kafka, and Snowflake.
  • Enhanced customer retention by 20% and boosted subscription revenue by 15% through cross-sell/up-sell strategies and developing a telemetry analytics platform using Apache Spark, Airflow, and AWS S3.
  • Boosted on-time renewal rates by 40% by employing Apache Spark, Delta Lake, Airflow, and microservices to derive and serve detailed install-based as-is config and metrics from system logs.
  • Enhanced lead generation, sales strategies, and customer support by building an Enterprise Contact Master application using MongoDB, Node.js, and React, integrating Google Maps and LinkedIn APIs for data enrichment.
  • Improved data consistency and operational efficiency organization-wide by establishing and executing global data management and governance strategies.
ElasticsearchJavaScriptAgile MethodologiesArchitectureData ManagementRepresentational State Transfer (REST)+28

Infogain

Solutions Architect (Data and Analytics)

Sep 2006Sep 2011 · 5 yrs · Sunnyvale, CA

  • Improved data management and analytics capabilities by masterminding and implementing multi-domain Master Data and dimensional data models.
  • Elevated data quality and handling efficiency by designing and launching a data workbench to optimize data acquisition and management processes.
  • Oversaw ERP systems across various domains with a strategic focus on Master Data Management.
JavaScriptData ManagementMaster Data ManagementEnterprise Resource Planning (ERP)InformaticaSQL+1

Lg electronics

Solution Architect (ERP)

Mar 2004Sep 2006 · 2 yrs 6 mos · Noida, Uttar Pradesh, India

  • Led a team to successfully implement the Oracle EBS 11 Order Management module, optimizing order processing and management for increased operational efficiency.
  • Architected and developed robust data migration processes, seamlessly transferring critical data from legacy Sales and Inventory systems to the new Oracle EBS platform, ensuring data integrity and continuity.
  • Conducted comprehensive training sessions for business users and operational teams, empowering them with the skills and knowledge to effectively utilize the Oracle EBS system, resulting in improved adoption and user competency.
  • Oversaw the seamless transition of support responsibilities from the implementation team to the operational team, ensuring continuous maintenance and support for the Oracle EBS system, and establishing a sustainable support framework.
Data ManagementOracle ERP ImplementationsERP Implementation

Education

Georgia Institute of Technology

Master of Science - MS — Analytics

Jan 2021Jan 2023

Indira Gandhi National Open University

Master of Computer Applications - MCA — Computer Science

Jan 1998Jan 2001

University of Allahabad

Bachelor of Arts - BA — Political Science and Government

Jan 1991Jan 1994

Stackforce found 100+ more professionals with Data Engineering & Cloud Computing

Explore similar profiles based on matching skills and experience