A

Ankit D.

Data Engineer

United States14 yrs 6 mos experience
Most Likely To SwitchHighly Stable

Key Highlights

  • Built scalable data systems for cross-functional teams.
  • Led data engineering initiatives at top tech companies.
  • Expert in data-driven decision-making and analytics.
Stackforce AI infers this person is a Data Engineer with expertise in SaaS and Data Analytics.

Contact

Skills

Core Skills

Data EngineeringEtlData ScienceData AnalysisConsulting

Other Skills

AirflowAWSS3RedshiftData ValidationData ModelingData PipelinesSQLCollaborationDashboard DevelopmentModelingPythonTableauText AnalyticsCollaborative Filtering

About

I am a full stack data professional working on the end-to-end data lifecycle involving logging, building data models and ETL, developing metrics and dashboards. My blended experience in Software Engineering, Data Science and Data Engineering helps me build tools, dashboards and data narratives that help cross functional partners make data driven decisions. I am passionate about learning about and building scalable data systems. I love to learn about how technology is helping shape our future.

Experience

14 yrs 6 mos
Total Experience
2 yrs
Average Tenure
2 yrs 3 mos
Current Experience

Netflix

Senior Data Engineer

Feb 2024Present · 2 yrs 3 mos

Airtable

Software Engineer, Data

Jul 2020Jan 2024 · 3 yrs 6 mos · San Francisco Bay Area

  • As the first Data Engineer in the company, I built core datasets that represent users, enterprises, marketing, revenue and logging events
  • Worked with cross functional stakeholders from data science, sales, account managers to identify and prioritize data requirements for building foundational datasets that help measure the topline revenue metric, ARR and other related revenue metrics
  • Built patterns and abstractions for ingesting and consuming data from external data sources like Salesforce, Clearbit, Google Ads, etc. into the data warehouse and rich information about users, enterprise accounts ,etc.
  • Use Airflow and AWS stack (S3, Redshift, Redshift Spectrum) to build data pipelines that process 10MM+ rows of raw data daily.
  • Contributed to the in-home ETL testing framework to setup data validation rules constraints like null checks, primary keys, value limits, etc.
  • Optimized ETL queries to bring down the ETL run-time to help with clear cluster resources during business hours.
AirflowAWSS3RedshiftETLData Validation+3

Airbnb

Data Scientist

Feb 2019Jul 2020 · 1 yr 5 mos · San Francisco Bay Area

  • Led the migration of the Homes Checkout data foundation, partnering with Engineering to define and validate the core logging schema, updating the checkout data model & ETL pipelines and defining conversion & user behavior metrics for identifying key frictions in the checkout flow
  • Unblocked experimentation for product teams by identifying root causes of experiment imbalance, finding bugs in the product & experimentation setup, using ad-hoc analyses for informing high stakes iterations
  • Collaborated with Data Scientists across Payments, Membership and Trust teams on an executive dashboard to surface trends in key partner team metrics and assess the impact of checkout sub-flows on conversion
  • Conducted company-wide SQL training to empower cross functional partners to independently answer their data questions
SQLData ModelingData AnalysisCollaborationDashboard DevelopmentData Science

Facebook

Data Scientist

Feb 2017Feb 2019 · 2 yrs · San Francisco Bay Area

  • Identified limitations of a complex weighted top line metric used by Branded Content team and moved the team to a simpler metric aligned with the team’s goals.
  • Empowered business operations team with a dashboard for creator leads powered by an ETL pipeline using page similarity data for sourcing new creators similar to the existing creators
  • Flagged policy violating partnership posts using a pipeline to look for fraudulent creators based on posting patterns
  • Built a propensity scoring model to identify users that are likely to pay for creators
  • Provided opportunity sizing for starting a new Creator marketplace program and awarding ad credit coupons to creators
  • Managed a Data Science intern to scope out the work for identifying key factors impacting retention and churn for subscriptions
Data AnalysisETLDashboard DevelopmentModelingData Science

Deloitte

Consultant - Advanced Analytics and Modeling

Feb 2016Feb 2017 · 1 yr · Greater Boston

  • Implemented an article recommendation engine using Text Analytics and Collaborative Filtering using Python.
  • Built a Tableau dashboard for visualizing sales attribution calculated through an R-based marketing mix model.
  • Provided consultation to the data platform team on requirements for setting up the AWS based data analytics platform for a client.
  • Built a RShiny based tool for a healthcare client
PythonTableauText AnalyticsCollaborative FilteringData AnalysisConsulting

Carnegie mellon university

Graduate Research Assistant

Sep 2015Dec 2015 · 3 mos · Greater Pittsburgh Region

  • Worked with Prof Rema Padman on aggregating and analyzing data to identify relationship between patients’ diagnosis, lab readings, medications, demographics, pulse, temperature and sleeping patterns with severity of the chronic kidney disease.

Linkedin

Data Scientist Intern

May 2015Aug 2015 · 3 mos · San Francisco Bay Area

  • Enhanced, on-boarded and productionized key metrics related to LinkedIn’s News Feed consumption.
  • Performed exploratory & segmentation analysis to show how the feed consumption varied across different segments.
  • Identified several data irregularities and worked closely with the engineering team to get the issues rectified.
  • Built daily reports that reported how metrics were changing over time.
  • Gathered data related to video sharing on the Feed to help understand the extent to which spam videos were being shared.
  • Helped uncover the reason behind a critical drop in consumption metrics using data segmentation and visualization.

Carnegie mellon university

Graduate Research Assistant

Jan 2015Apr 2015 · 3 mos · Greater Pittsburgh Region

  • Performing univariate and multivariate analysis on PNC bank's customer' financial transactions data to find out anomalous patterns that indicate deteriorating financial health of a customer.

Thoughtworks

Senior Application Developer

Jun 2010Jul 2014 · 4 yrs 1 mo · Pune/Gurgaon/Bangalore

  • Led a team of engineers to build a D3.JS based frontend application to help match employee skills with client project requirements
  • Used Scala on Play Framework with AngularJS to build a Single Page Web application for User Registration flow
  • Built a Java based multithreaded asynchronous Invoice Generation system using custom parsers for JSON and XML input
  • Trained 30+ developers on Unit Testing, Functional Testing, Design Patterns, Test Driven Development, Refactoring, Pair Programming, Object Oriented Design, Continuous Integration and Continuous Deployment

Education

Carnegie Mellon University

Masters in Information Systems Management — Business Intelligence and Data Analytics Concentration

Jan 2014Jan 2015

National Institute of Technology Nagpur

Bachelor of Technology (B.Tech.) — Computer Science and Engineering

Jan 2006Jan 2010

Jai Hindu jobs

Vocational Computer Science

Jan 2004Jan 2006

Stackforce found 100+ more professionals with Data Engineering & Etl

Explore similar profiles based on matching skills and experience