Aniket Deshpande

Data Engineer

Vancouver, British Columbia, Canada10 yrs 3 mos experience
Highly Stable

Key Highlights

  • 10 years of experience in data engineering.
  • Expert in building scalable data pipelines.
  • Proven track record in optimizing data architectures.
Stackforce AI infers this person is a Data Engineer with expertise in building and optimizing data pipelines in the Fintech and SaaS industries.

Contact

Skills

Core Skills

Apache SparkPython (programming Language)Databricks

Other Skills

PrestoApache AirflowSQLAWS AthenaSQL ServerPostgresMapRDCOSApache FlinkStreamsetsChefTerraformJavaHBaseKafka

About

I am a Data Engineer with Python, Java, SQL, Spark and Airflow as my core skills.

Experience

10 yrs 3 mos
Total Experience
2 yrs 6 mos
Average Tenure
--
Current Experience

Meta

2 roles

Data Engineer

Nov 2024Present · 1 yr 6 mos

Data Engineer

Jul 2020Nov 2024 · 4 yrs 4 mos

  • Supporting Data Architect efforts Human Ops Platform inside the Community Integrity stream to design and build data infra to monitor efficiency and effectiveness of the ecosystem
  • Led the Data architecture efforts for new products launched to support the world's largest and most complex human review platform, collaborating across multiple cross functional teams
  • Designed the analytics layer for effectively tracking the progress of new products with tools ranging from Dataswarm (Airflow), Unidash, Presto and Spark
  • Led efforts to optimize the performance of data pipelines using Dataswarm, Presto and Spark to effectively manage the compute resources and to improve SLAs for core datasets
  • Launched org wide initiative for improving and standardizing logging of critical data. (Nucleator)
Python (Programming Language)PrestoApache AirflowApache SparkSQL

Kabbage, inc

Advanced Data Platform Engineer

Nov 2016May 2020 · 3 yrs 6 mos · Atlanta Metropolitan Area

  • Built the Kabbage data lake using Databricks (Spark), Databricks Delta and AWS Athena
  • Designed and built multiple data pipelines via Airflow and Databricks (using Spark) to ingest and process data from multiple sources into the Kabbage Data Lake
  • Designed and built pipelines leveraging Spark and Airflow to unload data from SQL Server and Postgres into the Kabbage Data lake on a daily basis (300 Gb+ data per day)
  • Designed and built pipelines leveraging Spark via AWS EMR and Airflow to ingest and process data from 3rd part REST APIs
  • Designed and built a new stream-first data platform based on the Kappa Architecture, using MapR ecosystem (MaprFS, Mapr streams, MapR DB), DCOS (Mesos), Apache Flink, Apache Ignite, Druid, Streamsets etc
  • Designed, built and automated the entire infrastructure using Chef and Terraform on AWS
  • Designed a RESTful micro-service in Clojure for storing data in Apache Ignite
  • Designed and Developed a Spark (Java) batch process to transfer data between S3 and MapR automated it via Python, Boto3 and AWS EMR
  • Set up a highly available secure Apache Flink and Apache Spark standalone cluster on DCOS using Marathon
  • Designed and setup monitoring and log aggregation using fluentd, ElasticSearch, DataDog and OpsGenie
Python (Programming Language)PrestoApache AirflowApache SparkDatabricksAWS Athena+8

Magnetic

Data Engineer

Apr 2016Oct 2016 · 6 mos · New York City Metropolitan Area

  • Worked with the Real-Time Events and Media team
  • Refined the real time bidding pipeline by developing and testing clients using Python and Java
  • Implemented and managed an HBase to Kafka pipeline using Python (with HappyBase and kafka-python) . Monitored the pipeline metrics through DataDog
  • Developed an end-to-end integration test suite for the bidder pipeline using Python, Behave and Docker
  • Designed a Spark job (Python) to analyze the performance of Avro vs Parquet
  • Performed on-call duties as a Data Engineer including analyzing and solving bugs in the Data pipeline. This typically involved monitoring and fixing YARN and Luigi jobs. Also, DataDog was used for monitoring and tracking these pipelines
Python (Programming Language)Apache Spark

Sas

Graduate Intern

Jun 2015Aug 2015 · 2 mos · Cary, North Carolina

  • Building a robust model for an intelligent semantic search server
Python (Programming Language)

Geometric ltd.

Software Developer

Jul 2012Jun 2014 · 1 yr 11 mos · Pune/Pimpri-Chinchwad Area

  • Responsible for Teamcenter customization through Integrated Teamcenter Kit(Server side C,C++ coding) and Teamcenter Rich Application Client(Client Side Java UI coding) for resolving issues related to Virtual Process Planning for Caterpillar Inc, the world's leading manufacturer of construction and mining equipment, diesel and natural gas engines, industrial gas turbines and diesel-electric locomotives.

Intouchrewards.com

Intern

Apr 2010Jun 2010 · 2 mos · Pune/Pimpri-Chinchwad Area

  • Responsible for designing a mobile Java application using J2ME to track Loss Sales in Retail industry.

Education

University at Buffalo

Master of Science (M.S.) — Computer Science

Jan 2014Jan 2016

Visvesvaraya National Institute of Technology

Bachelor of Technology (BTech) — Computer Science

Jan 2008Jan 2012

Stackforce found 100+ more professionals with Apache Spark & Python (programming Language)

Explore similar profiles based on matching skills and experience