Aniketh Jain

Senior Software Engineer

Bengaluru, Karnataka, India9 yrs 8 mos experience
Most Likely To SwitchHighly Stable

Key Highlights

  • Expert in designing ETL and analytics frameworks.
  • Proficient in Apache Spark and distributed systems.
  • Significant experience in AWS cloud technologies.
Stackforce AI infers this person is a Big Data Engineer specializing in SaaS and AdTech solutions.

Contact

Skills

Core Skills

Apache SparkDistributed SystemsAwsApache IgniteHbase

Other Skills

AWS EMRAlgorithm DesignAlgorithmsApache KafkaApache ZooKeeperCompute layerCore JavaDAG of dependent queriesData StructuresError Management SystemExtract, Transform, Load (ETL)HDFSHLD and LLDHadoopHive

About

Experienced Senior Software Engineer with a demonstrated history in designing and development of etl and analytics frameworks using technologies like Apache Kafka, HDFS, ZooKeeper, Apache Spark, HBase, and Apache Ignite. Experienced in programming using Java with production-grade performance and memory tuning of Java-based applications. Very good understanding of creating Distributed systems for processing and extracting relevant insights from Big data at rest and streaming. Experienced in Spark processing on AWS cloud using S3, EMR and Lambda.

Experience

9 yrs 8 mos
Total Experience
2 yrs 5 mos
Average Tenure
3 yrs 11 mos
Current Experience

Amazon

SDE 2

Jul 2022Present · 3 yrs 11 mos · Bengaluru, Karnataka, India

Myntra jabong

Senior Software Engineer

Dec 2019Jul 2022 · 2 yrs 7 mos · Bangalore

  • Revamped the in-house data serving product ( DDP ) by implementing the features of query queue time
  • prediction, cost based query queues and priority and quota/throttling management system for the users.
  • Proposed, designed and developed Step Query Engine used for executing complex queries by breaking them into a DAG of dependent queries.
  • Involved in HLD and LLD for application based proxy for HA of Hive clusters to minimize query failures in Ingestion pipelines.
  • Proposed and designed Error Management System for converting Database and Application specific verbose errors into easy to understand cause and resolutions for end users. This reduced the creation of on-call tickets for several services by 60%.
  • Designed Compute layer to enable better resource management of clusters for all Open source technologies like Spark, Presto/Trino, Azure DW under analytics department.
query queue time predictioncost based query queuespriority and quota managementStep Query EngineHLD and LLDError Management System+3

Neustar, inc.

Big Data Engineer

Mar 2019Dec 2019 · 9 mos · Bengaluru, Karnataka, India

  • Worked in the development of Identity graphs for targeted advertisement by segment creation. Technically, this involved the use of Apache Spark, AWS EMR and Lambda. Also, owned and developed a common service for monitoring and alerting all involved Spark jobs in pipeline with fine grained metrics for analysing performance over multiple runs.
Identity graphstargeted advertisementsegment creationApache SparkAWS EMRLambda+2

Subex

3 roles

Technical Anchor

Feb 2019Mar 2019 · 1 mo

Senior Software Engineer

Promoted

Jul 2018Jan 2019 · 6 mos

  • Collaborated with the technical architect to develop the major components of online duplicate XDR ( Call Record or Subscription Record ) detection using custom bloom filter implementation in Apache Ignite embedded in Apache Spark and also in stand-alone mode along with HBase to store the data for recovery from abrupt failures.
  • Designed and developed an implementation of a common entity management framework to replace the logs, errors, exceptions and metrics generated by our components into an abstraction which will provide a deeper in-sight on the state and lifecycle of various components within the pipeline. This makes the tasks of debugging issues and monitoring the components automated with little manual intervention.
duplicate XDR detectionbloom filter implementationApache IgniteHBaseentity management framework

Software Engineer

Jul 2016Jun 2018 · 1 yr 11 mos

  • Designed and implemented a cluster chaos generation tool responsible for simulating abrupt failure scenario's in a big-data cluster installation specifically, HDP distribution using Ambari REST Api.
  • Extensively involved in the design and development of the caching layer for internal parsing engine using Apache Ignite.
chaos generation toolbig-data cluster installationcaching layerApache Ignite

Education

Jain (Deemed-to-be University)

Bachelor of Engineering - BE — Computer Science

Jan 2012Jan 2016

Stackforce found 100+ more professionals with Apache Spark & Distributed Systems

Explore similar profiles based on matching skills and experience