ANKIT KUMAR SINGH

Engineering Manager

Bengaluru, Karnataka, India12 yrs 1 mo experience
Most Likely To SwitchHighly Stable

Key Highlights

  • Over 11 years of experience in data engineering.
  • Expert in Hadoop ecosystem and big data technologies.
  • Proficient in developing end-to-end ETL pipelines.
Stackforce AI infers this person is a Big Data Engineer with extensive experience in healthcare and e-commerce sectors.

Contact

Skills

Other Skills

Apache AirflowApache SupersetData WarehousingETLInformaticaMinION Object Storage SuiteNetezzaPL/SQLTesting

About

• 11+ years of experience in Data warehouse application Development, Testing and Production Support on Healthcare, Insurance, Retail and E-commerce Domain. • 8 +years of experience in providing analytics-based business solutions using Hadoop Ecosystem (Hadoop, Cloudera, Impala, Hortonworks). • Expertise in Big Data Technologies Hadoop Ecosystem, HDFS, Sqoop, Scala, Python, Spark, Kafka, Spark Streaming, Informatica PowerCenter, Oracle, DB2, HBase,, Unix, JIRA, Autosys, Airflow • Experience in importing and exporting data from RDBMS to HDFS, Hive tables and HBase by using Sqoop. • Deep knowledge in incremental imports, partitioning and bucketing concepts in Hive and Spark SQL needed for optimization. • Experience in processing large set of Structured, Semi-structured, Unstructured datasets and Streaming data using Spark. • Experience working on Spark in performing ETL using Spark Core, Spark-SQL and Real-time data processing using Spark Streaming. • Worked with different file formats like JSON, XML, Avro data files and text files. • Ability to troubleshoot and tune relevant programming languages like SQL, Python, Scala, PIG, Hive, RDDs, Data Frames. • Experience in NoSQL databases such as HBase and Cassandra. • Developed end to end ETL Pipeline using Spark, Scala and Kafka API’s • Good Understanding of Big query and Migration Agent in GCP • Knowledge on Cloud Computing with Amazon Web Services like EMR, EC2, S3 • Optimized Hive QL/ pig scripts by using execution engine like Tez, Spark. • Good Understanding of Different RDBMS databases like Oracle, Teradata, Netezza. • Experience in Writing Complex SQL queries and Optimization. • Good understanding of Datawarehouse Concepts and RDBMS databases such as SQL Server, Oracle & Teradata • Good understanding of Software Development Life Cycle and development methodologies (Agile/Waterfall). • Experienced in handling production support which includes monitoring online, production batch runs, problem determination and Providing resolution. • Excellent verbal and written communication skills, with ability to lead a project team through entire project lifecycle.

Experience

Commonwealth bank

2 roles

Engineering Manager

Promoted

Aug 2024Present · 1 yr 7 mos · Bengaluru, Karnataka, India

Lead Data Engineer

Nov 2022Aug 2024 · 1 yr 9 mos · Bengaluru, Karnataka, India

Rakuten

3 roles

Senior Data Engineer-II

Jan 2022Oct 2022 · 9 mos

Senior Data Engineer-I

Promoted

Jan 2020Feb 2022 · 2 yrs 1 mo

Data Engineer

Apr 2018Jan 2020 · 1 yr 9 mos

Logitech

Data Engineer

Oct 2017Apr 2018 · 6 mos · Chennai, Tamil Nadu, India

Astrazeneca

ETL Developer

Sep 2016Oct 2017 · 1 yr 1 mo · Chennai Area, India

  • The Next Generation Warehouse will provide the foundation to enable the transformational business capabilities and support existing information needs.
  • NGW enhance efficiency and effectiveness and business decisions and insight generation by providing a commercial information management foundation with the appropriate breadth and depth of data.
  • The SIMON is the application system developed to support Headquarters Commercial Analytics and Field Sales Reporting tool in AstraZeneca North America.
  • Simon utilizes Micro Strategy Business Intelligence Tool as the platform for delivering the needed data. The current reporting solution leverages Netezza and Informatica as the technical platform.

Anthem, inc.

Software Developer

Jan 2014Sep 2016 · 2 yrs 8 mos · Bengaluru Area, India

  • Regular meetings with Onsite coordinator on the Project Status.
  • Analyzing the technical specifications.
  • Understanding existing business model and Customer Requirements.
  • Involved in Performance Tuning of Informatica ETL mappings and database.
  • Creating Mapping Document based on BRD.
  • Creating RTM (Requirement Traceability Matrix) based on BRD.
  • Fixing invalid Mappings, testing of Stored Procedures and Functions, Unit and Integration Testing of Informatica Sessions, Batches and Target Data
  • Extensively used Transformations like Router, Aggregator, Source Qualifier, Joiner, Expression, Aggregator and Sequence generator.
  • Worked with different Sources such as Oracle and flat files.
  • Prepared documentation to describe program development, logic, coding, testing, changes and corrections.

Education

Raj Kumar Goel Institute of Technology, Ghaziabad

Bachelor of Technology (B.Tech.) — Electronics and Communications Engineering

Jan 2009Jan 2013

Stackforce found 100+ more professionals with Apache Airflow & Apache Superset

Explore similar profiles based on matching skills and experience