Mohd Sohail Khan

Data Engineer

Faridabad, Haryana, India9 yrs 6 mos experience

Key Highlights

  • Expert in building scalable ETL pipelines.
  • Proficient in Big Data technologies like Apache Spark.
  • Strong background in data engineering and processing.
Stackforce AI infers this person is a Data Engineer specializing in Big Data solutions for the Fintech and SaaS industries.

Contact

Skills

Core Skills

Data EngineeringEtl

Other Skills

AirflowAmazon Web Services (AWS)Android DevelopmentApache AirflowApache OozieApache SparkApache SqoopBig DataCC++Data PipelinesDjangoExtract, Transform, Load (ETL)GitHadoop

About

Experienced Data Engineer with a demonstrated history of working in Big Data. Skilled in Scala, Python, Apache Spark, and Pyspark. Built end-to-end ETL pipelines using Apache Spark, Hadoop, MySQL, MongoDB, Scala, AIrflow, and AWS to process TBs of data. Strong engineering professional with a B.TECH focused in Computer Engineering from YMCA University of Science and Technology.

Experience

Exl

Senior Data Engineer

Apr 2025Present · 11 mos · Pune · Hybrid

Tiger analytics

Data Engineer

Aug 2022Apr 2025 · 2 yrs 8 mos · Hyderabad, Telangana, India · Hybrid

  • Worked with one of the major insurance companies in the US helping them in developing pyspark scripts for ETL and processing data. Developed scripts for parsing JSON data to hive tables. Used SnowSQL to write parsing logic for JSON data to get required data. Tech stack used: PySpark, Python, Hive, SnowFlake
Data EngineeringExtract, Transform, Load (ETL)AirflowData PipelinesApache SparkAmazon Web Services (AWS)+15

Sigmoid

Data Engineer

May 2021Aug 2022 · 1 yr 3 mos · Bengaluru, Karnataka, India

  • Built end to end ETL pipelines using Apache Spark, Hadoop, MySQL, MongoDB, Scala to process TBs of data.
  • Provided Big Data solutions to various customers with minimal query serve time.
  • Optimize the data processing cost and no. of partitions to save the data for indexing.
Data EngineeringExtract, Transform, Load (ETL)Data PipelinesApache SparkScalaPySpark+10

Samsung electronics

Software Engineer

Oct 2018May 2021 · 2 yrs 7 mos · Noida Area, India

Data EngineeringExtract, Transform, Load (ETL)Apache SparkPySparkScriptingSQL+3

St microelectronics

Software Engineering Intern

Jan 2018Jul 2018 · 6 mos · Greater Noida

Hackerearth

Campus Ambassador

Apr 2016May 2018 · 2 yrs 1 mo · Faridabad,Haryana

Education

J.C. Bose University of Science and Technology, YMCA

B.TECH — Computer Engineering

Jan 2014Jan 2018

Rawal Convent School

Senior Secondary — Non Medical

Jan 2013Jan 2014

Rawal Convent School

Secondary — Science

Jan 2011Jan 2012

Stackforce found 100+ more professionals with Data Engineering & Etl

Explore similar profiles based on matching skills and experience