S

Sandeep B

Data Engineer

Wichita, Kansas, United States0 mo experience

Key Highlights

  • Expert in building scalable ETL pipelines.
  • Proven track record of optimizing data workflows.
  • Skilled in cloud migration and data analytics.
Stackforce AI infers this person is a Data Engineering expert in SaaS environments, specializing in cloud-native data solutions.

Contact

Skills

Core Skills

Data EngineeringEtl

Other Skills

AWS EMRAWS GlueAWS S3AirflowAmazon EMRAmazon RedshiftApache AirflowApache FlumeApache KafkaApache OozieApache SparkApache Spark StreamingApache ZooKeeperAzure Data FactoryAzure Data Lake

About

Data Engineer with 4+ years of experience building scalable, cloud-native data systems using Python, SQL, PySpark, and AWS. Proven ability to design efficient ETL pipelines, automate data workflows using Apache Airflow, and improve performance in distributed data environments. Skilled in migrating legacy systems to cloud platforms and delivering accurate insights with Power BI. Passionate about transforming raw data into actionable business value.

Experience

Carbon cell

Data Engineer

Jan 2024Present · 2 yrs 2 mos · Remote

  • Developed scalable ETL pipelines using Python, SQL, and PySpark
  • Automated ingestion from 6+ sources, improving data reliability and availability
  • Managed Apache Airflow DAGs for batch processing, reducing failures by 25%
  • Migrated legacy systems to AWS S3 and Redshift for faster analytics
  • Collaborated with analysts, engineers, and PMs to define pipeline requirements
  • Improved report accuracy and reduced query time by 20% through validation and tuning
PythonSQLPySparkApache AirflowAWS S3Redshift+2

Prox technologies pvt. ltd.

Data Engineer

Jun 2019Dec 2021 · 2 yrs 6 mos · Hyderabad, Telangana, India · On-site

  • Built scalable ELT pipelines with PySpark and Amazon EMR
  • Automated ingestion from on-prem DBs to S3, improving processing efficiency by 30%
  • Optimized Amazon Redshift using SQL tuning, distribution key selection, and partition pruning
  • Implemented SCD Type 1 & 2 logic for historical compliance in Redshift
  • Led UAT and QA collaboration, accelerating reconciliation sign-off by 40%
  • Owned end-to-end DAG development and deployment using Apache Airflow on AWS MWAA
  • Created Power BI dashboards and validated data for business reporting
PySparkAmazon EMRSQLAmazon RedshiftApache AirflowPower BI+2

Education

Wichita State University

Master's degree — Computer Science

Jan 2022Jan 2024

KL University

Bachelor of Technology - BTech — Computer Science

Jan 2017Jan 2021

Stackforce found 100+ more professionals with Data Engineering & Etl

Explore similar profiles based on matching skills and experience