Sharayu Shinde

Data Engineer

India5 yrs 11 mos experience
Highly Stable

Key Highlights

  • Expert in building scalable data processing systems.
  • Proficient in optimizing data pipelines with PySpark.
  • Experience in real-time and batch data processing.
Stackforce AI infers this person is a Data Engineer with expertise in Fintech and Big Data technologies.

Contact

Skills

Core Skills

PysparkData PipelinesApi DevelopmentPythonEtlAmazon Web Services (aws)

Other Skills

SQLHiveHDFSApache IcebergDelta Lakedistributed systemsHarnessLightspeedAWSGitCI/CDMicrosoft OfficeJavaKafkaPower BI

About

Data Engineer at Citi with strong expertise in Big Data technologies and building scalable data processing systems for enterprise-level applications. I specialize in designing and optimizing data pipelines using distributed systems and modern data platforms. My experience includes working with large-scale datasets, ensuring efficient data processing, and enabling analytics for business-critical use cases. At Citi, I have: • Built and optimized data pipelines using PySpark for large-scale data processing • Worked extensively with Hive and HDFS for distributed data storage and querying • Leveraged Apache Iceberg and Delta Lake for reliable and scalable data lake architectures • Integrated real-time and batch workflows to support analytics and reporting • Developed APIs and automated workflows to improve system efficiency Technical expertise: • PySpark, SQL, Python • Hive, HDFS • Apache Iceberg, Delta Lake • Data pipelines, ETL, distributed systems • Harness, Lightspeed • AWS, Git, CI/CD I am passionate about solving complex data engineering challenges, optimizing performance, and building robust, scalable data platforms. Currently focused on advancing in: • Databricks and Lakehouse architecture • Advanced Spark optimization techniques • Real-time data processing systems Open to Data Engineering opportunities where I can contribute to building high-performance data platforms at scale.

Experience

5 yrs 11 mos
Total Experience
5 yrs 11 mos
Average Tenure
5 yrs 11 mos
Current Experience

Citi

Software Developer

Sep 2023Present · 2 yrs 9 mos

PySparkSQLPythonHiveHDFSApache Iceberg+9

The university of manchester

Student Representative

Oct 2022Sep 2023 · 11 mos · Manchester Area, United Kingdom

  • Student Representative for MSC in Advanced Computer Science Post Graduate program for all pathways.
Microsoft Office

Citi

2 roles

Software Engineer at Citi, Pune

Promoted

Dec 2020Sep 2022 · 1 yr 9 mos

  • The role involved developing and deploying a portfolio management and analysis application using Python and the Agile development cycle.
  • The application performance was enhanced by implementing a multiprocessing environment, which reduced processing time by 40%
  • Kafka was integrated to eliminate manual intervention in the system and automate asset processing to calculate Value at Risk and P&L
  • Autosys was used to automate the application, process the asset and generate portfolio reports used by high-value clients and downstream systems.
  • Developed a service for sending emails, and developed a Django website to process ad-hoc asset processing requests
  • Developed APIs for portal authentication and authorization using IBM API connect framework and developer’s tool.
  • A Power BI report was developed to give risk insights over distinct time periods.
JavaAPI DevelopmentPython

Software Analyst

Jul 2019Dec 2020 · 1 yr 5 mos

  • Conducted research on various proofs of concepts to assess the suitability of the technology for project requirements.
  • Evaluated technologies including Hive, Impala, Spark, Sqoop, and Java.
  • Designed an ETL flow to extract data from HDFS using Impala.
  • Conducted aggregation of data.
  • Loaded the data back into HDFS.
JavaSqoop

Ibm

Project Intern

Aug 2018May 2019 · 9 mos

  • Smart Infrastructure Alert
  • Contributed to the design and development of an alert system for generating infrastructure alerts.
  • Configured AWS CloudWatch service to monitor CPU and memory space utilization and generate events when a threshold is reached.
  • Configured AWS SNS service to capture events sent by the CloudWatch service and send email notifications to the DevOps team.
WebSphereAmazon Web Services (AWS)

Education

The University of Manchester

Master's degree — Advanced Computer Science

Sep 2022Sep 2023

MKSSS Cummins College of Engineering for Women

Bachelor's degree — Computer Science

Jan 2015Jan 2019

Stackforce found 100+ more professionals with Pyspark & Data Pipelines

Explore similar profiles based on matching skills and experience