Siddharth Chauhan

Data Engineer

Gautam Buddha Nagar, Uttar Pradesh, India7 yrs 9 mos experience
Most Likely To Switch

Key Highlights

  • Designed critical in-memory caching solution for data extraction.
  • Reduced data segregation time by 30% for global datasets.
  • Built automated validation pipeline ensuring data integrity.
Stackforce AI infers this person is a Data Engineering expert in Fintech with strong capabilities in data architecture and automation.

Contact

Skills

Core Skills

Data Warehouse ArchitectureData ArchitectureData EngineeringData ValidationSoftware DevelopmentTesting

Other Skills

PolarsAws S3Agile MethodologiesC++Apache AirflowJAVADockerData MaintenanceMySQLPandasDebuggingVersion ControlSoftware Development Life Cycle (SDLC)Test Automation FrameworksSelenium

About

As a Senior Data Engineer at PharVision Capital, I specialize in building robust, scalable data platforms and architecting end-to-end pipelines that enable fast, reliable, and accessible data delivery. With over 7 years of experience, I bring deep expertise in Python, C++, and distributed systems, combined with a strong focus on performance optimization and data strategy. ๐Ÿ”น Key Achievements: Designed & implemented a critical in-memory caching solution ("Table Service") that enables seamless raw data extraction, removing the dependency on C++ for downstream teams and significantly accelerating delivery. Developed a region- and exchange-aware data segregation system, reducing time for execution-level partitioning by 30%, streamlining analysis for global trading datasets. Built and deployed an automated Apache Airflow pipeline to perform exhaustive validation and sanity checks on both new and existing datasets, ensuring data integrity across all product lines. ๐ŸŽ“ Education & Certifications: B.Tech in Computer Science โ€“ Jaypee University of Information Technology Currently pursuing: Data Structures & Algorithms, LLD/HLD โ€“ Scaler Academy Certifications: Advanced C++, Python, Problem Solving ๐Ÿš€ Iโ€™m a proactive, detail-oriented engineer who thrives on collaboration, continuous learning, and solving hard problems at scale. Iโ€™m immediately available and actively exploring opportunities where I can contribute to building data-driven products and systems that scale.

Experience

7 yrs 9 mos
Total Experience
1 yr 11 mos
Average Tenure
2 yrs 5 mos
Current Experience

Pharvision capital

Senior Data Engineer

Jan 2024 โ€“ Present ยท 2 yrs 5 mos ยท Gurugram, Haryana, India ยท On-site

  • Integrating Data lake on AWS S3 cloud, Upgraded to Polars rather than Pandas data frame to get time and space efficiency.
PolarsAws S3Data Warehouse ArchitectureData Architecture

Worldquant

2 roles

Data Engineer

Promoted

May 2022 โ€“ Dec 2023 ยท 1 yr 7 mos ยท India

  • Engineered a functionality segregating raw data into region-specific and exchange-specific time zones, slashing execution-level data segregation time by an impressive 30%.
  • I detected a critical flaw in the legacy codebase that impacted more than 40% of the live dataset. We devised manual fixes for each dataset and orchestrated a streamlined migration plan to a contemporary code library.
  • I established an Apache Airflow pipeline that automates essential testing processes for validating every new and existing dataset addition, ensuring rigorous sanity checks in our productized datasets.
Agile MethodologiesC++Data EngineeringData Validation

Contractor

Jan 2021 โ€“ Apr 2022 ยท 1 yr 3 mos ยท India

  • As a contractor in WorldQuant via Cians Analytics, My work goals were closely aligned with creating tools to convert raw data into company proprietary outputs. Datasets were huge and varied greatly.
  • Running compatibility tests and coverage plots using AirFlows and Jenkins.
Agile MethodologiesC++

Monotype solutions india

Software Engineer I

Aug 2020 โ€“ Jan 2021 ยท 5 mos ยท Noida, Uttar Pradesh, India

  • Crafted C++ unit test cases, ensuring comprehensive code coverage. Conducted an in-depth analysis and documented benchmarks for Glyph rendering across diverse embedded devices.
  • Formulated demonstrative challenges utilizing proprietary library functions, aiding clients in mastering their utilization.
  • Employing Valgrind, pinpointed and rectified memory leaks without disrupting the overall system behavior.
  • Initiated and upheld Python-based helper scripts, automating myriad manual test cases across various devices through Unix-based automation.
C++JAVASoftware DevelopmentTesting

Thales

2 roles

Software Engineer

Jun 2018 โ€“ Jul 2020 ยท 2 yrs 1 mo

  • Conceptualized and executed an advanced automated testing framework utilizing Java Selenium for the pivotal Configuration Backup and Restore Functionality, a significant deliverable.
  • Engineered numerous essential host functions in C++11, addressing encryption, decryption, PIN management, and cryptographic key management.
  • Conducted rigorous unit, system, and functional testing on critical features of the Hardware Security Module (HSM).
  • Established a Regression pipeline on Jenkins for the Continuous Integration/Continuous Deployment (CI/CD) process, systematically testing new builds with every code change
Agile MethodologiesC++Software DevelopmentTesting

Internship Trainee

Feb 2018 โ€“ May 2018 ยท 3 mos

  • Worked closely with the R&D team to find legacy bugs and code coverage using tools like Valgrind and Qualys. In an Agile-based workflow, worked on many stories and related tasks. Hands-on experience in web automation testing using Java Selenium and Python Selenium. Console automation testing using pytest.
Agile MethodologiesJAVA

Education

Scaler

Data structure and Algorithms โ€” Low Level Design High Level design

Aug 2021 โ€“ Aug 2022

Jaypee University of Information Technology

Bachelor of Technology - BTech โ€” Computer Science

Jan 2014 โ€“ Jan 2018

Stackforce found 100+ more professionals with Data Warehouse Architecture & Data Architecture

Explore similar profiles based on matching skills and experience