Sujay Kumar

Data Engineer

Gurugram, Haryana, India7 yrs 6 mos experience
Most Likely To SwitchHighly Stable

Key Highlights

  • Expert in building scalable data pipelines.
  • Led a team of 10+ engineers at American Express.
  • Strong background in cloud technologies and data engineering.
Stackforce AI infers this person is a Data Engineer specializing in Fintech and cloud-based data solutions.

Contact

Skills

Core Skills

Data EngineeringCloud TechnologiesBusiness IntelligenceData Migration

Other Skills

Agile MethodologiesAirflowAnalytical SkillsBI PublisherBig DataBigQueryBusiness Intelligence (BI)C (Programming Language)CST Microwave StudioCommunicationConfluenceData ConversionData LakesData ModelingData Pipelines

About

As a Data Engineer with about 7 years of experience, I specialize in designing and building scalable data pipelines and delivering high-impact solutions using modern big data and cloud technologies. At American Express, I work on transforming large datasets into actionable insights that drive strategic business decisions. My technical toolkit includes Python, SQL, PL/SQL, PySpark, Hive, Hadoop, HBase, Airflow, and GCP services such as BigQuery, Cloud Storage, Dataproc, and Pub/Sub. I bring deep experience in architecting and optimizing data workflows in both on-prem and cloud environments, enabling advanced analytics and machine learning capabilities at scale. I’m passionate about continuous learning and problem-solving, and I thrive in fast-paced environments where innovation and collaboration are key. Whether it's improving pipeline performance, automating complex data processes, or mentoring junior engineers, I’m driven by the opportunity to make data work smarter. Let’s connect if you’re interested in data engineering, big data ecosystems, or building intelligent systems on the cloud.

Experience

7 yrs 6 mos
Total Experience
2 yrs 6 mos
Average Tenure
3 yrs 8 mos
Current Experience

American express

2 roles

Data Engineer 3

Promoted

Aug 2024Present · 1 yr 10 mos

  • As the Lead Data Engineer for a mission-critical Risk and Compliance application at American Express, I drive the design, development, and modernization of systems that process massive volumes of data to detect anomalies and potential misuse of Amex products. Our application uses metadata-driven rules to scan big data for suspicious patterns, triggering alerts for risk events.
  • I lead a high-performing team consisting of 10+ Big Data and Full Stack engineers, overseeing big data backend processing, and an interactive web UI that enables dynamic rule configuration. I'm currently spearheading our cloud modernization journey to GCP, leveraging tools like BigQuery, Dataproc, GCS, Cloud Composer (Airflow), and PySpark to scale our analytical processing capabilities.
  • Key responsibilities include:
  • 1. Leading architecture and solution design for data pipelines with an enterprise-wide perspective.
  • 2. Driving development of a custom rule engine to allow business users to write flexible, Bigquery/PySpark-compatible rules on GCP data infrastructure.
  • 3. Writing rule business logic using Pyspark, Bigquery, Hive that identifies potential anomalies in product usage pattern.
  • 4. Building real-time monitoring dashboards for rule execution and system health.
  • 5. Interfacing with business and product stakeholders to refine requirements and convert them into actionable engineering tasks.
  • 6. Providing technical leadership through code reviews, task scoping, and mentoring team members to foster a strong engineering culture.
  • My role balances hands-on engineering, strategic planning, and cross-functional collaboration to ensure robust, scalable solutions that protect the integrity of American Express products and data.
PythonBigQueryPySparkAirflowData EngineeringData Pipelines+2

Data Engineer 2

Oct 2022Aug 2024 · 1 yr 10 mos

  • Technical Lead for Anti Money Laundering Transaction Monitoring Application.

Oracle

2 roles

Cloud Consultant

Promoted

Sep 2022Oct 2022 · 1 mo

  • As a Data Engineer at Oracle, I contributed to the design and development of large-scale data processing solutions and business intelligence systems for global financial clients.
  • My work focused on transforming enterprise data into actionable insights through a custom in-house ETL platform, driving critical reporting and operational efficiency.
  • I worked hands-on with a range of technologies including SQL, PL/SQL, Python, Hive, Spark, Scala, and HDFS within the Hadoop ecosystem, delivering solutions that handled high data volumes and supported regulatory and business reporting.
  • Key responsibilities and contributions:
  • 1. Developed robust SQL scripts and PL/SQL objects (packages, functions, procedures) to extract and transform data from legacy systems for enterprise reporting.
  • 2. Built transformation logic, filters, and lookups to apply complex business rules and prepare data for visualization and analytics use cases using Hive and Pyspark.
  • 3. Played a core role in developing and testing a high-revenue-generating in-house ETL application, improving data accuracy and pipeline efficiency.
  • 4. Delivered custom BI Publisher and OTBI reports on Oracle Cloud, tailored to evolving business requirements and compliance needs.
  • 5. Collaborated within an Agile Scrum team, actively participating in sprints, backlog grooming, and cross-functional development to ensure timely delivery of features.
  • This experience gave me a strong foundation in enterprise data engineering, combining big data technologies with domain knowledge in financial systems and cloud-based business intelligence.
SQLPL/SQLPythonHiveSparkScala+2

Senior Cloud Analyst

Jul 2021Sep 2022 · 1 yr 2 mos

Infosys

3 roles

Senior Systems Engineer

Promoted

Jan 2021Jul 2021 · 6 mos

  • At Infosys, I played a key role in developing and implementing data-driven solutions for global clients in the finance and retail sectors. My responsibilities included designing and building scalable ETL pipelines, data migration solutions, and automating critical data workflows to support business intelligence and operational efficiency.
  • I utilized a diverse tech stack including Spark, Hive, Sqoop, SQL, PL/SQL, Python, and ETL to ensure seamless data transformation, migration, and integration across systems. I was also deeply involved in automating processes to streamline data operations and enhance data accessibility for end-users.
  • Key responsibilities and contributions:
  • 1. Developed and optimized ETL pipelines for large-scale data integration, using Spark, Hive, and Python to process, clean, and transform data efficiently.
  • 2. Worked on data migration projects, ensuring the smooth transfer of data across legacy and modern systems using SQL, PL/SQL, and Sqoop.
  • 3. Designed and implemented business logic transformations, including lookup, filter, and aggregation operations, to ensure accurate data for business intelligence.
  • 4. Collaborated with clients to gather requirements and translated them into scalable data solutions that aligned with business objectives.
  • 5. Participated in Agile Scrum methodology, ensuring timely delivery of project milestones and delivering high-quality solutions within sprint cycles.
  • 6. Enhanced data reporting and analysis capabilities by developing custom tools and scripts for data extraction and reporting in SQL and PL/SQL.
  • This role allowed me to develop my expertise in data engineering, working with a variety of big data tools and solutions while gaining invaluable experience in data migration, ETL development, and business logic implementation.
SparkHiveSQLPL/SQLPythonETL+2

Systems Engineer

Mar 2019Jan 2021 · 1 yr 10 mos

  • Infosys Certified Agile Developer.

Systems Engineer Trainee

Nov 2018Mar 2019 · 4 mos

  • Completed the Foundation Training program with a speciality in Python and SQL with a 91% overall score. Conferred "High Performer" tag for performance in training.

Samsung electronics

Intern

Aug 2018Oct 2018 · 2 mos · Noida Area, India

  • Operated as part of "Smart TV Europe Running Change" team at Samsung Research and Development Institute - Delhi.

Education

Galgotias College of Engineering and Technology

Bachelor of Technology (B.Tech.) (Hons.) — Electronics and Communications Engineering

Jan 2014Jan 2018

Stackforce found 100+ more professionals with Data Engineering & Cloud Technologies

Explore similar profiles based on matching skills and experience