Supriya P — Data Engineer
Data Engineer with over 9 years of extensive experience in designing, implementing, and maintaining data infrastructure solutions. experience in developing, implementing, and optimizing data pipelines on AWS cloud platform. Proficient in AWS data services and passionate about leveraging cloud technologies to drive business insights and decision-making. Adept at collaborating with cross-functional teams to deliver scalable and efficient data pipelines. Implemented ETL processes on AWS using AWS Glue to extract, transform, and load data from various sources into Amazon S3 and data warehouses like Amazon Redshift. Developed and maintained data pipelines for real-time data streaming and processing using Amazon Kinesis and AWS Lambda. Implemented and optimized data processing tasks on Amazon EMR using Apache Spark for distributed computing. Utilized Python and Scala for scripting and automation tasks, optimizing data processing. Developed Spark applications to perform large-scale data processing and analysis, leveraging Spark SQL for complex transformations and aggregations. Designed and optimized data models to ensure efficient storage, retrieval, and querying of data in Snowflake and AWS Redshift. Implemented data quality checks and monitoring processes to ensure the integrity and reliability of the data pipeline. Designed and implemented scalable data pipelines using PySpark to process and transform large datasets. Integrated data from various sources, such as databases, APIs, and file systems, into the data processing pipeline Scheduled and managed PySpark jobs using tools like Apache Airflow, AWS Glue, or other orchestration frameworks. Leveraged AWS S3 for data storage and AWS Redshift for scalable data warehousing solutions, optimizing performance and cost-effectiveness. Implemented end-to-end data pipelines using Informatica Intelligent Cloud Services (IICS) for seamless integration of disparate data sources into Snowflake. Designed and developed ETL workflows using Informatica PowerCenter to extract, transform, and load data from various on-premises and cloud-based sources. Designed, implemented, and maintained relational databases such as PostgreSQL, MySQL, or SQL Server to store structured data for enterprise applications. Developed PL/SQL scripts for data manipulation, transformation, and validation in Oracle databases.
Stackforce AI infers this person is a Fintech Data Engineer specializing in fraud detection and data integration.
Location: Irving, Texas, United States
Experience: 9 yrs 9 mos
Skills
- Extract, Transform, Load (etl)
- Snowflake
- Data Integration
- Etl Processes
Career Highlights
- Over 9 years of experience in data engineering.
- Expert in designing scalable data pipelines on AWS.
- Proven track record in fraud detection and data integration.
Work Experience
Mastercard
Data Engineer (2 yrs 4 mos)
BMW Group
Senior Associate (1 yr 5 mos)
Bank of America
Senior Software Engineer (6 yrs)
Education
Bachelor of Technology - BTech at Jawaharlal Nehru Technological University Hyderabad (JNTUH)