Sujith Reddy Pelleti

CTO

Bengaluru, Karnataka, India13 yrs 7 mos experience

Highly Stable

Key Highlights

Expert in building scalable data lakes and warehouses.
Proven track record in data-driven decision-making.
Strong leadership in engineering and data integration.

Stackforce AI infers this person is a Data Engineering expert with extensive experience in E-commerce and Supply Chain industries.

Contact

Skills

Core Skills

Data EngineeringData ScienceIntegrationEtlBi Solutions

Other Skills

Amazon Web Services (AWS)Apache AirflowApache PigApache SparkBig DataBusiness IntelligenceBusiness Intelligence ToolsCC++CeleryCircleCICommunicationData MigrationData ModelingData Quality

About

Data and Business Intelligence Engineer with around 11.5 years of experience in creating and maintaining Multi petabyte scale Data lakes and Data warehousing solutions and Data Pipelines. Areas of expertise: • Requirements gathering. • Client interaction • Data Modeling • Data Visualizations • ETL • Self-service Reporting and BI • Data analysis. • Technical design. • Development and implementation • Process Adherence Technical proficiencies: Platforms: Qubole, Hadoop, HIVE, Spark, AWS, EMR, MSBI Databases: Redshift, Teradata, SQL Server, Snowflake, Mysql Languages: SQL, TSQL, Python, Teradata bteq, Loaders (TPT) BI Tools: SSIS (ETL), Looker, SSRS, Tableau, Microstrategy, Power BI, Inetsoft OLAP cubes: SSAS Orchestration: Azkaban, Airflow

Experience

13 yrs 7 mos

Total Experience

2 yrs 4 mos

Average Tenure

1 yr 9 mos

Current Experience

Caring

Engineering leader

Sep 2024 – Present · 1 yr 9 mos · India · Remote

Building and leading the Engineering team in India, with a focus on:
Engineering Leadership
Data Architecture and Data Quality
Enterprise Integrations

Thrasio

2 roles

Engineering Manager

Apr 2024 – Sep 2024 · 5 mos · Remote · Remote

Led a team to develop an automated stock replenishment system ensuring optimal inventory levels in Amazon FBA warehouses.
Minimized stockouts by implementing data-driven forecasting models to predict replenishment needs.
Reduced storage costs by strategically managing inventory between FBA and cost-effective 3PL warehouses.
Automated the creation of Purchase Orders (POs) & Transfer Orders (TOs) in ERP and 3PL systems, improving operational efficiency.
Leveraged Data Science models for demand forecasting, optimizing stock availability and fulfilment efficiency.

Data EngineeringData ScienceInventory ManagementForecasting ModelsOperational Efficiency

Staff Data Engineer

Jul 2021 – Sep 2024 · 3 yrs 2 mos · Remote · Remote

Led a team of Data & Software Engineers to automate ERP to 3PL data reconciliation, enhancing supply chain efficiency.
Managed the entire Supply Chain Integrations & Engineering team, optimizing data pipelines and system architectures.
Reduced Mulesoft licensing costs from $1M to $400K by rearchitecting integration logic.
Enhanced integration reliability with proactive monitoring, alerting, and an on-call process for failures.
Built Python-based data connectors & reusable templates to streamline third-party API ingestion into the Data Lake.
Optimized Airflow performance by evaluating Kubernetes pod startup delays and migrating resource-intensive jobs to Celery.
Led POCs for data catalog & lineage tools, identifying optimal solutions for enterprise data governance.
Designed a rule-based Python engine for Data Quality, Monitoring, and Availability across DWH and Data Lake (AWS S3).
Developed a Docker-based DBT development environment, eliminating the need for DBT Cloud licenses and cutting costs.
Implemented Snowflake Zero Clone Copy for developers to seamlessly test DBT environments.
Created real-time monitoring solutions for DBT pipelines to enable faster failure resolutions.
Designed an aggregated error alerting system with user tagging, improving accountability for ERP business errors.
Enhanced SLA for critical datasets from 36 hours to 12 hours by leveraging Snowflake external tables & Snowpipes.
Built a one-click CI/CD pipeline for DBT and Spark-based workflows, improving deployment efficiency.
Led SOX-compliant Snowflake, AWS, and DBT implementations using Terragrunt for IaC.

Data EngineeringIntegrationData QualityMonitoringPython

Amazon

Data Engineer 2

May 2020 – Jul 2021 · 1 yr 2 mos · Bengaluru, Karnataka

● Revamping the Amazon Advertisements Datawarehouse by creating a Dimensional data
model for all the advertisement entities in Petabyte scale Amazon Redshift.
● Improved the SLA for critical datasets from 36 hours to 12 hours by moving the data
pipelines by identifying the bottlenecks in current pipelines and moving them closer to the
source and optimizing the data pipelines.
● Implement a Rule-based Data Quality Framework in Python to automate the quality checks
in Python.
● Created Spark Based ETL pipelines to handle complex data types and optimised the
storage formats.
● Improved the process of the Advertiser success team by creating Recommendation Impact
Analysis Data model and Data pipelines to help them make better decisions related to
campaigns and keyword-related data.
● Migrated the Salesforce to Redshift Data pipelines from Informatica to AWS Native service
AppFlow, resulting in savings of up to 150k USD per year in licensing.

Data EngineeringETLData QualityData ModelingPython

Expedia group

Data Engineer at Hotels.com

Apr 2018 – May 2020 · 2 yrs 1 mo · Bengaluru, Karnataka, India

Development and maintenance of Petabyte scale Date lake and Data warehouse solutions by creating scalable data pipelines
Creating Data pipelines and ETL solutions for Bookings and clickstream data on on-premise infrastructure using Teradata, MSBI, Hadoop, Hive, Sqoop, TPT, TDCH, and Azkaban
Data Migration from Legacy systems like Teradata, SQL Server and Hadoop to AWS cloud systems using Qubole, Pyspark, Kinesis, Firehose, Hive and Snowflake and building Self-service BI solutions using Looker
Migration of ETL and Data pipelines process from on-premise to AWS cloud using EMR, Spark, Hive, S3, AWS Boto3 and Airflow
Creation of Unit tests, Integration tests to contribute to CICD pipeline setup

Data EngineeringETLData MigrationSelf-service BICommunication

Altisource

Software Engineer

Feb 2016 – Mar 2018 · 2 yrs 1 mo

Developed Data Pipelines using Cloudera Hadoop, Sqoop, Hive, Pig and Mysql for landing data into Enterprise Data lake for products serving Mortgage and Real Estate customers
Developed Enterprise-wide Reporting as a Service framework using Inetsoft, SSRS, Tableau and Microstrategy for visualizations and Business critical reports
Created Self-service BI framework for business users to create their own reports and visualizations and automated delivery schedules
Migration of data from Relational databases to Object-oriented databases by converting structured data to complex nested JSON documents using python
Created ETL pipelines to convert complex JSON data to relational data

Data EngineeringETLReportingData MigrationCommunication

Unitedhealth group

Associate Application Developer

Sep 2012 – Jan 2016 · 3 yrs 4 mos · Bengaluru Area, India

Created BI and Reporting solutions for Health care data using SQL Server and SSRS using MSBI
Developed ETL solutions using SSIS, TSQL Stored procedures and OLAP cubes using SSAS

BI SolutionsETLReportingCommunication