Sujith Reddy Pelleti

CTO

Bengaluru, Karnataka, India13 yrs 7 mos experience
Highly Stable

Key Highlights

  • Expert in building scalable data lakes and warehouses.
  • Proven track record in data-driven decision-making.
  • Strong leadership in engineering and data integration.
Stackforce AI infers this person is a Data Engineering expert with extensive experience in E-commerce and Supply Chain industries.

Contact

Skills

Core Skills

Data EngineeringData ScienceIntegrationEtlBi Solutions

Other Skills

Amazon Web Services (AWS)Apache AirflowApache PigApache SparkBig DataBusiness IntelligenceBusiness Intelligence ToolsCC++CeleryCircleCICommunicationData MigrationData ModelingData Quality

About

Data and Business Intelligence Engineer with around 11.5 years of experience in creating and maintaining Multi petabyte scale Data lakes and Data warehousing solutions and Data Pipelines. Areas of expertise: • Requirements gathering. • Client interaction • Data Modeling • Data Visualizations • ETL • Self-service Reporting and BI • Data analysis. • Technical design. • Development and implementation • Process Adherence Technical proficiencies: Platforms: Qubole, Hadoop, HIVE, Spark, AWS, EMR, MSBI Databases: Redshift, Teradata, SQL Server, Snowflake, Mysql Languages: SQL, TSQL, Python, Teradata bteq, Loaders (TPT) BI Tools: SSIS (ETL), Looker, SSRS, Tableau, Microstrategy, Power BI, Inetsoft OLAP cubes: SSAS Orchestration: Azkaban, Airflow

Experience

13 yrs 7 mos
Total Experience
2 yrs 4 mos
Average Tenure
1 yr 9 mos
Current Experience

Caring

Engineering leader

Sep 2024Present · 1 yr 9 mos · India · Remote

  • Building and leading the Engineering team in India, with a focus on:
  • Engineering Leadership
  • Data Architecture and Data Quality
  • Enterprise Integrations

Thrasio

2 roles

Engineering Manager

Apr 2024Sep 2024 · 5 mos · Remote · Remote

  • Led a team to develop an automated stock replenishment system ensuring optimal inventory levels in Amazon FBA warehouses.
  • Minimized stockouts by implementing data-driven forecasting models to predict replenishment needs.
  • Reduced storage costs by strategically managing inventory between FBA and cost-effective 3PL warehouses.
  • Automated the creation of Purchase Orders (POs) & Transfer Orders (TOs) in ERP and 3PL systems, improving operational efficiency.
  • Leveraged Data Science models for demand forecasting, optimizing stock availability and fulfilment efficiency.
Data EngineeringData ScienceInventory ManagementForecasting ModelsOperational Efficiency

Staff Data Engineer

Jul 2021Sep 2024 · 3 yrs 2 mos · Remote · Remote

  • Led a team of Data & Software Engineers to automate ERP to 3PL data reconciliation, enhancing supply chain efficiency.
  • Managed the entire Supply Chain Integrations & Engineering team, optimizing data pipelines and system architectures.
  • Reduced Mulesoft licensing costs from $1M to $400K by rearchitecting integration logic.
  • Enhanced integration reliability with proactive monitoring, alerting, and an on-call process for failures.
  • Built Python-based data connectors & reusable templates to streamline third-party API ingestion into the Data Lake.
  • Optimized Airflow performance by evaluating Kubernetes pod startup delays and migrating resource-intensive jobs to Celery.
  • Led POCs for data catalog & lineage tools, identifying optimal solutions for enterprise data governance.
  • Designed a rule-based Python engine for Data Quality, Monitoring, and Availability across DWH and Data Lake (AWS S3).
  • Developed a Docker-based DBT development environment, eliminating the need for DBT Cloud licenses and cutting costs.
  • Implemented Snowflake Zero Clone Copy for developers to seamlessly test DBT environments.
  • Created real-time monitoring solutions for DBT pipelines to enable faster failure resolutions.
  • Designed an aggregated error alerting system with user tagging, improving accountability for ERP business errors.
  • Enhanced SLA for critical datasets from 36 hours to 12 hours by leveraging Snowflake external tables & Snowpipes.
  • Built a one-click CI/CD pipeline for DBT and Spark-based workflows, improving deployment efficiency.
  • Led SOX-compliant Snowflake, AWS, and DBT implementations using Terragrunt for IaC.
Data EngineeringIntegrationData QualityMonitoringPython

Amazon

Data Engineer 2

May 2020Jul 2021 · 1 yr 2 mos · Bengaluru, Karnataka

  • ● Revamping the Amazon Advertisements Datawarehouse by creating a Dimensional data
  • model for all the advertisement entities in Petabyte scale Amazon Redshift.
  • ● Improved the SLA for critical datasets from 36 hours to 12 hours by moving the data
  • pipelines by identifying the bottlenecks in current pipelines and moving them closer to the
  • source and optimizing the data pipelines.
  • ● Implement a Rule-based Data Quality Framework in Python to automate the quality checks
  • in Python.
  • ● Created Spark Based ETL pipelines to handle complex data types and optimised the
  • storage formats.
  • ● Improved the process of the Advertiser success team by creating Recommendation Impact
  • Analysis Data model and Data pipelines to help them make better decisions related to
  • campaigns and keyword-related data.
  • ● Migrated the Salesforce to Redshift Data pipelines from Informatica to AWS Native service
  • AppFlow, resulting in savings of up to 150k USD per year in licensing.
Data EngineeringETLData QualityData ModelingPython

Expedia group

Data Engineer at Hotels.com

Apr 2018May 2020 · 2 yrs 1 mo · Bengaluru, Karnataka, India

  • Development and maintenance of Petabyte scale Date lake and Data warehouse solutions by creating scalable data pipelines
  • Creating Data pipelines and ETL solutions for Bookings and clickstream data on on-premise infrastructure using Teradata, MSBI, Hadoop, Hive, Sqoop, TPT, TDCH, and Azkaban
  • Data Migration from Legacy systems like Teradata, SQL Server and Hadoop to AWS cloud systems using Qubole, Pyspark, Kinesis, Firehose, Hive and Snowflake and building Self-service BI solutions using Looker
  • Migration of ETL and Data pipelines process from on-premise to AWS cloud using EMR, Spark, Hive, S3, AWS Boto3 and Airflow
  • Creation of Unit tests, Integration tests to contribute to CICD pipeline setup
Data EngineeringETLData MigrationSelf-service BICommunication

Altisource

Software Engineer

Feb 2016Mar 2018 · 2 yrs 1 mo

  • Developed Data Pipelines using Cloudera Hadoop, Sqoop, Hive, Pig and Mysql for landing data into Enterprise Data lake for products serving Mortgage and Real Estate customers
  • Developed Enterprise-wide Reporting as a Service framework using Inetsoft, SSRS, Tableau and Microstrategy for visualizations and Business critical reports
  • Created Self-service BI framework for business users to create their own reports and visualizations and automated delivery schedules
  • Migration of data from Relational databases to Object-oriented databases by converting structured data to complex nested JSON documents using python
  • Created ETL pipelines to convert complex JSON data to relational data
Data EngineeringETLReportingData MigrationCommunication

Unitedhealth group

Associate Application Developer

Sep 2012Jan 2016 · 3 yrs 4 mos · Bengaluru Area, India

  • Created BI and Reporting solutions for Health care data using SQL Server and SSRS using MSBI
  • Developed ETL solutions using SSIS, TSQL Stored procedures and OLAP cubes using SSAS
BI SolutionsETLReportingCommunication

Education

Mahatma Gandhi Institute of technology

Bachelor of Technology (B.Tech) — Computer Science

Jan 2008Jan 2012

narayana junior college

Jan 2006Jan 2008

Stackforce found 100+ more professionals with Data Engineering & Data Science

Explore similar profiles based on matching skills and experience