Sai Krishna G.

Data Engineer

Hyderabad, Telangana, India9 yrs experience
Most Likely To SwitchAI Enabled

Key Highlights

  • Led migrations from AWS Glue to EMR.
  • Architected end-to-end pipeline solutions processing 10TB+ daily data.
  • Implemented dbt frameworks increasing code reusability by 70%.
Stackforce AI infers this person is a Data Engineering expert with a focus on AWS and Machine Learning in the SaaS and Fintech sectors.

Contact

Skills

Core Skills

Data EngineeringAwsMachine LearningData AnalysisWeb DevelopmentData Visualization

Other Skills

SnowflakeApache SparkAmazon Web Services (AWS)Amazon AthenaAWS GlueAWS Identity and Access Management (AWS IAM)Apache AirflowSQLLookerTableauBig DataAmazon Elastic MapReduce (EMR)Data Build Tool (DBT)databricksSnowflake Cloud

About

I'm a Senior Data Engineer with 9 years of experience specializing in building robust data pipelines and architectures across AWS, Snowflake, and various big data technologies. I've worked with notable organizations including Capital One, Meta (Facebook), and currently Victory Live, where I've led migrations from AWS Glue to EMR, architected end-to-end pipeline solutions processing 10TB+ daily data, and implemented dbt frameworks that increased code reusability by 70%. My background combines technical expertise in Python, Spark, SQL optimization, and cloud technologies with business analytics knowledge from my Master's degree from UT Dallas.

Experience

9 yrs
Total Experience
1 yr 6 mos
Average Tenure
2 yrs 2 mos
Current Experience

Victory live

Senior Data Engineer

Mar 2024Present · 2 yrs 2 mos · Hyderabad, Telangana, India · Hybrid

SnowflakeApache SparkAmazon Web Services (AWS)Amazon AthenaAWS GlueAWS Identity and Access Management (AWS IAM)+10

Capital one

Senior Data Engineer

Jun 2022Oct 2023 · 1 yr 4 mos · Dallas, Texas, United States

Snowflake CloudPandas (Software)TestingDatabricks ProductsAWS Command Line Interface (CLI)Amazon Elastic MapReduce (EMR)+5

Meta

Data Engineer

Oct 2019Jun 2022 · 2 yrs 8 mos · San Francisco Bay Area

SQLHiveApache SparkPythonTableauMachine Learning+6

American heart association

Data Scientist (Consulting Practicum)

Jan 2019May 2019 · 4 mos · Dallas-Fort Worth Metroplex

  • shopheart.org (American Heart Association)
  • Segmented Customer Base using K-means Clustering(Python) and RFM to understand Consumer behavior across channels and developed XGBoost model to predict the probability of customer purchase with 96% accuracy for targeted marketing.
  • Developed predictive models(GLM, SVM ,Decision Trees) to predict customer churn and factors contribute to churn. Estimated customer lifetime ,value for different segments to find Optimal Spends using Survival Analysis.
  • Built and implemented Random forest machine learning model to predict Zero-dollar walkers (ZDW) to reduce costs associated with fundraising by 2.1 million USD, evaluated important factors that help in the decrease of ZDW.

The university of texas at dallas

Graduate Student

Aug 2017May 2019 · 1 yr 9 mos · Dallas-Fort Worth Metroplex

  • Masters in Business Analytics
  • > Activities and Societies: Member of Data Science and Big data Club
  • > PROJECTS:
  • Recommendation Engine – Movie Lens (Scala, Spark MLlib, Apache Zeppelin, AWS EMR)
  • Built a movie recommendation engine using Collaborative Filtering on Movie Lens data set with 10 million ratings on Amazon EMR Cluster using Spark MLib. Used Alternating Least Squares (ALS) and matrix factorization (MF) to make personal recommendations for the users in the data set.
  • Credit Card Fraud Detection- Kaggle Competition (Pandas, Numpy, Seaborn, Matplotlib, Keras, Tensorflow)
  • Built a Deep Learning model to detect and Classify Incoming Credit card Transactions as Fraudulent or Genuine with 99% Accuracy. Used different Sampling Techniques such as SMOTE to increase Recall rate as data has highly Imbalanced Class (500 fraud transactions out of 285,000 Transactions).
  • Document Classification Web Application- NLP (python, Flask, AWS Cloud, API, NLP, NLTK)
  • Designed and developed an end to end Machine Learning application for Document classification based on Text and deployed it to AWS as an API endpoint using Flask. Built a 3-layer Neural Network for Text classification with log loss of 0.34 and F1 score of 0.86
  • Data Visualization- World Bank (Excel, SQL lite, Tableau Prep, Tableau Desktop)
  • Developed visualizations, Dashboards, Stories and Animations in Tableau, analyzed reasons behind trends, Patterns, correlations, Evaluated linear and Panel regression and documented findings as a video on World Development Indicators dataset with 2 million records.

Tech mahindra

Software Engineer (Data)

Jul 2015Aug 2017 · 2 yrs 1 mo · Hyderabad Area, India

Defence research and development organisation (drdo)

Intern

Jan 2014May 2014 · 4 mos · Hyderabad, Telangana, India

  • Responsible for Developing a Machine learning model using Anomaly Detection Algorithm for detecting erroneous behavior.
  • Successfully Incorporated all the specified functional requirements and rigorously evaluated the model.

Education

The University of Texas at Dallas

Master's degree — Business Analytics

Jan 2017Jan 2019

RAJIV GANDHI UNIVERSITY OF KNOWLEDGE TECHNOLOGIES

Bachelor of Technology (Major) — Computer Science & Engineering

Stackforce found 100+ more professionals with Data Engineering & Aws

Explore similar profiles based on matching skills and experience