Venkatesh Sami

Data Engineer

Hyderabad, Telangana, India9 yrs 6 mos experience

Highly Stable

Key Highlights

8+ years of experience in data engineering across multiple industries.
Expert in building scalable data pipelines and dashboards.
Proven leadership in mentoring junior engineers and optimizing processes.

Stackforce AI infers this person is a Data Engineering expert in Analytics and Big Data solutions.

Contact

Skills

Core Skills

Data EngineeringBig Data Analytics

Other Skills

AirflowAmazon Elastic MapReduce (EMR)Amazon S3Apache AirflowApache LivyApache Spark StreamingApache SqoopClouderaDatabricks ProductsDockerGithubHQLHiveKubernetesLogi Composer

About

•Proficient data engineer with 8+ years of experience in the airline, finance, mobility, and media industry improving business and operational processes by developing useful metrics and benchmarks for tracking • Good understanding of Spark Architecture including Spark Core, Spark Streaming, NIFI, Kafka, Livy, Airflow, Scala, Python, AWS EMR, S3, and EC2 • Creation of NRT streaming pipeline handles 8000TPS stores 2TB of raw data daily using NIFI, Kafka, Spark, and Hive • Led a team of 4 people and provided technical mentorship to junior teammates, code reviews as well as enforced best coding practices • Integrated Hadoop and Spark into Traditional ETL, accelerating the extraction, transformation, and loading of structured and Unstructured data • Responsible for creating conceptual data models and data flows • Experience in working Agile/SCRUM, Lead daily stand-ups, and scrum ceremonies

Experience

9 yrs 6 mos

Total Experience

2 yrs 5 mos

Average Tenure

2 yrs 1 mo

Current Experience

Thoughtworks

Senior Data Engineer

Apr 2024 – Present · 2 yrs 1 mo · Hyderabad, Telangana, India

Jio

Senior Data Engineer

Mar 2021 – Apr 2024 · 3 yrs 1 mo · Bengaluru, Karnataka, India

COE Analytics team enables businesses to make data-driven decisions and to bring forth the projects that we have successfully implemented with our partners. This team has been pivotal in growing various internal businesses like Connectivity, Energy, Retail, Media, and Hospital
Responsibilities:
Analyse and translate business need into data models by evaluating existing data systems
Design and Implemented spark jobs using Scala/Python and Spark SQL for interactive queries, faster processing
Implemented performance optimization techniques to handle large volumes of data in HQL and Spark jobs
Job scheduling in Rundeck/Airflow through CI/CD pipeline
Executed Notification campaign (SMS/ WhatsApp) using Spark-integrated Rest API framework
Design and Implemented several KPI jobs which helped the business to recognize patterns and adjust in their strategy to reach objectives
Built several KPI dashboards using Zoomdata and Logi Composer for business monitoring
Responsible for Requirement gathering and understanding, Design, Coding, Unit Testing, and Deployment

SparkScalaPythonHQLRundeckAirflow+4

Oracle financial services software limited

Application Developer

Oct 2019 – Feb 2021 · 1 yr 4 mos · Bangalore

Project Description:
Oracle Funds Transfer Pricing (FTP) is the industry-standard software application for implementing a matched rate transfer pricing system. Recognizing the value of matched rate transfer pricing, financial institutions are increasingly incorporating it into their performance measurement systems, Funds Transfer Pricing calculates transfer rates at the lowest possible level of detail in your institution's balance sheet, the instrument record level
Responsibilities:
Load data from RDBMS to spark using spark load operations and used dialect for data conversions
Implemented spark using Scala and Spark SQL for interactive queries, faster processing and testing of data
Used Dataframe and Dataset operations for data processing and manipulation
Implemented Apache Livy REST interface for interacting with other Spark applications and perform tasks
Implemented performance optimization techniques to handle large volumes of data
Responsible for Deploying and maintain the Cloudera 5.1 cluster
Responsible for Requirement gathering and understanding, Design, Coding and Unit Testing

SparkScalaSpark SQLApache LivyClouderaData Engineering+1

Hewlett packard enterprise

2 roles

Big Data Developer

Promoted

Jul 2017 – Oct 2019 · 2 yrs 3 mos · Bengaluru Area, India

Responsibilities :
 Used Sqoop to efficiently transfer data between databases and HDFS
 Involved in creating Hive tables, loading and analysing data using hive queries
 Involved in storing data into HBase, which will be used for analysis
 Created partitions, buckets in Hive to handle structured data
 Implemented spark using Scala and Spark SQL for faster testing and processing of data
 Used Spark to load data and create schema RDD and loaded data into hive tables
 Involved in converting Hive/SQL queries into spark transformations using Spark RDD’s and Scala
 Used Spark for interactive queries, processing of streaming data
 Responsible for troubleshooting and resolving the issues

SqoopHiveSparkScalaData EngineeringBig Data Analytics

Application Developer

Sep 2016 – Jun 2017 · 9 mos · Bengaluru Area, India

Responsibilities
 Understanding the user stories and implementing the solution on time with zero acceptance defects
 Design, Development and Testing during project
 Delivering products on time and with zero acceptance defects
 Onsite-Offshore coordination