Venkatesh Sami

Data Engineer

Hyderabad, Telangana, India9 yrs 5 mos experience
Highly Stable

Key Highlights

  • 8+ years of experience in data engineering across multiple industries.
  • Expert in building scalable data pipelines and dashboards.
  • Proven leadership in mentoring junior engineers and optimizing processes.
Stackforce AI infers this person is a Data Engineering expert in Analytics and Big Data solutions.

Contact

Skills

Core Skills

Data EngineeringBig Data Analytics

Other Skills

AirflowAmazon Elastic MapReduce (EMR)Amazon S3Apache AirflowApache LivyApache Spark StreamingApache SqoopClouderaDatabricks ProductsDockerGithubHQLHiveKubernetesLogi Composer

About

•Proficient data engineer with 8+ years of experience in the airline, finance, mobility, and media industry improving business and operational processes by developing useful metrics and benchmarks for tracking • Good understanding of Spark Architecture including Spark Core, Spark Streaming, NIFI, Kafka, Livy, Airflow, Scala, Python, AWS EMR, S3, and EC2 • Creation of NRT streaming pipeline handles 8000TPS stores 2TB of raw data daily using NIFI, Kafka, Spark, and Hive • Led a team of 4 people and provided technical mentorship to junior teammates, code reviews as well as enforced best coding practices • Integrated Hadoop and Spark into Traditional ETL, accelerating the extraction, transformation, and loading of structured and Unstructured data • Responsible for creating conceptual data models and data flows • Experience in working Agile/SCRUM, Lead daily stand-ups, and scrum ceremonies

Experience

Thoughtworks

Senior Data Engineer

Apr 2024Present · 1 yr 11 mos · Hyderabad, Telangana, India

Jio

Senior Data Engineer

Mar 2021Apr 2024 · 3 yrs 1 mo · Bengaluru, Karnataka, India

  • COE Analytics team enables businesses to make data-driven decisions and to bring forth the projects that we have successfully implemented with our partners. This team has been pivotal in growing various internal businesses like Connectivity, Energy, Retail, Media, and Hospital
  • Responsibilities:
  • Analyse and translate business need into data models by evaluating existing data systems
  • Design and Implemented spark jobs using Scala/Python and Spark SQL for interactive queries, faster processing
  • Implemented performance optimization techniques to handle large volumes of data in HQL and Spark jobs
  • Job scheduling in Rundeck/Airflow through CI/CD pipeline
  • Executed Notification campaign (SMS/ WhatsApp) using Spark-integrated Rest API framework
  • Design and Implemented several KPI jobs which helped the business to recognize patterns and adjust in their strategy to reach objectives
  • Built several KPI dashboards using Zoomdata and Logi Composer for business monitoring
  • Responsible for Requirement gathering and understanding, Design, Coding, Unit Testing, and Deployment
SparkScalaPythonHQLRundeckAirflow+4

Oracle financial services software limited

Application Developer

Oct 2019Feb 2021 · 1 yr 4 mos · Bangalore

  • Project Description:
  • Oracle Funds Transfer Pricing (FTP) is the industry-standard software application for implementing a matched rate transfer pricing system. Recognizing the value of matched rate transfer pricing, financial institutions are increasingly incorporating it into their performance measurement systems, Funds Transfer Pricing calculates transfer rates at the lowest possible level of detail in your institution's balance sheet, the instrument record level
  • Responsibilities:
  • Load data from RDBMS to spark using spark load operations and used dialect for data conversions
  • Implemented spark using Scala and Spark SQL for interactive queries, faster processing and testing of data
  • Used Dataframe and Dataset operations for data processing and manipulation
  • Implemented Apache Livy REST interface for interacting with other Spark applications and perform tasks
  • Implemented performance optimization techniques to handle large volumes of data
  • Responsible for Deploying and maintain the Cloudera 5.1 cluster
  • Responsible for Requirement gathering and understanding, Design, Coding and Unit Testing
SparkScalaSpark SQLApache LivyClouderaData Engineering+1

Hewlett packard enterprise

2 roles

Big Data Developer

Promoted

Jul 2017Oct 2019 · 2 yrs 3 mos · Bengaluru Area, India

  • Responsibilities :
  •  Used Sqoop to efficiently transfer data between databases and HDFS
  •  Involved in creating Hive tables, loading and analysing data using hive queries
  •  Involved in storing data into HBase, which will be used for analysis
  •  Created partitions, buckets in Hive to handle structured data
  •  Implemented spark using Scala and Spark SQL for faster testing and processing of data
  •  Used Spark to load data and create schema RDD and loaded data into hive tables
  •  Involved in converting Hive/SQL queries into spark transformations using Spark RDD’s and Scala
  •  Used Spark for interactive queries, processing of streaming data
  •  Responsible for troubleshooting and resolving the issues
SqoopHiveSparkScalaData EngineeringBig Data Analytics

Application Developer

Sep 2016Jun 2017 · 9 mos · Bengaluru Area, India

  • Responsibilities
  •  Understanding the user stories and implementing the solution on time with zero acceptance defects
  •  Design, Development and Testing during project
  •  Delivering products on time and with zero acceptance defects
  •  Onsite-Offshore coordination

Education

Saveetha School of Engineering

Bachelor of Technology (B.Tech.) — electronics and communication engineering

Jan 2012Jan 2016

Stackforce found 100+ more professionals with Data Engineering & Big Data Analytics

Explore similar profiles based on matching skills and experience