Venkata Balijepalli

DevOps Engineer

Hyderabad, Telangana, India7 yrs 5 mos experience

Key Highlights

Expert in designing scalable data ecosystems.
Proven track record in cloud-based data architecture.
Strong background in big data analytics and ETL processes.

Stackforce AI infers this person is a Data Architect specializing in scalable cloud-based data solutions for the healthcare industry.

Contact

akhilvenkata01@gmail.com LinkedIn

Skills

Core Skills

Data ArchitectureCloud ComputingData EngineeringBig Data AnalyticsApi Development

Other Skills

AWS CloudFormationAWS Data PipelineAWS Identity and Access Management (AWS IAM)AWS LambdaAccountingAdobe SparkAmazon EC2Amazon Elastic MapReduce (EMR)Amazon RedshiftAmazon Relational Database Service (RDS)Amazon S3Amazon Web Services (AWS)Apache AirflowApache OozieApache Spark

About

As a Data Architect, I specialize in designing and implementing enterprise-grade data ecosystems that are scalable, secure, and built to power advanced analytics and AI-driven insights. At the Insurance Information Bureau of India, I architect cloud-based data platforms and secure pipelines that serve as the foundation for intelligent decision-making across the organization. My work focuses on creating resilient, future-ready data architectures that align with business strategy and enable seamless scalability. Previously at Guardant Health, I led initiatives to develop modular data architectures, API-driven integrations, and automated data flows — optimizing data accessibility, interoperability, and governance across systems. These efforts accelerated insight delivery and enhanced the organization’s ability to act on real-time data. My technical expertise includes PySpark, Azure Fabric, and Databricks, where I design and optimize distributed data processing frameworks, orchestration workflows, and performance-tuned pipelines. I’m deeply passionate about translating complex business and technical requirements into robust, scalable architectural blueprints that enable data-driven transformation.

Experience

7 yrs 5 mos

Total Experience

1 yr

Average Tenure

1 yr

Current Experience

Insurance information bureau of india

Lead Data Engineer/Architect

Jun 2025 – Present · 1 yr · Hyderabad, Telangana, India · On-site

Designed and implemented scalable ETL pipelines using Azure Data Factory, orchestrating data ingestion from
diverse structured and unstructured sources into centralized data lakes and data warehouses.
Hands-on experience working with Microsoft Fabric to build end-to-end modern data platforms integrating
Lakehouse architecture with enterprise-grade governance and security.
Developed efficient and reusable PySpark scripts for large-scale data processing and transformation across distributed computing
environments using Apache Spark on Azure Synapse and Databricks.

Azure Data FactoryAzure Data LakeAzure Data StudioAzure DatabricksAzure FunctionsAzure SQL+17

Guardant health

Lead Data Engineer

Aug 2024 – May 2025 · 9 mos · Palo Alto, California, United States · On-site

Wrote various data normalization jobs for new data ingested into Redshift.
Wrote scripts and indexing strategy for a migration to Confidential Redshift from SQL Server and MySQL databases.
The data is ingested into this application by using Hadoop technologies like PIG and HIVE.
Worked on AWS Data Pipeline to configure data loads from S3 to into Redshift.
Used JSON schema to define table and column mapping from S3 data to Redshift.
Created EBS volumes for storing application files for use with EC2 instances whenever they are mounted to them.

AWS CloudFormationApache Spark StreamingApache OozieApache AirflowAWS LambdaAWS Identity and Access Management (AWS IAM)+12

Fifth third bank

Senior Data Engineer

Nov 2023 – Jul 2024 · 8 mos · Cincinnati metropolitan area, Ohio, United States · On-site

Responsible for design, Development, and Support of data solutions, APIs, tools, and processes to enable rapid delivery of business capabilities.
Act as a technical Expert addressing problems related to system and application design, performance, integration, security, etc.
Conduct research and Development based on current trends and technologies related to the banking industry, data engineering and architecture, data security, and related topics.

Google Cloud Platform (GCP)Google+HadoopGoogle DocsHiveAzure Data Studio+39

Commonspirit health

Senior Data Engineer

Sep 2022 – Oct 2023 · 1 yr 1 mo · Chicago metropolitan area, Illinois, United States · On-site

Used various AWS services including S3, EC2, Glue, Athena, RedShift, EMR, SNS, SQS, DMS, Kinesis.
Strong understanding of Extract, Transform, Load (ETL) and Extract, Load, Transform (ELT) processes. Experience in integrating data from various sources into Snowflake.
Created and configured IAM roles for cross-account access, allowing secure and efficient collaboration between different AWS accounts.

Google Cloud Platform (GCP)HadoopHadoop AdministrationHiveData MiningAWS CloudFormation+28

Credit suisse

Senior Big Data Engineer

Apr 2021 – Aug 2022 · 1 yr 4 mos · Raleigh, North Carolina, United States · On-site

Created and maintained database objects such as schemas, tables, views, and stored procedures in Snowflake.
Designed Data Quality Framework to perform schema validation and data profiling on Spark (PySpark).
Automation performed using Pig, HQL, Shell Scripts and Python for Ingestion and Consumption.
Used tools like Data Meer to validate HBase tables.
Developed custom ETL solutions, batch processing and real-time data ingestion pipeline to move data in and out of Hadoop using PySpark and shell scripting.

Google Cloud Platform (GCP)HadoopAzure Data StudioAutoCADData MiningApache Oozie+32

Bnsf railway

Data Engineer

Nov 2019 – Mar 2021 · 1 yr 4 mos · Dallas-Fort Worth Metroplex · On-site

Followed agile methodology and used Rally to maintain user stories.
Developed different Scala APIs which work on top of Spark and Hadoop Ecosystem.
Executed the program by using python API written in python to support Apache Spark or PySpark.
Created a Lambda Deployment function, and configured it to receive events from your S3 bucket
Responsible for provisioning key AWS Cloud services and configuring them for scalability, flexibility, and cost optimization.

Google Cloud Platform (GCP)PycharmHadoopHadoop AdministrationHiveOracle Database+24

T-mobile

Data Engineer

Jul 2018 – Oct 2019 · 1 yr 3 mos · Washington DC-Baltimore Area · On-site

Monitored and maintained data pipelines, troubleshooting and resolving issues to minimize downtime and ensure seamless data flow.
Conducted performance tuning and optimization of StreamSets pipelines to improve efficiency and reduce latency.
Written Python utilities and scripts to automate tasks in AWS using boto3 and AWS SDK. Automated backups using AWS SDK (boto3) to transfer data into S3 buckets.

HadoopAzure Data StudioAutoCADApache OozieAzure Data FactoryApache Spark+18