Vaibhav Rai

Software Engineer

Gurugram, Haryana, India8 yrs 3 mos experience

Key Highlights

  • Expert in automating ETL processes using Meltano.
  • Proficient in cloud technologies like AWS and GCP.
  • Strong background in data analysis and database management.
Stackforce AI infers this person is a Big Data Engineer with expertise in SaaS and cloud technologies.

Contact

Skills

Core Skills

Big Data EngineeringEtlData AnalysisDatabase Management

Other Skills

PythonMeltanoDockerApache AirflowGoogle Cloud StorageDatabricksBigQueryCloud RunCobolPySparkPostgreSQLAWSJavaApache SparkShell Script

About

SKILLS Programming Languages: Python, Bash, SQL, Cobol Databases: MySQL, Oracle, PostgreSQL, Redshift, BigQuery Libraries: PySpark, pandas, NumPy, matplotlib Project Management Tools: Confluence, Git, Jira Cloud: AWS, GCP ETL Tools: Meltano Additional Technologies: Docker Databricks (Delta Live Tables ETL Framework, Databricks Utilities (Widgets, File system, Mounts)) AWS Services: AWS EC2, AWS S3, AWS Lambda GCP Services: Cloud Run, BigQuery, Pub/Sub, Storage, Composer, Dataform EXPERIENCE Big Data Engineer Oct 2022 - Present Infosys, Gurgaon Automated ETL processes using Meltano ELT tool for data extraction from MSSQL and loading into BigQuery, ensuring efficient and accurate data transfer. Developed and containerized the ETL pipeline using Docker, storing images in Artifact Registry and deploying via Cloud Run for serverless execution. Created Python scripts to generate Apache Airflow DAG files, managed via Google Composer, facilitating automated scheduling and orchestration of ETL tasks. Uploaded DAG files to Google Cloud Storage and integrated with Apache Airflow, enhancing workflow automation and task management. Led migration initiative, orchestrating the transition from legacy Mainframe Cobol technologies to cutting-edge ETL solutions within Databricks Medallion Architecture. Developed robust PySpark scripts to automate the execution of the Databricks DLT pipeline, facilitating the seamless transformation of Cobol and Easytrieve files into PySpark and DLT SQL formats. Big Data Engineer Aug 2021 - Oct 2022 NITS Solutions Spearheaded migration projects, transitioning ETL pipelines from AWS S3-Oracle-React to AWS S3-Python with PostgreSQL-React. Analyzed summary tables using PySpark for enhanced insights. Developed an API in Java and Apache Spark technologies to accumulate data from different sources provided by the client. Created ETL processes in Python (Pandas), PostgreSQL, and Shell Script to process records on a scheduled basis through cron job. Worked on Agile Methodology. Software Developer Dec 2018 - Aug 2021 Indian Agriculture Statistics Research Institute (IASRI) Created Python pandas scripts to analyze a huge agricultural dataset. Worked on ETL pipeline from ingesting data into MySQL database and analyzing data with the help of Python pandas and matplotlib library. Worked on Agile Methodology. Used Git for version control.

Experience

8 yrs 3 mos
Total Experience
1 yr 8 mos
Average Tenure
1 yr 6 mos
Current Experience

Ibm

Senior Data Engineer

Nov 2024Present · 1 yr 6 mos · Gurugram, Haryana, India · Hybrid

PythonMeltanoDockerApache AirflowGoogle Cloud StorageDatabricks+2

Infosys

Technical Analyst

Oct 2022Oct 2024 · 2 yrs · India

PythonPostgreSQLPySparkBig Data EngineeringETL

Nits solutions

Assosiate big data engineer

Jul 2021Oct 2022 · 1 yr 3 mos · India

PythonPostgreSQLpandasmatplotlibData AnalysisETL

Indian agricultural research institute

Senior Research Fellow

Dec 2018Jul 2021 · 2 yrs 7 mos · New Delhi, Delhi, India

  • Working as Senior Research Fellow, job role is to design, develop and maintain database on MySql and python django Orm for Portal.
PythonPostgreSQLDatabase Management

Infozech software private limited

Software Developer

Oct 2017Sep 2018 · 11 mos · New Delhi Area, India

  • Worked as Service Delivery engineer, Job role was to run SQL commands to Load, Transform and Extract data in Oracle database. Created procedures, functions and triggers to automate daily transactions Query.
SQLOracleDatabase Management

Education

Ivy Professional School

Certification — Big Data and Analytics

Jan 2020Jan 2021

Bhai Parmanand Institute of Business Studies

Master of Computer Applications — Computer Software Engineering

Jan 2014Jan 2017

Stackforce found 100+ more professionals with Big Data Engineering & Etl

Explore similar profiles based on matching skills and experience