Harshitha A — Data Engineer
About: Passionate Data Engineer with 8+ years of experience in building and managing Data Pipelines, creating Data workflows and leveraging cloud services. Specialized in handling large datasets, migrating data, transforming data and utilized Hadoop related technologies to build a data driven eco-system. Deep understanding of distributed system architecture and the principles of parallel computing. Extensive experience with Kafka for real-time data processing. Hands-on with various Hadoop distributions such as Cloudera, Hortonworks, and AWS EMR. Skilled in AWS Cloud services including EMR, Redshift, S3, Athena, SNS, EC2 and Glue for big data analytics. Experienced in analyzing large datasets using PySpark scripts and Hive queries. Familiar with deployment automation tools like Jenkins and containerization concepts including Docker and Airflow. Extensive SQL query expertise for backend database analysis. Strong knowledge of NoSQL column-oriented databases like HBase, Cassandra, DynamoDB (AWS), and MongoDB, and their integration with Hadoop. Hands-on experience with SQL databases such as SQL Server, Hive, Oracle, MySQL, DB2, and PostgreSQL. Experienced with Sqoop for importing and exporting data between HDFS and RDBMS. Proficient in Azure Cloud services including ADLS, Azure Databricks, Azure Functions, Azure SQL Data Warehouse, Azure Synapse Analytics, and Azure Data Factory. Led data analysis and integration projects involving Hadoop and ETL processes. Transferred large data sets from Teradata RDBMS to HDFS using Sqoop. Experienced with visualization tools such as Tableau, Looker, and Power BI. Strong understanding of version control tools like Git and GitHub. Involved in various testing methodologies including unit, integration, and acceptance testing to ensure data quality and functionality. Skills: Data Modeling, Data Engineering, Big Data Analytics, Object Oriented Programming (OOPS), Data Warehousing. Programming Skills: Python, SQL, Scala, Hadoop, PySpark. Cloud Services: AWS S3, EMR, EC2, AWS Glue, Lambda services, AWS Redshift, Azure Data Factory, Azure Databricks, ADLS, Synapse, Snowflake.
Stackforce AI infers this person is a Big Data Engineer with expertise in cloud-based data solutions.
Experience: 5 yrs 11 mos
Skills
- Data Engineering
- Big Data Analytics
- Cloud Services
Career Highlights
- 8+ years of experience in Data Engineering.
- Expert in building and managing data pipelines.
- Proficient in both AWS and Azure cloud services.
Work Experience
Availity
Data Engineer (1 yr 4 mos)
Cloudflare
Sr Data Engineer (1 yr 6 mos)
Verizon
Big Data Engineer (1 yr 1 mo)
City of Hope
Data Engineer (1 yr 2 mos)
DataFactZ
Hadoop/Spark Developer (1 yr 10 mos)
Extended Web AppTech
Java Developer (1 yr 5 mos)