PIYUSH SHRIVASTAV

CEO

Gurgaon, Haryana, India0 mo experience

Key Highlights

Proficient in designing large-scale data processing pipelines.
Hands-on experience with Hadoop, Spark, and Kafka.
Specializes in optimizing data flows for real-time analytics.

Stackforce AI infers this person is a Big Data Engineer with expertise in distributed systems and real-time analytics.

Contact

Skills

Other Skills

Amazon Elastic MapReduce (EMR)Amazon Web Services (AWS)Apache AirflowCPPCascading Style Sheets (CSS)Data WarehousingDatabasesHTMLJavaScriptMapReduceMongoDBOperating SystemsSnowflake

About

Focus on Big Data engineering, proficient in designing and implementing large-scale distributed data processing pipelines. With hands-on experience in Apache Hadoop, Spark, and Kafka, I specialize in optimizing data flows and enabling real-time analytics across distributed environments. I have a keen interest in scaling infrastructure to handle petabyte-scale datasets and applying stream processing and batch processing techniques to ensure high availability and fault tolerance. Technical Proficiencies: • Big Data Ecosystem: Apache Hadoop (HDFS, MapReduce), Spark (RDDs, DataFrames, Spark Streaming), Apache Kafka • Data Storage: HDFS, HBase, Cassandra, S3 • Programming & Query Languages: Java, Python, SQL, HiveQL, Pig Latin • Data Ingestion & Processing: Apache Flume, Sqoop, Kafka Streams • Cloud & Distributed Systems: AWS EMR, Google Cloud Dataflow, Kubernetes I am actively involved in coursework that includes data modeling, ETL processes, and database management. I am also working on practical projects and collaborating with peers to apply theoretical knowledge to real-world scenarios.