PIYUSH SHRIVASTAV — CEO
Focus on Big Data engineering, proficient in designing and implementing large-scale distributed data processing pipelines. With hands-on experience in Apache Hadoop, Spark, and Kafka, I specialize in optimizing data flows and enabling real-time analytics across distributed environments. I have a keen interest in scaling infrastructure to handle petabyte-scale datasets and applying stream processing and batch processing techniques to ensure high availability and fault tolerance. Technical Proficiencies: • Big Data Ecosystem: Apache Hadoop (HDFS, MapReduce), Spark (RDDs, DataFrames, Spark Streaming), Apache Kafka • Data Storage: HDFS, HBase, Cassandra, S3 • Programming & Query Languages: Java, Python, SQL, HiveQL, Pig Latin • Data Ingestion & Processing: Apache Flume, Sqoop, Kafka Streams • Cloud & Distributed Systems: AWS EMR, Google Cloud Dataflow, Kubernetes I am actively involved in coursework that includes data modeling, ETL processes, and database management. I am also working on practical projects and collaborating with peers to apply theoretical knowledge to real-world scenarios.
Stackforce AI infers this person is a Big Data Engineer with expertise in distributed systems and real-time analytics.
Location: Gurgaon, Haryana, India
Experience: 0 mo
Career Highlights
- Proficient in designing large-scale data processing pipelines.
- Hands-on experience with Hadoop, Spark, and Kafka.
- Specializes in optimizing data flows for real-time analytics.
Education
Bachelor of Technology - BTech at UIET - Kurukshetra University
12th CBSE Board at MM Public Sr. Sec School
10th CBSE Board at MM Public Sr. Sec. School