Raghavendra Desai

Product Manager

Bengaluru, Karnataka, India12 yrs 10 mos experience
Most Likely To SwitchHighly Stable

Key Highlights

  • 7+ years of experience in Big Data technologies.
  • Expert in developing data ingestion pipelines.
  • Published multiple research papers in IoT domain.
Stackforce AI infers this person is a Big Data Engineer with expertise in data ingestion and analytics for manufacturing and supply chain.

Contact

Skills

Core Skills

Big DataAwsEtlSparkData IngestionData Processing

Other Skills

JavaIcebergApache AirflowHadoopHBaseHiveKafkaPySparkSpark SQLPower BIElasticsearchREST APIC#Web DevelopmentExtract, Transform, Load (ETL)

About

Introduction: I am a passionate Senior big data/Hadoop/python developer with having 7+ years of total IT Experience, including 5 years of extensive experience in Big-data/Hadoop/Spark Technologies. I am equally excited about problem solving, being creative and developing strategic business solutions. I was associated with manufacturing and supply chain optimisation programs within Intel IT. I am currently working with marketing cloud in Salesforce Current Work: I am Currently working on developing table maintenance jobs for lake-house project which involves manipulating data of iceberg tables in AWS through running spark jobs run with EMR instance Previous work: I am Big Data Developer working with Enterprise Data Analytics team in Intel. It delivers analytical solutions for manufacturing and supply chain orgs/data leveraging Hadoop and related technologies along with ML And my primary responsibility is to implement continuous ingestion pipelines to load data (JSON, CSV) from multiple sources such as API, Streaming (Active MQ, Kafka), Shared Drive, RDBMS into HDFS, HBase, Hive, Elastic, Azure. Previous project: As part of manufacturing org, each and every wafer is tested in numerous aspects such as performance, failure, validation, quality and reliability, power/voltage consumption and so on and as a result lot of data is generated in various formats (JSON, CSV, Excels, HTTP Response) from multiple sources including SMB Share, SQL Server, API, Streaming Message Queues. Project vision is to write reusable framework/pipelines to ingest these data to specific sink such as HBase, HDFS, Hadoop, HIVE, Impala, Elastic and so on. This Data is used to perform analytics and develop dashboards/reports in Cube/Power BI Technical Skills & Knowledge Set Languages: C, Core Java, C#, Python, SQL, HQL, Shell Scripting, Linux, VI, Object Oriented Programming Databases: MYSQL, SQL SERVER, Tera Data, HANA Basics Big Data: Hadoop, HDFS, Hive, Impala, HBase, Elastic Search, Logstash, Kibana, Flume,Sqoop, Azure, Spark Core, Spark SQL, PySpark, Spark Streaming and Apache Kafka, Basic AWS ML Libraries: NumPy, Pandas, Scikit-Learn, NLTK, Google Maps APIs and Tools: REST API, Google, TFS, Power BI, Gitlab, AutoSys, Team City, Moba X term, Putty, Orange Methodologies: Agile, SAFE, CI/CD I have 3 Scopus indexed journal publications in (Internet of things) domain and 1 IEEE conference publications. I have data science knowledge and worked on couple of POC's in text analytics and ML

Experience

12 yrs 10 mos
Total Experience
3 yrs 2 mos
Average Tenure
5 yrs 1 mo
Current Experience

Salesforce

Senior Member Of Technical Staff

May 2021Present · 5 yrs 1 mo · Bangalore Urban, Karnataka, India

  • I am Currently working on developing table maintenance jobs for lake-house project which involves manipulating data of iceberg tables in AWS through running spark jobs run with EMR instance
  • Technology : Java, Iceberg, AWS, Spark
JavaIcebergAWSSparkApache AirflowBig Data

Intel corporation

Senior Big Data Developer

May 2016May 2021 · 5 yrs · Bangalore

  • I am a passionate Big data/Hadoop/Python developer with excellent understanding/knowledge
  • of Hadoop architecture and various big data technologies such as Spark, HBase, Hive/Impala, Elastic, Azure,Kafka. I am part of Enterprise Data analytics (Supply Chain and Manufacturing) projects at Intel. My responsibility includes implementing continuous data ingestion pipelines
  • Experience
  • NSG Manufacturing
  • 1. Implemented complex calculations as PySpark Transformations using window functions in Spark SQL to add 100+ calculated/derived columns to MCD data
  • 2.Experience in tuning spark config parameters to improve spark performance
  • 3. Created hive external tables with partitioning to store the processed data from spark transformations
  • 4. Used Apache Kafka as a messaging system to load log data into HDFS and HBase system
  • NSG Customer Dashboard
  • 1. Experience in design and developing data ingestion pipeline to perform ETL of JIRA API (Rest) Data to Hadoop and expose HDFS data as hive/impala views for Cube and Power BI
  • 2. Created tasks for incremental load into staging tables, and schedule them to run.
  • 3. Designed appropriate partitioning/bucketing schema to allow faster data retrieval during data analysis
  • 4. Involved in creating Hive tables, enabling dynamic partitioning to capture daily snapshots
  • 5. Handled delta processing or incremental updates using hive and processed the data in hive table
  • NSG Conval
  • 1. Design and implement data ingestion pipeline to index streaming conval drive info jsons (wafer test failure logs) from AXON (Rest API) queues (Active MQ) to Hadoop and elasticsearch
  • 2. Experience in developing Azure data pipeline to load csv from samba share to ADLS
  • 3. Created script to backup hive partitions and easy recover in case of accidental delete
  • QNR
  • 1. loaded 2TB of QNR (Quality and Reliability) JSON files from wafer test drives into Hadoop - batch processing
  • 2. Used Hive JSON Serde library to flatten raw json data as hive tables - lateral view explodes
HadoopSparkHBaseHiveKafkaETL+1

Intel technologies

Graduate Intern Technical, C# Developer

Jul 2015May 2016 · 10 mos · Bengaluru Area, India

  • 1. I have designed automation tools by developing the web applications leveraging C# and related cutting edge web technologies.
  • 2. I have designed 5 process tracking tools mainly for helping employee attrition process, safeguard and training management process and Intellectual Property tracking processes.
  • 3. I hold responsibility in designing both the front end development as well as the server side development.
  • 4. I am successful in deriving reports, knowledge discovery out of these tools as in compare of data with the help of chart plugins
C#Web Development

Centurylink

Software Engineer

Aug 2012Jul 2014 · 1 yr 11 mos · Bengaluru Area, India

  • DDT - core java developer
  • 1. worked on search engine project to discover internal document data from multiple locations in network
  • CONSTRUCTION MARKET INTELLIGENCE SYSTEMS (CMIS) - Hadoop Developer
  • 1. I have worked as a Hadoop developer in ingestion of Construction Data in Excel files into Hadoop
  • 2. Involved in Parsing Excel/CSV data to integrate/join and transform data using Spark Data Frames
  • 3. Optimised data ingestion using spark to integrate, process millions of records from many sources
  • 4. Implemented search capability – Real time data pull and flatten results to fit UI needs
  • 5. loading data and writing complex hive joins/queries
  • 6. Exposed HDFS data as dynamically partitioned hive tables on version number (maintained in a text file)
JavaHadoopSparkBig Data

Education

VIT University

Master’s Degree — Information Technology (Computer Networking)

Jan 2014Jan 2016

SJCE Mysore

Bachelor’s Degree — Computer Science

Jan 2008Jan 2012

Stackforce found 100+ more professionals with Big Data & Aws

Explore similar profiles based on matching skills and experience