Raghavendra Desai — Product Manager

Introduction: I am a passionate Senior big data/Hadoop/python developer with having 7+ years of total IT Experience, including 5 years of extensive experience in Big-data/Hadoop/Spark Technologies. I am equally excited about problem solving, being creative and developing strategic business solutions. I was associated with manufacturing and supply chain optimisation programs within Intel IT. I am currently working with marketing cloud in Salesforce Current Work: I am Currently working on developing table maintenance jobs for lake-house project which involves manipulating data of iceberg tables in AWS through running spark jobs run with EMR instance Previous work: I am Big Data Developer working with Enterprise Data Analytics team in Intel. It delivers analytical solutions for manufacturing and supply chain orgs/data leveraging Hadoop and related technologies along with ML And my primary responsibility is to implement continuous ingestion pipelines to load data (JSON, CSV) from multiple sources such as API, Streaming (Active MQ, Kafka), Shared Drive, RDBMS into HDFS, HBase, Hive, Elastic, Azure. Previous project: As part of manufacturing org, each and every wafer is tested in numerous aspects such as performance, failure, validation, quality and reliability, power/voltage consumption and so on and as a result lot of data is generated in various formats (JSON, CSV, Excels, HTTP Response) from multiple sources including SMB Share, SQL Server, API, Streaming Message Queues. Project vision is to write reusable framework/pipelines to ingest these data to specific sink such as HBase, HDFS, Hadoop, HIVE, Impala, Elastic and so on. This Data is used to perform analytics and develop dashboards/reports in Cube/Power BI Technical Skills & Knowledge Set Languages: C, Core Java, C#, Python, SQL, HQL, Shell Scripting, Linux, VI, Object Oriented Programming Databases: MYSQL, SQL SERVER, Tera Data, HANA Basics Big Data: Hadoop, HDFS, Hive, Impala, HBase, Elastic Search, Logstash, Kibana, Flume,Sqoop, Azure, Spark Core, Spark SQL, PySpark, Spark Streaming and Apache Kafka, Basic AWS ML Libraries: NumPy, Pandas, Scikit-Learn, NLTK, Google Maps APIs and Tools: REST API, Google, TFS, Power BI, Gitlab, AutoSys, Team City, Moba X term, Putty, Orange Methodologies: Agile, SAFE, CI/CD I have 3 Scopus indexed journal publications in (Internet of things) domain and 1 IEEE conference publications. I have data science knowledge and worked on couple of POC's in text analytics and ML

Stackforce AI infers this person is a Big Data Engineer with expertise in data ingestion and analytics for manufacturing and supply chain.

Location: Bengaluru, Karnataka, India

Experience: 12 yrs 10 mos

Skills

Big Data
Aws
Etl
Spark
Data Ingestion
Data Processing

Career Highlights

7+ years of experience in Big Data technologies.
Expert in developing data ingestion pipelines.
Published multiple research papers in IoT domain.

Work Experience

Salesforce

Senior Member Of Technical Staff (5 yrs 1 mo)

Intel Corporation

Senior Big Data Developer (5 yrs)

Intel Technologies

Graduate Intern Technical, C# Developer (10 mos)

CenturyLink

Software Engineer (1 yr 11 mos)

Education

Master’s Degree at VIT University

Bachelor’s Degree at SJCE Mysore

Raghavendra Desai

Product Manager

Bengaluru, Karnataka, India12 yrs 10 mos experience

Most Likely To SwitchHighly Stable

Key Highlights

7+ years of experience in Big Data technologies.
Expert in developing data ingestion pipelines.
Published multiple research papers in IoT domain.

Stackforce AI infers this person is a Big Data Engineer with expertise in data ingestion and analytics for manufacturing and supply chain.

Contact

Skills

Core Skills

Big DataAwsEtlSparkData IngestionData Processing

Other Skills

JavaIcebergApache AirflowHadoopHBaseHiveKafkaPySparkSpark SQLPower BIElasticsearchREST APIC#Web DevelopmentExtract, Transform, Load (ETL)

About

Experience

12 yrs 10 mos

Total Experience

3 yrs 2 mos

Average Tenure

5 yrs 1 mo

Current Experience

Salesforce

Senior Member Of Technical Staff

May 2021 – Present · 5 yrs 1 mo · Bangalore Urban, Karnataka, India

I am Currently working on developing table maintenance jobs for lake-house project which involves manipulating data of iceberg tables in AWS through running spark jobs run with EMR instance
Technology : Java, Iceberg, AWS, Spark

JavaIcebergAWSSparkApache AirflowBig Data

Intel corporation

Senior Big Data Developer

May 2016 – May 2021 · 5 yrs · Bangalore

I am a passionate Big data/Hadoop/Python developer with excellent understanding/knowledge
of Hadoop architecture and various big data technologies such as Spark, HBase, Hive/Impala, Elastic, Azure,Kafka. I am part of Enterprise Data analytics (Supply Chain and Manufacturing) projects at Intel. My responsibility includes implementing continuous data ingestion pipelines
Experience
NSG Manufacturing
1. Implemented complex calculations as PySpark Transformations using window functions in Spark SQL to add 100+ calculated/derived columns to MCD data
2.Experience in tuning spark config parameters to improve spark performance
3. Created hive external tables with partitioning to store the processed data from spark transformations
4. Used Apache Kafka as a messaging system to load log data into HDFS and HBase system
NSG Customer Dashboard
1. Experience in design and developing data ingestion pipeline to perform ETL of JIRA API (Rest) Data to Hadoop and expose HDFS data as hive/impala views for Cube and Power BI
2. Created tasks for incremental load into staging tables, and schedule them to run.
3. Designed appropriate partitioning/bucketing schema to allow faster data retrieval during data analysis
4. Involved in creating Hive tables, enabling dynamic partitioning to capture daily snapshots
5. Handled delta processing or incremental updates using hive and processed the data in hive table
NSG Conval
1. Design and implement data ingestion pipeline to index streaming conval drive info jsons (wafer test failure logs) from AXON (Rest API) queues (Active MQ) to Hadoop and elasticsearch
2. Experience in developing Azure data pipeline to load csv from samba share to ADLS
3. Created script to backup hive partitions and easy recover in case of accidental delete
QNR
1. loaded 2TB of QNR (Quality and Reliability) JSON files from wafer test drives into Hadoop - batch processing
2. Used Hive JSON Serde library to flatten raw json data as hive tables - lateral view explodes

HadoopSparkHBaseHiveKafkaETL+1

Intel technologies

Graduate Intern Technical, C# Developer

Jul 2015 – May 2016 · 10 mos · Bengaluru Area, India

1. I have designed automation tools by developing the web applications leveraging C# and related cutting edge web technologies.
2. I have designed 5 process tracking tools mainly for helping employee attrition process, safeguard and training management process and Intellectual Property tracking processes.
3. I hold responsibility in designing both the front end development as well as the server side development.
4. I am successful in deriving reports, knowledge discovery out of these tools as in compare of data with the help of chart plugins

C#Web Development

Centurylink

Software Engineer

Aug 2012 – Jul 2014 · 1 yr 11 mos · Bengaluru Area, India

DDT - core java developer
1. worked on search engine project to discover internal document data from multiple locations in network
CONSTRUCTION MARKET INTELLIGENCE SYSTEMS (CMIS) - Hadoop Developer
1. I have worked as a Hadoop developer in ingestion of Construction Data in Excel files into Hadoop
2. Involved in Parsing Excel/CSV data to integrate/join and transform data using Spark Data Frames
3. Optimised data ingestion using spark to integrate, process millions of records from many sources
4. Implemented search capability – Real time data pull and flatten results to fit UI needs
5. loading data and writing complex hive joins/queries
6. Exposed HDFS data as dynamically partitioned hive tables on version number (maintained in a text file)

JavaHadoopSparkBig Data