Raja Sabarish PV

CEO

Bengaluru, Karnataka, India11 yrs 9 mos experience

Most Likely To SwitchHighly Stable

Key Highlights

10+ years in Data Engineering and Cloud Platforms
Expert in Generative AI and Agentic AI solutions
Proven leadership in Agile team environments

Stackforce AI infers this person is a Data Engineering and Cloud Architecture expert with a focus on Generative AI solutions.

Contact

Skills

Core Skills

Data EngineeringCloud Architecture

Other Skills

AJAXAPI GatewayActiveMQActorAgile MethodologiesAirflowAirpalAkkaAkkaHttpAmazon EC2Amazon LambdaAmazon RDSAmazon S3AngularJSApache Airflow

About

With 10+ years of experience in Data Engineering and Cloud Platforms, I specialize in building scalable data ecosystems and driving the enterprise adoption of Generative AI and Agentic AI. Currently, I serve as a Lead Consultant at ITC Infotech, where I lead the Lighthouse Project — transforming traditional data engineering with GenAI-driven insights, NLP-powered reporting, and autonomous AI agents for workflow automation and operational efficiency. My expertise spans Azure Data Factory, Databricks, Delta Lake, Unity Catalog, Lakehouse architectures, Delta Sharing, and AWS cloud services, enabling end-to-end data engineering solutions that are secure, efficient, and business-focused. I have also driven impactful initiatives with Databricks Genie, LangChain, and Azure OpenAI, creating enterprise-ready GenAI solutions that reduce reporting turnaround time and accelerate decision-making. Beyond technology, I thrive in collaborative, Agile environments, managing teams, engaging with stakeholders, and ensuring business alignment with technical delivery. I am passionate about emerging technologies, Generative AI, and Responsible AI practices, and continuously explore how they can reshape the future of data and analytics. Focus Areas: Data Engineering & Cloud Architecture Generative AI & Agentic AI Adoption Prompt Engineering & NLP-based analytics Data Governance, Security, & AI Automation Driving cost optimization & business value from data

Experience

11 yrs 9 mos

Total Experience

1 yr 3 mos

Average Tenure

4 yrs 6 mos

Current Experience

Itc infotech

2 roles

Technical Architect

Promoted

Nov 2024 – Present · 1 yr 7 mos · Bengaluru, Karnataka, India · Hybrid

Spearheaded the migration to Unity Catalog in Databricks, enhancing data governance and accessibility.
Led a team of 7-9 data engineers, ensuring seamless collaboration with data scientists for standardized data delivery.
Acted as the primary point of contact for clients, fostering strong relationships and understanding their data needs.

Data ArchitectsEngineering Data ManagementDomain ArchitecturePython (Programming Language)Data LoadingAzure Data Factory+5

Lead Consultant

Nov 2021 – Oct 2024 · 2 yrs 11 mos · Bengaluru, Karnataka, India · Hybrid

Led the optimization of consumer goods data through diverse ADF pipelines, managing data preprocessing stages, and collaborating with Data Scientists to enhance model runs.
Optimized consumer goods data through diverse ADF pipelines.
Managed data preprocessing stages, including data cleansing and schema validation.
Collaborated with Data Scientists to deliver curated data for model runs.

Azure DatabricksAzure Data LakeApache SparkExtract, Transform, Load (ETL)Azure Data FactoryPython (Programming Language)+5

Ntt data services

Information Technology Management Consultant

Jun 2020 – Oct 2021 · 1 yr 4 mos · Bengaluru, Karnataka, India

client: Sony India Software Centre
Managed large-scale data processing tasks using Azure Blob Storage for ingestion.
Integrated Spark transformation logic within Azure Synapse.
Implemented separate serverless Azure functions for each module in the application to manage incoming datasets. These functions are triggered upon the arrival of new data and processed before passing it onto Spark jobs.
Converted the data processed through Apache Spark into valuable insights and stored in corresponding blob storage locations.
Facilitated access to this data through secured APIs built using Scala, Akka, and the Play framework
Logged and reported all the activities to internal analytics dashboards for monitoring and analysis purposes.

AkkaHttpProject ManagementData WarehousingData LoadingPySparkMySQL+6

Troondx

Principal Technical Architect

Apr 2019 – Jun 2020 · 1 yr 2 mos · Chennai Area, India

The primary goal of this project is to identify duplicate records based on specific unique columns within the dataset, enabling the provision of loans to farmers. The project's focus lies in establishing an end-to-end data pipelining process, starting with data acquisition from various source systems such as MySQL, Postgres, and others. Once the data is transferred to the distributed system (HDFS), it is stored as ORC files, and external tables are created to access this data.

Python (Programming Language)Project ManagementData EngineeringAzure Data FactoryAzure DatabricksPySpark+1

Emids

Senior Software Engineer

Sep 2018 – Mar 2019 · 6 mos · Bengaluru Area, India

Medidata is a technology firm specializing in the development and promotion of software as a service tailored for clinical trials. Our platform facilitates the execution of comprehensive clinical trials, managing every step of the process. The application is crafted using Scala and Akka for REST services, ensuring robust functionality and performance

Software DevelopmentProject Management

Sony india software centre

Senior Software Engineer

Jan 2018 – Aug 2018 · 7 mos · Bengaluru Area, India

GWN serves as an e-commerce platform dedicated to Sony products. The application is hosted in various continental languages. Presently, it has transitioned from multiple domains to a single domain with corresponding locales. The application's primary functions include selling products and providing updates to customers on a range of Sony offerings such as PS3, Sony Pictures, Sony Mobiles, Sony Televisions, and more

Extract, Transform, Load (ETL)Engineering Data ManagementData ArchitectsPython (Programming Language)Data LoadingData Engineering

Tvs next

Senior Software Engineer

Jul 2017 – Dec 2017 · 5 mos · Chennai Area, India

Developed the Vpower Energy web application to display the daily power usage of various energy clients through a dashboard. Constructed a Data Lake to ingest data from various APIs of vpowertools, utilizing a generic Python model to transform the data into different formats such as DataFrame and Parquet (columnar data format). Implemented a Scala REST API to communicate with various microservices, enabling the Front End to consume these services and display UI graphs on the dashboard.

AkkaHttpAkkaPython (Programming Language)PySparkScalaData Engineering

Monsanto company

Data Specialist

Apr 2016 – Apr 2017 · 1 yr · Bengaluru, Karnataka, India

Creating various types of DAGs that involve multiple sub-tasks, executing these functions cyclically as worker threads within Airflow. All DAG functions are implemented in Python. Managd large datasets stored in Hive tables in Parquet format to retrieve and process model data. Data streaming is conducted using Apache Spark, with data streamed through Kafka Brokers.

Engineering Data ManagementData ArchitectsPython (Programming Language)Apache AirflowData Engineering

Ust global

Software Engineer

Apr 2015 – Apr 2016 · 1 yr · Bengaluru Area, India

1-Page (Hiring) employs a Microservice Architecture, facilitating communication between each component through HAProxy. This architecture is implemented within a job portal application. Recruiters can utilize the platform to search for specific skill sets by posting challenges to candidates. Candidates are then invited to participate in these challenges, proposing their ideas in response.

Software Development

Kenla systems pvt ltd

Software Engineer

Aug 2013 – Nov 2014 · 1 yr 3 mos · Chennai Area, India

XcellTRACKER is a user-friendly tracking application offering a comprehensive range of features. Apart from managing warranties, insurance, and key documents, it includes an Asset Master List function, enabling users to track various types of assets, including loans and leases. Assets can be easily assigned or reassigned to users along with corresponding policies and warranties, accommodating changes such as policy updates, carrier switches, or warranty extensions.
The Asset list feature aids in tracking purchases, manufacturers, warranties, and returns/exchanges. Users can upload various attachments like photos, videos, audios, and documents related to their assets, insurance, warranties, and documents, which are stored in Amazon S3 and can be accessed later upon request. Additionally, all details and contents are accessible through the Android and iPhone apps.

Amazon EC2AJAXJavaScriptAmazon S3MySQLScala