Raja Sabarish PV

CEO

Bengaluru, Karnataka, India11 yrs 9 mos experience
Most Likely To SwitchHighly Stable

Key Highlights

  • 10+ years in Data Engineering and Cloud Platforms
  • Expert in Generative AI and Agentic AI solutions
  • Proven leadership in Agile team environments
Stackforce AI infers this person is a Data Engineering and Cloud Architecture expert with a focus on Generative AI solutions.

Contact

Skills

Core Skills

Data EngineeringCloud Architecture

Other Skills

AJAXAPI GatewayActiveMQActorAgile MethodologiesAirflowAirpalAkkaAkkaHttpAmazon EC2Amazon LambdaAmazon RDSAmazon S3AngularJSApache Airflow

About

With 10+ years of experience in Data Engineering and Cloud Platforms, I specialize in building scalable data ecosystems and driving the enterprise adoption of Generative AI and Agentic AI. Currently, I serve as a Lead Consultant at ITC Infotech, where I lead the Lighthouse Project — transforming traditional data engineering with GenAI-driven insights, NLP-powered reporting, and autonomous AI agents for workflow automation and operational efficiency. My expertise spans Azure Data Factory, Databricks, Delta Lake, Unity Catalog, Lakehouse architectures, Delta Sharing, and AWS cloud services, enabling end-to-end data engineering solutions that are secure, efficient, and business-focused. I have also driven impactful initiatives with Databricks Genie, LangChain, and Azure OpenAI, creating enterprise-ready GenAI solutions that reduce reporting turnaround time and accelerate decision-making. Beyond technology, I thrive in collaborative, Agile environments, managing teams, engaging with stakeholders, and ensuring business alignment with technical delivery. I am passionate about emerging technologies, Generative AI, and Responsible AI practices, and continuously explore how they can reshape the future of data and analytics. Focus Areas: Data Engineering & Cloud Architecture Generative AI & Agentic AI Adoption Prompt Engineering & NLP-based analytics Data Governance, Security, & AI Automation Driving cost optimization & business value from data

Experience

11 yrs 9 mos
Total Experience
1 yr 3 mos
Average Tenure
4 yrs 6 mos
Current Experience

Itc infotech

2 roles

Technical Architect

Promoted

Nov 2024Present · 1 yr 7 mos · Bengaluru, Karnataka, India · Hybrid

  • Spearheaded the migration to Unity Catalog in Databricks, enhancing data governance and accessibility.
  • Led a team of 7-9 data engineers, ensuring seamless collaboration with data scientists for standardized data delivery.
  • Acted as the primary point of contact for clients, fostering strong relationships and understanding their data needs.
Data ArchitectsEngineering Data ManagementDomain ArchitecturePython (Programming Language)Data LoadingAzure Data Factory+5

Lead Consultant

Nov 2021Oct 2024 · 2 yrs 11 mos · Bengaluru, Karnataka, India · Hybrid

  • Led the optimization of consumer goods data through diverse ADF pipelines, managing data preprocessing stages, and collaborating with Data Scientists to enhance model runs.
  • Optimized consumer goods data through diverse ADF pipelines.
  • Managed data preprocessing stages, including data cleansing and schema validation.
  • Collaborated with Data Scientists to deliver curated data for model runs.
Azure DatabricksAzure Data LakeApache SparkExtract, Transform, Load (ETL)Azure Data FactoryPython (Programming Language)+5

Ntt data services

Information Technology Management Consultant

Jun 2020Oct 2021 · 1 yr 4 mos · Bengaluru, Karnataka, India

  • client: Sony India Software Centre
  • Managed large-scale data processing tasks using Azure Blob Storage for ingestion.
  • Integrated Spark transformation logic within Azure Synapse.
  • Implemented separate serverless Azure functions for each module in the application to manage incoming datasets. These functions are triggered upon the arrival of new data and processed before passing it onto Spark jobs.
  • Converted the data processed through Apache Spark into valuable insights and stored in corresponding blob storage locations.
  • Facilitated access to this data through secured APIs built using Scala, Akka, and the Play framework
  • Logged and reported all the activities to internal analytics dashboards for monitoring and analysis purposes.
AkkaHttpProject ManagementData WarehousingData LoadingPySparkMySQL+6

Troondx

Principal Technical Architect

Apr 2019Jun 2020 · 1 yr 2 mos · Chennai Area, India

  • The primary goal of this project is to identify duplicate records based on specific unique columns within the dataset, enabling the provision of loans to farmers. The project's focus lies in establishing an end-to-end data pipelining process, starting with data acquisition from various source systems such as MySQL, Postgres, and others. Once the data is transferred to the distributed system (HDFS), it is stored as ORC files, and external tables are created to access this data.
Python (Programming Language)Project ManagementData EngineeringAzure Data FactoryAzure DatabricksPySpark+1

Emids

Senior Software Engineer

Sep 2018Mar 2019 · 6 mos · Bengaluru Area, India

  • Medidata is a technology firm specializing in the development and promotion of software as a service tailored for clinical trials. Our platform facilitates the execution of comprehensive clinical trials, managing every step of the process. The application is crafted using Scala and Akka for REST services, ensuring robust functionality and performance
Software DevelopmentProject Management

Sony india software centre

Senior Software Engineer

Jan 2018Aug 2018 · 7 mos · Bengaluru Area, India

  • GWN serves as an e-commerce platform dedicated to Sony products. The application is hosted in various continental languages. Presently, it has transitioned from multiple domains to a single domain with corresponding locales. The application's primary functions include selling products and providing updates to customers on a range of Sony offerings such as PS3, Sony Pictures, Sony Mobiles, Sony Televisions, and more
Extract, Transform, Load (ETL)Engineering Data ManagementData ArchitectsPython (Programming Language)Data LoadingData Engineering

Tvs next

Senior Software Engineer

Jul 2017Dec 2017 · 5 mos · Chennai Area, India

  • Developed the Vpower Energy web application to display the daily power usage of various energy clients through a dashboard. Constructed a Data Lake to ingest data from various APIs of vpowertools, utilizing a generic Python model to transform the data into different formats such as DataFrame and Parquet (columnar data format). Implemented a Scala REST API to communicate with various microservices, enabling the Front End to consume these services and display UI graphs on the dashboard.
AkkaHttpAkkaPython (Programming Language)PySparkScalaData Engineering

Monsanto company

Data Specialist

Apr 2016Apr 2017 · 1 yr · Bengaluru, Karnataka, India

  • Creating various types of DAGs that involve multiple sub-tasks, executing these functions cyclically as worker threads within Airflow. All DAG functions are implemented in Python. Managd large datasets stored in Hive tables in Parquet format to retrieve and process model data. Data streaming is conducted using Apache Spark, with data streamed through Kafka Brokers.
Engineering Data ManagementData ArchitectsPython (Programming Language)Apache AirflowData Engineering

Ust global

Software Engineer

Apr 2015Apr 2016 · 1 yr · Bengaluru Area, India

  • 1-Page (Hiring) employs a Microservice Architecture, facilitating communication between each component through HAProxy. This architecture is implemented within a job portal application. Recruiters can utilize the platform to search for specific skill sets by posting challenges to candidates. Candidates are then invited to participate in these challenges, proposing their ideas in response.
Software Development

Kenla systems pvt ltd

Software Engineer

Aug 2013Nov 2014 · 1 yr 3 mos · Chennai Area, India

  • XcellTRACKER is a user-friendly tracking application offering a comprehensive range of features. Apart from managing warranties, insurance, and key documents, it includes an Asset Master List function, enabling users to track various types of assets, including loans and leases. Assets can be easily assigned or reassigned to users along with corresponding policies and warranties, accommodating changes such as policy updates, carrier switches, or warranty extensions.
  • The Asset list feature aids in tracking purchases, manufacturers, warranties, and returns/exchanges. Users can upload various attachments like photos, videos, audios, and documents related to their assets, insurance, warranties, and documents, which are stored in Amazon S3 and can be accessed later upon request. Additionally, all details and contents are accessible through the Android and iPhone apps.
Amazon EC2AJAXJavaScriptAmazon S3MySQLScala

Education

SPVMHSS

Bachelor of Engineering - BE

Jun 2008Jul 2012

Stackforce found 100+ more professionals with Data Engineering & Cloud Architecture

Explore similar profiles based on matching skills and experience