Ravi Varshney

CTO

Noida, Uttar Pradesh, India20 yrs 6 mos experience
Highly StableAI Enabled

Key Highlights

  • 19+ years of experience in data and AI leadership.
  • Expert in building scalable data platforms and solutions.
  • Proven track record in managing large engineering teams.
Stackforce AI infers this person is a SaaS expert with a strong focus on data engineering and AI solutions.

Contact

Skills

Core Skills

Data EngineeringAi/mlGenerative AiInformation RetrievalBig DataData PlatformsSoftware Development

Other Skills

AerospikeAmazon S3Amazon Web Services (AWS)Anomaly DetectionApache AirflowApache KafkaApache SparkApache ZeppelinAutomatic SchedulingClassificationData DemocratizationData LakeData LakesData MiningData Pipelines

About

Unlocking the Power of Data & AI :) An accomplished engineering and Data leader with 19+ years of rich experience in strategising, architecting and building scalable and innovative high volume - high traffic solutions. Enterprise Data Platforms, Data democratization, Personalization & Contextualisation, Search & Discovery, Data Pipelines, Workflow Automation, Data-as-a-product, Backend platforms, Recommender systems, scraping platforms, AI / ML, GenAI, LLMs, openai, prompt engineering. - Seasoned in building and managing highly skilled engineering teams, stakeholder management, and managing projects. Can manage multiple large projects and teams simultaneously. - End-to-end execution of big technical / business initiatives from inception to making those successful. Currently as Vice President – Engineering, leading the complete Big data engineering platform (Data platform) for naukri.com and multiple related businesses like naukrigulf.com, firstnaukri.com, AmbitionBox, RMS, IIMJOBs etc. The Data platform I have built is a generalized and multi-tenant platform, consists of centralized data lake, Ingestion and Change data capture (CDC), Data pipeline, Self service discovery, query and access, Data processing clusters, Generalized reusable data models (for Insights and Transformations), Real time analytics, Data and pipeline observability, Anomaly detection, Data products (Multiple insight products for end users, user segmentation, on/off-platform branding and campaigns, internal analytical products, predictive algorithms, Dashboards / reporting, Multiple ML models etc.) - Built a centralized Search & Discovery engineering team. - Developed a generalized and unified information retrieval (Search) platform for structured and semi-structured datasets based on various IR technologies (ElasticSearch, Solr, Sphinx, Lucene). - Have done extensive customization to the lucene and Sphinx code base for various matching / scoring / relevance / sorting / grouping requirements of the result set. - Search platform comprises - text processing, keywords, NER, query expansion, personalization, semantic, taxonomy, feedback loop and AI based search implementation to improve matching and relevance. Generic pipeline to incorporate various indexing and search use cases and algorithm impls. Built a multi agent generalized web-crawling (scraper) platform for Hidden and dynamic web, information extraction and parsing, and scaled it to millions of pages. Built generic services platform, configure and deploy no-code model for any service use case.

Experience

20 yrs 6 mos
Total Experience
4 yrs 7 mos
Average Tenure
4 yrs 2 mos
Current Experience

Findem

Head of Data Engineering and Science || Director of Data Engineering and Science

Apr 2022Present · 4 yrs 2 mos

  • As the Head of Data Engineering and Science at Findem.ai (https://www.findem.ai/), a US-based People Intelligence Platform Startup operating at a global level with data and AI at its core,
  • I am at the forefront of leading the global data team and establishing it from the ground up, building data strategy of the company.
  • My responsibilities encompass overseeing the core data systems, which include backend services, Big data, data science, AI/ML, generative AI, OpenAI (ChatGPT), prompt engineering, data acquisition, data engineering, workflows, and data pipelines, insights, predictive analytics, data democratization etc.
  • Additionally, I manage aspects such as analytics, data quality and observability, cleansing, and data-as-a-products solutions.
  • My role involves effective vendor management, collaborating with data providers, technology
  • solution providers, and contractual employee providers.
  • I am dedicated to driving Data Democratisation, overseeing data lake implementation, managing data pipelines, enabling self-service data query and access, optimising workflow management, ensuring data freshness, and maintaining high standards of data quality and coverage.
  • Building Generative AI and Agentic AI-based Framework for Workflow Agents in Various
  • Recruitment Flows. Tools: PydanticAI, OpenAI, Weaviate (Vector Database) etc..
  • Designed and implemented RAG-based semantic search pipelines leveraging vector databases and LLMs, improving information retrieval accuracy and contextual relevance..
  • Industry’s first Copilot for Sourcing, built on 3D data.
  • Furthermore, I specialise in attribute classification, semantic search, Knowledge graphs, NER (Named Entity Recognition), NER (Named Entity Resolution), Information retrieval (Search), and other advanced data techniques.
Data EngineeringAI/MLGenerative AIData DemocratizationData PipelinesData Quality+1

Info edge india ltd

4 roles

Vice President - Engineering

Promoted

Apr 2018Apr 2022 · 4 yrs

  • Infoedge (Naukri.com) is a trailblazer in India's internet industry, biggest jobsite in India.
  • Currently, leading the complete Big data engineering platform (Data platform) team and Insights Products (Data-as-a-product like Naukri Talent Pulse, Naukri Insights etc.) for naukri.com and related businesses like naukrigulf.com, firstnaukri.com, AmbitionBox, RMS, IIMJOBs etc.
  • <->
  • The Data platform I have built is a generalized and multi-tenant platform, consists of
  • Centralized data lake.
  • Data capture - Real time / batch Ingestion, Hundreds of different data sources,
  • Change Data Capture (CDC), Transactional | Behavioral | off / on platform events / entities
  • capture and processing, ETL, data lake, Data and pipeline observability with real time
  • actionable alerts, Anomaly detection etc.
  • Data Query and Access – Centralized Data access layer, Distributed Data processing
  • and query engine, compute cluster, self service adhoc and scheduled queries / jobs,
  • data Catalog, data security, access control, data integration pipelines for different business use
  • cases etc.
  • Workflow management.
  • Data models – reusable and generic data models (aggregations, transformations and
  • enrichment) to serve multiple internal, end user facing insight use cases, dashboards / reports etc.
  • Real time analytics – Real time traffic analytics, reporting / dashboards, generalized real time
  • query platform, pipeline for various business use cases.
  • Dashboards / reporting / DWH.
  • Data products - Multiple insight products for end users, personalization use cases, user
  • segmentation, on/off-platform branding and campaign management systems, internal
  • analytical products, predictive algorithms etc. Multiple ML model deployments.
  • Migrated complete Search stack to elastic Search.
  • Build search-as-a-service.
  • Implemented various Named entity recognition (NER), semantic search algorithms to improve relevance and NDCG metric.
Big DataData PlatformsData ProductsReal-time AnalyticsETLData Lakes

Associate Vice President Engineering

Promoted

Apr 2014Mar 2018 · 3 yrs 11 mos

  • Built a centralized Search & Discovery engineering team.
  • Developed a generalized and unified information retrieval (Search) platform for structured and semi-structured datasets based on various IR technologies (ElasticSearch, Solr, Sphinx, Lucene).
  • Have done in-depth and extensive customization to the lucene and Sphinx code base for various matching / scoring / relevance / sorting / grouping requirements of the result set.
  • Search platform comprises - text processing, keywords, NER, query expansion, personalization, semantic, taxonomy, feedback loop and AI based search implementation to improve matching and relevance. Generic pipeline to incorporate various indexing and search use cases and algorithm implementations.
Search EngineeringInformation RetrievalElasticSearchSphinxLucene

Sr. Search Architect

Promoted

Apr 2012Mar 2014 · 1 yr 11 mos

Sr. S/W Engineer

Sep 2007Oct 2009 · 2 yrs 1 mo

  • Working as a Senior member of search engineering team building the unified search platform for Info Edge (Naukri.com tech division). Designing and implementing a scalable and extensible information retrieval architecture for current and future needs of the company's most critical offerings.
  • tools : C, C++, perl, php, MySql, Linux System calls
Information RetrievalSearch EngineeringSphinxLucene

Naukri.com

2 roles

Search Architect

Promoted

Oct 2009Mar 2012 · 2 yrs 5 mos

  • Have worked in developing information retrieval (Search) platform for structured and semistructured datasets based on various IR technologies (Sphinx, Lucene).
  • Done in-depth customizations in Sphinx and Lucene for various matching / scoring / relevance / sorting / grouping requirements of result set.
  • Designed and implemented mutiagent web-crawler, information extraction and parsing platform, capable of scaling to millions of pages.
  • Various AI/ML, nlp and collaborative filtering based recommnadation systems like ‘view similar jobs’, ‘view similar candidates’, ‘Similar companies’, skill and job title based recommendations, computer vision based recommendations etc.
Information RetrievalWeb CrawlingAI/MLRecommendation Systems

Senior Software Engineer

Sep 2007Oct 2009 · 2 yrs 1 mo

  • Working as a Senior member of search engineering team building the unified search platform for Info Edge (Naukri.com tech division). Designing and implementing a scalable and extensible information retrieval architecture for current and future needs of the company's most critical offerings.
  • tools : C, C++, perl, php, MySql, Linux System calls
Information RetrievalSearch EngineeringSphinxLucene

Thomson digital

Software Developer

Sep 2005Sep 2007 · 2 yrs

  • As a Team Member of SSDC (Special Software Development Cell) Group, involved in Design and Development of a Production Planning and Automatic Scheduling package for Manufacturing companies.
  • development tools used : Borland c++ builder, Mysql
Production PlanningAutomatic SchedulingSoftware Development

Education

B.Tech. (Computer Science & Engineering)

B.Tech. — Computer Science and Engineering

Stackforce found 100+ more professionals with Data Engineering & Ai/ml

Explore similar profiles based on matching skills and experience