Kareem Shaik

AI Researcher

Bengaluru, Karnataka, India19 yrs 11 mos experience
Most Likely To SwitchAI ML Practitioner

Key Highlights

  • Expert in Generative AI and Computer Vision.
  • Pioneered multimodal search and AI-powered retrieval.
  • Renowned for transitioning research into commercial success.
Stackforce AI infers this person is a leading expert in eCommerce and multimedia technology.

Contact

Skills

Core Skills

Computer VisionGenerative AiAi-powered RetrievalMachine LearningBiometricsTouch InterfacesAlgorithm DevelopmentVideo ProcessingHardware ArchitectureSoftware DevelopmentImage ProcessingAudio Processing

Other Skills

ARMAdaptive FilteringAgentic AIAlgorithmsAudio CodecsAuto MLAutoMLCC-ModelingCatalogingCoaching & MentoringDebuggingDeep LearningDigital Image ProcessingDigital Signal Processing

About

With nearly two decades of expertise, I am deeply passionate about merging cutting-edge technology with tangible business outcomes, specialising in eCommerce and mobile applications to transform user experiences and drive market impact. - Expert in Computer Vision & Generative AI: Leading advancements in image/video synthesis, including product image enhancement and aspirational image generation for Home page and PCA/PLA advertising, driving compelling product engagement and conversion. - Multimodal Search & AI-Powered Retrieval: Pioneering visual search, composed image retrieval, and AI chat assistants with a foundation in image classification, detection and image quality assessment for efficient cataloging and product enrichment. - Business-Centric Innovation: Focused on building scalable, innovative solutions that align with business objectives, delivering impactful tools that accelerate digital transformation for eCommerce platforms. - Passionate About the Future of Technology: Committed to open innovation and staying ahead of technological advancements, always seeking to leverage forward-thinking strategies to push the boundaries of what’s possible. - Research to Real-World Impact: Renowned for seamlessly transitioning research into commercial success, particularly in computer vision, multimedia, and touch sensing, creating measurable business value.

Experience

Flipkart

3 roles

Senior Principal Data Scientist

Promoted

Sep 2024Present · 1 yr 6 mos

  • Building Generative AI Computer Vision Foundation models and multimodal models designed to power advanced synthesis applications. These models focus on creating rich and aspirational product creatives, elevating the quality and appeal of visual content for eCommerce platforms.
  • Building multimodal representation learning frameworks that enhance retrieval applications, including visual search and interactive multimodal search, specifically tailored for eCommerce. These technologies aim to improve the overall user experience by providing more accurate, dynamic, and contextually relevant search results, thereby driving innovation in product discovery and consumer engagement.
Generative AIComputer VisionMultimodal ModelsImage Synthesis

Principal Data Scientist

Promoted

Apr 2021Sep 2024 · 3 yrs 5 mos

  • Image synthesis using GenAI for Product Image Enrichment and Ads, Aspirational Image Generation, Generated Image Quality Assessment, Multimodal Solutions for Visual Search, Composed Image Retrieval.
Image SynthesisGenAIMultimodal SolutionsGenerative AIComputer Vision

Senior Data Scientist

Jul 2019Mar 2021 · 1 yr 8 mos

  • Developed an Auto Quality Check and Auto Enrichment solution for seamless cataloging, utilizing image classification, detection, and segmentation in computer vision to evaluate listing guidelines.
  • Built an AutoML platform equipped with base models optimized for eCommerce, enabling various downstream tasks critical to eCommerce operations.
AutoMLImage ClassificationComputer VisionMachine Learning

Samsung r&d institute india - bangalore private limited

3 roles

Principal Engineer

Mar 2015May 2019 · 4 yrs 2 mos · On-site

  • Led on device computer vision solutions for Home IoT applications, including human fall detection and human detection using dynamic vision sensors.
  • Spearheaded biometric innovations such as fingerprint verification for fingerprint-on-display devices, fingerprint liveness detection and Aadhaar fingerprint and IRIS authentication.
Computer VisionBiometricsDynamic Vision Sensors

Senior Chief Engineer

Mar 2013Feb 2015 · 1 yr 11 mos · On-site

  • Led the development of algorithms for jitter suppression, elliptical feature extraction, proximity sensing, touch coordinate extraction, wet touch detection, multi-finger tracking, and touch interactive modes classification.
Algorithm DevelopmentTouch SensingMulti-Finger TrackingTouch Interfaces

Chief Engineer

Jan 2010Mar 2013 · 3 yrs 2 mos · On-site

  • Developed C-modeling for Real Media Video Codec (RMV) as part of the Multi-Format Codec hardware architecture.
  • Executed C-modeling for H.264 Frext (4:2:2, 4:4:4, 10-bit) for integration into the Multi-Format Codec
  • hardware architecture.
  • Designed and developed C-modelling of Scalable video codec (SVC-H264) for Multi format codec Hardware architecture
C-ModelingVideo Codec DevelopmentMulti-Format CodecVideo ProcessingHardware Architecture

Texas instruments

Senior Software Engineer

Jan 2008Jan 2010 · 2 yrs · Bangalore · On-site

  • Developed and optimised a 1080p H.264 encoder for the DM368 architecture.
  • Developed and optimised a 720p H.264 encoder for the DM365 architecture.
H.264 Encoder OptimizationVideo ProcessingSoftware Development

Allgo embedded systems pvt ltd

Software Engineer

Jan 2006Jan 2008 · 2 yrs · Bengaluru Area, India · On-site

  • Designed and developed an image de-noise algorithm using wavelets for the ARM11 processor, Developed a sample rate converter for MPEG Layer 1/2/3 frequencies on ARM 9/11 and an IMDCT (Inverse Modified Discrete Cosine Transform) for WMA decoding on ARM processors.
  • Design and development of an H.264 decoder for Silicon Hive 2-Tile Architecture
  • Designed and implemented a stereo widening algorithm, enhancing audio quality across various platforms.
Image ProcessingAudio ProcessingWavelet Algorithms

Education

Udacity

Nanodegree — Deep Learning

Jan 2018Jan 2018

Udacity

Nanodegree — Computer Vision

Jan 2018Jan 2018

Indian Institute of Technology, Madras

Master's degree

Jan 2003Jan 2005

Bapatla Engineering College

Bachelor of Technology (B.Tech.)

Jan 1999Jan 2003

Stackforce found 100+ more professionals with Computer Vision & Generative Ai

Explore similar profiles based on matching skills and experience