A

Arijit Pramanik

Software Engineer

Sunnyvale, California, United States8 yrs 3 mos experience
Most Likely To SwitchHighly Stable

Key Highlights

  • Co-author of significant research on KV/prefix caching.
  • Expertise in advancing diffusion models for creator effects.
  • Strong background in Machine Learning and Distributed Systems.
Stackforce AI infers this person is a Machine Learning Engineer with a focus on Cloud Computing and Distributed Systems.

Contact

Skills

Core Skills

Machine LearningVideo ProcessingCloud ComputingDistributed SystemsDatabase ManagementSoftware DevelopmentResearch

Other Skills

PythonReinforcement LearningPrompt CachingE2E EncryptionGCPVector SearchStorage EnginesRustC++PostgreSQLKey-Value StoresDatabase OptimizationGithubMicrosoft OfficeMarket Research

About

I was part of the Gemini 2.5 Core team and co-author of https://arxiv.org/abs/2507.06261 for KV/prefix cachingI currently work on advancing diffusion models for creator effects and adapting LLMs for recommendation models at YouTube Shorts. I previously worked in the Vertex AI LLM efficiency team and Analytical Storage team in Core Data.I completed my MS at UW-Madison pursuing research in "ML for Systems" and experimenting with key-value stores and distributed databases. I had previously worked on Networked Systems including Programmable Switches (P4), Hardware acceleration (SmartNICs), although I'm equally interested in Computer Vision and Natural Language Processing, all intertwined with Machine Learning.

Experience

8 yrs 3 mos
Total Experience
1 yr
Average Tenure
4 yrs 1 mo
Current Experience

Google

4 roles

Software Engineer

Jul 2025Present · 10 mos

  • Bringing the best of Veo 3 to YouTube Shorts' Advanced Capabilities and Effects : https://blog.youtube/news-and-events/generative-ai-creation-tools-made-on-youtube-2025 like image-to-video, video stylization and pose control.
  • Improving the efficiency of state-of-the-art transformer-based Shorts recommendation models along with significant quality improvements using RL-based post-training policy optimization techniques
PythonMachine LearningReinforcement LearningVideo Processing

Software Engineer

Jul 2024Jun 2025 · 11 mos

  • Working on Gemini inference efficiency initiatives including prompt caching similar to OpenAI : https://ai.google.dev/gemini-api/docs/caching as well as allowing explicit control over the prefix to be cached like Anthropic : https://cloud.google.com/vertex-ai/generative-ai/docs/context-cache/context-cache-overview
  • Designed and implemented E2E encryption of all cached content with customer-provided encryption keys on GCP for prompt/context caching
Prompt CachingE2E EncryptionGCPCloud ComputingMachine Learning

Software Engineer

Jan 2024Jun 2024 · 5 mos

  • Vertex AI Vector Search (https://cloud.google.com/vertex-ai/docs/vector-search/overview) is a massively scalable, low-latency, and cost-efficient vector similarity search engine. Embedding vector search is one of the most critical steps in most Gen AI applications, notably Retrieval Augmented Generation (RAG). Currently building the enterprise-ready vector similarity search and ranking models that enable personalization, recommendation, and monetization at scale by providing high quality approximate nearest neighbors (ANN) as a Google Cloud Platform (GCP) service
Vector SearchGCPMachine LearningCloud Computing

Software Engineer

Feb 2022Jan 2024 · 1 yr 11 mos

  • Napa (https://research.google/pubs/pub50617/) is Google's new peta-scale, multi-homed, distributed storage engine optimized for high-update throughput of metrics data. It is the next generation of Mesa (https://research.google/pubs/pub42851/), but built from scratch. Like Mesa, Napa provides materialized views, supports low-latency point queries and high-throughput table scans; updates are consistently applied to materialized views across data centers.
Distributed SystemsStorage EnginesDatabase Management

Amazon web services (aws)

2 roles

Software Engineer

Jun 2021Feb 2022 · 8 mos · East Palo Alto, California, United States

  • Developed a Rust/C++ SDK for sandboxed, serverless execution of customer third-party extensions in PostgreSQL, without need for manual deployment verification by DB administrators. Responsible for :
  • Provisioning multi-AZ databases across a variety of EC2 instance types and EBS volume types
  • Automated backups and cross-region streaming with multiple read replicas
  • Auto engine version upgrades, patching and failovers aided by disaster recovery using S3
  • Support for custom DB parameters for better query workload performance
RustC++PostgreSQLDatabase ManagementSoftware Development

Software Engineer Intern

May 2020Aug 2020 · 3 mos · East Palo Alto, California, United States

  • Integrated an open-source extension PL/Container into PostgreSQL 12 and 13 to facilitate sandboxed execution of untrusted functions & stored procedures inside Docker containers to prevent customer programs from crashing the database server and enforce finer access privileges
  • Performed extensive benchmarking with a study of shared memory segments, Unix sockets and grpc channels to identify bottlenecks for maintaining average latency at par with extensions running locally
  • Developed a new prototype leveraging a single container across all customer sessions, leading to a 63% reduction in memory usage while scaling to thousands of customer connections
  • Added runtime support for Go language, and created 3 separate extensions for each of R, Python and Go utilizing separate containers for better isolation

University of wisconsin-madison

3 roles

Graduate Research Assistant

Aug 2020May 2021 · 9 mos · Madison, Wisconsin, United States

  • Log-Structured Merge (LSM) based key-value stores have become so popular today that they are used as backend for relational database abstractions like TiDB. They use indexes for faster data lookup whose memory overhead increases with database size, leaving lesser memory for caching data blocks. Recent studies also show that application throughput can be compromised by internal LSM tree operations that periodically write data to disk.
  • Observed 2.24-2.34X improvement in client write throughput for industrial write bursty workloads by scheduling background memtable flushes and compactions during idle periods or periods of very few writes to the database
  • Employed Learned Indexes to train a model to learn offsets from last keys of data blocks inside SSTables to reduce lookup time from O(logn) to O(1)
  • Obtained a 53% reduction in indexing memory footprint over traditional indexes with <5% increase in read latency using both Fuzzy and Greedy Piecewise Linear Regression in RocksDB
  • Guides : Prof. Remzi and Andrea Arpaci-Dusseau
Key-Value StoresDatabase OptimizationDatabase ManagementResearch

Graduate Teaching Assistant

Jan 2020May 2020 · 4 mos · Madison, Wisconsin Area

  • CS 559 : Computer Graphics
  • Instructor : Prof. Michael Gleicher

Graduate Teaching Assistant

Aug 2019Dec 2019 · 4 mos · Madison, Wisconsin Area

  • CS 559 : Computer Graphics
  • Instructor : Florian Heimerl

University of washington

Research Intern

May 2019Aug 2019 · 3 mos · Greater Seattle Area

  • Worked with different L4 and L7 proxies to demarcate functionalities for offloading to the SmartNICs. Carried out benchmarking experiments across Nginx, HAproxy and Envoy with SSL to determine feasibility of checksum offloading and scalability. Performed a detailed study of Envoy source code to determine scope for distribution of core functionalities between host and hardware for offloading
  • Guide : Prof. Arvind Krishnamurthy

Indian institute of technology, bombay

3 roles

Teaching Assistant

Jan 2019May 2019 · 4 mos

  • Served as a Teaching Assistant for a Freshman year course, Computer Programming and Utilization.
  • Instructor : Prof. Ganesh Ramakrishnan
  • Responsible for evaluation of labs for a batch of 450+ students
  • Set up weekly lab statements on bodhitree : IITB's online classroom learning initiative

Teaching Assistant

Jul 2018Nov 2018 · 4 mos

  • Served as a Teaching Assistant for a Third Year Computer Science Core Course, Computer Architecture
  • Instructor : Prof. Bernard Menezes
  • Entrusted with evaluating and grading labs for a batch of 120+ students
  • Assisted in setting up weekly lab statements and semester project proposals

Department Placement Coordinator

Jun 2018Jun 2019 · 1 yr

  • Represented the department of Computer Science and Engineering in the institute placement team
  • Assisted in the 2019 annual placements by conducting preparatory tests and personal interviews
  • Verified and scrutinized resumes of over 50+ students across all departments

Adobe

Research Intern

May 2018Jul 2018 · 2 mos · Bengaluru Area, India

  • Adapted Facebook AI Research’s Convolutional seq2seq model for translation to characteristic-driven text generation on pytorch, using CNNs for computing encoder and decoder states
  • Modified beam search paradigm for enhancing formality, descriptiveness and vocabulary tailored generation; attention
  • layers for topic-tuned summaries and token-based learning for length based summarization
  • Incorporated a Reinforcement Learning term in loss function and achieved a 6.4% increase in ROUGE scores
  • Guide : Dr. Balaji Vasan Srinivasan

Getfocus - contextual marketing through retail analytics

Data Analyst Intern

Nov 2017Dec 2017 · 1 mo · Mumbai

  • Involved in the assessment of retail customer database for appographic targeting through contextual marketing and developing focused strategies for the same
  • Leveraged Natural Language Processing techniques for customer segmentation and retail-category-affinity estimation in python
  • Implemented probabilistic graphical model based recommendation engine with contributions to the pgmpy github repository
  • Created a new query language for internal database query system on neo4j utilising parse trees

Philips innovation campus, bangalore

Summer Research Intern

May 2017Jul 2017 · 2 mos · Bengaluru Area, India

  • Designed a chatbot for customer care services to resolve queries pertaining to various consumer products
  • Facilitated the parsing of user manuals for construction of chatbot knowledge base with the addition of an AIML engine
  • Built the Semantic Engine for contextual semantic matching to find the most relevant topic leveraging Latent Semantic Indexing an Latent Dirichlet Allocation in python gensim with enhanced synonym recognition
  • Built a Sentiment Classifier for tracking customer satisfaction in live chat sessions based on a multiclass, multilabel emotion classifier with a backend sentiment score plotter
  • Facilitated the building of knowledge representations using ontologies from XML, CSV sources using Stanford’s Protege Library with an interface to fetch data responses from the same using SPARQL queries

Indian institute of technology, bombay

Teaching Assistant

Jan 2017May 2017 · 4 mos · Mumbai Area, India

  • Served as a Teaching Assistant for a Freshman Year Course, Introduction to Biology
  • Instructors : Prof. Ambarish Kunwar, Prof. Rahul Purwar, Prof. Sanjeev Srivastava and Prof. Prakriti Tayalia
  • Entrusted with evaluating and grading papers for a batch of 450+ students
  • Conducted weekly tutorials and helping sessions for clearing doubts on a one-on-one basis

Olivesync private limited

Algorithm Architect

Dec 2016Jan 2017 · 1 mo · Mumbai Area, India

  • Worked on the automated timetable generation algorithm from university slotting data :-
  • Designed the backend algorithm for generating timetable for institutes based on a set of constraints encompassing room compatibility, professor availability and course working hours
  • Implemented genetic algorithms in Java for obtaining the best-fit optimal timetable, mimicking biological evolution
  • Added sync to MySQL database and displayed the same via PHP, hosted on server for tracking availability and managing ad-hoc modifications to the timetable schedule

Mood indigo iit bombay

Marketing Coordinator

Apr 2016Aug 2016 · 4 mos

  • Working in a team of 24 towards conceptualization and successfully organizing Mood Indigo 2016.
  • Pursuing the marketing budget required for Mood Indigo 2016 through corporate sponsorship.
  • This includes marketing activities and integration(s) tailor-made for each brand during the event so as to mutually add value to both Mood Indigo and our partner.
  • Conceptualizing and Executing brand integration strategies having Indian youth as the target group for global and national corporations.
  • Primary focus on promoting brand Mood Indigo by developing synergic relations with corporates through Relationship Marketing.

Education

University of Wisconsin-Madison

Master's degree — Computer Science

Jan 2019Jan 2021

Indian Institute of Technology, Bombay

Bachelor of Technology (B.Tech.) — Computer Science and Engineering

Jan 2015Jan 2019

National University of Singapore

Semester Exchange — Computer Science

Jan 2018Jun 2018

Bhavan's Gangabux Kanoria Vidyamandir

Science

Jan 2001Jan 2015

Stackforce found 100+ more professionals with Machine Learning & Video Processing

Explore similar profiles based on matching skills and experience