Abhishek Agarwal

Software Engineer

Bengaluru, Karnataka, India10 yrs 10 mos experience

Key Highlights

Ph.D. in Information and Coding Theory from University of Minnesota.
Published in prestigious conferences like NeurIPS.
Developed AI systems with real-world impact across multiple industries.

Stackforce AI infers this person is a highly skilled AI Engineer with expertise in Deep Learning and Large Language Models across Healthcare and Software industries.

Contact

abhiag.719@gmail.com LinkedIn

Skills

Core Skills

Large Language Models (llm)Deep LearningComputer VisionRetrieval-augmented Generation (rag)GitProduct ManagementTransformersAlgorithmsMathematics

Other Skills

Research and Development (R&D)Model Context ProtocolLLMNeural NetworksCUDAGPUDocument SecurityCode AnalysisEmbedding-based ChunkingQWEN CoderBioinformaticsData ProcessingNatural Language ProcessingSQLOCR

About

AI Engineer and Researcher with 13+ years of experience across academia and industry. My background spans information-theoretic security, LLMs, and applied machine learning. During my Ph.D. at the University of Minnesota, I solved a 75-year-old open problem in statistical group testing. I have published extensively in IEEE journals and conferences in the fields of Information Theory and Machine Learning, including the prestigious NeurIPS conference in the field of AI. Since then, I’ve bridged research with real-world impact—developing AI-powered systems for code analysis, OCR, biomedical data pipelines, and secure document retrieval. I thrive at the intersection of theory and deployment, with a focus on building elegant, scalable solutions. Let’s connect if you’re working on ambitious problems in AI systems, or research-driven engineering.

Experience

10 yrs 10 mos

Total Experience

1 yr 3 mos

Average Tenure

9 mos

Current Experience

Turiyam ai

Senior Software Engineer

Aug 2025 – Present · 9 mos · Bengaluru, Karnataka, India

enabling AI acceleration for large scale systems.

Research and Development (R&D)Retrieval-Augmented Generation (RAG)GitDeep LearningModel Context ProtocolProduct Management+1

Samsung semiconductor

Staff Engineer

Dec 2023 – Sep 2025 · 1 yr 9 mos · Bangalore Urban, Karnataka, India · Hybrid

🔶 LLM & Computer Vision NeuralNet Pipeline Optimization for Mobile SoCs
Enabled Computer Vision and NLP NeuralNets for Samsung chipsets (Exynos), using chipset specific
optimizations, while maintaining accuracy and latency constraints
Independently refactored the legacy codebase for CUDA and GPU compatibility in 1 week, a task initially estimated to take 2–3 months with a full team
Reduced model support lifecycle by streamlining test workflows, resolving integration bottlenecks, and implementing development process optimizations
🔶 Secure LLM Chatbot for Technical Documentation
Architected a fully on-premise RAG system for internal documentation retrieval (Confluence)
Enabled role-based access control to ensure document-level security
Improved latency via LLM Quantization and smart document chunking
Integrated with internal workflow system for seamless information retrieval
🔶 LLM Copilot for Codebase Analysis and Vulnerability Remediation
Built a code-generation and ramp-up assistant using with custom system prompts and
in-context learning
Enabled code analysis support via embedding-based chunking of internal Git repos
Embedded integration with internal mirrors of NVD’s CVE database to cross-reference OSS dependencies.
🔶 AI-powered Solution to Automatically Resolve Code Merge conflicts
Used QWEN Coder to develop a custom merge driver for git, improving team productivity and minimizing integration bottlenecks
The solution was adopted across the team

Research and Development (R&D)Retrieval-Augmented Generation (RAG)GitDeep LearningMathematicsLarge Language Models (LLM)

2046 llc

Co-Founder and CSO

Mar 2023 – Nov 2023 · 8 mos · India · Hybrid

🔶 Led technology development, R&D, and client engagement across bioinformatics initiatives
Designed and deployed three bioinformatics pipelines for processing terabyte-scale biological assay data on client-facing servers
Integrated LLMs and Deep Learning models into production pipelines for enhanced interpretability and automation
Managed team of five, overseeing daily tasks, reviews, and client-facing deployments
🔶 LLM-Powered Chatbot for Multi-Omics BioTech Datasets
Led the team to develop a natural language chatbot using LLMs to interact with large-scale BioTech
datasets
Integrated chatGPT API to generate SQL queries to answer user prompts
Enabled cross-modal querying across RNA-seq, ATAC-seq, and clinical metadata to support exploratory research

Research and Development (R&D)GitTransformersDeep LearningMathematicsAlgorithms+1

Aimonk labs private ltd

Senior Machine Learning Consultant

Sep 2022 – Feb 2023 · 5 mos · India

🔶 Image to Latex Optical Character Recognition:
Built a novel transformers based OCR system for mathematical equations in PyTorch
Achieved 87% BLEU score on image data with noisy background
Researched and designed improvements to reduce inference time of the NeuralNet by a factor of 2-10 using linearized attention
🔶 Traffic Surveillance for Railway Crossing
Built a vehicle detection and tracking system using ResNet-50 (QAT + mixed precision) and LoRA-adapted DETR for CCTV traffic analysis
Reduced model size by 4× while improving latency by 6×, still achieved 82% MOTA for vehicle tracking

Research and Development (R&D)GitTensorFlowDeep LearningMathematicsAlgorithms+1

The jackson laboratory

Postdoctoral Researcher

Feb 2020 – Jun 2022 · 2 yrs 4 mos · United States

🔶 Attention Mechanism for Biological Assays:
Lead the effort to build a nextflow pipeline to create the first dataset of matching mouse and Human DNA features (enhancers) using Deep Learning.
Achieved 83% accuracy in predicting enhancer-promoter links. The model used LSTM attention and 1-D convolution to predict expensive ground truth assay outputs from inexpensive assays like ATAC-seq
🔶 Deep Learning Model for COVID:
Designed the architecture for using 2D CNN model on patient CT scan images
Reduced the data requirement for CT scans by a factor of 5, by using transfer learning from 2D CNNs to 3D models
Reduced false positives by 20% using the 3D architecture

Research and Development (R&D)GitTensorFlowDeep LearningMathematicsAlgorithms