A

Akshat Gupta

AI Researcher

Stuttgart, Baden-Württemberg, Germany4 yrs 3 mos experience
Most Likely To SwitchAI ML Practitioner

Key Highlights

  • Expert in machine learning and AI with a focus on NLP.
  • Proven track record in developing advanced AI systems.
  • Recognized Kaggle Expert and Intel Software Innovator.
Stackforce AI infers this person is a Machine Learning and AI expert specializing in NLP and deep learning solutions.

Contact

Skills

Core Skills

Machine LearningArtificial IntelligenceNatural Language Processing (nlp)Deep LearningSoftware Development

Other Skills

AlgorithmsArtificial Neural NetworksCSSClassificationClusteringComputer VisionCreative SolutionsData ScienceData StructuresData VisualisationDocker ProductsGenerative AIGenerative Adversarial Networks (GANs)GitGithub

About

Hello and welcome to my profile! A seasoned machine learning researcher and engineer with a focus on advancing AI through deep learning and natural language processing. Specializing in computer vision, speech analysis, and generative AI models, with a strong track record in research, genai, diffusion models, llms, cloud deployment, and developing robust, ethical, and fair AI systems. Key expertise includes LLM performance optimization, zero-shot/few-shot learning, prompt engineering, adversarial training, guided diffusion, multimodal emotion recognition, speaker diarization, OCR for medical records, recommendation engines, and attention-based models for language translation. Proven ability to lead teams, manage the full ML lifecycle, and contribute to cutting-edge research publications. Passionate about pushing the boundaries of AI capabilities while ensuring ethical practices. Committed to continuous learning, staying up-to-date with the latest advancements, and exploring novel techniques like diffusion models, transformers, and generative adversarial networks (GANs). Skilled in Python, PyTorch, TensorFlow, AWS, and various deep learning frameworks. Recognized as a Kaggle Expert, Intel Software Innovator, and author of machine learning books and online courses. Website: https://akshat4112.github.io/ GitHub: https://github.com/akshat4112/ Publications: Glyphnet: Homoglyph domains dataset and detection using attention-based Convolutional Neural Networks, https://arxiv.org/pdf/2306.10392.pdf

Experience

4 yrs 3 mos
Total Experience
1 yr
Average Tenure
2 yrs 1 mo
Current Experience

Additiv

AI Engineer

May 2024Present · 2 yrs 1 mo · Zurich, Switzerland · Remote

  • Developed Doc-Parser, a core part of additiv’s document automation stack, enabling accurate extraction of structured data from PDFs, scans, and semi-structured inputs.
  • Designed a scenario-driven, multi-language pipeline using GPT-4.1 and layout-aware OCR, supporting dynamic parsing based on context and user-defined schemas.
  • Built a modular agentic system with Agno for prompt engineering, field extraction, formatting, validation, and post-processing with memory-aware logic and fallbacks.
  • Implemented customizable, schema-driven JSON outputs and REST APIs, reducing manual document processing by over 80% and improving speed, accuracy, and scalability.
  • Developed ClaimFlow, a multi-agent AI system automating end-to-end claims analysis and decision-making for household and liability insurance.
  • Implemented specialised agents for scenario interpretation, damage cause and age estimation, audit validation, and structured reporting, using a shared-memory architecture for effective collaboration and contextual reasoning.
  • Integrated advanced image/video analysis, document parsing, geolocation data, user inputs, and LLM-powered reasoning to emulate human insurance auditor workflows.
  • Produced comprehensive, audit-ready PDF reports detailing damage segmentation, policy information, reasoning steps, clause matching, and final claim decisions, ensuring full transparency and auditability.
  • Retrieval-Augmented Generation (RAG): developed a hybrid retrieval using contextual and BM25 embeddings, used Faiss vector db, re-ranking on PDFs, JSONs, and Excel sheets and used langchain data loaders.
  • Implemented query translation to extend context in use.
  • Orchestrated end-to-end workflows with LangChain, incorporating contextual reranking models to surface the most relevant passages.
  • Integrated Chainlit for an interactive internal UI and deployed RAG with Phoenix for real-time monitoring and observability.
Large Language Models (LLM)Machine LearningDeep LearningArtificial IntelligenceGenerative AINatural Language Processing (NLP)+1

Validaitor

Machine Learning Engineer

Apr 2023Mar 2024 · 11 mos · Karlsruhe, Baden-Württemberg, Germany · Remote

  • Conducted research and implemented Adversarial Machine Learning techniques (FGSM, PGD) to evaluate and strengthen model robustness against adversarial attacks.
  • Fine-tuned machine learning models using adversarially generated datasets to improve resilience and generalisation under attack scenarios.
  • Executed model-stealing attacks (Copycat CNN) to identify vulnerabilities; developed and implemented protective algorithms such as model watermarking and fingerprinting.
  • Performed validation and evaluation of fairness, toxicity, and bias in Large Language Models (LLMs), building frameworks to ensure ethical AI deployments.
Deep LearningPython (Programming Language)Large Language Models (LLM)Machine Learning

Institut für parallele und verteilte systeme

Research Assistant

May 2022Oct 2022 · 5 mos · Stuttgart, Baden-Württemberg, Germany · Hybrid

  • at the Institut für Parallele und Verteilte Systeme (IPVS)
  • Conducted research and implemented spatio-temporal word embedding models, enhancing Skip-Gram architectures to incorporate temporal and geographic context.
  • Developed Python-based embedding pipelines and evaluation frameworks for temporal semantic shifts, regional dialect detection, and time-sensitive NLP tasks.
Deep LearningNatural Language Processing (NLP)Python (Programming Language)Knowledge Engineering

Quantiphi

Machine Learning Engineer

Jan 2021Oct 2021 · 9 mos · Bengaluru, Karnataka, India · Remote

  • Developed a multimodal emotion recognition system, integrating BERT (text) and wav2vec (speech) embeddings into a unified transformer-based neural network architecture trained with a joint loss function.
  • Implemented a Voice Activity Detection (VAD) pipeline using binary Convolutional Neural Networks and applied K-means clustering for effective Speaker Diarization.
  • Achieved strong performance metrics with an Emotion Recognition F1-score of 0.88 and a Speaker Diarization Error Rate (DER) of 17.35%.
Deep LearningData ScienceNatural Language Processing (NLP)Python (Programming Language)Machine Learning

Scanta inc.

AI/ML Team Lead

Mar 2019Mar 2020 · 1 yr · Gurgaon, India · On-site

  • Designed and deployed end-to-end NLP pipelines using AWS container services for low-latency text processing and inference.
  • Developed style transfer, language correction, data augmentation (21 modules), and personality detection models, boosting overall accuracy by 4.73%.
  • Integrated classical NLP methods with BERT-based deep learning to enable context-aware rephrasing, achieving up to 0.93 semantic similarity.
  • Presented strategic AI insights and emerging NLP innovations to leadership, directly shaping the product roadmap.
Computer VisionDeep LearningArtificial IntelligenceMachine LearningLeadership

Mobile programming llc.

Machine Learning Engineer

Jul 2018Dec 2018 · 5 mos · Gurgaon, India · On-site

  • Architected and implemented a 15-layer attention-based encoder–decoder model for high-precision language translation, achieving a BLEU score of 37.13.
  • Developed a domain-tailored BiLSTM-CRF model for pharmacological named-entity recognition in NIH protocol documents, reaching an F1 score of 0.83.
  • Built ML pipelines with Keras, SpaCy, NumPy, Pandas, and Anaconda for reproducible, rapid integration of translation and NER into apps.
Deep LearningData ScienceNatural Language Processing (NLP)Python (Programming Language)Machine Learning

Msme-technology development centre (ppdcagra)

2 roles

Software Developer Internship

Jun 2017Jul 2017 · 1 mo · Agra Area, India

  • This is an application fully developed in Codeigniter(PHP MVC). It was really a whopping useful application for MSME (Government of India) as it generates certificates automatically and have functions like direct CSV import and automatic QR code embedding on certificates for verification.
PHPJavaScriptSoftware Development

Application Developer Internship

Jun 2016Jul 2016 · 1 mo · Agra Area, India

  • In this internship I worked on ERP software for MSME(Govt. of India).In ERP software i worked on attendance module and courses module. It is fully based on Drupal, a PHP framework.
PHPSoftware Development

Education

University of Stuttgart

Master of Science - MS — Computational Linguistics

Dr. A.P.J. Abdul Kalam Technical University

Bachelor of Technology (B.Tech) — Computer Science

Simpkins School

I through XII High School

Stackforce found 100+ more professionals with Machine Learning & Artificial Intelligence

Explore similar profiles based on matching skills and experience