Anmol Gautam

CTO

Bengaluru, Karnataka, India3 yrs 8 mos experience
Most Likely To SwitchAI Enabled

Key Highlights

  • Expert in scaling LLM inferencing pipelines.
  • Developed autonomous software systems.
  • Proficient in model fine-tuning techniques.
Stackforce AI infers this person is a skilled AI/ML engineer specializing in generative AI and model optimization.

Contact

Skills

Core Skills

Large Language Models (llm)Deep LearningGenerative AiSoftware DevelopmentModel Fine-tuningAi ResearchAi/ml DevelopmentNlp

Other Skills

Analytical SkillsComputer VisionDPODocument UnderstandingEngineeringEnglishFalcon seriesGPU clustersHuggingfaceInstruct FTLLM inferencingMachine LearningMistral 7BMixtralNatural Language Processing (NLP)

About

8bit.ai focuses on scaling and optimizing LLM inferencing pipelines for open-source models on GPU clusters, contributing to cutting-edge AI advancements. Prior to this, SuperAGI enabled the development of SuperCoder 2.0, an autonomous software development system, and spearheaded fine-tuning techniques like Instruct FT and DPO for instruct models. The National Institute of Technology Meghalaya laid the foundation for core competencies in AI and computer science, complemented by practical experience in multi-agent systems, model fine-tuning, and generative AI. Dedicated to advancing scalable AI solutions, they aim to bridge research and real-world applications.

Experience

3 yrs 8 mos
Total Experience
1 yr 2 mos
Average Tenure
1 yr 7 mos
Current Experience

8bit.ai

Lead Applied Scientist AI/ML

Oct 2024Present · 1 yr 7 mos · Bengaluru, Karnataka, India · Hybrid

  • Scaling and optimising LLM inferencing pipelines for Open Source Models on GPU clusters.
PythonLLM inferencingGPU clustersLarge Language Models (LLM)Deep Learning

Superagi

2 roles

Applied Scientist AI/ML

May 2024Oct 2024 · 5 mos · Bengaluru, Karnataka, India · Hybrid

  • Building SuperCoder 2.0, an Open Source autonomous Software Development System
SuperCoder 2.0autonomous software developmentGenerative AISoftware Development

AI Product Engineer

Nov 2023May 2024 · 6 mos · Bengaluru, Karnataka, India · Hybrid

  • 2) Research - Part of the core research team at SuperAGI.
  • working on model Finetuning techniques - Instruct FT, DPO, etc.
  • Dataset creation for chat and instruct models.
  • Agentic Frameworks facilitating multi agent collaborative chat applications.
  • 1) Development of SAM-v1, small agentic model based on Mistral 7B.
  • Development of Explanation traces to fine tune mistral 7b
  • Used Yi-34B-Chat, Falcon - series, and Mixtral to generate Rationales
  • SAM achieved performance comparable to GPT 3.5 and outperformed Orca on GSM 8k.
  • The aim was to fine tune Mistral 7b with explanation/reasoning traces of better models to achieve better problem solving capability.
  • Blog : https://superagi.com/introducing-sam-small-agentic-model/
  • HuggingFace : https://huggingface.co/SuperAGI/SAM
Instruct FTDPOdataset creationmulti-agent systemsModel Fine-tuningAI Research

Oracle

Associate Consultant

Aug 2022Oct 2023 · 1 yr 2 mos · Bengaluru, Karnataka, India · On-site

  • 2) Working in AI/ML to develop POCs and implementing client requirements for automation using Python. Working with frameworks like TensorFlow and PyTorch. Developing Semantic search system for closed domain Question answering using LLMs. Using different open source tools to quickly develop NLP and Document Understanding application. Along with working on OCI AI Services to deliver to client needs.
  • 1) Worked with client to analyze the functional requirements and migrate SOA integrations to OIC Gen2. Analyzed SOA composites for ERP and CRM based systems.
PythonTensorFlowPyTorchNLPDocument UnderstandingAI/ML Development

Nvidia

Research Intern

May 2021Apr 2022 · 11 mos · Bangalore Urban, Karnataka, India

  • Worked on Deep Learning algorithms and used frameworks like Nvidia Nemo, Nvidia Tao, etc.
Deep LearningNvidia NemoNvidia Tao

Education

National Institute of Technology Meghalaya

Master of Technology - MTech — Computer Science

Jan 2020Jun 2022

Vellore Institute of Technology

Bachelor of Technology - BTech — Information Technology

Jan 2010Jan 2014

Delhi Public School - India

High School Diploma — Computer Science

Jan 2005Jan 2009

Stackforce found 100+ more professionals with Large Language Models (llm) & Deep Learning

Explore similar profiles based on matching skills and experience