Anmol Gautam

CTO

Bengaluru, Karnataka, India3 yrs 8 mos experience

Most Likely To SwitchAI Enabled

Key Highlights

Expert in scaling LLM inferencing pipelines.
Developed autonomous software systems.
Proficient in model fine-tuning techniques.

Stackforce AI infers this person is a skilled AI/ML engineer specializing in generative AI and model optimization.

Contact

Skills

Core Skills

Large Language Models (llm)Deep LearningGenerative AiSoftware DevelopmentModel Fine-tuningAi ResearchAi/ml DevelopmentNlp

Other Skills

Analytical SkillsComputer VisionDPODocument UnderstandingEngineeringEnglishFalcon seriesGPU clustersHuggingfaceInstruct FTLLM inferencingMachine LearningMistral 7BMixtralNatural Language Processing (NLP)

About

8bit.ai focuses on scaling and optimizing LLM inferencing pipelines for open-source models on GPU clusters, contributing to cutting-edge AI advancements. Prior to this, SuperAGI enabled the development of SuperCoder 2.0, an autonomous software development system, and spearheaded fine-tuning techniques like Instruct FT and DPO for instruct models. The National Institute of Technology Meghalaya laid the foundation for core competencies in AI and computer science, complemented by practical experience in multi-agent systems, model fine-tuning, and generative AI. Dedicated to advancing scalable AI solutions, they aim to bridge research and real-world applications.

Experience

3 yrs 8 mos

Total Experience

1 yr 2 mos

Average Tenure

1 yr 7 mos

Current Experience

8bit.ai

Lead Applied Scientist AI/ML

Oct 2024 – Present · 1 yr 7 mos · Bengaluru, Karnataka, India · Hybrid

Scaling and optimising LLM inferencing pipelines for Open Source Models on GPU clusters.

PythonLLM inferencingGPU clustersLarge Language Models (LLM)Deep Learning

Superagi

2 roles

Applied Scientist AI/ML

May 2024 – Oct 2024 · 5 mos · Bengaluru, Karnataka, India · Hybrid

Building SuperCoder 2.0, an Open Source autonomous Software Development System

SuperCoder 2.0autonomous software developmentGenerative AISoftware Development

AI Product Engineer

Nov 2023 – May 2024 · 6 mos · Bengaluru, Karnataka, India · Hybrid

2) Research - Part of the core research team at SuperAGI.
working on model Finetuning techniques - Instruct FT, DPO, etc.
Dataset creation for chat and instruct models.
Agentic Frameworks facilitating multi agent collaborative chat applications.
1) Development of SAM-v1, small agentic model based on Mistral 7B.
Development of Explanation traces to fine tune mistral 7b
Used Yi-34B-Chat, Falcon - series, and Mixtral to generate Rationales
SAM achieved performance comparable to GPT 3.5 and outperformed Orca on GSM 8k.
The aim was to fine tune Mistral 7b with explanation/reasoning traces of better models to achieve better problem solving capability.
Blog : https://superagi.com/introducing-sam-small-agentic-model/
HuggingFace : https://huggingface.co/SuperAGI/SAM

Instruct FTDPOdataset creationmulti-agent systemsModel Fine-tuningAI Research

Oracle

Associate Consultant

Aug 2022 – Oct 2023 · 1 yr 2 mos · Bengaluru, Karnataka, India · On-site

2) Working in AI/ML to develop POCs and implementing client requirements for automation using Python. Working with frameworks like TensorFlow and PyTorch. Developing Semantic search system for closed domain Question answering using LLMs. Using different open source tools to quickly develop NLP and Document Understanding application. Along with working on OCI AI Services to deliver to client needs.
1) Worked with client to analyze the functional requirements and migrate SOA integrations to OIC Gen2. Analyzed SOA composites for ERP and CRM based systems.

PythonTensorFlowPyTorchNLPDocument UnderstandingAI/ML Development