Kshitij Sarve

Software Engineer

Mumbai, Maharashtra, India6 mos experience
AI EnabledAI ML Practitioner

Key Highlights

  • Expert in AI-driven backend systems and MLOps.
  • Proven track record in reducing processing costs and time.
  • Hands-on experience with FastAPI and cloud platforms.
Stackforce AI infers this person is a Backend-focused AI Engineer with expertise in SaaS and MLOps.

Contact

Skills

Core Skills

Backend DevelopmentMlopsAi Search

Other Skills

API DevelopmentAWS LambdaAgentic AIAgile Application DevelopmentAmazon BedrockAmazon EC2Amazon Relational Database Service (RDS)Amazon S3Amazon Web Services (AWS)Artificial Intelligence (AI)Automated TradingAzure DevOps ServicesBack-End Web DevelopmentBrowserStackCelery

About

I specialize in AI-driven backend systems, MLOps, and Generative AI to build scalable and intelligent solutions. With hands-on experience in FastAPI, Django, and cloud platforms like AWS & GCP, I develop and deploy AI-powered applications that optimize performance and enhance user experiences. 🔹 Key Skills: MLOps, LLM Integration, AI Search, Financial AI, Backend Development, NLP 🔹 Tech Stack: Python, FastAPI, TensorFlow, Lang Chain, MongoDB, Redis, Docker, Kubernetes 🔹 Projects: AI-powered chatbots, financial analytics platforms, and AI-driven recommendation systems Always open to exciting AI and backend projects—let’s connect and build the future of AI together! 🚀

Experience

6 mos
Total Experience
6 mos
Average Tenure
--
Current Experience

Inagiffy

Artificial Intelligence Engineer

Apr 2025 – Oct 2025 · 6 mos

  • Built and scaled multiple FastAPI endpoints for a video generation product, using FFmpeg to slash processing time from 5 minutes to just 30 seconds.
  • Reduced video processing costs by 83% (from $3 to $0.50 per video) by implementing and optimizing open-source video models.
  • Designed and implemented a robust backend for a Reddit ORM, which improved post-fetching accuracy by 3x and cut sentiment analysis costs from $2.50 to $0.004 per query.
  • Fine-tuned a Llama 3.1 (7B) model on a 500,000-post dataset to generate highly contextual, human-like recommendations.
Python (Programming Language)FastAPIAPI DevelopmentPostgreSQLRetrieval-Augmented Generation (RAG)Agentic AI+7

Arkham archives

AI Intern

Mar 2025 – Apr 2025 · 1 mo · Remote

  • Developed an AI-powered search engine integrating LLMs and AI agents to generate highly relevant search results.
  • Built and optimized a FastAPI backend to enhance search capabilities and query response efficiency.
  • Implemented retrieval-augmented generation (RAG) to improve information accuracy and response relevance.
  • Engineered scalable AI-driven tools to streamline data retrieval and decision-making processes.
Python (Programming Language)Agentic AIGoogle APIGenerative AIFastAPIDeepseek+5

Quikscribe.in

Freelance Software Engineer

Sep 2024 – Mar 2025 · 6 mos · Remote

  • • Developed and optimized FastAPI APIs to power LLM features for meeting transcriptions, boosting model accuracy from 75% to 95%.
Python (Programming Language)API DevelopmentFastAPIPostgreSQLGenerative AIRetrieval-Augmented Generation (RAG)+2

Aldrich research services

Artificial Intelligence Engineer

Feb 2024 – May 2024 · 3 mos · On-site

  • Developed an AI-powered Candidate Recommendation System that parsed resumes and provided automated recommendations, reducing screening time for the HR team by 50%.
  • Created a Retrieval-Augmented Generation (RAG) chatbot using OpenAI, Groq, and Pinecone to allow the finance team to instantly query internal company documents.
Python (Programming Language)StreamlitMySQLOpenAI APIRetrieval-Augmented Generation (RAG)AI Search+1

Suven consultants and technology pvt.ltd.

Python Developer

Aug 2023 – Oct 2023 · 2 mos · Aurangabad, Maharashtra, India · Remote

  • Engineered a sophisticated web application facilitating seamless loan application processes. Achieved a 25% reduction in application processing time, enhancing overall efficiency and user experience.
  • Enhanced efficiency, reducing processing time by 30%.
FlaskPython (Programming Language)Pandas (Software)DjangoBackend Development

Forage

Analysis Specialist

Jan 2023 – Feb 2023 · 1 mo · Remote

  • Leveraged advanced techniques to boost efficiency by 20% in extracting meaningful patterns for strategic decision-making.
  • Created compelling presentations, improving stakeholder understanding by 30% through data-driven insights and impactful visuals.

Education

Narayana Junior College - India

12th — Science

Maharishi Vidya Mandir Senior Secondary School

10th — Science

Deogiri Institute of Engineering and Management Studies

B.Tech in Computer Science & Engineering (AI & Machine Learning)

Jan 2020 – Jan 2024

Stackforce found 100+ more professionals with Backend Development & Mlops

Explore similar profiles based on matching skills and experience