Samyak Jain

Software Engineer

Haryana, India2 yrs 10 mos experience
AI ML PractitionerAI Enabled

Key Highlights

  • Expert in Information Retrieval and Generative AI.
  • Achieved 90% accuracy in answer generation.
  • Developed innovative features for legal tech applications.
Stackforce AI infers this person is a SaaS-focused Software Engineer with expertise in Information Retrieval and Generative AI.

Contact

Skills

Core Skills

Information RetrievalGenerative AiPython (programming Language)Front-end Development

Other Skills

Document ChunkingData StructuringSearch MechanismsRAG PipelineGPT-4File Upload FeatureText HighlightingRetrieval augmented generation (RAG)Back-End Web DevelopmentReduxNode.jsReact.jsSelenium WebDriverMongoDBHTML5

About

Passionate for making apps and tools which are fun to use and make people think "woahhh". Not fixated on one technology and always excited to change the tech stack according to the needs of the project instead of finding projects which are aligned with me. Currently working with Information Retrieval Github - https://github.com/samyak112 My Blogs - https://dev.to/samyak112

Experience

2 yrs 10 mos
Total Experience
2 yrs 10 mos
Average Tenure
2 yrs 10 mos
Current Experience

Legalgraph

2 roles

Software Engineer II

Jun 2024Present · 1 yr 11 mos · Remote

  • Responsible for designing and maintaining the company’s information retrieval (IR) pipeline, along with downstream Generative AI (GenAI) workflows.
  • Contributed to key improvements in the IR pipeline, including:
  • Developing optimal document chunking strategies tailored to different data types.
  • Structuring data for efficient indexing and retrieval.
  • Enhancing search mechanisms to improve precision and recall.
Information RetrievalGenerative AIDocument ChunkingData StructuringSearch Mechanisms

Software Engineer

Jul 2023Jun 2024 · 11 mos · Remote

  • Owned the backend’s answer generation aspect, achieving 90 percent accuracy in answers with location indicators. The location feature was implemented without using ChatGPT solely, which helped in getting our first customer.
  • Enhanced GPT-4’s capabilities by employing a custom RAG pipeline without using LangChain with FAISS for embedding search. This method allowed for handling long legal texts (e.g., 200-page contracts) by sending only relevant chunks to GPT-4.
  • Implemented a comprehensive feature that allows users to upload files and accurately identify contract types without relying on ChatGPT. This solution surpasses basic regex matching, effectively recognizing various contract types. By avoiding ChatGPT, costs were minimized to nearly zero, and processing speed was enhanced, as everything runs locally on the CPU without requiring API calls.
  • Solely responsible for developing the front end from scratch.
  • Implemented a feature to highlight text in PDFs. Utilized React PDF as a base, incorporating custom algorithms for text highlighting using a custom Boyer-Moore search. Built this feature from scratch due to React PDF’s limitations, such as the inability to highlight multiple lines and lag during searches
Python (Programming Language)Front-End DevelopmentRAG PipelineGPT-4File Upload FeatureText Highlighting

Education

Amity University, Noida

Master of Computer Applications - MCA — Computer Science

Aug 2024Jul 2026

AMITY University Gurgaon

BCA — Computer Science

Jan 2020Jan 2023

Stackforce found 100+ more professionals with Information Retrieval & Generative Ai

Explore similar profiles based on matching skills and experience