-
Learning Steps - Akash / Srijan - Take Vihang’s Help to Find Best Courses
- Extraction Courses - 2
- Chunking Courses - 2
- Fine Tuning Models - 2
-
Breaking Down Question - Srijan - 3
- Understand what questions and understand what are keywords in questions
- Determine Dynamic Chunk size based on query
- Fetch Number of Chunks Dynamically based on Keywords Found
-
Retrieval - Srijan - 4
- Find Best 3 Documents that Can Answer the Question
- Find Best Pages Which May Have the Answer - 3 Pages From Each Document
- Find Best Chunks Based on Keywords - Expand Context if Needed
- Find Best Chunks Based on Suggested Questions - Expand Context If Needed
-
Extractions - Akash - 4
- Multimodal Extractions of Pages - Try Extraction in a structured way
- Try LLMShrepa For Extraction - Test Quality of Extraction
- Heading and Sub Headings and Maping Paragraphs with it.
- Maping Paragraphs with keywords
-
Augmentation - Srijan - 2
- Trying Nvidia Embed Model for Task Based Ranking of Information
- Trying Task Based Models for Reranking Content or By Passing through round of LLM
-
Generation - Akash - 2
- Generate Answer Based on Page from Multimodal (Try Multi Page Generation)
- Use a LLM to Combine and generate a Final Answer
-
Building Testing Setup - Srijan
-
Find 20 Documents of Different Kinds
-
100 Questions
-
Make a clean new Environment and Populate it with it.
-
Steps →
- Creating Clean Environment and Setup of Test Data - Srijan
- Query Understanding - Srijan
- Trying Retrieval Using Suggested Questions - Srijan
- Trying Retrieval Using Keywords - Srijan
- Dynamic Chunking and Dynamic Chunk Size Retrieval Based on Query - Srijan
- Multimodal Extraction - Akash
- LLM Sherpa Extraction - Akash
- Multimodal Answering - Akash
- Trying Task Based Embeddings Models - Srijan