Popular repositories Loading
-
speculative-rag
speculative-rag PublicA production-ready Retrieval-Augmented Generation (RAG) system with Speculative Decoding powered by Groq API. Get faster, more accurate responses without downloading massive language models!
Python 2
-
rag-chatbot-groq
rag-chatbot-groq PublicA Retrieval-Augmented Generation (RAG) based chatbot that reads and understands PDF documents. It uses the Groq API for lightning-fast LLM inference and Chroma as the vector database for context re…
Python
-
jwt-tokens
jwt-tokens PublicThis document provides a comprehensive explanation of JWT (JSON Web Token) authentication, breaking down how tokens work, the structure of JWTs, and detailed walkthroughs of each component in a Fas…
Python
-
spectral-rag
spectral-rag PublicThis repo shows new Retrieval-Augmented Generation technique that improves retrieval quality by using spectral graph theory.
Python
-
memory-management-system
memory-management-system PublicA streamlined Memory Management System for conversational AI. It analyzes user history, generates concise context notes, rewrites queries for clarity, and classifies user intent with confidence sco…
Python
-
rag-latency-optimization
rag-latency-optimization PublicA high-performance Speculative RAG pipeline designed to reduce latency by combining fast draft generation and accurate verification using Groq Llama models, local HuggingFace embeddings, ChromaDB v…
Python
If the problem persists, check the GitHub status page or contact support.