GitHub - prashanth-prakash/know-et-al: RAG based semantic search engine for custom knowledge base

🧠 Know et al. — Local Knowledge Base Chatbot

Motivation

Large language models are remarkable generalists — trained on massive corpora of internet-scale data. But they tend to hallucinate or "guess" when asked questions outside the bounds of their general training. This makes them unreliable for domain-specific tasks or queries tied to precise internal knowledge.

Know et al. is a lightweight, local-first chatbot that demonstrates:

How to restrict a language model to a specific knowledge base (Retrieval-Augmented Generation).

The ability to run on your own hardware without sending data to the cloud.

A foundation to plug in local language models, respecting privacy and control.

ARCHITECTURE OVERVIEW

Know et al/

├── main_gradio.py # Main UI + knowledge base loading

├── chonkit.py # Chunking logic

├── embedding.py # Sentence-transformer model loader

├── index_chunks.py # Indexing and embedding ingestion

├── query_me.py # Semantic search + Mistral formatting

├── sync_docs.py # Track updated PDFs

├── save_cache.py # Save/load cached embeddings

├── SearchPdfs.py # PDF text extraction

├── chroma_store/ # Directory for saved vector DBs (one per KB)

├── requirements.txt

├── Dockerfile

└── .gitignore

HOW IT WORKS

Knowledge Base Setup

User can load an existing KB or create a new one.

On creation:

PDFs are loaded from a selected folder.

Text is chunked and embedded.

Embeddings are stored in a dedicated ChromaDB instance per KB.

Hashes of PDFs are tracked to avoid recomputation.

Chat Interface

Powered by Gradio's ChatInterface

When a user asks a question:

Relevant chunks are retrieved using semantic similarity.

The question + results are passed to a generate_response_mistral() function.

Answer is streamed back to the user.

RAG Loop Summary

User Query | Retrieve top-k matching chunks from Chroma | Compose prompt using query + docs | Local/OpenAI model generates answer

Dockerized Deployment

To build and run the app in Docker:

docker build -t know-et-al . docker run -p 7860:7860 know-et-al

Ensure PDFs are accessible within the container if needed (e.g., via bind mounts).

Current Features

Multiple KB support via dropdown

Per-KB local Chroma vector DBs

Automatic PDF sync and change detection

Embedding caching for performance

OpenAI-compatible chat formatting ("messages")

Coming Soon

Chat history persistence

Tabbed UI per KB (like ChatGPT sidebar)

FastAPI interface for programmatic access

Integration with truly local models (e.g., Ollama, GGUF)

Local-first, Privacy-respecting

This project is built around the principle that your knowledge should remain your own.

By enabling local vector search + retrieval and giving you control over what the model sees, Know et al. makes LLMs safer, more focused, and far less prone to hallucination.

No more guessing. Just answers from your docs.

👨‍💻 Author

Built by @prashanth-prakash — explore, fork, contribute!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
SearchPdfs.py		SearchPdfs.py
allfileshas.txt		allfileshas.txt
chonkit.py		chonkit.py
embedding.py		embedding.py
index_chunks.py		index_chunks.py
main.py		main.py
main_gradio.py		main_gradio.py
query_me.py		query_me.py
requirements.txt		requirements.txt
run_databaseserver.py		run_databaseserver.py
save_cache.py		save_cache.py
sync_docs.py		sync_docs.py
test_chroma_connect.py		test_chroma_connect.py
utils.py		utils.py

prashanth-prakash/know-et-al

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages