Skip to content

End-to-end Retrieval Augmented Generation (RAG) pipeline using Notion, Qdrant, Sentence Transformers, and Streamlit for interactive question answering on private Notion workspaces.

Notifications You must be signed in to change notification settings

Yashraj-Muthyapwar/NotionAtlas-AI-Semantic-Search-And-RAG-Assistant-for-Notion

Repository files navigation

🚀 NotionAtlas — AI Semantic Search & RAG Assistant for Notion

Transform your Notion workspace into an interactive, intelligent knowledge assistant featuring Retrieval-Augmented Generation (RAG), semantic search, and real-time answers from powerful AI models.


⚡️ Tech Stack

Streamlit Python Qdrant Llama Notion API Hugging Face


📖 Overview

NotionAtlas is an end-to-end AI toolkit that turns your Notion workspace into a smart, conversational, and context-aware knowledge base. It combines best-in-class semantic search with state-of-the-art LLMs (like Llama 4) to deliver accurate, grounded answers — all through a modern Streamlit interface.

  • End-to-End RAG Workflow: Automates extraction, chunking, embedding, retrieval, and generation over your Notion content.
  • AI-Powered Q&A: Seamlessly blends semantic search and large language models for instant, context-rich answers.
  • User-Friendly Interface: Engage with your Notion knowledge through an intuitive chat app, accessible from any browser.

✨ Features

  • 🍀 Automatic Notion Extraction: Pulls and structures content directly from Notion using its API.
  • 🔍 Semantic Search & Smart Chunking: Indexes your workspace into fine-grained, searchable chunks (flashcards, paragraphs, Q&A).
  • 🤖 RAG (Retrieval-Augmented Generation): Retrieves the most relevant Notion context and augments LLM answers for accuracy and trust.
  • 🧠 Real-Time Conversational Q&A: Ask anything, get instant, context-aware responses. Supports flashcard drill-down and deep search.
  • Production-Ready & Configurable: Modular design, secure API key handling, and one-click deploy.
  • 🛡️ Privacy First: Only relevant Notion snippets are sent to the AI for answering; your data stays safe.

🚀 Usage Instructions

1. Notion Data Extraction

python DatabaseExtractor.py

Extracts structured Notion content to a local JSON/text file.

2. Generate Embeddings & Upload to Qdrant

Run the notebook:

NotionRAG.ipynb

Creates sentence-level embeddings and stores them in Qdrant.

3. Launch Streamlit Demo

python StreamlitApp.py

Access your real-time chat interface with Qdrant-powered retrieval and LLaMA or OpenAI responses.

🛠 Dependencies

Install dependencies with:

pip install -r requirements.txt

🌐 Notes

  • Ensure your Notion API token and database/page IDs are properly configured.
  • Update Qdrant host, collection name, and API keys as needed.
  • LLaMA credentials required for chatbot responses.
  • Huggingface token for sentence-transformers
  • Update the credentials in your environment or config files before running.

Contributions welcome. Built with ❤️ for modern knowledge management and AI innovation.

About

End-to-end Retrieval Augmented Generation (RAG) pipeline using Notion, Qdrant, Sentence Transformers, and Streamlit for interactive question answering on private Notion workspaces.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published