InquireAI - Intelligent Document Chatbot

A powerful AI-powered chatbot that can process documents, sync with Notion, crawl websites, and provide intelligent responses based on your knowledge base using RAG (Retrieval-Augmented Generation).

🚀 Features

📄 Document Processing: Upload and process PDF, DOCX, and text files
🔗 Notion Integration: Sync with Notion workspaces automatically
🕷️ Website Crawling: Extract content from websites and sitemaps
💬 Intelligent Chat: AI-powered responses using your document knowledge base
🔍 Vector Search: Advanced semantic search using Qdrant vector database
⚡ Real-time Processing: Background job processing with Redis queues
🔐 Secure Authentication: JWT-based user authentication
📊 Multi-tenant: Support for multiple users with data isolation

🛠️ Tech Stack

Backend

Node.js with Express.js
MongoDB for document and user data storage
Qdrant vector database for semantic search
Redis for job queues and caching
Bull for background job processing
OpenAI/Gemini for AI responses
JWT for authentication

Frontend

React with modern hooks
Tailwind CSS for styling
Socket.io for real-time chat
Axios for API communication

📋 Prerequisites

Node.js (v16 or higher)
MongoDB
Redis
Qdrant vector database
OpenAI API key or Google Gemini API key

🔧 Installation

Clone the repository

git clone https://github.com/4mJ0k3r/InquireAI.git
cd InquireAI/chatbot

Install dependencies

npm install
cd apps/backend && npm install
cd ../frontend && npm install

Set up environment variables

Copy the example environment file:

cp apps/backend/.env.example apps/backend/.env

Configure your .env file with:

# Database
MONGODB_URI=mongodb://localhost:27017/inquireai
REDIS_URL=redis://localhost:6379
QDRANT_URL=http://localhost:6333

# AI Services
OPENAI_API_KEY=your_openai_api_key
GEMINI_API_KEY=your_gemini_api_key

# Authentication
JWT_SECRET=your_jwt_secret_key

# Server
PORT=4000
NODE_ENV=development

Start the services

Make sure you have the required services running:
- MongoDB
- Redis
- Qdrant
You can use Docker for easy setup:
```
# Start Qdrant
docker run -p 6333:6333 qdrant/qdrant

# Start Redis
docker run -p 6379:6379 redis:alpine
```

Run the application

# Start backend (from chatbot directory)
npm run dev

# Start frontend (in another terminal)
cd apps/frontend
npm start

🎯 Usage

1. Authentication

Register a new account or login
JWT tokens are used for secure API access

2. Document Upload

Upload PDF, DOCX, or text files
Files are automatically processed and embedded into the vector database
Concurrent uploads are supported with intelligent job queue management

3. Notion Integration

Connect your Notion workspace
Automatic syncing every 2 hours
Manual sync available
Pages are processed and made searchable

4. Website Crawling

Enter a website URL to crawl
Automatic sitemap detection
Content extraction and processing
Configurable crawl limits

5. Chat Interface

Ask questions about your documents
Real-time streaming responses
Source citations included
Context-aware answers using RAG

🏗️ Architecture

Job Queue System

The application uses a sophisticated job queue system with Redis and Bull:

Upload Worker: Processes file uploads and text extraction
Chat Worker: Handles AI chat responses with streaming
Notion Worker: Syncs Notion workspace content
Crawler Worker: Processes website crawling jobs
Google Docs Worker: Handles Google Docs imports

Garbage Job Management

A unique feature that ensures job queue stability during concurrent operations:

Garbage jobs are inserted at even positions to maintain odd positioning for real jobs
Prevents race conditions during high-concurrency uploads
Ensures 100% success rate for concurrent file processing

Vector Search Pipeline

Text Extraction: PDF/DOCX → Raw text
Chunking: Split into semantic chunks
Embedding: Generate vector embeddings
Storage: Store in Qdrant with metadata
Search: Semantic similarity search for chat responses

📁 Project Structure

chatbot/
├── apps/
│   ├── backend/
│   │   ├── src/
│   │   │   ├── models/          # MongoDB schemas
│   │   │   ├── routes/          # API endpoints
│   │   │   ├── services/        # Business logic
│   │   │   ├── workers/         # Background job processors
│   │   │   ├── middlewares/     # Express middlewares
│   │   │   └── utils/           # Utility functions
│   │   ├── uploads/             # File upload directory
│   │   └── server.js            # Express server
│   └── frontend/
│       ├── src/
│       │   ├── components/      # React components
│       │   ├── pages/           # Page components
│       │   ├── services/        # API services
│       │   └── utils/           # Utility functions
│       └── public/              # Static assets
├── package.json                 # Root package.json
└── README.md                    # This file

🔌 API Endpoints

Authentication

POST /auth/register - Register new user
POST /auth/login - User login
POST /auth/logout - User logout

Documents

POST /docs/upload - Upload files
GET /docs - List user documents
DELETE /docs/:id - Delete document
GET /docs/search - Search documents

Sources

POST /sources/notion/connect - Connect Notion
POST /sources/notion/sync - Manual Notion sync
POST /sources/crawl - Crawl website
GET /sources - List connected sources

Chat

POST /chat - Start new chat
GET /chat/stream/:chatId - Stream chat responses

🚀 Deployment

Using Vercel (Recommended)

Connect your GitHub repository to Vercel
Configure environment variables in Vercel dashboard
Deploy automatically on push to main branch

Using Docker

# Build and run with Docker Compose
docker-compose up --build

Manual Deployment

Set up production environment variables
Build the frontend: npm run build
Start the backend: npm start
Configure reverse proxy (nginx/Apache)

🤝 Contributing

Fork the repository
Create a feature branch: git checkout -b feature/amazing-feature
Commit your changes: git commit -m 'Add amazing feature'
Push to the branch: git push origin feature/amazing-feature
Open a Pull Request

📝 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

OpenAI for GPT models
Google for Gemini AI
Qdrant for vector database
The open-source community for amazing tools and libraries

📞 Support

If you encounter any issues or have questions:

Check the Issues page
Create a new issue with detailed description
Join our community discussions

Built with ❤️ for the AI Agent Hackathon

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
apps		apps
.gitignore		.gitignore
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json

Folders and files

Latest commit

History

Repository files navigation

InquireAI - Intelligent Document Chatbot

🚀 Features

🛠️ Tech Stack

Backend

Frontend

📋 Prerequisites

🔧 Installation

🎯 Usage

1. Authentication

2. Document Upload

3. Notion Integration

4. Website Crawling

5. Chat Interface

🏗️ Architecture

Job Queue System

Garbage Job Management

Vector Search Pipeline

📁 Project Structure

🔌 API Endpoints

Authentication

Documents

Sources

Chat

🚀 Deployment

Using Vercel (Recommended)

Using Docker

Manual Deployment

🤝 Contributing

📝 License

🙏 Acknowledgments

📞 Support

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages