🔍 SearchSphere Agent

An AI-powered hybrid search and labeling assistant that fuses Elastic Cloud BM25 + kNN retrieval with Google Vertex AI Gemini 2.0 reasoning — built to revolutionize enterprise knowledge search and evaluation.

📘 Table of Contents

Overview
Features
Architecture
Tech Stack
Installation
Environment Variables
Running Locally
Deployment
Usage
Project Structure
Evaluation Metrics
Future Upgrades
Contributors
License

🧭 Overview

SearchSphere Agent is a full-stack AI search platform that integrates Elastic Cloud hybrid retrieval (BM25 + vector search) with Google Vertex AI Gemini 2.0 for contextual reasoning, evaluation, and dataset labeling.

It helps teams and enterprises find smarter, label faster, and evaluate efficiently — a complete foundation for AI-powered RAG systems.

⚙️ Features

🔍 Hybrid Search — Combines Elastic BM25 (lexical) + kNN (semantic) for deep understanding.
🤖 Gemini Reasoning — Uses Vertex AI Gemini-2.0-Flash for summaries and responses.
🧩 Label Assist — Create ground-truth JSONs interactively for model evaluation.
📊 Metrics Dashboard — Live precision@K and latency stats (p50/p95).
💬 Conversational Refinement — Natural chat-style interface for query reasoning.
🔐 Cloud Ready — Dockerized backend, deployed via Google Cloud Run + Netlify.

🏗️ Architecture


Frontend (Next.js 14, TypeScript, Tailwind)
│
▼
Next.js API routes (proxy)
│
▼
Backend (FastAPI)
├── Elastic Cloud (BM25 + kNN)
├── Vertex AI (Gemini + Embeddings)
├── Evaluation Engine
└── Label Assist Service

🧰 Tech Stack

Layer	Technology
Frontend	Next.js 14, React 18, Tailwind CSS, Recharts
Backend	FastAPI (Python 3.11), Elastic Cloud, Vertex AI
Deployment	Google Cloud Run (backend), Netlify (frontend)
Database	Elastic Cloud index (`searchsphere_docs`)
CI/CD	GitHub Actions, Docker

💻 Installation

1️⃣ Clone the repository

git clone https://github.com/MrigankJaiswal-hub/SearchSphere-Agent.git
cd SearchSphere-Agent

2️⃣ Backend setup

cd backend
python -m venv .venv
.venv\Scripts\activate   # (Windows)
pip install -r requirements.txt

3️⃣ Frontend setup

cd ../web
npm install

🔐 Environment Variables

Backend `.env`

ELASTIC_CLOUD_ID=your_elastic_cloud_id
ELASTIC_API_KEY=your_elastic_api_key
ELASTIC_INDEX=searchsphere_docs
VERTEX_LOCATION=us-central1
GCP_PROJECT_ID=searchsphere-ai
VERTEX_EMBED_MODEL=text-embedding-005
VERTEX_CHAT_MODEL=gemini-2.0-flash-001
ES_KNN_NUM_CANDIDATES=120
CORS_ORIGIN=*

Frontend `.env.local`

NEXT_PUBLIC_API_BASE=http://127.0.0.1:8080

▶️ Running Locally

Start backend

cd backend
uvicorn app:app --reload --port 8080

Start frontend

cd web
npm run dev

Visit 👉 http://localhost:3000

☁️ Deployment

Google Cloud Run (Backend)

gcloud run deploy searchsphere-backend \
  --source . \
  --region us-central1 \
  --set-env-vars "CORS_ORIGIN=https://your-frontend-url.netlify.app"

Netlify (Frontend)

Import /web directory from GitHub.

Add Environment Variable:

NEXT_PUBLIC_API_BASE=https://<your-cloudrun-url>

Deploy site.

Demo Workflow

Visit your deployed frontend.
Enter a natural language query (e.g., “What is hybrid search?”).
Observe semantic + lexical search fusion results.
View real-time precision and latency metrics.
Go to Label Assist, input a query, and export groundtruth.json.
Upload groundtruth.json in Run Evaluation to compute P@K.

✅ Files checklist

Folder/File	Purpose	Status
`/backend`	FastAPI backend	✅
`/web`	Next.js frontend	✅
`/docs`	Documentation (README, credits, architecture, eval_matrix.xlsx)	✅
`/docs/credits.txt`	Acknowledgments	✅
`/docs/evaluation_matrix.xlsx`	Metrics template	✅
`/docs/diagram.png`	Architecture diagram (export from draw.io / Lucidchart)	✅
`.github/workflows/`	Optional CI/CD	Optional ✅
`.env.example`	Example env vars	✅
`LICENSE`	MIT License	✅
`README.md`	Overview	✅

🧠 Usage

/ → Main chat & hybrid search page
/metrics → Live latency & precision dashboard
/label → Label Assist tool for dataset generation

To test evaluation manually:

curl -X POST "$URL/api/eval/precision" \
-H "Content-Type: application/json" \
-d '{"query":"hybrid search","k":10}'

📂 Project Structure

SearchSphere-Agent/
├── backend/
│   ├── app.py
│   ├── routes/
│   ├── utils/
│   ├── tests/
│   └── requirements.txt
│
├── web/
│   ├── app/
│   ├── components/
│   ├── lib/
│   ├── public/
│   ├── package.json
│   └── tailwind.config.ts
│
├── scripts/
├── assets/
└── README.md

📊 Evaluation Metrics

Precision@K: Evaluates top-K relevance.
Latency Tracking: p50/p95 in milliseconds.
Label Assist: Exports groundtruth.json for retraining.

Example output:

p50: 730 ms | p95: 1100 ms | Precision@10: 0.86

🔮 Future Upgrades

Multi-modal retrieval (text, images, audio, video).
Auth & multi-tenant support (Firebase/Cognito).
Feedback-driven fine-tuning of hybrid fusion weights.
Enhanced real-time dashboards and analytics.

👨‍💻 Contributors

Mrigank Jaiswal B.Tech

🖥️ Built full-stack architecture, Elastic-Vertex integration, frontend UI, and deployment automation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🔍 SearchSphere Agent

📘 Table of Contents

🧭 Overview

⚙️ Features

🏗️ Architecture

🧰 Tech Stack

💻 Installation

1️⃣ Clone the repository

2️⃣ Backend setup

3️⃣ Frontend setup

🔐 Environment Variables

Backend `.env`

Frontend `.env.local`

▶️ Running Locally

Start backend

Start frontend

☁️ Deployment

Google Cloud Run (Backend)

Netlify (Frontend)

Demo Workflow

✅ Files checklist

🧠 Usage

📂 Project Structure

📊 Evaluation Metrics

🔮 Future Upgrades

👨‍💻 Contributors

🪪 License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
.github/workflows		.github/workflows
backend		backend
docs		docs
infra		infra
scripts		scripts
web		web
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
netlify.toml		netlify.toml

Folders and files

Latest commit

History

Repository files navigation

🔍 SearchSphere Agent

📘 Table of Contents

🧭 Overview

⚙️ Features

🏗️ Architecture

🧰 Tech Stack

💻 Installation

1️⃣ Clone the repository

2️⃣ Backend setup

3️⃣ Frontend setup

🔐 Environment Variables

Backend .env

Frontend .env.local

▶️ Running Locally

Start backend

Start frontend

☁️ Deployment

Google Cloud Run (Backend)

Netlify (Frontend)

Demo Workflow

✅ Files checklist

🧠 Usage

📂 Project Structure

📊 Evaluation Metrics

🔮 Future Upgrades

👨‍💻 Contributors

🪪 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Backend `.env`

Frontend `.env.local`

Packages