Ecommerce LLM Agent

Conversational AI for E-commerce Support

✅ Order status and review lookup (by order ID)
✅ Comprehensive order details for any order
✅ FAQ semantic search (vector DB)
✅ Refund/payment check
✅ Easily extensible agent workflow (LangGraph/LangChain + OpenAI)
✅ SQLAlchemy ORM, Chroma vector DB, HuggingFace embeddings

Features

Natural language support for order queries and FAQ
Semantic search: users get best-match answers from your FAQ corpus
Integrated SQL database from Olist public dataset
Vector search with Chroma (state-of-the-art)
Production-ready modular Python codebase

Project Structure

ecommerce_agent/
├── agent/
│   ├── state.py
│   └── workflow.py
├── db/
│   ├── db_setup.py
│   └── models.py
├── tools/
│   └── business_tools.py
├── vectorstore/
│   └── faq_vectorstore.py
├── llm/
│   └── llm.py
├── data/
│   └── ... (CSV files: see below)
├── deploy/
│   └── conda/
│       └── env.yml
├── main.py
├── config.py
├── requirements.txt
├── environment.yml
├── gradio_app.py
├── docker-compose.yml
├── Dockerfile
└── LICENSE
├── README.md
├── .env
├── agent_interactions.log
└── faq_vectorstore/

Quickstart

1. Clone and Install

git clone <your-repo-url>
cd ecommerce-llm-agent
conda env create -f deploy/conda/env.yml
conda activate ecommerce_agent
# Alternatively use pip:
# pip install -r requirements.txt

2. Prepare Data

Place all Olist CSV files in the data/ directory.
- Example files:
  - olist_customers_dataset.csv
  - olist_geolocation_dataset.csv
  - olist_orders_dataset.csv
  - olist_order_items_dataset.csv
  - olist_order_payments_dataset.csv
  - olist_order_reviews_dataset.csv
  - olist_products_dataset.csv
  - olist_sellers_dataset.csv
  - product_category_name_translation.csv
FAQ will be loaded automatically from HuggingFace on first run.

3. Set Up API Keys

Create a .env file in your project root with your API keys. At minimum you need the OpenAI key if you want to use OpenAI for intent detection:

OPENAI_API_KEY=sk-xxxxxx
# Optional: override the chat model (default: gpt-4o-mini)
# OPENAI_MODEL_NAME=gpt-4o-mini

# Optional: use HuggingFace instead of OpenAI for intent classification
# INTENT_CLASSIFIER=huggingface
# HUGGINGFACE_API_TOKEN=hf_xxxx
# HF_INTENT_MODEL=facebook/bart-large-mnli

4. Run the Agent (CLI)

python main.py

The first execution automatically builds olist.db and the FAQ vector store by running the setup modules.

Example conversation:

You: What is your refund policy?
AI: [semantic FAQ answer]

You: Check order status for ID 000229ec398224ef6ca0657da4fc703e
AI: Order 000229ec398224ef6ca0657da4fc703e status: delivered

5. Launch the Web Chat (Gradio)

python gradio_app.py

This starts a simple web UI so you can chat with the agent in your browser.

Sample Questions

Try these example prompts to explore the agent's capabilities:

General FAQ – "What is your refund policy?" or "How do I return an item?"
Order Status – "Where is my order <order_id>?"
Order Details – "Give me all details for order <order_id> (items, customer city, payments, review)."
Refund/Payment – "Has order <order_id> been refunded?"
Order Review – "What review did I leave for order <order_id>?"
Item Breakdown – "Which products were in order <order_id> and what categories are they in?"
Seller Info – "Who were the sellers for the items in order <order_id>?"
Payment Method – "How much was paid for order <order_id> and what payment method was used?"
Customer Location – "Which city/state is the customer from for order <order_id>?"
Additional FAQs – "How long does shipping usually take?" or "What is your return policy for defective items?"

How it Works

Startup:
- Creates and populates SQLite DB from CSVs (ORM: all relationships preserved)
- Builds FAQ vector DB (Chroma + HuggingFace sentence transformers)
Agent Loop:
- Classifies intent via LLM
- Routes query to the correct tool (order lookup, refund, review, FAQ search)
- Logs every interaction for analytics/continual improvement

System Design

High-Level Architecture

graph LR
    subgraph Setup
        A[CSV Data] -->|populate| B[SQLite DB]
        C[FAQ Dataset] -->|embed & store| D[Chroma Vector Store]
    end
    E["User Interface<br/>(CLI or Gradio Web UI)"] --> F["LangGraph Workflow"]
    F --> G["Intent Classifier<br/>LLM"]
    G --> H["Tool Dispatcher"]
    H --> B
    H --> D
    H --> I["Answer Rephrasing<br/>LLM"]
    I --> J["Final Response"]

Low-Level Components

---
config:
  look: neo
  theme: neo
  layout: dagre
---
flowchart TD
    U["User"] -- 1: User Query --> Web["Gradio Web App"]
    Web -- 2 --> Workflow["LangGraph Workflow"]
    Workflow -- "2.1: Start" --> Perception["Perception Node - intent classification"]
    Perception -- Request --> LLM["LLM API"]
    LLM -- Response --> Perception & Answer["Answer Node - rephrase"]
    Perception -- "2.2" --> ToolNode["Tool Node"]
    ToolNode -- Internal --> Tools["Business Tools"]
    Tools -- Querying Relational Database --> DB[("SQLite DB")]
    DB -- Returning Data --> Tools
    Tools -- Searching Vector Database --> Vector[("Chroma Vector Store")]
    Vector -- Returning Chunks --> Tools
    Tools -- Internal --> ToolNode
    ToolNode -- "2.3" --> Answer
    Answer -- Request --> LLM
    Answer -- "2.4: End" --> Learning["Learning Node - logging"]
    Learning -- Storing Query and Response --> Logs["agent_interactions.log"]
    Answer -- 3: Response to User --> Web

Evaluation Metrics

The agent logs every question and answer to agent_interactions.log. The path is configured via the LOG_FILE constant in config.py:

LOG_FILE = os.getenv("LOG_FILE", "agent_interactions.log")

The actual logging happens inside agent/workflow.py:

def log_interaction(user_query: str, agent_answer: str):
    """Logs Q&A for learning, retraining, or analytics."""
    with open(LOG_FILE, "a", encoding="utf-8") as f:
        f.write(
            f"{datetime.datetime.now().isoformat()} | Q: {user_query} | A: {agent_answer}\n"
        )

To evaluate the agent:

Run some of the sample questions above (or your own prompts).
Review the entries in agent_interactions.log and compare them with your expected answers.
Note: Only the user query and final response are currently logged, but responses from intermediate steps can also be logged as needed for the evaluation plan outlined below.

Suggested metrics for analyzing these logs include:

Intent classification accuracy – compare detected intents with a labeled dataset.
Retrieval precision/recall – check FAQ results against ground-truth answers.
Database query correctness – verify order status and other details against your database.
Response quality – measure rephrased answers (e.g., with BLEU or ROUGE) or perform manual review.
User satisfaction – track explicit user ratings or feedback over time.

Step	Goal	Metric	Use Case
1	Intent classification	Accuracy / F1	Evaluate `classify_intent` predictions
2	Matching correct FAQs	Precision@k / Recall@k	Evaluate FAQ retrieval (`search_faq`)
3	Database query results	Exact match accuracy	Evaluate order tools (`get_order_status`, `get_refund_status`, `get_review`, `get_order_details`)
4	Quality of rephrased text	BLEU / ROUGE	Evaluate rephrasing (`rephrase_text`)
5	Overall user satisfaction	Rating / manual review	Human evaluation across full interaction

Configuration

Edit config.py to set:

Model names (vector embeddings)
DB and vectorstore paths
HuggingFace or OpenAI model config (OpenAI chat model: gpt-4o-mini, HF intent model: facebook/bart-large-mnli)
INTENT_CLASSIFIER to choose between OpenAI or HuggingFace for intent detection

Extending

Add new tools to tools/business_tools.py
Add new intent types to the agent workflow
Swap out the LLM in llm/llm.py for Claude, Gemini, etc.
Plug into a web API (Flask, FastAPI) with a few lines
Integrate tracing and evaluation with LangSmith or Langfuse for observability
Add user authorization layers (e.g., API keys, OAuth)
Implement user management to track individual accounts
Introduce session management to maintain conversation context across requests
Support multi-tenancy if serving multiple stores or clients
Apply security best practices (rate limiting, input validation, and safe LLM defaults)

Troubleshooting

Make sure all CSVs are in data/ and .env contains your API key.
For vectorstore/embedding errors: check internet access for model download on first run.
If you change DB schema or CSVs, delete olist.db and rerun main.py.

Docker Quickstart

The project includes a Dockerfile and docker-compose.yml so you can run everything inside containers. Follow these steps:

Create a .env file with your OpenAI (and optional HuggingFace) keys.
Build the image (only required the first time):
```
docker compose build
```
The first build copies the contents of data/ into the image (Step 6 in the Dockerfile). Once that data lives in the volume you can comment out that step for subsequent builds.
Create the external volumes (only required once):

The volumes defined in docker-compose.yml have external: true, so Docker will not create them automatically. Ensure they exist before running docker compose up:
```
docker volume create ecommerce_data
docker volume create ecommerce_database
docker volume create ecommerce_vectorstore
docker volume create ecommerce_logs
```
Start the application and open http://localhost:7860 in your browser:
```
docker compose up
```
- Detached mode
```
docker compose up -d
```
- Interactive shell
```
docker compose run --entrypoint bash -it app
```

Stop or restart the containers:

docker compose stop     # stop
docker compose restart  # restart

Remove everything when you're done:
```
docker compose down
```

Inspecting Docker Resources

Volumes

docker volume ls
docker volume inspect ecommerce_data

Networks

docker network ls
docker network inspect <network_name>

Services and containers
```
docker compose ps
```
Images
```
docker images
```

License

MIT License

Credits

Olist dataset (public domain)
LangChain, ChromaDB
HuggingFace Datasets, OpenAI GPT

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Ecommerce LLM Agent

Features

Project Structure

Quickstart

1. Clone and Install

2. Prepare Data

3. Set Up API Keys

4. Run the Agent (CLI)

5. Launch the Web Chat (Gradio)

Sample Questions

How it Works

System Design

High-Level Architecture

Low-Level Components

Evaluation Metrics

Configuration

Extending

Troubleshooting

Docker Quickstart

Inspecting Docker Resources

License

Credits

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 123 Commits
agent		agent
data		data
db		db
deploy/conda		deploy/conda
llm		llm
references		references
tools		tools
vectorstore		vectorstore
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
config.py		config.py
docker-compose.yml		docker-compose.yml
gradio_app.py		gradio_app.py
main.py		main.py

Folders and files

Latest commit

History

Repository files navigation

Ecommerce LLM Agent

Features

Project Structure

Quickstart

1. Clone and Install

2. Prepare Data

3. Set Up API Keys

4. Run the Agent (CLI)

5. Launch the Web Chat (Gradio)

Sample Questions

How it Works

System Design

High-Level Architecture

Low-Level Components

Evaluation Metrics

Configuration

Extending

Troubleshooting

Docker Quickstart

Inspecting Docker Resources

License

Credits

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages