GitHub - qreater/Profiteer-IO: Maximize Profits. Minimize Guesswork.

ProfiteerIO is a full-stack, end-to-end e-commerce intelligence platform that empowers users to predict optimal product pricing and boost purchasing behavior using real-time data pipelines, machine learning, and a seamless UI experience.

Demo

For a TLDR; demo, check out the YouTube video showcasing ProfiteerIO in action.

Introduction

Setting the right price for your products can be a challenge for e-commerce businesses. Price too high, and you risk losing customers; price too low, and you sacrifice profits. So, how do you find the sweet spot in real-time, adapting to shifting market dynamics, product trends, and customer behavior?

Introducing ProfiteerIO- a powerful tool designed to help e-commerce businesses maximize profits by predicting sales under different pricing scenarios.

With ProfiteerIO, you can:

Seamlessly sync catalog sales data from diverse sources (CSV, JSON, and more) into a robust database or data warehouse using Airbyte.
Harness the power of AI by training models to predict purchase volumes based on product attributes and pricing strategies with MindsDB.
Visualize product performance and forecast future sales trends through an intuitive React dashboard.
Deploy your entire analytics stack on Kubernetes to any cloud provider (AWS, GCP, Azure) with just a few simple configurations using Helm.

Hackathon Win

ProfiteerIO was the 1st place winner of the Airbyte+MindsDB Hackathon. The project was recognized for its innovative use of Airbyte and MindsDB to create a full-stack e-commerce intelligence platform.

Screenshots

Expand for Dashboard, Catalog and Prediction Visuals

Architecture

The entire stack is deployed via a Helm chart with modular subcharts for Airbyte, MindsDB, Postgres, and the app components, enabling One-Click Deployment on Kubernetes.

Below is a comprehensive breakdown of how the platform works as well as an architecture diagram, its key components, and how everything ties together in a single seamless deployment.

Core Services & Responsibilities

STACK	ROLE
FastAPI	Synthesizes realistic e-commerce sales data based on price, rating, and time dynamics. Also exposes REST APIs for analytics and prediction, powering the frontend and Airbyte source.
PostgreSQL	Central datastore for all synthesized sales data, analytics results, and ML inputs/outputs. Tuned for fast filtering and aggregation.
Airbyte	Periodically extracts generated data from the FastAPI source and loads it into PostgreSQL. Supports scheduled and manual syncs.
MindsDB	Connects to PostgreSQL to train and serve machine learning models that forecast purchase volume based on key product features.
React	A user-friendly UI that visualizes dashboards, enables catalog interactions, and supports live predictions using slider inputs.
NGINX	Routes incoming requests (via Nginx or other reverse proxy) to backend services and enforces secure access and CORS handling.

Technical Workflow

Synthetic Dataset Generation: FastAPI uses configurable rules to simulate sales events. Parameters like price sensitivity, time-of-day behavior, and product ratings are baked into the generation algorithm.
Ingestion & Storage (EL): Airbyte extracts data from FastAPI and loads it into PostgreSQL. You can trigger syncs manually or schedule them hourly/daily.
ML Model Training: MindsDB connects directly to PostgreSQL, continuously retraining or updating its prediction models. These models consider fields like price, rating, and hour-of-day to estimate expected purchases.
API & Analytics: FastAPI exposes analytics endpoints to serve dashboard stats and product insights. For predictions, it acts as a bridge between the React frontend and MindsDB’s trained models.
User Interaction: React queries the backend to render the Dashboard (metrics), Catalog (product management), and Prediction page (interactive sliders for pricing simulation).

Dataset

The E-Commerce Dynamics Dataset is a meticulously crafted synthetic dataset designed to emulate real-world e-commerce sales behavior. The dataset simulates sales activities across a product catalog over a user-specified time period (in hours).

Each record encapsulates a snapshot of product performance at a given timestamp, factoring in dynamic pricing, consumer engagement (views, cart additions, purchases), and contextual influences like time-of-day demand fluctuations. The synthesis logic leverages probabilistic models and domain-inspired heuristics to ensure realism and variability.

Key Features

Consumer Behavior Simulation: Views, cart additions, and purchases are modeled using a multi-stage funnel, influenced by product ratings, category popularity, and time-of-day demand patterns.
Category Popularity Integration: Products are assigned to categories with predefined popularity scores (derived from CategoryPopularity and ratings), impacting visibility and engagement.
Rating Evolution: Product ratings (ranging from 2.0 to 5.0) evolve stochastically over time, simulating shifts in consumer sentiment.
Time-of-Day Sensitivity: Demand fluctuates across four time buckets—Overnight (22:00–06:00), Morning (06:00–12:00), Afternoon (12:00–18:00), and Evening (18:00–22:00)—mimicking realistic shopping patterns.
Granular Temporal Resolution: Sales data is generated hourly, with timestamps in ISO format, enabling fine-grained analysis of temporal trends.
Dynamic Pricing Mechanisms: Products follow one of three pricing strategies—aggressive, moderate, or standard—bounded by minimum and maximum price constraints, reflecting real-world pricing variability.

Data Structure

Generation Methodology

Usage Notes

The dataset is generated using the generate_sales_data function, which accepts a SalesRequest object containing the number of hours and product metadata.
The dataset is synthetic, offering flexibility for experimentation without privacy or proprietary data concerns.

Installation

Tip

The recommended way to run the project is to use the provided Helm chart for deployment on Kubernetes. However, if you prefer to run it locally, there are instructions at the end of this section.

Note

If you are using the Helm chart, please ensure you have Docker, Kubectl, Minikube, Helm, Git installed and running on your machine.

Clone, Build Docker Images

Clone the repository to your local machine using the following command

git clone https://www.github.com/qreater/profiteer-io.git

Navigate to the directory, and then follow the instructions below to build the Docker images for each component. (Frontend, Backend) Make sure you are in the minikube context for docker.

cd devops/

docker build -t profiteer-io-backend .. -f ./Dockerfile.backend
docker build -t profiteer-io-frontend .. -f ./Dockerfile.frontend

Deploying with Helm

The Helm chart includes all the necessary components, including Airbyte, MindsDB, PostgreSQL, and the FastAPI backend, as well as an NGINX reverse proxy for routing requests to the appropriate services.

Warning

Ensure you configure the values.yaml file in the helm chart according to your requirements. The default configuration is set up for local development, but you can modify it for production use. The deployment uses about 7Gi of memory and 2 CPUs for the overall stack, with MindsDB demanding the most resources.

cd devops/helm/

kubectl create namespace profiteer
helm install profiteer-io . --namespace profiteer

Accessing the Services

Once the deployment is complete, you can port-forward the services individually, or the NGINX service to access the frontend and backend. K9s is a great tool to visualize the services and their ports.

After this, the stack will be up and running. You can proceed to follow with the flow of the application, starting with the Airbyte connection to the FastAPI source.

Running ProfiteerIO Locally

Prediction

MindsDB is used to predict the number of purchases based on the features provided in the dataset. The prediction model is trained using the purchases column as the target variable and the selected features as input variables. The model uses a time window of 24 hours and a horizon of 1 hour to make predictions.

Tip

The prediction model is designed to be un-biased and robust, ensuring that the predictions are not influenced by any specific product or category.

Field Name	Description
`views`	The views received by products
`cart_adds`	The number of times products were added to the cart
`popularity_factor`	The average popularity of the product categories, calculated with current price and ratings
`price_ratio`	The average ratio of current price to base price, rounded to the nearest whole number

UI Dashboard

The UI dashboard is built using React and provides a user-friendly interface to visualize the predictions made by MindsDB. The dashboard includes the following features:

Product Catalog: A list of products with their details, including product name, image, category, and current price.
Product Details and Prediction: A detailed view of each product, including its predicted purchases based on the price set.
Overall Statistics: A summary of the overall statistics, including the total purchases, top products and total revenue.

Features

Scheduled Data Syncs: Seamlessly sync your catalog sales data from various sources into a robust database or data warehouse using Airbyte.
AI-Powered Predictions: Leverage MindsDB to train models that predict purchase volumes based on product attributes and pricing strategies.
Interactive Dashboard: Visualize product performance and forecast future sales trends through an intuitive React dashboard.
One-Click Deployment: Deploy your entire analytics stack on Kubernetes to any cloud provider (AWS, GCP, Azure) with just a few simple configurations using Helm.
Modular Architecture: The architecture is designed to be modular, allowing for easy integration of new components and services as needed.

Future Work

Stock Management: Integrate stock management features to track inventory levels and optimize stock replenishment based on sales predictions.
Broader Category Support: Expand the dataset to include a wider range of product categories and attributes, enhancing the model's predictive capabilities.
Real-Time Data Ingestion: Implement near-real-time data ingestion capabilities to ensure that the dataset is always up-to-date with the latest sales data.
World Events: Integrate world events and trends into the dataset to better understand their impact on sales and consumer behavior.

Contributing

We welcome contributions to this project! If you have any suggestions or improvements, please feel free to open an issue or submit a pull request. Here are the tools and utilities we used to build this project, and we encourage you to use them as well:

Tools & Utilities

CATEGORY	TOOL
Dev Cycle	GitHub Issues + Pull Requests
CI/CD	GitHub Actions for PR test runs
Design	Figma for UI/UX mockups, logo
Frontend Style	`prettier` & `eslint` for linting and formatting
Backend Style	`black` for Python code formatting
Deployments	Docker, Kubernetes, Helm
Asset Generation	DALL·E for product imagery and creative assets

Tip

This project adheres to modern developer workflows and automation principles. It includes CI pipelines, standardized formatting tools, and a collaborative GitHub-based review process.

License

This project is licensed under the MIT License. See the LICENSE file for more details.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.github/workflows		.github/workflows
backend		backend
devops		devops
doc_assets		doc_assets
frontend		frontend
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Table of Contents

Demo

Introduction

Hackathon Win

Screenshots

Architecture

Core Services & Responsibilities

Technical Workflow

Dataset

Key Features

Data Structure

Generation Methodology

Usage Notes

Installation

Clone, Build Docker Images

Deploying with Helm

Accessing the Services

Running ProfiteerIO Locally

Prediction

UI Dashboard

Features

Future Work

Contributing

Tools & Utilities

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Table of Contents

Demo

Introduction

Hackathon Win

Screenshots

Architecture

Core Services & Responsibilities

Technical Workflow

Dataset

Key Features

Data Structure

Generation Methodology

Usage Notes

Installation

Clone, Build Docker Images

Deploying with Helm

Accessing the Services

Running ProfiteerIO Locally

Prediction

UI Dashboard

Features

Future Work

Contributing

Tools & Utilities

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages