NEU Solution

LLMOps Pipeline: NEU_SOLUTION

About The Project

This project implements an end-to-end LLMOps pipeline to manage the lifecycle of a large language model (LLM) deployment. It encompasses data collection, data curation, synthetic data generation, model training, evaluation, and production serving using multiple tools and services. The pipeline is designed to automate data processing, model evaluation, and deployment, ensuring a scalable and maintainable infrastructure for large language model applications.

ARCHITECTURE

The pipeline consists of the following components:

Data Collection

Web Scraping: Web data is collected using BeautifulSoup to crawl news and other relevant data.

Synthetics Generation: Gemini is employed to generate synthetic data for conversational use cases.

See crawl

Data Curation

Data collected from scraping and synthetic generation is aggregated and stored in Google BigQuery for further processing and analysis.

Monitoring and Orchestration

Apache Airflow orchestrates the entire data collection, processing, and training workflow.

Prometheus and Grafana are used for monitoring metrics and visualizing data.

Model Training

LLaMA-Factory is used to fine-tune the LLM using curated data and synthetic conversations. Training data is sourced from BigQuery and fed into the training pipeline. MLflow manages model tracking, logging, and storing training weights and metrics. See training_cluster