Skip to content

LLM benchmark for media: rate and visualize Verso Playground outputs

Notifications You must be signed in to change notification settings

Verso-Lab/journobench

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 

Repository files navigation

JournoBench

A Streamlit dashboard for evaluating AI models' performance on news writing tasks based on user feedback data stored in Firestore.

Setup

  1. Make sure you have Python 3.7+ installed
  2. Install required packages:
    pip install -r requirements.txt
    
  3. Make sure your Firebase credentials are properly set up in .streamlit/secrets.toml

Running the App

Run the app with:

streamlit run app.py

The app will be available at http://localhost:8501

Features

  • View and filter user evaluations of AI model outputs
  • Compare performance across different news writing tasks
  • Filter by task type (Headlines, Newsletter Writing, etc.)

Project Structure

  • app.py - Main Streamlit application
  • .streamlit/secrets.toml - Firebase credentials (not tracked in git)
  • requirements.txt - Python dependencies

About

LLM benchmark for media: rate and visualize Verso Playground outputs

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages