Skip to content
Bryan Robbins edited this page Feb 28, 2016 · 2 revisions

The Baseball Workbench UI should allow users to describe, execute, and export statistical analysis. To support this goal, there should be a flow of the following activities:

  • Select Initial Dataset, from a list of available Public datasets. For example: Lahman.Hitters
  • Define one or more new columns, in terms of columns from the Initial Dataset and basic arithmetic (add, subtract, multiply, divide). For example: RC = (H + BB) * TB / (AB + BB)
  • Define one or more row filters, in terms of column names, values, and comparators, to be applied to the updated data set (Initial + New Columns) prior to export. For example: Year > 1955
  • Define Exported Artifact, from a list of available export types and their options. For example: Histogram of RC
  • Click "Generate"
  • Receive temporary link to exported files.

The available datasets are:

  • Individual tables from the Lahman database (e.g., Hitters, Pitchers, Teams, etc.)
  • Retrosheet Gamelogs database (Regular Season, Postseason, or All-Star)

The supported Export Types are:

  • Table ordered by X (ASC or DSC)
  • Histogram of X
  • Scatter Plot of X vs. Y

Clone this wiki locally