Skip to content

[More data] Extract data from character/staff descriptions #210

@Koopzington

Description

@Koopzington

Characters and staff descriptions often contain additional data like height, hometowns and other extra bits, usually formatted in a more of less standardized way that could potentially be extracted and stored into separate columns as well as being used as a base for more filters.
Examples:

__Race:__ Human
Age: 26
**Hometown:**: Tokyo

Data tuples that should be considered for extraction:

  • Gender
    • Although there already is an official dedicated field for that now, this could potentially help people for data submissions
  • Birthday
    • See above
  • Blood type
    • See above
  • Hometown / Home town
    • See above
  • Height
  • Age
    • On the staff side i'd like to say we don't need it since we can calculate the age based on their birthdays. Unless we're talking about vtubers that come from the future >_>
    • On the characters side the years in their birthdays are useless since only their age at the time the story plays is of relevance. For the sake of keeping our sanity all values like "20-30", "mid 30s" or "approx. 300" will be disregarded. It's worth mentioning that there's a dedicated field for that already present so this will also be helpful for data submissions.
  • Nationality
    • We might want to disregard any values that are fictional
  • Occupation

Data tuples that should not be considered for extraction:

  • Weight
  • Zodiac sign
  • Three Sizes
  • Hair/Eye Color
    These, while there may still be entries featuring that data are subject to removal according to the data submission guidelines of AL. (As of 2025-02-24)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions