Skip to content

Organizing Audio Data for Speaker Recognition: Folders or Single Folder? #31

@lpscr

Description

@lpscr

first thank you very must for your work it's amazing !

I have 10 speakers in my dataset. Should I organize each speaker's audio files into separate folders like this

├── spk1
│   ├── 1.wav
│   ├── 2.wav
│   └── ...
├── spk2
│   ├── 3.wav
│   ├── 4.wav
│   └── ...
└── spk3
    ├── 5.wav
    ├── 6.wav
    └── 

Or is it okay to put all the audio files in one folder like this:

> ├── 1.wav
> ├── 2.wav
> ├── 3.wav
> ├── 4.wav
> ├── 5.wav
> ├── 6.wav
> └── 

Does separating each speaker into a folder help with better recognition during training? Also, I only have about 1 hour of audio data for each of the 10 speakers. Is that enough for effective training?"

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions