Skip to content

5-Fold CV split Implementation #11

@adiv5

Description

@adiv5

Hi,
As mentioned in the paper

Each split was stratified according to the sample site to mitigate potential batch artifacts

In the code provided, I'm unable to understand how was the split done. How were the site information sourced for each sample?

It would be really great if you could point to a snippet that you used for making the splits

Note: I was going through this and here the tissue source site is supposed to be the component in the barcode just after "TCGA". Please confirm if this is so?

Thanks in Advance

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions