Hi,
As mentioned in the paper
Each split was stratified according to the sample site to mitigate potential batch artifacts
In the code provided, I'm unable to understand how was the split done. How were the site information sourced for each sample?
It would be really great if you could point to a snippet that you used for making the splits
Note: I was going through this and here the tissue source site is supposed to be the component in the barcode just after "TCGA". Please confirm if this is so?
Thanks in Advance
Hi,
As mentioned in the paper
In the code provided, I'm unable to understand how was the split done. How were the site information sourced for each sample?
It would be really great if you could point to a snippet that you used for making the splits
Note: I was going through this and here the tissue source site is supposed to be the component in the barcode just after "TCGA". Please confirm if this is so?
Thanks in Advance