Open
Conversation
…kflows/modules into module/fingerprintparser
…ix pool processing
…TIENTS; remove ifEmpty pass-through fallback
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Adding modules and subworkflows for creating a fingerprint vcf using gbcms and then transforming the vcf into a tab-delimited file using a custom script. The tab delimited file is based on the format used in the clinical pipeline. modules also compute contamination, and samples are combined and correlated for confirming that samples match expectations.
fingerprint/vcfparser — Parses a fingerprint VCF file produced by GBCMS into a TSV of locus-level allele counts, genotypes, and VAFs per sample.
fingerprint/combine — Merges per-sample fingerprint TSVs into a single wide table, applying a depth filter and lifting over loci coordinates between hg19 and hg38.
fingerprint/contamination — Calculates major and minor contamination from a pair of tumor/normal fingerprint TSVs.
fingerprint/correlation — Computes pairwise Pearson correlations across all samples in a combined fingerprint table and outputs a heatmap (PDF + interactive HTML), a correlations matrix, and a loci-overlap observations matrix.
fingerprint/mislabels — Flags unexpected sample matches (unrelated samples with suspiciously high correlation) and unexpected mismatches (same-patient samples with suspiciously low correlation) based on patient labels from a sample sheet, using validated thresholds.
PR checklist
feature/<module_name>for modules, orfeature/<subworkflow_name>for subworkflows. For modules, if there is a subcommand use:feature/<module_name>/<module_subcommand>.versions.ymlfile.label.nf-core modules --git-remote https://github.com/mskcc-omics-workflows/modules.git -b <module_branch> test <MODULE> --profile dockernf-core modules --git-remote https://github.com/mskcc-omics-workflows/modules.git -b <module_branch> test <MODULE> --profile singularitynf-core modules --git-remote https://github.com/mskcc-omics-workflows/modules.git -b <module_branch> test <MODULE> --profile condanf-core subworkflows --git-remote https://github.com/mskcc-omics-workflows/modules.git -b <subworkflow_branch> test <SUBWORKFLOW> --profile dockernf-core subworkflows --git-remote https://github.com/mskcc-omics-workflows/modules.git -b <subworkflow_branch> test <SUBWORKFLOW> --profile singularitynf-core subworkflows --git-remote https://github.com/mskcc-omics-workflows/modules.git -b <subworkflow_branch> test <SUBWORKFLOW> --profile conda