CLL to Richter Syndrome: integrating network-based strategies with experimental readouts elucidating disease-drivers and personalized therapies
Simulation, analysis and code for reproducing the results of the draft can be found in CLL_FigureScripts.R. The script is ordered according to the figure numbers in the draft. All additional custom functions required to run the script are provided within the Function folder, and loaded automatically within the main script.
All additional file sources are provided in the repository and listed below:
- The modeled Boolean network is given in the text-based BoolNet-format (CLL_Model.txt, CLL_Model_Microenvironment.txt)
- Experimental data on the anonymised patient cohort can be found in BMI1_Expression_Anonymous.csv and CD5_Expression_Anonymous.csv
- The input output analysis readouts are provided as CLL_IO-Notch_TP53.RData, InputOutputAnalysisHeatmap.RData
The network predictions were validated using a single cell RNA Seq dataset of human CLL patients by Nadeu et al., Nature Medicine, 2022 (https://doi.org/10.1038/s41591-022-01927-8). For the analysis, we used the publicly available Seurat-objects (https://zenodo.org/records/6631966). Code for the validation of the Boolean network prediction based on single cell RNA sequencing data can be found in the RMarkdown-file SingleCellAnalysis.Rmd.
Additional files :
- Intermediary results are given as RData-objects (binarizedDataset.RData, toBinarizeRevised.RData).
- To compare the scRNA seq data with the network predictions, we used the attractors given in the files RS_AKT_KI_attrmatrix.Rdata, RS_CDKN2AB_TP53_KO_attrmatrix.Rdata, RS_NFAT_KO_attrmatrix.Rdata, RS_NOTCH_KI_attrmatrix.Rdata, CLL_attrmatrix.Rdata
- A mapping of network gene names to ENSEMBL-IDs can be found in Attrgene_plus_EnsID.Rdata
- SingleCellAnalysis.nb.html is a compiled html-file showing the results as generated within SingleCellAnalysis.Rmd
- All figures showing the different results generated in the SingleCellAnalysis.Rmd file can be found in the folder "output"
All main simulations were performed on a MacBook Pro with Apple Silicon Chip and OS-Version 14.3 (Sonoma). Binarization and input/output analysis were run massively parallel on a compute node running with Ubuntu 20.04 LTS. Besides the latter two, the script can be ran within minutes. Depending on the avaiable resources binarization and input/output analysis might take up to days. Consequently, the intermediary results are given withing the repository (binarizedDataset.RData, CLL_IO-Notch_TP53.RData, InputOutputAnalysisHeatmap.RData).
Simulations were run on R Version 4.2.2.
Required packages for simulation are :
- BoolNet, version 2.1.7
- qgraph, version 1.9.3
- igraph, version 1.3.5
- pROC, version 1.18.0
- ggplot2, version 3.4.1
- ggpubr, version 0.6.0
- poweRlaw, version 0.70.6
- readr, version 2.1.5
- viridis, version 0.6.4
- tidyverse, version 2.0.0
- Seurat, version 5.0.0
- SeuratObject, version 5.0.0
- reshape2, version 1.4.4
- harmony, version 1.1.0
- BiTriNa, version 1.3.1