Crafted Experiment

This repository contains the code for generating crafted datasets based on real scRNA-seq data. In our study, we generated a set of 24 crafted datasets via perturbing genes selected from different levels of sparsity based on the previously published 3 cell line mixture dataset from GEO: GSE136148.

Please cite:

On the background of the 3 cell line mixture dataset, we created a fourth luminal group by randomly sampling 30% of the luminal cells (N=162) and perturbing a set of genes for these chosen luminal cells. Instead of randomly perturbing genes across the whole genome, we focused on different gene regions based on the relationship between the total count per gene and the proportion of zeros per gene across the luminal cells only. First, we split the genes into three major regions:

Sparse (S) - genes with proportion of zeros greater than 0.9 (N=9,223)
Medium (M) - genes with proportion of zeros between 0.45 and 0.55 (N=732)
Dense (D) - genes with proportion of zeros less than 0.1 (N=613)

From each gene region, we randomly selected 600, 300, 100, and 50 genes to control the strength of the added signal. To allow cross comparisons, these gene sets are nested within each other.

Description

The 3 cell line mixture dataset used in this analysis is in the data folder. All the R scripts used to generate the crafted datasets are in the code folder. All crafted datasets generated from the code are included in the crafted_datasets folder. All 24 crafted datasets generated in the paper are included in the crafted_datasets_paper folder.

Example

Run 1.GenerateCraftedGenes.R to randomly select genes for crafting. From each gene regions (sparse, dense, or medium), 600, 300, 100, and 50 genes are randomly selected. For best comparisons, these sets of genes are all nested within each other.
Depending on which type of genes selected for crafting in step 1, use GenerateCraftedExperiments_***.R to generate crafted datasets.

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
code		code
crafted_datasets		crafted_datasets
crafted_datasets_paper		crafted_datasets_paper
data		data
docs		docs
.DS_Store		.DS_Store
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Crafted Experiment

Please cite:

Description

Example

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Crafted Experiment

Please cite:

Description

Example

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages