Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
56 commits
Select commit Hold shift + click to select a range
f60a61f
Use stand. inchis for deduplication
Sep 3, 2025
0571c20
rerun
Sep 3, 2025
729e4fe
regenerate plots as pdfs
PaulaKramer Sep 23, 2025
244521a
rerun notebook
PaulaKramer Sep 23, 2025
7572fd4
fixes for chembl update
PaulaKramer Sep 24, 2025
4564162
rerun custom notebooks ci
PaulaKramer Sep 24, 2025
563742d
new enamine substructure search
PaulaKramer Sep 26, 2025
f3dc227
analyze and adapt qed threshold
PaulaKramer Sep 26, 2025
0a293fe
changed parameters and explanation
PaulaKramer Sep 26, 2025
e2ae722
change qed parameter
PaulaKramer Sep 26, 2025
6b85b84
Merge pull request #89 from volkamerlab/inchi_deduplication
PaulaKramer Sep 26, 2025
0d937ee
update pkidb in kinfraglib notebook
PaulaKramer Sep 30, 2025
70ab077
kinfraglib inchi deduplication
PaulaKramer Sep 30, 2025
e2e862a
bug fix exclude single filters
PaulaKramer Oct 1, 2025
a25605f
rerun with new combinatorial library
PaulaKramer Oct 15, 2025
055072f
adapt chembl version
PaulaKramer Oct 15, 2025
297f923
change askcos flag
PaulaKramer Oct 15, 2025
b9fd88d
new combinatorial library rerun
PaulaKramer Oct 15, 2025
1cf87e6
Merge pull request #94 from volkamerlab/custom_bug_fixes
PaulaKramer Oct 15, 2025
ab7971d
adapted readme
PaulaKramer Oct 16, 2025
d109fde
ci add new zenodo
PaulaKramer Oct 17, 2025
b556021
adapt path in ci
PaulaKramer Oct 17, 2025
e1ec089
Merge pull request #93 from volkamerlab/kinfraglib_deduplication
PaulaKramer Oct 17, 2025
a85fb3a
rerun all fragments
kabu00002 Nov 3, 2025
68d10f8
Merge branch 'qed_threshold' of github.com:volkamerlab/KinFragLib int…
kabu00002 Nov 3, 2025
e0c541a
rerun askcos
kabu00002 Nov 12, 2025
44cb472
rerun notebook
Nov 12, 2025
400c916
rerun custom analysis
PaulaKramer Nov 12, 2025
b8d6883
adapt askcos documentation
PaulaKramer Nov 12, 2025
854b166
resolve merge conflicts
PaulaKramer Nov 12, 2025
bf226dc
Merge pull request #91 from volkamerlab/qed_threshold
PaulaKramer Nov 12, 2025
68fd0bc
Kinase Map Notebook
Aug 30, 2025
ce8f04b
added tSNE plot
Sep 3, 2025
2589a10
rebase branch
PaulaKramer Nov 12, 2025
3ff03e6
adapt for renamed notebooks
PaulaKramer Sep 29, 2025
37fcb7d
add kinase numbers
PaulaKramer Sep 29, 2025
8b9bf75
rerun kinase coverage with updated custom library
PaulaKramer Nov 12, 2025
e5e57de
rerun enamine most similar
PaulaKramer Nov 13, 2025
e55bc77
more documentation in coverage notebook
PaulaKramer Nov 13, 2025
5f2f09f
Merge pull request #88 from volkamerlab/kinase_mapping_custom_kfl
PaulaKramer Nov 13, 2025
207de31
adapt readmes
PaulaKramer Nov 13, 2025
b28bdda
add askcos installation infos
PaulaKramer Nov 13, 2025
6de8290
Merge pull request #95 from volkamerlab/readme_updates
PaulaKramer Nov 13, 2025
55e6153
reduce cpu cores retro
PaulaKramer Nov 14, 2025
990e51d
nbval checks
PaulaKramer Nov 14, 2025
6f240a2
ci rerun custom notebooks
PaulaKramer Nov 14, 2025
342bb2d
reduce cpu count
PaulaKramer Nov 14, 2025
5dc9504
ci cpu split
PaulaKramer Nov 17, 2025
ad2ddd9
ci nbval, timeout, cpu cores
PaulaKramer Nov 17, 2025
18f544e
reduce cpu, rerun notebooks nbval
PaulaKramer Nov 17, 2025
011f29b
exclude retro notebooks ci
PaulaKramer Nov 18, 2025
6c171f9
rerun notebooks ci
PaulaKramer Nov 18, 2025
84e3884
ci rerun
PaulaKramer Nov 19, 2025
840bd32
reduce timeout
PaulaKramer Nov 19, 2025
7b6124d
Merge pull request #96 from volkamerlab/ci_fixing
PaulaKramer Nov 19, 2025
6d65cae
rerun paper plots
PaulaKramer Nov 19, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
10 changes: 5 additions & 5 deletions .github/workflows/ci.yml
Original file line number Diff line number Diff line change
Expand Up @@ -62,18 +62,18 @@ jobs:
shell: bash -l {0}
run: |
echo "Download combinatorial library from zenodo..."
wget -q -O data/combinatorial_library/combinatorial_library.tar.bz2 https://zenodo.org/record/10843763/files/combinatorial_library.tar.bz2?download=1
wget -q -O data/combinatorial_library/combinatorial_library.tar.bz2 https://zenodo.org/record/17368450/files/combinatorial_library.tar.bz2?download=1
ls -l data/combinatorial_library/
echo "Decompress selected files..."
tar -xvf data/combinatorial_library/combinatorial_library.tar.bz2 combinatorial_library_deduplicated.json chembl_standardized_inchi.csv
mv combinatorial_library_deduplicated.json data/combinatorial_library/
mv chembl_standardized_inchi.csv data/combinatorial_library/
tar -xvf data/combinatorial_library/combinatorial_library.tar.bz2 combinatorial_library/combinatorial_library_deduplicated.json combinatorial_library/chembl_standardized_inchi.csv
mv combinatorial_library/combinatorial_library_deduplicated.json data/combinatorial_library/
mv combinatorial_library/chembl_standardized_inchi.csv data/combinatorial_library/
ls -l data/combinatorial_library/

- name: Run tests
shell: bash -l {0}
run: |
PYTEST_ARGS="--nbval-lax --nbval-current-env --nbval-cell-timeout=3600"
PYTEST_IGNORE="--ignore=notebooks/custom_kinfraglib/2_3_custom_filters_paper.ipynb"
PYTEST_IGNORE="--ignore=notebooks/custom_kinfraglib/2_4_custom_filters_paper.ipynb --ignore=notebooks/custom_kinfraglib/1_4_custom_filters_pairwise_retrosynthesizability.ipynb --ignore=notebooks/custom_kinfraglib/2_1_custom_filters_pipeline.ipynb"

pytest $PYTEST_ARGS $PYTEST_IGNORE
2 changes: 1 addition & 1 deletion data/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -10,5 +10,5 @@ Overview of data content:
- `combinatorial_library/`: Combinatorial library based on the reduced fragment library.
- `external/`: Data from external resources.
- `filters/`: Data used for custom filters.
- `fragment_library_custom_filtered/`: Custom filtered fragment library: Pre-filtered (remove pool X, deduplicate per subpocket, remove unfragmented ligands, remove all fragments that connect only to pool X), and filtered for unwanted substructures (PAINS and Brenk), drug-likeness (Ro3 and QED), synthesizability (buyable building blocks and SYBA) and pairwise retrosynthesizability (using ASKCOS).
- `fragment_library_custom_filtered/`: Custom filtered fragment library: Pre-filtered (remove pool X, deduplicate per subpocket, remove unfragmented ligands, remove all fragments that connect only to pool X), and filtered for unwanted substructures (PAINS and Brenk), drug-likeness (Ro3 and QED), synthesizability (buyable building blocks and SYBA) and pairwise retrosynthesizability (using ASKCOS).
- `fragment_library_old/`: Full fragment library v1.1.0 which was described in the KinFragLib paper.
6 changes: 3 additions & 3 deletions data/combinatorial_library/README.md
Original file line number Diff line number Diff line change
@@ -1,17 +1,17 @@
# KinFragLib: Combinatorial library

[![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.15844170.svg)](https://doi.org/10.5281/zenodo.15844170)
[![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.17368450.svg)](https://doi.org/10.5281/zenodo.17368450)

This folder is meant for the metadata and properties of the KinFragLib combinatorial library, which is based on the KinFragLib fragment library at https://github.com/volkamerlab/KinFragLib. This dataset is used for the analysis of the combinatorial library.

**Note**: Since this dataset contains large files, we provide it outside this repository at https://zenodo.org/records/15844170 (DOI: 10.5281/zenodo.15844170, v2.0.1).
**Note**: Since this dataset contains large files, we provide it outside this repository at https://zenodo.org/records/17368450 (DOI: 10.5281/zenodo.17368450, v2.0.2).
In order to run the analysis notebooks, please download this dataset to this folder.

## Raw data

- `combinatorial_library.json`: Full combinatorial library, please refer to `notebooks/kinfraglib/4_1_combinatorial_library_data_preparation.ipynb` at https://github.com/volkamerlab/KinFragLib for detailed information about this data format
- `combinatorial_library_deduplicated.json`: Deduplicated combinatorial library (based on InChIs)
- `chembl_standardized_inchi.csv`: Standardized ChEMBL 33 molecules in the form of InChI strings.
- `chembl_standardized_inchi.csv`: Standardized ChEMBL 36 molecules in the form of InChI strings.
- `KLIFS_download_summary.csv`: PDB codes of all KLIFS structures used to generate the KinFragLib fragmentation library.

## Processed data
Expand Down
2 changes: 1 addition & 1 deletion data/external/README.md
Original file line number Diff line number Diff line change
@@ -1,3 +1,3 @@
# External data

- `pkidb_2020-07-15.sdf`: PKIDB dataset from 2020-07-15 downloaded from https://www.icoa.fr/pkidb/ (publications: DOI:10.3390/molecules23040908 and DOI: 10.3390/molecules25143226).
- `pkidb_2025-04-15.sdf`: PKIDB dataset from 2025-04-15 downloaded from https://www.icoa.fr/pkidb/ (publications: DOI:10.3390/molecules23040908 and DOI: 10.3390/molecules25143226).
Loading
Loading