The `checkForDuplicates()` method is called before and after the ChEBI Update. https://github.com/reactome/data-release-pipeline/blob/develop/chebi-update/src/main/java/org/reactome/release/chebiupdate/ChebiUpdateStep.java#L20 https://github.com/reactome/data-release-pipeline/blob/develop/chebi-update/src/main/java/org/reactome/release/chebiupdate/ChebiUpdateStep.java#L23 This method writes to the same output file "ChEBI_Update_DuplicateMoleculeIdentifiers.tsv" and so two headers are printed from this line: https://github.com/reactome/data-release-pipeline/blob/develop/chebi-update/src/main/java/org/reactome/release/chebiupdate/ChebiUpdater.java#L478 Could the duplicate report be separated into two files - one pre-update and one post-update?
The
checkForDuplicates()method is called before and after the ChEBI Update.https://github.com/reactome/data-release-pipeline/blob/develop/chebi-update/src/main/java/org/reactome/release/chebiupdate/ChebiUpdateStep.java#L20
https://github.com/reactome/data-release-pipeline/blob/develop/chebi-update/src/main/java/org/reactome/release/chebiupdate/ChebiUpdateStep.java#L23
This method writes to the same output file "ChEBI_Update_DuplicateMoleculeIdentifiers.tsv" and so two headers are printed from this line:
https://github.com/reactome/data-release-pipeline/blob/develop/chebi-update/src/main/java/org/reactome/release/chebiupdate/ChebiUpdater.java#L478
Could the duplicate report be separated into two files - one pre-update and one post-update?