Skip to content

Current Z-score outlier removal method is resulting in increased number of false positives. #57

Description

@miraep8

The current method for outlier removal (eg removing the top x% in an ordered list) seems to be resulting in a large number of false positives. Recommendation to replace this method with a numpy optimized version of the original mean/sd based approach and/or to to make the number of reads excluded a tunable parameter. Thanks to @jxmavs for raising this issue and for suggestions!

Examples of false positives (at z score 14) (false z score in center of frame). Thanks to James for collecting examples:

image image

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions