-
Notifications
You must be signed in to change notification settings - Fork 20
Implement some more similarity metrics #181
Copy link
Copy link
Open
1 / 41 of 4 issues completedLabels
enhancementNew feature or requestNew feature or requestexperimentationExperimenting on thingsExperimenting on thingshelp wantedExtra attention is neededExtra attention is needed
Description
deven96
opened on Jan 19, 2025
Issue body actions
- Hamming distance: Linear algo. The number of bits that need to be changed to convert one vector into the other. Fast and useful for binary vectors
- Minkowski: Linear algo. It is a generalisation of Euclidean and some other distances like Manhattan i.e. by tuning some parameters, you get Euclidean/Manhattan or other distances
- Locality Sensitive Hashing: Nonlinear algo. Works by grouping vectors into buckets by processing each vector through a hash function that maximizes hashing collision as opposed to minimizing as is usual with hashing functions. Not suitable for large dimensionality vectors
- Hierarchical navigable small world: Nonlinear algo. An adaptation of navigable small world (NSW) graphs where an NSW graph is a graph structure containing vertices connected by edges to their nearest neighbors.Good for high dimensionality data
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or requestexperimentationExperimenting on thingsExperimenting on thingshelp wantedExtra attention is neededExtra attention is needed