Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[feat] add binary quantization support vector search #687

Open
jonathanpv opened this issue Apr 8, 2024 · 1 comment
Open

[feat] add binary quantization support vector search #687

jonathanpv opened this issue Apr 8, 2024 · 1 comment

Comments

@jonathanpv
Copy link

Problem Description

Currently it seems like vector search applies cosine similarity search and may be good for other embedding types (binary or int8) support via other search algorithms like hamming_distance

Proposed Solution

We should implement SIMSIMD or this vanilla js function that is discussed here

xenova/transformers.js#681

Alternatives

No response

Additional Context

It will be beneficial for local-first applications to use binary quantized embeddings to reduce the load on memory and support vector search that way. Will be interesting to see stress test on this.

@jonathanpv
Copy link
Author

I've seen some of the source code for orama and seems like this can be supported via an extension so I perhaps could tackle this.

However given its a single function + proposal to add types to the API it may be a simple 30 line change.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant