[feat] add binary quantization support vector search #687

jonathanpv · 2024-04-08T04:09:44Z

Problem Description

Currently it seems like vector search applies cosine similarity search and may be good for other embedding types (binary or int8) support via other search algorithms like hamming_distance

Proposed Solution

We should implement SIMSIMD or this vanilla js function that is discussed here

xenova/transformers.js#681

Alternatives

No response

Additional Context

It will be beneficial for local-first applications to use binary quantized embeddings to reduce the load on memory and support vector search that way. Will be interesting to see stress test on this.

jonathanpv · 2024-04-08T04:10:50Z

I've seen some of the source code for orama and seems like this can be supported via an extension so I perhaps could tackle this.

However given its a single function + proposal to add types to the API it may be a simple 30 line change.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feat] add binary quantization support vector search #687

[feat] add binary quantization support vector search #687

jonathanpv commented Apr 8, 2024

jonathanpv commented Apr 8, 2024

[feat] add binary quantization support vector search #687

[feat] add binary quantization support vector search #687

Comments

jonathanpv commented Apr 8, 2024

Problem Description

Proposed Solution

Alternatives

Additional Context

jonathanpv commented Apr 8, 2024