Float16 integration and API #4234

IvanPleshkov · 2024-05-14T09:55:33Z

Float16 REST and Grpc, storages, scorers, and constructors

All Submissions:

Contributions should target the dev branch. Did you create your branch from dev?
Have you followed the guidelines in our Contributing document?
Have you checked to ensure there aren't other open Pull Requests for the same update/change?

New Feature Submissions:

Does your submission pass tests?
Have you formatted your code locally using cargo +nightly fmt --all command prior to submission?
Have you checked your code using cargo clippy --all --all-features command?

Changes to Core Features:

Have you added an explanation of what your changes do and why you'd like us to include them?
Have you written new tests for your core changes, as applicable?
Have you successfully ran tests with your changes locally?

IvanPleshkov · 2024-05-14T13:20:22Z

lib/segment/src/spaces/metric_f16/simple_cosine.rs

Cosine metric was missed, added

Does preprocess work for non float32? I can add tests / SIMD implementations for cosine metric if helpful.

IvanPleshkov · 2024-05-14T13:21:57Z

lib/segment/src/data_types/primitive.rs

-            Distance::Manhattan => <ManhattanMetric as Metric<VectorElementType>>::preprocess(v),
-        };
-        Cow::from(preprocessed_vector)
+        Cow::Owned(vector.iter().map(|&x| f16::to_f32(x)).collect_vec())


It was an incorrect implementation. quantization_preprocess is a trick for byte type, where we need to apply -127 for binary quantization. Here we need conversion into f32 only

IvanPleshkov · 2024-05-14T13:23:44Z

lib/segment/tests/integration/byte_storage_hnsw_test.rs

+    VectorStorageDatatype::Float16,
+    64,
+    20
+)]
 fn test_byte_storage_hnsw(


I extended existing test for byte vector to test HNSW

IvanPleshkov · 2024-05-14T13:24:01Z

lib/segment/tests/integration/byte_storage_quantization_test.rs

@@ -166,6 +194,7 @@ fn sames_count(a: &[Vec<ScoredPointOffset>], b: &[Vec<ScoredPointOffset>]) -> us
 )]
 fn test_byte_storage_binary_quantization_hnsw(
    #[case] query_variant: QueryVariant,
+    #[case] storage_data_type: VectorStorageDatatype,


I extended existing test for byte vector to test quantization

IvanPleshkov · 2024-05-14T14:08:52Z

lib/segment/tests/integration/byte_storage_quantization_test.rs

@@ -46,40 +48,68 @@ enum QuantizationVariant {
    Binary,
 }

-fn random_discovery_query<R: Rng + ?Sized>(rnd: &mut R, dim: usize) -> QueryVector {
+fn random_vector<R>(rnd_gen: &mut R, dim: usize, data_type: VectorStorageDatatype) -> DenseVector


because we test binary quantization, we need to specify vector to utilize binary condition. for u8 we generate vectors in range [0; 255], for f16 we generate vectors in range [-0.5; 0.5]

tests api fix test are you happy clippy

* f16 integration tests api fix test are you happy clippy * fix build

IvanPleshkov marked this pull request as draft May 14, 2024 10:12

github-actions bot mentioned this pull request May 14, 2024

Flaky test index::hnsw_index::tests::test_graph_connectivity::test_graph_connectivity #2875

Open

IvanPleshkov changed the title ~~f16 integration~~ Float16 integration and API May 14, 2024

IvanPleshkov commented May 14, 2024

View reviewed changes

IvanPleshkov marked this pull request as ready for review May 14, 2024 14:56

IvanPleshkov requested a review from generall May 14, 2024 14:57

IvanPleshkov added 2 commits May 14, 2024 20:30

f16 integration

9686470

tests api fix test are you happy clippy

fix build

b014ed2

IvanPleshkov force-pushed the f16-integration branch from c129559 to b014ed2 Compare May 14, 2024 18:33

github-actions bot mentioned this pull request May 14, 2024

Flaky test segment_builder_test::test_building_cancellation #2723

Open

generall approved these changes May 15, 2024

View reviewed changes

IvanPleshkov merged commit e808449 into dev May 15, 2024
17 checks passed

IvanPleshkov deleted the f16-integration branch May 15, 2024 08:36

generall pushed a commit that referenced this pull request May 26, 2024

Float16 integration and API (#4234)

78b16a1

* f16 integration tests api fix test are you happy clippy * fix build

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Float16 integration and API #4234

Float16 integration and API #4234

IvanPleshkov commented May 14, 2024 •

edited

IvanPleshkov May 14, 2024

TheQuantumFractal May 20, 2024

IvanPleshkov May 14, 2024

generall May 15, 2024

IvanPleshkov May 14, 2024

IvanPleshkov May 14, 2024

IvanPleshkov May 14, 2024 •

edited

Float16 integration and API #4234

Float16 integration and API #4234

Conversation

IvanPleshkov commented May 14, 2024 • edited

All Submissions:

New Feature Submissions:

Changes to Core Features:

IvanPleshkov May 14, 2024

Choose a reason for hiding this comment

TheQuantumFractal May 20, 2024

Choose a reason for hiding this comment

IvanPleshkov May 14, 2024

Choose a reason for hiding this comment

generall May 15, 2024

Choose a reason for hiding this comment

IvanPleshkov May 14, 2024

Choose a reason for hiding this comment

IvanPleshkov May 14, 2024

Choose a reason for hiding this comment

IvanPleshkov May 14, 2024 • edited

Choose a reason for hiding this comment

IvanPleshkov commented May 14, 2024 •

edited

IvanPleshkov May 14, 2024 •

edited