New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

feat: Integration of VLM embedding model #446

Open

FUYICC wants to merge 53 commits into master from CLIP_model

Contributor

FUYICC commented Feb 1, 2024 •

edited by coderabbitai bot

Description

Issue #445

Summary by CodeRabbit

New Features
- Introduced CLIPEmbedding for image and text embedding functionalities.
Bug Fixes
- Improved file encoding handling in license updates.
Tests
- Added tests for the new CLIPEmbedding functionality, covering initialization, embedding processes, and output dimension retrieval.

Wendong-Fan and others added 18 commits

November 24, 2023 01:01


          add e5 embedding

d300035


          fix typo in toml file

524bfd4


          allow user to switch embeeding model from SentenceTransformer

0f13021


          Move the import to __init__

9ddc871


          polish docstring

e9c3135


          remove # type: ignore

aeae92d


          change embed_list return type and polish docstring

b431e67


          use Union[List[List[float]], ndarray] instead of List[List[float]] | …

884f190

…ndarray


          change return of embed_list from ndarray to list

9cce263


          change name from SentenceTransformerEmbedding into SentenceTransforme…

4c7b67c

…rEncoder


          update poetry

939808e


          update poetry

e8ce692


          update poetry

1bf7320


          update poetry

653b381


          remove ndarry and union in embedding base file

93e795e


          Merge branch 'master' into feature/open_source_embedding_model

a50b478


          sentence-transformer

4d5ba2d


          integration of clip embedding and update of license

692a670

FUYICC self-assigned this

FUYICC linked an issue

that may be closed by this pull request

[Feature Request] Multi-modal RAG(Retrieval-Augmented Generation) #445

Open

4 tasks

FUYICC added the Embeddings label


          Limit embed_list input type

20654fd

FUYICC closed this


          revert changes of sentence embedding

9e0de62

FUYICC reopened this

FUYICC added 2 commits

February 5, 2024 14:18


          poetry change of pillow

b3ea26c


          change of docstring of functions

f1adf18

FUYICC requested review from lightaime and Wendong-Fan

February 9, 2024 03:05

FUYICC marked this pull request as ready for review

February 9, 2024 03:11

Member

Appointat commented Mar 22, 2024

@FUYICC Hi, is the pr still in progress? Let me know if you have any difficulties.

Contributor Author

FUYICC commented Mar 27, 2024

@FUYICC Hi, is the pr still in progress? Let me know if you have any difficulties.

Thank you for your kind help! Sorry I've been mostly working on my dissertation for the past 3 weeks so I haven't had time to move forward, I'll be up and running starting next week, we'll discuss any questions anytime!

FUYICC and others added 5 commits

April 9, 2024 23:58


          Change to general visual language model class and use lazy initializa…

f0a1573

…tion


          Merge branch 'master' into CLIP_model

0fc220d


          test for inconsistancy of inputs with different types

1fa0c0f


          update of poetry

71d48a2


          usage of **kwargs

4de4fad

FUYICC changed the title ~~Integration of CLIP embedding model~~ Integration of VLM embedding model

FUYICC and others added 5 commits

May 2, 2024 15:28


          debug for pytest

1517d52


          Merge branch 'master' into CLIP_model


          poetry dependency

ed54edf


          ruff

a667614


          poetry

8aab43d

FUYICC requested review from Appointat, zechengz and dandansamax

May 3, 2024 05:58

Wendong-Fan reviewed

View reviewed changes

camel/embeddings/vlm_embedding.py Show resolved Hide resolved

Wendong-Fan requested changes

View reviewed changes

camel/embeddings/vlm_embedding.py Show resolved Hide resolved

FUYICC added 2 commits

May 5, 2024 21:32


          return list of float

8c1f086


          change of tests

b8bd94e

FUYICC requested a review from Wendong-Fan

May 5, 2024 17:14

Wendong-Fan added this to the Sprint 4 milestone

Wendong-Fan reviewed

View reviewed changes

Member

Wendong-Fan left a comment

Thanks for the contribution and sorry for the late review, left some comments

camel/embeddings/vlm_embedding.py Outdated Show resolved Hide resolved

camel/embeddings/vlm_embedding.py Outdated

Comment on lines 71 to 74

+                                  images=obj, return_tensors="pt", padding=True, **kwargs
+                              )
+                              image_feature = (
+                                  self.model.get_image_features(**input, **kwargs)

Member

Wendong-Fan May 26, 2024

redundant kwargs could lead to unexpected behaviors if kwargs contains overlapping keys

Member

Wendong-Fan May 28, 2024

how about separate kwargs into 2 dict?

def embed_list(
    self,
    objs: List[Union[Image.Image, str]],
    processor_kwargs: dict ={},
    model_kwargs: dict = {},
) -> List[List[float]]:
            text_input = self.processor(
                text=obj,
                return_tensors="pt",
                padding=True,
                **processor_kwargs,
            )
            text_feature = (
                self.model.get_text_features(**text_input, **model_kwargs)
                .squeeze(dim=0)
                .tolist()
            )

camel/embeddings/vlm_embedding.py Outdated

Comment on lines 82 to 85

+                                  text=obj, return_tensors="pt", padding=True, **kwargs
+                              )
+                              text_feature = (
+                                  self.model.get_text_features(**input, **kwargs)

Member

Wendong-Fan May 26, 2024

same as above

camel/embeddings/vlm_embedding.py Outdated Show resolved Hide resolved

camel/embeddings/vlm_embedding.py Outdated Show resolved Hide resolved

camel/embeddings/vlm_embedding.py Outdated Show resolved Hide resolved

camel/embeddings/vlm_embedding.py Outdated Show resolved Hide resolved

camel/embeddings/vlm_embedding.py Outdated Show resolved Hide resolved

pyproject.toml

@@ @@ -59,7 +59,7 @@ pyowm = { version = "^3.3.0", optional = true } @@
               googlemaps = { version = "^4.10.0", optional = true }
               requests_oauthlib = { version = "^1.3.1", optional = true }
               unstructured = { extras = ["all-docs"], version = "^0.10.30", optional = true }
+              pillow = { version = "^10.2.0", optional = true }

Member

Wendong-Fan May 26, 2024

why we need this library?

Contributor Author

FUYICC May 27, 2024

Because we need to determine if the input is an image or not in vlm embedding class.

Member

Wendong-Fan May 28, 2024

also add it under [tool.poetry.extras] tools and all part, as well as [[tool.mypy.overrides]]

FUYICC and others added 6 commits

May 27, 2024 20:57


          Update camel/embeddings/vlm_embedding.py

de718ce

Co-authored-by: Wendong-Fan <133094783+Wendong-Fan@users.noreply.github.com>


          Update camel/embeddings/vlm_embedding.py

6ebf5cd

Co-authored-by: Wendong-Fan <133094783+Wendong-Fan@users.noreply.github.com>


          Update camel/embeddings/vlm_embedding.py

c969597

Co-authored-by: Wendong-Fan <133094783+Wendong-Fan@users.noreply.github.com>


          Update camel/embeddings/vlm_embedding.py

6b2c48e

Co-authored-by: Wendong-Fan <133094783+Wendong-Fan@users.noreply.github.com>


          Update camel/embeddings/vlm_embedding.py

b0cadb0

Co-authored-by: Wendong-Fan <133094783+Wendong-Fan@users.noreply.github.com>


          one method for **kwargs

e2c7824

Wendong-Fan changed the title ~~Integration of VLM embedding model~~ feat: Integration of VLM embedding model

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

Wendong-Fan Wendong-Fan requested changes

coderabbitai[bot] coderabbitai left review comments

lightaime Awaiting requested review from lightaime

Appointat Awaiting requested review from Appointat

zechengz Awaiting requested review from zechengz

dandansamax Awaiting requested review from dandansamax

Requested changes must be addressed to merge this pull request.