Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

There is a long gap between the validation accuracy of the dataset of vlmevalkit and the model paper #94

Open
YongLD opened this issue Feb 23, 2024 · 6 comments

Comments

@YongLD
Copy link

YongLD commented Feb 23, 2024

On the TextVQA dataset, the paper in Instructblip 13b indicates that its precision is 50.7, and the paper in Qwen VL Chat shows an accuracy of 63.75.
In terms of the accuracy measured by the vlmevalkit official, the accuracy of Instructblip 13b is about 30, and the accuracy of QWEN VL Chat is 10.5, what do you think is the problem?
Also, I tested the accuracy of Instructblip 13b on textVQA and found that I ran with an accuracy of 16.7, what went wrong? These are all the results of prefech, and GPT is not used.

@kennymckormick
Copy link
Member

Hi, @YongLD ,
Actually, the support of VQA datasets is still in progress (we only share some preliminary results for now). We still cannot obtain the corresponding accuracies reported by the VLM papers. Potential reasons might be different prompt used or different inference hyper parameters.

@John-Ge
Copy link

John-Ge commented Feb 25, 2024

Hello, I find that for TextVQA dataset, LLaVA evaluation with with reference token like:
What kind of beer is this?\nReference OCR token: NINK, NK, BOWING, CC, STON, SUE, ED, Sublimely, SELF, ELF-RICHEE, swAaVd, KGy, ALE\nAnswer the question using a single word or phrase.
in VLMEvalKit does not apply that token, would you like to add an option for users to choose to add the reference tokens or not?

@YongLD
Copy link
Author

YongLD commented Feb 28, 2024

@kennymckormick Can we use the azure openai key in VlmEvalKit? How can I change the base_url of azure?

@kennymckormick
Copy link
Member

@kennymckormick Can we use the azure openai key in VlmEvalKit? How can I change the base_url of azure?

Currently, VLMEvalKit does not support openai api key (cuz I do not have azure api access can cannot debug). You can follow the azure doc to support an azure openai wrapper in VLMEvalKit.

@geknow
Copy link

geknow commented Mar 13, 2024

I've noticed that most of the results are dissimilar compared to those in the research paper. I believe that the framework that's in use should be rectified, making it more corresponding with the original results.

@kennymckormick
Copy link
Member

I've noticed that most of the results are dissimilar compared to those in the research paper. I believe that the framework that's in use should be rectified, making it more corresponding with the original results.

Hi, @geknow ,
We understand the known issues with the preliminary results on VQA tasks might be misleading. We have know removed these results and will re-upload them once ready.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants