Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NEED HELP: Action Classification low performance #127

Open
JayMay1994 opened this issue May 17, 2024 · 4 comments
Open

NEED HELP: Action Classification low performance #127

JayMay1994 opened this issue May 17, 2024 · 4 comments

Comments

@JayMay1994
Copy link

Problems: Use demo to test action classification on kinetics-700 validation set but get very poor result

Experiment:

  1. Pretrained model: https://huggingface.co/OpenGVLab/InternVideo2-Stage2_1B-224p-f4/tree/main
  2. text candidate:use the class name of k700 dataset annotation
  3. dataset:kinetics-700 validation set.
  4. code:demo.ipynb

example:
input:carving ice/nTnAoTQ41Nc_000011_000021.mp4
output:
text: coloring in ~ prob: 0.0085
text: acting in play ~ prob: 0.0053
text: smashing ~ prob: 0.0043
text: cracking knuckles ~ prob: 0.0041
text: tasting food ~ prob: 0.0040

@Andy1621
Copy link
Collaborator

Can you try the VideoCLIP model?

@JayMay1994
Copy link
Author

JayMay1994 commented May 23, 2024

Can you try the VideoCLIP model?

Do you mean InternVideo2_CLIP in InternVideo2/multi_modality/scripts/evaluation/clip/zero_shot/1B/config_k400.py ?
I'm a green hand in this field :PP

@JayMay1994
Copy link
Author

Can you try the VideoCLIP model?

Do you mean abandon internvideo2 and turn to videoclip for action recognition?Your work seems very promising.

@Andy1621
Copy link
Collaborator

Yes. I'm not sure whether there is a bug in the demo. But I have tested the VideoCLIP on Kinetics and it runs normally.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants