New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Refactor MLLM #529

Open

hhaAndroid wants to merge 119 commits into InternLM:main from hhaAndroid:refactor_llava

+17,550 −121

Collaborator

hhaAndroid commented Mar 29, 2024

In order to enhance the flexibility and usability of the multimodal model, it is necessary to refactor the existing llava code.

hhaAndroid and others added 14 commits

March 29, 2024 15:04


          refactor llava

90222e0

fix

4dd223e

fix

d2428af


          update

39add6b


          update

f154bb9


          fix ddp

2feb0e3


          add config

cd0a01b


          add config

36053f6


          add config

211c33a


          fix disp

b38f453


          fix test

a868151


          add dataset

7f70c56


          fix eval dataset

04f0ac1


          Merge branch 'main' into refactor_llava

43c2bab

LZHgrla reviewed

View reviewed changes

...va/phi2_2_7b_siglip_so400m_p14_384/llava_phi2_2_7b_siglip_so400m_p14_384_e1_gpu8_pretrain.py Outdated

+              val_dataset = [
+                  dict(
+                      type=MMELLaVADataset,
+                      data_file='/mnt/petrelfs/huanghaian/code/xtuner/LMUData/MME.tsv',

Collaborator

LZHgrla Apr 2, 2024

像这种路径，是让用户修改config来设置，还是说像VLMEvalKit一样，内部自动下载

Collaborator Author

hhaAndroid Apr 3, 2024

我觉得不用这么智能吧，直接要他们提前下载好，省的出现一些奇怪问题

LZHgrla reviewed

View reviewed changes

...va/phi2_2_7b_siglip_so400m_p14_384/llava_phi2_2_7b_siglip_so400m_p14_384_e1_gpu8_pretrain.py Outdated

+              # set log processor
+              log_processor = dict(by_epoch=False)
+              # ==================== val and test cfg =======================

Collaborator

LZHgrla Apr 2, 2024

此部分，是放到上面 PART 3 Dataset & Dataloader 还是单独放在下面？

Collaborator Author

hhaAndroid Apr 3, 2024

我觉得单独放下面，用户用起来更方便？

LZHgrla reviewed

View reviewed changes

xtuner/dataset/evaluation/hallusion_llava_dataset.py Outdated

+                      print_log('============================================', 'current')
+                      print_log(score, 'current')
+                      print_log('============================================', 'current')
+                      print_log(f'YOrN_eval successfully finished evaluating', 'current')

Collaborator

LZHgrla Apr 2, 2024

Suggested change

      
                    print_log(f'YOrN_eval successfully finished evaluating', 'current')
          
                    print_log('Hallusion successfully finished evaluating', 'current')

LZHgrla reviewed

View reviewed changes

xtuner/dataset/evaluation/hallusion_llava_dataset.py Outdated

+                      with pd.ExcelWriter(osp.join(work_dir, self.results_xlsx_path), engine='openpyxl') as writer:
+                          results_df.to_excel(writer, index=False)
+                      score = Hallusion_rating(data)

Collaborator

LZHgrla Apr 2, 2024

把 Hallusion_rating 函数放到本文件？因为这个应该只有这里会用到，所以方便查看。

Collaborator Author

hhaAndroid Apr 3, 2024

可以

LZHgrla reviewed

View reviewed changes

xtuner/dataset/evaluation/mme_llava_dataset.py Outdated

+                      with pd.ExcelWriter(osp.join(work_dir, self.results_xlsx_path), engine='openpyxl') as writer:
+                          results_df.to_excel(writer, index=False)
+                      score = MME_rating(data)

Collaborator

LZHgrla Apr 2, 2024

MME_rating 放到本文件下？

LZHgrla reviewed

View reviewed changes

xtuner/dataset/evaluation/hallusion_llava_dataset.py Outdated

Comment on lines 29 to 31

		template = prompt_template
		self.template = template

Collaborator

LZHgrla Apr 2, 2024

Suggested change

      
                    template = prompt_template
          
                    self.template = template
          
                    self.template = prompt_template

LZHgrla reviewed

View reviewed changes

xtuner/dataset/evaluation/mme_llava_dataset.py Outdated

Comment on lines 32 to 34

		template = prompt_template
		self.template = template

Collaborator

LZHgrla Apr 2, 2024

Suggested change

      
                    template = prompt_template
          
                    self.template = template
          
                    self.template = prompt_template

LZHgrla reviewed

View reviewed changes

xtuner/dataset/evaluation/multiple_choice_llava_dataset.py Outdated

Comment on lines 33 to 35

		template = prompt_template
		self.template = template

Collaborator

LZHgrla Apr 2, 2024

Suggested change

      
                    template = prompt_template
          
                    self.template = template
          
                    self.template = prompt_template

LZHgrla reviewed

View reviewed changes

xtuner/dataset/evaluation/pope_llava_dataset.py Outdated

Comment on lines 65 to 67

		template = prompt_template
		self.template = template

Collaborator

LZHgrla Apr 2, 2024

Suggested change

      
                    template = prompt_template
          
                    self.template = template
          
                    self.template = prompt_template

LZHgrla reviewed

View reviewed changes

xtuner/dataset/evaluation/textvqa_llava_dataset.py Outdated

Comment on lines 45 to 47

		template = prompt_template
		self.template = template

Collaborator

LZHgrla Apr 2, 2024

Suggested change

      
                    template = prompt_template
          
                    self.template = template
          
                    self.template = prompt_template

LZHgrla reviewed

View reviewed changes

xtuner/engine/runner/loops.py

+                          results = collect_results(results, len(dataset))
+                          self.runner.logger.info('========= Starting the evaluation of a data ===========')
+                          if is_main_process():
+                              metric = dataset.evaluate(results, self.runner.work_dir)

Collaborator

LZHgrla Apr 2, 2024

self.runner.work_dir 是否考虑携带 iter 信息，类似这里

xtuner/xtuner/engine/hooks/evaluate_chat_hook.py

Lines 93 to 98 in 0b5708c

    
           def _save_eval_output(self, runner, eval_outputs): 
        
               save_path = os.path.join(runner.log_dir, 'vis_data', 
        
                                        f'eval_outputs_iter_{runner.iter}.txt') 
        
               with open(save_path, 'w', encoding='utf-8') as f: 
        
                   for i, output in enumerate(eval_outputs): 
        
                       f.write(f'Eval output {i + 1}:\n{output}\n\n')

LZHgrla reviewed

View reviewed changes

xtuner/dataset/evaluation/textvqa_llava_dataset.py Outdated

+                      print_log('Samples: {}, Accuracy: {:.2f}%'.format(len(pred_list), acc), 'current')
+                      print_log('============================================', 'current')
+                      print_log(f'TextVQA successfully finished evaluating', 'current')
+                      return {'acc': acc}

Collaborator

LZHgrla Apr 2, 2024

return acc?

Collaborator Author

hhaAndroid Apr 3, 2024

规定必须要返回字典吧，我记得，因为到时候 bset checkpoint 时候会读取

LZHgrla reviewed

View reviewed changes

xtuner/tools/test.py

		@@ -96,7 +97,11 @@ def main():
		runner = RUNNERS.build(cfg)

		state_dict = guess_load_checkpoint(args.checkpoint)

Collaborator

LZHgrla Apr 3, 2024

checkpoint 现在是一个 optional argument，看起来是之前残留的bug，最好改成 positional argument。

Collaborator Author

hhaAndroid Apr 3, 2024

ok

hhaAndroid added 3 commits

April 3, 2024 15:06


          update config

8b44a9e


          Merge branch 'refactor_llava' of github.com:hhaAndroid/xtuner into re…

3a994dd

…factor_llava

fix

05534c9

hhaAndroid added 30 commits

April 26, 2024 15:31

fix

d4ef310


          add config

7f62009


          update

1bd9be1


          add 70b finetune

f6abf85


          add internvl 1.5 pretrain

d732b58


          add internvl 1.5 finetune

6f8d2fb


          update

ab0b003


          update

f47d06d


          add layer-wise learning rate (LLDR)

323dfbb


          update config

e605a73

fix

ed1a836


          update

f5a1922

fix

cfd8d4d


          update config

98e6ac9


          Merge branch 'main' of github.com:InternLM/xtuner into refactor_llava

8bf0f3e


          update config


          add test

38c8c27

fix

55f01aa


          add allava

1c5de9d

fix

c361adc


          add finetune


          add finetune1

43b27d0


          add config

28796c2


          updata

aae7b46


          updata

85a62f5


          update

f998ae5


          add patch select

fix

98f1f58


          update

bd4bf22


          fix bug

32723b1

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment