Answer-Driven Visual State Estimator for Goal-Oriented Visual Dialogue

This repository contains the code for the following paper:

Zipeng Xu, Fangxiang Feng, Xiaojie Wang, Yushu Yang, Huixing Jiang, Zhongyuan Wang, Answer-Driven Visual State Estimator for Goal-Oriented Visual Dialogue. In ACM MM, 2020. (https://dl.acm.org/doi/10.1145/3394171.3413668)

@inproceedings{10.1145/3394171.3413668,
author = {Xu, Zipeng and Feng, Fangxiang and Wang, Xiaojie and Yang, Yushu and Jiang, Huixing and Wang, Zhongyuan},
title = {Answer-Driven Visual State Estimator for Goal-Oriented Visual Dialogue},
year = {2020},
booktitle = {Proceedings of the 28th ACM International Conference on Multimedia},
pages = {4271–4279}
}

This code is reimplemented as a fork of GuessWhatGame/guesswhat, which uses tensorflow.

Data

GuessWhat?! relies on two datasets:

the GuessWhat?! dataset that contains the dialogue inputs
The MS Coco dataset that contains the image inputs

Supervised Pretraining

Train Oracle:

python src/guesswhat/train/train_oracle.py \
   -config config/oracle/config.v1.json \
   -exp_dir out/oracle

Train ADVSE-QGen:

python src/guesswhat/train/train_qgen_supervised.py \
   -config config/qgen/config.advse.json \
   -exp_dir out/qgen

Train ADVSE-Guesser:

python src/guesswhat/train/train_guesser.py \
   -config config/guesser/config.advse.json \
   -exp_dir out/guesser

Reinforcement Learning

Based on the supervisedly pretrained models, we use REINFORCE to fine-tune the QGen model.

python src/guesswhat/train/train_qgen_reinforce.py
    -exp_dir out/loop/ \
    -config config/looper/config.advse8g.json \
    -networks_dir out/ \
    -oracle_identifier <oracle_identifier> \
    -qgen_identifier <qgen_identifier> \
    -guesser_identifier <guesser_identifier> \
    -evaluate_all false \
    -store_games true

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
config		config
data		data
src		src
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

config

config

data

data

src

src

LICENSE

LICENSE

README.md

README.md

Repository files navigation

Answer-Driven Visual State Estimator for Goal-Oriented Visual Dialogue

Data

Supervised Pretraining

Reinforcement Learning

About

Releases

Packages

Languages

License

zipengxuc/ADVSE-GuessWhat

Folders and files

Latest commit

History

Repository files navigation

Answer-Driven Visual State Estimator for Goal-Oriented Visual Dialogue

Data

Supervised Pretraining

Reinforcement Learning

About

Topics

Resources

License

Stars

Watchers

Forks

Languages