Skip to content

This repository is the code of our paper "DiffUTE: Universal Text Editing Diffusion Model" (NeurIPS'2023).

License

Notifications You must be signed in to change notification settings

chenhaoxing/DiffUTE

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DiffUTE

This repository is the code of our NeurIPS'23 paper "DiffUTE: Universal Text Editing Diffusion Model". Unfortunately, pre-trained models are not allowed to be made public due to the lisence of AntGroup. You can easily reproduce our method using diffusers and transformers.

Getting Started with DiffUTE

Installation

The codebases are built on top of diffusers. Thanks very much.

Requirements

  • Linux or macOS with Python ≥ 3.8
  • PyTorch ≥ 1.10.0 and torchvision that matches the PyTorch installation. You can install them together at pytorch.org to make sure of this
  • OpenCV
  • transformers

Steps

  1. Install diffusers following https://github.com/huggingface/diffusers.

  2. Prepare datasets. Due to data sensitivity issues, our data will not be publicly available now, you can reproduce it on your own data, and all images with text are available for model training. Because our data is present on Ali-Yun oss, we have chosen pcache to read the data we have stored. You can change the data reading method according to the way you store the data.

  3. Train VAE

  4. Train DiffUTE

Experimental results

Citing DiffUTE

If you use DiffUTE in your research or wish to refer to the baseline results published here, please use the following BibTeX entry.

@inproceedings{DiffUTE,
      title={DiffUTE: Universal Text Editing Diffusion Model},
      author={Chen, Haoxing and Xu, Zhuoer and Gu, Zhangxuan and Lan, Jun and Zheng, Xing and Li, Yaohui and Meng, Changhua and Zhu, Huijia and Wang, Weiqiang},
      booktitle={Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS)},
      year={2023}
}

Contacts

Please feel free to contact us if you have any problems.

Email: hx.chen@hotmail.com or zhuoerxu.xzr@antgroup.com

About

This repository is the code of our paper "DiffUTE: Universal Text Editing Diffusion Model" (NeurIPS'2023).

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages