Skip to content

Commit 1a7844c

Browse files
committed
Third-party Usage and Research
1 parent 1dfa6a7 commit 1a7844c

File tree

1 file changed

+52
-0
lines changed

1 file changed

+52
-0
lines changed

README.md

+52
Original file line numberDiff line numberDiff line change
@@ -156,6 +156,46 @@ Then use the [OpenAI's FID evaluation toolkit](https://github.com/openai/guided-
156156
Note a relatively small `cfg=1.5` is used for trade-off between image quality and diversity. You can adjust it to `cfg=5.0`, or sample with `autoregressive_infer_cfg(..., more_smooth=True)` for **better visual quality**.
157157
We'll provide the sampling script later.
158158

159+
160+
## Third-party Usage and Research
161+
162+
***In this pargraph, we cross link third-party repositories or research which use VAR and report results. You can let us know by raising an issue***
163+
164+
(`Note please report accuracy numbers and provide trained models in your new repository to facilitate others to get sense of correctness and model behavior`)
165+
166+
[12/30/2024] Varformer: Adapting VAR’s Generative Prior for Image Restoration: https://github.com/siywang541/Varformer
167+
168+
[12/19/2024] FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching: https://github.com/OliverRensu/FlowAR
169+
170+
[12/13/2024] 3D representation in 512-Byte: Variational tokenizer is the key for autoregressive 3D generation: https://github.com/sparse-mvs-2/VAT
171+
172+
[12/19/2024] FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching: https://github.com/OliverRensu/FlowAR
173+
174+
[12/9/2024] CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction: https://carp-robot.github.io/
175+
176+
[12/5/2024] Infinity ∞: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis: https://github.com/FoundationVision/Infinity
177+
178+
[12/5/2024] Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis: https://github.com/yandex-research/switti
179+
180+
[12/3/2024] XQ-GAN🚀: An Open-source Image Tokenization Framework for Autoregressive Generation: https://github.com/lxa9867/ImageFolder
181+
182+
[11/28/2024] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient: https://github.com/czg1225/CoDe
183+
184+
[11/27/2024] SAR3D: Autoregressive 3D Object Generation and Understanding via Multi-scale 3D VQVAE: https://github.com/cyw-3d/SAR3D
185+
186+
[11/15/2024] M-VAR: Decoupled Scale-wise Autoregressive Modeling for High-Quality Image Generation: https://github.com/OliverRensu/MVAR
187+
188+
[10/14/2024] HART: Efficient Visual Generation with Hybrid Autoregressive Transformer: https://github.com/mit-han-lab/hart
189+
190+
[10/3/2024] ImageFolder🚀: Autoregressive Image Generation with Folded Tokens: https://github.com/lxa9867/ImageFolder
191+
192+
[07/25/2024] ControlVAR: Exploring Controllable Visual Autoregressive Modeling: https://github.com/lxa9867/ControlVAR
193+
194+
[07/3/2024] VAR-CLIP: Text-to-Image Generator with Visual Auto-Regressive Modeling: https://github.com/daixiangzi/VAR-CLIP
195+
196+
[06/16/2024] STAR: Scale-wise Text-to-image generation via Auto-Regressive representations: https://arxiv.org/abs/2406.10797
197+
198+
159199
## License
160200
This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.
161201

@@ -172,3 +212,15 @@ If our work assists your research, feel free to give us a star ⭐ or cite us us
172212
primaryClass={cs.CV}
173213
}
174214
```
215+
216+
```
217+
@misc{Infinity,
218+
title={Infinity: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis},
219+
author={Jian Han and Jinlai Liu and Yi Jiang and Bin Yan and Yuqi Zhang and Zehuan Yuan and Bingyue Peng and Xiaobing Liu},
220+
year={2024},
221+
eprint={2412.04431},
222+
archivePrefix={arXiv},
223+
primaryClass={cs.CV},
224+
url={https://arxiv.org/abs/2412.04431},
225+
}
226+
```

0 commit comments

Comments
 (0)