Update README.md
Browse files
README.md
CHANGED
|
@@ -61,7 +61,7 @@ Source code is available at https://github.com/NVlabs/Sana.
|
|
| 61 |
- **Model Description:** This is a model that can be used to generate and modify images based on text prompts.
|
| 62 |
It is a Linear Diffusion Transformer that uses one fixed, pretrained text encoders ([Gemma2-2B-IT](https://huggingface.co/google/gemma-2-2b-it))
|
| 63 |
and one 32x spatial-compressed latent feature encoder ([DC-AE](https://hanlab.mit.edu/projects/dc-ae)).
|
| 64 |
-
- **Resources for more information:** Check out our [GitHub Repository](https://github.com/NVlabs/Sana) and the [
|
| 65 |
|
| 66 |
### Model Sources
|
| 67 |
|
|
|
|
| 61 |
- **Model Description:** This is a model that can be used to generate and modify images based on text prompts.
|
| 62 |
It is a Linear Diffusion Transformer that uses one fixed, pretrained text encoders ([Gemma2-2B-IT](https://huggingface.co/google/gemma-2-2b-it))
|
| 63 |
and one 32x spatial-compressed latent feature encoder ([DC-AE](https://hanlab.mit.edu/projects/dc-ae)).
|
| 64 |
+
- **Resources for more information:** Check out our [GitHub Repository](https://github.com/NVlabs/Sana) and the [SANA-1.5 report on arXiv](https://arxiv.org/abs/2501.18427).
|
| 65 |
|
| 66 |
### Model Sources
|
| 67 |
|