LaVie: high-quality video generation with cascaded latent diffusion models
This work aims to learn a high-quality text-to-video (T2V) generative model by leveraging a pre-trained text-to-image (T2I) model as a basis. It is a highly desirable yet challenging task to simultaneously (a) accomplish the synthesis of visually realistic and temporally coherent videos while (b) pr...
Saved in:
Main Authors: | Wang, Yaohui, Chen, Xinyuan, Ma, Xin, Zhou, Shangchen, Huang, Ziqi, Wang, Yi, Yang, Ceyuan, He, Yinan, Yu, Jiashuo, Yang, Peiqing, Guo, Yuwei, Wu, Tianxing, Si, Chenyang, Jiang, Yuming, Chen, Cunjian, Loy, Chen Change, Dai, Bo, Lin, Dahua, Qiao, Yu, Liu, Ziwei |
---|---|
其他作者: | College of Computing and Data Science |
格式: | Article |
語言: | English |
出版: |
2025
|
主題: | |
在線閱讀: | https://hdl.handle.net/10356/183061 |
標簽: |
添加標簽
沒有標簽, 成為第一個標記此記錄!
|
機構: | Nanyang Technological University |
語言: | English |
相似書籍
-
Statistical Study of Permutation Test for Quantitative Trait LOCI Detection
由: CHEN YUMING
出版: (2010) -
VToonify: controllable high-resolution portrait video style transfer
由: Yang, Shuai, et al.
出版: (2023) -
Exploiting diffusion prior for real-world image super-resolution
由: Wang, Jianyi, et al.
出版: (2024) -
Modular Modeling and Control for Autonomous Underwater Vehicle (AUV)
由: CHEN YANG
出版: (2010) -
Law schools vie in moot court
由: Liggayu, Margaret Mary B.
出版: (2012)