Text this: LaVie: high-quality video generation with cascaded latent diffusion models

 _____     _    _     _____    _    _    _____    
|  __ \\  | || | ||  / ____|| | || | || |  __ \\  
| |  \ || | || | || / //---`' | || | || | |  \ || 
| |__/ || | \\_/ || \ \\___   | \\_/ || | |__/ || 
|_____//   \____//   \_____||  \____//  |_____//  
 -----`     `---`     `----`    `---`    -----`