Text this: Building a large-scale dataset for audio-conditioned dance motion synthesis

  _____     _____      _____   __   __    ____    
 /  ___||  |  ___||   / ___//  \ \\/ //  |  _ \\  
| // __    | ||__     \___ \\   \ ` //   | |_| || 
| \\_\ ||  | ||__     /    //    | ||    | .  //  
 \____//   |_____||  /____//     |_||    |_|\_\\  
  `---`    `-----`  `-----`      `-`'    `-` --`