Text this: Automating dataset updates towards reliable and timely evaluation of Large Language Models

  _____     ______    ______     ___     __   __  
 |__  //   /_   _//  /_   _//   / _ \\   \ \\/ // 
   / //     -| ||-     | ||    | / \ ||   \   //  
  / //__    _| ||_    _| ||    | \_/ ||   / . \\  
 /_____||  /_____//  /__//      \___//   /_//\_\\ 
 `-----`   `-----`   `--`       `---`    `-`  --`