Text this: Automating dataset updates towards reliable and timely evaluation of Large Language Models

  ____      _____     ______     ___     _    _   
 |  _ \\   |  ___||  /_   _//   / _ \\  | \  / || 
 | |_| ||  | ||__    `-| |,-   | / \ || |  \/  || 
 | .  //   | ||__      | ||    | \_/ || | .  . || 
 |_|\_\\   |_____||    |_||     \___//  |_|\/|_|| 
 `-` --`   `-----`     `-`'     `---`   `-`  `-`