Text this: Developing a statistical formula for identifying four-character words in chinese text.

   _____     ___     ______     _____     _____   
  / ___//   / _ \\  |      \\  |  ___||  |__  //  
  \___ \\  | / \ || |  --  //  | ||__      / //   
  /    //  | \_/ || |  --  \\  | ||__     / //__  
 /____//    \___//  |______//  |_____||  /_____|| 
`-----`     `---`   `------`   `-----`   `-----`