Legal area classification: A comparative study of text classifiers on Singapore Supreme Court judgments

This paper conducts a comparative study on the performance of various machine learning(“ML”) approaches for classifying judgments into legal areas. Using a novel dataset of 6,227 Singapore Supreme Court judgments, we investigate how state-of-the-art NLP methods compare against traditional statistica...

Full description

Saved in:
Bibliographic Details
Main Authors: SOH, Jerrold, LIM, How Khang, CHAI, Ian Ernst
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2019
Subjects:
Online Access:https://ink.library.smu.edu.sg/sol_research/2956
https://ink.library.smu.edu.sg/context/sol_research/article/4914/viewcontent/A_Comparative_Study_of_Text_Classifiers_on_Singapore_Supreme_Court_Judgments.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
Description
Summary:This paper conducts a comparative study on the performance of various machine learning(“ML”) approaches for classifying judgments into legal areas. Using a novel dataset of 6,227 Singapore Supreme Court judgments, we investigate how state-of-the-art NLP methods compare against traditional statistical models when applied to a legal corpus that comprised few but lengthy documents. All approaches tested, including topic model, word embedding, and language model-based classifiers, performed well with as little as a few hundred judgments. However, more work needs to be done to optimize state-of-the-art methods for the legal domain.