Discovering significant topics from legal decisions with selective inference

We propose and evaluate an automated pipeline for discovering significant topics from legal decision texts by passing features synthesized with topic models through penalized regressions and post-selection significance tests. The method identifies case topics significantly correlated with outcomes,...

Full description

Saved in:
Bibliographic Details
Main Author: SOH, Jerrold Tsin Howe
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2024
Subjects:
Online Access:https://ink.library.smu.edu.sg/sol_research/4433
https://ink.library.smu.edu.sg/context/sol_research/article/6391/viewcontent/Discovering_significant_topics_from_legal_decisions_with_selective_inference_pvoa_cc_by.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sol_research-6391
record_format dspace
spelling sg-smu-ink.sol_research-63912024-04-04T07:18:23Z Discovering significant topics from legal decisions with selective inference SOH, Jerrold Tsin Howe We propose and evaluate an automated pipeline for discovering significant topics from legal decision texts by passing features synthesized with topic models through penalized regressions and post-selection significance tests. The method identifies case topics significantly correlated with outcomes, topic-word distributions which can be manually interpreted to gain insights about significant topics, and case-topic weights which can be used to identify representative cases for each topic. We demonstrate the method on a new dataset of domain name disputes and a canonical dataset of European Court of Human Rights violation cases. Topic models based on latent semantic analysis as well as language model embeddings are evaluated. We show that topics derived by the pipeline are consistent with legal doctrines in both areas and can be useful in other related legal analysis tasks. This article is part of the theme issue 'A complexity science approach to law and governance'. 2024-04-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sol_research/4433 info:doi/10.1098/rsta.2023.0147 https://ink.library.smu.edu.sg/context/sol_research/article/6391/viewcontent/Discovering_significant_topics_from_legal_decisions_with_selective_inference_pvoa_cc_by.pdf http://creativecommons.org/licenses/by/3.0/ Research Collection Yong Pung How School Of Law eng Institutional Knowledge at Singapore Management University domain name disputes European Court of Human Rights legal language processing text-as-data topic models Courts Legal Studies Numerical Analysis and Scientific Computing
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic domain name disputes
European Court of Human Rights
legal language processing
text-as-data
topic models
Courts
Legal Studies
Numerical Analysis and Scientific Computing
spellingShingle domain name disputes
European Court of Human Rights
legal language processing
text-as-data
topic models
Courts
Legal Studies
Numerical Analysis and Scientific Computing
SOH, Jerrold Tsin Howe
Discovering significant topics from legal decisions with selective inference
description We propose and evaluate an automated pipeline for discovering significant topics from legal decision texts by passing features synthesized with topic models through penalized regressions and post-selection significance tests. The method identifies case topics significantly correlated with outcomes, topic-word distributions which can be manually interpreted to gain insights about significant topics, and case-topic weights which can be used to identify representative cases for each topic. We demonstrate the method on a new dataset of domain name disputes and a canonical dataset of European Court of Human Rights violation cases. Topic models based on latent semantic analysis as well as language model embeddings are evaluated. We show that topics derived by the pipeline are consistent with legal doctrines in both areas and can be useful in other related legal analysis tasks. This article is part of the theme issue 'A complexity science approach to law and governance'.
format text
author SOH, Jerrold Tsin Howe
author_facet SOH, Jerrold Tsin Howe
author_sort SOH, Jerrold Tsin Howe
title Discovering significant topics from legal decisions with selective inference
title_short Discovering significant topics from legal decisions with selective inference
title_full Discovering significant topics from legal decisions with selective inference
title_fullStr Discovering significant topics from legal decisions with selective inference
title_full_unstemmed Discovering significant topics from legal decisions with selective inference
title_sort discovering significant topics from legal decisions with selective inference
publisher Institutional Knowledge at Singapore Management University
publishDate 2024
url https://ink.library.smu.edu.sg/sol_research/4433
https://ink.library.smu.edu.sg/context/sol_research/article/6391/viewcontent/Discovering_significant_topics_from_legal_decisions_with_selective_inference_pvoa_cc_by.pdf
_version_ 1814047478557704192