Duties for datasets

Machine learning (ML) systems are increasingly being deployed in contexts, such as law, medicine and finance, where system errors present serious and foreseeable risks. As ML system behaviour is largely determined by their training inputs, should dataset providers owe duties of care to victims? Usin...

Full description

Saved in:
Bibliographic Details
Main Author: SOH, Jerrold Tsin Howe
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2023
Subjects:
Online Access:https://ink.library.smu.edu.sg/sol_research/4443
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sol_research-6401
record_format dspace
spelling sg-smu-ink.sol_research-64012024-05-23T02:24:02Z Duties for datasets SOH, Jerrold Tsin Howe Machine learning (ML) systems are increasingly being deployed in contexts, such as law, medicine and finance, where system errors present serious and foreseeable risks. As ML system behaviour is largely determined by their training inputs, should dataset providers owe duties of care to victims? Using the ImageNet dataset and the Generative Pre-trained Transformer (GPT) models as case studies, this chapter argues that the conventional approach of centralising duties on system providers alone yields insufficient safeguards. Dataset-specific duties should also be considered to incentivise precaution in the preparation of crucial ML input. The chapter analyses how dataset duties may be encompassed in existing tort law, surfacing situations where duties are more appropriate. For instance, where a dataset is intended to be used in a risky context, the dataset provider actively influences system outputs, and the dataset is published without safety restrictions or warnings. 2023-12-01T08:00:00Z text https://ink.library.smu.edu.sg/sol_research/4443 info:doi/10.5040/9781509966059.ch-013 Research Collection Yong Pung How School Of Law eng Institutional Knowledge at Singapore Management University Datasets machine learning tort law Artificial Intelligence and Robotics Science and Technology Law
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Datasets
machine learning
tort law
Artificial Intelligence and Robotics
Science and Technology Law
spellingShingle Datasets
machine learning
tort law
Artificial Intelligence and Robotics
Science and Technology Law
SOH, Jerrold Tsin Howe
Duties for datasets
description Machine learning (ML) systems are increasingly being deployed in contexts, such as law, medicine and finance, where system errors present serious and foreseeable risks. As ML system behaviour is largely determined by their training inputs, should dataset providers owe duties of care to victims? Using the ImageNet dataset and the Generative Pre-trained Transformer (GPT) models as case studies, this chapter argues that the conventional approach of centralising duties on system providers alone yields insufficient safeguards. Dataset-specific duties should also be considered to incentivise precaution in the preparation of crucial ML input. The chapter analyses how dataset duties may be encompassed in existing tort law, surfacing situations where duties are more appropriate. For instance, where a dataset is intended to be used in a risky context, the dataset provider actively influences system outputs, and the dataset is published without safety restrictions or warnings.
format text
author SOH, Jerrold Tsin Howe
author_facet SOH, Jerrold Tsin Howe
author_sort SOH, Jerrold Tsin Howe
title Duties for datasets
title_short Duties for datasets
title_full Duties for datasets
title_fullStr Duties for datasets
title_full_unstemmed Duties for datasets
title_sort duties for datasets
publisher Institutional Knowledge at Singapore Management University
publishDate 2023
url https://ink.library.smu.edu.sg/sol_research/4443
_version_ 1814047539370917888