Duties for datasets

Machine learning (ML) systems are increasingly being deployed in contexts, such as law, medicine and finance, where system errors present serious and foreseeable risks. As ML system behaviour is largely determined by their training inputs, should dataset providers owe duties of care to victims? Usin...

وصف كامل

محفوظ في:
التفاصيل البيبلوغرافية
المؤلف الرئيسي: SOH, Jerrold Tsin Howe
التنسيق: text
اللغة:English
منشور في: Institutional Knowledge at Singapore Management University 2023
الموضوعات:
الوصول للمادة أونلاين:https://ink.library.smu.edu.sg/sol_research/4443
الوسوم: إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
الوصف
الملخص:Machine learning (ML) systems are increasingly being deployed in contexts, such as law, medicine and finance, where system errors present serious and foreseeable risks. As ML system behaviour is largely determined by their training inputs, should dataset providers owe duties of care to victims? Using the ImageNet dataset and the Generative Pre-trained Transformer (GPT) models as case studies, this chapter argues that the conventional approach of centralising duties on system providers alone yields insufficient safeguards. Dataset-specific duties should also be considered to incentivise precaution in the preparation of crucial ML input. The chapter analyses how dataset duties may be encompassed in existing tort law, surfacing situations where duties are more appropriate. For instance, where a dataset is intended to be used in a risky context, the dataset provider actively influences system outputs, and the dataset is published without safety restrictions or warnings.