Embedding watermarks into deep neural networks of audio classification

In recent years, there is an increasing trend of developing high performance neural network to tackle various real-world problems. This has led to momentous progress areas such as image recognition, speech emotion analysis and natural language processing. Significant amount of training data, compute...

وصف كامل

محفوظ في:
التفاصيل البيبلوغرافية
المؤلف الرئيسي: Chin, Jun Ying
مؤلفون آخرون: Zhang Tianwei
التنسيق: Final Year Project
اللغة:English
منشور في: Nanyang Technological University 2021
الموضوعات:
الوصول للمادة أونلاين:https://hdl.handle.net/10356/147918
الوسوم: إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
المؤسسة: Nanyang Technological University
اللغة: English
الوصف
الملخص:In recent years, there is an increasing trend of developing high performance neural network to tackle various real-world problems. This has led to momentous progress areas such as image recognition, speech emotion analysis and natural language processing. Significant amount of training data, computer resources and human resources are required to produce a service-grade neural network. Hence, it is important to regard and protect neural networks as intellectual property, owned by the creators. Various digital watermarking techniques have been proposed to identify violation of intellection property of such networks, primarily neural networks build for image classification problems. This project focuses on investigating the effectiveness of backdoor-based watermarking techniques on neural networks dealing with audio classification, then investigates the effectiveness of three different watermark generation algorithms. Additional techniques that enhance the robustness of watermarks embeddings are also explored. These include making the watermark embedding resistant against typical transformations of data in the audio domain, pruning, and fine-tuning of the trained model. This project ultimately aims to identify an effective method of watermarking of neural networks in the audio domain.