STPrivacy: Spatio-temporal privacy-preserving action recognition

Existing methods of privacy-preserving action recognition (PPAR) mainly focus on frame-level (spatial) privacy removal through 2D CNNs. Unfortunately, they have two major drawbacks. First, they may compromise temporal dynamics in input videos, which are critical for accurate action recognition. Seco...

Full description

Saved in:
Bibliographic Details
Main Authors: LI, Ming, XU, Xiangyu, FAN, Hehe, ZHOU, Pan, LIU, Jun, LIU, Jia-Wei, LI, Jiahe, KEPPO, Jussi, SHOU, Mike Zheng, YAN, Shuicheng
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2023
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/8985
https://ink.library.smu.edu.sg/context/sis_research/article/9988/viewcontent/2023_ICCV_STPrivacy.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-9988
record_format dspace
spelling sg-smu-ink.sis_research-99882024-07-25T08:30:07Z STPrivacy: Spatio-temporal privacy-preserving action recognition LI, Ming XU, Xiangyu FAN, Hehe ZHOU, Pan LIU, Jun LIU, Jia-Wei LI, Jiahe KEPPO, Jussi SHOU, Mike Zheng YAN, Shuicheng Existing methods of privacy-preserving action recognition (PPAR) mainly focus on frame-level (spatial) privacy removal through 2D CNNs. Unfortunately, they have two major drawbacks. First, they may compromise temporal dynamics in input videos, which are critical for accurate action recognition. Second, they are vulnerable to practical attacking scenarios where attackers probe for privacy from an entire video rather than individual frames. To address these issues, we propose a novel framework STPrivacy to perform video-level PPAR. For the first time, we introduce vision Transformers into PPAR by treating a video as a tubelet sequence, and accordingly design two complementary mechanisms, i.e., sparsification and anonymization, to remove privacy from a spatio-temporal perspective. In specific, our privacy sparsification mechanism applies adaptive token selection to abandon action-irrelevant tubelets. Then, our anonymization mechanism implicitly manipulates the remaining action-tubelets to erase privacy in the embedding space through adversarial learning. These mechanisms provide significant advantages in terms of privacy preservation for human eyes and action-privacy trade-off adjustment during deployment. We additionally contribute the first two large-scale PPAR benchmarks, VP-HMDB51 and VP-UCF101, to the community. Extensive evaluations on them, as well as two other tasks, validate the effectiveness and generalization capability of our framework. 2023-10-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/8985 info:doi/10.1109/ICCV51070.2023.00471 https://ink.library.smu.edu.sg/context/sis_research/article/9988/viewcontent/2023_ICCV_STPrivacy.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Privacy Data privacy Computer vision Benchmark testing Transformers Information filtering Task analysis Graphics and Human Computer Interfaces
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Privacy
Data privacy
Computer vision
Benchmark testing
Transformers
Information filtering
Task analysis
Graphics and Human Computer Interfaces
spellingShingle Privacy
Data privacy
Computer vision
Benchmark testing
Transformers
Information filtering
Task analysis
Graphics and Human Computer Interfaces
LI, Ming
XU, Xiangyu
FAN, Hehe
ZHOU, Pan
LIU, Jun
LIU, Jia-Wei
LI, Jiahe
KEPPO, Jussi
SHOU, Mike Zheng
YAN, Shuicheng
STPrivacy: Spatio-temporal privacy-preserving action recognition
description Existing methods of privacy-preserving action recognition (PPAR) mainly focus on frame-level (spatial) privacy removal through 2D CNNs. Unfortunately, they have two major drawbacks. First, they may compromise temporal dynamics in input videos, which are critical for accurate action recognition. Second, they are vulnerable to practical attacking scenarios where attackers probe for privacy from an entire video rather than individual frames. To address these issues, we propose a novel framework STPrivacy to perform video-level PPAR. For the first time, we introduce vision Transformers into PPAR by treating a video as a tubelet sequence, and accordingly design two complementary mechanisms, i.e., sparsification and anonymization, to remove privacy from a spatio-temporal perspective. In specific, our privacy sparsification mechanism applies adaptive token selection to abandon action-irrelevant tubelets. Then, our anonymization mechanism implicitly manipulates the remaining action-tubelets to erase privacy in the embedding space through adversarial learning. These mechanisms provide significant advantages in terms of privacy preservation for human eyes and action-privacy trade-off adjustment during deployment. We additionally contribute the first two large-scale PPAR benchmarks, VP-HMDB51 and VP-UCF101, to the community. Extensive evaluations on them, as well as two other tasks, validate the effectiveness and generalization capability of our framework.
format text
author LI, Ming
XU, Xiangyu
FAN, Hehe
ZHOU, Pan
LIU, Jun
LIU, Jia-Wei
LI, Jiahe
KEPPO, Jussi
SHOU, Mike Zheng
YAN, Shuicheng
author_facet LI, Ming
XU, Xiangyu
FAN, Hehe
ZHOU, Pan
LIU, Jun
LIU, Jia-Wei
LI, Jiahe
KEPPO, Jussi
SHOU, Mike Zheng
YAN, Shuicheng
author_sort LI, Ming
title STPrivacy: Spatio-temporal privacy-preserving action recognition
title_short STPrivacy: Spatio-temporal privacy-preserving action recognition
title_full STPrivacy: Spatio-temporal privacy-preserving action recognition
title_fullStr STPrivacy: Spatio-temporal privacy-preserving action recognition
title_full_unstemmed STPrivacy: Spatio-temporal privacy-preserving action recognition
title_sort stprivacy: spatio-temporal privacy-preserving action recognition
publisher Institutional Knowledge at Singapore Management University
publishDate 2023
url https://ink.library.smu.edu.sg/sis_research/8985
https://ink.library.smu.edu.sg/context/sis_research/article/9988/viewcontent/2023_ICCV_STPrivacy.pdf
_version_ 1814047700844281856