Surgical activity triplet recognition via triplet disentanglement

Including context-aware decision support in the operating room has the potential to improve surgical safety and efficiency by utilizing real-time feedback obtained from surgical workflow analysis. In this task, recognizing each surgical activity in the endoscopic video as a triplet instrument, verb,...

Full description

Saved in:

Bibliographic Details
Main Authors:	CHEN, Yiliang, HE, Shengfeng, JIN, Yueming, QIN, Jing
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2023
Subjects:	Triplet disentanglement Surgical activity recognition Endoscopic videos Pattern recognition Analytical, Diagnostic and Therapeutic Techniques and Equipment Artificial Intelligence and Robotics Graphics and Human Computer Interfaces
Online Access:	https://ink.library.smu.edu.sg/sis_research/8498
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-9501
record_format	dspace
spelling	sg-smu-ink.sis_research-95012024-01-04T04:18:03Z Surgical activity triplet recognition via triplet disentanglement CHEN, Yiliang HE, Shengfeng JIN, Yueming QIN, Jing Including context-aware decision support in the operating room has the potential to improve surgical safety and efficiency by utilizing real-time feedback obtained from surgical workflow analysis. In this task, recognizing each surgical activity in the endoscopic video as a triplet instrument, verb, target> is crucial, as it helps to ensure actions occur only after an instrument is present. However, recognizing the states of these three components in one shot poses extra learning ambiguities, as the triplet supervision is highly imbalanced (positive when all components are correct). To remedy this issue, we introduce a triplet disentanglement framework for surgical action triplet recognition, which decomposes the learning objectives to reduce learning difficulties. Particularly, our network decomposes the recognition of triplet into five complementary and simplified sub-networks. While the first sub-network converts the detection into a numerical supplementary task predicting the existence/number of three components only, the second focuses on the association between them, and the other three predict the components individually. In this way, triplet recognition is decoupled in a progressive, easy-to-difficult manner. In addition, we propose a hierarchical training schedule as a way to decompose the difficulty of the task further. Our model first creates several bridges and then progressively identifies the final key task step by step, rather than explicitly identifying surgical activity. Our proposed method has been demonstrated to surpass current state-of-the-art approaches on the CholecT45 endoscopic video dataset. 2023-10-12T07:00:00Z text https://ink.library.smu.edu.sg/sis_research/8498 info:doi/10.1007/978-3-031-43996-4_43 Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Triplet disentanglement Surgical activity recognition Endoscopic videos Pattern recognition Analytical, Diagnostic and Therapeutic Techniques and Equipment Artificial Intelligence and Robotics Graphics and Human Computer Interfaces
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Triplet disentanglement Surgical activity recognition Endoscopic videos Pattern recognition Analytical, Diagnostic and Therapeutic Techniques and Equipment Artificial Intelligence and Robotics Graphics and Human Computer Interfaces
spellingShingle	Triplet disentanglement Surgical activity recognition Endoscopic videos Pattern recognition Analytical, Diagnostic and Therapeutic Techniques and Equipment Artificial Intelligence and Robotics Graphics and Human Computer Interfaces CHEN, Yiliang HE, Shengfeng JIN, Yueming QIN, Jing Surgical activity triplet recognition via triplet disentanglement
description	Including context-aware decision support in the operating room has the potential to improve surgical safety and efficiency by utilizing real-time feedback obtained from surgical workflow analysis. In this task, recognizing each surgical activity in the endoscopic video as a triplet instrument, verb, target> is crucial, as it helps to ensure actions occur only after an instrument is present. However, recognizing the states of these three components in one shot poses extra learning ambiguities, as the triplet supervision is highly imbalanced (positive when all components are correct). To remedy this issue, we introduce a triplet disentanglement framework for surgical action triplet recognition, which decomposes the learning objectives to reduce learning difficulties. Particularly, our network decomposes the recognition of triplet into five complementary and simplified sub-networks. While the first sub-network converts the detection into a numerical supplementary task predicting the existence/number of three components only, the second focuses on the association between them, and the other three predict the components individually. In this way, triplet recognition is decoupled in a progressive, easy-to-difficult manner. In addition, we propose a hierarchical training schedule as a way to decompose the difficulty of the task further. Our model first creates several bridges and then progressively identifies the final key task step by step, rather than explicitly identifying surgical activity. Our proposed method has been demonstrated to surpass current state-of-the-art approaches on the CholecT45 endoscopic video dataset.
format	text
author	CHEN, Yiliang HE, Shengfeng JIN, Yueming QIN, Jing
author_facet	CHEN, Yiliang HE, Shengfeng JIN, Yueming QIN, Jing
author_sort	CHEN, Yiliang
title	Surgical activity triplet recognition via triplet disentanglement
title_short	Surgical activity triplet recognition via triplet disentanglement
title_full	Surgical activity triplet recognition via triplet disentanglement
title_fullStr	Surgical activity triplet recognition via triplet disentanglement
title_full_unstemmed	Surgical activity triplet recognition via triplet disentanglement
title_sort	surgical activity triplet recognition via triplet disentanglement
publisher	Institutional Knowledge at Singapore Management University
publishDate	2023
url	https://ink.library.smu.edu.sg/sis_research/8498
_version_	1787590780836118528

Surgical activity triplet recognition via triplet disentanglement

Similar Items