Towards temporal sentence grounding in videos

Towards temporal sentence grounding in videos

Temporal sentence grounding in videos (TSGV), a.k.a., natural language video localization (NLVL) or video moment retrieval (VMR), aims to retrieve a temporal moment (i.e., a fraction of a video) that semantically corresponds to a language query from an untrimmed video. Connecting computer vision and...

وصف كامل

محفوظ في:

التفاصيل البيبلوغرافية
المؤلف الرئيسي:	Zhang, Hao
مؤلفون آخرون:	Sun Aixin
التنسيق:	Thesis-Doctor of Philosophy
اللغة:	English
منشور في:	Nanyang Technological University 2022
الموضوعات:	Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Engineering::Computer science and engineering::Computer applications Engineering::Computer science and engineering::Computing methodologies::Document and text processing Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
الوصول للمادة أونلاين:	https://hdl.handle.net/10356/163788
الوسوم:	إضافة وسم لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!

مواد مشابهة

Towards humanized open-domain conversational agents
بواسطة: Zhong, Peixiang
منشور في: (2021)

Video-based traffic analysis
بواسطة: Fong, Hao Wei
منشور في: (2021)

Image and video super-resolution in the wild
بواسطة: Chan, Kelvin Cheuk Kit
منشور في: (2022)

Radio-frequency (RF) sensing for deep awareness of human physical status
بواسطة: Loe, Daniel Kit Leong
منشور في: (2022)

Application of machine learning in the forecast of stock index
بواسطة: Sugianto, Jason Jonathan
منشور في: (2022)

Radio-frequency (RF) sensing for deep awareness of human physical status
بواسطة: Muhammad Nasran Hamza
منشور في: (2022)

Deep-learning for conversational speech using semantic textual analysis
بواسطة: Suthakar, Shiny Gladdys
منشور في: (2022)

Hierarchical document representation for summarization
بواسطة: Tey, Rui Jie
منشور في: (2022)

Explainable importance ranking of research paper (NVIDIA)
بواسطة: You, Yatao
منشور في: (2022)

Keyword and named entity recognition on air traffic control text
بواسطة: Tay, Nikole Qiwei
منشور في: (2020)

Training deep network models for accurate recognition of texts in scene images
بواسطة: Chen, Pengfei
منشور في: (2021)

Structured pointing networks for natural language understanding
بواسطة: Nguyen, Thanh Tung
منشور في: (2021)

Keyword and named entity recognition on air traffic control (ATC) data
بواسطة: Thia, Jeremy Ming Xuan
منشور في: (2019)

Improving LSTM price prediction of Bitcoin with sentiment analysis of Twitter post
بواسطة: Tu, Xianan
منشور في: (2023)

Building generalizable models for discourse phenomena evaluation and machine translation
بواسطة: Jwalapuram, Prathyusha
منشور في: (2023)

Improving spam detection on Twitter using deep learning
بواسطة: Ng, Yi Rong
منشور في: (2021)

Aspect-based sentiment analysis for user profiles
بواسطة: Ng, Zhiyong
منشور في: (2021)

Detecting hazardous events from online news and social media
بواسطة: Liu, Zinan
منشور في: (2023)

Deep learning for optical character recognition in online images
بواسطة: Lim, Yi Xian
منشور في: (2023)

Deep learning techniques for text classification
بواسطة: Raihan, Diardano
منشور في: (2021)

Deep learning-based automatic document categorization and organization
بواسطة: Foo, Shawn Nicholas Say Yan
منشور في: (2021)

Human pose estimation and action recognition based on monocular video inputs
بواسطة: Leong, Mei Chee
منشور في: (2020)

Towards interpretable & robust face recognition
بواسطة: Pattra, Surya Paryanta
منشور في: (2022)

Towards interpretable & robust occluded facial recognition
بواسطة: Rachita, Agrawal
منشور في: (2023)

Towards deep neural networks robust to adversarial examples
بواسطة: Matyasko, Alexander
منشور في: (2020)

Towards unbiased visual language reasoning and consistent segmentation
بواسطة: Huang, Jianqiang
منشور في: (2023)

Motion analysis of temporal features in video surveillance
بواسطة: Yuan, Kirsten Shaoqing.
منشور في: (2009)

Building SenticNet 7
بواسطة: Perh, Zhi Hao
منشور في: (2021)

Deep learning based car license plate recognition
بواسطة: Ngo, Jason Jun Hao
منشور في: (2021)

Semantic representation learning for natural language understanding
بواسطة: Zhang, Yong
منشور في: (2018)

Deep reinforcement learning for intractable routing & inverse problems
بواسطة: Zhang, Rongkai
منشور في: (2023)

Contrastive knowledge transfer from CLIP for open vocabulary object detection
بواسطة: Zhang, Chuhan
منشور في: (2023)

Exploring versatile neural architectures across modalities and perception tasks
بواسطة: Zhang, Wenwei
منشور في: (2023)

Graph neural networks for questions and answers
بواسطة: Mah, Caleb
منشور في: (2019)

Natural language translation with graph convolutional neural network
بواسطة: Zhu, Yimin
منشور في: (2018)

English exam question answering using a deep learning model
بواسطة: Sokhonn, Rainy
منشور في: (2018)

A hybrid approach for real-time sentiment analysis & visualization of tweets in Singlish
بواسطة: Lim, Michelle Shi Hui
منشور في: (2018)

Event detection from social media on COVID-19
بواسطة: Ho, Yin Wee
منشور في: (2022)

Event detection for biomedical text
بواسطة: Pham, Nguyen Minh Thu
منشور في: (2022)

BERT named entity recognition on emergency response system
بواسطة: Chua, Clarita Wyn Kay
منشور في: (2022)