MLP-3D: A MLP-like 3D architecture with grouped time mixing

MLP-3D: A MLP-like 3D architecture with grouped time mixing

Convolutional Neural Networks (CNNs) have been re-garded as the go-to models for visual recognition. More re-cently, convolution-free networks, based on multi-head self-attention (MSA) or multi-layer perceptrons (MLPs), become more and more popular. Nevertheless, it is not trivial when utilizing the...

Full description

Saved in:

Bibliographic Details
Main Authors:	QIU, Zhaofan, YAO, Ting, NGO, Chong-wah, MEI, Tao
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2022
Subjects:	Artificial Intelligence and Robotics Graphics and Human Computer Interfaces
Online Access:	https://ink.library.smu.edu.sg/sis_research/7505 https://ink.library.smu.edu.sg/context/sis_research/article/8508/viewcontent/Qiu_MLP_3D_A_MLP_Like_3D_Architecture_With_Grouped_Time_Mixing_CVPR_2022_paper.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

Similar Items

PosMLP-Video: Spatial and temporal relative position encoding for efficient video recognition
by: HAO, Yanbin, et al.
Published: (2024)

Dynamic temporal filtering in video models
by: LONG, Fuchen, et al.
Published: (2022)

Architecture analysis of MLP by geometrical interpretation
by: Xiang, C., et al.
Published: (2014)

Geometrical interpretation and architecture selection of MLP
by: Xiang, C., et al.
Published: (2014)

Global context aware convolutions for 3D point cloud understanding
by: ZHANG, Zhiyuan, et al.
Published: (2020)

Test-time augmentation for 3D point cloud classification and segmentation
by: VU, Tuan-Anh, et al.
Published: (2024)

Condensing a sequence to one informative frame for video recognition
by: QIU. Zhaofan,, et al.
Published: (2021)

Hi3D: Pursuing high-resolution image-to-3D generation with video diffusion models
by: YANG, Haibo, et al.
Published: (2024)

Learning spatio-temporal representation with local and global diffusion
by: QIU, Zhaofan, et al.
Published: (2019)

Zero-shot ingredient recognition by multi-relational graph convolutional network
by: CHEN, Jingjing, et al.
Published: (2020)

Transferring and regularizing prediction for semantic segmentation
by: ZHANG, Yiheng, et al.
Published: (2020)

Consistent3D: Towards consistent high-fidelity text-to-3D generation with deterministic sampling prior
by: WU, Zike, et al.
Published: (2024)

D3still : Decoupled differential distillation for asymmetric image retrieval
by: XIE, Yi, et al.
Published: (2024)

Improved spin images for 3D surface matching using signed angles
by: ZHANG, Zhiyuan, et al.
Published: (2012)

Feature selection via sensitivity analysis of MLP probabilistic outputs
by: Yang, J.-B., et al.
Published: (2014)

Self-evolving multi-layer perceptron (seMLP) with its applications in trend reversals & technical trading indicators
by: Seow, Wen Jun
Published: (2019)

Optimization planning for 3D ConvNets
by: QIU, Zhaofan, et al.
Published: (2021)

Diffusion time-step curriculum for one image to 3D generation
by: YI, Xuanyu, et al.
Published: (2024)

Software size estimation in design phase based on MLP neural network
by: Benjamas Panyangam, et al.
Published: (2018)

Decision support for the stocks trading using MLP and data mining techniques
by: Narissara Eiamkanitchat, et al.
Published: (2018)

Decision support for the stocks trading using MLP and data mining techniques
by: Eiamkanitchat N., et al.
Published: (2017)

Software size estimation in design phase based on MLP neural network
by: Panyangam B., et al.
Published: (2017)

ATM TRAFFIC MANAGEMENT USING MLP NEURAL NETWORKS AND FUZZY CONTROLLERS
by: NELSON NG ONN LUM
Published: (2020)

Wave-ViT: Unifying wavelet and transformers for visual representation learning
by: YAO, Ting, et al.
Published: (2022)

Towards improving system performance in large scale multi-agent systems with selfish agents
by: KUMAR, Rajiv Ranjan
Published: (2022)

Visual Commonsense R-CNN
by: WANG, Tan, et al.
Published: (2020)

Knowledge-aware multimodal fashion chatbot
by: LIAO, Lizi, et al.
Published: (2018)

Debiasing NLU models via causal intervention and counterfactual reasoning
by: TIAN, Bing, et al.
Published: (2022)

Gesture enhanced comprehension of ambiguous human-to-robot instructions
by: WEERAKOON MUDIYANSELAGE DULANGA KAVEESHA WEERAKOON,, et al.
Published: (2020)

Self-trained deep ordinal regression for end-to-end video anomaly detection
by: PANG, Guansong, et al.
Published: (2020)

Self-supervised multi-class pre-training for unsupervised anomaly detection and segmentation in medical images
by: TIAN, Yu, et al.
Published: (2021)

Edgeduet: Tiling small object detection for edge assisted autonomous mobile vision
by: WANG, Xu, et al.
Published: (2021)

Symmetry robust descriptor for non-rigid surface matching
by: ZHANG, Zhiyuan, et al.
Published: (2013)

Pixel-wise energy-biased abstention learning for anomaly segmentation on complex urban driving scenes
by: TIAN, Yu, et al.
Published: (2022)

Reducing adaptation latency for multi-concept visual perception in outdoor environments
by: WIGNESS, Maggie, et al.
Published: (2016)

Feature prediction diffusion model for video anomaly detection
by: YAN, Cheng, et al.
Published: (2023)

GDFace: Gated deformation for multi-view face image synthesis
by: XU, Xuemiao, et al.
Published: (2020)

Adversarial meta sampling for multilingual low-resource speech recognition
by: XIAO, Yubei, et al.
Published: (2021)

How important is the train-validation split in meta-learning?
by: BAI, Yu, et al.
Published: (2021)

Outlier-robust tensor PCA
by: ZHOU, Pan, et al.
Published: (2016)