Injecting descriptive meta-information into pre-trained language models with hypernetworks

Injecting descriptive meta-information into pre-trained language models with hypernetworks

Pre-trained language models have been widely adopted as backbones in various natural language processing tasks. However, existing pre-trained language models ignore the descriptive meta-information in the text such as the distinction between the title and the mainbody, leading to over-weighted atten...

Full description

Saved in:

Bibliographic Details
Main Authors:	DUAN, Wenying, HE, Xiaoxi, ZHOU, Zimu, RAO, Hong, THIELE, Lothar
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2021
Subjects:	descriptive meta-information hypernetworks pre-trained language mode Programming Languages and Compilers Software Engineering
Online Access:	https://ink.library.smu.edu.sg/sis_research/6237 https://ink.library.smu.edu.sg/context/sis_research/article/7240/viewcontent/Injecting_Descriptive_Meta_Information_into_Pre_Trained_Language_Models_with_Hypernetworks.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

Similar Items

Generative AI art - hypernetworks
by: Chee, Mei Qi
Published: (2024)

Using pre-trained models for vision-language understanding tasks
by: CAO, Rui
Published: (2024)

On the transferability of pre-trained language models for low-resource programming languages
by: CHEN, Fuxiang, et al.
Published: (2022)

p-Meta: Towards on-device deep model adaptation
by: QU, Zhongnan, et al.
Published: (2022)

Pre-training graph transformer with multimodal side information for recommendation
by: Liu, Yong, et al.
Published: (2022)

Pruning meta-trained networks for on-device adaptation
by: GAO, Dawei, et al.
Published: (2021)

Enhancing visual grounding in vision-language pre-training with position-guided text prompts
by: WANG, Alex Jinpeng, et al.
Published: (2024)

Grasper: A generalist pursuer for pursuit-evasion problems
by: LI, Pengdeng, et al.
Published: (2024)

Learning to pre-train graph neural networks
by: LU, Yuanfu, et al.
Published: (2021)

Putting practice into words : the state of data and methods transparency in grammatical descriptions
by: Gawne, Lauren, et al.
Published: (2019)

Analyzing job advertisements and skill descriptions using NLP techniques (part 1: data cleaning and pre-processing and part 4: front-end visualization/user interface) - collaboration with CAO
by: Dinglasan, Cris Anthony Sarmiento
Published: (2024)

Sentiment analysis for software engineering: How far can pre-trained transformer models go?
by: ZHANG, Ting, et al.
Published: (2020)

A DESCRIPTIVE GRAMMAR OF PENANG HOKKIEN
by: HING JIA WEN
Published: (2020)

On the usage of continual learning for out-of-distribution generalization in pre-trained language models of code
by: WEYSSOW, Martin, et al.
Published: (2023)

Cross-thought for sentence encoder pre-training
by: WANG, Shuohang, et al.
Published: (2020)

BEE: a special-purpose machine for hardware description languages
by: Loh, W.L.
Published: (2014)

Towards automatic generation of security-centric descriptions for Android apps
by: ZHANG, Mu, et al.
Published: (2015)

VHDL verilog research/assignment
by: Lazaro, Jose B., Jr.
Published: (2008)

Omnidirectional coverage for device-free passive human detection
by: ZHOU, Zimu, et al.
Published: (2013)

Laughter emotion recognition using gestures
by: De Jesus, Paulina Catya S.
Published: (2014)

Sound and complete certificates for quantitative termination analysis of probabilistic programs
by: CHATTERJEE, Krishnendu, et al.
Published: (2022)

Learning control policies for stochastic systems with reach-avoid guarantees
by: ZIKELIC, Dorde, et al.
Published: (2023)

Attack prompt generation for red teaming and defending large language models
by: DENG, Boyi, et al.
Published: (2023)

A Prolog-based definition of an entity-relationship language
by: CHAN, H., et al.
Published: (1993)

Revisiting masked auto-encoders for ECG-language representation learning
by: PHAM, Hung Manh, et al.
Published: (2024)

VLStereoSet: A study of stereotypical bias in pre-trained vision-language models
by: ZHOU, Kankan, et al.
Published: (2022)

Retrieval based code summarisation using code pre-trained models
by: Gupta, Sahaj
Published: (2024)

Position-guided text prompt for vision-language pre-training
by: WANG, Alex Jinpeng, et al.
Published: (2023)

Program evaluation for Easy, C, Pascal programming languages
by: Garcia, Arnaldo, et al.
Published: (1989)

An exploratory study on code attention in BERT
by: SHARMA, Rishab, et al.
Published: (2022)

Leveraging linguistic knowledge to enhance low-resource NLP applications
by: Zhu, Zixiao
Published: (2025)

Aspect-based API review classification: How far can pre-trained transformer model go?
by: YANG, Chengran, et al.
Published: (2022)

PTM4Tag: sharpening tag recommendation of stack overflow posts with pre-trained models
by: HE, Junda, et al.
Published: (2022)

Compressing pre-trained models of code into 3 MB
by: SHI, Jieke, et al.
Published: (2022)

Tsukiden Global Solutions intensive C training: C workbook module and machine problems
by: Ong, Erwin Donovan
Published: (2011)

Automated commit intelligence by pre-training
by: Liu, Shangqing, et al.
Published: (2025)

Pre-training on large-scale heterogeneous graph
by: JIANG, Xunqiang, et al.
Published: (2021)

Contrastive pre-training of GNNs on heterogeneous graphs
by: JIANG, Xunqiang, et al.
Published: (2021)

Functionality & privacy in mobile applications - who's going to win the game
by: Gao, Debin
Published: (2019)

Understanding the Genetic Makeup of Linux Device Drivers
by: Tschudin, Peter Senna, et al.
Published: (2013)