MeLT: Message-Level Transformer with Masked Document Representations as Pre-Training for Stance Detection

Matero, Matthew; Soni, Nikita; Balasubramanian, Niranjan; Schwartz, H. Andrew

Computer Science > Computation and Language

arXiv:2109.08113 (cs)

[Submitted on 16 Sep 2021 (v1), last revised 1 Nov 2021 (this version, v2)]

Title:MeLT: Message-Level Transformer with Masked Document Representations as Pre-Training for Stance Detection

Authors:Matthew Matero, Nikita Soni, Niranjan Balasubramanian, H. Andrew Schwartz

View PDF

Abstract:Much of natural language processing is focused on leveraging large capacity language models, typically trained over single messages with a task of predicting one or more tokens. However, modeling human language at higher-levels of context (i.e., sequences of messages) is under-explored. In stance detection and other social media tasks where the goal is to predict an attribute of a message, we have contextual data that is loosely semantically connected by authorship. Here, we introduce Message-Level Transformer (MeLT) -- a hierarchical message-encoder pre-trained over Twitter and applied to the task of stance prediction. We focus on stance prediction as a task benefiting from knowing the context of the message (i.e., the sequence of previous messages). The model is trained using a variant of masked-language modeling; where instead of predicting tokens, it seeks to generate an entire masked (aggregated) message vector via reconstruction loss. We find that applying this pre-trained masked message-level transformer to the downstream task of stance detection achieves F1 performance of 67%.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2109.08113 [cs.CL]
	(or arXiv:2109.08113v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2109.08113

Submission history

From: Matthew Matero [view email]
[v1] Thu, 16 Sep 2021 17:07:45 UTC (805 KB)
[v2] Mon, 1 Nov 2021 18:42:07 UTC (994 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2109

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Niranjan Balasubramanian
H. Andrew Schwartz

export BibTeX citation

Computer Science > Computation and Language

Title:MeLT: Message-Level Transformer with Masked Document Representations as Pre-Training for Stance Detection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:MeLT: Message-Level Transformer with Masked Document Representations as Pre-Training for Stance Detection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators