End-to-End Multimodal Fact-Checking and Explanation Generation: A Challenging Dataset and Models

dc.contributor.authorYao, Barryen
dc.contributor.authorShah, Adityaen
dc.contributor.authorSun, Lichaoen
dc.contributor.authorCho, Jin-Heeen
dc.contributor.authorHuang, Lifuen
dc.date.accessioned2023-08-02T17:46:48Zen
dc.date.available2023-08-02T17:46:48Zen
dc.date.issued2023-07-19en
dc.date.updated2023-08-01T07:57:47Zen
dc.description.abstractWe propose end-to-end multimodal fact-checking and explanation generation, where the input is a claim and a large collection of web sources, including articles, images, videos, and tweets, and the goal is to assess the truthfulness of the claim by retrieving relevant evidence and predicting a truthfulness label (e.g., support, refute or not enough information), and to generate a statement to summarize and explain the reasoning and ruling process. To support this research, we construct Mocheg, a large-scale dataset consisting of 15,601 claims where each claim is annotated with a truthfulness label and a ruling statement, and 33,880 textual paragraphs and 12,112 images in total as evidence. To establish baseline performances on Mocheg, we experiment with several state-of-the-art neural architectures on the three pipelined subtasks: multimodal evidence retrieval, claim verification, and explanation generation, and demonstrate that the performance of the state-of-the-art end-to-end multimodal factchecking does not provide satisfactory outcomes. To the best of our knowledge, we are the first to build the benchmark dataset and solutions for end-to-end multimodal fact-checking and explanation generation. The dataset, source code and model checkpoints are available at https://github.com/VT-NLP/Mocheg.en
dc.description.versionPublished versionen
dc.format.mimetypeapplication/pdfen
dc.identifier.doihttps://doi.org/10.1145/3539618.3591879en
dc.identifier.urihttp://hdl.handle.net/10919/115965en
dc.language.isoenen
dc.publisherACMen
dc.rightsCreative Commons Attribution 4.0 Internationalen
dc.rights.holderThe author(s)en
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/en
dc.titleEnd-to-End Multimodal Fact-Checking and Explanation Generation: A Challenging Dataset and Modelsen
dc.typeArticle - Refereeden
dc.type.dcmitypeTexten

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
3539618.3591879.pdf
Size:
3 MB
Format:
Adobe Portable Document Format
Description:
Published version
License bundle
Now showing 1 - 1 of 1
Name:
license.txt
Size:
0 B
Format:
Item-specific license agreed upon to submission
Description: