COINS: dynamically generating COntextualized Inference rules for Narrative Story completion

Despite recent successes of large pre-trained language models in solving reasoning tasks, their inference capabilities remain opaque. We posit that such models can be made more interpretable by explicitly generating interim inference rules, and using them to guide the generation of task-specific tex...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Paul, Debjit (VerfasserIn) , Frank, Anette (VerfasserIn)
Dokumenttyp: Kapitel/Artikel Konferenzschrift
Sprache:Englisch
Veröffentlicht: August 2021
In: The 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing - proceedings of the conference ; Vol. 1: Long papers
Year: 2021, Pages: 5086-5099
DOI:10.18653/v1/2021.acl-long.395
Online-Zugang:Verlag, lizenzpflichtig, Volltext: https://doi.org/10.18653/v1/2021.acl-long.395
Verlag, lizenzpflichtig, Volltext: https://aclanthology.org/2021.acl-long.395
Volltext
Verfasserangaben:Debjit Paul, Anette Frank
Beschreibung
Zusammenfassung:Despite recent successes of large pre-trained language models in solving reasoning tasks, their inference capabilities remain opaque. We posit that such models can be made more interpretable by explicitly generating interim inference rules, and using them to guide the generation of task-specific textual outputs. In this paper we present Coins, a recursive inference framework that i) iteratively reads context sentences, ii) dynamically generates contextualized inference rules, encodes them, and iii) uses them to guide task-specific output generation. We apply to a Narrative Story Completion task that asks a model to complete a story with missing sentences, to produce a coherent story with plausible logical connections, causal relationships, and temporal dependencies. By modularizing inference and sentence generation steps in a recurrent model, we aim to make reasoning steps and their effects on next sentence generation transparent. Our automatic and manual evaluations show that the model generates better story sentences than SOTA baselines, especially in terms of coherence. We further demonstrate improved performance over strong pre-trained LMs in generating commonsense inference rules. The recursive nature of holds the potential for controlled generation of longer sequences.
Beschreibung:Gesehen am 10.07.2023
Beschreibung:Online Resource
ISBN:9781954085527
DOI:10.18653/v1/2021.acl-long.395