Design and realization of a modular architecture for textual entailment

A key challenge at the core of many Natural Language Processing (NLP) tasks is the ability to determine which conclusions can be inferred from a given natural language text. This problem, called the Recognition of Textual Entailment (RTE), has initiated the development of a range of algorithms, meth...

Full description

Saved in:
Bibliographic Details
Main Authors: Padó, Sebastian (Author) , Noh, Tae-Gil (Author) , Stern, Asher (Author) , Wang, Rui (Author) , Zanoli, Roberto (Author)
Format: Article (Journal)
Language:English
Published: 2015
In: Natural language engineering
Year: 2013, Volume: 21, Issue: 2, Pages: 167-200
ISSN:1469-8110
DOI:10.1017/S1351324913000351
Online Access:Verlag, lizenzpflichtig, Volltext: https://doi.org/10.1017/S1351324913000351
Verlag, lizenzpflichtig, Volltext: https://www.cambridge.org/core/journals/natural-language-engineering/article/design-and-realization-of-a-modular-architecture-for-textual-entailment/39234E9C6213B3E99C2F0CB3AD57B9BA
Get full text
Author Notes:Sebastian Padó, Tae-Gil Noh, Asher Stern, Rui Wang and Roberto Zanoli
Description
Summary:A key challenge at the core of many Natural Language Processing (NLP) tasks is the ability to determine which conclusions can be inferred from a given natural language text. This problem, called the Recognition of Textual Entailment (RTE), has initiated the development of a range of algorithms, methods, and technologies. Unfortunately, research on Textual Entailment (TE), like semantics research more generally, is fragmented into studies focussing on various aspects of semantics such as world knowledge, lexical and syntactic relations, or more specialized kinds of inference. This fragmentation has problematic practical consequences. Notably, interoperability among the existing RTE systems is poor, and reuse of resources and algorithms is mostly infeasible. This also makes systematic evaluations very difficult to carry out. Finally, textual entailment presents a wide array of approaches to potential end users with little guidance on which to pick. Our contribution to this situation is the novel EXCITEMENT architecture, which was developed to enable and encourage the consolidation of methods and resources in the textual entailment area. It decomposes RTE into components with strongly typed interfaces. We specify (a) a modular linguistic analysis pipeline and (b) a decomposition of the ‘core’ RTE methods into top-level algorithms and subcomponents. We identify four major subcomponent types, including knowledge bases and alignment methods. The architecture was developed with a focus on generality, supporting all major approaches to RTE and encouraging language independence. We illustrate the feasibility of the architecture by constructing mappings of major existing systems onto the architecture. The practical implementation of this architecture forms the EXCITEMENT open platform. It is a suite of textual entailment algorithms and components which contains the three systems named above, including linguistic-analysis pipelines for three languages (English, German, and Italian), and comprises a number of linguistic resources. By addressing the problems outlined above, the platform provides a comprehensive and flexible basis for research and experimentation in textual entailment and is available as open source software under the GNU General Public License.
Item Description:First published online 13 December 2013
Gesehen am 17.07.2020
Physical Description:Online Resource
ISSN:1469-8110
DOI:10.1017/S1351324913000351