A flexible, corpus-driven model of regular and inverse selectional preferences

We present a vector space-based model for selectional preferences that predicts plausibility scores for argument headwords. It does not require any lexical resources (such as WordNet). It can be trained either on one corpus with syntactic annotation, or on a combination of a small semantically annot...

Full description

Saved in:
Bibliographic Details
Main Authors: Erk, Katrin (Author) , Padó, Sebastian (Author) , Padó, Ulrike (Author)
Format: Article (Journal)
Language:English
Published: December 01, 2010
In: Computational linguistics
Year: 2010, Volume: 36, Issue: 4, Pages: 723-763
ISSN:1530-9312
DOI:10.1162/coli_a_00017
Online Access:Verlag, lizenzpflichtig, Volltext: https://doi.org/10.1162/coli_a_00017
Get full text
Author Notes:Katrin Erk, Sebastian Padó, Ulrike Padó
Description
Summary:We present a vector space-based model for selectional preferences that predicts plausibility scores for argument headwords. It does not require any lexical resources (such as WordNet). It can be trained either on one corpus with syntactic annotation, or on a combination of a small semantically annotated primary corpus and a large, syntactically analyzed generalization corpus. Our model is able to predict inverse selectional preferences, that is, plausibility scores for predicates given argument heads.We evaluate our model on one NLP task (pseudo-disambiguation) and one cognitive task (prediction of human plausibility judgments), gauging the influence of different parameters and comparing our model against other model classes. We obtain consistent benefits from using the disambiguation and semantic role information provided by a semantically tagged primary corpus. As for parameters, we identify settings that yield good performance across a range of experimental conditions. However, frequency remains a major influence of prediction quality, and we also identify more robust parameter settings suitable for applications with many infrequent items.
Item Description:Gesehen am 27.03.2023
Physical Description:Online Resource
ISSN:1530-9312
DOI:10.1162/coli_a_00017