TV series as disseminators of emerging vocabulary: non-codified expressions in the TV Corpus

This study presents a method for identifying words that appear in corpus data earlier than their first date of attestation in dictionaries. We demonstrate the application of this method based on a large diachronic corpus, the TV Corpus, and the Oxford English Dictionary (OED). Combining automatic ex...

Full description

Saved in:
Bibliographic Details
Main Authors: Landert, Daniela (Author) , Säily, Tanja (Author) , Hämäläinen, Mika (Author)
Format: Article (Journal)
Language:English
Published: May 2023
In: ICAME journal
Year: 2023, Volume: 47, Issue: 1, Pages: 63-79
ISSN:1502-5462
DOI:10.2478/icame-2023-0004
Online Access:Resolving-System, kostenfrei, Volltext: https://doi.org/10.2478/icame-2023-0004
Verlag, kostenfrei, Volltext: https://sciendo.com/article/10.2478/icame-2023-0004
Get full text
Author Notes:Daniela Landert, Tanja Säily, Mika Hämäläinen
Description
Summary:This study presents a method for identifying words that appear in corpus data earlier than their first date of attestation in dictionaries. We demonstrate the application of this method based on a large diachronic corpus, the TV Corpus, and the Oxford English Dictionary (OED). Combining automatic extraction of candidate terms from the TV Corpus with comprehensive manual analysis and verification, the method identifies 32 words that were used in TV series before their first attestation in the OED. We present a detailed discussion of these words, analysing their distribution across decades and genres of the TV Corpus, their origins, semantic domains and word-formation processes. We also present extracts with their first uses in the TV Corpus and analyse how the words were presented to the large and anonymous mass audience. Our study shows that the method we present is suitable for identifying early attestations of words in large corpora, even though in the case of the TV Corpus, a great deal of manual analysis and verification is needed. In addition, we argue that TV series and other types of fictional texts are an important resource for studying the coinage and spread of terms, due to their function and the fact that they address a mass audience.
Item Description:Online veröffentlicht: 30. April 2023
Gesehen am 04.04.2024
Physical Description:Online Resource
ISSN:1502-5462
DOI:10.2478/icame-2023-0004