Tokenizing on scale. Preprocessing large text corpora on the lexical and sentence level

Saved in:
Bibliographic Details
Main Authors: Diewald, Nils (Author) , Kupietz, Marc (Author) , Lüngen, Harald (Author)
Other Authors: Klosa-Kückelhaus, Annette (Editor) , Engelberg, Stefan (Editor) , Möhrs, Christine (Editor) , Storjohann, Petra (Editor)
Format: Conference Paper
Language:English
Published: Mannheim IDS-Verlag 2022
Mannheim Leibniz-Institut für Deutsche Sprache (IDS) 2022
DOI:10.14618/ids-pub-11146
Online Access:Resolving-System, kostenfrei: https://doi.org/10.14618/ids-pub-11146
Resolving-System, kostenfrei: https://nbn-resolving.org/urn:nbn:de:bsz:mh39-111464
Langzeitarchivierung Nationalbibliothek, kostenfrei: https://d-nb.info/127705052X/34
Verlag, kostenfrei: https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/11146
Get full text
Author Notes:Nils Diewald, Marc Kupietz, Harald Lüngen ; Herausgeber: Annette Klosa-Kückelhaus, Stefan Engelberg, Christine Möhrs, Petra Storjohann
Description
Item Description:In: Dictionaries and Society. Proceedings of the XX EURALEX International Congress, 12-16 July 2022, Mannheim, Germany. - Mannheim : IDS-Verlag, 2022, S. 208-221. - ISBN 978-3-937241-87-6
Physical Description:Online Resource
DOI:10.14618/ids-pub-11146