Tokenizing on scale. Preprocessing large text corpora on the lexical and sentence level
Saved in:
| Main Authors: | , , |
|---|---|
| Other Authors: | , , , |
| Format: | Conference Paper |
| Language: | English |
| Published: |
Mannheim
IDS-Verlag
2022
Mannheim Leibniz-Institut für Deutsche Sprache (IDS) 2022 |
| DOI: | 10.14618/ids-pub-11146 |
| Online Access: | Resolving-System, kostenfrei: https://doi.org/10.14618/ids-pub-11146 Resolving-System, kostenfrei: https://nbn-resolving.org/urn:nbn:de:bsz:mh39-111464 Langzeitarchivierung Nationalbibliothek, kostenfrei: https://d-nb.info/127705052X/34 Verlag, kostenfrei: https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/11146 |
| Author Notes: | Nils Diewald, Marc Kupietz, Harald Lüngen ; Herausgeber: Annette Klosa-Kückelhaus, Stefan Engelberg, Christine Möhrs, Petra Storjohann |
| Item Description: | In: Dictionaries and Society. Proceedings of the XX EURALEX International Congress, 12-16 July 2022, Mannheim, Germany. - Mannheim : IDS-Verlag, 2022, S. 208-221. - ISBN 978-3-937241-87-6 |
|---|---|
| Physical Description: | Online Resource |
| DOI: | 10.14618/ids-pub-11146 |