Tokenizing on scale. Preprocessing large text corpora on the lexical and sentence level
Gespeichert in:
| Hauptverfasser: | , , |
|---|---|
| Weitere Verfasser: | , , , |
| Dokumenttyp: | Konferenzschrift |
| Sprache: | Englisch |
| Veröffentlicht: |
Mannheim
IDS-Verlag
2022
Mannheim Leibniz-Institut für Deutsche Sprache (IDS) 2022 |
| DOI: | 10.14618/ids-pub-11146 |
| Online-Zugang: | Resolving-System, kostenfrei: https://doi.org/10.14618/ids-pub-11146 Resolving-System, kostenfrei: https://nbn-resolving.org/urn:nbn:de:bsz:mh39-111464 Langzeitarchivierung Nationalbibliothek, kostenfrei: https://d-nb.info/127705052X/34 Verlag, kostenfrei: https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/11146 |
| Verfasserangaben: | Nils Diewald, Marc Kupietz, Harald Lüngen ; Herausgeber: Annette Klosa-Kückelhaus, Stefan Engelberg, Christine Möhrs, Petra Storjohann |
MARC
| LEADER | 00000cam a2200000 c 4500 | ||
|---|---|---|---|
| 001 | 1835135269 | ||
| 003 | DE-627 | ||
| 005 | 20231120151911.0 | ||
| 007 | cr uuu---uuuuu | ||
| 008 | 230214s2022 gw |||||o 00| ||eng c | ||
| 015 | |a 23,O02 |2 dnb | ||
| 016 | 7 | |a 127705052X |2 DE-101 | |
| 024 | 7 | |a urn:nbn:de:bsz:mh39-111464 |2 urn | |
| 024 | 7 | |a 10.14618/ids-pub-11146 |2 doi | |
| 035 | |a (DE-627)1835135269 | ||
| 035 | |a (DE-599)DNB127705052X | ||
| 035 | |a (OCoLC)1369163488 | ||
| 040 | |a DE-627 |b ger |c DE-627 |e rakwb | ||
| 041 | |a eng | ||
| 044 | |c XA-DE-BW | ||
| 082 | 0 | 4 | |a 420 |q DE-101 |
| 084 | |a 28 |2 sdnb | ||
| 084 | |a 52 |2 sdnb | ||
| 100 | 1 | |a Diewald, Nils |d 1981- |e VerfasserIn |0 (DE-588)1043078355 |0 (DE-627)770104843 |0 (DE-576)394444566 |4 aut | |
| 245 | 1 | 0 | |a Tokenizing on scale. Preprocessing large text corpora on the lexical and sentence level |c Nils Diewald, Marc Kupietz, Harald Lüngen ; Herausgeber: Annette Klosa-Kückelhaus, Stefan Engelberg, Christine Möhrs, Petra Storjohann |
| 264 | 1 | |a Mannheim |b IDS-Verlag |c 2022 | |
| 264 | 1 | |a Mannheim |b Leibniz-Institut für Deutsche Sprache (IDS) |c 2022 | |
| 300 | |a 1 Online-Ressource | ||
| 336 | |a Text |b txt |2 rdacontent | ||
| 337 | |a Computermedien |b c |2 rdamedia | ||
| 338 | |a Online-Ressource |b cr |2 rdacarrier | ||
| 500 | |a In: Dictionaries and Society. Proceedings of the XX EURALEX International Congress, 12-16 July 2022, Mannheim, Germany. - Mannheim : IDS-Verlag, 2022, S. 208-221. - ISBN 978-3-937241-87-6 | ||
| 583 | 1 | |a Archivierung/Langzeitarchivierung gewährleistet |2 pdager |5 DE-101 | |
| 700 | 1 | |a Kupietz, Marc |d 1971- |e VerfasserIn |0 (DE-588)1023035693 |0 (DE-627)717351920 |0 (DE-576)308136020 |4 aut | |
| 700 | 1 | |a Lüngen, Harald |e VerfasserIn |0 (DE-588)1058599186 |0 (DE-627)797211527 |0 (DE-576)180662341 |4 aut | |
| 700 | 1 | |a Klosa-Kückelhaus, Annette |e HerausgeberIn |4 edt | |
| 700 | 1 | |a Engelberg, Stefan |e HerausgeberIn |4 edt | |
| 700 | 1 | |a Möhrs, Christine |e HerausgeberIn |4 edt | |
| 700 | 1 | |a Storjohann, Petra |e HerausgeberIn |4 edt | |
| 856 | 4 | 0 | |u https://doi.org/10.14618/ids-pub-11146 |x Resolving-System |z kostenfrei |
| 856 | 4 | 0 | |u https://nbn-resolving.org/urn:nbn:de:bsz:mh39-111464 |x Resolving-System |z kostenfrei |
| 856 | 4 | 0 | |u https://d-nb.info/127705052X/34 |x Langzeitarchivierung Nationalbibliothek |z kostenfrei |
| 856 | 4 | 0 | |u https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/11146 |q application/pdf |x Verlag |z kostenfrei |
| 951 | |a BO | ||
| 992 | |a 20231120 | ||
| 993 | |a ConferencePaper | ||
| 994 | |a 2022 | ||
| 998 | |g 1023035693 |a Kupietz, Marc |m 1023035693:Kupietz, Marc |d 90000 |d 90500 |e 90000PK1023035693 |e 90500PK1023035693 |k 0/90000/ |k 1/90000/90500/ |p 2 | ||
| 999 | |a KXP-PPN1835135269 |e 4414013216 | ||
| BIB | |a Y | ||
| JSO | |a {"note":["In: Dictionaries and Society. Proceedings of the XX EURALEX International Congress, 12-16 July 2022, Mannheim, Germany. - Mannheim : IDS-Verlag, 2022, S. 208-221. - ISBN 978-3-937241-87-6"],"physDesc":[{"extent":"1 Online-Ressource"}],"id":{"doi":["10.14618/ids-pub-11146"],"uri":["urn:nbn:de:bsz:mh39-111464"],"eki":["1835135269"]},"type":{"bibl":"book","media":"Online-Ressource"},"title":[{"title_sort":"Tokenizing on scale. Preprocessing large text corpora on the lexical and sentence level","title":"Tokenizing on scale. Preprocessing large text corpora on the lexical and sentence level"}],"language":["eng"],"person":[{"family":"Diewald","given":"Nils","display":"Diewald, Nils","role":"aut"},{"family":"Kupietz","display":"Kupietz, Marc","given":"Marc","role":"aut"},{"role":"aut","given":"Harald","display":"Lüngen, Harald","family":"Lüngen"},{"role":"edt","family":"Klosa-Kückelhaus","given":"Annette","display":"Klosa-Kückelhaus, Annette"},{"role":"edt","family":"Engelberg","display":"Engelberg, Stefan","given":"Stefan"},{"family":"Möhrs","display":"Möhrs, Christine","given":"Christine","role":"edt"},{"family":"Storjohann","display":"Storjohann, Petra","given":"Petra","role":"edt"}],"origin":[{"publisher":"IDS-Verlag ; Leibniz-Institut für Deutsche Sprache (IDS)","dateIssuedDisp":"2022","dateIssuedKey":"2022","publisherPlace":"Mannheim ; Mannheim"}],"name":{"displayForm":["Nils Diewald, Marc Kupietz, Harald Lüngen ; Herausgeber: Annette Klosa-Kückelhaus, Stefan Engelberg, Christine Möhrs, Petra Storjohann"]},"recId":"1835135269"} | ||
| SRT | |a DIEWALDNILTOKENIZING2022 | ||