Using comparable corpora for under-resourced areas of machine translation

This book provides an overview of how comparable corpora can be used to overcome the lack of parallel resources when building machine translation systems for under-resourced languages and domains. It presents a wealth of methods and open tools for building comparable corpora from the Web, evaluating...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Weitere Verfasser: Skadiņa, Inguna (HerausgeberIn) , Gaizauskas, Robert (HerausgeberIn) , Babych, Bogdan (HerausgeberIn) , Ljubešić, Nikola (HerausgeberIn) , Tufiş, Dan (HerausgeberIn) , Vasiļjevs, Andrejs (HerausgeberIn)
Dokumenttyp: Konferenzschrift
Sprache:Englisch
Veröffentlicht: Cham Springer International Publishing 2019
Schriftenreihe:Theory and Applications of Natural Language Processing
SpringerLink Bücher
DOI:10.1007/978-3-319-99004-0
Schlagworte:
Online-Zugang:Verlag, lizenzpflichtig, Volltext: https://doi.org/10.1007/978-3-319-99004-0
Resolving-System, Volltext: http://dx.doi.org/10.1007/978-3-319-99004-0
Volltext
Verfasserangaben:edited by Inguna Skadiņa, Robert Gaizauskas, Bogdan Babych, Nikola Ljubešić, Dan Tufiş, Andrejs Vasiļjevs

MARC

LEADER 00000cam a2200000 c 4500
001 1067369147
003 DE-627
005 20250426182358.0
007 cr uuu---uuuuu
008 190301s2019 gw |||||o 00| ||eng c
020 |a 9783319990040  |9 978-3-319-99004-0 
020 |a 9783319990040  |9 978-3-319-99004-0 
024 7 |a 10.1007/978-3-319-99004-0  |2 doi 
035 |a (DE-627)1067369147 
035 |a (DE-576)518432343 
035 |a (DE-599)GBV1067369147 
035 |a (DE-He213)978-3-319-99004-0 
035 |a (DE-627-1)061088323 
040 |a DE-627  |b ger  |c DE-627  |e rda 
041 |a eng 
044 |c XA-DE 
050 0 |a QA76.9.N38 
072 7 |a UYQL  |2 thema 
072 7 |a UYQL  |2 bicssc 
072 7 |a COM073000  |2 bisacsh 
082 0 |a 006.35 
084 |a 51  |2 sdnb 
084 |a 17.45  |2 bkl 
084 |a 18.00  |2 bkl 
245 0 0 |a Using comparable corpora for under-resourced areas of machine translation  |c edited by Inguna Skadiņa, Robert Gaizauskas, Bogdan Babych, Nikola Ljubešić, Dan Tufiş, Andrejs Vasiļjevs 
264 1 |a Cham  |b Springer International Publishing  |c 2019 
300 |a Online-Ressource (VI, 323 p. 63 illus., 39 illus. in color, online resource) 
336 |a Text  |b txt  |2 rdacontent 
337 |a Computermedien  |b c  |2 rdamedia 
338 |a Online-Ressource  |b cr  |2 rdacarrier 
490 0 |a Theory and Applications of Natural Language Processing 
490 0 |a SpringerLink  |a Bücher 
520 |a This book provides an overview of how comparable corpora can be used to overcome the lack of parallel resources when building machine translation systems for under-resourced languages and domains. It presents a wealth of methods and open tools for building comparable corpora from the Web, evaluating comparability and extracting parallel data that can be used for the machine translation task. It is divided into several sections, each covering a specific task such as building, processing, and using comparable corpora, focusing particularly on under-resourced language pairs and domains. The book is intended for anyone interested in data-driven machine translation for under-resourced languages and domains, especially for developers of machine translation systems, computational linguists and language workers. It offers a valuable resource for specialists and students in natural language processing, machine translation, corpus linguistics and computer-assisted translation, and promotes the broader use of comparable corpora in natural language processing and computational linguistics 
520 |a Introduction -- Cross-language comparability and its Applications for MT (Bogdan Babych, Fangzhong Su, Anthony Hartley, Ahmet Aker, Monica Lestari Paramita, Paul Clough, Robert Gaizauskas) -- Collecting comparable corpora (Monica Lestari Paramita, Ahmet Aker, Paul Clough, Robert Gaizauskas, Nikos Glaros, Nikos Mastropavlos, Olga Yannoutsou, Radu Ion, Dan Ștefănescu, Alexandru Ceauşu, Dan Tufiș and Judita Preiss) -- Extracting data from comparable corpora (Mārcis Pinnis, Nikola Ljubešić, Dan Ştefănescu, Inguna Skadiņa, Marko Tadić, Tatjana Gornostaja, Špela Vintar, Darja Fišer) -- Mapping and aligning units from comparable corpora (Ahmet Aker, Alexandru Ceaușu, Yang Feng, Robert Gaizauskas, Sabine Hunsicker, Radu Ion, Elena Irimia, Dan Ștefănescu, Dan Tufiș) -- Training, enhancing, evaluating and using MT-Systems with comparable data (Bogdan Babych, Yu Chen, Andreas Eisele, Sabine Hunsicker, Mārcis Pinnis, Inguna Skadiņa, Raivis Skadiņš, Gregor Thurmair, Andrejs Vasiļjevs, Mateja Verlic, Xiaojun Zhang) -- New areas of application of comparable corpora (Reinhard Rapp, Vivian Xu, Michael Zock, Serge Sharoff, Richard Forsyth, Bogdan Babych, Chenhui Chu, Toshiaki Nakazawa, Sadao Kurohashi) -- Appendices (Ahmet Aker, Radu Ion, Nikos Mastropavlos, Monica Paramita, Mārcis Pinnis, Dan Ştefănescu, Fangzhong Su, Gregor Thurmair,Elena Irimia, Nikola Ljubešić, Evangelos Kanoulas, Judita Preiss, Rob Gaizauskas, Paul Clough, Emma Barker, Nikos Glaros, Tiberiu Boroș, Inguna Skadiņa, Andrejs Vasiļjevs) 
533 |a Reproduktion  |f Springer eBook Collection. Computer Science 
650 0 |a Natural Language Processing (NLP) 
650 0 |a Natural language processing (Computer science) 
650 0 |a Computational linguistics 
650 0 |a Data mining 
689 0 0 |d s  |0 (DE-588)4003966-3  |0 (DE-627)106388185  |0 (DE-576)208853340  |a Maschinelle Übersetzung  |2 gnd 
689 0 1 |d s  |0 (DE-588)4165338-5  |0 (DE-627)104584068  |0 (DE-576)209894733  |a Korpus  |g Linguistik  |2 gnd 
689 0 2 |d s  |0 (DE-588)4202994-6  |0 (DE-627)105154229  |0 (DE-576)210152222  |a Ähnlichkeit  |2 gnd 
689 0 |5 (DE-627) 
700 1 |a Skadiņa, Inguna  |e HerausgeberIn  |0 (DE-588)1180399854  |0 (DE-627)1667025481  |0 (DE-576)520138791  |4 edt 
700 1 |a Gaizauskas, Robert  |e HerausgeberIn  |0 (DE-588)1180400240  |0 (DE-627)1667025694  |0 (DE-576)520138996  |4 edt 
700 1 |a Babych, Bogdan  |e HerausgeberIn  |0 (DE-588)116804572X  |0 (DE-627)1031743782  |0 (DE-576)51140851X  |4 edt 
700 1 |a Ljubešić, Nikola  |e HerausgeberIn  |0 (DE-588)1180400054  |0 (DE-627)166702549X  |0 (DE-576)520138899  |4 edt 
700 1 |a Tufiş, Dan  |e HerausgeberIn  |0 (DE-588)1180400445  |0 (DE-627)1667025627  |0 (DE-576)520139119  |4 edt 
700 1 |a Vasiļjevs, Andrejs  |e HerausgeberIn  |0 (DE-588)1180400631  |0 (DE-627)1667025503  |0 (DE-576)520139224  |4 edt 
776 1 |z 9783319990033 
776 1 |z 9783319990033 
776 1 |z 9783319990057 
776 0 8 |i Erscheint auch als  |n Druck-Ausgabe  |t Using comparable corpora for under-resourced areas of machine translation  |d Cham, Switzerland : Springer, 2019  |h vi, 323 Seiten  |w (DE-627)1644911515  |w (DE-576)514692901  |z 9783319990033 
856 4 0 |u https://doi.org/10.1007/978-3-319-99004-0  |m X:SPRINGER  |x Verlag  |z lizenzpflichtig  |3 Volltext 
856 4 0 |u http://dx.doi.org/10.1007/978-3-319-99004-0  |x Resolving-System  |3 Volltext 
912 |a ZDB-2-SCS  |b 2019 
912 |a ZDB-2-SEB 
912 |a ZDB-2-SXCS  |b 2019 
936 b k |a 17.45  |j Übersetzungswissenschaft  |q SEPA  |0 (DE-627)106416987 
936 b k |a 18.00  |j Einzelne Sprachen und Literaturen allgemein  |q SEPA  |0 (DE-627)106405403 
951 |a BO 
990 |a Ähnlichkeit 
990 |a Korpus 
990 |a Maschinelle Übersetzung 
992 |a 20240404 
993 |a ConferencePaper 
994 |a 2019 
998 |g 116804572X  |a Babych, Bogdan  |m 116804572X:Babych, Bogdan  |p 3 
999 |a KXP-PPN1067369147  |e 4507194423 
BIB |a Y 
JSO |a {"name":{"displayForm":["edited by Inguna Skadiņa, Robert Gaizauskas, Bogdan Babych, Nikola Ljubešić, Dan Tufiş, Andrejs Vasiļjevs"]},"person":[{"given":"Inguna","family":"Skadiņa","role":"edt","display":"Skadiņa, Inguna","roleDisplay":"HerausgeberIn"},{"role":"edt","roleDisplay":"HerausgeberIn","display":"Gaizauskas, Robert","given":"Robert","family":"Gaizauskas"},{"roleDisplay":"HerausgeberIn","display":"Babych, Bogdan","role":"edt","family":"Babych","given":"Bogdan"},{"given":"Nikola","family":"Ljubešić","role":"edt","roleDisplay":"HerausgeberIn","display":"Ljubešić, Nikola"},{"role":"edt","roleDisplay":"HerausgeberIn","display":"Tufiş, Dan","given":"Dan","family":"Tufiş"},{"given":"Andrejs","family":"Vasiļjevs","role":"edt","roleDisplay":"HerausgeberIn","display":"Vasiļjevs, Andrejs"}],"title":[{"title_sort":"Using comparable corpora for under-resourced areas of machine translation","title":"Using comparable corpora for under-resourced areas of machine translation"}],"origin":[{"publisher":"Springer International Publishing","dateIssuedKey":"2019","dateIssuedDisp":"2019","publisherPlace":"Cham"}],"id":{"doi":["10.1007/978-3-319-99004-0"],"eki":["1067369147"],"isbn":["9783319990040","9783319990040"]},"type":{"media":"Online-Ressource","bibl":"edited-book"},"physDesc":[{"extent":"Online-Ressource (VI, 323 p. 63 illus., 39 illus. in color, online resource)"}],"recId":"1067369147","language":["eng"]} 
SRT |a USINGCOMPA2019