This book provides a comprehensive overview of methods to build comparable corpora and of their applications, including machine translation, cross-lingual transfer, and various kinds of multilingual natural language processing. The authors begin with a brief history on the topic followed by a comparison to parallel resources and an explanation of why comparable corpora have become more widely used. In particular, they provide the basis for the multilingual capabilities of pre-trained models, such as BERT or GPT. The book then focuses on building comparable corpora, aligning their sentences to create a database of suitable translations, and using these sentence translations to produce dictionaries and term banks. Then, it is explained how comparable corpora can be used to build machine translation engines and to develop a wide variety of multilingual applications.
Author: Serge Sharoff, Reinhard Rapp, Pierre Zweigenbaum
Binding Type: Paperback
Publisher: Springer
Published: 08/24/2024
Series: Synthesis Lectures on Human Language Technologies
Pages: 133
ISBN: 9783031313868
2023 Edition
Ezra's Archive Does not ship outside of the United States
Delivery Options:
1. Economy:
Estimated Delivery Time - 5 to 8 Business Days
Shipping Cost - $4.15
2. USPS Priority:
Estimated Delivery Time - 1 to 3 Business Days
Shipping Cost - $8.85
3. Free Economy Shipping: Only Applicable to Orders over $60
Returns and Refunds:
Purchased items are not eligible to be returned. However, a refund or item replacement may be granted should an item be damaged or misplaced during shipping. To make a refund or replacement claim please contact us via email at Ezra'sArchive@outlook.com