A Frequency Dictionary of Russian is an invaluable tool for all learners of Russian, providing a list of the 5,000 most frequently used words in the language and the 300 most frequent multiword constructions. The dictionary is based on data from a 150-million-word internet corpus taken from more than 75,000 webpages and covering a range of text types from news and journalistic articles, research papers, administrative texts and fiction. All entries in the rank frequency list feature the English equivalent, a sample sentence with English translation, a part of speech indication, indication of stress for polysyllabic words and information on inflection for irregular forms. The dictionary also contains twenty-six thematically organised and frequency-ranked lists of words on a variety of topics, such as food and drink, travel, and sports and leisure. A Frequency Dictionary of Russian enables students of all levels to get the most out of their study of vocabulary in an engaging and efficient way. It is also a rich resource for language teaching, research, curriculum design, and materials development. A CD version is available to purchase separately. Designed for use by corpus and computational linguists it provides the full text in a format that researchers can process and turn into suitable lists for their own research purposes.
This book provides a comprehensive overview of methods to build comparable corpora and of their applications, including machine translation, cross-lingual transfer, and various kinds of multilingual natural language processing. The authors begin with a brief history on the topic followed by a comparison to parallel resources and an explanation of why comparable corpora have become more widely used. In particular, they provide the basis for the multilingual capabilities of pre-trained models, such as BERT or GPT. The book then focuses on building comparable corpora, aligning their sentences to create a database of suitable translations, and using these sentence translations to produce dictionaries and term banks. Then, it is explained how comparable corpora can be used to build machine translation engines and to develop a wide variety of multilingual applications.
This book provides a comprehensive overview of methods to build comparable corpora and of their applications, including machine translation, cross-lingual transfer, and various kinds of multilingual natural language processing. The authors begin with a brief history on the topic followed by a comparison to parallel resources and an explanation of why comparable corpora have become more widely used. In particular, they provide the basis for the multilingual capabilities of pre-trained models, such as BERT or GPT. The book then focuses on building comparable corpora, aligning their sentences to create a database of suitable translations, and using these sentence translations to produce dictionaries and term banks. Then, it is explained how comparable corpora can be used to build machine translation engines and to develop a wide variety of multilingual applications.
A Frequency Dictionary of Russian is an invaluable tool for all learners of Russian, providing a list of the 5,000 most frequently used words in the language and the 300 most frequent multiword constructions. The dictionary is based on data from a 150-million-word internet corpus taken from more than 75,000 webpages and covering a range of text types from news and journalistic articles, research papers, administrative texts and fiction. All entries in the rank frequency list feature the English equivalent, a sample sentence with English translation, a part of speech indication, indication of stress for polysyllabic words and information on inflection for irregular forms. The dictionary also contains twenty-six thematically organised and frequency-ranked lists of words on a variety of topics, such as food and drink, travel, and sports and leisure. A Frequency Dictionary of Russian enables students of all levels to get the most out of their study of vocabulary in an engaging and efficient way. It is also a rich resource for language teaching, research, curriculum design, and materials development. Former CD content is now available to access at www.routledge.com/9780415521420 as support material. Designed for use by corpus and computational linguists it provides the full text in a format that researchers can process and turn into suitable lists for their own research purposes.
This will help us customize your experience to showcase the most relevant content to your age group
Please select from below
Login
Not registered?
Sign up
Already registered?
Success – Your message will goes here
We'd love to hear from you!
Thank you for visiting our website. Would you like to provide feedback on how we could improve your experience?
This site does not use any third party cookies with one exception — it uses cookies from Google to deliver its services and to analyze traffic.Learn More.