balanced corpus 与sample corpus 真的一样吗?

在看Mc Enery的书Corpus Linguistics 中提到 “ Balanced corpus, also known as sample corpus, tries to represent a particular type of language over a specific span of time. In doing so, it tries to be balanced and representative within a particular sampling frame which defines the type of language, the population that we would like to characterize. ”
书后面的glossary给的定义是:balanced corpus:A corpus that contains texts from a wide range of different language genres and text domains, so that, for example, it may include both spoken and written, and public and private texts. Balanced corpus is sometimes referred to as reference, general or core corpora. Corpus which seeks balance and representativeness within a given sampling frame is a balanced corpus.
想询问balanced corpus 和sample corpus 真的可以完全划等号吗
 
回复: balanced corpus 与sample corpus 真的一样吗?

在看Mc Enery的书Corpus Linguistics 中提到 “ Balanced corpus, also known as sample corpus, tries to represent a particular type of language over a specific span of time. In doing so, it tries to be balanced and representative within a particular sampling frame which defines the type of language, the population that we would like to characterize. ”
书后面的glossary给的定义是:balanced corpus:A corpus that contains texts from a wide range of different language genres and text domains, so that, for example, it may include both spoken and written, and public and private texts. Balanced corpus is sometimes referred to as reference, general or core corpora. Corpus which seeks balance and representativeness within a given sampling frame is a balanced corpus.
想询问balanced corpus 和sample corpus 真的可以完全划等号吗

Strictly speaking, any corpus is supposed to be a sample of a language or a specific variety of a language. However, strict sampling and representativeness are not stressed in building specific corpora. It would be crucial when you are designing a general corpus like BNC.
 
回复: balanced corpus 与sample corpus 真的一样吗?

Thank you very much:)
Strictly speaking, any corpus is supposed to be a sample of a language or a specific variety of a language. However, strict sampling and representativeness are not stressed in building specific corpora. It would be crucial when you are designing a general corpus like BNC.
 
Back
顶部