搜寻结果

  1. F

    Web-Based TAGGER

    回复:Web-Based TAGGER it is a good tool for those who do not have a standalone pos tagger
  2. F

    LOGICAL APPROACH TO CORPUS LINGUISTICS

    回复:LOGICAL APPROACH TO CORPUS LINGUISTICS I find the section on rationalism and empiricism very interesting. It compares the corpus linguist with the theoretical linguisti. [本贴已被 作者 于 2005年09月28日 02时15分35秒 编辑过]
  3. F

    LOGICAL APPROACH TO CORPUS LINGUISTICS

    PhD thesis: A LOGICAL APPROACH TO COMPUTATIONAL CORPUS LINGUISTICS http://www.ling.gu.se/~lager/Thesis.pdf
  4. F

    Vocabulary study bibliography

    http://www1.harenet.ne.jp/~waring/vocab/vocrefs/biblio.txt
  5. F

    About the Corpus Linguistics Summer Institute

    回复:About the Corpus Linguistics Summer Institute Don't get discouraged. We wait for a longer time, and more people will come, in which case it is more likely to find a sponsor.
  6. F

    Learner corpus bibliography downloadable

    Learner corpus bibliography downloadable: a long list http://forum.corpus4u.org/upload/forum/2005092311054316.pdf
  7. F

    [讨论] CLEC-Based error analysis

    回复:[讨论] CLEC-Based error analysis WANTED!!! Collocations in a Learner Corpus by Nadja Nesselhauf University of Heidelberg Studies in Corpus Linguistics 14 2005. xii, 332 pp. Table of contents Abbreviations ix Acknowledgements xi Collocations in native and non-native...
  8. F

    经过POS标注的CLEC 可信吗?

    回复:经过POS标注的CLEC 可信吗? Thanks, xiaoz. The quality of the transcription probably has a great effect on the accuracy rate. It seems natural that tagging accuracy on learner spoken data is the lowest, while that on native speaker written data is the highest.
  9. F

    经过POS标注的CLEC 可信吗?

    回复:经过POS标注的CLEC 可信吗? Thank you, Dr.xiaoz, for all the information you provided. My pilot study with learner spoken data shows that tagging accuracy is about 88%.
  10. F

    A wonderful tool for analyzing textual coherence

    回复:A wonderful tool I did some work on Latent Semantic Analysis myself. It involves some matrix computations. The results generated by Coh-Metrix are interpreted at the tool's website. Simply click the links to the left of the generated results, and another webpage will pop up. The...
  11. F

    What is POS tagging and how it is done

    Thank you for the info, xusun575.
  12. F

    Some papers by Grangers for download

    Starter Bibliography for Learner Corpus Analysis Cobb, T. (2003). Analyzing late interlanguage with learner corpora: Quebec replications of three European studies. The Canadian Modern Language Review/La Revue canadienne des langues vivantes, 59(3), 393-423. Granger, S. (1993). The...
  13. F

    经过POS标注的CLEC 可信吗?

    回复:经过POS标注的CLEC 可信吗? I conducted a pilot study, in which I tagged a set of 16 learner-written essays with both Brill and CLAWS. The result surprised me. The transformation-based POS tagger yielded an accuracy of something like 93%, while the probability-based POS tagger turned out an accuracy...
  14. F

    What is POS tagging and how it is done

    回复:What is POS tagging and how it is done This was taken from the web. It is one chapter of a book, which, unfortunately, I do not have, so I cannot provide any copyright information or sort of things. Consequently, it can only be taken as an introductory to POS tagging. Quoting or citing can...
  15. F

    Investigating learner vocabulary

    Learners' vocabulary can be investigated in terms of depth, breadth, variety, richness, sophistication, etc. Visit Paul Nation's website, and you should find more information there. Vocabulary analysis does not simply involve counting.
  16. F

    Archive of Research Papers in Computational Lingui

    A Digital Archive of Research Papers in Computational Linguistics http://acl.ldc.upenn.edu/
  17. F

    Investigating learner vocabulary

    Paper to download: Investigating learner vocabulary http://forum.corpus4u.org/upload/forum/2005092000051795.pdf [本贴已被 作者 于 2005年09月20日 00时06分03秒 编辑过]
  18. F

    [求助]lexical density tools needed

    回复:[求助]lexical density tools needed 以下是引用 Xiaoz 在 2005-9-19 19:08:47 的发言: Not excatly just the first thousand words of each text. Here is what Mike says about STTR: "The standardised type/token ratio (STTR) is computed every n words as Wordlist goes through each text file. By default, n =...
  19. F

    [求助]lexical density tools needed

    回复:[求助]lexical density tools needed Dr. Xiaoz, while I agree that standardized TTR is better as TTR, I am more inclined to accept the view that standard TTR is not really a good measure either, as the measure only takes into account part (say, the first 1000 words) of texts. In so doing, a good...
  20. F

    Corpora searchable online

    Corpora searchable online BROWN & LOB CORPUS http://www.edict.com.hk/concordance/WWWConcappE.htm BRITISH NATIONAL CORPUS (BNC) http://sara.natcorp.ox.ac.uk/lookup.html The British National Corpus (BNC) is a one hundred million word corpus of British English, both spoken and written...
Back
顶部