I have two folders: ICECUP3 and ICE-GB-S, the programs are installed in ICECUP3, the corpus in ICE-GB-S. But in ICE-GB-S, only those subdirectories, i.e. DATA, INDEX, LEXICAL, ... , VARS are installed. I don't see any DEF file nor INI file.
One thing I must make clear. Somehow it is really difficult for me to mail this large file, and still so even if I split it up and deliver chunk by chunk. So far, the only way practicable on my part is transfer by QQ, and trust me, it is really fast.
回复:[TO SHARE] NEW YORK TIMES 1995 (pos tagged)
In monoconcpro:
Use menu File>Tag settings>part of speech tags, check "embedded in word" and set up "delimiter character".
Yeah, true, different. I quoted this Z-score formula from what I read, but I am just confused about the variety of those formulae, though I've learned some statistics.
Can we seriously take the following as Chinglish?
--------------------------------------------------------------------
干货计价处:fuck the certain price of goods
熟食计价处:the familiar make sures the price
怎么是你:how are you
怎么老是你:how old are you
一位中国学生在美国加州目睹了一起交通事故,警察问他经过,he said,one car come,one...
回复:Parallel image text corpus of Chinglish
Can we be sure they are nothing significant but jokes?
I found something like this, but I don't believe they are real.
[本贴已被 作者 于 2005年08月22日 17时25分17秒 编辑过]
回复:为Google检索重做的表单页面
GOOGLE SEARCH FORM REVISED
我把之前做的Google搜索表单完善了一下,并且加入了 Google Scholar 和 Google Print。
http://www.corpus4u.org/upload/forum/2005082214363671.rar
T-test is used to solve two types of collocation discovery problems. It seems that T-test is used in "investigations of how pairs of words are used differently, rather then the association between two words" (Biber, 1998), and in this case the statistical approach is something of Student's...