xiaoz
2005-10-01, 10:20 AM
A Powerpoint prepared today -
http://forum.corpus4u.org/upload/forum/2005100110193664.ppt
[本贴已被 xujiajin 于 2005年10月01日 10时24分35秒 编辑过]
xujiajin
2005-10-01, 10:30 AM
Images help a lot.
Before concordancing, collocation computing, etc, preprocessing has to be done:
Mark up the corpus in XML
1. Markup can be very complex or very simple
2. If your corpus is not XML marked up, use Index Tool (Tools C Preprocess in the Index Toolkit) to add simple XML markup
3. For a non-alphabet language corpus, convert it into Unicode (e.g. UTF-8, UTF-16)
Use the Index tool (Tools C Index Wizard in the Index Toolkit) to index your corpus.
vBulletin® v3.7.4,版权所有 ©2000-2009,Jelsoft Enterprises Ltd.