搜寻结果

  1. ArthurW

    求一个语料抓取全面的法语词频表,可以使用antconc和自己生成的语料库作对比

    显然最关键的是要有一个足够大的法语语料库,之后就容易了。这样的库似乎很难找,附件是我根据Notre-Dame de Paris小说全文提取的词频表(仅lemma,也包括标点符号)
  2. ArthurW

    北大的CCL语料库不能访问了?

    北大的库主要是内部使用,外部访问要仰北大鼻息
  3. ArthurW

    早已潜伏Windows的十大语料神器

    Thanks, Dr. Li. Most of those low level tools are good enough for advanced purposes, if used creatively enough. Let's hope our corpling drivers are interested in what's under the hood ;-)
  4. ArthurW

    求助,怎么利用AntConc检索BNC里的虚拟语气的倒装句?正则表达式什么啊?

    兰卡的CQPweb也可以用BNC,只要注册不需付费。个人认为比byu的检索功能更强。https://cqpweb.lancs.ac.uk/
  5. ArthurW

    AntCorGen: 新发布的语料采集工具

    AntCorGen (Windows) Build 1.0.1 (Released June 5, 2017) Introduction AntCorGen is a freeware corpus generation tool. AntCorGen lets you search for documents in the PLOS ONE research database via search queries and/or subject category browsing and decide which sections (e.g. title, abstract...
  6. ArthurW

    CLiC: 语料库文体学新工具

    http://www.nottingham.ac.uk/research/groups/cral/projects/clic.aspx Characterisation in the representation of speech and body language from a corpus linguistic perspective The CLiC Dickens project demonstrates through corpus stylistics how computer-assisted methods can be used to study literary...
  7. ArthurW

    求wordsmith 注册码或破解版

    Hi This request is not appropriate here... For most of what you need you could use other software instead - there are loads of them
  8. ArthurW

    从熟语料到生语料-除了手动

    正则表达式确实是更好的选择。先仔细分析熟语料,然后备份语料,再用Notepad++、Editplus等等专业文本编辑器中的“在文件中替换”这类功能,一条正则都搞定
  9. ArthurW

    一个很好的CQPweb搭建与数据格式的tutorial

    http://chozelinek.github.io/sacoco/cqpwebsetup.html Andrew Hardie本人也说这个tutorial很不错,甚至想联系作者本人问是否可以把其中一部分“借”到CQPweb tutorial里去。
  10. ArthurW

    跪求LOCNESS中英语本族语大学生议论文语料

    Google for LOCNESS and her email address isn't hard to find.
  11. ArthurW

    关于专门用途语料库的文本清理问题

    Formulas are annoying for corpus building indeed. Since they lose their meaning in pure text, deleting them from the corpus text will be fine. However, this is not true if the theses are to be retrieved in more advanced, or multimodal ways. The answer to your question depends on the nature of...
  12. ArthurW

    怎样消除段首半角空格缩进?

    Windows Notepad isn't a good tool for such purposes. Consider Notepad++ or other similar powerful text editors instead, which have built-in support for regular expressions. To remove leading spaces, replace call instances of ^\s+ with nothing. However, it isn't so easy to to connect two lines...
  13. ArthurW

    怎样消除段首半角空格缩进?

    Please describe your problem more precisely. Your title doesn't match your message body.
  14. ArthurW

    推荐两个网上的语料资源

    Corpus Finder: http://www.helsinki.fi/varieng/CoRD/corpora/corpusfinder/ 很多经典的语料库可下载 Opus: the open parallel corpus http://opus.lingfil.uu.se/ 海量的平行语料库可下载
  15. ArthurW

    Free Corpus Linguistics course from Tony McEnery

    今年的是九月下旬开始的,共8周,现在已经结束了。大概要等明年吧
  16. ArthurW

    CLAWS tag和Tree tagger标注

    There are a lot of tagging schemes, but there is no single standard for annotation. You may want to have a look at the tagsets in order to get an idea of what the tags mean.
  17. ArthurW

    推荐使用网络语料自动下载工具BootCat

    BootCat是Bootstrap Corpora and Terms的缩写,是WaCky (Web-As-Corpus Kool Yinitiative)兴趣小组开发的软件,用于从互联网自动采集语料。可按用户指定的搜索词(称为seeds)借助Bing.com搜索引擎从网络上搜索网页,然后下载到本地硬盘构成语料库。这个软件被SketchEngine用作采集网络语料的后台工具(称为WebBootCaT)。由于SketchEngine是收费的,而且对试用用户有不少限制,而BootCat则是自由软件,因此强烈推荐有兴趣者下载安装BootCat。 说明: 1. 需要先在电脑系统中安装JRE。...
  18. ArthurW

    2015年新出炉中国英语学习者语料库The TECCL corpus

    Hi, you could try the "Restricted Query" menu item.
Back
顶部