BFSU Sentence Collector 1.0 基于语料库的英语例句提取工具

本文由 xujiajin2010-04-08 发表於 "编程与工具开发" 讨论区

  1. xujiajin

    xujiajin 管理员 Staff Member

    BFSU Sentence Collector 1.0 was programmed by Mr. Yunlong Jia, and designed by Dr. Jiajin Xu.

    The earliest idea of developing such a tool was to encourage and enable the use of corpus data and method in actual teaching of English. Data driven learning to date has largely been a toy method for only a few curious and motivated research-minded people. Sentence Collector therefore was meant to turn corpus analysis from a toy method to a handy learning aid.

    The main functionality of Sentence Collector is primarily KWIC search, or concordancing, in technical terms. To serve pedagogic purposes, however, we choose the ‘sentence mode’ for result display. Another feature of the tool is the selection of sentences in the result by some restrictions, for instance, the new words in the sentence context, or sentence length, or simply a default of 30 random sentences. It has long been vexing with past concordancers, when brought to the classroom, that loads of examples tend to drown and confuse even the most ambitious learners.

    The sentences in the search result can be sorted according to their lexical difficulty based on some pre-loaded base word list. The selected sentences can eventually be used for pedagogic purposes after post-editing, if necessary. This idea was in the first place inspired by the lexical difficulty filter in an online learning system called IWILL in Taiwai.

    Sentence Collector 1.0 supports regular expression based search. This version of the program only accepts raw texts.

    Please cite the program as:

    Xu, Jiajin & Yunlong Jia. (2010). BFSU Sentence Collector 1.0. Beijing: National Research Center for Foreign Language Education, Beijing Foreign Studies University.

    BFSU Sentence Collector 1.0 is freeware. The software comes on an “as is” basis, and the authors will accept no liability for any damage that results from using the software.

    Bug reports will be highly appreciated and should be sent to WilliamJia@OpenCorpus.org.

    【Save】保存结果时,可以选择DOC Word格式,还可以选择Exercise File格式,同样为Word格式,但检索词会被替换为____________,经编辑后可做填空练习。
     

    附件文件:

    1 个人点赞。
  2. 回复: BFSU Sentence Collector 1.0 基于语料库的英语例句提取工具

    感谢许博士、贾博士分享!先下为快!
     
  3. armstrong

    armstrong 高级会员

    回复: BFSU Sentence Collector 1.0 基于语料库的英语例句提取工具

    我将用BFSU NewWords Marker 1.0生成的.IDX文件载入BFSU Sentence Collector 1.0 后就不能运行,可能是缺少对应的.DAT文件吧,不知如何解决,请教一下许博士,谢谢!
     
  4. xujiajin

    xujiajin 管理员 Staff Member

    回复: BFSU Sentence Collector 1.0 基于语料库的英语例句提取工具

    BFSU Sentence Collector 1.0 evaluation版不支持加载数据和导入index文件功能。
     
  5. iCasino

    iCasino 普通会员

    回复: BFSU Sentence Collector 1.0 基于语料库的英语例句提取工具

    impressive,thanks
     
  6. armstrong

    armstrong 高级会员

    回复: BFSU Sentence Collector 1.0 基于语料库的英语例句提取工具

    知道了,谢谢!
     
  7. 回复: BFSU Sentence Collector 1.0 基于语料库的英语例句提取工具

    哦,难怪!我还以为系统不支持呢。
    不知道是否支持汉语?要是能分别检索英、汉、英汉对照就更完美啦!
     
  8. xujiajin

    xujiajin 管理员 Staff Member

    回复: BFSU Sentence Collector 1.0 基于语料库的英语例句提取工具

    没有计划支持汉语和英汉对照。这个工具是专门针对英语学习和教学的。
     
  9. 回复: BFSU Sentence Collector 1.0 基于语料库的英语例句提取工具

    好的,明白,谢谢啦!
     
  10. maggieq58

    maggieq58 语料人生

    回复: BFSU Sentence Collector 1.0 基于语料库的英语例句提取工具

    此版本不知道怎么用?几个功能键都是灰色的。。。晚上我脑子也是有点不太灵活。一时没看懂怎么用啊。。。我知道有几个功能暂时不支持。
     
  11. xujiajin

    xujiajin 管理员 Staff Member

    回复: BFSU Sentence Collector 1.0 基于语料库的英语例句提取工具

    输入单词、短语或正则表达式,获得10个例句,可用于课堂教学或编制练习。
    例句筛选可通过限定句长和语境中的生词数调节。

    结果可以Save成所见即所得的形式,也可以Save成挖词填空(Exercise File)形式的。
     
  12. 回复: BFSU Sentence Collector 1.0 基于语料库的英语例句提取工具

    好像下载的时候,卡巴会误报为病毒。结果我要暂停卡巴,才下完整了。不知道其他C 友是否一样如此?
     
  13. 清风出袖

    清风出袖 高级会员

    回复: BFSU Sentence Collector 1.0 基于语料库的英语例句提取工具

    谢谢贾博士和许博士!
     
  14. 回复: BFSU Sentence Collector 1.0 基于语料库的英语例句提取工具

    谢谢贾博士和许博士!Excellent Tools, it will help the EFL/ESL teachers in classroom teaching and contribute to the implementing corpus into teaching pratice. My question is how can we evaluate the validity of the sentences? or what corpus is the sentences based on?
     
  15. maggieq58

    maggieq58 语料人生

    回复: BFSU Sentence Collector 1.0 基于语料库的英语例句提取工具

    的确,很好用,看到效果了。感谢。有个问题,这个软件可以标记出新单词,它也是根据new words maker那个软件的原理标记的吗?但这个软件未看到有选择词表的地方。
    再有,此软件在indexes下有colen这个dat文件,貌似搜录了colen这个语料库
     
  16. xujiajin

    xujiajin 管理员 Staff Member

    回复: BFSU Sentence Collector 1.0 基于语料库的英语例句提取工具

    http://www.corpus4u.org/showthread.php?t=3217
    The preloaded corpus is COLEN. The corpus has been indexed and marked up with information of sentence length and new words.
     
  17. williamJia

    williamJia 开放语料库项目

    回复: BFSU Sentence Collector 1.0 基于语料库的英语例句提取工具

    内置COLEN语料库,标注词表为四级词汇。
     
  18. 回复: BFSU Sentence Collector 1.0 基于语料库的英语例句提取工具

    谢谢!!
     
  19. 回复: BFSU Sentence Collector 1.0 基于语料库的英语例句提取工具

    谢谢分享!
    要是能导入自己的语料库和标注词就太好了。
     
  20. xujiajin

    xujiajin 管理员 Staff Member

    回复: BFSU Sentence Collector 1.0 基于语料库的英语例句提取工具

    检索说明:
    缺省检索方式为lemma search,即输入原形词make,会得到包含making、makes、make、made等词的例句;如果输入making,则只会得到含making的例句。
    缺省支持正则表达式比如输入a \S+ of可以得到含a lot of和a number of这样短语的句子。

    输入, however(逗号+空格+however)可以得到在句中出现的however句子。