PDA

查看完整版本 : 求助:How to remove the tags in BNC in Wordsmith?


zlgtony
2007-03-20, 06:41 PM
How to remove the tags in BNC in Wordsmith? Many thanks in advance

oscar3
2007-03-20, 08:22 PM
你是想hide tags,还是将strip tags?

laohong
2007-03-20, 09:39 PM
If you want to work on a plain text format of BNC (without any tags), you may want to try the slim version of BNC.

Have a look at this tool:

BET-MBNC - Canare Slim BNC Extraction Tool
http://www.sjmediasystem.com/bet-mbnc.html

xiaoz
2007-03-21, 12:47 AM
What kind of tool is that? Can it be used for the British National Corpus?

Product Description

Easy Access to Rear Rack
Fits Canare ''Slim'' BNC
Long 300 mm Heavy Duty Metal Probe Shaft
Tapered Channel Socket
Clear Plastic Handle



If you want to work on a plain text format of BNC (without any tags), you may want to try the slim version of BNC.

Have a look at this tool:

BET-MBNC - Canare Slim BNC Extraction Tool
http://www.sjmediasystem.com/bet-mbnc.html

laohong
2007-03-21, 10:05 AM
Just kidding! I know it's not for BNC when I posted it yesterday.

Actually I remembered that one registered user here in this forum has a slim version of BNC and he said it's ok to share with us. Search the forum and contact him.

BTW, the Canare Slim BNC Extraction Tool is one of the search results, not exactly relevant, though.

zlgtony
2007-03-22, 09:18 AM
I want to hide the tags. I tried my best to ignore them in WS by typing <*> in TAGS TO IGNORE, but the generated wordlist also included a lot of tags. Do you know how to ingore them in WS? Thanks!

xiaoz
2007-03-22, 09:46 AM
Even if you successfully ignores everything in <>, the resulting wordlist is still unrebliable because many parts in the corpus header are not brackets. You will need to use/cut "part of file" to cut the corpus header in each corpus file and then tries to ignore tags.

There are of course free copies of BNC wordlists out there at this forum, in both WST3 and 4 formats. Just search for it.

xujiajin
2007-03-23, 05:18 PM
Detagging Tool
http://forum.corpus4u.org/showthread.php?t=2059&highlight=remove

How to remove the error tags in CLEC?
http://forum.corpus4u.org/showthread.php?t=1827&highlight=remove

How to remove tags at one go?
http://forum.corpus4u.org/showthread.php?t=784&highlight=remove

How to ignore all the tags in CLEC?
http://forum.corpus4u.org/showthread.php?t=875&highlight=remove

如何去掉标记部分的内容?Tag removal : remove tags
http://forum.corpus4u.org/showthread.php?t=910&highlight=remove

如何将BNC语料中POS tag去掉并存为.txt格式?有程序下载哦!
http://forum.corpus4u.org/showthread.php?t=1368&highlight=remove

如何去除中英混合文本中的中文或英文?
http://forum.corpus4u.org/showthread.php?t=908&highlight=remove

如何删去语料库中的标注信息和附码信息
http://forum.corpus4u.org/showthread.php?t=1088&highlight=remove

Detagging小工具
http://forum.corpus4u.org/showthread.php?t=2055&highlight=detagging