查看完整版本 : [SHARE] NEW YORK TIMES 1995 (pos tagged)
dzhigner
2005-05-15, 10:49 PM
[SHARE] NEW YORK TIMES 1995 (pos tagged)
LDC used to provide access to this data resource, but no longer now. I downloaded it (9434 html files). If you need it, contact me.
Haiyang
2005-05-15, 10:59 PM
Hi Dzhigner, I'd like to have a look.
dzhigner
2005-08-14, 10:53 AM
NOW IT IS ONLY ONE TXT FILE, 64MB. IF YOU ARE INTERESTED, CONTACT ME.
xujiajin
2005-08-14, 02:36 PM
This looks a great source of American English. But it is too big to transfer via email.
xujiajin
2005-08-15, 02:46 PM
Is that possible that you put it somewhere on your local school server for download?
lizzawood
2005-08-15, 03:40 PM
You can also use online storage to keep this big size data of yours.
dzhigner
2005-08-15, 07:34 PM
If only I could ... Those people don't give me that, because I am just a nobody. But I am a warm-hearted nobody. My QQ: 33386349.
xujiajin
2005-08-15, 08:25 PM
Thanks a lot for the warm-hearted dzhigner.
xiaoz
2005-08-15, 08:25 PM
One month's data from the NY Times (July 2002) is included in ANC first release.
chrics
2005-08-17, 09:32 PM
Send it to me through if u are using education network, i will provide a server to let others to download the file. my qq:16124885
xujiajin
2005-08-17, 10:29 PM
Thank u, chrics, for your contribution to Corpus4U.
xusun575
2005-08-22, 04:31 PM
What tool is recommended for processing this tagged NYT95? Thanks!
dzhigner
2005-08-22, 05:38 PM
以下是引用 chrics 在 2005-8-17 21:32:53 的发言:
Send it to me through if u are using education network, i will provide a server to let others to download the file. my qq:16124885
Great news. I will contact you as soon as possible.
dzhigner
2005-08-22, 05:40 PM
Wordsmith or MonoconcPro
xujiajin
2005-08-22, 05:41 PM
以下是引用 xusun575 在 2005-8-22 16:31:33 的发言:
What tool is recommended for processing this tagged NYT95? Thanks!
Most concordancers can be used, but i guess your problem might be the the large size of the file.
xusun575
2005-08-22, 06:51 PM
i tried it with monopro and wsmith and found no problem with the zise, but i could not ignore the tags while concordancing.
dzhigner
2005-08-23, 01:39 AM
以下是引用 xusun575 在 2005-8-22 18:51:25 的发言:
i tried it with monopro and wsmith and found no problem with the zise, but i could not ignore the tags while concordancing.
It should be easy ....
http://forum.corpus4u.org/upload/forum/2005082301384282.jpg
http://forum.corpus4u.org/upload/forum/2005082301385532.jpg
xusun575
2005-08-23, 02:27 AM
以下是引用 dzhigner 在 2005-8-23 1:39:00 的发言:
[quote]以下是引用 xusun575 在 2005-8-22 18:51:25 的发言:
i tried it with monopro and wsmith and found no problem with the zise, but i could not ignore the tags while concordancing.
It should be easy ....
************
But it's easy said than done. I had done what u displayed here but failed. Probably NYT95 is tagged in a way different from what WST or Monopro would accept.
以下是引用 xusun575 在 2005-8-23 2:27:26 的发言:
以下是引用 dzhigner 在 2005-8-23 1:39:00 的发言:
[quote]以下是引用 xusun575 在 2005-8-22 18:51:25 的发言:
i tried it with monopro and wsmith and found no problem with the zise, but i could not ignore the tags while concordancing.
It should be easy ....
************
But it's easy said than done. I had done what u displayed here but failed. Probably NYT95 is tagged in a way different from what WST or Monopro would accept.
How did you fail? More info would be needed for trouble shooting:
Which program are you using?
What does the result look like?
etc.
Reason: different programs have different ways to ignore tags. In many
programs you can specify what kind of tag to ignore (or include).
xusun575
2005-08-23, 07:08 AM
Thank u ! I used both monopro and WST3 for NYT95 concordancing. next are the results produced by Monopro.http://forum.corpus4u.org/upload/forum/2005082307034897.jpg
With tags compressed , I got the following:
http://forum.corpus4u.org/upload/forum/2005082307040622.jpg
dzhigner
2005-08-24, 01:36 AM
In monoconcpro:
Use menu File>Tag settings>part of speech tags, check "embedded in word" and set up "delimiter character".
http://forum.corpus4u.org/upload/forum/2005082401335996.jpg
dzhigner
2005-08-24, 01:46 AM
One thing I must make clear. Somehow it is really difficult for me to mail this large file, and still so even if I split it up and deliver chunk by chunk. So far, the only way practicable on my part is transfer by QQ, and trust me, it is really fast.
xusun575
2005-08-25, 07:26 AM
thank u! i got it.
xusun575
2005-08-25, 07:29 AM
以下是引用 dzhigner 在 2005-8-24 1:46:26 的发言:
One thing I must make clear. Somehow it is really difficult for me to mail this large file, and still so even if I split it up and deliver chunk by chunk. So far, the only way practicable on my part is transfer by QQ, and trust me, it is really fast.
I got a copy from dzzhigner by QQ transfer. it's now your turn!
xujiajin
2005-08-25, 12:24 PM
Thanks, dzhigner. MonoConc works very well with big files.
oscar3
2005-08-25, 01:03 PM
I am using MonoCon Pro( a demo version only) for the first time.
http://forum.corpus4u.org/upload/forum/2005082513053536.jpg
[本贴已被 作者 于 2005年08月25日 13时05分46秒 编辑过]
[本贴已被 作者 于 2005年08月25日 13时07分07秒 编辑过]
[本贴已被 作者 于 2005年08月25日 13时21分24秒 编辑过]
oscar3
2005-08-25, 03:56 PM
I see, it is a window to display more context of a certain keyword.
以下是引用 oscar3 在 2005-8-25 13:03:57 的发言:
I am using MonoCon Pro( a demo version only) for the first time.
http://forum.corpus4u.org/upload/forum/2005082513053536.jpg
[本贴已被 作者 于 2005年08月25日 13时05分46秒 编辑过]
[本贴已被 作者 于 2005年08月25日 13时07分07秒 编辑过]
[本贴已被 作者 于 2005年08月25日 13时21分24秒 编辑过]
vBulletin® v3.7.4,版权所有 ©2000-2009,Jelsoft Enterprises Ltd.