PDA

查看完整版本 : [SHARE] NEW YORK TIMES 1995 (pos tagged)


dzhigner
2005-05-15, 10:49 PM
[SHARE] NEW YORK TIMES 1995 (pos tagged)
LDC used to provide access to this data resource, but no longer now. I downloaded it (9434 html files). If you need it, contact me.

Haiyang
2005-05-15, 10:59 PM
Hi Dzhigner, I'd like to have a look.

dzhigner
2005-08-14, 10:53 AM
NOW IT IS ONLY ONE TXT FILE, 64MB. IF YOU ARE INTERESTED, CONTACT ME.

xujiajin
2005-08-14, 02:36 PM
This looks a great source of American English. But it is too big to transfer via email.

xujiajin
2005-08-15, 02:46 PM
Is that possible that you put it somewhere on your local school server for download?

lizzawood
2005-08-15, 03:40 PM
You can also use online storage to keep this big size data of yours.

dzhigner
2005-08-15, 07:34 PM
If only I could ... Those people don't give me that, because I am just a nobody. But I am a warm-hearted nobody. My QQ: 33386349.

xujiajin
2005-08-15, 08:25 PM
Thanks a lot for the warm-hearted dzhigner.

xiaoz
2005-08-15, 08:25 PM
One month's data from the NY Times (July 2002) is included in ANC first release.

chrics
2005-08-17, 09:32 PM
Send it to me through if u are using education network, i will provide a server to let others to download the file. my qq:16124885

xujiajin
2005-08-17, 10:29 PM
Thank u, chrics, for your contribution to Corpus4U.

xusun575
2005-08-22, 04:31 PM
What tool is recommended for processing this tagged NYT95? Thanks!

dzhigner
2005-08-22, 05:38 PM
以下是引用 chrics 在 2005-8-17 21:32:53 的发言:
Send it to me through if u are using education network, i will provide a server to let others to download the file. my qq:16124885


Great news. I will contact you as soon as possible.

dzhigner
2005-08-22, 05:40 PM
Wordsmith or MonoconcPro

xujiajin
2005-08-22, 05:41 PM
以下是引用 xusun575 在 2005-8-22 16:31:33 的发言:
What tool is recommended for processing this tagged NYT95? Thanks!


Most concordancers can be used, but i guess your problem might be the the large size of the file.

xusun575
2005-08-22, 06:51 PM
i tried it with monopro and wsmith and found no problem with the zise, but i could not ignore the tags while concordancing.

dzhigner
2005-08-23, 01:39 AM
以下是引用 xusun575 在 2005-8-22 18:51:25 的发言:
i tried it with monopro and wsmith and found no problem with the zise, but i could not ignore the tags while concordancing.


It should be easy ....
http://forum.corpus4u.org/upload/forum/2005082301384282.jpg
http://forum.corpus4u.org/upload/forum/2005082301385532.jpg

xusun575
2005-08-23, 02:27 AM
以下是引用 dzhigner 在 2005-8-23 1:39:00 的发言:
[quote]以下是引用 xusun575 在 2005-8-22 18:51:25 的发言:
i tried it with monopro and wsmith and found no problem with the zise, but i could not ignore the tags while concordancing.


It should be easy ....

************
But it's easy said than done. I had done what u displayed here but failed. Probably NYT95 is tagged in a way different from what WST or Monopro would accept.

动态语法
2005-08-23, 03:52 AM
以下是引用 xusun575 在 2005-8-23 2:27:26 的发言:
以下是引用 dzhigner 在 2005-8-23 1:39:00 的发言:
[quote]以下是引用 xusun575 在 2005-8-22 18:51:25 的发言:
i tried it with monopro and wsmith and found no problem with the zise, but i could not ignore the tags while concordancing.


It should be easy ....

************
But it's easy said than done. I had done what u displayed here but failed. Probably NYT95 is tagged in a way different from what WST or Monopro would accept.


How did you fail? More info would be needed for trouble shooting:

Which program are you using?
What does the result look like?

etc.

Reason: different programs have different ways to ignore tags. In many
programs you can specify what kind of tag to ignore (or include).

xusun575
2005-08-23, 07:08 AM
Thank u ! I used both monopro and WST3 for NYT95 concordancing. next are the results produced by Monopro.http://forum.corpus4u.org/upload/forum/2005082307034897.jpg

With tags compressed , I got the following:
http://forum.corpus4u.org/upload/forum/2005082307040622.jpg

dzhigner
2005-08-24, 01:36 AM
In monoconcpro:
Use menu File>Tag settings>part of speech tags, check "embedded in word" and set up "delimiter character".
http://forum.corpus4u.org/upload/forum/2005082401335996.jpg

dzhigner
2005-08-24, 01:46 AM
One thing I must make clear. Somehow it is really difficult for me to mail this large file, and still so even if I split it up and deliver chunk by chunk. So far, the only way practicable on my part is transfer by QQ, and trust me, it is really fast.

xusun575
2005-08-25, 07:26 AM
thank u! i got it.

xusun575
2005-08-25, 07:29 AM
以下是引用 dzhigner 在 2005-8-24 1:46:26 的发言:
One thing I must make clear. Somehow it is really difficult for me to mail this large file, and still so even if I split it up and deliver chunk by chunk. So far, the only way practicable on my part is transfer by QQ, and trust me, it is really fast.


I got a copy from dzzhigner by QQ transfer. it's now your turn!

xujiajin
2005-08-25, 12:24 PM
Thanks, dzhigner. MonoConc works very well with big files.

oscar3
2005-08-25, 01:03 PM
I am using MonoCon Pro( a demo version only) for the first time.
http://forum.corpus4u.org/upload/forum/2005082513053536.jpg


[本贴已被 作者 于 2005年08月25日 13时05分46秒 编辑过]

[本贴已被 作者 于 2005年08月25日 13时07分07秒 编辑过]

[本贴已被 作者 于 2005年08月25日 13时21分24秒 编辑过]

oscar3
2005-08-25, 03:56 PM
I see, it is a window to display more context of a certain keyword.


以下是引用 oscar3 在 2005-8-25 13:03:57 的发言:
I am using MonoCon Pro( a demo version only) for the first time.
http://forum.corpus4u.org/upload/forum/2005082513053536.jpg


[本贴已被 作者 于 2005年08月25日 13时05分46秒 编辑过]

[本贴已被 作者 于 2005年08月25日 13时07分07秒 编辑过]

[本贴已被 作者 于 2005年08月25日 13时21分24秒 编辑过]