TextProcessingToolKit

xujiajin

管理员
Staff member
http://sourceforge.net/project/showfiles.php?group_id=114201&package_id=123665&release_id=252579
http://soft.java-virtual-machine.net/java/text-processing-tool-kit.html

This is a java text processing toolkit, which provides classes and
command line interfaces in support of the following text processing goals

- Addressing whole file (not line) based text processing
- Conversion to/from inline/standoff annotation
- Multiplexing lots of different kinds of annotation, especially
with respect to a single goal.
- Standoff/meta-data handling of source data to reduce IO overhead and
confusion about data subsets.
- Validation of annotation against non-tree models.
- Cyclic dependencies between tagging processes.
- Lexical resources.
- Incremental and shared feature handling for statistical taggers.

This toolkit makes use of the java web services development pack to
store some structured information with transparent translation between
xml and java classes. The Java web services development pack is
available for free from http://java.sun.com/webservices.

To build this toolkit, you need a POSIX shell. If you have that,
the classes and command line can be built with

JWSDP_HOME=/path/to/java-webservices-home ./build.sh dist

This toolkit is far from complete, but provides a handful of useful
classes and command line tools addressing the stated goals.
 
Back
顶部