Linux Today: Linux News On Internet Time.

System Text for Information Extraction

Oct 17, 2008, 03:33 (0 Talkback[s])

[ Thanks to jmalasko for this link. ]

"System Text makes the process of writing information extraction code like that of building any other piece of enterprise software. The major components of System Text are the Development Environment, Optimizer, and Run-time environment.

"The Development Environment helps the annotator developer to develop and debug extraction rules. Rules in System Text are written in AQL, a powerful, declarative language. AQL can express complex patterns that previous languages could not describe. In addition, AQL is designed to give System Text Optimizer maximal flexibility in reordering operations for improved performance. The Development Environment provides facilities for managing AQL rules and dictionary files, as well as for testing rules on collections of representative documents."

Complete Story

Related Stories: