You are viewing an old version of this page. View the current version.

Compare with Current View Page History

Version 1 Next »

1. State of progress since last meeting

Jaspret and Roar have been working on their

2. Production of normalized, common sentence format and numbering. Should also be compatible with output (Brat) standoff format from Christine.

Everyone will look for existing framework for representing documents in Java, e.g. the Document class used in Lucene.
Also, everyone will make a list of required properties of such a representation. This is homework till next meeting.

3. Common dictionary format

4. Other issues

  - Recommended "simstring" (from the people behind Brat...) for phrase matching, used in
    normalization in brat 1.3 (maybe something to help Christine).

  • No labels