1. State of progress since last meeting
Jaspret and Roar have been working on their
2. Production of normalized, common sentence format and numbering. Should also be compatible with output (Brat) standoff format from Christine.
Everyone will look for existing framework for representing documents in Java, e.g. the Document class used in Lucene.
Also, everyone will make a list of required properties of such a representation. This is homework till next meeting.
3. Common dictionary format
4. Other issues
- Recommended "simstring" (from the people behind Brat...) for phrase matching, used in
normalization in brat 1.3 (maybe something to help Christine).