I ran across:
Evaluating Text Extraction Algorithms
this morning and thought it might be of interest.
BTW, I followed the URL Grant posted for the mailing list archives:
and have started at the beginning to try to come up to speed on what has
been discussed so far.
1) The message:
displays "Date: 1969-12-31" but the text of the message says: "
On Tue, Sep 21, 2010 at 7:37 AM, Tommaso Teofili"
I was under the impression this project started in 2009? Shouldn't the email archives start in 2009?
2) The message mentioned above makes reference to the OpenRelevance Viewer:
That returns a page not found message.
3) How complete are the email archives for OpenRelevance? I ask because when I view:
It has nested comments, such as the pointer to
posted by Robert Muir, and I have not found the original of that post.
I feel like I am missing part of the conversation.
Is there some other archive that I should be using?
PS: The archive is reporting messages plain text (<pre>) and HTML? Is that a feature?
Chair, V1 - US TAG to JTC 1/SC 34
Convener, JTC 1/SC 34/WG 3 (Topic Maps)
Editor, OpenDocument Format TC (OASIS), Project Editor ISO/IEC 26300
Co-Editor, ISO/IEC 13250-1, 13250-5 (Topic Maps)
Another Word For It (blog): http://tm.durusau.net