add support for hamshahri collection
------------------------------------

Key: ORP-2
URL: https://issues.apache.org/jira/browse/ORP-2
Project: Open Relevance Project
Issue Type: New Feature
Components: Collections, Judgments, Queries
Reporter: Robert Muir
Attachments: ORP-2.patch

this patch adds support for the hamshahri collection, approximately 160,000 persian documents.
There are two sets of corresponding queries and relevance judgements.

Also, I fixed two things in the readme instructions
1) I forgot to specify the class name of QueryDriver (the thing that actually runs the relevance tests)
I also forgot to specify -Dfile.encoding=UTF-8 for this QueryDriver (currently needed when running the command-line)
2) In the example .alg file, i forgot to specify content.source.encoding=UTF-8.


--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Search Discussions

  • Robert Muir (JIRA) at Nov 19, 2009 at 8:46 am
    [ https://issues.apache.org/jira/browse/ORP-2?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Robert Muir updated ORP-2:
    --------------------------

    Attachment: ORP-2.patch

    patch
    add support for hamshahri collection
    ------------------------------------

    Key: ORP-2
    URL: https://issues.apache.org/jira/browse/ORP-2
    Project: Open Relevance Project
    Issue Type: New Feature
    Components: Collections, Judgments, Queries
    Reporter: Robert Muir
    Attachments: ORP-2.patch


    this patch adds support for the hamshahri collection, approximately 160,000 persian documents.
    There are two sets of corresponding queries and relevance judgements.
    Also, I fixed two things in the readme instructions
    1) I forgot to specify the class name of QueryDriver (the thing that actually runs the relevance tests)
    I also forgot to specify -Dfile.encoding=UTF-8 for this QueryDriver (currently needed when running the command-line)
    2) In the example .alg file, i forgot to specify content.source.encoding=UTF-8.
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Simon Willnauer (JIRA) at Nov 19, 2009 at 8:48 am
    [ https://issues.apache.org/jira/browse/ORP-2?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Simon Willnauer reassigned ORP-2:
    ---------------------------------

    Assignee: Simon Willnauer
    add support for hamshahri collection
    ------------------------------------

    Key: ORP-2
    URL: https://issues.apache.org/jira/browse/ORP-2
    Project: Open Relevance Project
    Issue Type: New Feature
    Components: Collections, Judgments, Queries
    Reporter: Robert Muir
    Assignee: Simon Willnauer
    Attachments: ORP-2.patch


    this patch adds support for the hamshahri collection, approximately 160,000 persian documents.
    There are two sets of corresponding queries and relevance judgements.
    Also, I fixed two things in the readme instructions
    1) I forgot to specify the class name of QueryDriver (the thing that actually runs the relevance tests)
    I also forgot to specify -Dfile.encoding=UTF-8 for this QueryDriver (currently needed when running the command-line)
    2) In the example .alg file, i forgot to specify content.source.encoding=UTF-8.
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Simon Willnauer (JIRA) at Nov 19, 2009 at 8:50 am
    [ https://issues.apache.org/jira/browse/ORP-2?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12779880#action_12779880 ]

    Simon Willnauer commented on ORP-2:
    -----------------------------------

    Good stuff robert! Get some sleep I will look at it closer tomorrow.

    simon
    add support for hamshahri collection
    ------------------------------------

    Key: ORP-2
    URL: https://issues.apache.org/jira/browse/ORP-2
    Project: Open Relevance Project
    Issue Type: New Feature
    Components: Collections, Judgments, Queries
    Reporter: Robert Muir
    Assignee: Simon Willnauer
    Attachments: ORP-2.patch


    this patch adds support for the hamshahri collection, approximately 160,000 persian documents.
    There are two sets of corresponding queries and relevance judgements.
    Also, I fixed two things in the readme instructions
    1) I forgot to specify the class name of QueryDriver (the thing that actually runs the relevance tests)
    I also forgot to specify -Dfile.encoding=UTF-8 for this QueryDriver (currently needed when running the command-line)
    2) In the example .alg file, i forgot to specify content.source.encoding=UTF-8.
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Simon Willnauer (JIRA) at Nov 23, 2009 at 8:14 pm
    [ https://issues.apache.org/jira/browse/ORP-2?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781596#action_12781596 ]

    Simon Willnauer commented on ORP-2:
    -----------------------------------

    Beside the build.xml issue I guess that is ok to go for now as we plan to clean up later anyway. There are a couple of things which could be "cleaner" and we could abstract some stuff but lets get collections in there and we fix while we move forward. Thoughts?

    add support for hamshahri collection
    ------------------------------------

    Key: ORP-2
    URL: https://issues.apache.org/jira/browse/ORP-2
    Project: Open Relevance Project
    Issue Type: New Feature
    Components: Collections, Judgments, Queries
    Reporter: Robert Muir
    Assignee: Simon Willnauer
    Attachments: ORP-2.patch


    this patch adds support for the hamshahri collection, approximately 160,000 persian documents.
    There are two sets of corresponding queries and relevance judgements.
    Also, I fixed two things in the readme instructions
    1) I forgot to specify the class name of QueryDriver (the thing that actually runs the relevance tests)
    I also forgot to specify -Dfile.encoding=UTF-8 for this QueryDriver (currently needed when running the command-line)
    2) In the example .alg file, i forgot to specify content.source.encoding=UTF-8.
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.
  • Simon Willnauer (JIRA) at Nov 25, 2009 at 4:00 pm
    [ https://issues.apache.org/jira/browse/ORP-2?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Simon Willnauer resolved ORP-2.
    -------------------------------

    Resolution: Fixed

    Committed revision 884162.

    add support for hamshahri collection
    ------------------------------------

    Key: ORP-2
    URL: https://issues.apache.org/jira/browse/ORP-2
    Project: Open Relevance Project
    Issue Type: New Feature
    Components: Collections, Judgments, Queries
    Reporter: Robert Muir
    Assignee: Simon Willnauer
    Attachments: ORP-2.patch


    this patch adds support for the hamshahri collection, approximately 160,000 persian documents.
    There are two sets of corresponding queries and relevance judgements.
    Also, I fixed two things in the readme instructions
    1) I forgot to specify the class name of QueryDriver (the thing that actually runs the relevance tests)
    I also forgot to specify -Dfile.encoding=UTF-8 for this QueryDriver (currently needed when running the command-line)
    2) In the example .alg file, i forgot to specify content.source.encoding=UTF-8.
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.

Related Discussions

Discussion Navigation
viewthread | post
Discussion Overview
groupopenrelevance-dev @
categorieslucene
postedNov 19, '09 at 8:46a
activeNov 25, '09 at 4:00p
posts6
users1
websitelucene.apache.org...

1 user in discussion

Simon Willnauer (JIRA): 6 posts

People

Translate

site design / logo © 2018 Grokbase