FAQ
solr. icu4j for Unicode Normalization
-------------------------------------

Key: SOLR-2334
URL: https://issues.apache.org/jira/browse/SOLR-2334
Project: Solr
Issue Type: Test
Components: clients - java
Affects Versions: 1.4
Environment: debian lenny and squeez , 1386 arch
Reporter: ahmad maher
Fix For: 1.4.2


Dears,
i use icu4j for UnicodeNormalization in schema.xml like that
"
<filter class="schema.UnicodeNormalizationFilterFactory" version="icu4j" composed="false" remove_diacritics="true" remove_modifiers="true" fold="true"/>
"
and if i use any token except English tokens in filter class , it return error, like in using solr.PatternReplaceFilterFactory

how can i use :
transliterate rule and transform rule in solr schema or config file ?
as mentioned here http://userguide.icu-project.org/transforms/general

can any one help me ?

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org

Search Discussions

  • Robert Muir (JIRA) at Jan 24, 2011 at 12:46 pm
    [ https://issues.apache.org/jira/browse/SOLR-2334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    Robert Muir resolved SOLR-2334.
    -------------------------------

    Resolution: Not A Problem

    Hi Ahmad,

    In the trunk (to be 4.0) and branch_3x (3.1) svn repositories, take a look at the analysis-extras contrib for this.
    For example:
    http://svn.apache.org/repos/asf/lucene/dev/branches/branch_3x/solr/contrib/analysis-extras/src/java/org/apache/solr/analysis/

    In order to filter with ICU transforms, you want to use ICUTransformFilterFactory.

    It takes two parameters:
    * id (mandatory): A Transliterator ID, one from {@link Transliterator#getAvailableIDs()}
    * direction (optional): Either 'forward' or 'reverse'. Default is forward.

    solr. icu4j for Unicode Normalization
    -------------------------------------

    Key: SOLR-2334
    URL: https://issues.apache.org/jira/browse/SOLR-2334
    Project: Solr
    Issue Type: Test
    Components: clients - java
    Affects Versions: 1.4
    Environment: debian lenny and squeez , 1386 arch
    Reporter: ahmad maher
    Fix For: 1.4.2


    Dears,
    i use icu4j for UnicodeNormalization in schema.xml like that
    "
    <filter class="schema.UnicodeNormalizationFilterFactory" version="icu4j" composed="false" remove_diacritics="true" remove_modifiers="true" fold="true"/>
    "
    and if i use any token except English tokens in filter class , it return error, like in using solr.PatternReplaceFilterFactory
    how can i use :
    transliterate rule and transform rule in solr schema or config file ?
    as mentioned here http://userguide.icu-project.org/transforms/general
    can any one help me ?
    --
    This message is automatically generated by JIRA.
    -
    You can reply to this email to add a comment to the issue online.


    ---------------------------------------------------------------------
    To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
    For additional commands, e-mail: dev-help@lucene.apache.org
  • ahmad maher (JIRA) at Feb 9, 2011 at 10:13 am
    [ https://issues.apache.org/jira/browse/SOLR-2334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12992418#comment-12992418 ]

    ahmad maher commented on SOLR-2334:
    -----------------------------------

    thank you for replay,

    how can use Pattern Replace OR char map or replace in my solr schema file
    using the ICU4j for non English patterns or chars ?




    solr. icu4j for Unicode Normalization
    -------------------------------------

    Key: SOLR-2334
    URL: https://issues.apache.org/jira/browse/SOLR-2334
    Project: Solr
    Issue Type: Test
    Components: clients - java
    Affects Versions: 1.4
    Environment: debian lenny and squeez , 1386 arch
    Reporter: ahmad maher
    Fix For: 1.4.2


    Dears,
    i use icu4j for UnicodeNormalization in schema.xml like that
    "
    <filter class="schema.UnicodeNormalizationFilterFactory" version="icu4j" composed="false" remove_diacritics="true" remove_modifiers="true" fold="true"/>
    "
    and if i use any token except English tokens in filter class , it return error, like in using solr.PatternReplaceFilterFactory
    how can i use :
    transliterate rule and transform rule in solr schema or config file ?
    as mentioned here http://userguide.icu-project.org/transforms/general
    can any one help me ?
    --
    This message is automatically generated by JIRA.
    -
    For more information on JIRA, see: http://www.atlassian.com/software/jira



    ---------------------------------------------------------------------
    To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
    For additional commands, e-mail: dev-help@lucene.apache.org
  • ahmad maher (JIRA) at Feb 14, 2011 at 9:24 am
    [ https://issues.apache.org/jira/browse/SOLR-2334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12992418#comment-12992418 ]

    ahmad maher edited comment on SOLR-2334 at 2/14/11 9:22 AM:
    ------------------------------------------------------------

    thank you for replay,

    how can use Pattern Replace OR char map or replace in my solr schema file
    using the ICU4j for non English patterns or chars ?
    OR
    adding Arabic Normalization
    <filter class="solr.ArabicNormalizationFilterFactory"/>






    was (Author: amd_maher):
    thank you for replay,

    how can use Pattern Replace OR char map or replace in my solr schema file
    using the ICU4j for non English patterns or chars ?




    solr. icu4j for Unicode Normalization
    -------------------------------------

    Key: SOLR-2334
    URL: https://issues.apache.org/jira/browse/SOLR-2334
    Project: Solr
    Issue Type: Test
    Components: clients - java
    Affects Versions: 1.4
    Environment: debian lenny and squeez , 1386 arch
    Reporter: ahmad maher
    Fix For: 1.4.2


    Dears,
    i use icu4j for UnicodeNormalization in schema.xml like that
    "
    <filter class="schema.UnicodeNormalizationFilterFactory" version="icu4j" composed="false" remove_diacritics="true" remove_modifiers="true" fold="true"/>
    "
    and if i use any token except English tokens in filter class , it return error, like in using solr.PatternReplaceFilterFactory
    how can i use :
    transliterate rule and transform rule in solr schema or config file ?
    as mentioned here http://userguide.icu-project.org/transforms/general
    can any one help me ?
    --
    This message is automatically generated by JIRA.
    -
    For more information on JIRA, see: http://www.atlassian.com/software/jira



    ---------------------------------------------------------------------
    To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
    For additional commands, e-mail: dev-help@lucene.apache.org
  • ahmad maher (JIRA) at Feb 14, 2011 at 9:34 am
    [ https://issues.apache.org/jira/browse/SOLR-2334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12992418#comment-12992418 ]

    ahmad maher edited comment on SOLR-2334 at 2/14/11 9:32 AM:
    ------------------------------------------------------------

    thank you for replay,

    how can use Pattern Replace OR char map or replace in my solr schema file
    using the ICU4j for non English patterns or chars ?







    was (Author: amd_maher):
    thank you for replay,

    how can use Pattern Replace OR char map or replace in my solr schema file
    using the ICU4j for non English patterns or chars ?
    OR
    adding Arabic Normalization
    <filter class="solr.ArabicNormalizationFilterFactory"/>





    solr. icu4j for Unicode Normalization
    -------------------------------------

    Key: SOLR-2334
    URL: https://issues.apache.org/jira/browse/SOLR-2334
    Project: Solr
    Issue Type: Test
    Components: clients - java
    Affects Versions: 1.4
    Environment: debian lenny and squeez , 1386 arch
    Reporter: ahmad maher
    Fix For: 1.4.2


    Dears,
    i use icu4j for UnicodeNormalization in schema.xml like that
    "
    <filter class="schema.UnicodeNormalizationFilterFactory" version="icu4j" composed="false" remove_diacritics="true" remove_modifiers="true" fold="true"/>
    "
    and if i use any token except English tokens in filter class , it return error, like in using solr.PatternReplaceFilterFactory
    how can i use :
    transliterate rule and transform rule in solr schema or config file ?
    as mentioned here http://userguide.icu-project.org/transforms/general
    can any one help me ?
    --
    This message is automatically generated by JIRA.
    -
    For more information on JIRA, see: http://www.atlassian.com/software/jira



    ---------------------------------------------------------------------
    To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
    For additional commands, e-mail: dev-help@lucene.apache.org
  • ahmad maher (JIRA) at Feb 14, 2011 at 11:44 am
    [ https://issues.apache.org/jira/browse/SOLR-2334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12994269#comment-12994269 ]

    ahmad maher commented on SOLR-2334:
    -----------------------------------

    can you give an example ?
    solr. icu4j for Unicode Normalization
    -------------------------------------

    Key: SOLR-2334
    URL: https://issues.apache.org/jira/browse/SOLR-2334
    Project: Solr
    Issue Type: Test
    Components: clients - java
    Affects Versions: 1.4
    Environment: debian lenny and squeez , 1386 arch
    Reporter: ahmad maher
    Fix For: 1.4.2


    Dears,
    i use icu4j for UnicodeNormalization in schema.xml like that
    "
    <filter class="schema.UnicodeNormalizationFilterFactory" version="icu4j" composed="false" remove_diacritics="true" remove_modifiers="true" fold="true"/>
    "
    and if i use any token except English tokens in filter class , it return error, like in using solr.PatternReplaceFilterFactory
    how can i use :
    transliterate rule and transform rule in solr schema or config file ?
    as mentioned here http://userguide.icu-project.org/transforms/general
    can any one help me ?
    --
    This message is automatically generated by JIRA.
    -
    For more information on JIRA, see: http://www.atlassian.com/software/jira



    ---------------------------------------------------------------------
    To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
    For additional commands, e-mail: dev-help@lucene.apache.org
  • ahmad maher (JIRA) at Feb 14, 2011 at 11:44 am
    [ https://issues.apache.org/jira/browse/SOLR-2334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

    ahmad maher updated SOLR-2334:
    ------------------------------

    Comment: was deleted

    (was: thank you for replay,

    how can use Pattern Replace OR char map or replace in my solr schema file
    using the ICU4j for non English patterns or chars ?





    )
    solr. icu4j for Unicode Normalization
    -------------------------------------

    Key: SOLR-2334
    URL: https://issues.apache.org/jira/browse/SOLR-2334
    Project: Solr
    Issue Type: Test
    Components: clients - java
    Affects Versions: 1.4
    Environment: debian lenny and squeez , 1386 arch
    Reporter: ahmad maher
    Fix For: 1.4.2


    Dears,
    i use icu4j for UnicodeNormalization in schema.xml like that
    "
    <filter class="schema.UnicodeNormalizationFilterFactory" version="icu4j" composed="false" remove_diacritics="true" remove_modifiers="true" fold="true"/>
    "
    and if i use any token except English tokens in filter class , it return error, like in using solr.PatternReplaceFilterFactory
    how can i use :
    transliterate rule and transform rule in solr schema or config file ?
    as mentioned here http://userguide.icu-project.org/transforms/general
    can any one help me ?
    --
    This message is automatically generated by JIRA.
    -
    For more information on JIRA, see: http://www.atlassian.com/software/jira



    ---------------------------------------------------------------------
    To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
    For additional commands, e-mail: dev-help@lucene.apache.org
  • ahmad maher (JIRA) at Feb 14, 2011 at 11:50 am
    [ https://issues.apache.org/jira/browse/SOLR-2334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12994269#comment-12994269 ]

    ahmad maher edited comment on SOLR-2334 at 2/14/11 11:48 AM:
    -------------------------------------------------------------

    can you give an example - how can i use it in solr schema file ?

    was (Author: amd_maher):
    can you give an example ?
    solr. icu4j for Unicode Normalization
    -------------------------------------

    Key: SOLR-2334
    URL: https://issues.apache.org/jira/browse/SOLR-2334
    Project: Solr
    Issue Type: Test
    Components: clients - java
    Affects Versions: 1.4
    Environment: debian lenny and squeez , 1386 arch
    Reporter: ahmad maher
    Fix For: 1.4.2


    Dears,
    i use icu4j for UnicodeNormalization in schema.xml like that
    "
    <filter class="schema.UnicodeNormalizationFilterFactory" version="icu4j" composed="false" remove_diacritics="true" remove_modifiers="true" fold="true"/>
    "
    and if i use any token except English tokens in filter class , it return error, like in using solr.PatternReplaceFilterFactory
    how can i use :
    transliterate rule and transform rule in solr schema or config file ?
    as mentioned here http://userguide.icu-project.org/transforms/general
    can any one help me ?
    --
    This message is automatically generated by JIRA.
    -
    For more information on JIRA, see: http://www.atlassian.com/software/jira



    ---------------------------------------------------------------------
    To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
    For additional commands, e-mail: dev-help@lucene.apache.org

Related Discussions

Discussion Navigation
viewthread | post
Discussion Overview
groupdev @
categorieslucene
postedJan 24, '11 at 9:41a
activeFeb 14, '11 at 11:50a
posts8
users1
websitelucene.apache.org

1 user in discussion

ahmad maher (JIRA): 8 posts

People

Translate

site design / logo © 2022 Grokbase