FAQ

Search Discussions

37 discussions - 116 posts

  • Hi, The default Lucene core jar contains no the smartcn analyzer. How can I include it into the core jar. Thanks!
    ChengCheng
    Sep 6, 2012 at 2:05 pm
    Sep 6, 2012 at 2:54 pm
  • I have a key field that will only ever have a length of 3 characters. I am using a StandardAnalyzer and a QueryParser to create the Query (parser.parse(string)), and an IndexReader and IndexSearcher ...
    Edward W. RouseEdward W. Rouse
    Sep 26, 2012 at 8:47 pm
    Sep 27, 2012 at 1:37 pm
  • I am updating an analyzer that uses a particular configuration of the PerFieldAnalyzerWrapper to work with Lucene 4.0. A few of the fields use a custom analyzer and StandardTokenizer and the other ...
    Mike O'LearyMike O'Leary
    Sep 25, 2012 at 11:59 pm
    Sep 26, 2012 at 5:56 pm
  • I'm building documentation from the Lucene 4.0.0-BETA source (though this was also an issue with the ALPHA source), and the output has null characters in it. I believe that this is because the source ...
    Mark ParkerMark Parker
    Sep 6, 2012 at 5:54 pm
    Sep 6, 2012 at 8:15 pm
  • Hello, I am currently discussing the possibilities of introducing Hibernate Search (Lucene) into an existing Java Web Project with existing Hibernate Layer. Hibernate Queries are quite complex and ...
    Robert StreitbergerRobert Streitberger
    Sep 12, 2012 at 12:46 pm
    Sep 17, 2012 at 3:10 pm
  • Hi, Imagine you are indexing the following documents (every line is stored in 1 single field, analyzed with the default StandardAnalyzer): - Doc 1: restaurant 't Robbeke fish passoa beer 15 EUR 5 EUR ...
    Jochen HebbrechtJochen Hebbrecht
    Sep 7, 2012 at 7:33 am
    Sep 7, 2012 at 11:23 am
  • Hi All, I have an algorithm by which i measure the importance of a term in a document . While indexing i want to store weight with respect to that term for the document. Any idea how to do it . Will ...
    Parnab kumarParnab kumar
    Sep 29, 2012 at 4:24 pm
    Oct 1, 2012 at 2:39 pm
  • I'm currently giving the user an option to include stop words or not when filtering a body of text for ngram frequencies. Typically, this is done as follows: snowballAnalyzer = new ...
    Martin O'SheaMartin O'Shea
    Sep 19, 2012 at 8:25 am
    Sep 20, 2012 at 9:22 am
  • I am processing a bunch of text coming out of OCR, i.e. it's machine-generated text that contains some errors like garbage characters attached to words, letters replaced with similarly looking ...
    Ilya ZavorinIlya Zavorin
    Sep 17, 2012 at 2:42 pm
    Sep 17, 2012 at 4:40 pm
  • Hey Everyone, I'm building a solr store on version 3.6.1 and I encounter the following error when the system gets to about 1,000,000 documents ...
    Gully BurnsGully Burns
    Sep 10, 2012 at 1:04 am
    Sep 10, 2012 at 6:40 pm
  • I have a web java/jsp application running on Apache Tomcat server. In this web application I have used lucene, to index and calculate similrarity between some PDF documents(PDF documents are in the ...
    Kasun PereraKasun Perera
    Sep 7, 2012 at 5:11 am
    Sep 10, 2012 at 4:49 pm
  • Take a look at this query : -HOSTNAME:ram AND SEVERITY:information The above query isn't giving me the intended results. I know we need to append a * : * to a query that it is totally negative, also ...
    Ramprakash RamamoorthyRamprakash Ramamoorthy
    Sep 5, 2012 at 3:25 pm
    Sep 8, 2012 at 11:14 am
  • Doing Lucene search within a jetty servlet container, the machine has 16gb of memory. Using 64bit JVM and Lucene 3.6 and files are memory mapped so I just allocate a max of 512mb to jetty itself, ...
    Paul TaylorPaul Taylor
    Sep 25, 2012 at 6:47 pm
    Sep 27, 2012 at 9:26 am
  • Hello. my project may require the tree style category info, how to store it so all leaf docs under some category node could be retrieved ? in thought, planing to store the vertical category info in ...
    秋水秋水
    Sep 20, 2012 at 2:02 am
    Sep 20, 2012 at 3:37 am
  • Hello All, I've a issue with respect to the distance measure of SpanNearQuery in Lucene. Let's say I've following two documents: DocID: 6, cotent:"1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 ...
    VempapVempap
    Sep 19, 2012 at 6:37 pm
    Sep 19, 2012 at 9:33 pm
  • Hi all how to disable the field cache? 王惠达 (PHP开发工程师, Sysdev Team) ------------------------- 分机:8836 QQ:429335915 mobile: 13795449454 E-mail: <span class="m_body_email_addr" ...
    惠达 王惠达 王
    Sep 13, 2012 at 7:53 am
    Sep 18, 2012 at 8:15 pm
  • Hello, This is my first time writing to the list. I am a Java developer, writing a personal project using Lucene, and so far have been very happy with the library (v4BETA). However, I have recently ...
    Yann-Erwan PerioYann-Erwan Perio
    Sep 9, 2012 at 9:56 am
    Sep 9, 2012 at 10:25 am
  • If a Lucene ShingleFilter can be used to tokenize a string into shingles, or ngrams, of different sizes, e.g.: "please divide this sentence into shingles" Becomes: shingles "please divide", "divide ...
    Martin O'SheaMartin O'Shea
    Sep 4, 2012 at 4:37 pm
    Sep 6, 2012 at 11:46 pm
  • Hi, I have a ram based index which occasionally needs to be persistent with a disk based index. Every time the size doubles which eats up my disk space quickly. Below is the code. Could someone help ...
    ChengCheng
    Sep 28, 2012 at 12:57 am
    Oct 1, 2012 at 4:34 am
  • Hi, For Lucene core 4.0. BETA, under the search.similarities help page it says the following "To change ...
    Sachin KulkarniSachin Kulkarni
    Sep 5, 2012 at 12:59 pm
    Sep 29, 2012 at 5:02 am
  • Hi, in case someone is interested in an application of the Lucene indexing engine in the field of corpus linguistics rather than information retrieval: we have worked on that subject for some time ...
    Carsten SchnoberCarsten Schnober
    Sep 26, 2012 at 1:59 pm
    Sep 26, 2012 at 2:04 pm
  • Hi, to migrate my lucene application to the new lucene Version I have to change all deprecated function to the recommended functions. Now I am looking for a solution to migrate the ...
    Naber, PeterNaber, Peter
    Sep 24, 2012 at 11:22 am
    Sep 25, 2012 at 11:06 am
  • hi all: I want to know that why transform numeric to string? public static int longToPrefixCoded(final long val, final int shift, final BytesRef bytes) { if (shift 63 || shift<0) throw new ...
    惠达 王惠达 王
    Sep 21, 2012 at 9:54 am
    Sep 24, 2012 at 7:40 am
  • According to the lucene file formats, it is only .tii and couple more files that are read fully in the memory. I am trying to understand why does giving more JVM memory to lucene makes it run ...
    Maneesha JainManeesha Jain
    Sep 20, 2012 at 9:57 pm
    Sep 21, 2012 at 9:16 am
  • Hi, I'm trying to index a big set of plain text files, almost 8,104,467 files, that are all under the same directory /media/MAFALDA/yohasebewp2txt/Archivos and want to get my index under ...
    Reyna MelaraReyna Melara
    Sep 19, 2012 at 5:45 pm
    Sep 19, 2012 at 8:00 pm
  • most sentences around Lucene what I searched out aren't compiled correctly. wondering if we build our local mailing list...
    SdrkyjSdrkyj
    Sep 14, 2012 at 2:13 pm
    Sep 14, 2012 at 2:42 pm
  • Hi, problems with the Fliter instance on version 3.6.1. When add an instance of Filter as a parameter on the IndexSearcher.search() method, like search(query, filter, n)it gets no result, for which ...
    SdrkyjSdrkyj
    Sep 13, 2012 at 7:20 am
    Sep 13, 2012 at 9:00 am
  • Hi! Is there a reason why the Document object is no longer serializable in 4.0 ? Is it because it has become a more complex object as to 3.x version ? This is breaking some 3.x lucene code that ...
    Nagendra NagarajayyaNagendra Nagarajayya
    Sep 11, 2012 at 3:32 am
    Sep 11, 2012 at 6:08 am
  • I want to use the Lucene to implement a Servlet in Tomcat environment. lucene-core-3.5.0.jar is in the webapp/WEB-INF/lib/ However I still got the error as following: Java.lang.NoSuchMethodError ...
    NeoskyNeosky
    Sep 10, 2012 at 5:35 pm
    Sep 10, 2012 at 7:16 pm
  • Hello, I have upgraded my lucene from 2.4.0 to 3.6.0, While i am packaging my application using tg2exe the size get increased. Is there any way to minimize the lucene? Thanks, Antony ...
    Antony JosephAntony Joseph
    Sep 5, 2012 at 2:56 pm
    Sep 7, 2012 at 9:23 am
  • I was looking at IndexWriter.commit(commitUserData) and IndexCommit.getUserData() as possible ways to save metadata about documents in an index, but I realized that the metadata we are looking at ...
    Mike O'LearyMike O'Leary
    Sep 21, 2012 at 8:24 pm
    Sep 21, 2012 at 8:24 pm
  • some analyzers are languages specific and not probable on tagged text analyze, while some analyzers may analyze tagged text mainly in English. I haven't dig much in the tagged text analyze yet, but ...
    秋水秋水
    Sep 20, 2012 at 2:44 am
    Sep 20, 2012 at 2:44 am
  • Hi, there is a specific list for pylucene? I'm having this problem when trying to install pylucene: /bin/sh: 1: ivy-fail: not found /bin/sh: 1: ivy-bootstrap: not found Somebody can help me?
    Juliano Fischer NavesJuliano Fischer Naves
    Sep 20, 2012 at 2:30 am
    Sep 20, 2012 at 2:30 am
  • Hi, I'm facing the following problem to install JCC (to posteriorly install pylucene): sudo python setup.py build */usr/lib/python2.7/distutils/extension.py:133: UserWarning: Unknown Extension ...
    Juliano Fischer NavesJuliano Fischer Naves
    Sep 14, 2012 at 9:09 pm
    Sep 14, 2012 at 9:09 pm
  • Hi to all, I used pruning package with LA Times collection. The initial LA Times index is created by lucene benchmark/conf/*.alg. Luke shows 131896 documents with 635614 terms for initial index. I ...
    Zeynep P.Zeynep P.
    Sep 14, 2012 at 12:27 pm
    Sep 14, 2012 at 12:27 pm
  • thanks for your reply.in fact, just the cacheTermsFilter doesn't work properly. the mail web client is in Chinese, so applys to the reply content, ha. ----- 原始邮件 ----- 发件人:Ian Lea <<span ...
    SdrkyjSdrkyj
    Sep 13, 2012 at 10:04 am
    Sep 13, 2012 at 10:04 am
  • Hello all, We have identified almost 9 Lucene ports in 8 different programming languages. Is anything missed out? Are these projects are active? Refer to the link ...
    AdityaAditya
    Sep 7, 2012 at 9:55 am
    Sep 7, 2012 at 9:55 am
Group Navigation
period‹ prev | Sep 2012 | next ›
Group Overview
groupjava-user @
categorieslucene
discussions37
posts116
users51
websitelucene.apache.org

51 users for September 2012

Jack Krupansky: 8 posts Cheng: 7 posts 齐保元: 7 posts Ian Lea: 6 posts Mike O'Leary: 5 posts Edward W. Rouse: 4 posts Martin O'Shea: 4 posts 秋水: 4 posts Sdrkyj: 3 posts Chris Male: 3 posts Gully Burns: 3 posts Jochen Hebbrecht: 3 posts Robert Muir: 3 posts Uwe Schindler: 3 posts Aditya: 2 posts Chris Hostetter: 2 posts Erick Erickson: 2 posts Ilya Zavorin: 2 posts Juliano Fischer Naves: 2 posts Mark Parker: 2 posts
show more
Archives