Search Discussions

71 discussions - 296 posts

  • hello everyone, I am starting to understand lucene in java and I am having a hard time in implementing it. I am trying to develop a java application that can do indexing, searching and whatnot. and ...
    Jul 28, 2010 at 4:06 am
    Aug 13, 2010 at 7:58 am
  • Any hints on making something like an InverseWildcardQuery? We're trying to find all documents that have at least one field that doesn't match the wildcard query. Or is there a way to inverse any ...
    Jul 30, 2010 at 2:30 pm
    Jul 30, 2010 at 8:31 pm
  • Hi, A customer has been indexing a very large collection of documents that has been running over many days using 2.9.0. During the optimisation stage, the following error occurred, and now the index ...
    David SitskyDavid Sitsky
    Jul 26, 2010 at 4:22 am
    Jul 29, 2010 at 10:33 am
  • I'm having trouble with the IndexReader class as per below: (using lucene 2.9.1) RAMDirectory dir = new RAMDirectory(); createIndex(dir); IndexReader reader = IndexReader.open(dir); IndexReader ...
    Gregory TarrGregory Tarr
    Jul 30, 2010 at 8:16 am
    Aug 21, 2010 at 12:56 am
  • Hi! We are using lucene in our project to search through information objects which works fine. For indexing we use the StandardAnalyzer. Now, we have to support the Chinese language. I found out that ...
    Kolhoff, Jacqueline - ENCOWAYKolhoff, Jacqueline - ENCOWAY
    Jul 1, 2010 at 9:20 am
    Jul 2, 2010 at 7:10 am
  • We're getting up there in terms of corpus size for our Lucene indexing application: * 20 million documents * all fields need to be stored * 10 short fields / document * 1 long free text field / ...
    Christopher ConditChristopher Condit
    Jul 13, 2010 at 9:54 pm
    Jul 16, 2010 at 8:35 am
  • Hi, Is there a way to get all the fields involved in a query? Thanks Anuj
    Anuj ShahAnuj Shah
    Jul 31, 2010 at 1:42 pm
    Jan 16, 2013 at 9:21 pm
  • Hi, I just performed two queries which, in my opinion, should lead to the same document rankings. However, the document ranking differ between these two queries. For better understanding I prepared ...
    Jul 21, 2010 at 9:27 pm
    Jul 27, 2010 at 3:31 pm
  • the index file is ill-formated because disk full when feeding. Can I roll back to last version? Is there any method to avoid unexpected errors when indexing? attachments are my segment_N
    Li LiLi Li
    Jul 7, 2010 at 2:43 am
    Jul 7, 2010 at 3:34 am
  • We recently upgraded from lucene 2.4.0 to lucene 3.0.2. Our load testing revealed a serious performance drop specific to traversing the list of terms and their associated documents for a given ...
    Nader, John PNader, John P
    Jul 28, 2010 at 6:40 pm
    Aug 19, 2010 at 9:39 am
  • Hi, I want to rank my results only on parts of my query. E.g my query is "TITLE:Lucene AND AUTHOR:Manning". After this query standard lucene ranking for both fields take place. However, is it ...
    Jul 31, 2010 at 8:05 am
    Aug 1, 2010 at 7:44 am
  • Hello, we are trying to implement a query type for Lucene (with eventual target being Solr) where the query string passed in needs to be "filtered" through a large list of document IDs per user. We ...
    Martin JMartin J
    Jul 21, 2010 at 12:38 pm
    Jul 23, 2010 at 5:53 pm
  • Hi all, Consider the following string: "the buffalo buffaloes" [1]. When passed through a stemming analyzer, the resulting token would be "buffalo buffalo" (assuming a good stemmer). To enable exact ...
    Itamar Syn-HershkoItamar Syn-Hershko
    Jul 16, 2010 at 3:30 pm
    Jul 22, 2010 at 9:45 pm
  • Hi, I would like to continuously iterate over the documents in my lucene index as the index is updated. Kind of like a "stream" of documents. Is there a way I can achieve this? Would something like ...
    Max LynchMax Lynch
    Jul 13, 2010 at 9:18 pm
    Jul 15, 2010 at 3:40 pm
  • Hello Friends; Recently, I have problem with lucene search - memory problem on the basis that indexed file is so big. (I have indexed some kinds of information and this indexed file's size is nearly ...
    Ilkay polatIlkay polat
    Jul 14, 2010 at 10:45 am
    Jul 15, 2010 at 12:52 am
  • i am trying to search for a value which begins with a '$' or even sometimes '$$'. '$' is not listed as a special character and no matter what i try, i can not get a search for $* to return anything. ...
    Nathaniel AuvilNathaniel Auvil
    Jul 1, 2010 at 6:57 pm
    Jul 9, 2010 at 6:57 pm
  • Hello everybody, I am reading the file format paper and I check it against a created index. The documentation says: TermInfoIndex (.tii)-- TIVersion, IndexTermCount, IndexInterval, SkipInterval, ...
    Alexander vom BergAlexander vom Berg
    Jul 21, 2010 at 8:52 am
    Jul 27, 2010 at 6:20 pm
  • Hi all! hmmm, i need to get how important is the word in entire document collection that is indexed in the lucene index. I need to extract some "representable words", lets say concepts that are ...
    Jul 23, 2010 at 2:44 am
    Jul 23, 2010 at 11:44 am
  • Hi, I'm about to write an application that does very simple text analysis, namely dictionary based entity entraction. The alternative is to do in memory matching with substring: String text; // could ...
    Geir Gullestad PettersenGeir Gullestad Pettersen
    Jul 22, 2010 at 10:31 pm
    Jul 28, 2010 at 8:54 am
  • Hi, Normally, when I am building my index directory for indexed documents, I used to keep my indexed files simply in a directory called 'filesToIndex'. So in this case, I do not use any standar ...
    Manjula wijewickremaManjula wijewickrema
    Jul 23, 2010 at 5:46 am
    Jul 28, 2010 at 6:43 am
  • Hi all, I have an interesting problem...instead of going from a query to a document collection, is it possible to come up with the best fit query for a given document collection (results)? "Best fit" ...
    Jul 23, 2010 at 6:31 am
    Jul 23, 2010 at 12:33 pm
  • I am using lucene 2.9.3 (via Solr 1.4.1) on windows and am trying to understand ShingleFilter. I wrote the following code and find that if I provide more words than the actual phrase indexed in the ...
    Ethan CollinsEthan Collins
    Jul 13, 2010 at 7:43 am
    Jul 14, 2010 at 10:00 am
  • Hi, I run a single programme to see the way of scoring by Lucene for single indexed document. The explain() method gave me the following results. ******************* Searching for 'metaphysics' ...
    Manjula wijewickremaManjula wijewickrema
    Jul 9, 2010 at 7:22 am
    Jul 12, 2010 at 7:49 am
  • Hi, In my application, I input only one index file and enter only single term query to check the lucene score. I used explain method to see the way of obtaining results and system gave me the result ...
    Manjula wijewickremaManjula wijewickrema
    Jul 8, 2010 at 3:45 am
    Jul 9, 2010 at 10:31 am
  • Hi, For Lucene 3.0.2, issue LUCENE-2421 ( https://issues.apache.org/jira/browse/LUCENE-2421) changed NativeFSLock.release to not raise an exception if a write.lock file could not be deleted since the ...
    Ted McFaddenTed McFadden
    Jul 7, 2010 at 5:59 am
    Jul 8, 2010 at 1:25 pm
  • Hi, In my application, I input only single term query (at one time) and get back the corresponding scorings for those queries. But I am little struggling of understanding Lucene scoring. I have ...
    Manjula wijewickremaManjula wijewickrema
    Jul 5, 2010 at 9:03 am
    Jul 7, 2010 at 8:36 am
  • Hi all, Is it possible to run a search over top 100,000 (for example) results of a prior search. So if the user first does the search, gets results, if pressing on the search button again, I would ...
    Liat orenLiat oren
    Jul 6, 2010 at 8:33 am
    Jul 6, 2010 at 9:06 am
  • Hi All, I'm trying to use the patch for testing, provided in the issue. I downloaded the patch and the dependency *LUCENE-2453 <https://issues.apache.org/jira/browse/LUCENE-2453 *. I tested this ...
    Utku Can TopçuUtku Can Topçu
    Jul 23, 2010 at 5:00 pm
    Aug 17, 2010 at 7:26 pm
  • Hi, I heard work is being done on re-writing MultiPassIndexSplitter so it will be a single pass and work quicker. I was wondering if this is already done or when is it due ? Thanks
    Yatir Ben ShlomoYatir Ben Shlomo
    Jul 22, 2010 at 2:54 pm
    Aug 5, 2010 at 5:15 pm
  • Hi, for some queries I'm only interested in the number of matching documents. Is there a better/faster way to perform such a query, instead of retrieving all TopDocs and counting the number of ...
    Jul 26, 2010 at 1:19 pm
    Jul 26, 2010 at 3:06 pm
  • Hey All, I am using Apache Lucene (2.9.1) and its fast and it works great! I have a question in connection with Apache PDFBox. The following command creates a Lucent Document from a PDF file: ...
    Joe HansenJoe Hansen
    Jul 19, 2010 at 10:32 pm
    Jul 20, 2010 at 12:08 am
  • Hi, I'm trying to run ant task "generate-maven-artifacts" in lucene-solr build.xml file. But getting this error: /home/chardex/lucene/dev/lucene/common-build.xml:312: Error deploying artifact ...
    Pavel MinchenkovPavel Minchenkov
    Jul 16, 2010 at 3:36 pm
    Jul 19, 2010 at 4:43 pm
  • Hello, I'm a newbie to Lucene and before starting playing with it I would like to know whether it fits to my application. I have a collection of XML documented demarcated with respect to a stable XML ...
    Jul 15, 2010 at 3:09 pm
    Jul 16, 2010 at 10:09 am
  • Hi, I have seen that, onece the field length of a document goes over a certain limit ( http://lucene.apache.org/java/2_9_3/api/all/org/apache/lucene/index/IndexWriter.html#DEFAULT_MAX_FIELD_LENGTH ...
    Manjula wijewickremaManjula wijewickrema
    Jul 12, 2010 at 8:01 am
    Jul 13, 2010 at 4:43 pm
  • I am extremely impressed with Lucene and would like to thank Naveen and Otis for your kind help. I am not really a Java person, I am a perl and C++ guy and my website is done with mod_perl. So, my ...
    Igor ChudovIgor Chudov
    Jul 9, 2010 at 5:18 am
    Jul 9, 2010 at 7:37 am
  • Hi, what would be the fastest way to get all terms for all documents matching a specific query? Sofar I: 1.) Query the index 2.) Retrieve all scoreDocs 3.) Iterate the scoreDocs and retrieve all ...
    Jul 27, 2010 at 12:51 pm
    Jul 28, 2010 at 1:16 pm
  • Consider the following two documents which I have added to my index: doc.add( new Field("text", "hello world", Field.Store.YES, Using the StandardQueryParser I can retrieve my document with either of ...
    Geir Gullestad PettersenGeir Gullestad Pettersen
    Jul 27, 2010 at 8:19 pm
    Jul 27, 2010 at 9:10 pm
  • Hi, is there a possibility to retrieve the lengthNorm for all (or a specific) fields in a specific document? Regards, Philippe --------------------------------------------------------------------- To ...
    Jul 19, 2010 at 1:54 pm
    Jul 19, 2010 at 2:28 pm
  • Hi there, I have been recently trying to build a lucene index out of ngrams and seem to have stumbled on to a number of issues. I first tried to use the NGramTokenizer, but that thing apparently only ...
    Jul 17, 2010 at 8:30 pm
    Jul 17, 2010 at 9:53 pm
  • I'm examining the following search problem. Consider a document with two multi-value fields. Document doc = new Document(); doc.add(new Field("f1", "a1", Field.Store.YES, Field.Index.ANALYZED)); ...
    Hans-Gunther BirkenHans-Gunther Birken
    Jul 9, 2010 at 12:44 pm
    Jul 9, 2010 at 6:39 pm
  • Hello, My name is Igor and I own a website algebra.com. I just joined. I have a database of answered algebra questions (208,000 and growing). A typical question is here (original spelling): ``who ...
    Igor ChudovIgor Chudov
    Jul 8, 2010 at 10:14 pm
    Jul 9, 2010 at 5:12 am
  • I used to store full text into lucene index. But I found it's very slow when merging index because when merging 2 segments it copy the fdt files into a new one. So I want to only index full text. But ...
    Li LiLi Li
    Jul 7, 2010 at 6:09 am
    Jul 7, 2010 at 6:30 am
  • it is said that "At a few thousand ~160 characters long documents InstantiatedIndex outperforms RAMDirectory some 50x, 15x at 100 documents of 2000 characters length, and is linear to RAMDirectory at ...
    Li LiLi Li
    Jul 2, 2010 at 6:34 am
    Jul 7, 2010 at 2:59 am
  • Hello All, Can someone explain to me how fielded queries work with phrases? My first thought is that the phrase is broken down into terms and those terms are then fielded and separated with the AND ...
    Thomas NguyenThomas Nguyen
    Jul 6, 2010 at 8:20 pm
    Jul 7, 2010 at 2:46 am
  • Hi all. I've been dealing with a small problem when searching and trying to sort and filter on a NumericField using Lucene 2.9.2; the result never comes back as expected. Here are some snippets of my ...
    Eduardo PierdantEduardo Pierdant
    Jul 6, 2010 at 5:49 pm
    Jul 6, 2010 at 6:43 pm
  • Hi, I am currently working on a Lucene module that makes use of controlled SKOS vocabularies (http://www.w3.org/TR/skos-primer/) during index and search time. It should work similar to Lucene's ...
    Bernhard HaslhoferBernhard Haslhofer
    Jul 6, 2010 at 1:03 pm
    Jul 6, 2010 at 2:37 pm
  • Working on the nightly build of solr and lucene - MultiPhraseQuery throws ArrayIndexOutOfBounds Exception for the words defined as synonyms SEVERE: java.lang.ArrayIndexOutOfBoundsException: 5 at ...
    Jayendra patilJayendra patil
    Jul 30, 2010 at 3:21 pm
    Jul 30, 2010 at 6:11 pm
  • Hi, I'm trying to implement a query for phrases without strict ordered and with missing words. At the moment, I'm trying the Spans infrastructure and this problem just arised. NearSpansOrdered's ...
    Santiago M. MolaSantiago M. Mola
    Jul 29, 2010 at 9:24 am
    Jul 29, 2010 at 10:49 am
  • Hi, Can any one clarify me difference between lucene index and database index? I am just trying to understand how lucene stores index, like databases store index as b-tree's. Thank in advance, ...
    Jul 27, 2010 at 2:22 am
    Jul 27, 2010 at 8:21 am
Group Navigation
period‹ prev | Jul 2010 | next ›
Group Overview
groupjava-user @

97 users for July 2010

Michael McCandless: 19 posts Ian Lea: 17 posts Manjula wijewickrema: 13 posts Erick Erickson: 11 posts Philippe: 11 posts Uwe Schindler: 11 posts Li Li: 9 posts David Sitsky: 7 posts Justin: 7 posts Grant Ingersoll: 6 posts Jg lin: 6 posts Shai Erera: 6 posts Findbestopensource: 5 posts Itamar Syn-Hershko: 5 posts Steven A Rowe: 5 posts Ahmet Arslan: 4 posts Ethan Collins: 4 posts Geir Gullestad Pettersen: 4 posts Ilkay polat: 4 posts Ivan Provalov: 4 posts
show more