Search Discussions

78 discussions - 309 posts

  • Hi, I'm trying to get the terms that match a certain RegexpQuery. My (naive) approach: 1. Create a RegexpQuery from the queryString (e.g. "abc.*"): Query q = new RegexpQuery(new Term("text", ...
    Carsten SchnoberCarsten Schnober
    Mar 11, 2013 at 10:41 am
    Mar 12, 2013 at 10:26 am
  • Hi all, How can I filter suggestions based on some value from the indexed field? I have a stored 'id' field in my index and I want to use that to examine documents where the suggestion was found, but ...
    Bratislav StojanovicBratislav Stojanovic
    Mar 14, 2013 at 9:36 pm
    Mar 22, 2013 at 11:49 am
  • Hi All, I've just begun to get my feet wet with Lucene and have a few simple questions: 1. Must the index writer read and index files on disk, or can i create documents in memory and ask the writer ...
    Mar 27, 2013 at 3:25 am
    Mar 29, 2013 at 3:34 pm
  • The wiki at http://wiki.apache.org/lucene-java/ has come under attack by spammers more frequently of late, so the PMC has decided to lock it down in an attempt to reduce the work involved in tracking ...
    Steve RoweSteve Rowe
    Mar 25, 2013 at 3:16 am
    Mar 25, 2013 at 8:17 pm
  • 11


    Anybody knows what is happening to Luke? Staying with the latest Lucene has become a risk since Luke seems to stay behind of the Lucene development. For 4.1 it was possible to find a patch on github, ...
    Wouter HeijkeWouter Heijke
    Mar 13, 2013 at 9:11 am
    Mar 18, 2013 at 5:59 am
  • We have a system that is using lucene and the searches are very slow. The number of documents is fairly small (less than 30,000) and each document is typically only 2 to 10 kilo-characters. Yet, ...
    Scott SmithScott Smith
    Mar 15, 2013 at 10:16 pm
    Mar 16, 2013 at 7:28 pm
  • I am new to Lucene and going through the Lucene in Action 2nd edition book. I have a quick question on the best way to add fields to a document now that Field.Index is deprecated. Here is what I am ...
    Mar 22, 2013 at 3:23 pm
    Mar 23, 2013 at 3:39 pm
  • Dear all, We are moving our codebase from Lucene 2.3 to 4.1. Also, we are migrating the older indices from 2.3 to 3.6 and then to 4.1. Would be glad if you can share some do's and dont's during ...
    Ramprakash RamamoorthyRamprakash Ramamoorthy
    Mar 11, 2013 at 7:38 am
    Mar 12, 2013 at 10:49 am
  • Hi Everybody, I have created a single compound index which is of size 250 Gigs. I open a single index reader to search simple boolean queries. The process is consuming lot of memory search painfully ...
    Ash nixAsh nix
    Mar 20, 2013 at 5:12 pm
    Mar 27, 2013 at 12:32 pm
  • Hello, I'm looking for an analyzer that allows performing accent insensitive search in latin languages. I'm currently using the StandardAnalyzer but it doesn't fulfill this need. Could you please ...
    Jerome BlouinJerome Blouin
    Mar 22, 2013 at 4:23 pm
    Mar 24, 2013 at 10:58 pm
  • As part of a project using Lucene I have implemented a trio of Directories roughly corresponding to the FSDirectory implementations in core. These directory implementations use the NIO2 API's in JDK7 ...
    Michael PoindexterMichael Poindexter
    Mar 17, 2013 at 2:29 am
    Mar 17, 2013 at 8:15 pm
  • Hello team, I have a query and I am explaining it as below. Objective : To split index and store, and combine it during query time Approach : Have two index writers, one will write a storedField and ...
    Ramprakash RamamoorthyRamprakash Ramamoorthy
    Mar 1, 2013 at 9:51 am
    Mar 8, 2013 at 7:38 am
  • Hi. I want indexing all documents once a day and after indexing delete old index files that indexed before a day. I think to do this, indexing all documents in new directory and replace IndexSearcher ...
    Mar 5, 2013 at 5:11 am
    Dec 3, 2013 at 1:22 pm
  • I have some custom Payload Functions that worked great up until version 4.1 and now I can't get any of them to work correctly. In fact, the existing ones -- AveragePayloadFunction, ...
    Mar 22, 2013 at 4:49 am
    Mar 28, 2013 at 2:56 am
  • Team, We are in the process of migrating our codebase from lucene 2.3(Yeah, its way older) to lucene 4.1. We had previously used WildCardTermEnum ...
    Ramprakash RamamoorthyRamprakash Ramamoorthy
    Mar 21, 2013 at 10:31 am
    Mar 26, 2013 at 5:32 am
  • I'm using MultiFieldQueryParser to parse search queries. I find that certain query strings (e.g., "/study/" without the quotes) cause MultiFieldQueryParser.parse() to throw an AssertionError, if ...
    Adam RauchAdam Rauch
    Mar 24, 2013 at 2:57 pm
    Mar 25, 2013 at 6:32 pm
  • Hi guys, On my path of migrating from 3.6.x to 4.1, I'm facing the following problem: I create a document with an IntField in it: doc.add new IntField( 'freeSeats', 5, Store.YES ) After adding to the ...
    Konstantyn SmirnovKonstantyn Smirnov
    Mar 4, 2013 at 4:09 pm
    Mar 4, 2013 at 11:25 pm
  • I believe that when Lucene indexes documents, it generates counts for a term by counting how many times the term appears in a particular document. Instead of having Lucene do the counting, I want to ...
    Sharon TamSharon Tam
    Mar 28, 2013 at 7:25 pm
    Apr 9, 2013 at 6:35 pm
  • I could use CountFacetRequest, but I don't need counting. I just want to get the list of child categories. How can I do it?
    Mar 30, 2013 at 9:24 pm
    Apr 2, 2013 at 1:13 am
  • I am getting a really weird problem. I am getting a discrepancy between what my search results are the files that are listed in my index, through their docId's These are the files that the index says ...
    Bushman, LamontBushman, Lamont
    Mar 29, 2013 at 9:23 am
    Mar 31, 2013 at 3:22 am
  • Hi Guys, I have been testing the search time improvement in Lucene 4.0 from Lucene 3.0.2 version for Wildcard Queries (with atleast say 2 chars Eg.ar*). For a 2GB size index with 4000000 docs, the ...
    Arun Kumar KArun Kumar K
    Mar 29, 2013 at 9:38 am
    Mar 29, 2013 at 11:19 am
  • Hi all, I am trying to write simple program to add documents to index. But am unable to do so. The class StandardAnalyzer is not defined. Analyzer analyzer = new ...
    Gaurav redkarGaurav redkar
    Mar 25, 2013 at 7:41 am
    Mar 26, 2013 at 4:46 am
  • Hi, Again, I will ask a Long-Time question, It has been asked for many time, but I googled a lot , but no good solution: I asked one time before, but I think the answer is not very well, SO: The ...
    Mar 14, 2013 at 3:09 am
    Mar 15, 2013 at 1:36 am
  • I'm seeing performance problems when indexing a certain set of data, and I'm looking for pointers on how to improve the situation. I've read the very helpful performance advice on the Wiki and I am ...
    Jan StetteJan Stette
    Mar 7, 2013 at 4:45 pm
    Mar 7, 2013 at 7:50 pm
  • Inspired by this presentation of DocValues: http://www.slideshare.net/lucenerevolution/willnauer-simon-doc-values-column-stride-fields-in-lucene I decided to try them out in 4.2. I created a 1M ...
    Peter KeeganPeter Keegan
    Mar 26, 2013 at 3:00 pm
    Mar 28, 2013 at 12:44 pm
  • Hello, We have documents with many numerical fields. In some search scenario, we would like to create a filter based on the sum of the values of two fields. For example, assume we have fields F1 and ...
    Wei WangWei Wang
    Mar 24, 2013 at 9:47 am
    Mar 27, 2013 at 10:46 am
  • I have a project where I need to index documents using Lucene 4.1.0. One of the fields for the stored Document is the actual text from the document(.pdf, .docx, etc.) I want to be able to highlight ...
    Bushman, LamontBushman, Lamont
    Mar 25, 2013 at 7:14 am
    Mar 26, 2013 at 8:53 am
  • Hello guys, Supposing I have one million documents, and each document has hundreds of features. For a given query, it also has hundreds of features. I want to fetch most relevant top 1000 documents ...
    Lin MaLin Ma
    Mar 15, 2013 at 5:09 pm
    Mar 17, 2013 at 2:36 am
  • Dear all, I have a request to implement an auto-suggest feature for our lucene based product. We have upgraded to Lucene 4.1 and intend to use the AnalyzingSuggester, but we cannot determine the ...
    Nils KnappmeierNils Knappmeier
    Mar 11, 2013 at 10:32 am
    Mar 11, 2013 at 12:53 pm
  • Hi Steve, Thanks for all your help here. I just tried it all again and this time I get "Cannot Open Project /Users/cbamford/projects/lucene_solr_4_1_0 contains no IntelliJ IDEA project" when I do ...
    Chris BamfordChris Bamford
    Mar 5, 2013 at 3:14 pm
    Mar 5, 2013 at 6:02 pm
  • Hello, I understand BlockJoinQuery can be used to index nested documents with some internal structure. And at indexing time, addDocuments is used to create document blocks. In case we would like to ...
    Wei WangWei Wang
    Mar 3, 2013 at 4:35 am
    Mar 5, 2013 at 4:23 pm
  • Hi, Can anyone help me understand the scoring function in the LMDirichletSimilarity class? The scoring function in LMDirichletSimilarity is shown below ...
    Mar 29, 2013 at 9:21 pm
    Apr 4, 2013 at 7:45 pm
  • I'm looking for a consultant for Lucene Solr. Our team of 3 extended OpenBravo (Java ERP) with a built-in Shopping Cart (written in JS). I'm now on the search/faceted-search part of the shopping ...
    Nick HoffmanNick Hoffman
    Mar 28, 2013 at 6:44 pm
    Mar 29, 2013 at 11:47 am
  • Hello everyone, I've seen in a couple of old presentations that DocValues will be updatable (without updating the whole document) but I cannot find anything recent on this. Is this currently possible ...
    Pablo GuerreroPablo Guerrero
    Mar 23, 2013 at 4:53 pm
    Mar 25, 2013 at 7:02 am
  • Hi all, I'm evaluating using Lucene for some data that would not be stored anywhere else, and I'm concerned about reliabilty. Having a database storing the data in addition to Lucene would be a ...
    Pablo GuerreroPablo Guerrero
    Mar 22, 2013 at 1:01 pm
    Mar 23, 2013 at 8:14 am
  • First, I decided I wasn't comfortable doing closes on the IndexReader. So, I did what I hope is better. I create a singleton SearcherManager (out-of-the-box from the 4.1 release) and do ...
    Scott SmithScott Smith
    Mar 20, 2013 at 9:49 pm
    Mar 20, 2013 at 10:19 pm
  • This is an internal code I came across in lucene today and unable to decipher it. FreqProxTermsWriterPerField.java void flush(String fieldName, FieldsConsumer consumer, final SegmentWriteState state) ...
    Ravikumar GovindarajanRavikumar Govindarajan
    Mar 20, 2013 at 12:12 pm
    Mar 20, 2013 at 1:20 pm
  • Hi chaps, Lucene 4.1.0: I notice org.apache.lucene.document.Field(String name, String value, Field.Store store, Field.Index index, Field.TermVector termVector) is marked as deprecated while its ...
    Kiwi cliveKiwi clive
    Mar 18, 2013 at 4:08 pm
    Mar 19, 2013 at 3:56 pm
  • This was intentional actually. The default was a tradeoff of "benefits" (which affect less than 5% of english vocabulary, if you read around the snowball site), versus a much more significant ...
    Robert MuirRobert Muir
    Mar 15, 2013 at 3:30 pm
    Mar 16, 2013 at 12:29 pm
  • Hi everyone, I am trying to port forward to 4.2 some Lucene 3.2-era code that uses the ASCIIFoldingFilter. The token stream handling has changed significantly since them, and I cannot figure out what ...
    Steven SchlanskerSteven Schlansker
    Mar 15, 2013 at 6:18 pm
    Mar 15, 2013 at 6:36 pm
  • Hello, could someone give me an example of how to conduct a search in an already built index with Lucene 4 mode phrase query using a specific analyzer. I tested here with the phrase the search query ...
    Arlei Ferreira Farnetani JuniorArlei Ferreira Farnetani Junior
    Mar 12, 2013 at 12:46 am
    Mar 12, 2013 at 5:34 pm
  • I am having the same experience as issue LUCENE-4713. Without modifying the source code myself, how do I overcome this? I am in the middle of migrating from 3.6 to 4.2 and I am unable to index or ...
    Darren HoffmanDarren Hoffman
    Mar 12, 2013 at 8:06 am
    Mar 12, 2013 at 9:41 am
  • Hello. I'm getting an OOME with a heap size of 512MB while trying to open an IndexReader over a 70GB index (FSDirectory). It didn't happen before when the index was smaller, and it doesn't happen if ...
    Gili NachumGili Nachum
    Mar 11, 2013 at 9:08 am
    Mar 11, 2013 at 3:57 pm
  • DiskDocValues is a codec (or part of a codec, apparenlty) for accessing the DocValues from disk, with minimal RAM usage for things like offsets. Lucene42Codec alternatively puts all of DocValues in ...
    David Smiley (@MITRE.org)David Smiley (@MITRE.org)
    Mar 8, 2013 at 7:02 pm
    Mar 8, 2013 at 11:42 pm
  • Excuse the blatant marketing, though for the benefit of the community... <http://programs.lucidworks.com/Solr4032013_signuppage.html Join me tomorrow/today (March 27) for a webinar on what's new and ...
    Erik HatcherErik Hatcher
    Mar 27, 2013 at 1:24 am
    Mar 30, 2013 at 8:56 pm
  • Hi, I've done a few experiments in Lucene 4.2 with several different query types: TermQuery TermRangeQuery NumericRangeQuery WildcardQuery PrefixQuery MatchAllDocsQuery I think I more or less ...
    Paul BellPaul Bell
    Mar 29, 2013 at 8:06 pm
    Mar 29, 2013 at 8:12 pm
  • This is probably a pretty general inquiry, but I'm just exploring this as an option at the moment. It seems that Lucene 4 adds some freedom to define how data is actually written to underlying ...
    Vitaly FunsteinVitaly Funstein
    Mar 26, 2013 at 7:01 pm
    Mar 26, 2013 at 8:05 pm
  • Hi all, My use case is very simple, given a string I would like to suggest all the possible urls that contain that string (given the limitations of the tokenizer and suggester). So far I have created ...
    Andres GarciaAndres Garcia
    Mar 26, 2013 at 7:45 am
    Mar 26, 2013 at 11:44 am
  • Team, We are migrating from 2.3 to 4.1, and we have implemented a method which does this *BitSet searchTermBits = searchQueryFilter.bits(reader); *searchQueryFilter is of type Filter and reader is an ...
    Ramprakash RamamoorthyRamprakash Ramamoorthy
    Mar 26, 2013 at 9:38 am
    Mar 26, 2013 at 11:07 am
  • Hey there, I am trying to get up some working example with the AnalyzingSuggester and stopwords - like it is done in the corresponding unit test. I thought, I could build the AnalyzingSuggester from ...
    Alexander ReelsenAlexander Reelsen
    Mar 24, 2013 at 7:51 pm
    Mar 24, 2013 at 8:41 pm
Group Navigation
period‹ prev | Mar 2013 | next ›
Group Overview
groupjava-user @

90 users for March 2013

Uwe Schindler: 23 posts Michael McCandless: 22 posts Simon Willnauer: 19 posts Ramprakash Ramamoorthy: 12 posts Ian Lea: 9 posts Scott Smith: 9 posts Steve Rowe: 9 posts Adrien Grand: 8 posts Ash nix: 8 posts Carsten Schnober: 8 posts Bratislav Stojanovic: 7 posts Dizh: 7 posts Erick Erickson: 6 posts Jack Krupansky: 6 posts Paul Bell: 6 posts Wei Wang: 6 posts Bushman, Lamont: 5 posts Robert Muir: 5 posts Arlei Ferreira Farnetani Junior: 4 posts David Smiley (@MITRE.org): 4 posts
show more