Search Discussions

48 discussions - 220 posts

  • Hello! I need to perform an experiment of loading the entire index in RAM and seeing how the search performance changes. My index has TermVectors with payload and position info, StoredFields, and ...
    Igor ShalyminovIgor Shalyminov
    Oct 9, 2013 at 10:18 pm
    Oct 25, 2013 at 1:59 pm
  • Hello, I've observed that when using PostingsHighlighter in Lucene 4.4 that some of the responsive documents in TopDocs will have zero matches in the associated array of Passage objects. I.e., in the ...
    Jon StewartJon Stewart
    Oct 14, 2013 at 9:12 pm
    Oct 15, 2013 at 3:05 pm
  • Hi, I am running solr 4.4 & one of my collections seems to have a corrupted index... I tried doing - java -cp lucene-core-4.4.0.jar -ea:org.apache.lucene... org.apache.lucene.index.CheckIndex ...
    Oct 23, 2013 at 1:01 pm
    Oct 24, 2013 at 2:44 am
  • I have created strings like the below &&searchtext +sampletext and when I try to search the following using *&&** or *+** it does not give any result. I am using QueryParser.escape(String s) method ...
    Oct 20, 2013 at 4:47 pm
    Oct 21, 2013 at 1:00 am
  • I've been running some tests comparing storing large fields (documents, say 100K .. 10M) as files vs. storing them in Lucene as stored fields. Initial results seem to indicate storing them externally ...
    Michael SokolovMichael Sokolov
    Oct 11, 2013 at 5:03 pm
    Oct 18, 2013 at 12:01 pm
  • Consider a Lucene index consisting of 10m documents with a total disk footprint of 3G. Consider an application that treats this index as read-only, and runs very complex queries over it. Queries with ...
    Benson MarguliesBenson Margulies
    Oct 8, 2013 at 9:46 pm
    Oct 10, 2013 at 6:52 pm
  • UNOFFICIAL Hi everyone, I have a question about how to retrieve the values in a NumericDocValuesField. I understand how to do this in situations where you have an AtomicReaderContext available ...
    Stephen GRAYStephen GRAY
    Oct 23, 2013 at 7:30 am
    Oct 24, 2013 at 11:21 pm
  • Hi there, I'm looking for pointers, suggestions on how to approach this in Lucene 4.5. Say I am creating an index using a sequence of addDocument() calls and end up with segments that each contain ...
    Arvind KalyanArvind Kalyan
    Oct 23, 2013 at 5:16 pm
    Oct 24, 2013 at 9:09 am
  • You have to call updateDocument with the unique key of the document to update. The unique key must be a separate, indexed, not necessarily stored key. addDocument just adds a new instance of the ...
    Uwe SchindlerUwe Schindler
    Oct 1, 2013 at 7:11 am
    Oct 2, 2013 at 2:36 pm
  • Hi, I rewrote lemmatizer project LemmaGen (http://lemmatise.ijs.si/) to java. Originally it's written in C#. Lemmagen project uses rules to lemmatize word. Algorithm is described here ...
    Michal HlavacMichal Hlavac
    Oct 23, 2013 at 3:18 pm
    Dec 2, 2013 at 2:34 pm
  • Hi, I have problem with doing wild card search on file path fields. i have a field "filePath" where i store complete path of files. i have used StringField to store the field ("i assume by default ...
    Nischal reddyNischal reddy
    Oct 14, 2013 at 8:13 am
    Oct 14, 2013 at 7:41 pm
  • Hi, I have implemented Lucene to search for a single keyword across multiple fields and it works great. I did this by concatenating all the fields into a "contents" field and searching against this ...
    Raghavendra K RaoRaghavendra K Rao
    Oct 10, 2013 at 3:55 pm
    Oct 13, 2013 at 10:34 pm
  • Hi, How to implement synonym Search for All languages.. As far as i know,Wordnet has only English Support..Is there any other we can use to get support for all languages. -- Thanks and Regards ...
    Oct 7, 2013 at 9:34 am
    Oct 9, 2013 at 2:07 pm
  • Hello, We would like to index some documents. Each field of a document may have multiple values. And for each (field,value) pair there are some associated values. These associated values are just for ...
    Alice WongAlice Wong
    Oct 2, 2013 at 11:12 pm
    Oct 4, 2013 at 3:09 pm
  • Hi, I am using the regular QueryParser to form a PhraseQuery. It works fine, but when it consists of a hyphen, it gets removed, hence resulting in unexpected results. Note: I am NOT using the ...
    Raghavendra K RaoRaghavendra K Rao
    Oct 14, 2013 at 10:15 pm
    Oct 17, 2013 at 10:08 pm
  • Hi! I'm trying to look at index contents by setting SimpleTextCodec into IndexWriterConfig. The problem is, after indexing there are only .cfs, .cfe, .si, segments.gen and segments_h files, and the ...
    Igor ShalyminovIgor Shalyminov
    Oct 15, 2013 at 1:45 pm
    Oct 15, 2013 at 4:11 pm
  • In our search application, queries like test && usage do not return correct results but test AND usage works fine. So queries with "&&" doesn't work but "AND" does. We are using default queryparser ...
    Devi pulapartiDevi pulaparti
    Oct 10, 2013 at 8:28 am
    Oct 10, 2013 at 10:47 am
  • Dear, I have indexing integer field like this ----- Document doc = new Document(); FieldType fieldType = new FieldType(); fieldType.setIndexed(true); fieldType.setStored(true) ...
    Oct 7, 2013 at 3:24 pm
    Oct 7, 2013 at 9:22 pm
  • Hi, I am Trying to do Multiphrase Query in Lucene 4.3. It is working Perfect for all scenarios except the below scenario. When I try to Search for a phrase which is preceded by any punctuation,it is ...
    Oct 3, 2013 at 2:08 pm
    Oct 3, 2013 at 4:07 pm
  • Are there any best practices for constructing Filters to search efficiently? From my non-exhaustive experiments I cannot intuit how to construct my filters to achieve best performance. I have an ...
    James ClarkeJames Clarke
    Oct 10, 2013 at 6:02 pm
    Oct 17, 2013 at 9:06 am
  • Hi, I have a search application using Lucene 4.4.0 with various BinaryDocValues and SortedSetDocValues. We use MMapDirectory to help keep the Java heap small / GC pause times short and instead rely ...
    Steven SchlanskerSteven Schlansker
    Oct 2, 2013 at 6:12 pm
    Oct 2, 2013 at 10:42 pm
  • Hi Benson, forceMerge[1], formerly known as optimize, takes a parameter to configure how many segments should remain in the index. Regarding multi-core usage, if your query load is high enough to use ...
    Adrien GrandAdrien Grand
    Oct 1, 2013 at 11:10 am
    Oct 1, 2013 at 8:07 pm
  • Hi there, If I put Lucene segments on tmpfs and use MMapDirectory to access them, would the kernel be so dumb to load the files from tmpfs to another copy of file system cache before map it to the ...
    Oct 22, 2013 at 6:27 am
    Oct 22, 2013 at 7:28 am
  • Hello, I am quite confused about the Lucene NRT feature. And there are not many examples out there. My understanding is we can create an DirectoryReader from a IndexWriter. Whenever IndexWriter ...
    Alice WongAlice Wong
    Oct 17, 2013 at 5:12 am
    Oct 17, 2013 at 6:01 pm
  • Hi All, I'm trying to create an index with payloads using Lucene 4.5 but the payloads don't seem to be getting into the index. I looked into the code and it looks like storePayloads argument to ...
    Kyle JudsonKyle Judson
    Oct 15, 2013 at 12:14 pm
    Oct 15, 2013 at 1:37 pm
  • Team, We have our app using lucene 4.1. Docs keep getting indexed and we close the index by 00.00 hrs every day and open a new one for the next calendar day, however in case of an abrupt ...
    Ramprakash RamamoorthyRamprakash Ramamoorthy
    Oct 3, 2013 at 12:40 pm
    Oct 3, 2013 at 1:31 pm
  • Hello, I am trying to weigh some ideas for implementing paged search functionality in our system, which has these basic requirements: - Using Solr is not an option (at the moment). - Any Lucene 4.x ...
    Vitaly FunsteinVitaly Funstein
    Oct 11, 2013 at 2:01 am
    Nov 23, 2013 at 12:10 am
  • Hello, I'm new to lucene and I am having some trouble figuring out the right way to use a SearcherTaxonomyManager for NRT faceted search. Assuming I set up the STM with a reopen thread: // Index ...
    Joe EckardJoe Eckard
    Oct 26, 2013 at 3:45 pm
    Oct 26, 2013 at 5:36 pm
  • Hi, In my current implementation of Lucene 4.3 where there are millions of indexed records, I do a regular search() and get the topDocs.totalHits as the count of results. As part of this, I store all ...
    Raghavendra K RaoRaghavendra K Rao
    Oct 17, 2013 at 10:17 pm
    Oct 18, 2013 at 12:49 am
  • Hello, Have a question about default PhraseQuery boost processing. The Query.setBoost() <http://lucene.apache.org/core/4_4_0/core/org/apache/lucene/search/Query.html#setBoost(float) says: / Sets the ...
    Oct 16, 2013 at 6:04 am
    Oct 17, 2013 at 11:28 am
  • Hi Team, I have one requirement where i have to display sentences of valid document if the keyword(input string) is found in that document. I am thinking if parent-child relation will work? DocBean ...
    Avni SompuraAvni Sompura
    Oct 17, 2013 at 7:40 am
    Oct 17, 2013 at 9:02 am
  • Hi all, I'd like to know a bit more in detail about what is happening behind the scenes in case of prepareCommit vs Commit. Also, I read that if a indexing request comes while commit is requested, it ...
    Phani ChaitanyaPhani Chaitanya
    Oct 16, 2013 at 4:27 pm
    Oct 16, 2013 at 6:18 pm
  • Can anyone tell me if a search based on a ConstantScoreQuery should return the results in the order that the documents were added to the index? I am not sorting the results and this works for one ...
    Darren HoffmanDarren Hoffman
    Oct 15, 2013 at 1:43 am
    Oct 15, 2013 at 1:49 am
  • UNOFFICIAL Hi everyone, I'd appreciate some help with a problem I'm having. I have a collection of documents in my index. Each doc contains an IntField with a value in it. What I want is to find out ...
    Stephen GRAYStephen GRAY
    Oct 14, 2013 at 1:04 am
    Oct 14, 2013 at 11:12 am
  • Using Lucene 3. I know Lucene supports multiple occurrences of a field, and if one searches on that field, all fields are checked for hits. One question I have is if there is a performance difference ...
    Earl HoodEarl Hood
    Oct 8, 2013 at 2:04 am
    Oct 11, 2013 at 11:47 am
  • Is there some advice around about when it's appropriate to create an Analyzer class, as opposed to just Tokenizer and TokenFilter classes? The advantage of the constituent elements is that they allow ...
    Benson MarguliesBenson Margulies
    Oct 8, 2013 at 2:31 pm
    Oct 8, 2013 at 4:05 pm
  • Is there an optimal way to access many document TermVectors (in the same chunk) consecutively when using the LZ4 termvector compression? I'm curious to know whether all TermVectors in a single ...
    Rose, Stuart JRose, Stuart J
    Oct 7, 2013 at 7:31 pm
    Oct 8, 2013 at 7:51 am
  • I'm not aware of a lucene rather than Solr or whatever tutorial. A search for something like "lucene sharding" will get hits. Why don't you want to use Solr or Katta or similar? They've already done ...
    Ian LeaIan Lea
    Oct 1, 2013 at 10:16 am
    Oct 1, 2013 at 11:34 am
  • Hi, Currently we merge 2 indexes using iw.addIndexes(idxReaders), where the same call will be made in batches of 10 readers Our requirement is to make this addIndex call consistent. That is, during ...
    Ravikumar GovindarajanRavikumar Govindarajan
    Oct 25, 2013 at 5:05 pm
    Oct 25, 2013 at 5:05 pm
  • October 2013, Apache Lucene™ 4.5.1 available The Lucene PMC is pleased to announce the release of Apache Lucene 4.5.1 Apache Lucene is a high-performance, full-featured text search engine library ...
    Mark MillerMark Miller
    Oct 24, 2013 at 4:15 pm
    Oct 24, 2013 at 4:15 pm
  • Hi folks, If anyone reading this is interested in how to use the spatial module in Lucene, you might be interested in a recent two-part blog post by Steven Citron-Pousty on the OpenShift blog ...
    Smiley, David W.Smiley, David W.
    Oct 23, 2013 at 5:33 pm
    Oct 23, 2013 at 5:33 pm
  • Hello, My lucene index contains 46 segments with a total of 4M docs. Lately, while running queries I started getting seldom exceptions from this index: java.lang.ArrayIndexOutOfBoundsException at ...
    Manuel Le NormandManuel Le Normand
    Oct 22, 2013 at 11:37 am
    Oct 22, 2013 at 11:37 am
  • Posted something similar some time ago, but didn't get any responses, so I thought I'd try again with more details. We allow end-user queries that have our own proprietary query language, which we ...
    Marcos Juarez LopezMarcos Juarez Lopez
    Oct 16, 2013 at 5:11 pm
    Oct 16, 2013 at 5:11 pm
  • Hello! I'm trying to realize how I can improve search performance for my task. The index is as follows: - 29 segments, each of about 600 MB; - in the complete setup, there's a thread for each segment ...
    Igor ShalyminovIgor Shalyminov
    Oct 14, 2013 at 4:16 pm
    Oct 14, 2013 at 4:16 pm
  • Hi, If some one removes some of the segments in my Lucene Index from file system,How to find out which documents are deleted. -- Thanks and Regards Vignesh Srinivasan 9739135640
    Oct 11, 2013 at 11:20 am
    Oct 11, 2013 at 11:20 am
  • October 2013, Apache Lucene™ 4.5 available The Lucene PMC is pleased to announce the release of Apache Lucene 4.5 Apache Lucene is a high-performance, full-featured text search engine library written ...
    Adrien GrandAdrien Grand
    Oct 5, 2013 at 9:46 am
    Oct 5, 2013 at 9:46 am
  • Hello! I need to access token position and payload info during the search result page building. I need to do this for 10 documents max, so retrieving TermVectors is totally OK for me. Say, I retrieve ...
    Igor ShalyminovIgor Shalyminov
    Oct 4, 2013 at 6:14 pm
    Oct 4, 2013 at 6:14 pm
  • Hi folks, Here's a small tool that I'm releasing as open source. I hope some of you might find it useful: https://github.com/joelb-git/lqt Lucene Query Tool (lqt) is a small utility for executing and ...
    Joel BarryJoel Barry
    Oct 3, 2013 at 6:36 pm
    Oct 3, 2013 at 6:36 pm
Group Navigation
period‹ prev | Oct 2013 | next ›
Group Overview
groupjava-user @

53 users for October 2013

Michael McCandless: 31 posts Ian Lea: 21 posts Igor Shalyminov: 14 posts VIGNESH S: 10 posts Raghavendra K Rao: 9 posts Benson Margulies: 9 posts Michael Sokolov: 9 posts Adrien Grand: 8 posts Desidero: 7 posts Chris: 6 posts Robert Muir: 6 posts Alice Wong: 5 posts Jack Krupansky: 5 posts Stephen GRAY: 5 posts Uwe Schindler: 5 posts Arvind Kalyan: 4 posts Duke DAI: 4 posts Gudiseashok: 4 posts Jon Stewart: 4 posts Saisantoshi: 4 posts
show more