Search Discussions

42 discussions - 161 posts

  • Hi I have a situation whereby the following filter query works with Lucene 3.8.1 ChainedFilter: [QueryWrapperFilter(+receivedate:[201407010000 TO 201407310000] +cat:email) ...
    Jul 16, 2014 at 9:12 am
    Jul 16, 2014 at 10:37 am
  • Hi everybody, I found a problem confused me when I tested the mmap feature in lucene. I tested to read a file size of 800M by mmap method like below: RandomAccessFile raf = new RandomAccessFile(new ...
    Jul 14, 2014 at 9:58 am
    Jul 17, 2014 at 2:15 am
  • The UAX29URLEmailAnalyzer analyzer in Lucene 4.4 is not working as I expected. Is this a bug in the analyzer or is this working as designed? If I use the UAX29URLEmailAnalyzer, it tokenizes the ...
    Jul 23, 2014 at 5:49 pm
    Jul 24, 2014 at 2:55 pm
  • Hello list, We have a fairly large Lucene database for a 30+ million post forum. Users post and search for all kinds of things. To make sure users don't have to type exact matches, we combine a ...
    Arjen van der MeijdenArjen van der Meijden
    Jul 6, 2014 at 6:48 pm
    Jul 10, 2014 at 9:08 pm
  • Hi, I wanted to know of the best approach to follow if a few fields in my indexed documents are changing at run time (after index and before or during search), but a majority of them are created at ...
    Sandeep KhanzodeSandeep Khanzode
    Jul 1, 2014 at 12:32 pm
    Jul 8, 2014 at 4:13 pm
  • We're doing some housekeeping of the moderators of this list, and looking for any new folks that would like to volunteer. (we currently have 3 active moderators, 1-2 additional mods would be helpful ...
    Chris HostetterChris Hostetter
    Jul 23, 2014 at 5:03 pm
    Jul 29, 2014 at 4:29 pm
  • Hi, below is an exception I get from one Solr core. According to https://issues.apache.org/jira/browse/LUCENE-5617 the check that leads to the exception was introduced recently. Two things are worth ...
    Harald KirschHarald Kirsch
    Jul 23, 2014 at 10:04 am
    Jul 23, 2014 at 12:51 pm
  • I'm trying to solve the following problem... I have 3 documents that contain the following contents: 1: "George Washington Carver blah blah blah." 2: "George Washington blah blah blah." 3: "George ...
    Michael RyanMichael Ryan
    Jul 11, 2014 at 2:20 pm
    Jul 15, 2014 at 12:24 pm
  • Currently Drillsideways provides following method: public DrillSidewaysResult search(DrillDownQuery query, Collector hitCollector); Can same class provide following method ? public ...
    Jigar ShahJigar Shah
    Jul 8, 2014 at 11:15 am
    Jul 14, 2014 at 11:43 pm
  • Hi I am using NRT search with the SearcherManager class. When the user elects to delete some documents, writer.deleteDocuments(terms) is called. The problem is that deletes are not immediately ...
    Jul 10, 2014 at 8:56 pm
    Jul 14, 2014 at 3:20 pm
  • Hi, I created an Analyzer with a PorterStemFilter, and I’m searching some test documents. Normal plurals work; searching for “zebra” finds text with “zebras”. But searching for “goose” doesn’t find ...
    Robert NikanderRobert Nikander
    Jul 29, 2014 at 2:04 pm
    Jul 29, 2014 at 2:34 pm
  • Hello I am using more like this query .But size of Score Docs i am getting is 0 I found that it In Query Searchedquery = mlt.like(reader1, "criteria"); query object contain following value boost 1.0 ...
    Rajendra RaoRajendra Rao
    Jul 18, 2014 at 10:51 am
    Jul 21, 2014 at 2:27 pm
  • Hi all, I know IndexSearcher is thread safe. But IndexSearcher.doc is not thread safe maybe... I try to below -------------------------------------------- First, I extract docID at index directory ...
    Jul 9, 2014 at 5:00 am
    Jul 10, 2014 at 7:57 am
  • Hello. How do you get the terms for a TermsEnum of an IntField coded with precisionStc = Integer.Max that you get with MultiFields.getTerms(reader, intField).iterator(null) ? I had mitigated success ...
    Olivier BindaOlivier Binda
    Jul 30, 2014 at 6:21 am
    Jul 30, 2014 at 1:52 pm
  • Hi, I am working on my doctoral dissertation in CS and am trying to use Lucene to do custom similarity measures, namely MMM (mixed, min and Max), Paice and p-norm and then compare those results to ...
    Erin ColvinErin Colvin
    Jul 28, 2014 at 12:56 am
    Jul 28, 2014 at 10:26 pm
  • Hi I have created index with 1 field with simple message like (hello - world) now when I create for search like +body:" \"hello world\"" & I should not get any result because I have wrapped my search ...
    Jul 18, 2014 at 5:52 am
    Jul 18, 2014 at 9:16 pm
  • Hi, In the system which I develop I have to store many query objects in memory. The system also receives documents. For each document MemoryIndex is instantiated. I execute all stored queries on this ...
    Pawel RogPawel Rog
    Jul 2, 2014 at 7:51 pm
    Jul 3, 2014 at 6:06 am
  • We currently migrate one project to Lucene 4 and noticed that the method IndexSearcher.setDefaultFieldSortScoring(..) disappeared in Lucene 4.0. We can't find something about this in the migration ...
    Christian ReuschlingChristian Reuschling
    Jul 18, 2014 at 2:18 pm
    Jul 22, 2014 at 4:18 pm
  • Hi All I am matching and ranking two set of Docs using apache lucene and I passes page hits 1000. But in the result it shows 200 only why? It means that rest 800 are not matched and if so then what ...
    Priyanka TufchiPriyanka Tufchi
    Jul 18, 2014 at 1:22 pm
    Jul 21, 2014 at 9:52 am
  • Hi, I want to implement a query like phrase query with slop 0, but I can allow one term mismatch. For example, the text is "A B C D E" I want to match this text with the query "A B C X E". X ...
    Yonghui ZhaoYonghui Zhao
    Jul 17, 2014 at 7:23 am
    Jul 17, 2014 at 1:05 pm
  • Hi everyone, I created an index with three documents, ran a query, and noted the scores. Then I deleted one of the documents using IndexWriter.tryDeleteDocument, and then re-added the exact same ...
    Jul 16, 2014 at 6:08 pm
    Jul 16, 2014 at 9:12 pm
  • I found my first search on new created IndexReader is slow , but after i made a search , it will be much faster i'd like to do such "warn up" in the back end of my application rather then wait for ...
    Jul 11, 2014 at 4:17 pm
    Jul 14, 2014 at 4:35 pm
  • I'm building a new system where I will have several pdf files. The content you will have to have in my indexes are: 1. Name 2. No. of Pages 3. Data File 4. Archive When I run the search by the ...
    Arlei Ferreira Farnetani JuniorArlei Ferreira Farnetani Junior
    Jul 6, 2014 at 2:28 pm
    Jul 6, 2014 at 10:13 pm
  • Hi, Could please explain me how to determine the tf-idf score for bigrams. My program is able to index and search bigrams correctly, but it does not calculate the tf-idf for bigrams. If someone can, ...
    Manjula WijewickremaManjula Wijewickrema
    Jul 2, 2014 at 7:45 am
    Jul 3, 2014 at 4:06 am
  • I am building my code using Lucene 4.7.1 and Hadoop 2.4.0 . Here is what I am trying to do Create Index 1. Build index in RAMDirectory based on data stored on HDFS . 2. Once built , copy the index ...
    Varun sharmaVarun sharma
    Jul 15, 2014 at 6:34 am
    Aug 20, 2014 at 2:30 pm
  • Hello all, I am using Lucene 4.9 and the index size is 7 GB. Search is faster, it takes 1 second to return the results (50 hits). I loop through the result and fetching the stored data for all and it ...
    Jul 31, 2014 at 1:37 am
    Aug 4, 2014 at 11:09 am
  • Hi We used apache lucene for ranking documents .We have documents with its score .Now our requirements is to set a threshold so that we can show that documents has matched ,no matched ,or partial ...
    Rajendra RaoRajendra Rao
    Jul 12, 2014 at 7:21 am
    Jul 12, 2014 at 5:02 pm
  • Hi, I am using Lucene 4.7.2 and my primary use case for Lucene is to do three things: (a) search, (b) sort by a number of fields for the search results, and (c) facet on probably an equal number of ...
    Sandeep KhanzodeSandeep Khanzode
    Jul 8, 2014 at 6:50 am
    Jul 10, 2014 at 11:47 am
  • nowadays , i've been trying every way to improve the performance of indexing , IndexWriter's close operation is really costly , and the Lucene's doc sugguest to re-use IndexWriter instance , i did it ...
    Jul 8, 2014 at 12:29 pm
    Jul 8, 2014 at 1:48 pm
  • Hi, For Lucene 4.7.2 Facets, once we invoke FacetCollector and get the topNChildren into FacetResult, is there any mechanism that for a particular search result, I could get the docIds corresponding ...
    Sandeep KhanzodeSandeep Khanzode
    Jul 7, 2014 at 11:11 am
    Jul 7, 2014 at 11:34 am
  • Hello team, We are using lucen-core-2.9.1.jar for indexing and searching for text fields We are having problem when our string has digits as prefix or suffix of _/- Ex: I have texts ICMP6-a-b ...
    Smitha Kuldeep (smtt)Smitha Kuldeep (smtt)
    Jul 4, 2014 at 11:53 am
    Jul 6, 2014 at 1:42 pm
  • I am looking to track down an issue in 2.9.2 where during highlighting, certain data may cause rapid memory usage and OOM exception in java: ------- java.lang.OutOfMemoryError: Java heap space at ...
    Baldwin, DavidBaldwin, David
    Jul 30, 2014 at 5:36 pm
    Jul 30, 2014 at 5:36 pm
  • Hi All During search, i find it difficult to handle every multi-valued field with different analyzers. So i believe indexing multi-valued field with hash value may solve the problem of searching with ...
    Kumaran RamasubramanianKumaran Ramasubramanian
    Jul 30, 2014 at 12:55 pm
    Jul 30, 2014 at 12:55 pm
  • Hi list, can anyone give some hints about removing duplicates from a MultiPhraseQuery? I have the list with: List<Term[] termarray = (MultiPhraseQuery) myquery).getTermArrays(); But the lucene ...
    Bernd FehlingBernd Fehling
    Jul 29, 2014 at 7:59 am
    Jul 29, 2014 at 7:59 am
  • Test west
    West suhanicWest suhanic
    Jul 24, 2014 at 6:05 pm
    Jul 24, 2014 at 6:05 pm
  • Which is the efficient way to create faster searches? 1. Single large index (or) 2. Multiple small indexes (opened with SearchManager, multireaders) When creating the IndexSearcher(multireader), how ...
    Umashanker, SrividhyaUmashanker, Srividhya
    Jul 23, 2014 at 4:51 pm
    Jul 23, 2014 at 4:51 pm
  • Can you post an example which demonstrates the problem? It's also Hello: I solved the problem. My facet search code, which is a separate program from the index code, did not call setHierarchical ...
    West suhanicWest suhanic
    Jul 14, 2014 at 10:07 pm
    Jul 14, 2014 at 10:07 pm
  • Hello, Can anyone help me with an example of using PostingsHighlighter in Lucene 4.7. I am trying to modify SearchFiles.java (which can be found in the demo directory) so that it would display not ...
    Natalia ConnollyNatalia Connolly
    Jul 9, 2014 at 4:59 pm
    Jul 9, 2014 at 4:59 pm
  • Hi all, I am trying to figure out how to easily remove or add a keyword from a document's index (or equivalently, decrease/increase that keyword's frequency in the document). I know Lucene allows you ...
    Allen KneserAllen Kneser
    Jul 8, 2014 at 6:17 pm
    Jul 8, 2014 at 6:17 pm
  • Hi, I tried to index bigrams from a documhe system gave and the system gave me the following output with the frequencies of the bigrams(output 1): array size:15 array terms are:{contents: /1, assist ...
    Manjula WijewickremaManjula Wijewickrema
    Jul 8, 2014 at 4:31 am
    Jul 8, 2014 at 4:31 am
  • I am trying to understand why I am seeing very small segment sizes during indexing. I am using elasticsearch and one node sees heavy merge activity. After enabling info stream logs it seems that the ...
    Kireet ReddyKireet Reddy
    Jul 6, 2014 at 6:47 pm
    Jul 6, 2014 at 6:47 pm
  • Hi, Is there any way to pre-build rewritten queries and cache them somewhere. When I have a set of queries which is used very frequently I would get significant boost (10-20% of CPU wasted) when I ...
    Pawel RogPawel Rog
    Jul 1, 2014 at 12:59 pm
    Jul 1, 2014 at 12:59 pm
Group Navigation
period‹ prev | Jul 2014 | next ›
Group Overview
groupjava-user @

59 users for July 2014

Jamie: 11 posts Uwe Schindler: 10 posts Ian Lea: 7 posts Michael McCandless: 7 posts Chris Hostetter: 6 posts Robert Muir: 6 posts Wangzhijiang999: 6 posts Milind: 5 posts Rajendra Rao: 5 posts Steve Rowe: 5 posts Jigar Shah: 4 posts Pawel Rog: 4 posts Sandeep Khanzode: 4 posts Tri Cao: 4 posts Arjen van der Meijden: 3 posts Erick Erickson: 3 posts Ganesh: 3 posts Harald Kirsch: 3 posts Manjula Wijewickrema: 3 posts Michael Ryan: 3 posts
show more