Search Discussions

97 discussions - 478 posts

  • Hi, I'm having heap memory issues when I do lucene queries involving sorting by a string field. Such queries seem to load a lot of data in to the heap. Moreover lucene seems to hold on to references ...
    Dec 7, 2009 at 9:27 pm
    Dec 17, 2009 at 10:34 am
  • Hi, guys, I'm implementing a search engine based on Lucene for Chinese. So I want to support pinyin search as Google China do. e.g. “中国” means Chinese in English this word's pinyin input is ...
    Weiwei WangWeiwei Wang
    Dec 15, 2009 at 1:26 pm
    Dec 18, 2009 at 1:51 am
  • Hi, I need to add a query operator '!' such that when it precedes a word or a phrase in the query, that term will contribute twice its weight if it is positioned in an even offset of the document. ...
    Elias KhsheibunElias Khsheibun
    Dec 19, 2009 at 1:07 pm
    Dec 21, 2009 at 6:52 pm
  • Hi, all, I designed a ftp search engine based on Lucene. I did a few modifications to the StandardTokenizer. My problem is: C++ is tokenized as c from StandardTokenizer and I want to recover it from ...
    Weiwei WangWeiwei Wang
    Dec 11, 2009 at 5:31 am
    Dec 13, 2009 at 1:46 pm
  • I have Lucene 2.3.1 code and indexes deployed in production in a distributed system and would like to bring everything up to date with 3.0.0 via 2.9.1. Here's my migration plan: 1. Add a index writer ...
    Rob Staveley \(Tom\)Rob Staveley \(Tom\)
    Dec 9, 2009 at 12:14 pm
    Dec 11, 2009 at 8:08 am
  • Hello all, We are facing serious issues related to Sorting in production environment. I know this issues has been discussed in this group. I am using v2.9.1 I am having multiple shards and i need to ...
    Dec 17, 2009 at 6:48 am
    Dec 18, 2009 at 1:40 pm
  • Can someone please point me in the right direction. We are creating an application that needs to beable to search on C++ and get back doc's that have C++ in it. The StandardAnalyzer does not seem to ...
    Dec 11, 2009 at 5:10 pm
    Dec 25, 2009 at 12:40 am
  • Hello all, I've built a tokenizer which uses the new tokenizer API (the one with the Decorator Pattern). I built a new AttributeReader called GlossAttribute and GlossAttributeImpl. I packaged it into ...
    Ahmed El-dawyAhmed El-dawy
    Dec 27, 2009 at 8:50 pm
    Jan 4, 2010 at 9:44 pm
  • Hello guys, We have a dilemma on a few of our lucene machines. We have a tomcat running our servlets for searching and indexing on each of these machines. Its a live index where documents are being ...
    Siraj HaiderSiraj Haider
    Dec 22, 2009 at 10:20 pm
    Dec 23, 2009 at 2:56 pm
  • I have a situation where I might have 1000 different types of Lucene Documents each with 10 or so fields with different names that get indexed. I am wondering if this is bad to do within Lucene. I ...
    Jason TesserJason Tesser
    Dec 30, 2009 at 3:19 pm
    Dec 31, 2009 at 11:10 am
  • I want my search to treat 'No. 1' and 'No.1' the same, because in our context its one token I want 'No. 1' to become 'No.1', I need to do this before tokenizing because the tokenizer would split one ...
    Paul TaylorPaul Taylor
    Dec 7, 2009 at 8:48 pm
    Dec 15, 2009 at 2:22 pm
  • I'm having a problem with 'searchWithFilter' on Lucene 2.9.1. The Filter wraps a simple BitSet. When doing a 'MatchAllDocs' query with this filter, I get only a subset of the expected results, even ...
    Peter KeeganPeter Keegan
    Dec 4, 2009 at 3:33 pm
    Dec 4, 2009 at 6:30 pm
  • Hello all, I'm actually looking for a software who can search in a computer (and on windows network drive) all files and the contents of files based on indexing method. I have few questions about ...
    Dec 2, 2009 at 2:51 pm
    Dec 3, 2009 at 9:49 am
  • Hi Guys, Could you give me advice how to deal with Lucene 3.0 with 2.4 indexes that contain compressed data. Our case is following - we have code like this: Field.Store fieldStored = ...
    Ivan VasilevIvan Vasilev
    Dec 28, 2009 at 6:13 pm
    Dec 29, 2009 at 12:16 pm
  • I want to have the effect of handling the first term of a phrase query as a PayloadTermQuery and for the others just normal handling ... how can I do that ? ...
    Elias KhsheibunElias Khsheibun
    Dec 21, 2009 at 5:52 pm
    Dec 22, 2009 at 2:23 pm
  • Hi, I am trying to expand user queries to figure out potential document categories implied in the query. I wanted to know what was the best way to figure out the document category that is the most ...
    Dec 15, 2009 at 4:24 am
    Dec 21, 2009 at 1:30 pm
  • Hi, all, I need to update my index everyday so as to keep pace with times. Current I have two copy of indexes, one is used to provide search service, the other is used to do updating. Once the ...
    Weiwei WangWeiwei Wang
    Dec 12, 2009 at 12:25 pm
    Dec 13, 2009 at 11:31 am
  • Hi, I do have some indices where I need to get results based on a fixed number list (not a range) Let's say I have a field named "CategoryID" and I now need all results where "CategoryID" is 1,3 or ...
    comparis.ch - Roman Baeriswylcomparis.ch - Roman Baeriswyl
    Dec 10, 2009 at 3:35 pm
    Dec 10, 2009 at 6:51 pm
  • Hi, I'm using Lucene 2.9.1 patched with http://issues.apache.org/jira/browse/LUCENE-1260 For some special reason I need to find all documents which contain at least 1 term in a certain field. This ...
    Benjamin HeilbrunnBenjamin Heilbrunn
    Dec 3, 2009 at 4:46 pm
    Dec 4, 2009 at 1:54 pm
  • Hi, As per title...is it possible to store image using Lucene? And if its possible...how can I do that? Thanks -- View this message in context: ...
    Dec 3, 2009 at 2:32 am
    Dec 3, 2009 at 7:01 am
  • After updating to 2.9.x or 3.0, highlighter doesn't work on wildcard queries like "abc*". I thought that it would be because of scoring, so I also set myIndexSearcher.setDefaultFieldSortScoring(true, ...
    Mohsen SaboorianMohsen Saboorian
    Dec 30, 2009 at 4:47 pm
    Jan 11, 2010 at 10:46 am
  • Hi, there, first of all, thanks for the awesome work! We have migrated to lucene 3.0. We are very tempted to use SpanQuery, as it seems to favor document ranking based on proximity (eg, within a ...
    Ben JiangBen Jiang
    Dec 28, 2009 at 6:42 am
    Dec 29, 2009 at 8:27 pm
  • Hello, I have a performance problem and would need expert advice on how to go about fixing it: I currently have 2 indexes: Daily and Hourly. The Daily index contains about 1,000,000 documents and my ...
    François EricFrançois Eric
    Dec 18, 2009 at 2:09 pm
    Dec 18, 2009 at 7:29 pm
  • Hi, all I currently need a TokenFilter to break token season07 into two tokens season 07 I tried PatternReplaceCharFilter to replace "season07" with "season 07", however, the offset is not correct ...
    Weiwei WangWeiwei Wang
    Dec 15, 2009 at 9:02 am
    Dec 15, 2009 at 12:48 pm
  • Hi All I'm upgrading my code from 2.4 to 2.9, and I've hit an issue with deprecations. My old code was: new SnowballAnalyzer("English", StopAnalyzer.ENGLISH_STOP_WORDS); Looking at the JavaDocs, I'd ...
    Nick BurchNick Burch
    Dec 14, 2009 at 12:55 pm
    Dec 15, 2009 at 11:26 am
  • Howdy, I am wondering if anyone has seen NearSpansUnordered.getPayload() not return payloads that are verifiably accessible via IR.termPositions? It's a bit confusing because most of the time they're ...
    Jason RutherglenJason Rutherglen
    Dec 9, 2009 at 12:34 am
    Dec 9, 2009 at 10:26 pm
  • I'm on 2.3.2 and looking to move to 2.9.1 or 3.0.0 In 2.9.1 TopFieldDocCollector is "Deprecated. Please use TopFieldCollector instead." in 3.0.0 TopFieldCollector says NOTE: This API is experimental ...
    Antony BowesmanAntony Bowesman
    Dec 8, 2009 at 6:45 am
    Dec 8, 2009 at 9:44 pm
  • Hello Group, I am continuously updating an index and at the same time searcher also searches the index, which resulted in multiple .cfs files for each commit by IndexWriter. I am not sure whether ...
    Cool The BreezerCool The Breezer
    Dec 8, 2009 at 6:26 am
    Dec 8, 2009 at 1:27 pm
  • Suppose I have a (useful) document stored in a Lucene index, and I have a variant that I'd also like to be able to search. This variant has the exact same data as the original document, but with some ...
    Dec 30, 2009 at 10:09 pm
    Jan 3, 2010 at 5:42 pm
  • I am getting IOException when I am doing a "Real-time" search, i.e. I am creating a Index using the Index Writer and also opening the Index using Index Reader (writer.getReader()) to make sure the ...
    Kumaravel KandasamiKumaravel Kandasami
    Dec 30, 2009 at 8:29 pm
    Jan 1, 2010 at 11:53 am
  • *Problem description* - I have a complex multi-value field. So, each value consist from several rows. - Each rows consists from several cells/items I want to be able to match those issues, which have ...
    Leonid M.Leonid M.
    Dec 28, 2009 at 6:09 pm
    Dec 30, 2009 at 5:03 pm
  • Hi, Following are details of my problem and possible solutions which I can think of. Please suggest which should I choose, or is there any other approach better than these. I want to index blog posts ...
    Shahid FaizShahid Faiz
    Dec 23, 2009 at 1:36 pm
    Dec 23, 2009 at 7:53 pm
  • Dear fellow Java developers: I am trying to run the XML Query Parser example that comes with Lucene 3.0 source distribution. I have the application structured identical to the one you download, and I ...
    Dec 23, 2009 at 1:04 am
    Dec 23, 2009 at 7:34 pm
  • I have indexed a number of geotagged locations and want the results sorted by relevance. For instance, when searching for "wash*" I want "Washington" (the city) to appear before "Washington Park", so ...
    Dec 19, 2009 at 5:49 pm
    Dec 20, 2009 at 4:24 pm
  • Hi, I have an index in which documents are inserted with different boost during indexing. eg. doc1 has boost 5.64 doc2 has boost 5.25 doc3 has boost 5.10 doc4 has boost 4.8 doc5 has boost 4.4 doc6 ...
    Prabin meiteiPrabin meitei
    Dec 18, 2009 at 5:41 pm
    Dec 19, 2009 at 6:47 am
  • Hi, I just realized that since I upgraded from Lucene 2.x to 3.0.0 (and removed all deprecated things), searches like that don't work anymore: test AND blue test NOT blue (test AND blue) OR red etc. ...
    Michel NadeauMichel Nadeau
    Dec 15, 2009 at 8:31 pm
    Dec 15, 2009 at 9:42 pm
  • Hi all in the Lucene 2.3.2 there was a method in TermEnum skipTo( term ) In the 3.0.0 it's missing... Are there any other way to skip terms? ----- Konstantyn Smirnov, CTO http://www.poiradar.ru ...
    Konstantyn SmirnovKonstantyn Smirnov
    Dec 10, 2009 at 2:13 pm
    Dec 10, 2009 at 3:01 pm
  • Hi, I am a newbie to lucene. I am using Standard Analyzer in my lucene project. I am indexing some fields which may contain only "%" as a field value and it indexes fine and i can view the value ...
    Dec 8, 2009 at 4:05 pm
    Dec 10, 2009 at 10:52 am
  • Hello All, I am a newbie using Lucene. To be brief, I am just wondering whether is there a point where we get the access to the org.apache.lucene.document.Document (which is being indexed at the ...
    Phanindra RevaPhanindra Reva
    Dec 8, 2009 at 3:13 pm
    Dec 8, 2009 at 8:42 pm
  • Hi, just upgraded my code to Lucene 3.0 and on one simple search I get the following stacktrace when I pass Integer.MAX_VALUE to the Searcher.search(Query query,int n) method, if I change the value ...
    Paul TaylorPaul Taylor
    Dec 2, 2009 at 1:35 pm
    Dec 6, 2009 at 9:53 am
  • Hi We've run into problems w/ LockFactory usage on our system. The problem is that the system can be such that the index is configured on a local file system, or a remote, shared one. If ...
    Shai EreraShai Erera
    Dec 2, 2009 at 12:36 pm
    Dec 2, 2009 at 1:52 pm
  • Hi all, I'm happy to announce that Luke 1.0.0 was released, and is available for download here: http://code.google.com/p/luke This version of Luke uses Lucene 3.0 jars. There were only minor changes ...
    Andrzej BialeckiAndrzej Bialecki
    Dec 26, 2009 at 1:51 pm
    Dec 26, 2009 at 5:46 pm
  • Merry Christamas, guys, I haven't taken part in any open source activties and currently I want to make my project open source under apache public license? What should i do? Apply for a license? and ...
    Weiwei WangWeiwei Wang
    Dec 24, 2009 at 3:42 am
    Dec 24, 2009 at 8:39 am
  • Hi, All! I have some problems with Lucene's search process and it's results, so I hope You could help me. First one: how should I split results by pages? Now I get search results in such way: TopDocs ...
    Антон КирилловАнтон Кириллов
    Dec 21, 2009 at 12:26 pm
    Dec 22, 2009 at 12:30 pm
  • Dear everyone, I am beginner of Java Lucene, please help me for the following question of my research: Now, I have a SET of text documents that indexed by Lucene. If I have another text document as a ...
    Phan The DaiPhan The Dai
    Dec 21, 2009 at 8:47 am
    Dec 21, 2009 at 10:36 am
  • I want to split this parsed result string: name:"zhong guo" name:friend server: into name:"zhong guo" name:friend server: how can I write a regular pattern to do that? I'm not ...
    Weiwei WangWeiwei Wang
    Dec 16, 2009 at 5:46 am
    Dec 16, 2009 at 10:31 am
  • Hi, all, Most of us should have some experience about inputing an incorrection keyword in google or some other search engine. When this happens, google can celverly suggest you another query ...
    Weiwei WangWeiwei Wang
    Dec 15, 2009 at 7:09 am
    Dec 15, 2009 at 12:32 pm
  • I'm upgrading from 2.3.1 to 3.0.0. I have 3.0.0 index readers ready to go into production and writers in the process of upgrading to 3.0.0. I think understand the implications of ...
    Rob Staveley \(Tom\)Rob Staveley \(Tom\)
    Dec 11, 2009 at 5:21 pm
    Dec 12, 2009 at 10:44 am
  • I'm trying to upgrade our application from Lucene 2.4.1 to Lucene 2.9.1. I've been using an InstantiatedIndex to do a bunch of unit testing, but am running into a some problems with Lucene 2.9.1. In ...
    Jason FennellJason Fennell
    Dec 9, 2009 at 6:48 pm
    Dec 10, 2009 at 8:40 pm
  • Hi all, Am using lucene 2.3.2. When i search using lucene demo am getting all the results which contains the query. But i would like to restrict my results to the relevant match and not all the ...
    Dec 3, 2009 at 12:43 pm
    Dec 4, 2009 at 2:01 pm
Group Navigation
period‹ prev | Dec 2009 | next ›
Group Overview
groupjava-user @

106 users for December 2009

王巍巍: 51 posts Michael McCandless: 46 posts Uwe Schindler: 43 posts Erick Erickson: 28 posts Ganesh: 14 posts AHMET ARSLAN: 12 posts Elias Khsheibun: 12 posts Rob Staveley \(Tom\): 9 posts Toke Eskildsen: 9 posts Ian Lea: 8 posts Jason Rutherglen: 8 posts Robert Muir: 7 posts Syedfa: 7 posts Anshum Gupta: 6 posts DHIVYA M: 6 posts Koji Sekiguchi: 6 posts Michel Nadeau: 6 posts Paul Taylor: 6 posts Stefan Trcek: 6 posts Simon Willnauer: 5 posts
show more