FAQ

Search Discussions

116 discussions - 604 posts

  • -- original post was on solr's user list. -- -- i've reposted here as it's centered on the ShingleFilter which comes from lucene -- *ShortVersion* is there a way to make the ShingleFilter perform ...
    MckMck
    Sep 9, 2008 at 4:35 pm
    Sep 17, 2008 at 6:51 pm
  • We have an application in which index will be updated frequently. During development time, found that index files gets corrupted, i.e. more than one cfs files,some other extension files e.g. frq, ...
    Chaula GanatraChaula Ganatra
    Sep 26, 2008 at 10:50 am
    Oct 26, 2008 at 5:53 pm
  • Hi. I'm new to Lucene and I would like to get a few answers (they can be lame) I want to index large amount of emails using Lucene (maybe SOLR), not only the contents but also some metadata like ...
    Wojciech StrzałkaWojciech Strzałka
    Sep 12, 2008 at 11:59 am
    Sep 17, 2008 at 5:45 pm
  • I have a memory leak in my lucene search code. I am able to run a few queries fine, but I eventually run out of memory. Please note that I do close and set to null the ivIndexSearcher object ...
    Andy33Andy33
    Sep 2, 2008 at 7:54 pm
    Sep 8, 2008 at 2:10 am
  • First of all, thanks to all the people who helped me in getting the lucene replication setup working and right now its live in our production :-) Everything working fine, except that i am seeing some ...
    Rahul_k123Rahul_k123
    Sep 26, 2008 at 5:23 am
    Sep 30, 2008 at 1:14 pm
  • Hi! I've a lucene document structured like: Field: Text name: George Bush Sex: Male Occupation: President of USA Now I can have two types of queries: Structured query: name: George Bush AND ...
    Anshul jainAnshul jain
    Sep 21, 2008 at 7:27 pm
    Sep 23, 2008 at 5:57 pm
  • Hello, What is the new favorable way of searching a query? I understand Hits will be depreciated. So how do we do it the new way? With hit collector? Best. ...
    Cam BazzCam Bazz
    Sep 15, 2008 at 5:54 pm
    Sep 24, 2008 at 9:09 am
  • Hello all, I am have indexed more than 4 million documents. My query fetches 300,000 hits. If i perform sorting on any field, then tomcat reports out of memory exception. Sometimes the query results ...
    Ganesh - yahooGanesh - yahoo
    Sep 17, 2008 at 12:46 pm
    Sep 23, 2008 at 12:40 pm
  • Hi all, How can I implemented this scenario in lucene? suppose every document has three fields: docid, doctext and simdocid. docid is the id of the document, doctext is the content of the document, ...
    Xh sunXh sun
    Sep 19, 2008 at 8:01 am
    Sep 19, 2008 at 1:35 pm
  • Hello All, Anyone has tried this? My UI has a requirement to show total number of results and then show results in pages. How do I do that with TopDocCollector, without having to run search() twice, ...
    Dino KorahDino Korah
    Sep 17, 2008 at 11:38 am
    Sep 18, 2008 at 4:04 am
  • Hi, I am seeing odd behavior with SpanNearQuery. The problem is that with multiple fields, all fields beyond the first one 'car' are not seen by the span. I didn't think the span meant to sets of the ...
    Darren GovoniDarren Govoni
    Sep 14, 2008 at 5:38 pm
    Sep 15, 2008 at 11:55 am
  • Hi all, I am wondering if Lucene implements the query optimisation that consists of ordering the posting lists based on the term frequency before intersection ? If yes, could somebody point me to the ...
    Renaud DelbruRenaud Delbru
    Sep 17, 2008 at 12:20 pm
    Oct 13, 2008 at 3:53 pm
  • Hi the community, In a Tomcat application (a keyword-based search engine), I use Lucene 2.3.2 to index 60 million documents, but each document is small-sized. The total index size is about 60GB. ...
    Peter ChengPeter Cheng
    Sep 14, 2008 at 1:59 pm
    Oct 6, 2008 at 4:00 pm
  • I am baffled by the results of the following queries. Can it be something to do with the boosting factor? All of these queries are performed in the same environment with the same crawled index/data. ...
    Student_tStudent_t
    Sep 26, 2008 at 7:56 pm
    Sep 29, 2008 at 4:11 pm
  • Hi , I am facing below problem. Please help me in this. I have integrated CJK Analyzer for Japanese characters. I am able to save japanese double byte characters in mysql database in UTF-8 format ...
    AnandsarwadeAnandsarwade
    Sep 17, 2008 at 2:42 pm
    Sep 19, 2008 at 12:54 am
  • Hi all, We stuck a 60 GB OCZ "Core Series" SSD in a Dell T5400 (dual quadcore, 16GB RAM, SATA II 7200 RPM disk) and did some comparisons between running with our index on disk, vs. on SSD. I can't ...
    Eric BowmanEric Bowman
    Sep 16, 2008 at 9:06 am
    Sep 17, 2008 at 4:01 pm
  • Hello, Here is what I am trying to do: dir = FSDirectory.getDirectory("/test"); writer = new IndexWriter(dir, analyzer, true, new IndexWriter.MaxFieldLength(2)); ...
    Cam BazzCam Bazz
    Sep 15, 2008 at 7:05 pm
    Sep 15, 2008 at 9:24 pm
  • Hello, I see that IndexWriter.flush() is depreciated in 2.4. What do we use? Also I used to make a: try { nodeWriter = new IndexWriter(nodeDir, true, analyzer, false); } catch(FileNotFoundException ...
    Cam BazzCam Bazz
    Sep 15, 2008 at 5:32 pm
    Sep 15, 2008 at 8:09 pm
  • Hi all, First of all, sorry for my poor English. It's not my native language. I'm trying to use Lucene to index hierarchical kind of information: I have structured html and pdf/word documents and I ...
    Leonid MaslovLeonid Maslov
    Sep 1, 2008 at 7:26 am
    Sep 10, 2008 at 11:22 am
  • Hi, We need a feature for time based cleanup of IndexCommits. Would it be possible to add a method to IndexCommit class to get the timestamp of an index commit? Thanks. -- Regards, Akshay Ukey.
    AkshayAkshay
    Sep 1, 2008 at 10:07 am
    Sep 4, 2008 at 9:56 am
  • hi all i tried to reuse the IndexSearcher among all of the threads that are doing searches as described in (http://wiki.apache.org/lucene-java/LuceneFAQ#head-48921635adf2c968f7936dc07d51dfb40d638b82) ...
    Simon litwanSimon litwan
    Sep 25, 2008 at 8:57 am
    Sep 26, 2008 at 3:03 pm
  • Hi, all! How to implement Front-end match use lucene? Say, two Documents in index: Document<stored/uncompressed,indexed<field:abc Document<stored/uncompressed,indexed<field:bcd How can I get the ...
    叶双明叶双明
    Sep 19, 2008 at 9:06 am
    Sep 22, 2008 at 12:10 am
  • Hi, We're thinking of using Lucene to integrate search in a backup service application. The background is that we have a bunch of users using a backup service, and we want them to be able to search ...
    Tobias Larsson HultTobias Larsson Hult
    Sep 16, 2008 at 2:56 pm
    Sep 19, 2008 at 12:14 pm
  • Hello, What is the difference between flush in <2.4 and commit? Also I have been looking over docs, and they mention commit(long) but there is no commit(long) method but only commit() Best. ...
    Cam BazzCam Bazz
    Sep 15, 2008 at 5:50 pm
    Sep 15, 2008 at 8:04 pm
  • You need to call rewrite on the query to expand it then give that version to the highlighter - see the package javadocs. ...
    Mark harwoodMark harwood
    Sep 11, 2008 at 9:21 am
    Sep 11, 2008 at 3:14 pm
  • Hi@all Is it possible to do a search with multiple wildcards in one query, for instance "%MANAGE%" AND "CORE%"? Is there a code example available? Thanks a lot Mirko
    Sertic Mirko, BedagSertic Mirko, Bedag
    Sep 10, 2008 at 1:07 pm
    Sep 11, 2008 at 8:35 am
  • Hey all, I want to know how much two Strings are similar! The thing is: I'm processing an email box and I want to group all messages that have the subject similar, makes sense?? I looked on the ...
    Thiago MoreiraThiago Moreira
    Sep 3, 2008 at 7:53 pm
    Sep 9, 2008 at 5:36 pm
  • Hello, I have been looking at instantiated index in the trunk. Does this come with a searcher? Are the adds reflected directly to the index? Or is it just an experimental thing only with reader and ...
    Cam BazzCam Bazz
    Sep 15, 2008 at 4:45 pm
    Oct 29, 2008 at 2:19 pm
  • Hi there, I am using WhiteSpaceAnalyser to index documents. I have used this because i need to split tokens based on space only. Also Tokensized=true While indexing what does it do with special ...
    MiztakenMiztaken
    Sep 16, 2008 at 1:05 pm
    Sep 16, 2008 at 4:28 pm
  • Hi there. I am trying to implement sorting in a large index (3 million documents). My sort field is simple integer with values between 1 and 100. With IndexSearcher's search(Query, Sort) everything ...
    Dragan JotanovicDragan Jotanovic
    Sep 11, 2008 at 4:32 pm
    Sep 15, 2008 at 4:10 pm
  • Hi, In our application, I want users to be able to search for the updates they make almost immediately. Hence, whenever they update, I spawn a thread immediately to index. However, when the load on ...
    NobodyNobody
    Sep 11, 2008 at 8:43 pm
    Sep 12, 2008 at 11:03 pm
  • hi folks, I'm new to Lucene and I'm looking for a way to search a substring that starts at a fixed position. It isn't a classical substring search because it's a bit weird. I indexed a field that ...
    Luther blissetLuther blisset
    Sep 11, 2008 at 11:35 am
    Sep 11, 2008 at 2:37 pm
  • We have some code that uses lucene which has been working perfectly well for several months. Recently, a QA team in our organization has set up a server with a much larger data set than we have ever ...
    Justin GrunauJustin Grunau
    Sep 4, 2008 at 9:21 pm
    Sep 7, 2008 at 1:55 am
  • Hi, I am having an issue when using the PhraseQuery which is best illustrated with this example: I have created 2 documents to emulate URLs. One with a URL of: "http://www.airballoon.com" and title ...
    Yannis PavlidisYannis Pavlidis
    Sep 4, 2008 at 4:23 pm
    Sep 5, 2008 at 6:50 pm
  • Hello, This came up before but - if we were to make a swear word filter, string edit distances are no good. for example words like `shot` is confused with `shit`. there is also problem with words ...
    Cam BazzCam Bazz
    Sep 4, 2008 at 12:39 pm
    Sep 4, 2008 at 2:54 pm
  • Is it possible to query for documents that have empty values for a field? Say need to find documents with category empty, I tried negative query: -category:* But it returns 0 document. I think ...
    Chris LuChris Lu
    Sep 2, 2008 at 8:24 am
    Sep 4, 2008 at 1:20 am
  • Hi all, Most of our queries are very simple, of the type: Query query = new PrefixQuery(new Term(LABEL_FIELD, prefix)); Hits hits = searcher.search(query, new Sort(new SortField(LABEL_FIELD))) Which ...
    Andre RubinAndre Rubin
    Sep 2, 2008 at 5:06 pm
    Sep 2, 2008 at 6:20 pm
  • Hello you all, is it somehow possible to get all document ids found by a search. Not only 50 or 100... If it is possible and someone knows it, please help me :-) Thanks and beste regards, Gregor ...
    Gregor_ewaldGregor_ewald
    Sep 26, 2008 at 2:32 pm
    Sep 29, 2008 at 2:37 pm
  • According to https://issues.apache.org/jira/browse/LUCENE-1282?focusedCommentId=12596949#action_12596949 (Sun hotspot compiler bug in 1.6.0_04/05 affects Lucene), a workaround for the bug which ...
    Ari MillerAri Miller
    Sep 26, 2008 at 12:10 am
    Sep 29, 2008 at 9:52 am
  • Hello, If I had a file with the following content: ... object.method(); ... I would like to be able to query for object method object.method My guess is that I should store not only "object.method", ...
    Jean-Claude AntonioJean-Claude Antonio
    Sep 22, 2008 at 9:41 pm
    Sep 23, 2008 at 8:48 pm
  • Hi! I'm writing application that should be using lucene for searching threw ~2000000 documents. Search criterias i'm collecting using BooleanQuery and ChainedFilter. Avarage search takes 150 ms, what ...
    NukieNukie
    Sep 16, 2008 at 1:20 pm
    Sep 23, 2008 at 9:33 am
  • Hi, We have been running Lucene 2.3 for last few months with our application and all the sudden we have hit the following exception, java.lang.RuntimeException: java.io.IOException: background merge ...
    Vivek sarVivek sar
    Sep 17, 2008 at 8:25 pm
    Sep 22, 2008 at 11:03 pm
  • Hi, In my webapp I'm trying to use the lucene api to build queries instead of the QueryParser but I haven't found out where to specify the Analyzer. Any help? Thanks. jean71 ...
    Giannandrea CastaldiGiannandrea Castaldi
    Sep 22, 2008 at 8:47 am
    Sep 22, 2008 at 7:58 pm
  • Hello, Lets say I have two documents, both containing field F. document 0 has the string "a b" as F document 1 has the string "b a" as F I am trying to make a phrasequery like: PhraseQuery pq = new ...
    Cam BazzCam Bazz
    Sep 16, 2008 at 5:34 am
    Sep 17, 2008 at 3:56 pm
  • Hello, Could it harm if I make a searcher.search(query, Integer.MAX_VALUE) ? I just need to make a query to get the number of hits in this case, but I dont know what the max hits will be. Also When I ...
    Cam BazzCam Bazz
    Sep 16, 2008 at 2:43 am
    Sep 16, 2008 at 3:39 am
  • Hello, To patch for lucene-1314 what must I do? I think I have to cd into one of the directories, then patch -pX < ../../patchfile, but I could not figure it out. Best, -C.B. ...
    Cam BazzCam Bazz
    Sep 15, 2008 at 11:39 am
    Sep 15, 2008 at 8:48 pm
  • Hi; I am trying to write an application that's working on Blackberry or other java enabled phones. This application talk with the server and Ask some questions. Server Side is c# and i am using ...
    Fatih SoydanFatih Soydan
    Sep 15, 2008 at 7:55 am
    Sep 15, 2008 at 2:46 pm
  • Hi~. I hava a question about lucene incremental indexing. I want to do incremental indexing my goods data. For example, I have 4 products datas with "GOOD_ID","NAME","PRICE","CREATEDATE","UPDATEDATE" ...
    장용석장용석
    Sep 9, 2008 at 12:38 am
    Sep 10, 2008 at 1:30 am
  • Hi guys. Glad to announce that I finally managed to move this package out of the company code and released it to the OS community. The package contains some neat classes which we use for instance ...
    Marcus HerouMarcus Herou
    Sep 8, 2008 at 4:44 am
    Sep 8, 2008 at 8:09 pm
  • hello, anyone using ramdisks for storage? there is ramsam and there is also fusion io. but they are kinda expensive. any other alternatives I wonder? Best.
    Cam BazzCam Bazz
    Sep 4, 2008 at 3:58 pm
    Sep 5, 2008 at 12:52 pm
Group Navigation
period‹ prev | Sep 2008 | next ›
Group Overview
groupjava-user @
categorieslucene
discussions116
posts604
users121
websitelucene.apache.org

121 users for September 2008

Michael McCandless: 57 posts Cam Bazz: 39 posts Erick Erickson: 37 posts Chris Hostetter: 23 posts 叶双明: 22 posts Grant Ingersoll: 16 posts Jason Rutherglen: 15 posts Otis Gospodnetic: 14 posts Dino Korah: 13 posts Karl Wettin: 13 posts Mck: 13 posts Mark Miller: 11 posts Steven A Rowe: 11 posts Chris Lu: 10 posts Leonid Maslov: 8 posts Ian Lea: 7 posts Jimi Hullegård: 7 posts Mark harwood: 7 posts Noble Paul നോബിള്‍ नोब्ळ्: 7 posts Andy33: 6 posts
show more
Archives