Search Discussions
-
Hi all, I was wondering if it is possible to do boosting by search terms' position in the document. for example: search terms appear in the first 100 words, or first 10% words, or in first two ...
Cedric Ho
Aug 1, 2007 at 4:13 am
Dec 18, 2007 at 5:19 pm -
Is it possible to rename fields in an existing index without having to re-index all documents? thx -- Antoine Baudoux Development Manager ab@taktik.be Tél.: +32 2 333 58 44 GSM: +32 499 534 538 Fax.: ...
Antoine Baudoux
Aug 22, 2007 at 7:55 am
Feb 1, 2008 at 7:30 am -
Hello, I have an application with a 2GB index. A lot of documents (up to 10.000 per day) are added/deleted to this index. My customer would like to have a Maximum of 7 minutes delay between a media ...
Antoine Baudoux
Aug 29, 2007 at 8:04 pm
Aug 31, 2007 at 4:08 pm -
A few questions on custom score queries: [1] I need to rank matches by some combination of keyword match, popularity and recency of the doc. I read the docs about CustomScoreQuery and seems to be a ...
Srinivas.N.
Aug 13, 2007 at 11:35 pm
Aug 20, 2007 at 2:00 pm -
Hi, I would like to keep user search history data and I am looking for some ideas/advices/recommendations. In general I would like to talk about methods of storing such data, its structure and how to ...
Lukas Vlcek
Aug 10, 2007 at 7:28 am
Aug 14, 2007 at 8:53 pm -
Hi, I indexed a large number of large documents, but I did not index the document themselves. Now I am interested in getting the vector (i.e.: the terms indexed and the frequency) of that indexed but ...
Tierecke
Aug 3, 2007 at 9:18 am
Nov 6, 2007 at 11:36 am -
10
out of order
Using Lucene 2.2.0, I still sporadically got doc out of order error. I indexed all of my stuff in one thread. Do you have any idea why it happens? Thanks! -- View this message in context: ...Testn
Aug 15, 2007 at 11:05 pm
Aug 16, 2007 at 7:50 pm -
Hi all, Can I get just a list of document Ids given a search criteria ? To elaborate here is my situation: I store 20000 contracts in the file system index each with some parameterName and Value. ...
Makkhar
Aug 2, 2007 at 8:56 am
Aug 4, 2007 at 3:14 am -
9
Indexing
Hi, I'm new to this list. So first of all Hello to everyone! So right now I have a little issue I would like to discuss with you. Suppose that your are in a really big application where the data in ...Jonathan Ariel
Aug 22, 2007 at 1:15 pm
Aug 26, 2007 at 10:46 am -
Hello while calling IndexReader.deletedoc(int) I am becomming a NPE. java.lang.NullPointerException at org.apache.lucene.index.IndexReader.acquireWriteLock(IndexReader.java:658) at ...
Eric Louvard
Aug 21, 2007 at 8:10 am
Aug 22, 2007 at 10:59 am -
I know how to do english text with POI and PDFBox and so on. Now, I want to start indexing non-english language such as french and spanish. Which extraction libs are available for me? I want to do: ...
Michael Prichard
Aug 1, 2007 at 5:45 am
Sep 12, 2007 at 2:02 pm -
Hi! I have an index containing the following fields "id" (not to be confused with the internal Lucene id) "version" "date" The combination of "id" and "version" is unique, i.e. there may be serveral ...
Per Lindberg
Aug 28, 2007 at 3:49 pm
Aug 29, 2007 at 3:53 pm -
Hi all, My problem is as follows: Our documents each comes from a different publication. And we currently have 5000 different publication sources. Our clients can choose arbitrarily a subset of the ...
Cedric Ho
Aug 13, 2007 at 4:18 am
Aug 15, 2007 at 5:52 am -
I have a searchable index of documents which contain french and spanish diacritics (è, é, À) etc. I would like to make the content searchable so that when a user searches for a word such as ...
Anorman
Aug 27, 2007 at 2:03 pm
Aug 28, 2007 at 11:55 am -
Hi I am using WhitespaceAnalyzer and the query is " icdCode:H* " but there is no result however I know that there are many documents with this field value such as H20, H20.5 etc. this field is ...
Mohammad Norouzi
Aug 15, 2007 at 5:18 am
Aug 19, 2007 at 3:15 pm -
Hi, Folks - Two quick questions - need to size a server to run our new index. If I have an index with 111k articles and 90 million words indexed, how much RAM should I have to get really fast access ...
Lucene user
Aug 12, 2007 at 7:04 am
Aug 13, 2007 at 10:33 pm -
Hi, I have indexed 5 fields and stored 2 of them(field Length is around 10000). My index is growing in nature and it is in GB. I need to get search result based on docID only. Scoring, additional ...
SK R
Aug 6, 2007 at 11:39 am
Aug 7, 2007 at 12:25 pm -
I'm creating a tokenized "content" Field from a plain text file using an InputStreamReader and new Field("content", in); The text file is large, 20 MB, and contains zillions lines, each with the the ...
Per Lindberg
Aug 31, 2007 at 2:17 pm
Sep 11, 2007 at 1:29 pm -
Hello i would like to implement a suggest implementation (like google suggest) using lucene. i actually tried using lucene and it was successfull but i was stuck in some point which is returning a ...
Heba Farouk
Aug 21, 2007 at 10:00 am
Aug 22, 2007 at 12:38 pm -
Hi, Is there a way to delete the results from a query or a filter and not documents specified by Term. I have seen some explanations here but i do not know how to do it: ...
Abu Abdulla alhanbali
Aug 18, 2007 at 6:40 am
Aug 20, 2007 at 6:44 pm -
I'm working on refining my stopwords by looking at the highest scoring document returned for each search, and using the highlighter to show which terms were significant in choosing that document. ...
Donna L Gresh
Aug 15, 2007 at 5:22 pm
Aug 16, 2007 at 4:17 pm -
Hi, could u pl. tell me how to update boost factor of already indexed document using setBoost. Thanks & regards, Rohit -- VANDE - MATRAM
Rohit saini
Aug 10, 2007 at 3:59 am
Aug 14, 2007 at 10:45 am -
Hello, I need to do a search that is capable to also match on substrings, for example: *oo bar the qu* should find a document that contains 'foo bar the quux' and 'foo bar the qux'. Now, should I ...
Ard Schrijvers
Aug 8, 2007 at 8:28 am
Aug 12, 2007 at 3:55 am -
Hi There! I've been working for a while on the implementation of a website oriented to contents that would contain millions of entries, most of them indexable (such as descriptions, texts, names, ...
Antonello Provenzano
Aug 10, 2007 at 9:09 am
Aug 11, 2007 at 1:21 pm -
Hi again, everyone. First of all, I want to thank everyone for their extremely helpful replies so far. Also, I just started reading the book "Lucene in Action" last night. So far it's an awesome ...
Joe Attardi
Aug 1, 2007 at 3:32 pm
Aug 1, 2007 at 9:31 pm -
Hi Lucene gurus, I am newbie and i have a question on transfering index directories across multiple machines. Whenever i update/add any new documents to the existing index, then it is generating new ...
Varma d
Aug 26, 2007 at 1:52 am
Jun 9, 2009 at 9:19 am -
I'm invoking Luke like this: java -jar lukeall-0.7.1.jar I run this query: content:Nyarubuye When I use the StandardAnalyzer I get results but when I use the KeywordAnalyzer I don't get results. Can ...
Kai_testing Middleton
Aug 7, 2007 at 11:22 pm
Dec 2, 2008 at 6:24 pm -
Hi, I have fields which have high multiplicity; for example I have a topic with 1000 names, 500 of which are "USA" and 200 are "United States of America". Previously I was indexing "USA USA .(500x).. ...
Tim Sturge
Aug 28, 2007 at 7:29 pm
Aug 29, 2007 at 7:59 pm -
Hi All, I have the following set up: a) Indexed set of docs. b) Ran 1st query and got tops docs c) Fetched the id's from that and stored in a data structure. d) Ran 2nd query , got top docs , fetched ...
Lokeya
Aug 16, 2007 at 6:20 pm
Aug 20, 2007 at 3:43 pm -
Hello, I have an index with an 'actor' field, for each actor there exists an single field value entry, e.g. stored/compressed,indexed,tokenized,termVector,termVectorOffsets,termVectorPosition ...
Duiduder
Aug 16, 2007 at 9:50 am
Aug 20, 2007 at 2:18 pm -
Hi, John I think you cost too much time in I/O,and if you use RAMDirectory first will better.see http://wiki.apache.org/lucene-java/ImproveIndexingSpeed kai -----邮件原件----- 发件人: Erick Erickson 发送时间: ...
Kai Hu
Aug 13, 2007 at 7:02 am
Aug 16, 2007 at 2:01 pm -
Hi all, Lucene query parser synax page (http://lucene.apache.org/java/docs/queryparsersyntax.html) provides the following two examples of range query: mod_date:[20020101 TO 20030101] and title:{Aida ...
Nilesh Bansal
Aug 11, 2007 at 8:27 pm
Aug 13, 2007 at 1:20 pm -
Hi there, I have my 25 indexes of 1.8GB each read with MultiReader. I try to get the document frequency of all the terms in specific documents and it takes quite a long time - a document with 1000 ...
Tierecke
Aug 5, 2007 at 11:41 pm
Aug 7, 2007 at 12:57 am -
Hi, I got unexpected behavior while testing lucene. To shortly address the problem: Using IndexWriter I add docs with fields named ID with a consecutive order (1,2,3,4, etc) then close that index. I ...
Ridwan Habbal
Aug 1, 2007 at 3:49 pm
Aug 2, 2007 at 1:02 pm -
I want to set documents in my IndexReader as deleted, but I will never commit these deletions. Sort of a filter on a reader rather than on a searcher, and no write-locks. Can I do that out of the ...
Karl wettin
Aug 20, 2007 at 2:46 am
Sep 4, 2007 at 3:50 pm -
I have been fine with my database (discussion forum) to lucene. I am taking the simplest approach, eg; I have a discussion forum which are just text messages, I take those out of the databse and then ...
Bbrown
Aug 31, 2007 at 8:15 pm
Sep 3, 2007 at 8:14 pm -
Hi everyone, I have the following need and I wander what are my options or if anyone run into it and has a solution / suggestion. I'm indexing a SQL database. Each table is a Lucene index. Now, in ...
George Aroush
Aug 30, 2007 at 2:03 am
Aug 30, 2007 at 9:55 am -
I've searched the mailing list archives, the web, read the FAQ, etc and I don't see anything relevant so here it goes… I'm trying to implement a radius based searching based on zip/postal codes. (The ...
Mike
Aug 29, 2007 at 3:36 pm
Aug 29, 2007 at 5:04 pm -
i'm looking at doing some statistical work with lucene searches and the function queries look like a nice starting point. i found the DocValues.getMin/Max/Avg functions already however there doesn't ...
Will Johnson
Aug 24, 2007 at 9:04 pm
Aug 27, 2007 at 1:43 pm -
Hello, I'm indexing 2,5 millions docs. I already have added 1,2 millions docs to the index and the indexing speed becomes quite slow. my index directory is 1GB . Is there a limit to the indexing ...
Antoine Baudoux
Aug 27, 2007 at 6:27 am
Aug 27, 2007 at 12:58 pm -
Hi, I need your help in formalizing this query: (field1:query1 AND field2:query2) OR (field1:query3 AND field2:query4) OR (field1:query5 AND field2:query6) OR (field1:query7 AND field2:query8) ... ...
Abu Abdulla alhanbali
Aug 10, 2007 at 4:21 am
Aug 18, 2007 at 4:30 am -
I've been experimenting with using SpanQuery to perform what is essentially a limited type of database 'join'. Each document in the index contains 1 or more 'rows' of meta data from another 'table'. ...
Peter Keegan
Aug 13, 2007 at 6:34 pm
Aug 15, 2007 at 12:02 am -
Here's a scenario I just ran into, though I don't know how to make Lucene do it (or even if it can). I have two lists; to keep things simply lets assume (A B C D E F G) and (X Y). I want to form a ...
Walt Stoneburner
Aug 13, 2007 at 6:20 pm
Aug 14, 2007 at 8:17 pm -
Antonello, You are right,I think lucene indexsearcher will search the old information if IndexWriter was not closed(I think lucene release the Lock here),so I only add a few documents every time from ...
Kai Hu
Aug 10, 2007 at 10:17 am
Aug 11, 2007 at 5:16 pm -
I was wondering if there is a "search based" method to find the top-k frequent phrases in a set of documents.( I do not have a particular phrase in mind so PhraseQuery can probably be ruled out). I ...
Akanksha Baid
Aug 9, 2007 at 7:35 am
Aug 10, 2007 at 4:41 pm -
Is there a good way to handle the following scenario: I have certain terms with embedded periods for which I want to leave them intact (not split at the periods). For example in my application a ...
Donna L Gresh
Aug 9, 2007 at 2:37 pm
Aug 9, 2007 at 5:29 pm -
Hi, I got stuck with a complex proximity clause - and would be grateful to get your help. Does Lucene allow, and if yes: what is the syntax? * Proximity between two phrases, for instance a within n1 ...
Tierecke
Aug 5, 2007 at 12:27 am
Aug 5, 2007 at 12:09 pm -
Hi, We're considering to use the new IndexWriter.deleteDocuments call rather than the IndexReader.delete call. Are there any performance improvements that this may provide, other than the benefit of ...
Andreas Knecht
Aug 3, 2007 at 4:27 am
Aug 3, 2007 at 7:47 pm -
Hello, I've been asked to devise some way to discover and correct data in Lucene indexes that have been "corrupted." The word "corrupt", in this case, has a few different meanings, some of which ...
Joe R
Aug 2, 2007 at 3:24 pm
Aug 3, 2007 at 1:20 pm -
I understand that only document that has been indexed will be able to search. I already manage to index the document and also search the content of the document. The problem is, why is that there are ...
Masz-wow
Aug 1, 2007 at 4:32 am
Aug 1, 2007 at 9:19 am
Group Overview
group | java-user |
categories | lucene |
discussions | 130 |
posts | 568 |
users | 138 |
website | lucene.apache.org |
138 users for August 2007
Archives
- June 2016 (77)
- May 2016 (96)
- April 2016 (116)
- March 2016 (67)
- February 2016 (76)
- January 2016 (78)
- December 2015 (85)
- November 2015 (114)
- October 2015 (95)
- September 2015 (123)
- August 2015 (98)
- July 2015 (107)
- June 2015 (85)
- May 2015 (70)
- April 2015 (103)
- March 2015 (130)
- February 2015 (183)
- January 2015 (111)
- December 2014 (147)
- November 2014 (117)
- October 2014 (118)
- September 2014 (148)
- August 2014 (206)
- July 2014 (161)
- June 2014 (282)
- May 2014 (162)
- April 2014 (152)
- March 2014 (152)
- February 2014 (219)
- January 2014 (147)
- December 2013 (88)
- November 2013 (176)
- October 2013 (220)
- September 2013 (232)
- August 2013 (257)
- July 2013 (320)
- June 2013 (223)
- May 2013 (228)
- April 2013 (233)
- March 2013 (309)
- February 2013 (224)
- January 2013 (425)
- December 2012 (246)
- November 2012 (301)
- October 2012 (200)
- September 2012 (116)
- August 2012 (229)
- July 2012 (302)
- June 2012 (203)
- May 2012 (253)
- April 2012 (172)
- March 2012 (245)
- February 2012 (347)
- January 2012 (302)
- December 2011 (191)
- November 2011 (246)
- October 2011 (251)
- September 2011 (230)
- August 2011 (197)
- July 2011 (254)
- June 2011 (374)
- May 2011 (310)
- April 2011 (310)
- March 2011 (422)
- February 2011 (227)
- January 2011 (365)
- December 2010 (239)
- November 2010 (322)
- October 2010 (295)
- September 2010 (192)
- August 2010 (295)
- July 2010 (296)
- June 2010 (292)
- May 2010 (299)
- April 2010 (359)
- March 2010 (399)
- February 2010 (448)
- January 2010 (467)
- December 2009 (478)
- November 2009 (699)
- October 2009 (609)
- September 2009 (450)
- August 2009 (465)
- July 2009 (582)
- June 2009 (470)
- May 2009 (513)
- April 2009 (609)
- March 2009 (684)
- February 2009 (389)
- January 2009 (356)
- December 2008 (589)
- November 2008 (480)
- October 2008 (508)
- September 2008 (604)
- August 2008 (582)
- July 2008 (522)
- June 2008 (444)
- May 2008 (424)
- April 2008 (453)
- March 2008 (515)
- February 2008 (560)
- January 2008 (619)
- December 2007 (405)
- November 2007 (471)
- October 2007 (392)
- September 2007 (337)
- August 2007 (568)
- July 2007 (584)
- June 2007 (496)
- May 2007 (623)
- April 2007 (542)
- March 2007 (765)
- February 2007 (669)
- January 2007 (602)
- December 2006 (469)
- November 2006 (498)
- October 2006 (598)
- September 2006 (572)
- August 2006 (668)
- July 2006 (692)
- June 2006 (695)
- May 2006 (609)
- April 2006 (497)
- March 2006 (695)
- February 2006 (541)
- January 2006 (544)
- December 2005 (368)
- November 2005 (529)
- October 2005 (565)
- September 2005 (526)
- August 2005 (493)
- July 2005 (409)
- June 2005 (570)
- May 2005 (363)
- April 2005 (464)
- March 2005 (419)
- February 2005 (600)
- January 2005 (636)
- December 2004 (633)
- November 2004 (597)
- October 2004 (460)
- September 2004 (495)
- August 2004 (450)
- July 2004 (552)
- June 2004 (491)
- May 2004 (355)
- April 2004 (362)
- March 2004 (486)
- February 2004 (375)
- January 2004 (285)
- December 2003 (377)
- November 2003 (452)
- October 2003 (217)
- September 2003 (291)
- August 2003 (186)
- July 2003 (226)
- June 2003 (218)
- May 2003 (334)
- April 2003 (256)
- March 2003 (276)
- February 2003 (228)
- January 2003 (190)
- December 2002 (192)
- November 2002 (365)
- October 2002 (280)
- September 2002 (179)
- August 2002 (117)
- July 2002 (203)
- June 2002 (229)
- May 2002 (248)
- April 2002 (282)
- March 2002 (228)
- February 2002 (252)
- January 2002 (134)
- December 2001 (146)
- November 2001 (327)
- October 2001 (177)
- September 2001 (1)