Search Discussions
-
I have an application using Lucene 1.3 final. In this application, I am loading data where the main text for each document is stored into a "body" field, a couple of other internal fields, and ...
David Sitsky
May 25, 2004 at 1:12 am
Jun 3, 2004 at 6:53 am -
Hi... I have a query screen where most of the fields search a regular database but one field searches for text in the body of the document. You could say the database holds metadata about the ...
Glen Stampoultzis
May 11, 2004 at 3:10 am
May 13, 2004 at 12:04 am -
Hey Guys Found some Highlighter Package on CVS Directory Was Investigating,found some Compile time error.. Please some body tell me what this The Code:- private IndexReader reader=null; private ...
Karthik N S
May 19, 2004 at 8:10 am
Jun 15, 2004 at 8:39 am -
Hi, I'm a newbie to Lucene and heard that it helps in the information retrieval process. However, my problem is not really related to the information retrieval but to the comparison of two texts. I ...
Uddam chukmol
May 31, 2004 at 6:10 pm
Jun 2, 2004 at 6:37 pm -
10
Page ranking
Is there anyone out there that has page ranking implemented on top of Lucene? Just in case anyone may be thinking otherwise, when I say page ranking I'm not referring to the ranking of results from ...Scott Sayles
May 18, 2004 at 5:05 pm
Jun 1, 2004 at 5:24 pm -
10
Memory usage
Hello, I was wondering if anyone has had problems with memory usage and MultiSearcher. My index is composed of two sub-indexes that I search with a MultiSearcher. The total size of the index is about ...James Dunn
May 26, 2004 at 7:02 pm
May 27, 2004 at 4:27 pm -
Hi I noticed that most users have +- 1G of RAM to run Lucene. Does anyone have experiences running it on a 128MB or 256MB machine? http://jakarta.apache.org/lucene/docs/benchmarks.html Advise on the ...
Sebastian Ho
May 13, 2004 at 9:28 am
May 13, 2004 at 9:23 pm -
I've knocked together this tool which automatically discovers Analyzers on the classpath and provides a GUI to allow you to try out different Analyzers and see their effects: ...
Markharw00d
May 27, 2004 at 10:45 pm
Jun 2, 2004 at 1:08 pm -
Hi, I am using CJK Tokenzier for searching the Japanese documents. I am able to search japanese documents which are text files. But I am not able to search from Microsoft word, excel files with ...
Ankur Goel
May 20, 2004 at 5:10 pm
May 24, 2004 at 3:48 pm -
Say I have a query result for the term Linux... now I just want the TITLE of these documents not the BODY. To further this scenario imagine the TITLE is 500 bytes but the BODY is 50M. The current ...
Kevin Burton
May 13, 2004 at 6:19 am
May 22, 2004 at 7:22 pm -
Lucene Users, I'm using the SearchBean contribution from the sandbox to implement a Struts application search with Lucene (taking ACO's advice and going MVC on the demo app.) Right now I'm having a ...
Timothy Stone
May 18, 2004 at 3:43 pm
May 20, 2004 at 3:29 am -
How can I construct a document that has multiple values for one field (ex: locale en_US, de_DE, etc). I've been concatonating the values into one string and storing them in one field, but I think ...
Ryan Sonnek
May 11, 2004 at 4:25 pm
May 17, 2004 at 10:20 am -
Hello all, I am doing some search work using Lucene 1.4 paralmultisearch. If I use MultiSearcher.It works well. But if I use ParallelMultisearcher the code can compile correctly and cann't execute ...
Xuemei li
May 7, 2004 at 7:56 pm
May 10, 2004 at 9:50 am -
Hi all, I want to integrate lucene into my web app. I would like to increase the score of the document when more people click on it. Could I implement that in lucene ? Thanks. Perseus MSN 8 helps ...
Centaur zeus
May 6, 2004 at 2:45 am
May 6, 2004 at 10:23 pm -
Try using Tidy. Creates a Document of the html and allows you to apply xpath. Hope this helps. Kiran. -----Original Message----- From: Karthik N S Sent: 17 May 2004 11:59 To: Lucene Users List ...
Viparthi, Kiran (AFIS)
May 17, 2004 at 9:57 am
May 26, 2004 at 11:28 am -
When performing a search with lucene, is it possible to only return a subset of the results? I need to be able to page through results, and it seems much more efficient if I can tell the searcher, ...
Ryan Sonnek
May 11, 2004 at 1:59 pm
May 15, 2004 at 10:20 am -
Hi, Working now for a few months with this really great search engine, I was wondering where the name "Lucene" comes from? What does it mean? Is there any deeper sense? Thanks, Til ...
Til Schneider
May 5, 2004 at 8:26 am
May 7, 2004 at 7:26 am -
Hi, I have a bunch of digits in a field. When I do this search it returns nothing: myField:0000001085609805100 It returns the correct document when I add a * to the end like this: ...
Reece 1247688
May 26, 2004 at 10:31 pm
May 27, 2004 at 6:59 pm -
I switched to indexing using a text field instead of keyword, then I tried the following based on various pieces of advice: PerFieldAnalyzerWrapper pfaw = new PerFieldAnalyzerWrapper(new ...
Alex Bourne
May 26, 2004 at 12:40 pm
May 27, 2004 at 7:33 am -
I am trying to index a field in a Lucene document with about 90,000 characters. The problem is that it only indexes part of the document. It seems to only index about 65,00 characters. So, if I ...
Gilberto Rodriguez
May 26, 2004 at 8:08 pm
May 26, 2004 at 10:13 pm -
Hi Hannah, Otis I cannot help but I have excatly the same problems with special german charcters. I used snowball analyser but this does not help because the problem (tokenizing) appears before the ...
PEP AD Server Administrator
May 19, 2004 at 4:09 pm
May 21, 2004 at 11:02 am -
Per the discussion the other day about storing content external to Lucene I think we have an opportunity to improve the lucene core and bring a lot of functionality to future developers. Right now ...
Kevin Burton
May 18, 2004 at 6:43 pm
May 19, 2004 at 11:25 pm -
Hi, I hope someone can help! I am using Lucene to make a searching repository of electronic documents. (MS Office, PDF's etc.). Some of these document can contain a large amount of text (about 500K ...
Paul Williams
May 14, 2004 at 3:23 pm
May 14, 2004 at 5:35 pm -
At one point I thought I'd read that a Hits object doesn't actually contain Documents, but rather references to them. However, in that case I wouldn't expect I could save a Hits object past the ...
Dmgoodstein
May 27, 2004 at 8:52 pm
May 28, 2004 at 3:51 am -
I tried this, but no it does not work. I'm concerned that escaping the minus symbol does not appear to work. The field is indexed as a keyword so is not tokenized - I've checked the contents using ...
Alex Bourne
May 24, 2004 at 4:52 pm
May 24, 2004 at 8:18 pm -
Hi all, I want to achieve the following, when I indexing the 'xyz@company.com', I want to index the 'xyz@company.com' token, then the 'xyz' token, the 'company' token and the 'com'token. This way, ...
Albert Vila
May 21, 2004 at 1:44 pm
May 21, 2004 at 11:43 pm -
I am using the lucene 1.4 to index the information. I have lot of HTML tags in the information that i will be indexing ,so let me know if their is any way of removing the HTML tags from being ...
Mahesh
May 20, 2004 at 5:45 am
May 20, 2004 at 10:37 am -
Hi all! I'm currently developing an application in which text searching is a main component. Among other things, a document will contain a field denoting hierarchical information. The information is ...
Fredrik Lindner
May 17, 2004 at 9:08 am
May 17, 2004 at 12:45 pm -
Hi, The documentation for BooleanQuery.add() states : "Adds a clause to a boolean query. Clauses may be: required which means that documents which do not match this sub-query will not match the ...
Leonid Portnoy
May 13, 2004 at 7:38 pm
May 14, 2004 at 12:56 pm -
Hi This is a typical web crawler, indexing and search application development. I have wrote my crawler and planning to add lucene in next. One questions pop to my mind, in terms of performance, do i ...
Sebastian Ho
May 13, 2004 at 1:27 am
May 13, 2004 at 5:57 pm -
Hi, I have no idea where to look for, and I know almost nothing about java :-( We're using lucene quite a while now (about a year I guess) and suddenly I've seen this when trying to optimize the ...
Sascha Ottolski
May 4, 2004 at 5:30 pm
May 12, 2004 at 5:14 pm -
Version 1.4 RC3 of Lucene is available for download from: http://cvs.apache.org/dist/jakarta/lucene/v1.4-rc3/ Changes are described at: ...
Doug Cutting
May 11, 2004 at 8:53 pm
May 12, 2004 at 2:42 pm -
Hi, I am considering a project that would index 315+ million documents. I am comfortable that the indexing will work well in creating an index ~800GB in size, but am concerned about the query ...
Will Allen
May 6, 2004 at 11:48 pm
May 7, 2004 at 4:12 pm -
Hi, What's the best way to store numbers for range searching? If someone has some info about this I'd love to see it. This is my current plan: When I convert the number to a string I will zero pad it ...
Reece 1247688
May 6, 2004 at 3:44 pm
May 6, 2004 at 10:59 pm -
Hi,all, Can we do search and update one index simultaneously?Is someone know sth about it? I had done some experiments.Now the search will be blocked when the index is being updated.The error in ...
Xuemei li
May 18, 2004 at 10:05 pm
Jun 1, 2004 at 5:28 pm -
Hi, I have the following question: Is there an easy way to see which words from a query were found in a resulting document? So if I search for 'cat OR dog' and get a result document with only 'cat' ...
Edvard_eid
May 25, 2004 at 8:52 am
May 26, 2004 at 12:05 pm -
Haven't seen this discussed here. See 7a at the link below: http://www.asktog.com/columns/062top10ReasonsToNotShop.html 7a talks about searching on a camera site for the "Lowepro 100 AW". He says ...
David Spencer
May 21, 2004 at 4:10 pm
May 21, 2004 at 11:49 pm -
Hi All, I'm using Lucene on a site that has split content with a branch containing pages in English and a separate branch in Chinese. Some of the chinese pages include some (untranslatable) English ...
Alex Bourne
May 21, 2004 at 3:36 pm
May 21, 2004 at 3:57 pm -
Hi All, I just upgraded to 1.4 RC 3 and am now unable to open my index. I am getting: java.io.IOException: The system cannot find the path specified at ...
Grant Ingersoll
May 17, 2004 at 6:15 pm
May 18, 2004 at 10:50 am -
Hi, I tried to build lucene 1.4 -rc3 with ant 1.5.3 and java 1.4.1_02. When I type "ant clean", I got an error message: build.xml:11: Unexpected element "tstamp". It seems like ant version problem, ...
Zhang, Lisheng
May 16, 2004 at 3:47 am
May 16, 2004 at 3:57 am -
Hi, I am getting "not a directory" error when doing search after I moved the index from local to a SAN box. FSDirectory does not recognize the index directory as a directory. Any idea? I use JDK142 ...
Hui
May 13, 2004 at 3:14 am
May 14, 2004 at 3:18 am -
We are currently using lucene 1.3 on a production web server. For the most part, it runs great. However, once in a while we see some problems which I suspect are the infamous "running out of file ...
Scott Smith
May 6, 2004 at 6:11 pm
May 14, 2004 at 1:22 am -
Hi, I found following entry within the mail-archives: http://www.mail-archive.com/lucene-user@jakarta.apache.org/msg02129.html Is there now (2 years ago) a possibility to have the index within a ...
Edin Pezerovic
May 7, 2004 at 10:15 am
May 13, 2004 at 1:59 pm -
Hello, I'm using Lucene 1.4 RC2, and I'm having trouble understanding how the scoring relates to document rank. The work I am doing is going to depend very much on knowing exactly how the scoring ...
Matthew W. Bilotti
May 10, 2004 at 3:26 pm
May 10, 2004 at 10:14 pm -
We have a number of small indices and also an uber-index made up of all the smaller indices. We need to get do a search across a number of the sub-indices and get back a hit count from each. ...
David Townsend
May 10, 2004 at 12:12 pm
May 10, 2004 at 5:12 pm -
I've seen this: http://www.jguru.com/faq/view.jsp?EID=538312 I've seen in the code that there is a method to set lowercasing, but I need to remove accentuated chars as well. Any suggestions as to ...
Stephane James Vaucher
May 10, 2004 at 8:05 am
May 10, 2004 at 10:01 am -
Hi all, I have a good working index about 3 GB in one directory for example in c:/index1 now i want to change the computer and directory for example to d:/index2 (is this possible ???) and when i ...
Rosen Marinov
May 3, 2004 at 1:53 pm
May 3, 2004 at 2:09 pm -
Thanks for responding Nader. hmmmm...you've hit the nail on the spot. I do have a cron job which backs up the index. Its run in a batch index scheduled job. The logic is basically backupindex() try { ...
Kelvin Tan
May 3, 2004 at 2:52 am
May 3, 2004 at 4:58 am -
Just thought I'd pass on some info I just discovered. I've been successfully using the CVS head version of Lucene as of about 2 months ago. I then got the formal release (1.4-rc3) and tried it with ...
Terry Steichen
May 31, 2004 at 8:07 pm
Jun 1, 2004 at 9:58 am -
Hi there, I am a newbie to Lucene and I'm considering using it in an upcoming project. I've read through the documentation but I still have a number of questions: 1. SEGMENTING AN INDEX & QUERIES BY ...
Sasha Haghani
May 31, 2004 at 2:34 am
May 31, 2004 at 1:14 pm
Group Overview
group | java-user |
categories | lucene |
discussions | 82 |
posts | 355 |
users | 105 |
website | lucene.apache.org |
105 users for May 2004
Archives
- June 2016 (77)
- May 2016 (96)
- April 2016 (116)
- March 2016 (67)
- February 2016 (76)
- January 2016 (78)
- December 2015 (85)
- November 2015 (114)
- October 2015 (95)
- September 2015 (123)
- August 2015 (98)
- July 2015 (107)
- June 2015 (85)
- May 2015 (70)
- April 2015 (103)
- March 2015 (130)
- February 2015 (183)
- January 2015 (111)
- December 2014 (147)
- November 2014 (117)
- October 2014 (118)
- September 2014 (148)
- August 2014 (206)
- July 2014 (161)
- June 2014 (282)
- May 2014 (162)
- April 2014 (152)
- March 2014 (152)
- February 2014 (219)
- January 2014 (147)
- December 2013 (88)
- November 2013 (176)
- October 2013 (220)
- September 2013 (232)
- August 2013 (257)
- July 2013 (320)
- June 2013 (223)
- May 2013 (228)
- April 2013 (233)
- March 2013 (309)
- February 2013 (224)
- January 2013 (425)
- December 2012 (246)
- November 2012 (301)
- October 2012 (200)
- September 2012 (116)
- August 2012 (229)
- July 2012 (302)
- June 2012 (203)
- May 2012 (253)
- April 2012 (172)
- March 2012 (245)
- February 2012 (347)
- January 2012 (302)
- December 2011 (191)
- November 2011 (246)
- October 2011 (251)
- September 2011 (230)
- August 2011 (197)
- July 2011 (254)
- June 2011 (374)
- May 2011 (310)
- April 2011 (310)
- March 2011 (422)
- February 2011 (227)
- January 2011 (365)
- December 2010 (239)
- November 2010 (322)
- October 2010 (295)
- September 2010 (192)
- August 2010 (295)
- July 2010 (296)
- June 2010 (292)
- May 2010 (299)
- April 2010 (359)
- March 2010 (399)
- February 2010 (448)
- January 2010 (467)
- December 2009 (478)
- November 2009 (699)
- October 2009 (609)
- September 2009 (450)
- August 2009 (465)
- July 2009 (582)
- June 2009 (470)
- May 2009 (513)
- April 2009 (609)
- March 2009 (684)
- February 2009 (389)
- January 2009 (356)
- December 2008 (589)
- November 2008 (480)
- October 2008 (508)
- September 2008 (604)
- August 2008 (582)
- July 2008 (522)
- June 2008 (444)
- May 2008 (424)
- April 2008 (453)
- March 2008 (515)
- February 2008 (560)
- January 2008 (619)
- December 2007 (405)
- November 2007 (471)
- October 2007 (392)
- September 2007 (337)
- August 2007 (568)
- July 2007 (584)
- June 2007 (496)
- May 2007 (623)
- April 2007 (542)
- March 2007 (765)
- February 2007 (669)
- January 2007 (602)
- December 2006 (469)
- November 2006 (498)
- October 2006 (598)
- September 2006 (572)
- August 2006 (668)
- July 2006 (692)
- June 2006 (695)
- May 2006 (609)
- April 2006 (497)
- March 2006 (695)
- February 2006 (541)
- January 2006 (544)
- December 2005 (368)
- November 2005 (529)
- October 2005 (565)
- September 2005 (526)
- August 2005 (493)
- July 2005 (409)
- June 2005 (570)
- May 2005 (363)
- April 2005 (464)
- March 2005 (419)
- February 2005 (600)
- January 2005 (636)
- December 2004 (633)
- November 2004 (597)
- October 2004 (460)
- September 2004 (495)
- August 2004 (450)
- July 2004 (552)
- June 2004 (491)
- May 2004 (355)
- April 2004 (362)
- March 2004 (486)
- February 2004 (375)
- January 2004 (285)
- December 2003 (377)
- November 2003 (452)
- October 2003 (217)
- September 2003 (291)
- August 2003 (186)
- July 2003 (226)
- June 2003 (218)
- May 2003 (334)
- April 2003 (256)
- March 2003 (276)
- February 2003 (228)
- January 2003 (190)
- December 2002 (192)
- November 2002 (365)
- October 2002 (280)
- September 2002 (179)
- August 2002 (117)
- July 2002 (203)
- June 2002 (229)
- May 2002 (248)
- April 2002 (282)
- March 2002 (228)
- February 2002 (252)
- January 2002 (134)
- December 2001 (146)
- November 2001 (327)
- October 2001 (177)
- September 2001 (1)