Search Discussions

164 discussions - 695 posts

  • I was thinking it could be nice if the query parser handled aliases by passing a Map to the parser. The data could be compiled from the index. "name", "nam", "na" and "n" all trigger on "name", ...
    Karl wettinKarl wettin
    Jun 4, 2006 at 9:58 pm
    Jun 9, 2006 at 8:33 am
  • Hello, I have a question about the IndexSearcher(). I have a Servlet that has a searchDocument(String theQuery) method. These method instantiate a new IndexSearcher at each query: ...
    Fabrice RobiniFabrice Robini
    Jun 27, 2006 at 8:25 am
    Jun 28, 2006 at 12:33 pm
  • Does everyone recommend getting this book? I'm just starting out with Lucene and like to have a book beside me as well as the web / this mailing list, but the book looks quite old now, has a 1-2 ...
    Jun 6, 2006 at 10:01 am
    Jun 7, 2006 at 1:19 am
  • Can someone please suggest a HTML text extraction library? In the Lucene book, it recommends Tidy. Seems jtidy is not really being maintained. Otis, what do you guys use at Simpy? Thanks -john
    John WangJohn Wang
    Jun 21, 2006 at 5:40 am
    Jun 30, 2006 at 5:47 am
  • Hi, I have created an index of 47 Million documents. I have 1.28GB RAM. When i am doing a search over this index it is taking on average 25 sec. Is there a way so that i can get results in part of a ...
    Jun 26, 2006 at 1:58 pm
    Jun 30, 2006 at 3:06 am
  • How can I create a QueryScorer in Lucene 2.0??? When I create a QueryScorer using the following codes, BooleanQuery booleanQuery = new BooleanQuery(); booleanQuery.add(q1,BooleanClause.Occur.SHOULD); ...
    Ferdinand ChanFerdinand Chan
    Jun 15, 2006 at 9:26 am
    Jun 15, 2006 at 5:47 pm
  • Hi again, After a lot of debugging and some API doc reading I have come to the conclusion that the static encodeNorm method of the Similarity class will encode my boost value into a single byte ...
    Marcus FalckMarcus Falck
    Jun 20, 2006 at 10:03 am
    Jun 22, 2006 at 2:20 pm
  • Hi all, I had five different indexes: 1 having 15469008 documents 2 having 7734504 documents 3 having 7734504 documents 4 having 7734504 documents 5 having 7734504 documents Which sums to 46407024. ...
    Jun 20, 2006 at 12:07 pm
    Jun 22, 2006 at 4:17 pm
  • We wrote ours for NetSearch to handle this specific issue. I suggest you create a holder class to hold the IndexReader and IndexSearcher, this can close them in the finalizer. Clients keep the holder ...
    Mike StreetonMike Streeton
    Jun 6, 2006 at 8:02 am
    Jun 6, 2006 at 11:45 am
  • hi, my problem is that i am using mysql db in which one table is present and i want index each row in the table and then search plz reply how this can be done? amit kumar DISCLAIMER ========== This ...
    Jun 29, 2006 at 9:09 am
    Jun 30, 2006 at 3:31 pm
  • When I create an index withe the class IndexModifier in Lucene 1.9.1there is a lock file created on a temp folder. My question is: Is it possible to disable this option? If yes how to procede?
    Jun 29, 2006 at 6:01 pm
    Jun 30, 2006 at 2:12 pm
  • Hi All, Has anyone else out there come across the shortcomings of the new COMMIT_LOCK_TIMEOUT in regards to searching on an actively updated Index? It used to be a settable system property and ...
    Michael DuvalMichael Duval
    Jun 9, 2006 at 4:04 pm
    Jun 13, 2006 at 12:24 pm
  • Hi, I've been handed a legacy index containing Documents with two Fields; one is a file ID, the other is contents of the file. The contents field was added using UnStored. Now, we want to add fields. ...
    Bob ArensBob Arens
    Jun 9, 2006 at 3:43 am
    Jun 9, 2006 at 11:30 pm
  • Hi Folks, I'm working on project that is going to have free-text search mechanism. The project is completely based on open source technologies, such as MySQL and PHP. I'm reading about Lucene and ...
    Alexander MASHTAKOVAlexander MASHTAKOV
    Jun 6, 2006 at 1:26 pm
    Jun 6, 2006 at 5:12 pm
  • Problems with Lucene executing from Web with jsf. I do not understand because perhaps Lucene cannot be used in applications Web? This is the instruction that gives error: IndexSearcher oIndexSearcher ...
    Alberto Marquÿffffe9sAlberto Marquÿffffe9s
    Jun 1, 2006 at 7:34 am
    Jun 2, 2006 at 12:05 pm
  • Hy, i use Lucene to index a SQL-Table which contains three fields: a index-field, the text to search in and another field. When adding a lucene document I let Lucene index the search-field and also ...
    Dominik BruhnDominik Bruhn
    Jun 30, 2006 at 11:11 pm
    Jul 8, 2006 at 3:06 pm
  • Hi, Do you know another library for indexing RDF? Thanks a lot for your help Teresa -----Mensaje original----- De: Suba Suresh Enviado el: martes, 27 de junio de 2006 17:38 Para: ...
    Jun 27, 2006 at 4:00 pm
    Jun 30, 2006 at 12:03 pm
  • Hi, I am trying to merge in index from a different node and probably different platform. I tried some simple cases by copying an index created from a windows machine, and bring to a linux server. I ...
    Beady GeraghtyBeady Geraghty
    Jun 27, 2006 at 3:23 pm
    Jun 27, 2006 at 7:23 pm
  • Hi, I've looked through the archives and it looks like this question has been asked in one form or another a few times, but without a satisfactory solution. I am trying to get the most frequently ...
    Nader AkhnoukhNader Akhnoukh
    Jun 21, 2006 at 11:33 pm
    Jun 22, 2006 at 7:58 pm
  • I am working on indexing emails and want to have a "to" field. I am currently putting all the emails on one line seperated w/ spaces...example: michael@foo.bar john@boo.com jane@bar.com Then i index ...
    Michael J. PrichardMichael J. Prichard
    Jun 16, 2006 at 7:20 pm
    Jun 19, 2006 at 8:59 pm
  • Hi, I am trying to index a huge collection of documents - several hundreds of gigabytes. Needless to say, I'm trying to squeeze every ounce of performance from my machine, to get this indexing done ...
    Nadav Har'ElNadav Har'El
    Jun 12, 2006 at 12:23 pm
    Jun 12, 2006 at 7:53 pm
  • Hi All, Will u please give me some clue for searching on more than one field of document. My document has six field and i want to search on three fields. Presently I am able to search on only TITLE ...
    Amaresh Kumar YadavAmaresh Kumar Yadav
    Jun 6, 2006 at 9:09 am
    Jun 6, 2006 at 3:21 pm
  • Hy, how can I limit the result-count of a query in order to save time? I searched the web but didn't find a solution. Thanks Dominik ...
    Dominik BruhnDominik Bruhn
    Jun 29, 2006 at 5:30 pm
    Jul 5, 2006 at 6:07 pm
  • Hi, I started using the EnglishStemmer and noticed that only the stem gets added to the index. I would like to be able to add both to give me a stem search and an exact search capability. My first ...
    Robert HaycockRobert Haycock
    Jun 28, 2006 at 12:52 pm
    Jun 28, 2006 at 6:34 pm
  • Hello. I have encountered weird error with CO of Lucene, when I try to build PyLucene: thorn@thornserver /cygdrive/d/workshop/PyLucene $ make svn co -r 417135 ...
    Yura SmolskyYura Smolsky
    Jun 27, 2006 at 6:04 pm
    Jun 27, 2006 at 8:49 pm
  • Hi, I have a small query here. How do I do an exact match on the value of a field and also assert that the value must match the entire content of that field? For E.g., I want that only the documents ...
    Shivani SawhneyShivani Sawhney
    Jun 12, 2006 at 4:26 am
    Jun 12, 2006 at 1:42 pm
  • Hi, I am freshman to Lucene and I am reading the book "Lucene In Action". Just as that we know, there are two kinds of directory to hold index, one is File System and the other is RAM. There is a ...
    Flik ShenFlik Shen
    Jun 11, 2006 at 9:38 am
    Jun 12, 2006 at 6:58 am
  • I have a very large corpus that I am storing in many indexes: 200 indexes * ~500MB each, with 10^6 very tiny documents in each. (I could look into optimizing this later, of course, but seems ok for ...
    Benjamin SteinBenjamin Stein
    Jun 7, 2006 at 7:44 pm
    Jun 12, 2006 at 4:31 am
  • Hi, I am trying to implement an alternative scoring mechanism in Lucene. A query of multiple terms is represented as a BooleanQuery with one or more Occur.SHOULD clauses. The scoring for a document ...
    Trieschnigg, R.B. \(Dolf\)Trieschnigg, R.B. \(Dolf\)
    Jun 7, 2006 at 1:46 pm
    Jun 10, 2006 at 8:50 pm
  • Hi...reasonably experienced web search programmer but total Lucene newbie here. After poking through Lucene for a while, I still haven't figured out a decent way to tweak the scoring based on ...
    Scott DaviesScott Davies
    Jun 2, 2006 at 8:15 pm
    Jun 3, 2006 at 8:06 am
  • i want to search for text into "title" field only. how shuold i specify it? Regards.. Amaresh -----Original Message----- From: Alexey Sorokin Sent: Wednesday, May 31, 2006 4:21 PM To: ...
    Amaresh Kumar YadavAmaresh Kumar Yadav
    Jun 1, 2006 at 3:56 am
    Jun 1, 2006 at 11:54 pm
  • Is there a way to preload the index into memory when the process starts? Basically I want to warm up the index before processing user queries. What are some recommended ways to do this? Thanks.
    Charles MiCharles Mi
    Jun 1, 2006 at 3:03 am
    Jun 1, 2006 at 5:17 am
  • As an example - I want to search for the word "interested" without finding docs that have "not" immediately preceding it. I couldn't see anything in SpanQuerys that would help and you can't construct ...
    Mark harwoodMark harwood
    Jun 30, 2006 at 3:47 pm
    Jul 1, 2006 at 8:38 am
  • Hi all, I'm a new user of Lucene and wanted to know if Lucne can search multiple data directories which these directories do not have similar parent? e.g. I have a directory named "father" and i have ...
    Shaghayegh SahebieShaghayegh Sahebie
    Jun 20, 2006 at 12:22 pm
    Jun 22, 2006 at 12:57 pm
  • Hello, I would like to write a application to browse around and search the log files of linux machines, like www.splunk.org does. Would lucene be the right db to store such text information ? Because ...
    Andreas MoroderAndreas Moroder
    Jun 18, 2006 at 5:00 pm
    Jun 20, 2006 at 11:39 pm
  • I've been playing around with Lucene for a while now. I'm pretty comfortable with creating an index and searching against it. Up until now, I've been using the LuceneIndexAccessor package contributed ...
    Van NguyenVan Nguyen
    Jun 12, 2006 at 10:55 pm
    Jun 14, 2006 at 3:46 pm
  • We keep getting JVM crashes on 1.4.3. I found in the archive that setting a JVM parameter solved the problem for a few users. We've tried that and it has not worked. Here's our JVM parameters: ...
    Ross RankinRoss Rankin
    Jun 13, 2006 at 7:06 pm
    Jun 13, 2006 at 10:02 pm
  • I'm implementing a spellchecker in my search and have a question. After creating the index and spellchecker index, I pass in the word "ducted tape" to search (I am expecting "duct tape" back). I've ...
    Van NguyenVan Nguyen
    Jun 7, 2006 at 12:45 am
    Jun 13, 2006 at 7:57 pm
  • Hi All! I have a trouble... When I index text documents in english, there is no problem, buy when I index Spanish text documents (And they're big), a lot of information from the document don't become ...
    Manu mohedanoManu mohedano
    Jun 9, 2006 at 7:32 pm
    Jun 10, 2006 at 7:01 am
  • Hi all, I am using the QueryParser with a StandardAnalyzer. I would like to avoid or auto-correct anything that would lead to a ParseException. For example, I don't think you can get a parse ...
    Chris NoklebergChris Nokleberg
    Jun 6, 2006 at 8:30 pm
    Jun 7, 2006 at 4:08 am
  • Hi, I indexed some XML files using Lucene. When I open up the index using Luke, I can see that all the fields are stored correctly in the index. But, when I try to grab the fields from the hits, ...
    Seeta SomaganiSeeta Somagani
    Jun 30, 2006 at 10:36 pm
    Jul 5, 2006 at 6:16 pm
  • Hi, I got a lucene based host application that retrieves content for indexing from fetcher applications. Since I get fresh content all the time I wanted to have full control over the disc write ...
    Marcus FalckMarcus Falck
    Jun 28, 2006 at 9:46 am
    Jun 28, 2006 at 9:07 pm
  • Hi, I'm evaluating Lucene right now to use as a base for one open source project. I found some _indexing_ benchmarks on the lucene website (http://lucene.apache.org/java/docs/benchmarks.html), but, ...
    Vladimir OleninVladimir Olenin
    Jun 26, 2006 at 2:57 pm
    Jun 28, 2006 at 4:16 am
  • It means that to pick both high maxBufferedDocs and mergeFator will improve your indexing performance. But if too high, it will lead you to an OutOfMemberException.exception. And if you set ...
    Flik ShenFlik Shen
    Jun 12, 2006 at 5:56 am
    Jun 27, 2006 at 4:18 pm
  • Hello everyone, If you have 15 seconds to spare, please let us (Lucene developers) know which version of Java you are using with Lucene: 1.4 or 1.5 All it takes is 1 click on one of the two choices: ...
    Otis GospodneticOtis Gospodnetic
    Jun 16, 2006 at 3:48 pm
    Jun 17, 2006 at 6:47 pm
  • Hi, I have a very simple example. An IndexWriter (Lucene 1.9.0) with CJKAnalyzer (latest version as of today). A Chinese friend of mine as given me a sentence and a word that appears in that ...
    Robert HaycockRobert Haycock
    Jun 15, 2006 at 11:30 am
    Jun 17, 2006 at 4:58 am
  • Hello, We have an application dealing with historical books. The books have metadata consisting of event dates, and person names among others. The FullText, Person and Date indexes were split until ...
    Mile RosuMile Rosu
    Jun 12, 2006 at 9:23 am
    Jun 14, 2006 at 8:50 am
  • Hi, i am having problem in getting the count on distict values of a field. The reason for getting this value is that, each of all documents in index belongs to one predefined class and i want to get ...
    Vipin sharmaVipin sharma
    Jun 12, 2006 at 12:06 pm
    Jun 14, 2006 at 3:33 am
  • Hi, I am searching two fields at a time and using MultiFieldQueryParser to parse my query (method described in In Action book). I noticed that I am getting duplicate results. The document exists only ...
    Varun soodVarun sood
    Jun 5, 2006 at 3:14 pm
    Jun 7, 2006 at 9:05 pm
  • Hi, seems like am awe struck. My Index is working fine. Now, have got the WordNet synonym-index. How do I make use of this index to get synonym support search results.? Do I have to Merge these 2 ...
    Ramesh SallaRamesh Salla
    Jun 29, 2006 at 1:55 pm
    Jun 29, 2006 at 2:30 pm
Group Navigation
period‹ prev | Jun 2006 | next ›
Group Overview
groupjava-user @

183 users for June 2006

Chris Hostetter: 62 posts Vipin sharma: 29 posts Erik Hatcher: 28 posts Rob Staveley (Tom): 28 posts Otis Gospodnetic: 27 posts Erick Erickson: 23 posts Karl wettin: 22 posts Yonik Seeley: 18 posts Aleksander M. Stensby: 11 posts Paul Elschot: 10 posts Grant Ingersoll: 9 posts James Pine: 8 posts Mile Rosu: 8 posts Bruce: 7 posts Flik Shen: 7 posts Marcus Falck: 7 posts Robert Haycock: 7 posts Adam Saeed: 6 posts Daniel Noll: 6 posts Digby: 6 posts
show more