Search Discussions

126 discussions - 544 posts

  • Hi friends, How do you think use the lucene for searching in the Semantic Web? I am trying using the lucene for searching documents with ontological annotation. But i do not get a better model to ...
    Jan 17, 2006 at 2:35 pm
    Jan 20, 2006 at 10:01 am
  • Hi I want to find out the intersection of two hits objects please help me Hits hits1 = Searcher.searh(strQuery1); Hits hits2 = Searcher.searh(strQuery2); Now In hits1 contains records numbers 3,4 ...
    Jan 18, 2006 at 1:18 pm
    Jan 19, 2006 at 11:04 am
  • Hello, Is it possible to retrieve the top 'n' most often appearing words within a search criteria? I've seen the High Frequency Terms code in the sandbox but it works across the whole index. To put ...
    Chris BrownChris Brown
    Jan 9, 2006 at 4:28 pm
    Jan 11, 2006 at 6:30 pm
  • I would like to do queries that are negative. I mean a query with only negative terms and phrases. For example, retrieve all documents that do not contain the term "apple". For now, I have a limited ...
    Beady GeraghtyBeady Geraghty
    Jan 6, 2006 at 5:05 pm
    Jan 7, 2006 at 5:46 pm
  • I'm throwing myself at the mercy of the lucene community, I'm a bit brain dead today after looking after a screaming 3 month old baby for 4 hours last night... We have a 'title' field indexed as ...
    Paul SmithPaul Smith
    Jan 5, 2006 at 10:09 pm
    Jan 6, 2006 at 12:11 pm
  • I'm having some trouble coming up with a good search strategy for geographical data. e.g., given: [1] city: London, United Kingdom [2] city: London, Ontario, Canada [3] city: Ontario, California, ...
    Colin YoungColin Young
    Jan 27, 2006 at 4:09 pm
    Feb 11, 2006 at 9:47 am
  • hey, i have a bit of a complex problem, i need to group results recieved in a result set, for example: my result set returns 10,000 results there are about 10 fields in each result document i need to ...
    Zzzzz shalevZzzzz shalev
    Jan 29, 2006 at 12:56 pm
    Jan 31, 2006 at 9:26 am
  • I'm having a problem with keyword fields and how they're treated by QueryParser. At indexing time, I index my documents, as follows: Content - tokenized, indexed field (the default field) DocType - ...
    Dmitry GoldenbergDmitry Goldenberg
    Jan 24, 2006 at 11:47 pm
    Feb 17, 2006 at 9:55 am
  • Does Lucene use a BTree kind of structure for storing the index (atleast in the memory) .? or is it just a list. Based on the file format in the index directory ( where in the terms are are ...
    Shailesh kumarShailesh kumar
    Jan 11, 2006 at 12:23 pm
    Jan 13, 2006 at 12:03 am
  • I'm using the .net port of highlighter (1.5) and I notice it doesn't highlight range or prefix queries.. Is this consistent with the java version? Only I note my standard reference of ...
    Gwyn CarwardineGwyn Carwardine
    Jan 24, 2006 at 9:32 pm
    Jan 26, 2006 at 1:53 pm
  • Is there an efficient way to determine if two or more terms frequently appear next to each other sequence? For a query like: a b c one or more of the following suggestions could be generated: "a b c" ...
    Eric JainEric Jain
    Jan 10, 2006 at 11:10 pm
    Jan 12, 2006 at 8:11 pm
  • Hi, Does anyone know if it is possible to show related searches with lucene, for example if someone searched for "car insurance" you could bring back the results and related searches like these ...
    Leon ChaddockLeon Chaddock
    Jan 30, 2006 at 12:05 pm
    Feb 1, 2006 at 11:05 am
  • Hello, I 've a problem with data i try to index with lucene. I browse a directory and index text from different types of files throw parsers. For text files, data could be in different languages so ...
    Jan 26, 2006 at 8:56 am
    Jan 28, 2006 at 4:20 pm
  • I am attempting to prune an index by getting each document in turn and then checking/deleting it: IndexReader ir = IndexReader.open(path); for(int i=0;i<ir.numDocs();i++) { Document doc = ...
    Chun Wei HoChun Wei Ho
    Jan 26, 2006 at 8:15 am
    Jan 27, 2006 at 8:24 am
  • I am periodically getting "Too many open files" error when searching. Currently there are over 500 files in my Lucene directory. I am attempting to run optimize( ) to reduce the number of files. ...
    Steve RajavuoriSteve Rajavuori
    Jan 23, 2006 at 5:18 pm
    Jan 23, 2006 at 7:16 pm
  • the similarity measure is term based, tf*idf weighted in ist simple form. Further enhancement would be a "weighting" of nodes e.g. based on information content (see e.g. Rodriguez, M.A. & Egenhofer, ...
    Mathias LuxMathias Lux
    Jan 19, 2006 at 12:36 pm
    Jan 19, 2006 at 5:15 pm
  • Folks, I'm a Lucene newbie, and I've been searching awhile today to answer this question. Googled, read Lucene FAQ, looked at Javadoc for Document and Hits, etc. How would you implement "snippets" ...
    Dan FrankowskiDan Frankowski
    Jan 4, 2006 at 11:04 pm
    Jan 8, 2006 at 5:38 pm
  • Hello, I have a couple instances of lucene. I just altered on implementation and now its not keeping a segments file. while indexing occurs, there is a segment file. but once its done, there isn't. ...
    John PowersJohn Powers
    Jan 27, 2006 at 3:10 am
    Jan 30, 2006 at 6:44 am
  • Hi, Apologies if this question has being asked before on this list. I am working on an application with a Lucene index whose performance (response time for a query) has started degrading as its size ...
    Ori SchnapsOri Schnaps
    Jan 24, 2006 at 12:25 am
    Jan 25, 2006 at 7:31 pm
  • Two queries about ranges: 1. field:[a TO z] does not return the same as field:[z TO a] I think it should. The standard QueryParser or even the range query should ascertain the lowest and highest and ...
    Gwyn CarwardineGwyn Carwardine
    Jan 23, 2006 at 3:43 pm
    Jan 25, 2006 at 12:00 pm
  • Hi Friends, I have very basic question that 1] Why we use Lucene for Database search like Oracle / Sybase ? 2] For that first we have to convert all records one bye one in string then build lucene ...
    Vikas KhengareVikas Khengare
    Jan 17, 2006 at 10:52 am
    Jan 20, 2006 at 5:43 am
  • Hi, I am currently looking for a way to limit the amount of Hits which are returned by a Query. What I am doing is following: Searcher s = ...; Query q = QueryParser.parse("...", "...", new ...
    Daniel PfeiferDaniel Pfeifer
    Jan 19, 2006 at 1:00 pm
    Jan 19, 2006 at 8:34 pm
  • Hi, I got a problem of using the lucene. I write a SynonymFilter which can add synonyms from the WordNet. Meanwhile, i used the SnowballFilter for term stemming. However, i got a problem when ...
    Jan 16, 2006 at 8:54 am
    Jan 17, 2006 at 2:38 pm
  • Hi everybody, Well I will explain you my problem: I am indexing ".txt" files and basically I split each file in paragraphs, I mean, i create a Document for each file and within this Document I add ...
    Jairo Sánchez MenéndezJairo Sánchez Menéndez
    Jan 29, 2006 at 6:18 pm
    Jan 30, 2006 at 2:53 pm
  • Hi, I want to know how the lucene normalizes the score. I see hits class has this function to get each document's score. But i dont know how lucene calculates the normalized score and in the "Lucene ...
    Xing jiangXing jiang
    Jan 27, 2006 at 7:16 am
    Jan 28, 2006 at 6:04 am
  • Hi, I am trying to find a way to create scores with a custom formula based on the initial score from Lucene and field values from each document, e.g. for each document: finalScore = searchScore * ...
    Nick VincentNick Vincent
    Jan 24, 2006 at 5:42 pm
    Jan 24, 2006 at 11:14 pm
  • I would like to store large source documents ( 10MB) in the index in their original form, i.e. as text for text documents or as byte[] for binary documents. I have no difficulty adding the source ...
    George WashingtonGeorge Washington
    Jan 20, 2006 at 4:52 am
    Jan 22, 2006 at 6:42 am
  • Hello dear Lucene users! Is their an easy way to check, whether index is optimized or not? Best regards, Max --------------------------------------------------------------------- To unsubscribe, ...
    Maxim PatramanskijMaxim Patramanskij
    Jan 11, 2006 at 1:25 pm
    Jan 12, 2006 at 10:17 am
  • Lucene seems to prefer matches in shorter documents. Is it possible to influence the scoring mechanism to have matches in shorter fields score higher instead? For example, a query for "europe" should ...
    Eric JainEric Jain
    Jan 9, 2006 at 9:35 am
    Jan 10, 2006 at 11:06 pm
  • Question: Run attached java class and see the also attached when I ran it. I assumed the second writer should get an IOException in getWriter (in first TESTCASE). However this does not happen! This ...
    Dick de JongDick de Jong
    Jan 8, 2006 at 4:54 am
    Jan 10, 2006 at 10:05 am
  • Hi, I want to update a document in the lucene index. As mentioned in the documentation, I tried to delete the document using IndexReader.delete method. But even after I delete the document, I am able ...
    Harini RaghavanHarini Raghavan
    Jan 8, 2006 at 4:11 pm
    Jan 9, 2006 at 3:22 pm
  • Hello, I found in Lucenebook, that searching always reads first first 100 documents. I take a look to java source code and found that in Hits.java. I cannot find any easy way, how to forbid this ...
    Leos LiterakLeos Literak
    Jan 7, 2006 at 3:11 pm
    Jan 9, 2006 at 8:36 am
  • HI all. i am newbie to lucene Do lucene provides any way to do indexing ,searching and deleting simultaneously .. I hope we could do searching and indexing which means there can be multiple ...
    K.A.Hussain AliK.A.Hussain Ali
    Jan 5, 2006 at 3:00 pm
    Jan 5, 2006 at 6:36 pm
  • Hi, We use lucene without any problems even for German text bit with Chinese text nothing is found. What is the best way to index and search Chinese text? Zsolt ...
    Jan 28, 2006 at 8:27 am
    Jan 31, 2006 at 11:54 am
  • Hi, Does anyone know if there is a SoundEx analyzer implementation available for lucene??? thanks. Steve, --------------------------------------------------------------------- To unsubscribe, e-mail: ...
    Steven PannellSteven Pannell
    Jan 18, 2006 at 8:27 am
    Jan 26, 2006 at 2:02 pm
  • I have ~5 million documents that are in categories and subcategories. Let us say that my query is for search terms in one top-level category and it returns a large amount of documents and I want to ...
    Mike AustinMike Austin
    Jan 25, 2006 at 6:58 pm
    Jan 26, 2006 at 8:28 am
  • Hi A database is used as our primary data store. Our lucene index is then created and updated from this database. We store the value of the database primary key in the lucene index as we need to be ...
    Allan DewarAllan Dewar
    Jan 23, 2006 at 3:48 pm
    Jan 24, 2006 at 11:02 am
  • Hi, there, In "Lucene in action", it mentions in Section 3.2.3 "reading indexes into memory" that, "...RAMDirectory's constructor can be used to read a file system-based index into memory, allowing ...
    Kan DengKan Deng
    Jan 12, 2006 at 1:37 am
    Jan 12, 2006 at 7:34 pm
  • So far I've succeeded in grabbing the latest source (and I can build with ant), but now I'm trying to open it in eclipse by creating a new "Java Project from Existing And Buildfile," but when I ...
    Colin YoungColin Young
    Jan 5, 2006 at 3:08 am
    Jan 5, 2006 at 8:34 pm
  • hello all - i have a problem with a SpanNearQuery returning incorrect (false positive) results. I am creating the context of a field using tokens which have position increment set to either 1 or 0. ...
    Marc HadfieldMarc Hadfield
    Jan 5, 2006 at 2:39 am
    Jan 5, 2006 at 6:46 pm
  • Hi, I would like to associate information (or labels) with each word or a range of words in a document. Information such as this word is a noun, that word is a verb, this period marks the end of a ...
    Dave KorDave Kor
    Jan 4, 2006 at 6:34 am
    Jan 4, 2006 at 2:06 pm
  • Normally I wouldn't post this here, but I haven't been able to find any info about how I would go about downloading the latest source from the SVN repository. I've got a bit of experience with CVS, ...
    Colin YoungColin Young
    Jan 4, 2006 at 3:45 am
    Jan 4, 2006 at 12:27 pm
  • Hi, I'm currently using the GermanStemmer and it works well. However today I've found two words which get stemmed to the same stemm-word. "Suche" and "Sucht" both get stemmed to the same "such" it ...
    Markus FischerMarkus Fischer
    Jan 31, 2006 at 12:50 pm
    Jan 31, 2006 at 9:33 pm
  • I am curious what would be the difference between searching for a number verses a character. I have a large index consisting of a few fields (So index would look something like: " 123123123 my ...
    Aigner, ThomasAigner, Thomas
    Jan 30, 2006 at 9:46 pm
    Jan 31, 2006 at 9:29 pm
  • Hi, I'm trying to figure out a way to locate tokens which include special characters. The actual text in the file being indexed is something like "function() { statement1; statement2; }" The query ...
    Dmitry GoldenbergDmitry Goldenberg
    Jan 27, 2006 at 10:10 pm
    Jan 30, 2006 at 4:59 pm
  • Hi, im trying to delete duplicate documents from my index, the unique indentifier is the documents url (aka field "url"). my initial thought of how to acomplish this is to open the index via a reader ...
    Jan 28, 2006 at 5:00 pm
    Jan 30, 2006 at 12:01 pm
  • Hi, Got more questions regarding Lucene and this time it's about performance ;-) We currently are using RAMDirectories to read our Indexes. This has now become a problem since our index has grown to ...
    Daniel PfeiferDaniel Pfeifer
    Jan 26, 2006 at 3:21 pm
    Jan 27, 2006 at 10:43 am
  • Today I've been alerted by one of my collegues that our Lucene-based indexing solution no longer refreshes the searchers and thus we never get any new indexed documents. Since I didn't find anything ...
    Daniel PfeiferDaniel Pfeifer
    Jan 24, 2006 at 1:52 pm
    Jan 27, 2006 at 6:46 am
  • Hi, Please forgive me if this comes across as being naïve however Ive bashed my head against it for a while and can’t come up with a solution. Overview: I have the following basic document structure: ...
    Ashley RajaratnamAshley Rajaratnam
    Jan 20, 2006 at 3:09 pm
    Jan 20, 2006 at 10:40 pm
  • Hello lucene people! First of all, i would like to thank all of community participants ( developers, users, Erik and Otis for "Lucene in Action" book ) for their great work. As far as i understand ...
    Aleksey SerbaAleksey Serba
    Jan 20, 2006 at 1:50 pm
    Jan 20, 2006 at 6:32 pm
Group Navigation
period‹ prev | Jan 2006 | next ›
Group Overview
groupjava-user @

126 users for January 2006

Erik Hatcher: 41 posts Chris Hostetter: 39 posts Yonik Seeley: 32 posts Xing jiang: 18 posts Paul Elschot: 17 posts Otis Gospodnetic: 14 posts Peter Keegan: 13 posts Colin Young: 11 posts John Powers: 11 posts Klaus: 11 posts Koji Sekiguchi: 11 posts Kan Deng: 9 posts Mike Streeton: 9 posts Daniel Naber: 8 posts Daniel Pfeifer: 8 posts Doug Cutting: 8 posts Grant Ingersoll: 8 posts Gwyn Carwardine: 8 posts Harini Raghavan: 8 posts Ravi: 8 posts
show more