Search Discussions

82 discussions - 377 posts

  • I'm confused about something - what's the point of creating a document for every sentence? -----Original Message----- From: Jochen Frey Sent: Wednesday, December 17, 2003 4:17 PM To: 'Lucene Users ...
    Dan QuaroniDan Quaroni
    Dec 17, 2003 at 9:19 pm
    Dec 19, 2003 at 5:00 pm
  • I am indexing a group of items and one field , id, is unique. When the user clicks on a results I want just that one result to show. I index and search using SimpleAnalyzer. Query query_es = ...
    Pleasant, TracyPleasant, Tracy
    Dec 4, 2003 at 9:53 pm
    Dec 5, 2003 at 10:45 pm
  • Hi, I'm having problems understanding query parsers handling of AND and OR if there's more than one operator. E.g. a OR b AND c gives the same number of hits as b AND c (only scores are different) ...
    Morus WalterMorus Walter
    Dec 9, 2003 at 9:58 am
    Jan 3, 2004 at 5:18 pm
  • A quick question. Is there any way to disable the - and + modifiers in the QueryParser? I'm trying to use Lucene to provide indexing of COBOL source code, and allow me to highlight matches when the ...
    Iain YoungIain Young
    Dec 15, 2003 at 5:18 pm
    Dec 16, 2003 at 6:15 pm
  • Hi there, is Damian patch in the cvs or latest lucene release. Allow this patch to recieve a term vector of a document? Thanks! Stefan -- open technology: www.media-style.com open source: ...
    Stefan GroschupfStefan Groschupf
    Dec 8, 2003 at 3:51 pm
    Dec 8, 2003 at 10:53 pm
  • Hi, I'm trying to use IndexSearcher.explain(Query query, int doc) and am getting a NPE. If I remove the "explain" the search works fine. I poked a little at the TermQuery.java code, but I can't ...
    Dror MatalonDror Matalon
    Dec 4, 2003 at 1:52 am
    Dec 4, 2003 at 8:33 pm
  • SearchBlox is a J2EE search component that enables you to add search functionality to your applications, intranets or portals in a matter of minutes. SearchBlox uses Lucene Search API and features ...
    Robert SelvarajRobert Selvaraj
    Dec 2, 2003 at 2:42 pm
    Dec 4, 2003 at 12:26 am
  • Hello I not undertanding the syntax of queries. I search with this string: title: (importar) ^5.0 OR title: (arquivos) return 6 hits. and with this: title: (arquivos) OR title: (importar) ^5.0 27 ...
    Ernesto De SantisErnesto De Santis
    Dec 12, 2003 at 9:12 pm
    Dec 20, 2003 at 12:54 am
  • Hi Has anyone tried to implement a counter using Lucene. We currently have a search implemented, searching multiple indexes and returning the results in a Vector of hits objects. In order to get our ...
    Shannon MarchShannon March
    Dec 12, 2003 at 1:30 pm
    Dec 12, 2003 at 5:05 pm
  • is there a limit to the size of an UnIndexed field? i changed my code to increase the maximum string size per document from 300 bytes to 10,000 and although the index run completes without errors, i ...
    Chong, HerbChong, Herb
    Dec 8, 2003 at 4:14 pm
    Dec 11, 2003 at 10:10 pm
  • Hello group, from the very inspiring conversations with Karsten I know that Lucene is based on a Vector Space Model. I am just wondering if it would be possible to turn this into a probabilistic ...
    Dec 3, 2003 at 2:13 pm
    Dec 5, 2003 at 8:38 pm
  • Would there be any performance improvement in query throughput and latency if locking were disabled for readonly indexes? It doesnt' seem like it makes sense to worry about locking if you know for ...
    Kevin A. BurtonKevin A. Burton
    Dec 1, 2003 at 9:39 pm
    Dec 2, 2003 at 12:21 pm
  • As a spinoff, I was wondering if anyone has been happy with indexing and searching Word docs. What about reading the contents? Any problems? -----Original Message----- From: Ryan Ackley Sent: Friday, ...
    Pleasant, TracyPleasant, Tracy
    Dec 15, 2003 at 1:58 pm
    Dec 15, 2003 at 2:49 pm
  • Hi, I have seen the example SAX based XML processing in the Lucene sandbox (thanks to the authors for contributing!) and have successfully adapted this approach for my application. The one thing that ...
    Grant IngersollGrant Ingersoll
    Dec 5, 2003 at 2:48 pm
    Dec 7, 2003 at 11:16 pm
  • Hi there, I have not yet got any response about my problem. While debugging into the depth of lucene (really hard to read deep insde) I discovered that it is possible to disable the Locks using a ...
    Hohwiller, JoergHohwiller, Joerg
    Dec 16, 2003 at 10:37 am
    Dec 16, 2003 at 3:20 pm
  • Hi, I notice something really strange. I just tried the "document to query" thing with term frequencies and term bosting based on the term frequence. The code itself take may be 3 minutes, but i ...
    Stefan GroschupfStefan Groschupf
    Dec 8, 2003 at 7:18 pm
    Dec 11, 2003 at 12:26 am
  • This seems like an almost reasonable request, and easy enough to implement in FSDirectory.create. Lucene has no business deleting other files in that directory that it doesn't use either, although ...
    Erik HatcherErik Hatcher
    Dec 7, 2003 at 11:47 pm
    Dec 8, 2003 at 5:53 pm
  • The FAQ describes implementing a TokenFilter for applying aliases. I have a trouble accomplishing this. This is the code that I have so far for the next Method within AliasFilter. After reading some ...
    Allen AtamerAllen Atamer
    Dec 4, 2003 at 9:58 pm
    Dec 5, 2003 at 6:47 pm
  • Hi, I have just indexed a lot of news (nntp) postings. I now have an index for each topic (a topic can have many newsgroups) The index sizes are: 2.6G Current Affairs 2.4G Celebs 119M Recreation 3.0M ...
    Jt oobJt oob
    Dec 2, 2003 at 1:55 pm
    Dec 4, 2003 at 7:54 pm
  • Hello Lucene Users i use Lucene 1.3rc3 to index several thousand metadata records. these look as follows: <?xml version="1.0" encoding="utf-8"? <oaidc:dc xmlns="http://purl.org/dc/elements/1.1/" ...
    Thomas KrämerThomas Krämer
    Dec 22, 2003 at 12:40 pm
    Dec 29, 2003 at 4:27 pm
  • Hi Everybody, I wish to use an hierarchy of concept provided by an Ontology to refine or expand my query answer with Lucene. May I Know If someone have tryed it yet ? Thanks, Gayo ...
    Gayo DialloGayo Diallo
    Dec 10, 2003 at 10:59 am
    Dec 18, 2003 at 6:38 pm
  • Hi, I'm attempting to optimize a fuzzy search on a big index with ~4.400.000 Documents ( lucene's meanning ) in 600.000 sub-categories (Simple Text.Keyword type a field ). My purpose is to limit the ...
    Julien gerardJulien gerard
    Dec 10, 2003 at 3:07 pm
    Dec 10, 2003 at 5:13 pm
  • What about Spindle? Has anybody used it to crawle a jsp based web site? Do I need to intall listlib.jar to do so? I got error message "Jsp Translate:Unable to find setter method for attribue:class" ...
    Zhou, OliverZhou, Oliver
    Dec 3, 2003 at 5:14 pm
    Dec 3, 2003 at 8:48 pm
  • Hi, is there a maximum of documents Hits provide or is it unlimited (means limited to heap size of VM)? If there is a maximimum, what is the number? Ralf -- +++ GMX - die erste Adresse für Mail, ...
    Dec 3, 2003 at 2:36 pm
    Dec 3, 2003 at 4:27 pm
  • --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, e-mail: ...
    Iain YoungIain Young
    Dec 1, 2003 at 3:45 pm
    Dec 2, 2003 at 9:50 am
  • Hi, I have been using Lucene to index a large directory, indexing HTML and text files. However during the indexing process the entire system stops, that is the IndexWriter no longer adds Document ...
    Niall GallagherNiall Gallagher
    Dec 23, 2003 at 10:46 am
    Dec 23, 2003 at 2:11 pm
  • Has anyone thought about or used Lucene to build an indexed, searchable help system? Either Server or Application Based? -M. -- Mark Diggory Software Developer Harvard MIT Data Center ...
    Mark R. DiggoryMark R. Diggory
    Dec 20, 2003 at 3:42 am
    Dec 22, 2003 at 7:45 pm
  • I've seen discussions about using the double metaphone algorithm with Lucene (basically: like soundex, used to find works that sound similar in English at least) but couldn't find an implementation, ...
    David SpencerDavid Spencer
    Dec 19, 2003 at 7:52 pm
    Dec 21, 2003 at 10:13 pm
  • I'm implementing Lucene in our Content Management system. A plugin for every type of content fills the Document, so I have no control over the amount and names of the fields. Now I'm trying to do a ...
    Thijs CadierThijs Cadier
    Dec 18, 2003 at 10:41 am
    Dec 18, 2003 at 1:36 pm
  • Hello I'm new to Lucene. I want users can search text which is stored in mysql database. Is there any tutorial how to implement this kind of search feature. Best regards, Stefan
    Stefan TrckoStefan Trcko
    Dec 16, 2003 at 8:31 pm
    Dec 17, 2003 at 3:22 pm
  • http://sourceforge.net/projects/weblucene/ WebLucene: Lucene search engine XML interface, provided sax based indexing, indexing sequence based result sorting and xml output with highlight support. ...
    Che DongChe Dong
    Dec 16, 2003 at 5:33 pm
    Dec 17, 2003 at 1:37 am
  • Hello Lucene Users i need a document term matrix to initialize a neural network, that i want to use to integrate user feedback in the retrieval process. until now, i am using a slightly modified ...
    Thomas KrämerThomas Krämer
    Dec 11, 2003 at 9:02 pm
    Dec 16, 2003 at 5:38 pm
  • Is there a way to tell if an index is currently optimized? I couldn't find a method in the API to check. Is there a way of telling if an index has been altered since it was last optimized by looking ...
    Jt oobJt oob
    Dec 4, 2003 at 3:04 pm
    Dec 5, 2003 at 6:18 pm
  • Hello all, i am using lucene to index simple metadata records, that consist of fields such as creator, description, identifier etc. with the apache commons digester i can read each record into a ...
    Thomas KrämerThomas Krämer
    Dec 29, 2003 at 10:35 pm
    Dec 30, 2003 at 12:55 pm
  • I am trying to setup a Lucene installation on a Windows 2000 server. I can not get the IndexWriter to initialize properly. It fails out with an IOException error that says it could not delete backup. ...
    Alex GadeaAlex Gadea
    Dec 18, 2003 at 5:23 am
    Dec 18, 2003 at 12:55 pm
  • Hi, I have tried to type the following at Windows command line at weblucene directory: ant build Everything seems to work fine except the following error: java.lang.InstantiationException: ...
    Tun LinTun Lin
    Dec 13, 2003 at 4:20 pm
    Dec 13, 2003 at 4:47 pm
  • Hi I am starting to get an error about a write.lock in lucene when creating an index in an empty directory. It used to work fine before but now it started to occur and as far as I know I didn't touch ...
    Aaron GaleaAaron Galea
    Dec 5, 2003 at 10:48 pm
    Dec 11, 2003 at 12:08 pm
  • Hello, We use lucene to search menus, there are around 10000 items in index and sometimes I see error like this: (/tmp/index-menu is index directory) java.io.FileNotFoundException: ...
    Igor SemenkoIgor Semenko
    Dec 10, 2003 at 11:51 am
    Dec 11, 2003 at 8:05 am
  • Hello lucene-users. What is the better way to filter search results by date (which is one of the indexed fields): - use RangeQuery against date field as a required part of boolean query; - use ...
    Maxim PatramanskijMaxim Patramanskij
    Dec 4, 2003 at 3:16 pm
    Dec 4, 2003 at 4:34 pm
  • Hi I am indexing a document but for a strange reason the word "Mayo" is never indexed. The thing is that in this large document this term appears only once. However if i remove all text from this ...
    Aaron GaleaAaron Galea
    Dec 4, 2003 at 11:32 am
    Dec 4, 2003 at 12:00 pm
  • Hi, I would highly appreciate it if the experts here (especially Karsten or Chong) look at my idea and tell me if this would be possible. Sorry, I have no idea about how to use a probabilistic ...
    Karsten KonradKarsten Konrad
    Dec 3, 2003 at 9:51 pm
    Dec 4, 2003 at 11:45 am
  • Hi, I have tried the XMLIndexingDemo. It only supports indexing one xml file at a time and delete the old one. Also, I customerInfo tag can have only 1 <name . Is there an open source that supports 1 ...
    Tun LinTun Lin
    Dec 4, 2003 at 10:27 am
    Dec 4, 2003 at 11:05 am
  • Hi folks, another newbie question for you. I'm using Lucene to index huges chunks of source code, (cobol, jcl, c, java, text documents etc). In some of these languages (such as cobol) it is valid to ...
    Iain YoungIain Young
    Dec 2, 2003 at 5:06 pm
    Dec 2, 2003 at 5:21 pm
  • Hello Ralf, According to your description, Lucene basically maps the boolean query into the vector space and measures the cosine similarity towards other documents in the vector space. If I ...
    Karsten KonradKarsten Konrad
    Dec 1, 2003 at 2:19 pm
    Dec 1, 2003 at 8:29 pm
  • Hi, Do you have the install.txt for windows XP setup of the WebLucene? It seems that the install.txt is only for UNIX setup. Thanks. -----Original Message----- From: Che Dong Sent: Sunday, November ...
    Tun LinTun Lin
    Dec 1, 2003 at 3:35 am
    Dec 1, 2003 at 3:56 pm
  • Hi, I am just testing Lucene 1.3 RC with the Compound index option on. When I come to delete an existing document in the index to re-update a document, I get an unable to obtain lock error. ...
    Paul WilliamsPaul Williams
    Dec 30, 2003 at 5:07 pm
    Dec 30, 2003 at 5:37 pm
  • I'm attempting to make Lucene the document search solution for the Ariba application suite. I have identified functionality gaps in the areas of refinement, order by and range queries. Below I ...
    Geoffrey PeddleGeoffrey Peddle
    Dec 22, 2003 at 8:15 pm
    Dec 24, 2003 at 2:00 am
  • Hi all, I use this code Query query = QueryParser.parse(q, "Contenu", new Analyseur()); String larequet = query.toString(); System.out.println("la requête à traiter est: " + larequet); And I have as ...
    Gayo DialloGayo Diallo
    Dec 17, 2003 at 3:45 pm
    Dec 17, 2003 at 3:50 pm
  • I'm not having a problem. The question is whether I picked a reasonable set of parameters for what I'm doing. I have an application which receives messages. Each message averages around 4k bytes and ...
    Scott SmithScott Smith
    Dec 12, 2003 at 9:25 pm
    Dec 12, 2003 at 9:48 pm
  • I'm looking for a fast way (execution wise) to get a list of unique values for a field called "partno" for all documents which have a given value for a field called "type". This is for adding values ...
    Karl PenneyKarl Penney
    Dec 12, 2003 at 1:40 pm
    Dec 12, 2003 at 6:17 pm
Group Navigation
period‹ prev | Dec 2003 | next ›
Group Overview
groupjava-user @

88 users for December 2003

Erik Hatcher: 44 posts Dror Matalon: 33 posts Otis Gospodnetic: 24 posts Doug Cutting: 18 posts Tun Lin: 16 posts Pleasant, Tracy: 14 posts Ralph: 14 posts Chong, Herb: 12 posts Iain Young: 12 posts Stefan Groschupf: 11 posts Jochen Frey: 8 posts Thomas Krämer: 8 posts Morus Walter: 7 posts Gregor Heinrich: 6 posts Morus Walter: 6 posts Jt oob: 5 posts Karsten Konrad: 5 posts Shannon March: 5 posts Tatu Saloranta: 5 posts Ype Kingma: 5 posts
show more