Search Discussions

124 discussions - 600 posts

  • Hi, When I run an optimize in our production environment, old index are left in the directory and are not deleted. My understanding is that an optimize will create new index files and all existing ...
    Yahootintin 1247688Yahootintin 1247688
    Feb 4, 2005 at 1:36 am
    Feb 8, 2005 at 4:22 pm
  • Three HTML parsers(Lucene web application demo,CyberNeko HTML Parser,JTidy) are mentioned in Lucene FAQ 1.3.27.Which is the best?Can it filter tags that are auto-created by MS-word 'Save As HTML ...
    Jingkang ZhangJingkang Zhang
    Feb 1, 2005 at 9:14 am
    Feb 4, 2005 at 3:37 pm
  • I am trying to do some filtering and rearrangement of search result. Two possiblity come into mind are iterating though the Hits or making custom HitCollector. All documentation invaribly warn about ...
    Feb 3, 2005 at 4:43 pm
    Feb 4, 2005 at 10:44 pm
  • I finally had some time to take Doug's advice and reburn our indexes with a larger TermInfosWriter.INDEX_INTERVAL value. The default is 128 but I increased it to 256 and then burned our indexes again ...
    Kevin A. BurtonKevin A. Burton
    Feb 24, 2005 at 8:01 am
    Feb 26, 2005 at 12:24 am
  • What is single handedly the best way to improve search performance? I have an index in the 2G range stored on the local file system of the searcher. Under a load test of 5 simultaneous users my ...
    Michael CelonaMichael Celona
    Feb 18, 2005 at 3:54 pm
    Feb 19, 2005 at 2:35 pm
  • First let me say - Awesome tool! Almost too easy to be true, but with that being said.... Hi, I have read several articles and postings that indicate that the Field.Keyword field should be searchable ...
    Mike MillerMike Miller
    Feb 8, 2005 at 1:53 pm
    Feb 10, 2005 at 6:52 pm
  • Topic: Search performance with large numbers of indexes vs. one large index Hello, we are experiencing a performance problem when using large numbers of indexes. We have an application with about 6 ...
    Jochen FrankeJochen Franke
    Feb 25, 2005 at 8:43 pm
    Mar 8, 2005 at 12:23 am
  • It's about time I actually did something real with Lucene.... :) I have been working with the Applied Research in Patacriticism group at the University of Virginia for a few months and finally ready ...
    Erik HatcherErik Hatcher
    Feb 18, 2005 at 7:48 pm
    Feb 23, 2005 at 3:37 am
  • Hello. I'm using Lucene for an application and I want to boost the title of my documents. For that I use the setBoost method that is applied on the title field. However when I look with luke(1.6) I ...
    Claude LiboisClaude Libois
    Feb 28, 2005 at 8:06 am
    Mar 7, 2005 at 3:43 pm
  • Is there a way to eliminate duplicate hits being returned from the index? Jerry Jalenak Senior Programmer / Analyst, Web Publishing LabOne, Inc. 10101 Renner Blvd. Lenexa, KS 66219 (913) 577-1496 ...
    Jerry JalenakJerry Jalenak
    Feb 1, 2005 at 2:02 pm
    Feb 1, 2005 at 5:30 pm
  • What's the full stack trace? Erik --------------------------------------------------------------------- To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org For additional commands, ...
    Erik HatcherErik Hatcher
    Feb 21, 2005 at 3:32 pm
    Feb 21, 2005 at 8:19 pm
  • Hi, I was rambling to some friends about an idea to build a cache-aware JDBC driver wrapper, to make it easier to keep a lucene index of a database up to date. They asked me a question that I have to ...
    Steven J. OwensSteven J. Owens
    Feb 18, 2005 at 9:19 pm
    Feb 24, 2005 at 8:11 am
  • I have created an Analyzer that I think should just be converting to lower case and add synonyms in the index (it is at the end of the email). The problem is, after running it I get one more result ...
    Luke ShannonLuke Shannon
    Feb 18, 2005 at 10:40 pm
    Feb 22, 2005 at 8:58 am
  • Hi, I've read from various sources on the Internet that it is perfectly safe to simultaneously search a Lucene index that is being updated from another Thread, as long as all write access to the ...
    Paul MellorPaul Mellor
    Feb 16, 2005 at 10:21 am
    Feb 18, 2005 at 9:24 am
  • I'm building a lucene project for a client who uses php for their dynamic web pages. It would be possible to add servlets to their environment easily enough (they use apache) but I'd like to have ...
    Owen DensmoreOwen Densmore
    Feb 6, 2005 at 5:10 pm
    Feb 9, 2005 at 9:32 am
  • In my Clipper days I could build an index on English words using a technique that was called soundex. Searching in that index resulted in hits of words that sounded the same. From what i remember ...
    Aad NalesAad Nales
    Feb 9, 2005 at 12:24 pm
    Feb 10, 2005 at 11:43 am
  • Many times I've written ad-hoc code that pulls in data from an RDBMS and builds a Lucene index. The use case is a typical database-driven dynamic website which would be a hassle to spider (say, due ...
    David SpencerDavid Spencer
    Feb 7, 2005 at 8:07 am
    Feb 9, 2005 at 4:03 pm
  • Hello ALL, It might not be the right place for it but as we are talking about SCM, I have a quick question. First, I haven't used CVS/SVN on any project. I am a ClearCase/PVCS guy. I just would like ...
    Chakra YadavalliChakra Yadavalli
    Feb 3, 2005 at 12:50 am
    Feb 3, 2005 at 4:49 pm
  • I've been merrily cooking along, thinking I was replacing documents when I haven't. My logic is to go through a batch of documents, get a field called "reference" which is unique build a term from it ...
    Jim LynchJim Lynch
    Feb 1, 2005 at 8:25 pm
    Feb 2, 2005 at 3:24 pm
  • Hi: Is there way to find out given a hit from a search, find out which fields contributed to the hit? e.g. If my search for: contents1="brown fox" OR contents2="black bear" can the document founded ...
    John WangJohn Wang
    Feb 16, 2005 at 10:40 pm
    Feb 23, 2005 at 7:24 pm
  • Hi guys Apologies........... I am getting this error on ' Every FIRST SEARCH after Startup of the WEBSERVER ' and I have declared the following code only once in the method of execution <%@ page ...
    Karthik N SKarthik N S
    Feb 11, 2005 at 6:45 am
    Feb 11, 2005 at 3:15 pm
  • Hi! Does DateFilter work on fields indexed as UnStored? Can I filter an UnStored field with values like "2004-11-05" ? Regards, Sanyi
    Feb 13, 2005 at 12:09 pm
    Feb 14, 2005 at 2:34 pm
  • Hi Im trying to Compile Lucene but am encountering the following error on typing ant from the root of Lucene-1.4.3 C:\lucene-1.4.3 ant Buildfile: build.xml init: compile-core: BUILD FAILED ...
    Helen ButlerHelen Butler
    Feb 2, 2005 at 7:26 pm
    Feb 2, 2005 at 9:00 pm
  • Hi, I'm new to Lucene and want to know, whether Lucene has the capability of displaying the search results based the Users Rights. For Example: There are suppose some resources, like : Resource 1 ...
    Verma Atul (extern)Verma Atul (extern)
    Feb 1, 2005 at 3:01 pm
    Feb 2, 2005 at 5:23 am
  • Whats the desired pattern of using of TermInfosWriter.indexInterval ? Do I have to compile my own version of Lucene to change this? The last API was public static final but this is not public nor ...
    Kevin A. BurtonKevin A. Burton
    Feb 24, 2005 at 10:08 pm
    Mar 1, 2005 at 5:46 pm
  • Hello; Why won't this query find the document below? Query: +(type:203) +(name:*home\**) Document (relevant fields): Keyword<type:203 Keyword<name:marcipan + home* I was hoping by escaping the * it ...
    Luke ShannonLuke Shannon
    Feb 17, 2005 at 7:40 pm
    Feb 18, 2005 at 2:28 pm
  • Hello. I use PyLucene, python port of Lucene. I have problem about using big index (50Gb) with IndexSearcher from many threads. I use IndexSearcher from PyLucene's PythonThread. It's really a wrapper ...
    Yura SmolskyYura Smolsky
    Feb 16, 2005 at 8:05 pm
    Feb 16, 2005 at 8:39 pm
  • Hi, A couple of newbie questions. I've searched the archives and read the Javadoc but I'm still having trouble figuring these out. 1. What's the best way to index and handle queries like the ...
    Paul JansPaul Jans
    Feb 10, 2005 at 10:01 pm
    Feb 14, 2005 at 8:28 pm
  • Hi All, If I store multiple fields with same name for example “Author” with 3 values “bob,”jane”,”bill” once I retrieve the doc are the values in the same order? Thanks, Ramon -- No virus found in ...
    Ramon AsenieroRamon Aseniero
    Feb 11, 2005 at 4:49 am
    Feb 11, 2005 at 9:47 pm
  • Hello; Getting squinted with Query Parsing. I have a questions: Query query = MultiFieldQueryParser .parse("mario", new String[] { "name", "desc" }, new int[] { MultiFieldQueryParser.NORMAL_FIELD, ...
    Luke ShannonLuke Shannon
    Feb 2, 2005 at 11:01 pm
    Feb 2, 2005 at 11:29 pm
  • Hey folks.. thanks in advance to any who respond... I do a good deal of post-search processing and the file io to read the fields I need becomes horribly costly and is definitely a problem. Is there ...
    Chris FraschettiChris Fraschetti
    Feb 1, 2005 at 9:40 am
    Feb 1, 2005 at 10:54 pm
  • Hello, lucene-user. I have index with many documents, more than 40 Mil. Each document has DateField (It is time stamp of document) I need the most recent results only. I use single instance of ...
    Yura SmolskyYura Smolsky
    Feb 24, 2005 at 6:02 pm
    Feb 24, 2005 at 6:52 pm
  • Hi all, I have a question about scaling lucene across a cluster, and good ways of breaking up the work. We have a very large index and searches sometimes take more time than they're allowed. What we ...
    Chris DChris D
    Feb 18, 2005 at 4:01 pm
    Feb 19, 2005 at 6:48 pm
  • Is there a simple, efficient way to compute similarity of documents indexed with Lucene? My first, naive idea is to use the entire contents of one document as a query to the second document, and use ...
    Matt ChaputMatt Chaput
    Feb 18, 2005 at 9:27 pm
    Feb 18, 2005 at 10:15 pm
  • Hello, I would like to use ParrellelMultiSearcher with few RemoteSearchables. If one of the remote server is down, Can I parrellelMultiSearcher set close() and make new ParrellelMultiSearcher with ...
    Youngho ChoYoungho Cho
    Feb 17, 2005 at 9:27 am
    Feb 18, 2005 at 6:47 am
  • Hi all, I'm quite a newbie for Lucene, but I bought "Lucene In Action" and I'm trying to customize few examples caught from there. I Have this sample code of JSP (bad JSP caus' I'm also a jsp newbie ...
    Pierre VANNIERPierre VANNIER
    Feb 15, 2005 at 8:45 am
    Feb 15, 2005 at 4:50 pm
  • I think I found a pretty good way to do a negative match. In this query I am looking for all the Documents that have a kcfileupload field with any value except for jpg. Query negativeMatch = new ...
    Luke ShannonLuke Shannon
    Feb 10, 2005 at 9:01 pm
    Feb 11, 2005 at 10:03 pm
  • Hello all, I have heard that Lucene 1.3 Final should run under Java 1.1. (I need that because I want to run a search with a PDA using Java 1.1). However, when I run my code. I get the following ...
    Karl KochKarl Koch
    Feb 8, 2005 at 3:28 pm
    Feb 8, 2005 at 9:05 pm
  • I have varying length text fields which I am searching on. I would like relevancy to be dictated predominantly by the number of terms in my query that match. Right now I am seeing a high relevancy ...
    Michael CelonaMichael Celona
    Feb 7, 2005 at 1:48 pm
    Feb 7, 2005 at 9:08 pm
  • Is there any way to construct a query to locate all documents without a specific field? By this I mean the Document was created without ever having that field added to it. -- Bill Tschumy Otherwise ...
    Bill TschumyBill Tschumy
    Feb 3, 2005 at 7:18 pm
    Feb 4, 2005 at 9:03 pm
  • Hi, I am getting this exception now and then when I am indexing content. It doesn't always happen. But when it happens, I have to delete the index and start over again. This is a serious problem. In ...
    Chris LuChris Lu
    Feb 2, 2005 at 6:05 am
    Feb 3, 2005 at 5:58 pm
  • An IndexReader will always see the same set of documents. Even if another process deletes some documents, adds new ones or optimizes the complete index, your IndexReader instance will not see those ...
    Vanlerberghe, LucVanlerberghe, Luc
    Feb 24, 2005 at 2:07 pm
    Mar 1, 2005 at 6:19 pm
  • Hi All, How does Lucene handle multi term queries? Does it use short circuiting? So if a user entered: (a OR b) AND c But my program knew testing for "c" is cheaper than testing for "(a OR b)" and I ...
    Runde, KevinRunde, Kevin
    Feb 21, 2005 at 6:59 pm
    Feb 21, 2005 at 8:29 pm
  • Hi! Is there any way to store info about the index in the index? (You know, like in .doc files on Windows. You can store title, author, etc...) I need to store the last indexed database UID in the ...
    Feb 17, 2005 at 1:43 pm
    Feb 18, 2005 at 12:21 am
  • I'm getting a bit more serious about the final form of our lucene index. Each document has DocNumber, Authors, Title, Abstract, and Keywords. By Keywords, I mean a comma separated list, each entry ...
    Owen DensmoreOwen Densmore
    Feb 12, 2005 at 8:08 pm
    Feb 16, 2005 at 8:23 am
  • First I'm getting a The requested URL could not be retrieved ------------------------------------------------------------------------ While trying to retrieve the URL: ...
    Jim LynchJim Lynch
    Feb 14, 2005 at 3:31 pm
    Feb 14, 2005 at 5:33 pm
  • I'm building an index from a FileMaker database by dumping the data to a tab-separated file. Because the FileMaker output is encoded in MacRoman, and uses Mac line separators, I run a script across ...
    Owen DensmoreOwen Densmore
    Feb 10, 2005 at 5:32 am
    Feb 11, 2005 at 10:19 pm
  • Hi, I have an index with field "documentNumber". There are 10 documents. One of the documents has documentNumber A5058970 I want to return all matches where documentNumber != A505*. I should get 9 ...
    Feb 10, 2005 at 6:02 pm
    Feb 10, 2005 at 6:25 pm
  • Idiot question. I've managed to blow away the "segments" file. This is an optimized index, so there's only one segment. Is there an easy way to reconstruct the segments file? I've looked over the ...
    Ian SoboroffIan Soboroff
    Feb 4, 2005 at 4:58 pm
    Feb 7, 2005 at 6:38 pm
  • Hi, is it possible to retrieve ALL documents from a Lucene index? This should then actually not be a search... Karl -- Lassen Sie Ihren Gedanken freien Lauf... z.B. per FreeSMS GMX bietet bis zu 100 ...
    Karl KochKarl Koch
    Feb 7, 2005 at 10:41 am
    Feb 7, 2005 at 11:16 am
Group Navigation
period‹ prev | Feb 2005 | next ›
Group Overview
groupjava-user @

124 users for February 2005

Erik Hatcher: 75 posts Luke Shannon: 47 posts Karl Koch: 19 posts Paul Elschot: 17 posts Sergiu gordea: 16 posts David Spencer: 14 posts Otis Gospodnetic: 14 posts Andrzej Bialecki: 13 posts Doug Cutting: 13 posts Miles Barr: 13 posts Jim Lynch: 12 posts Kelvin Tan: 12 posts Michael Celona: 12 posts Kevin A. Burton: 11 posts Yura Smolsky: 11 posts Owen Densmore: 10 posts Morus Walter: 9 posts Aad Nales: 8 posts Aurora: 8 posts Chris Hostetter: 8 posts
show more