Search Discussions

124 discussions - 460 posts

  • If I use a Sort instance on my searcher, what will have priority? Score or Sort? Assuming I have a pages with .9, .9, and .5 scores, ... if the .5 has a higher 'sort' value, will it return higher ...
    Chris FraschettiChris Fraschetti
    Oct 12, 2004 at 8:52 pm
    Oct 15, 2004 at 5:57 am
  • I recently read in regards to my problem that date_field:[0820483200 TO 1104480000] is evluated into a series of boolean queries ... which has a cap of 1024 ... considering my documents will have ...
    Chris FraschettiChris Fraschetti
    Oct 1, 2004 at 1:25 am
    Oct 5, 2004 at 6:14 pm
  • Thanks to something Doug said when I first opened this discussion, I went back and looked at my implementation. He said, "Can't we just do this in getFieldQuery?". Figuring that he probably knew what ...
    Bill JanssenBill Janssen
    Oct 27, 2004 at 12:33 am
    Nov 16, 2004 at 12:45 am
  • Hello all, I need a piece of advice/experience.. What pdf parser (written in java) u'd recommend? I played now with PDFBox-0.6.7a and would not say I was satisfied too much with it On certain pdf's ...
    Iouli GolovatyiIouli Golovatyi
    Oct 22, 2004 at 11:34 am
    Oct 26, 2004 at 11:29 am
  • Hi All I have a question regarding selection of Analyzer's during query parsing i have three field in my index db_id, full_text, subject all three are indexed, however while indexing I specified to ...
    Rupinder Singh MazaraRupinder Singh Mazara
    Oct 19, 2004 at 3:23 pm
    Oct 21, 2004 at 1:32 pm
  • http://www.peerfear.org/rss/permalink/2004/10/15/GoogleDesktopCouldBeBetter/ -- Use Rojo (RSS/Atom aggregator). Visit http://rojo.com. Ask me for an invite! Also see irc.freenode.net #rojo if you ...
    Kevin A. BurtonKevin A. Burton
    Oct 15, 2004 at 7:57 am
    Oct 21, 2004 at 1:59 pm
  • Can any one give me a demo for indexing XML files ? Mit freundlichen Grüssen - with kind regards Sumathi P Junior Consultant QA GFT Technologies , India 95 , Bharathidasan Salai Cantonment , ...
    Oct 5, 2004 at 1:07 pm
    Oct 11, 2004 at 1:04 pm
  • I am having this same problem, but cannot find any help! I have a keyword field that sometimes includes double quotes, but I am unable to search for that field because the escape for a quote doesnt ...
    Will AllenWill Allen
    Oct 28, 2004 at 4:04 pm
    Oct 29, 2004 at 6:34 am
  • Hello, I am a Java/Lucene/Tomcat newbie I know that does not bode well as a start to a post but I really am in dire straits as far as Lucene goes so bear with me. I am working on indexing and ...
    James TyrrellJames Tyrrell
    Oct 27, 2004 at 10:59 am
    Oct 28, 2004 at 2:59 pm
  • Is anyone aware of an open source (non-GPL; i.e.., free for commercial use) Arabic analyzer for Lucene? Does Arabic really require a stemmer as well (some of the reading I've seen on the web would ...
    Scott SmithScott Smith
    Oct 6, 2004 at 11:28 pm
    Oct 19, 2004 at 3:01 pm
  • Hi, I'm getting: java.io.IOException: Lock obtain timed out I have a writer service that opens the index to delete and add docs. I have a reader service that opens the index for searching only. This ...
    Yahootintin 1247688Yahootintin 1247688
    Oct 27, 2004 at 9:33 pm
    Nov 6, 2004 at 12:40 am
  • Hi, I'm new using lucene. I downloaded lucene 1.4.2 and added the 2 jar files to the classpath. Executing the demos as a bat file (Windows) is working fine, but using lucene as a web 'application' is ...
    Willy De WaeleWilly De Waele
    Oct 28, 2004 at 1:01 pm
    Nov 1, 2004 at 4:55 pm
  • Hi, Why there is a limit on the number of clauses? and is there any harm in setting MaxClauseCount to Integer.MAX_VALUE? I'm using a Range Query on a field that represents dates and getting ...
    Angelov, RossenAngelov, Rossen
    Oct 25, 2004 at 10:35 pm
    Oct 26, 2004 at 6:43 pm
  • Hello all, I need a piece of advice/experience again.. What ms Word/Excel/PowerPoint parsers (written in java) u'd recommend? Thanks in advance J. ...
    Iouli GolovatyiIouli Golovatyi
    Oct 25, 2004 at 2:55 pm
    Oct 26, 2004 at 10:39 am
  • hy lucene users i developed a Spell checker for lucene inspired by the David Spencer code see the wiki doc: http://wiki.apache.org/jakarta-lucene/SpellChecker Nicolas Maisonneuve
    Nicolas MaisonneuveNicolas Maisonneuve
    Oct 11, 2004 at 6:26 pm
    Oct 21, 2004 at 4:06 pm
  • Hi FFI, I am indexing multiple documents like (word,excel,html,ppt,pdf) at the time of indexing there is no problem..... My search results contents(description) comes with small Boxes(this is ...
    Oct 19, 2004 at 8:32 am
    Oct 19, 2004 at 10:05 am
  • Hi everybody, I am thinking about extending the Lucene search with metadata in the following way Field Value --------------------------------------------------------------------------- Title (n1, n2, ...
    Michael HartmannMichael Hartmann
    Oct 12, 2004 at 12:16 pm
    Oct 14, 2004 at 6:53 am
  • Thanks to the recent changes (see CVS) in TermFreqVector support we can now make use of term offset information held in the Lucene index rather than incurring the cost of re-analyzing text to ...
    Oct 28, 2004 at 11:16 pm
    Nov 4, 2004 at 11:16 am
  • Hi Guys Apologies......... I have a Field Type "Text" 'ItemPrice' , Using it to Store " Price Factor in numeric " such as 10, 25.25 , 50.00.... If I am suppose to Find the Range factor between 2 ...
    Karthik N SKarthik N S
    Oct 19, 2004 at 6:57 am
    Oct 21, 2004 at 8:37 am
  • Hello guys, I need additions and deletions of documents to the index to be ATOMIC (they either happen to completion or not at all). On top of this, I need updates (which I currently implement with a ...
    Christian RodriguezChristian Rodriguez
    Oct 15, 2004 at 9:13 pm
    Oct 19, 2004 at 6:36 am
  • Hello, Does anyone do indexeing of numeric entities for japanese characters? I have (non-x)html containing those entities and need to index and search them. -- The information contained in this ...
    Daan HooglandDaan Hoogland
    Oct 7, 2004 at 6:00 am
    Oct 12, 2004 at 4:21 pm
  • LS, in http://issues.apache.org/eyebrowse/ReadMsg?listId=30&msgNo=8980 Jon Schuster explains how to get a Japanese search system working. I followed his advice and got a index that "luke" shows as ...
    Daan HooglandDaan Hoogland
    Oct 8, 2004 at 10:16 am
    Oct 12, 2004 at 12:57 pm
  • Hi, I'm curious about your strategy to backup indexes based on FSDirectory. If I do a file based copy I suspect I will get corrupted data because of concurrent write access. My current favorite is to ...
    Christoph KiehlChristoph Kiehl
    Oct 27, 2004 at 12:17 pm
    Nov 16, 2004 at 10:31 am
  • Hi Guys Apologies.......... I was Curious to Know the Difference between ParallelMultiSearcher and MultiSearcher , 1) Is the working internal functionality of these are same or different . 2) In ...
    Karthik N SKarthik N S
    Oct 13, 2004 at 7:07 am
    Oct 15, 2004 at 5:42 pm
  • Hi, I want to do filtering on matched results of a query. For example 1. age 18 2. age < 18 3. age 5 and age < 18 4. birthdate = [some date] What can be the best approach? How can it be done with ...
    Sam sSam s
    Oct 14, 2004 at 1:05 am
    Oct 14, 2004 at 7:00 pm
  • Hi , Would there be a problem if one enters space while using wildcards ? say i search for 'abc' . i get 100 hits as results 'man' gives - 200 'abc man' gives 300 but 'ab* man' 'abc ma*' ab* ma*' ab* ...
    Robinson RajuRobinson Raju
    Oct 1, 2004 at 4:17 am
    Oct 5, 2004 at 6:39 am
  • http://www.peerfear.org/rss/permalink/2004/10/26/PoorLuceneRankingForShortText/ -- Use Rojo (RSS/Atom aggregator). Visit http://rojo.com. Ask me for an invite! Also see irc.freenode.net #rojo if you ...
    Kevin A. BurtonKevin A. Burton
    Oct 27, 2004 at 6:21 pm
    Dec 24, 2004 at 4:26 pm
  • I have a need to search an index for documents that were taken ffrom particulars files in the filesystem. Each document in the index has a field named "url" that is created using: ...
    Bill TschumyBill Tschumy
    Oct 28, 2004 at 10:22 pm
    Oct 29, 2004 at 6:47 am
  • Hello, Do ypu know about Hebrew language solution in Lucene ? Alex Kiselevsky Speech Technology Tel: 972-9-776-43-46 R&D, Amdocs - Israel Mobile: 972-53-63 50 38 mailto:alexkis@amdocs.com The ...
    Alex KiselevskiAlex Kiselevski
    Oct 10, 2004 at 7:19 am
    Oct 21, 2004 at 7:22 am
  • hi all i have a question regarding the QueryParser and Proximity Searches I executed the following piece of code String x = "\"jakarta apache\"~100"; QueryParser parser = new ...
    Rupinder Singh MazaraRupinder Singh Mazara
    Oct 18, 2004 at 6:40 pm
    Oct 19, 2004 at 9:07 am
  • We need to have index files that can't be reverse engineered, etc. An obvious approach would be to write a 'FSEncryptedDirectory' class, but sounds like a performance killer. Does anyone have ...
    Weir, MichaelWeir, Michael
    Oct 13, 2004 at 1:20 pm
    Oct 18, 2004 at 8:14 pm
  • Hello, i've got a problem with stopword elimination function. i'm trying to use this function: GermanAnalyzer germanAnalyzer = new GermanAnalyzer(); IndexWriter writer = new IndexWriter("dbind", ...
    Miro MaxMiro Max
    Oct 17, 2004 at 3:23 am
    Oct 18, 2004 at 10:16 am
  • I am a new user of Lucene. I am looking to index over 20 million documents (and a lot more someday) and am looking for ideas on the best indexing/search strategy. Which will optimize the Lucene ...
    Jeff MunsonJeff Munson
    Oct 8, 2004 at 12:44 pm
    Oct 12, 2004 at 11:09 am
  • Hi Does anyone know why Lucene returns 0 hits when there are in fact three matches? The attached are two java class that repeat the problem. In the example, I created a Keyword field "type" for each ...
    Fred YuFred Yu
    Oct 6, 2004 at 8:29 pm
    Oct 7, 2004 at 7:31 pm
  • Hi, One document in my index contains term 'doom 3' (indexed, tokenized, stored) How can I match term doom3 with that document? I tried following but no luck I have written alias filter which returns ...
    Abhay SaswadeAbhay Saswade
    Oct 26, 2004 at 5:23 pm
    Oct 26, 2004 at 7:39 pm
  • Hi folks, I want to download full copies of web pages and storage them locally as well the hyperlink structures as local directories. I tried to use Lucene, but I've realized that it doesn't have a ...
    Luciano BarbosaLuciano Barbosa
    Oct 19, 2004 at 9:35 pm
    Oct 20, 2004 at 12:55 pm
  • Hi, I read the "sorting and score ordering - http://www.mail-archive.com/lucene-user@jakarta.apache.org/msg09775.html" thread from the archive and I think, I have a very similar problem but I still ...
    Angelov, RossenAngelov, Rossen
    Oct 18, 2004 at 7:25 pm
    Oct 18, 2004 at 10:24 pm
  • if i have four threads all trying to call my index function, will lucene do what is necessary for each thread to wait until the writer is available.. or will the threads get an exception? -- Chris ...
    Chris FraschettiChris Fraschetti
    Oct 15, 2004 at 11:53 pm
    Oct 16, 2004 at 9:15 am
  • I have used the Lucene factory method Field.Text(String, String) to index, tokenize and store several hundreds short xml files. I stored the entire xml content of these files in a field called ...
    Juan A. PolancoJuan A. Polanco
    Oct 14, 2004 at 6:02 pm
    Oct 15, 2004 at 1:34 pm
  • Hello, I am using the IndexHTML class to index around 30,000 files and it is working fine. Question that I have is, is there a way to add multiple fields to index so that when the actual search is ...
    Hetan ShahHetan Shah
    Oct 13, 2004 at 6:44 pm
    Oct 15, 2004 at 12:39 pm
  • Hi Folks, Is there any place where I can do a better search on lucene mailing archives? I tried JGuru and looks like their search is paid. Apache maintained archives lags efficient searching. Thanks ...
    Sam sSam s
    Oct 14, 2004 at 4:59 pm
    Oct 14, 2004 at 8:48 pm
  • Hi, Index side information: No. of indexes: Two (to explain better I call these as index_a and index_b). Fields in index_a: x and y. Fields in index_b: y and z. I have written a multisearch code like ...
    Sreedhar, DantamSreedhar, Dantam
    Oct 12, 2004 at 12:07 pm
    Oct 12, 2004 at 5:47 pm
  • The dataset that I index is pretty dynamic and flexible, and I started to notice a incorrectly displayed character on some of my results... some debugging showed that it was a the Unicode character ...
    Chris FraschettiChris Fraschetti
    Oct 12, 2004 at 12:51 am
    Oct 12, 2004 at 1:54 am
  • hi all following is some code that i use to index the contents of a table ( there are 18746 records in the table. ) using a database result set , i loop over all the records , creating a document ...
    Rupinder Singh MazaraRupinder Singh Mazara
    Oct 5, 2004 at 6:31 pm
    Oct 7, 2004 at 9:42 am
  • Hello I have a stand-alone java application. We have a new requirement where there will be around 1000 data files in XML format. Each of them have the same format. Nodes will have value and ...
    Oct 2, 2004 at 12:06 am
    Oct 4, 2004 at 8:46 pm
  • Is there way to include stopwords in an exact phrase search? For example, when I search on "Melbourne IT", Lucene only searches for Melbourne ignoring "IT". Thanks, Ravi. ...
    Oct 27, 2004 at 7:36 pm
    Oct 27, 2004 at 7:47 pm
  • Hello, 1. Is it possibleto use Lucene to search PDF contents ? 2. Can it search Chinese contents PDF files ??? Eric --------------------------------------------------------------------- To ...
    Eric ChowEric Chow
    Oct 25, 2004 at 4:10 am
    Oct 25, 2004 at 9:22 am
  • Lucene 1.4 changed the file format for indexes. You can access a old index using lucene 1.4 but you can't access index which was created using lucene 1.4 with older versions. I suggest you rebuild ...
    Oct 21, 2004 at 4:59 pm
    Oct 22, 2004 at 6:01 pm
  • hi, I am new to lucene. I want to build a search engine for myself, and use jboss(bundled with tomcat) as server. I wrote following code to do the index: ---------------------------code ...
    Slide TaoSlide Tao
    Oct 22, 2004 at 9:48 am
    Oct 22, 2004 at 11:03 am
  • Hello, I'm a new user of Lucene, and a would like to use it to create a Thesaurus. Do you have any idea to do this? .... Thanks! kind regards P.Galeas ...
    Patricio GaleasPatricio Galeas
    Oct 19, 2004 at 4:50 pm
    Oct 19, 2004 at 10:20 pm
Group Navigation
period‹ prev | Oct 2004 | next ›
Group Overview
groupjava-user @

116 users for October 2004

Erik Hatcher: 25 posts Chris Fraschetti: 19 posts Daniel Naber: 18 posts Otis Gospodnetic: 16 posts Sergiu Gordea: 16 posts Karthik N S: 12 posts Morus Walter: 12 posts Iouli Golovatyi: 11 posts Justin Swanhart: 11 posts Nader Henein: 11 posts Sumathi: 11 posts Daan Hoogland: 10 posts Rupinder Singh Mazara: 10 posts Aad Nales: 8 posts Aviran: 8 posts Chuck Williams: 8 posts Bill Janssen: 7 posts Che Dong: 7 posts Kevin A. Burton: 7 posts Angelov, Rossen: 6 posts
show more