FAQ

Search Discussions

128 discussions - 582 posts

  • Hi All, Once I index a bunch of documents with a StandardAnalyzer (and if the effort I need to put in to reindex the documents is not worth the effort), is there a way to search on the index without ...
    Dino KorahDino Korah
    Aug 13, 2008 at 4:11 pm
    Sep 19, 2008 at 12:22 pm
  • hi, I wish to know if the contents of two indexes have same data. will all the files be exactly same if I put same set of documents to both? --Noble ...
    Noble Paul നോബിള്‍ नोब्ळ्Noble Paul നോബിള്‍ नोब्ळ्
    Aug 29, 2008 at 9:35 am
    Sep 6, 2008 at 12:53 pm
  • Hi, I followed the following procedure to escape special characteres. String escapedKeywords = QueryParser.escape(keywords); Query query = new QueryParser("content", new ...
    Kalani RuwanpathiranaKalani Ruwanpathirana
    Aug 4, 2008 at 10:06 am
    Aug 12, 2008 at 10:08 am
  • Hi all, I am working on implementing a new Query, Weight and Scorer that is expensive to run. I'd like to limit the number of documents I run this query on by first building a candidate set of ...
    Matt RongeMatt Ronge
    Aug 30, 2008 at 1:36 am
    Sep 4, 2008 at 8:22 am
  • Hi, You may want to ask on the java-user list (more subscribers), which I'm CC-ing, so we can continue discussion there. I think you will have to implement your own logic that runs on A and does ...
    Otis GospodneticOtis Gospodnetic
    Aug 28, 2008 at 1:43 am
    Aug 29, 2008 at 1:52 am
  • Just trying to grasp the concept. I want to search a text file where each line is a separate item to be searched. When text it entered by the user, I want to return all the lines in which that text ...
    Brittany JacobsBrittany Jacobs
    Aug 1, 2008 at 1:33 pm
    Aug 4, 2008 at 6:35 pm
  • Hi, I'm sometimes receiving FileNotFoundExceptions during indexing. java.io.FileNotFoundException: /tmp/content/3615.0-3618.0/_3p.fnm (No such file or directory) at ...
    Wojtek212Wojtek212
    Aug 1, 2008 at 12:52 am
    Aug 3, 2008 at 9:33 am
  • Hi guys, Fairly new to Lucene, and just finished reading Lucene in Action. My problem is the following I need to index the documents that only contains the following pattern(s) in a mass of ...
    Raymond BalmèsRaymond Balmès
    Aug 30, 2008 at 7:02 pm
    Sep 9, 2008 at 1:30 pm
  • Hello I'm use FrenchAnalyzer for index IndexWriter writer = new IndexWriter(pathOfIndex, new FrenchAnalyzer(), true); Document = new Document(); doc.add(new ...
    Christophe from parisChristophe from paris
    Aug 6, 2008 at 10:36 am
    Oct 15, 2008 at 5:30 am
  • Hi all, Let's say that I have in my index the value "One Two Three" for field 'A'. I'm using a custom analyzer that is described in the forwarded message. My Search query is built like this: ...
    Andre RubinAndre Rubin
    Aug 25, 2008 at 3:34 pm
    Aug 26, 2008 at 6:04 pm
  • Clarification question: If I don't store term vectors, then I: -- won't have information on the position of matching terms -- I don't have the term frequency vector -- but I should still have the ...
    David LeeDavid Lee
    Aug 21, 2008 at 11:21 pm
    Aug 22, 2008 at 9:20 pm
  • Hi all, We are testing Lucene with SSD. No doubt the performance is much better than that of a normal hard disk. However it's still not good enough for our particular case. So I wonder if there are ...
    Cedric HoCedric Ho
    Aug 19, 2008 at 8:23 am
    Aug 21, 2008 at 3:37 am
  • hi. I am searching for lucene api or function like query "FIELD 1000" For example, a user wants to search a product which price is bigger then user's input. If user's input is 10000 then result are ...
    장용석장용석
    Aug 12, 2008 at 10:01 am
    Aug 19, 2008 at 5:48 pm
  • Hi! I'm using Lucene 2.3.2 to store a relatively-large index of HTML documents. I'm storing ~150 million documents, taking up 150 GB of space. I index the HTML text, but I only store primary key ...
    MattspitzMattspitz
    Aug 16, 2008 at 8:08 am
    Aug 18, 2008 at 9:39 pm
  • Hello Users, I'm working on a project which attempts to store data that comes from an OCR process which describes the pixel co-ordinates of each term in the document. It's used for hit highlighting. ...
    Martin OwensMartin Owens
    Aug 5, 2008 at 3:03 pm
    Aug 11, 2008 at 3:37 pm
  • Hello, I am new to all this. I need to read in a text file and have each line in the file be a document. The LineDocMaker seems to be intended for this purpose. But I can't figure out how to read the ...
    Brittany JacobsBrittany Jacobs
    Aug 6, 2008 at 8:17 pm
    Aug 8, 2008 at 3:25 pm
  • Hi, Say, I have a query with two terms: A + B, I want to return the documents with 50 of A and 50 of B on top of documents with 1000 of A and 1 of B. Is there an existing Query class can handle this ...
    Shi Hui LiuShi Hui Liu
    Aug 27, 2008 at 6:01 pm
    Aug 29, 2008 at 5:49 pm
  • Hi All, I am new to this Lucene, and I am using this for indexing and searching. Is it possible to search substrings using this, for example if a field holds the value "LuceneIndex" and if a give the ...
    Venkata SubbarayuduVenkata Subbarayudu
    Aug 25, 2008 at 11:54 am
    Aug 26, 2008 at 1:42 pm
  • Hi All, I am using IndexWriter for adding the documents. I am re-using the document as well as the fields for improving index speed as per the link ...
    Aditi GoyalAditi Goyal
    Aug 19, 2008 at 9:09 am
    Aug 21, 2008 at 12:28 pm
  • I started playing with payloads and have been trying to work out how to get the data into the payload I have a field where I want to add the following untokenized fields A1 A2 A3 With these fields, I ...
    Antony BowesmanAntony Bowesman
    Aug 14, 2008 at 3:15 am
    Aug 20, 2008 at 2:33 pm
  • Hi, I am currently using Lucene for indexing. After a index a file, I will use LUKE to open it and check the index. And there is 1 part that I am curious about. In Luke, under the Document tab, I ...
    Blazingwolf7Blazingwolf7
    Aug 18, 2008 at 3:00 am
    Jul 10, 2012 at 8:05 pm
  • So from what I understand, is it true that if mergeFactor is 10, then when I index my first 9 documents, I have 9 separate segments, each containing 1 document? And when searching, it will search ...
    David LeeDavid Lee
    Aug 22, 2008 at 11:36 pm
    Sep 14, 2009 at 12:15 pm
  • Hi, I'm trying to get MoreLikeThis working but it just returns no results. I have lucene working for normal queries and indexing but MoreLikeThis Just returns nothing. This is what I'm trying ...
    DavoodDavood
    Aug 30, 2008 at 6:06 am
    Sep 1, 2008 at 3:00 pm
  • Hi, Sorry if I missed this somewhere or maybe its not released yet, but I was anxiously curious about lucene 3.0's expected features/improvements. Is there a list yet? thanks! Darren ...
    Darren GovoniDarren Govoni
    Aug 26, 2008 at 10:53 pm
    Aug 28, 2008 at 10:47 am
  • Anyone else run on Windows? We have index around 26 GB in size. Seems file system cache ends up taking up nearly all available RAM (26 GB out of 32 GB on 64-bit box). Lucene process is around 5 GB, ...
    Robert StewartRobert Stewart
    Aug 16, 2008 at 11:44 am
    Aug 19, 2008 at 2:27 pm
  • Hi all, When I switched a String field from tokenized to untokenized, some searches started not returning some obvious values. Am I missing something on querying untokenized fields? Another question ...
    Andre RubinAndre Rubin
    Aug 8, 2008 at 12:04 am
    Aug 13, 2008 at 7:54 pm
  • Hi, The indexer can't be opened after about 20 queries in linux system, but it is fine if the index is in windows system. The indexer is the same in both systems. reader = ...
    Xh sunXh sun
    Aug 5, 2008 at 2:34 am
    Aug 6, 2008 at 6:07 pm
  • Hi, I am trying to boost the freshness of some of our documents in the index using the most efficient way (i.e. if 2 news stories have the same score based on the content then I want to promote the ...
    Yannis PavlidisYannis Pavlidis
    Aug 28, 2008 at 4:13 pm
    Aug 29, 2008 at 10:16 pm
  • I have a custom TopDocsCollector and need to collect a payload from each final document hit. The payload comes from a single term in each hit. When collecting the payload, I don't want to fetch the ...
    Antony BowesmanAntony Bowesman
    Aug 27, 2008 at 7:08 am
    Aug 28, 2008 at 9:55 am
  • I am new to lucene. Here is my question. The document has fields. When I add a field to the document I can specify that field is Indexed, Tokenized, etc.. So the same field can be Tokenized in one ...
    DimitriDDimitriD
    Aug 22, 2008 at 3:17 pm
    Aug 25, 2008 at 9:23 am
  • I have an index in Spanish and I use Snowball to stem and analyze and it works perfectly. However, I am running into trouble storing (not indexing, only storing) words that have special characters. ...
    Juan Pablo MoralesJuan Pablo Morales
    Aug 21, 2008 at 5:17 pm
    Aug 22, 2008 at 12:16 am
  • Hello, I am creating fields for documents like this: String name = ... String value = ... doc.add(new Field(name, value, Field.Store.NO, Field.Index.UN_TOKENIZED)); On the query side, sometimes I ...
    Bill CheskyBill Chesky
    Aug 18, 2008 at 2:29 pm
    Aug 18, 2008 at 11:37 pm
  • Hi, I need first 100 documents in a sorted order lets say sorted on the document id and there are more then 50K documents in the index. My search query is matching all those 50K documents. Is there ...
    Neeraj GuptaNeeraj Gupta
    Aug 5, 2008 at 8:36 pm
    Aug 7, 2008 at 4:38 pm
  • hello, is there any date for the 2.3.3 release? best, -C.B.
    Cam BazzCam Bazz
    Aug 4, 2008 at 11:29 pm
    Aug 5, 2008 at 9:11 am
  • Hi, How do I list all the fields in an index? Some documents do not contain all fields. Thanks, John -- View this message in context: ...
    John PattersonJohn Patterson
    Aug 13, 2008 at 9:03 am
    Feb 10, 2009 at 3:29 pm
  • Hi, I'd appreciate if someone could explain the results I'm getting. I've written a simple custom analyzer that applies the NGramTokenFilter to the token stream during indexing. It's never applied ...
    Gaz77Gaz77
    Aug 28, 2008 at 2:53 pm
    Sep 1, 2008 at 11:49 am
  • Hi, I know that Lucene uses an inverted index which makes range queries and great-than/less-than type queries very slow for continuous data types like times, latitude, etc. Last time I looked they ...
    John PattersonJohn Patterson
    Aug 27, 2008 at 9:11 am
    Sep 1, 2008 at 8:29 am
  • Can you combine these two queries somehow so that they behave like a PhraseQuery? I have a custom query parser which takes a phrase like "*at sat" and produces a BooleanQuery consisting of a ...
    Chris BamfordChris Bamford
    Aug 26, 2008 at 3:08 pm
    Aug 28, 2008 at 3:27 am
  • I am a developer on the JIRA Issue tracker, and we are considering upgrading our Lucene version from v2.2.0 to v2.3.2. I have been charged with doing the risk analysis, and project work. I have read ...
    Mark LassauMark Lassau
    Aug 26, 2008 at 7:18 am
    Aug 27, 2008 at 9:50 am
  • Hello, I'm interested in knowing how these tokenizers work together. The API doc for TeeTokenizer http://lucene.apache.org/java/2_3_1/api/org/apache/lucene/analysis/TeeTokenFilter.html has this ...
    Teruhiko KurosakaTeruhiko Kurosaka
    Aug 22, 2008 at 7:48 pm
    Aug 26, 2008 at 1:15 pm
  • Hi, I just discovered some strange behaviour with deleted documents. I do a search for documents with a certain query and delete one using IndexWriter.deleteDocuments(Term) using a key for the term. ...
    John PattersonJohn Patterson
    Aug 26, 2008 at 7:56 am
    Aug 26, 2008 at 10:55 am
  • Hi, I am using StandardAnalyzer when creating the Lucene index. It indexes the word "wo&rk" as it is but does not index the word "wo*rk" in that manner. Can I index such words (including * and ?) as ...
    Kalani RuwanpathiranaKalani Ruwanpathirana
    Aug 25, 2008 at 7:20 am
    Aug 25, 2008 at 9:27 am
  • Hello, I am new into Lucene and I want to make sure what I am trying to do will not hit performance. My scenario is the following: I want to keep user text files indexed separately, I will have about ...
    CyndyCyndy
    Aug 19, 2008 at 2:34 am
    Aug 19, 2008 at 3:08 pm
  • Dear List, I have a rather big index around 20gb. My documents have a unique id that I store in in an untokenized field. Using an IndexReader I delete documents by term using the id. The applications ...
    Michael ZehrerMichael Zehrer
    Aug 7, 2008 at 12:47 pm
    Aug 16, 2008 at 8:33 am
  • Dear users, Question on approaches to indexing TEI XML or similar section/subsectioned files. I'm indexing TEI P4 XML files using Lucene 2.x. Currently, each TEI XML file corresponds to a Lucene ...
    Ao1Ao1
    Aug 13, 2008 at 8:04 am
    Aug 13, 2008 at 4:31 pm
  • We're trying to perform a query where if our intended search term/phrase is part of a specific larger phrase, we want to ignore that particular match, but not the entire document (unless of course ...
    Jeff FrenchJeff French
    Aug 11, 2008 at 11:19 pm
    Aug 12, 2008 at 3:31 pm
  • hello, what would happen if I modified the class IndexWriter, and made the delete by id method public? I have two fields in my documents and I got to be able to delete by those two fields, (by query ...
    Cam BazzCam Bazz
    Aug 8, 2008 at 9:40 pm
    Aug 12, 2008 at 10:11 am
  • Hello, I have two machines on the same network, but I want to use one machine to search an index located on the file system of the other machine. Any ideas on how to achieve this? Thanks Dana -- View ...
    DanaWhiteDanaWhite
    Aug 7, 2008 at 3:54 pm
    Aug 8, 2008 at 4:19 am
  • Hi there! I'm new to Lucene, so forgive any misconceptions on my part. I created an Index and now I want to search on it based on a field. The field is a String field and Field.Store.YES and ...
    Andre RubinAndre Rubin
    Aug 5, 2008 at 7:15 pm
    Aug 5, 2008 at 8:19 pm
Group Navigation
period‹ prev | Aug 2008 | next ›
Group Overview
groupjava-user @
categorieslucene
discussions128
posts582
users130
websitelucene.apache.org

130 users for August 2008

Michael McCandless: 52 posts Otis Gospodnetic: 24 posts Erick Erickson: 22 posts Tom: 18 posts Doron Cohen: 17 posts Grant Ingersoll: 17 posts Mark Miller: 17 posts Andre Rubin: 16 posts Chris Hostetter: 14 posts Karl Wettin: 13 posts Steven A Rowe: 12 posts Brittany Jacobs: 11 posts Dino Korah: 11 posts Antony Bowesman: 10 posts Cam Bazz: 10 posts Ian Lea: 10 posts Daniel Naber: 8 posts Andrzej Bialecki: 7 posts Daniel Noll: 7 posts Darren Govoni: 7 posts
show more
Archives