Search Discussions

35 discussions - 159 posts

  • Hi, I have a question about the size of the Xapian index. I indexed a set of 200 000 data who has a global size of about 1Gb and the index created has a size of more than 3Gb!! What can explain this ...
    Justine DemeyerJustine Demeyer
    Nov 24, 2008 at 2:47 pm
    Dec 2, 2008 at 6:30 am
  • Greets, WRT add_posting() and the term's position: presumably it's best to use the actual offset in the source as the position, rather than the line number containing the term, right? I take it this ...
    Nov 18, 2008 at 12:35 pm
    Nov 19, 2008 at 11:58 am
  • Greets, It's been a long day, and my brain is becoming mushy, so forgive my silly questions: I'm coming from another indexing system/paradigm which uses named "fields" which store strings which are ...
    Nov 6, 2008 at 3:43 pm
    Nov 9, 2008 at 2:56 pm
  • Hi all, I'm using Xapian for my mail indexer/searcher[1]; the current version uses Xapian in tandem with SQLite, but I'm making it Xapian-only now, mainly for reasons of simplification of the code. ...
    Nov 10, 2008 at 7:20 pm
    Dec 1, 2008 at 11:04 am
  • Hello all, Our application uses several dozen 'fields' as the basis for searching (migrating from another indexing system). ie, all indexed data is stored in fields. Searching these fields works as ...
    Nov 22, 2008 at 8:02 pm
    Nov 24, 2008 at 11:36 am
  • Greetings all, I'm about to evaluate Xapian for a future project and would appreciate a few comments from those in the know: Indexing 1. Is Xapian similar to Lucene in the sense that you can define ...
    Nov 3, 2008 at 10:09 am
    Nov 6, 2008 at 2:29 pm
  • Hello dear list, I'm trying to index various types of files with Xapian, used in a Python program. Text and HTML work fine via index_text() but I can't find any explanations for indexing other types ...
    Florian BeerFlorian Beer
    Nov 5, 2008 at 10:44 am
    Nov 9, 2008 at 2:50 pm
  • Hi I've decided to use xapian because my files table in my mysql database is going to grow very large, and it seems mysql isn't good at full text searching. I'm doing this with the php wrapper by the ...
    Nov 21, 2008 at 2:29 am
    Nov 30, 2008 at 8:50 pm
  • Hi, I am trying to add some functions to the Xapian library but when I tried to compile it, the linker give me the "undefined reference" error. Here is what I did: Under directory matcher, I created ...
    Zhiguo LiZhiguo Li
    Nov 24, 2008 at 11:04 am
    Dec 1, 2008 at 8:02 am
  • Hello ,I want to develop a simple and lightweight desktop search(fulltext) tool on Windows recently. After a lot of comparison ,I decide to use Xapian as the indexing-engine ,but i haven't found any ...
    Jinqian HuangJinqian Huang
    Nov 20, 2008 at 7:56 pm
    Nov 25, 2008 at 4:14 pm
  • Hi, I am new to this discuss group and I hope my question hasn't been asked before. I was trying to do the following: I have an enquiry and a database. After performing the query I got the MSet. Now ...
    Zhiguo liZhiguo li
    Nov 12, 2008 at 9:27 pm
    Nov 13, 2008 at 12:31 am
  • Hi there, I've got a semi-production database just a little screwed up (not completely, so I can't really remove everything and start over again), which I am using on a Debian system with the PHP5 ...
    Yannick WarnierYannick Warnier
    Nov 3, 2008 at 5:50 am
    Nov 9, 2008 at 5:14 pm
  • Is there an API for extending Xapian's scoring algrithm? I would like to rank documents by a factor of Xapian's score and their age. Thanks Rob
    Robert YoungRobert Young
    Nov 20, 2008 at 10:08 pm
    Nov 30, 2008 at 8:46 pm
  • Hi, I have a simple question: given a Mset "matches" and a Query "query", how do I get the weight of the terms in the query? Suppose I use the BM25 weighting schme. Also suppose I have the document ...
    Zhiguo liZhiguo li
    Nov 17, 2008 at 11:02 pm
    Nov 18, 2008 at 8:24 pm
  • Hi, I would like to have two term iterators to go over the list of terms like the following: for (t1 = query.get_terms_begin(); t1 != query.get_terms_end(); t1++) for (t2=t1+1; t2 != ...
    Zhiguo liZhiguo li
    Nov 13, 2008 at 5:42 am
    Nov 13, 2008 at 8:15 pm
  • Hello Xapianistas , I am writing an email search tool in Perl and would like users to be able to search for messages by date . I decided that i when indexing i would catch dates and convert them to ...
    Amias ChannerAmias Channer
    Nov 8, 2008 at 11:47 am
    Nov 8, 2008 at 8:26 pm
  • Greets, I may be mistaken, but it looks like the Perl CPAN module Search::Xapian is missing method WritableDatabase::add_spelling, even though you can set FLAG_SPELLING_CORRECTION in ...
    Nov 6, 2008 at 2:20 pm
    Nov 7, 2008 at 7:35 am
  • Hi, I am a bit confused about query syntax if wildcard queries are allowed. The code looks like: my $query=$qp- parse_query( $qstring, FLAG_WILDCARD ); printf "Parsed query '%s'\n", $query- ...
    Torsten FoertschTorsten Foertsch
    Nov 6, 2008 at 12:49 pm
    Nov 6, 2008 at 1:27 pm
  • I've uploaded Xapian 1.0.9 (including Search::Xapian, which as usual you can download from: http://xapian.org/download This release fixes a few bugs, improves documentation in a few places, ...
    Olly BettsOlly Betts
    Nov 1, 2008 at 3:28 am
    Nov 3, 2008 at 1:34 pm
  • The documentation says that initially the PostingSource points to a position *before* the first position. The default skip_to() calls get_docid() before it calls next(). Is this OK? If skip_to() is ...
    Paul RudinPaul Rudin
    Nov 21, 2008 at 7:14 am
    Nov 30, 2008 at 8:15 pm
  • Hi, I'm currently testing Xapian for an e-commerce website and I want to implement a search by navigation like there is on http://www.bbcshop.com. I don't think there is a magic function that would ...
    Yann ROBINYann ROBIN
    Nov 25, 2008 at 1:41 pm
    Nov 25, 2008 at 2:31 pm
  • Hello, For some obscure reasons, we want to use in a part of our program TermGenerator only to recover the words of a text. Does this is possible with the python binding ? I tried: tg = ...
    David VersmisseDavid Versmisse
    Nov 14, 2008 at 4:45 pm
    Nov 15, 2008 at 6:13 pm
  • Hi, just to clarify, what is the difference between a normal prefix and a boolean prefix? If I understand it correctly a normal prefix is a way to give a name to a certain part of the index. When the ...
    Torsten FoertschTorsten Foertsch
    Nov 7, 2008 at 4:50 pm
    Nov 9, 2008 at 12:21 pm
  • Hi all, I used Xapian with Python but now I want to try it with C++. So, I tried to compile the examples : g++ simpleindex.cc -o simpleindex But I have some errors : /tmp/ccl7xhUF.o: In function ...
    Justine DemeyerJustine Demeyer
    Nov 1, 2008 at 9:44 am
    Nov 1, 2008 at 6:17 pm
  • Hi all, I've been asked to prepare a comparison of Lucene/Solr and Xapian and I'm trying to find some differences between the two. I'm not that familiar with Lucene myself but I expect there are lots ...
    Charlie HullCharlie Hull
    Nov 28, 2008 at 10:32 am
    Nov 28, 2008 at 1:23 pm
  • Hi, I've compiled xapian 1.0.9 (core and buildings) on Windows Xp, using Lemur's nmake files[1]. I'm trying to use acts_as_xapian [2] on RubyOnRails, but I'm facing a problem with the rebuild index ...
    Pietro GiorgianniPietro Giorgianni
    Nov 19, 2008 at 9:16 am
    Nov 19, 2008 at 10:16 am
  • Hi, http://www.xapian.org/docs/queryparser.html states: "The QueryParser can be configured to support range-searching using document values." How to do that using the Perl API? I want to be able to ...
    Torsten FoertschTorsten Foertsch
    Nov 16, 2008 at 12:26 pm
    Nov 17, 2008 at 10:18 am
  • Hi, all, I just found a weird problem about PositionIterator, and I really hope somebody could help me out. My example code looks like the following: Xapian::MSetIterator im; Xapian::TermIterator ...
    Zhiguo liZhiguo li
    Nov 13, 2008 at 11:55 am
    Nov 13, 2008 at 12:04 pm
  • Hi all, I've tried to figure out how to work with clustering and categorisation, as i have understood i have to checkout svn branches of clustering and matchspy. But can't understand how to index ...
    Denis KuzmenokDenis Kuzmenok
    Nov 4, 2008 at 7:24 am
    Nov 9, 2008 at 4:04 pm
  • Hi, Are the PHP5 bindings always supporting all of the methods delivered by the Xapian core (http://xapian.org/docs/apidoc/html/) or is there a separate documentation for the ones supported in the ...
    Yannick WarnierYannick Warnier
    Nov 3, 2008 at 6:02 am
    Nov 9, 2008 at 2:56 pm
  • Is it possible to extract common phrases from an index? Basically, I'd like to index my document set and find words that commonly appear next to each other. For example if I a set of recent political ...
    Nov 6, 2008 at 4:47 am
    Nov 6, 2008 at 1:38 pm
  • Hi, is there a xapian stemmer suitable for polish or czech languages? Thanks, Torsten
    Torsten FoertschTorsten Foertsch
    Nov 5, 2008 at 1:30 pm
    Nov 5, 2008 at 2:48 pm
  • Hi, I'd like to know if the directions for the apt repository for debian etch, on the project website, http://xapian.org/download.php ------------------------ $ su - enter your root password # wget ...
    Josef NovakJosef Novak
    Nov 2, 2008 at 3:56 am
    Nov 2, 2008 at 3:13 pm
  • Alex, I am writing an indexer using your Search::Xapian module. From what I have learned from xapian-omega/omindex.cc I think the following is correct. Can you please confirm it? my ...
    Torsten FoertschTorsten Foertsch
    Nov 1, 2008 at 5:49 pm
    Nov 1, 2008 at 7:07 pm
  • There were some noticable issues with Xapian version 1.0.8 which seems to be fixed with Xapian version 1.0.9, thanks. Kevin Duraj http://myhealthcare.com/search?q=gastrointestinal+virus
    Kevin DurajKevin Duraj
    Nov 4, 2008 at 9:03 pm
    Nov 4, 2008 at 9:03 pm
Group Navigation
period‹ prev | Nov 2008 | next ›
Group Overview
groupxapian-discuss @

33 users for November 2008

Olly Betts: 41 posts Henry: 22 posts Zhiguo Li: 12 posts Charlie Hull: 7 posts James Aylett: 7 posts Justine Demeyer: 7 posts Torsten Foertsch: 7 posts Richard Boulton: 6 posts Jim Lynch: 5 posts Robert Young: 4 posts Yannick Warnier: 4 posts Alex Neth: 3 posts Daniel Ménard: 3 posts Djcb: 3 posts Kevin Duraj: 3 posts Amias Channer: 2 posts David Versmisse: 2 posts Felix Antonius Wilhelm Ostmann: 2 posts Florian Beer: 2 posts Jinqian Huang: 2 posts
show more