On behalf of the Data Intensive Infrastructure unit (DERI) [1], I'm
pleased to announce the first public version of SIREn (Semantic
Information Retrieval Engine).

SIREn, the Information Retrieval system at the core of the Semantic Web
Index Sindice, is now available for download and includes the full
source under Apache License 2.0. SIREn is based on best practices and
our own experience in solving large-scale semi-structured data search.
Our goal is to bring the benefits of state-of-the-art techniques for
semi-structured Information Retrieval into Lucene / Solr, and to provide
a full-featured search engine for semi-structured data.

This is our first release, and by no means you should consider it
feature complete or final. There is still much work to do, such as
improved ranking and new indexing schemes, but we believe it to already
be reasonably stable and useful in its current form.

Some examples of the possibility are:
- indexing plain n-triples documents,
- indexing entity-centric RDF description,
- indexing tabular data (IMDB)
and are available at

Source distributions are available at

Please visit our project site for more information at

Any and all feedback is welcome at

A special thanks to Nickolai Toupikov, Robert Fuller, Michele Catasta,
and Giovanni Tummarello who provided value suggestions and inputs to
make this project happen ... but also to the Data Intensive
Infrastructure Group and DERI.

[1] http://di2.deri.ie/
Renaud Delbru

To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org

Search Discussions

Related Discussions

Discussion Navigation
viewthread | post
Discussion Overview
groupjava-user @
postedJul 23, '09 at 9:28a
activeJul 23, '09 at 9:28a

1 user in discussion

Renaud Delbru: 1 post



site design / logo © 2022 Grokbase