What: Katta and a Case Study
When: November 10, 2008 6:30 PM
22 Cortlandt Street
New York, NY 10007
Learn more here and RSVP:
Stefan Groschupf will present two topics:
"Katta - how to distribute Lucene indexes in a grid."
Katta is a young open source project that helps to serve very large
indexes or very heavy loaded indexes.
Stefan will give an overview of the architecture, discuss functionality
and API. Further more Stefan will explain how to use katta as large
distributed data storage with xpath style queries support.
"A Case Study - An experience report and architectural overview of a
Stefan will share his experience building a production system to process
millions of events with a hadoop cluster, generate trend alerts and
reports with hadoop, pig and katta.
About the speaker:
Stefan is an active member of the Open Source Community working on
distributed file system, map reduce and search engine implementation
projects. Stefan also contributed to "Nutch" a set of things like the
nutch plugin system.
Over the past 10 years Stefan is consulting on Internet and database
projects for BMW, Intel, Siemens and Hoffmann La Roche, Krugle many
more. Currently Stefan is CTO at Sproose (a user powered search engine).