Dear all,
Currently I am using Lucene jave 2.3.2 demo to parse Microsoft 2003 and 2007
docs and PDF files.
It is able to parse files with *.pdf, *.doc, *.xls etc.
But it does not search in files of Microsoft 2007 docs.
It shows indexing *.docx and other Microsoft 2007 doc files.
Does Lucene java supports parsing of extensions *.docx, *.pptx, *.mpp i.e.
Microsoft Windows 2007 documents?
If it supports, what should be done in Lucene demo 2.3.2 to search queries
on file with above mentioned extensions?
Thanks
Kumar