Apache Tika
Product: Submitted by BartHanssens on Wed, 2009-08-26 07:16. Last updated on Wed, 2009-08-26 07:17.
Apache Tika - a subproject of Apache Lucene - is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.
Support for ODF was added in Tika 0.3.


