The OpenDocument web site is not longer accepting new posts. Information on this page is preserved for legacy purposes only. For current information on ODF, please see the OASIS OpenDocument Technical Committee.

Apache Tika

Apache Tika - a subproject of Apache Lucene - is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries. 
Support for ODF was added in Tika 0.3. Focus Areas: BPEL | DITA | ebXML | IDtrust | OpenDocument | SAML | UBL | UDDI
