Apache Tika

Apache Tika - a subproject of Apache Lucene - is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries. 
Support for ODF was added in Tika 0.3.

XML.org Focus Areas: BPEL | DITA | ebXML | IDtrust | OpenDocument | SAML | UBL | UDDI
OASIS sites: OASIS | Cover Pages | XML.org | AMQP | Blue | CGM Open | eGov | Emergency | IDtrust | LegalXML | Open CSA | WS-I