The OpenDocument XML.org web site is not longer accepting new posts. Information on this page is preserved for legacy purposes only. For current information on ODF, please see the OASIS OpenDocument Technical Committee.

Diff for Apache Tika

Wed, 2009-08-26 07:16 by BartHanssensWed, 2009-08-26 07:17 by BartHanssens

small layout changes

Changes to Description
-
Apache Tika - a subproject of Apache Lucene - is a toolkit for detecting and extracting metadata and
+
Apache Tika - a subproject of Apache Lucene - is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries. 
-
structured text content from various documents using existing parser
+
-
libraries. 
+
 
Support for ODF was added in Tika 0.3.
 
Support for ODF was added in Tika 0.3.
 
 
Current revision:

Apache Tika

Apache Tika - a subproject of Apache Lucene - is a toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries. 
Support for ODF was added in Tika 0.3.

XML.org Focus Areas: BPEL | DITA | ebXML | IDtrust | OpenDocument | SAML | UBL | UDDI
OASIS sites: OASIS | Cover Pages | XML.org | AMQP | CGM Open | eGov | Emergency | IDtrust | LegalXML | Open CSA | OSLC | WS-I