Page tree
Skip to end of metadata
Go to start of metadata
The full text of this page is only available to our customers.

Please login or sign up. You may also need to provide your support ID if you have not already done so.

Product Name
Tika
Publisher Page
Apache
Category
Search and Discovery
Release
TKU 2015-Mar-1
More Information
Publisher Link
Apache

Product Description

The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF). All of these file types can be parsed through a single interface, making Tika useful for search engine indexing, content analysis, translation, and much more.

Apache Tika was born out of the Apache Lucene and Apache Incubator products.

Known Versions

The full text of this page is only available to our customers.

Please login or sign up. You may also need to provide your support ID if you have not already done so.