Maintenance outage for upgrade on Sunday, September 22

This site, docs.bmc.com, will be inaccessible for two hours starting at 8 AM CDT, Sunday, September 22, for a platform upgrade.

    Page tree
    Skip to end of metadata
    Go to start of metadata
    The full text of this page is only available to our customers.

    Please login or sign up. You may also need to provide your support ID if you have not already done so.

    Discover with BMC Discovery
    download

    This product can be discovered by Enterprise version of BMC Discovery, but you can still Download our free Community Edition to discover [other products] !

    What is this?
    This is a product information page, containing details of the information that BMC Discovery gathers about a product and how it is obtained.
    Product Name
    Tika
    Publisher Page
    Apache
    Category
    Search and Discovery
    Release
    TKU 2015-Mar-1
    Change History
    Apache Tika - Change History
    Reports & Attributes
    Apache Tika - Reports & Attributes
    Publisher Link
    Apache

    Product Description

    The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF). All of these file types can be parsed through a single interface, making Tika useful for search engine indexing, content analysis, translation, and much more.

    Apache Tika was born out of the Apache Lucene and Apache Incubator products.

    Known Versions

    The full text of this page is only available to our customers.

    Please login or sign up. You may also need to provide your support ID if you have not already done so.