Apache Tika 0.8


 Sponsored links


 Sponsored links
License:
Apache
Category:
Development
Publisher:
Apache-Software-Foundation
Size:
1.6 MB
Last Updated:
2013-10-02
Operating System:
Mac OS X
Price:
FREE
Download
Publisher's description - Apache Tika 0.8
 
 Sponsored links

Apache Tika is a free and open source toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries.

What`s New in This Release: [ read full changelog ]

· Language identification is now dynamically configurable, managed via a config file loaded from the classpath. (TIKA-490)
· Tika now supports parsing Feeds by wrapping the underlying Rome library. (TIKA-466)
· A quick-start guide for Tika parsing was contributed. (TIKA-464)
· An approach for plumbing through XHTML attributes was added. (TIKA-379)
· Media type hierarchy information is now taken into account when selecting the best parser for a given input document. (TIKA-298)
· Support for parsing common scientific data formats including netCDF and HDF4/5 was added (TIKA-400 and TIKA-399).
· Unit tests for Windows have been fixed, allowing TestParsers to complete. (TIKA-398)


 

Also See ...
ojAlgo 30.1

ojAlgo 30.1
ChessShell Pre-Alpha

ChessShell Pre-Alpha
Amethyst 2.0.0 Alpha 2.1

Amethyst 2.0.0 Alpha 2.1
Nsound 0.8.1

Nsound 0.8.1
Chilkat Perl Bounce Library 9.2.0

Chilkat Perl Bounce Library 9.2.0



More
Pin Scheduled

Pin Scheduled
Dictionnaire Français Turc

Dictionnaire Français Turc
Templates Pro for MS Office

Templates Pro for MS Office
MarkDrop

MarkDrop
Playr

Playr



Mac App
Dictionnaire Français Turc

Dictionnaire Français Turc
Instabar

Instabar
German FlashCards BASIC 1.0

German FlashCards BASIC 1.0
TwitKit Plus 1.4.0

TwitKit Plus 1.4.0
BalloonTip 1.0

BalloonTip 1.0