LINNAEUS is a free and open source, general-purpose dictionary matching software, capable of processing multiple types of document formats in the biomedical domain (MEDLINE, PMC, BMC, OTMI, text, etc.).
LINNAEUS can produce multiple types of output (XML, HTML, tab-separated-value file, or save to a database).
LINNAEUS also contains methods for acting as a server (including load balancing across several servers), allowing clients to request matching over a network.
A package with files for recognizing and identifying species names is available for LINNAEUS, showing 94% recall and 97% precision compared to LINNAEUS-species-corpus.
Here are some key features of "LINNAEUS":
· Supports multiple document input formats (MEDLINE, PMC, BMC, OTMI, text)
· Supports multiple output formats (.tsv, XML, HTML, MySQL)
· Can act in a server/client networking mode
· Performs very time-effective matching through automatons, that can be generated from files containing regular expressions
Requirements:
· Java