Several resources specialized to medical domain are available.
Toolkits and Libraries
- Apache cTakes (clinical Text Analysis Knowledge Extraction System): cTakes is a natural language processing system for extraction of information from electronic medical record clinical free-text.
- MetaMap: An application to map biomedical text to the UMLS Metathesaurus or, equivalently, to discover Metathesaurus concepts referred to in text. MetaMap uses a knowledge-intensive approach based on symbolic, natural-language processing (NLP) and computational-linguistic techniques. Besides being applied for both IR and data-mining applications, MetaMap is one of the foundations of NLM's Medical Text Indexer (MTI) which is being used for both semiautomatic and fully automatic indexing of biomedical literature at NLM.
- CLAMP (Clinical Language Annotation, Modeling and Processing Toolkit) : A comprehensive set of tools combined with a graphical user interface to visually develop natural language processing projects.
General Domain
- Apache Lucene: A search engine library forms the basis of Elasticsearch and Apache Solr Applications
Knowledge Resources
- UMLS (Unified Medical Language System): A comprehensive knowledgebase for medical terminologies.
- MT Samples: 5000 clinical notes for developing and testing NLP models.
No comments:
Post a Comment