Synonym extraction of medical terms from clinical text using combinations of word space models

Aron Henriksson (-); Hans Moen (-); Maria Skeppstedt (-); Ann-Marie Eklund (Institutionen för svenska språket); Vidas Daudaravicius (-); Martin Hassel (-)
5th International Symposium on Semantic Mining in Biomedicine (SMBM), 3rd-4th September, 2012, Zurich, 2012 s. 10-17
Konferensbidrag, refereegranskat
In information extraction, it is useful to know if two signifiers have the same or very similar semantic content. Maintaining such information in a controlled vocabulary is, however, costly. Here it is demonstrated how synonyms of medical terms can be extracted automatically from a large corpus of clinical text using distributional semantics. By combining Random Indexing and Random Permutation, different lexical semantic aspects are captured, effectively increasing our ability to identify synonymic relations between terms. 44% of 340 synonym pairs from MeSH are success- fully extracted in a list of ten suggestions. The models can also be used to map abbreviations to their full-length forms; simple pattern-based filtering of the suggestions yields substantial improvements.
Data- och informationsvetenskap ->
Språkteknologi (språkvetenskaplig databehandling)
2012-09-03 16:17
2012-09-04 12:20

