A Comparison of Word Embeddings to Study Complications in Neurosurgery

Stud Health Technol Inform. 2022 Jan 14:289:5-8. doi: 10.3233/SHTI210845.

Abstract

Our study aimed to compare the capability of different word embeddings to capture the semantic similarity of clinical concepts related to complications in neurosurgery at the level of medical experts. Eighty-four sets of word embeddings (based on Word2vec, GloVe, FastText, PMI, and BERT algorithms) were benchmarked in a clustering task. FastText model showed the best close to the medical expertise capability to group medical terms by their meaning (adjusted Rand index = 0.682). Word embedding models can accurately reflect clinical concepts' semantic and linguistic similarities, promising their robust usage in medical domain-specific NLP tasks.

Keywords: NLP; Neurosurgery; clustering; complications; word embeddings.

MeSH terms

  • Algorithms
  • Cluster Analysis
  • Linguistics
  • Neurosurgery*
  • Semantics