m-Networks: Adapting the Triplet Networks for Acronym Disambiguation - UTU Research Portal

A4 Refereed article in a conference publication

m-Networks: Adapting the Triplet Networks for Acronym Disambiguation

Authors: Seneviratne Sandaru, Daskalaki Elena, Lenskiy Artem, Suominen Hanna

Editors: Tristan Naumann, Steven Bethard, Kirk Roberts, Anna Rumshisky

Conference name: Clinical Natural Language Processing Workshop

Publisher: Association for Computational Linguistics (ACL)

Publication year: 2022

Book title : Proceedings of the 4th Clinical Natural Language Processing Workshop

Journal name in source: ClinicalNLP 2022 - 4th Workshop on Clinical Natural Language Processing, Proceedings

First page : 21

Last page: 29

ISBN: 978-1-955917-77-3

Publication's open availability at the time of reporting: Open Access

Publication channel's open availability : Open Access publication channel

Web address : https://aclanthology.org/2022.clinicalnlp-1.3/

Self-archived copy’s web address: https://research.utu.fi/converis/portal/detail/Publication/176821003

Self-archived copy's licence: CC BY

Self-archived copy's version: Publisher`s PDF

Abstract

Acronym disambiguation (AD) is the process of identifying the correct expansion of the acronyms in text. AD is crucial in natural language understanding of scientific and medical documents due to the high prevalence of technical acronyms and the possible expansions. Given that natural language is often ambiguous with more than one meaning for words, identifying the correct expansion for acronyms requires learning of effective representations for words, phrases, acronyms, and abbreviations based on their context. In this paper, we proposed an approach to leverage the triplet networks and triplet loss which learns better representations of text through distance comparisons of embeddings. We tested both the triplet network-based method and the modified triplet network-based method with m networks on the AD dataset from the SDU@AAAI-21 AD task, CASI dataset, and MeDAL dataset. F scores of 87.31%, 70.67%, and 75.75% were achieved by the m network-based approach for SDU, CASI, and MeDAL datasets respectively indicating that triplet network-based methods have comparable performance but with only 12% of the number of parameters in the baseline method. This effective implementation is available at https://github.com/sandaruSen/m_networks under the MIT license.

Downloadable publication

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.

2022.clinicalnlp-1.3.pdf