Refereed article in conference proceedings (A4)
Combining support vector machines and LSTM networks for chemical-protein relation extraction
List of Authors: Farrokh Mehryary, Jari Björne, Tapio Salakoski, Filip Ginter
Editors: Cecilia Arighi, Qinghua Wang, Cathy Wu
Conference name: BioCreative
Publication year: 2018
Book title *: Proceedings of the BioCreative VI Workshop
Start page: 175
End page: 179
ISBN: 978-84-948397-0-2
URL: http://www.biocreative.org/resources/publications/bcvi-proceedings/
Self-archived copy’s web address: https://research.utu.fi/converis/portal/detail/Publication/28819195
We present the results of our participation in the BioCreative VI: Text mining chemical-protein interactions (CHEMPROT) track. The goal of this task is to promote the development and evaluation of systems capable of extracting relations between chemical compounds/drug and genes/proteins from biomedical literature. We participate with two systems: (1) an SVM system which relies on a rich set of features extracted from the parse graph and (2) an ensemble of neural networks that utilize LSTM networks and generate features along the shortest path of dependencies. We also combine the predictions from the two systems with the goal of increasing performance. On the development set, our system combination approach outperforms the two individual systems, achieving an F-score of 61.09 (according to the official evaluation metric). On the test set, our SVM system achieves the highest result for our submissions with an F-score of 60.99.
Downloadable publication This is an electronic reprint of the original article. |