A4 Vertaisarvioitu artikkeli konferenssijulkaisussa

Extracting protein-protein interaction sentences by applying rough set data analysis




TekijätGinter F, Pahikkala T, Pyysalo S, Boberg J, Jarvinen J, Salakoski T

ToimittajaTsumoto Husaku, Slowinski Roman, Komorowski Jan, Grzymala-Busse Jerzy W

Konferenssin vakiintunut nimiFourth International Conference on Rough Sets and Current Trends in Computing

Julkaisuvuosi2004

JournalLecture Notes in Computer Science

Kokoomateoksen nimiProceedings of the Fourth International Conference on Rough Sets and Current Trends in Computing

Tietokannassa oleva lehden nimiROUGH SETS AND CURRENT TRENDS IN COMPUTING

Lehden akronyymiLECT NOTES ARTIF INT

Vuosikerta3066

Aloitussivu780

Lopetussivu785

Sivujen määrä6

ISBN3-540-22117-4

ISSN0302-9743


Tiivistelmä
In this paper, we introduce away to apply rough set data analysis to the problem of extracting protein-protein interaction sentences in biomedical literature. Our approach builds on decision rules of protein names, interaction words, and their mutual positions in sentences. In order to broaden the set of potential interaction words, we develop a morphological model which generates spelling and inflection variants of the interaction words. We evaluate the performance of the proposed method on a hand-tagged dataset of 1894 sentences and show a precision-recall break-even performance of 79,8% by using leave-one-out cross-validation.



Last updated on 2024-26-11 at 22:54