Julkaistu kehittämis- tai tutkimusraportti taikka -selvitys (D4)

TRC-Matcher and enhanced TRC-Matcher. New Tools for Automatic XML Schema Matching

Julkaisun tekijät: Lauri Mukkala, Jukka Arvo, Teijo Lehtonen, Timo Knuutila

Kustantaja: University of Turku

Paikka: Turku

Julkaisuvuosi: 2017

Sarjan nimi: University of Turku Technical Reports

Numero sarjassa: 13

eISBN: 978-951-29-6856-5

Verkko-osoite: http://urn.fi/URN:ISBN:978-951-29-6856-5

Rinnakkaistallenteen osoite: https://research.utu.fi/converis/portal/detail/Publication/29235897


Modern society depends on the access to a wide range of information that is located
in heterogeneous data sources. Schema matching is a task of finding relationships
among data source elements automatically. However, most of the existing schema
matching software are semi-automatic meaning that they need a lot of interaction
from an expert familiar with the systems being integrated. In this work, we propose
a new hybrid matcher algorithm, called TRC-matcher, that is targeted for matching
business oriented XML schemas with none or minor user assistance. When compared
to previously published schema matching methods, the efficiency of the new
algorithm is based on a new content profiling algorithm and on intelligent combination
of matching results of multiple matching algorithms. In addition, an enhanced
version of the TRC-Matcher is introduced that combines machine learning methods
together with few new matching algorithms.

Ladattava julkaisu

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.

Last updated on 2022-07-04 at 16:46