Edit Paper: SERIMI – Resource Description Similarity, RDF Instance Matching and Interlinking

Jump to: navigation, search

You do not have permission to edit this page, for the following reason:

The action you have requested is limited to users in the group: Users.


SERIMI – Resource Description Similarity, RDF Instance Matching and Interlinking

Abstract[edit]

[[has abstract:={{{Abstract}}}]]

Conclusion[edit]

[[has conclusion:={{{Conclusion}}}]]

Future work[edit]

[[has future work:={{{Future work}}}]]

Approach[edit]

Positive Aspects: [[Has PositiveAspects::{{{PositiveAspects}}}]]

Negative Aspects: [[Has NegativeAspects::{{{NegativeAspects}}}]]

Limitations: [[Has Limitations::{{{Limitations}}}]]

Challenges: [[Has Challenges::{{{Challenges}}}]]

Proposes Algorithm: [[Proposes Algorithm::{{{ProposesAlgorithm}}}]]

Methodology: [[uses Methodology::{{{Methodology}}}]]

Requirements: [[Has Requirements::{{{Requirements}}}]]

Limitations: [[Has Limitations::{{{Limitations}}}]]

Implementations[edit]

Download-page: [[Has Downloadpage::{{{Download-page}}}]]

Access API: [[access API::{{{API}}}]]

Information Representation: [[Has InfoRepresentation::{{{InfoRepresentation}}}]]

Data Catalogue: [[Has DataCatalouge::{{{Catalogue}}}]]

Runs on OS: [[ runsOn OS::{{{OS}}}]]

Vendor: [[Has vendor::{{{vendor}}}]]

Uses Framework: [[Uses Framework::{{{Framework}}}]]

Has Documentation URL: [[Has DocumentationURL::{{{DocumentationURL}}}]]

Programming Language: [[implementedIn ProgLang::{{{ProgLang}}}]]

Version: [[has Version ::{{{Version}}}]]

Platform: [[has platform::{{{Platform}}}]]

Toolbox: [[uses Toolbox::{{{Toolbox}}}]]

GUI: has GUI::No

Research Problem[edit]

Subproblem of: [[Has Subproblem::{{{Subproblem}}}]]

RelatedProblem: [[Has relatedProblem::{{{RelatedProblem}}}]]

Motivation: [[Has motivation::{{{Motivation}}}]]

Evaluation[edit]

Experiment Setup: [[Has ExperimentSetup::{{{ExperimentSetup}}}]]

Evaluation Method : [[Has EvaluationMethod::{{{EvaluationMethod}}}]]

Hypothesis: [[Has Hypothesis::{{{Hypothesis}}}]]

Description: [[Has Description::{{{Description}}}]]

Dimensions: [[Has Dimensions::{{{Dimensions}}}]]

Benchmark used: [[has Benchmark::{{{Benchmark}}}]]

Results: [[Has Results::{{{Results}}}]]


[edit]
Paper Title*: Full title of the given paper
Subject: Primary subject or topic of research of the paper.
Author(s):
Publication venue: Publication venue of the paper.
Year: The year in which the paper is published.
Keywords:
Abstract: Abstract of the paper.
Conclusion:
Future work: Future ideas of the paper.
Problem: The research problem.
Approach: Approach used to achieve the goal.
Implementation: The implementation/framework that is invoked when running.
Evaluation: Proposed work Evaluation.
Positive Aspects:
Negative Aspects:
Limitations:
Challenges:
Proposes Algorithm:
Proposes Model:
Methodology:
Requirements:
Download-page:
Access API:
Information Representation:
Data Catalogue:
OS:
Vendor: Organization: commercial, free or otherwise
Uses Framework: Framework name
Documentation URL: Link to Documentation webpage.
Programming Language:
Version:
Platform:
Toolbox:
GUI: whether the implementation has a GUI?
Subproblem of: A broader research problem.
Related Problem: A another research problem.
Motivation: The motivation to solve the problem.


Experiment setup:
Evaluation method : A broader research problem.
Hypothesis: A another research problem.
Description: The motivation to solve the problem.
Dimensions:
Select evaluation dimension.
Benchmark: Benchmark used for evaluation.
Results:



Paper description:

Cancel

Access APINo data available now. +
Event in seriesArXiv +
Has BenchmarkDBpedia +, Sider +, DrugBank +, LinkedCT +, Dailymed +, Diseasome + and TCM +
Has ChallengesNo data available now. +
Has DataCatalouge{{{Catalogue}}} +
Has DescriptionNo data available now. +
Has DimensionsAccuracy +
Has DocumentationURLhttp://No data available now. +
Has Downloadpagehttps://github.com/samuraraujo/SERIMI-RDF-Interlinking +
Has EvaluationAccuracy Evaluation +
Has EvaluationMethodIn order to evaluate the effectiveness of the proposed interlinking method, we used the precision, recall and F1 metrics. +
Has ExperimentSetupWe have loaded all these datasets into an
We have loaded all these datasets into an open-source instance of Virtuoso Universal server 10 , where around 2GB of data were loaded. An exception was the DBPedia dataset, which we accessed online via its Sparql endpoint. The Virtuoso server was installed in a Mac OS X – version 10.5.8, with 2.4 GHz Intel Core 2 Duo processor and with 4 GB 1067 MHz DDR3 of memory. We ran the script that implements the SERIMI approach directly over the local SPARQL endpoints and DBPedia online endpoint.
RQL endpoints and DBPedia online endpoint. +
Has GUINo +
Has HypothesisNo data available now. +
Has ImplementationSERIMI +
Has InfoRepresentationNo data available now. +
Has LimitationsNo data available now. +
Has NegativeAspectsNo data available now. +
Has PositiveAspectsNo data available now. +
Has RequirementsNo data available now. +
Has ResultsNo data available now. +
Has SubproblemNo data available now. +
Has VersionNo data available now. +
Has abstractThe interlinking of datasets published in
The interlinking of datasets published in the Linked Data Cloud is a challenging problem and a key factor for the success of the Semantic Web. Manual rule-based methods are the most effective solution for the problem, but they require skilled human data publishers going through a laborious, error prone and time-consuming process for manually describing rules mapping instances between two datasets. Thus, an automatic approach for solving this problem is more than welcome. In this paper, we propose a novel interlinking method, SERIMI, for solving this problem automatically. SERIMI matches instances between a source and a target datasets, without prior knowledge of the data, domain or schema of these datasets. Experiments conducted with benchmark collections demonstrate that our approach considerably outperforms state-of-the-art automatic approaches for solving the interlinking problem on the Linked Data Cloud.
rlinking problem on the Linked Data Cloud. +
Has approachNo data available now. +
Has authorsSamur Araujo +, Jan Hidders +, Daniel Schwabe +, Arjen P. de Vries + and Abraham Bernstein +
Has conclusionRDF instance matching in the context of in
RDF instance matching in the context of interlinking RDF datasets published in the Linked Data Cloud is the task of determining if two resources are referred to the same entity in the real world. This is a challenging task in high demand by data publishers that wish to interlink their datasets in the cloud.

In this work, we propose a novel approach, called SERIMI, for solving the RDF instance-matching problem automatically. SERIMI matches instances between a source and target datasets, without prior knowledge of the data, domain or schema of these datasets. It does so by approximating the notion of similarity by pairing instances based on entity labels as well as structural (ontological) context. As part of the SERIMI approach, we proposed the CRDS function to approximate that judgment of similarity.

We used two collections proposed by the OAEI 2010 initiative to evaluate SERIMI. On average, SERIMI outperforms two representative systems, RiMOM and ObjectCoref, which tried to solve the same problem using the same collections and reference alignment, in 70% of the cases.
reference alignment, in 70% of the cases. +
Has future workAs future work, we intend to investigate h
As future work, we intend to investigate how our model can be adjusted to consider partial string matching in the similarity function that we proposed, and to accommodate different score distribution metrics as the threshold for the parameter Also, we intend to evaluate this approach in different collections that may provide a more accurate reference alignment than the ones that we used in this work.
t than the ones that we used in this work. +
Has keywordsdata integration, RDF interlinking, instance matching, linked data, entity recognition, entity search. +
Has motivationNo data available now. +
Has platformNo data available now. +
Has problemLink Discovery +
Has relatedProblemNo data available now. +
Has subjectOntology matching +
Has vendorOpen Source +
Has year2011 +
ImplementedIn ProgLangRuby +
Proposes AlgorithmNo data available now. +
RunsOn OSMac OS X +
TitleSERIMI – Resource Description Similarity, RDF Instance Matching and Interlinking +
Uses FrameworkNo data available now. +
Uses MethodologyNo data available now. +
Uses ToolboxNo data available now. +