dc.contributor.author | Christodoulou, Klitos | |
dc.contributor.author | Paton, Norman W. | |
dc.contributor.author | Fernandes, Alvaro A.A. | |
dc.date.accessioned | 2015-12-06T08:02:40Z | |
dc.date.available | 2015-12-06T08:02:40Z | |
dc.date.issued | 2015-02-24 | |
dc.identifier.isbn | 9783662465622 | |
dc.identifier.uri | http://hdl.handle.net/11728/6276 | |
dc.description.abstract | Linked Data (LD) overlays the World Wide Web of documents with a Web of Data. This is becoming significant as shown in the growth of LD repositories available as part of the Linked Open Data (LOD) cloud. At the instance-level, LD sources use a combination of terms from various vocabularies, expressed as RDFS/OWL, to describe data and publish it to the Web. However, LD sources do not organise data to conform to a specific structure analogous to a relational schema; instead data can adhere to multiple vocabularies. Expressing SPARQL queries over LD sources – usually over a SPARQL endpoint that is presented to the user – requires knowledge of the predicates used so as to allow queries to express user requirements as graph patterns. Although LD provides low barriers to data publication using a single language (i.e., RDF), sources organise data with different structures and terminologies. This paper describes an approach to automatically derive structural summaries over instance-level data expressed as RDF triples. The technique builds on a hierarchical clustering algorithm that organises RDF instance-level data into groups that are then utilised to infer a structural summary over a LD source. The resulting structural summaries are expressed in the form of classes, properties and, relationships. Our experimental evaluation shows good results when applied to different types of LD sources. | en_UK |
dc.language.iso | en | en_UK |
dc.publisher | Springer Link | en_UK |
dc.relation.ispartofseries | Lecture Notes in Computer Science pp 1-25;Volume 8990 | |
dc.rights | © Springer-Verlag Berlin Heidelberg 2015 | en_UK |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | en_UK |
dc.subject | Schema | en_UK |
dc.subject | Linked Data | en_UK |
dc.subject | Clustering | en_UK |
dc.subject | Query formulation | en_UK |
dc.title | Structure Inference for Linked Data Sources Using Clustering | en_UK |
dc.type | Book chapter | en_UK |
dc.doi | 10.1007/978-3-662-46562-2 1 | |