Towards automatic topical classification of LOD datasets


Meusel, Robert ; Spahiu, Blerina ; Bizer, Christian ; Paulheim, Heiko



URL: http://ceur-ws.org/Vol-1409/paper-03.pdf
Additional URL: http://www.slideshare.net/BlerinaSpahiu/towards-au...
Document Type: Conference or workshop publication
Year of publication: 2015
Book title: LDOW 2015 : Proceedings of the Workshop on Linked Data on the Web ; co-located with the 24th International World Wide Web Conference (WWW 2015) ; Florence, Italy, May 19th, 2015
The title of a journal, publication series: CEUR Workshop Proceedings
Volume: 1409
Page range: Paper 03
Date of the conference: May 19th 2015
Publisher: Bizer, Christian
Place of publication: Aachen
Publishing house: RWTH
ISSN: 1613-0073
Publication language: English
Institution: School of Business Informatics and Mathematics > Wirtschaftsinformatik V (Bizer)
Subject: 004 Computer science, internet
Keywords (English): LOD , LDOW , Classification
Abstract: The datasets that are part of the Linking Open Data cloud diagramm (LOD cloud) are classified into the following topical categories: media, government, publications, life sciences, geographic, social networking, user-generated content, and cross-domain. The topical categories were manually assigned to the datasets. In this paper, we investigate to which extent the topical classification of new LOD datasets can be automated using machine learning techniques and the existing annotations as supervision. We conducted experiments with different classification techniques and different feature sets. The best classification technique/feature set combination reaches an accuracy of 81.62% on the task of assigning one out of the eight classes to a given LOD dataset. A deeper inspection of the classification errors reveals problems with the manual classification of datasets in the current LOD cloud.
Additional information: Online-Ressource

Dieser Eintrag ist Teil der Universitätsbibliographie.




+ Citation Example and Export

Meusel, Robert ; Spahiu, Blerina ; Bizer, Christian ; Paulheim, Heiko Towards automatic topical classification of LOD datasets. Bizer, Christian CEUR Workshop Proceedings 1409 Paper 03 In: LDOW 2015 : Proceedings of the Workshop on Linked Data on the Web ; co-located with the 24th International World Wide Web Conference (WWW 2015) ; Florence, Italy, May 19th, 2015 (2015) Aachen [Conference or workshop publication]


+ Search Authors in

+ Page Views

Hits per month over past year

Detailed information



You have found an error? Please let us know about your desired correction here: E-Mail


Actions (login required)

Show item Show item