Although the quantity of structured information on the Web and within organizations is increasing, the majorityof information remains available only in unstructured form. While different in form, both unstructuredand structured information sources provide information about entities in the world and their properties andrelations;still, frameworks for their seamless integration have not been deeply investigated. In this paper theauthors describe the KnowledgeStore, a scalable, fault-tolerant, and Semantic Web grounded open-sourcestorage system for interlinking structured and unstructured data. They present the concept, design, functionand implementation of the system, and report on its concrete usage in three application scenarios within theNewsReader EU project,where itstores and supports the querying of millions of news articles interlinked withmillions of RDF triples extracted from text and imported from Linked Open Data sources. The authors reporton data population and data retrieval performances of the system measured through a number of experiments,and they also discuss the practical issues and lessons learned from these experiences.

The KnowledgeStore: a Storage Framework for Interlinking Unstructured and Structured Knowledge

Rospocher Marco;
2015-01-01

Abstract

Although the quantity of structured information on the Web and within organizations is increasing, the majorityof information remains available only in unstructured form. While different in form, both unstructuredand structured information sources provide information about entities in the world and their properties andrelations;still, frameworks for their seamless integration have not been deeply investigated. In this paper theauthors describe the KnowledgeStore, a scalable, fault-tolerant, and Semantic Web grounded open-sourcestorage system for interlinking structured and unstructured data. They present the concept, design, functionand implementation of the system, and report on its concrete usage in three application scenarios within theNewsReader EU project,where itstores and supports the querying of millions of news articles interlinked withmillions of RDF triples extracted from text and imported from Linked Open Data sources. The authors reporton data population and data retrieval performances of the system measured through a number of experiments,and they also discuss the practical issues and lessons learned from these experiences.
2015
KnowledgeStore, Linked Open Data Sources, NewsReader EU Project, Structured Information, Unstructured Information
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11562/990117
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 14
  • ???jsp.display-item.citation.isi??? 9
social impact