We consider the feasibility of processing billions of RDFtriples on a single commodity machine using streaming andsorting techniques and focusing on RDF processing tasksrelevant for Linked Data consumption: data filtering andtransformation, RDFS inference, owl:sameAs smushing andstatistics extraction. To investigate this research questionwe built rdfpro (RDF Processor), an open source tool thatprovides streaming and sorting-based processors for the consideredtasks and allows their sequential and parallel compositionin complex pipelines. An empirical evaluation ofrdfpro in four application scenario—dataset analysis, filtering,merging and massaging—shows the effectiveness of thetool and allows to positively answer our research question.
Processing Billions of RDF Triples on a Single Machine using Streaming and Sorting
Rospocher, Marco;
2015-01-01
Abstract
We consider the feasibility of processing billions of RDFtriples on a single commodity machine using streaming andsorting techniques and focusing on RDF processing tasksrelevant for Linked Data consumption: data filtering andtransformation, RDFS inference, owl:sameAs smushing andstatistics extraction. To investigate this research questionwe built rdfpro (RDF Processor), an open source tool thatprovides streaming and sorting-based processors for the consideredtasks and allows their sequential and parallel compositionin complex pipelines. An empirical evaluation ofrdfpro in four application scenario—dataset analysis, filtering,merging and massaging—shows the effectiveness of thetool and allows to positively answer our research question.File | Dimensione | Formato | |
---|---|---|---|
sac2015.pdf
non disponibili
Dimensione
454.77 kB
Formato
Adobe PDF
|
454.77 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.