1) La descarga del recurso depende de la página de origen
2) Para poder descargar el recurso, es necesario ser usuario registrado en Universia

Opción 1: Descargar recurso

Detalles del recurso


Statistical data about cities, regions and at country level is collected for various purposes and from various institutions. Yet, while access to high quality and recent such data is crucial both for decision makers as well as for the public, all to often such collections of data remain isolated and not re-usable, let alone properly integrated. In this paper we present the Open City Data Pipeline, a focused attempt to collect, integrate, and enrich statistical data collected at city level worldwide, and republish this data in a reusable manner as Linked Data. The main feature of the Open City Data Pipeline are: (i) we integrate and cleanse data from several sources in a modular and extensible, always up-to-date fashion; (ii) we use both Machine Learning techniques as well as ontological reasoning over equational background knowledge to enrich the data by imputing missing values, (iii) we assess the estimated accuracy of such imputations per indicator. Additionally, (iv) we make the integrated and enriched data available both in a we browser interface and as machine-readable Linked Data, using standard vocabularies such as QB and PROV, and linking to e.g. DBpedia. Lastly, in an exhaustive evaluation of our approach, we compare our enrichment and cleansing techniques to a preliminary version of the Open City Data Pipeline presented at ISWC2015: firstly, we demonstrate that the combination of equational knowledge and standard machine learning techniques significantly helps to improve the quality of our missing value imputations; secondly, we arguable show that the more data we integrate, the more reliable our predictions become. Hence, over time, the Open City Data Pipeline shall provide a sustainable effort to serve Linked Data about cities in increasing quality.

Pertenece a

ePub-WU OAI Archive (Vienna Univ. of Econ. and B.A.)  


Bischof, Stefan -  Benedikt, Kämpgen -  Andreas, Harth -  Axel, Polleres -  Patrik, Schneider - 

Id.: 69695179

Idioma: inglés  - 

Versión: 1.0

Estado: Final

Tipo:  application/pdf - 

Palabras claveopen data, data cleaning, data integration - 

Tipo de recurso: Paper  -  NonPeerReviewed  - 

Tipo de Interactividad: Expositivo

Nivel de Interactividad: muy bajo

Audiencia: Estudiante  -  Profesor  -  Autor  - 

Estructura: Atomic

Coste: no

Copyright: sí

Formatos:  application/pdf - 

Requerimientos técnicos:  Browser: Any - 

Relación: [References] http://epub.wu.ac.at/5438/

Fecha de contribución: 09-mar-2017



Otros recursos del mismo autor(es)

  1. Four heuristics to guide structured content crawling Provided by the author(s) and NUI Galway in accordance with publisher policies. Please cite the publ...
  2. Four heuristics to guide structured content crawling Provided by the author(s) and NUI Galway in accordance with publisher policies. Please cite the publ...
  3. Previous version: Copyright c ○ 2004 DERI r ○ , All Rights Reserved. DERI liability, trademark, document use, and soft...
  4. Towards Fine-grained Service Matchmaking by Using Concept Similarity Provided by the author(s) and NUI Galway in accordance with publisher policies. Please cite the publ...
  5. Scalable Authoritative OWL Reasoning on a Billion Triples available.

Otros recursos de la mismacolección

  1. Six Dimensions of Concentration in Economics: Scientometric Evidence from a Large-Scale Data Set This paper scientometrically investigates concentration in economics between 1956 and 2016 using a l...
  2. The macroeconomic effects of international uncertainty shocks We propose a large-scale Bayesian VAR model with factor stochastic volatility to investigate the mac...
  3. Structural breaks in Taylor rule based exchange rate models - Evidence from threshold time varying parameter models In this note we develop a Taylor rule based empirical exchange rate model for eleven major currencie...
  4. The shortage of safe assets in the US investment portfolio: Some international evidence This paper develops a Bayesian Global VAR (GVAR) model to track the international transmission dynam...
  5. The true art of the tax deal: Evidence on aid flows and bilateral double tax agreements Out of a total of 2,976 double tax agreements (DTAs), some 60% are signed between a developing and a...

Aviso de cookies: Usamos cookies propias y de terceros para mejorar nuestros servicios, para análisis estadístico y para mostrarle publicidad. Si continua navegando consideramos que acepta su uso en los términos establecidos en la Política de cookies.