Open Access Article SciPap-749
The Performance Efficiency of the Virtual Hadoop Using Open Big Data
by Martin Lněnička 1,* and Jitka Komárková 2 iD icon

1 Faculty of Economics and Administration, Institute of System Engineering and Informatics, University of Pardubice, Studenská 84, Pardubice 53210, Czechia

2 Faculty of Economics and Administration, Institute of System Engineering and Informatics, University of Pardubice, Studenská 84, Pardubice 53210, Czechia

* Authors to whom correspondence should be addressed.

Abstract: Public sector institutions nowadays maintain a large amount of data from various domains. This data represents a potential resource that businesses and citizens can use to enhance their own datasets or which can be used to develop new products and public services. Open data support the emergence and realization of the big data potential. While it enhances the volume and velocity of available data, its main impact is on the variety of data sources. This paper deals with the deployment of the Virtual Hadoop for the processing of the open big data idea in the public sector. The first part of this paper is based on the literature review of the cloud computing, the distributed processing of data, big / open / linked data and theirs sources on the web. The primary aim of the Virtual Hadoop deployment is to test the performance efficiency using open big data in order to obtain the direction of the future research. The last part then introduces the most important findings and recommendations.

Keywords: Public Sector, Open Big Data, Virtual Hadoop, Data Processing, Performance Efficiency, Cloud Computing

JEL classification:   C55 - Large Data Sets: Modeling and Analysis,   C63 - Computational Techniques • Simulation Modeling,   H83 - Public Administration • Public Sector Accounting and Audits,   L86 - Information and Internet Services • Computer Software

SciPap 2015, 23(1), 749

Received: 19 June 2014 / Accepted: 8 April 2015 / Published: 27 April 2015