Pareto frontier for job execution and data transfer time in hybrid clouds
2014 (English)In: Future generations computer systems, ISSN 0167-739X, E-ISSN 1872-7115, Vol. 37, no 0, 321-334 p.Article in journal (Refereed) Published
This paper proposes a solution to calculate the Pareto frontier for the execution of a batch of jobs versus data transfer time for hybrid clouds. Based on the nature of the cloud application, jobs are assumed to require a number of data-files from either public or private clouds. For example, gene probes can be used to identify various infection agents such as bacteria, viruses, etc. The heavy computational task of aligning probes of a patient's DNA (private-data) with normal sequences (public-data) with various data sizes is the key to this process. Such files have different characteristics depends on their nature and could be either allowed for replication or not in the cloud. Files could be too big to replicate (big data), others might be small enough to be replicated but they cannot be replicated as they contain sensitive information (private data). To show the relationship between the execution time of a batch of jobs and the transfer time needed for their required data in hybrid cloud, we first model this problem as a bi-objective optimization problem, and then propose a Particle Swarm Optimization (PSO)-based approach, called here PSO-ParFnt, to find the relevant Pareto frontier. The results are promising and provide new insights into this complex problem.
Place, publisher, year, edition, pages
Elsevier, 2014. Vol. 37, no 0, 321-334 p.
Big data, Private data, Cloud bursting, Particle swarm optimization, Pareto frontier
Research subject Computer Science
IdentifiersURN: urn:nbn:se:kau:diva-46084ISI: 000337931200031OAI: oai:DiVA.org:kau-46084DiVA: diva2:970872
A, IF(2012) : 1.9782016-09-142016-09-142016-10-03Bibliographically approved