TY - JOUR
T1 - Performance evaluation of NoSQL big-data applications using multi-formalism models
AU - Barbierato, Enrico
AU - Gribaudo, Marco
AU - Iacono, Mauro
PY - 2014
Y1 - 2014
N2 - Starting with the birth of Web 2.0, the quantity of data managed by large-scale web services has grown exponentially, posing new challenges and infrastructure requirements. This has led to new programming paradigms and architectural choices, such as map-reduce and NoSQL databases, which constitute two of the main peculiarities of the specialized massively distributed systems referred to as Big Data architectures. The underlying computer infrastructures usually face complexity requirements, resulting from the need for efficiency and speed in computing over huge evolving data sets. This is achieved by taking advantage from the features of new technologies, such as the automatic scaling and replica provisioning of Cloud environments. Although performances are a key issue for the considered applications, few performance evaluation results are currently available in this field. In this work we focus on investigating how a Big Data application designer can evaluate the performances of applications exploiting the Apache Hive query language for NoSQL databases, built over a Apache Hadoop map-reduce infrastructure. This paper presents a dedicated modeling language and an application, showing first how it is possible to ease the modeling process and second how the semantic gap between modeling logic and the domain can be reduced, by means of vertical multiformalism modeling. © 2014 Elsevier B.V. All rights reserved.
AB - Starting with the birth of Web 2.0, the quantity of data managed by large-scale web services has grown exponentially, posing new challenges and infrastructure requirements. This has led to new programming paradigms and architectural choices, such as map-reduce and NoSQL databases, which constitute two of the main peculiarities of the specialized massively distributed systems referred to as Big Data architectures. The underlying computer infrastructures usually face complexity requirements, resulting from the need for efficiency and speed in computing over huge evolving data sets. This is achieved by taking advantage from the features of new technologies, such as the automatic scaling and replica provisioning of Cloud environments. Although performances are a key issue for the considered applications, few performance evaluation results are currently available in this field. In this work we focus on investigating how a Big Data application designer can evaluate the performances of applications exploiting the Apache Hive query language for NoSQL databases, built over a Apache Hadoop map-reduce infrastructure. This paper presents a dedicated modeling language and an application, showing first how it is possible to ease the modeling process and second how the semantic gap between modeling logic and the domain can be reduced, by means of vertical multiformalism modeling. © 2014 Elsevier B.V. All rights reserved.
KW - Big data
KW - Multiformalism modeling
KW - Performance evaluation
KW - Big data
KW - Multiformalism modeling
KW - Performance evaluation
UR - http://hdl.handle.net/10807/202857
U2 - 10.1016/j.future.2013.12.036
DO - 10.1016/j.future.2013.12.036
M3 - Article
SN - 0167-739X
VL - 37
SP - 345
EP - 353
JO - Future Generation Computer Systems
JF - Future Generation Computer Systems
ER -