NOSQL DATABASE PERFORMANCE BENCHMARKING - A CASE STUDY

Authors

  • Camelia-Florina ANDOR Faculty of Mathematics and Computer Science, Babeș-Bolyai University, Cluj-Napoca, Romania. Email: andorcamelia@cs.ubbcluj.ro https://orcid.org/0000-0002-1124-7805
  • Bazil PÂRV Faculty of Mathematics and Computer Science, Babeș-Bolyai University, Cluj-Napoca, Romania. Email: bparv@cs.ubbcluj.ro

DOI:

https://doi.org/10.24193/subbi.2018.1.06

Keywords:

NoSQL database, performance benchmarking, MongoDB, Cassandra.

Abstract

This paper describes an experimental study regarding NoSQL database performance. Two NoSQL databases were considered (MongoDB and Cassandra), two different workloads (update heavy and mostly read) and several degrees of parallelism. The results refer to throughput performance metric. Statistical analysis performed is referring to throughput results. Conclusions confirm that MongoDB performs better than Cassandra in the context of a mostly read workload, while Cassandra outperformed MongoDB in the context of an update heavy workload where the number of operations was high.

Author Biographies

Camelia-Florina ANDOR, Faculty of Mathematics and Computer Science, Babeș-Bolyai University, Cluj-Napoca, Romania. Email: andorcamelia@cs.ubbcluj.ro

Faculty of Mathematics and Computer Science, Babeș-Bolyai University. 1 Mihail Kogălniceanu, RO-400084 Cluj-Napoca, Romania. Email: andorcamelia@cs.ubbcluj.ro

 

Bazil PÂRV, Faculty of Mathematics and Computer Science, Babeș-Bolyai University, Cluj-Napoca, Romania. Email: bparv@cs.ubbcluj.ro

Faculty of Mathematics and Computer Science, Babeș-Bolyai University. 1 Mihail Kogălniceanu, RO-400084 Cluj-Napoca, Romania. Email: bparv@cs.ubbcluj.ro

References

M. H. F. R. M. P. A. C. H.-A. J. Ahmad Ghazal, Tilmann Rabl. Bigbench: towards an industry standard benchmark for big data analytics. Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data, pages 1197–1208, 2013.

Apache cassandra. http://cassandra.apache.org/. Accessed: 2017-09-25.

F. Chang, J. Dean, S. Ghemawat, W. C. Hsieh, D. A. Wallach, M. Burrows, T. Chandra, A. Fikes, and R. E. Gruber. Bigtable: A distributed storage system for structured data. OSDI ’06 Proceedings of the 7th USENIX Symposium on Operating Systems Design and Implementation, 7, 2006.

B. F. Cooper, A. Silberstein, E. Tam, R. Ramakrishnan, and R. Sears. Benchmarking cloud serving systems with ycsb. Proceedings of the 1st ACM symposium on Cloud computing, pages 143–154, 2010.

Couchdb. http://couchdb.apache.org/. Accessed: 2017-09-25.

Datagres. Perfaccel performance benchmark: nosql database mongodb. Technical report, Datagres Technologies Inc., 2015.

G. DeCandia, D. Hastorun, M. Jampani, G. Kakulapati, A. Lakshman, A. Pilchin, S. Sivasubramanian, P. Vosshall, and W. Vogels. Dynamo: Amazon’s highly available key-value store. Proceedings of twenty-first ACM SIGOPS symposium on Operating systems principles, oct 2007.

Fixstars. Griddb and cassandra performance and scalability. a ycsb performance comparison on microsoft azure. Technical report, Fixstars Solutions, 2016.

A. Gandini, M. Gribaudo, W. J. Knottenbelt, R. Osman, and P. Piazzolla. Performance evaluation of nosql databases. EPEW 2014: Computer Performance Engineering, Lecture Notes in Computer Science, 8721:16–29, 2014.

Hbase. https://hbase.apache.org/. Accessed: 2017-09-25.

N. E. P. D. K. P. C. M. John Klein, Ian Gorton. Performance evaluation of nosql databases: A case study. Proceedings of the 1st Workshop on Performance Analysis of Big Data Systems, pages 5–10, 2015.

Json. https://www.json.org/. Accessed: 2018-03-16.

A. Lakshman and P. Malik. Cassandra: a decentralized structured storage system. ACM SIGOPS Operating Systems Review, 44:35–40, 2010.

Mongodb. https://www.mongodb.com/. Accessed: 2017-09-25.

Orientdb. http://orientdb.com/. Accessed: 2017-09-25.

M. F. Pramod J. Sadalage. NoSQL distilled: a brief guide to the emerging world of polyglot persistence. Addison-Wesley Professional, 2012.

R statistics package. https://www.r-project.org/. Accessed: 2017-09-25.

Stress test for couch base client and cluster. http://docs.couchbase.com/sdk-api/couchbase-c-client-2.4.8/md_doc_cbc-pillowfight.html. Accessed: 2017-09-25.

The cassandra-stress tool. https://docs.datastax.com/en/cassandra/2.1/cassandra/tools/toolsCStress_t.html. Accessed: 2017-09-25.

The ycsb core workloads. https://github.com/brianfrankcooper/YCSB/wiki/Core-Workloads. Accessed: 2017-09-25.

Xml. https://www.w3.org/TR/2008/REC-xml-20081126/. Accessed: 2018-03-16.

Yaml. http://yaml.org/. Accessed: 2018-03-16.

Ycsb github wiki. https://github.com/brianfrankcooper/YCSB/wiki. Accessed: 2017-09-25.

Downloads

Published

2018-06-19

How to Cite

ANDOR, C.-F., & PÂRV, B. (2018). NOSQL DATABASE PERFORMANCE BENCHMARKING - A CASE STUDY. Studia Universitatis Babeș-Bolyai Informatica, 63(1), 80–93. https://doi.org/10.24193/subbi.2018.1.06

Issue

Section

Articles