An Efficient and Performance-Aware Big Data Storage System

Your ads will be inserted here by

Easy Plugin for AdSense.

Please go to the plugin admin page to
Paste your ad code OR
Suppress this ad slot.

Paper Title: An Efficient and Performance-Aware Big Data Storage System

Published  in: Cloud Computing and Services Science Communications in Computer and Information Science Volume 367, 2013, pp 102-116.

Keywords: Big Data Storage, Cloud Computing, Cloud Storage, Amazon S3, CACSS

Abstract. Recent escalations in Internet development and volume of data have created a growing demand for large-capacity storage solutions. Although Cloud storage has yielded new ways of storing, accessing and managing data, there is still a need for an inexpensive, effective and efficient storage solution especially suited to big data management and analysis. In this paper, we take our previous work one step further and present an in-depth analysis of the key features of future big data storage services for both unstructured and semi-structured data, and discuss how such services should be constructed and deployed. We also explain how different technologies can be combined to provide a single, highly scalable, efficient and performance-aware big data storage system.  We especially focus on the issues of data de-duplication for enterprises and private organisations.  This research is particularly valuable for inexperienced solution providers like universities and research organisations, and will allow them to swiftly set up their own big data storage services.

Authors: Yang Li, Li Guo and Yike Guo

PDF Version Download

Read More

CACSS Cloud Storage System Web Login Portal

CACSS Cloud Storage System (Big Data Ready) Live Demonstration (Amazon S3, Rackspace, Google Cloud Storage Alternatives)

Your ads will be inserted here by

Easy Plugin for AdSense.

Please go to the plugin admin page to
Paste your ad code OR
Suppress this ad slot.

CACSS Cloud Storage System

  • Neither of Amazon S3’s architecture nor its implementation has yet been made public. As such, it is not available for extension in order to develop the capability of creating private clouds of any size. In order to reveal this secret knowledge behind cloud storage services and thereby a generic solution, we present CACSS, a generic computational and adaptive cloud storage system that adapts existing storage technologies to provide efficient and scalable services.
  • CACSS Cloud Storage System is built as the implementation of one of my papers CACSS: Towards a Generic Cloud Storage Service (In: The 2nd International Conference on Cloud Computing and Services Science, CLOSER 2012, Porto, Portugal. Pages 27-36,SciTePress 2012, ISBN 978-989-8565-05-1.)
  • It is Amazon S3 API Compatible (you only need to change the endpoint from s3.amazonaws.com to s3.bigdatapro.org)
  • Currently deployed across a bunch of virtual machines.
  • Some functions such as object versioning is currently disabled.
  • Please feel free to contact me for any questions you may have.

Web Portal: http://www.bigdatapro.org or http://console.bigdatapro.org

End Point: http://s3.bigdatapro.org (use port 80, SSL is not yet supported)

CACSS Open Source Cloud Storage System Web Login Portal

CACSS OpenSource Cloud Storage System Web Control Panel Demostration

Keywords: Amazon S3, Amazon S3 Architecture, Open Source Amazon S3,  Open Source Cloud Storage System, CACSS, Big Data Storage

 

Read More

CACSS: Towards a Generic Cloud Storage Service (reveal the secret knowledge behind cloud storage services)

Your ads will be inserted here by

Easy Plugin for AdSense.

Please go to the plugin admin page to
Paste your ad code OR
Suppress this ad slot.

Keywords: cacss cloud storage s3 architecture cloud computing amazon s3

Published in the 2nd International Conference on Cloud Computing and Services Science, CLOSER 2012, Porto, Portugal.

Title:      CACSS: Towards a Generic Cloud Storage Service
Author(s):      Yang Li, Li Guo, Yike Guo

Abstract

The advent of the cloud era has yielded new ways of storing, accessing and managing data. Cloud storage services enable the storage of data in an inexpensive, secure, fast, reliable and highly scalable manner over the internet. Although giant providers such as Amazon and Google have made a great success of their services, many enterprises and scientists are still unable to make the transition into the cloud environment due to often insurmountable issues of privacy, data protection and vendor lock-in. These issues demand that it be possible for anyone to setup or to build their own storage solutions that are independent of commercially available services. However, the question persists as to how to provide an effective cloud storage service with regards to system architecture, resource management mechanisms, data reliability and durability, as well as to provide proper pricing models. The aim of this research is to present an in-depth understanding and analysis of the key features of generic cloud storage services, and of how such services should be constructed and provided. This is achieved through the demonstration of design rationales and the implementation details of a real cloud storage system (CACSS). The method by which different technologies can be combined to provide a single excellent performance, highly scalable and reliable cloud storage system is also detailed. This research serves as a knowledge source for inexperienced cloud providers, giving them the capability of swiftly setting up their own cloud storage services.

PDF Download: CACSS-TOWARDS A GENERIC CLOUD STORAGE SERVICE

Read More