A Taxonomy of Data Management Models in Distributed and Grid Environments

Full Text (PDF, 677KB), PP.19-32

Views: 0 Downloads: 0


Farrukh Nadeem 1,*

1. Department of Information Systems, Faculty of Computing and Information Technology King Abdulaziz University, Jeddah, Saudi Arabia

* Corresponding author.

DOI: https://doi.org/10.5815/ijitcs.2016.03.03

Received: 3 Jul. 2015 / Revised: 7 Oct. 2015 / Accepted: 11 Dec. 2015 / Published: 8 Mar. 2016

Index Terms

Data Management Tasks and Challenges, Data Management Models, Taxonomy of Data Management Models, Data Management in Distributed and Grid Environments


The distributed environments vary largely in their architectures, from tightly coupled cluster environment to loosely coupled Grid environment and completely uncoupled peer-to-peer environment, and thus differ in their working environments as well as performance. To meet the specific needs of these environments for data organization, replication, transfer, scheduling etc. the data management systems implement different data management models. In this paper, major data management tasks in distributed environments are identified and a taxonomy of the data management models in these environments is presented. The taxonomy is used to highlight the specific data management requirements of each environment and highlight the strengths and weakness of the implemented data management models. The taxonomy is followed by a survey of different distributed and Grid environments and the data management models they implement. The taxonomy and the survey results are used to identify the issues and challenges of data management for future exploration.

Cite This Paper

Farrukh Nadeem, "A Taxonomy of Data Management Models in Distributed and Grid Environments", International Journal of Information Technology and Computer Science(IJITCS), Vol.8, No.3, pp.19-32, 2016. DOI:10.5815/ijitcs.2016.03.03


[1]Australian Virtual Observatory. http://www.aus-vo.org/.

[2]BioGrid, Japan (biogrid-jp). http://www.biogrid.jp/.

[3]Biomedical Informatics Research Network (BIRN). birncommunity.org/.

[4]Earth System Grid. www.earthsystemgrid.org/.

[5]eDiaMoND Grid Computing Project. www.ediamond.ox.ac.uk/.

[6]Enabling Grids for E-SciencE (EGEE). http://public.eu-egee.org/.

[7]European Grid Infrastructure (EGI). http://www.egi.eu/.

[8]Grid Physics Network(GriPhyN). http://www.griphyn.org.

[9]Grid4All. http://www.cslab.ece.ntua.gr/cgi-bin/twiki/view/CSLab/AboutGrid4All.

[10]GridPP: UK Computing for Particle Physics. http://www.gridpp.ac.uk/.

[11]International Virtual Observatory Alliance. http://www.ivoa.net/.

[12]NEESgrid: Virtual Collaboratory for Earthquake Engineering. www.neesgrid.org.

[13]Organic Grid. http://www.csc.lsu.edu/~gb/OrganicGrid/.

[14]Particle Physics Data Grid (PPDG). http://www.ppdg.net/.

[15]Karl Aberer, Philippe Cudr′e-Mauroux, Anwitaman Datta, Zoran Despotovic, Manfred Hauswirth, Magdalena Punceva, and Roman Schmidt. P-grid: a self-organizing structured p2p system. SIGMOD Record, 32(3):29–33, 2003.

[16]Carlisle Adams and Steve Lloyd. Understanding the Public-Key Infrastructure: Concepts, Standards and Deployment Considerations. Addison-Wesley Professional, second edition.

[17]David P. Anderson, Jeff Cobb, Eric Korpela, Matt Lebofsky, and Dan Werthimer. Seti@home: an experiment in public- resource computing. Commun. ACM, 45(11):56–61, November 2002.

[18]Fran Berman, Geoffrey Fox, and Anthony J. G. Hey. Grid Computing: Making the Global Infrastructure a Reality. John Wiley & Sons, Inc., New York, NY, USA, 2003.

[19]Bhatia, K.1. Ogsa-p2p research group: peer-to-peer requirements on the open grid services architecture framework. Global Grid Forum Document GFD-I.049, 2005.

[20]CERN: European Council for Nuclear Researh. Worldwide LHC Computing Grid (WLCG). http://wlcg.web.cern.ch/.

[21]Ann Chervenak, Ian Foster, Carl Kesselman, Charles Salisbury, and Steven Tuecke. The data grid: Towards an architecture for the distributed management and analysis of large scienti?c datasets. JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 23:187–200, 1999.

[22]Ding Choon-Hoong, Sarana Nutanong, and Rajkumar Buyya. Peer-to-Peer Networks for Content Sharing, chapter 2, pages 28–65. Idea Group Inc, Hershey, PA, USA, 2005.

[23]Community Grids Lab, Indiana University. The Narada Brokering Project. http://grids.ucs.indiana.edu/ptliupages/projects/narada/.

[24]Data Management International. DAMA-DMBOK Guide (Data Management Body of Knowledge) Introduction & Project Status. http://www.dama.org/files/public/DI DAMA DMBOK Guide Presentation 2007.pdf.

[25]Brian D. Davison. A web caching primer. IEEE Internet Computing, 5(4):38–45, July 2001.

[26]John Dilley, Bruce Maggs, Jay Parikh, Harald Prokop, Ramesh Sitaraman, and Bill Weihl. Globally distributed content delivery. IEEE Internet Computing, 6(5):50–58, September 2002.

[27]David F. Ferraiolo, D. Richard Kuhn, and Ramaswamy Chandramouli. Role-Based Access Control. Artech House.

[28]Ian Foster, Nicholas R. Jennings, and Carl Kesselman. Brain meets brawn: Why grid and agents need each other. In Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems - Volume 1, AAMAS ’04, pages 8–15, Washington, DC, USA, 2004. IEEE Computer Society.

[29]Ian Foster, Carl Kesselman, and Steven Tuecke. The anatomy of the grid: Enabling scalable virtual organizations. Int. J. High Perform. Comput. Appl., 15(3):200–222, August 2001.

[30]Jim Gray and Andreas Reuter. Transaction Processing: Concepts and Techniques. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 1st edition, 1992.

[31]Wolfgang Hoschek, Francisco Javier Ja′en-Mart′?nez, Asad Samar, Heinz Stockinger, and Kurt Stockinger. Data management in an international data grid project. In Proceedings of the First IEEE/ACM International Workshop on Grid Computing, GRID ’00, pages 77–90, London, UK, UK, 2000. Springer-Verlag.

[32]Bart Jacob, Luis Ferreira, Norbert Bieberstein, Candice Gilzean, Jean-Yves Girard, Roman Strachowski, and Seong (Steve) Yu. Enabling applications for grid computing with globus. IBM Corp., Riverton, NJ, USA, first edition, 2003.

[33]Balachander Krishnamurthy, Craig Wills, and Yin Zhang. On the use and performance of content distribution networks. In Proceedings of the 1st ACM SIGCOMM Workshop on Internet Measurement, IMW ’01, pages 169–182, New York, NY, USA, 2001. ACM.

[34]Jagatheesan A. Rajasekar A. Wan M. ”Moore, R.W. and W” Schroeder. Data grid management systems. In the 21st IEEE/NASA Conference on Mass Storage Systems and Technologies (MSST), College Park, Maryland, USA, April 2004.

[35]Andy Oram, editor. Peer-to-Peer: Harnessing the Power of Disruptive Technologies. O’Reilly & Associates, Inc., Sebastopol, CA, USA, 2001.

[36]John Orcutt and Frank Vernon. Real-time Observatories, Applications, and Data Management Network [ROADNet]. http://roadnet.ucsd.edu/.

[37]Esen Ozkarahan. Database management: concepts, design, and practice. Prentice Hall, 1990.

[38]M. Tamer Ozsu. Principles of Distributed Database Systems. Prentice Hall Press, Upper Saddle River, NJ, USA, 3rd edition, 2007.

[39]Laura Pearlman, Carl Kesselman, Sridhar Gullapalli, B. F. Spencer, Jr., Joe Futrelle, Kathleen Ricker, Ian Foster, Paul Hubbard, and Charles Severance. Distributed hybrid earthquake engineering experiments: Experiences with a ground shaking grid application. In Proceedings of the 13th IEEE International Symposium on High Performance Distributed Computing, HPDC ’04, pages 14–23, Washington, DC, USA, 2004. IEEE Computer Society.

[40]Jun Qin and Thomas Fahringer. Advanced data flow support for scientific grid workflow applications. In Proceedings of the 2007 ACM/IEEE conference on Supercomputing, SC ’07, pages 42:1–42:12, New York, NY, USA, 2007. ACM.

[41]Amit P. Sheth and James A. Larson. Federated database systems for managing distributed, heterogeneous, and au tonomous databases. ACM Comput. Surv., 22(3):183–236, September 1990.

[42]Wei Song, Yuelong Zhao, Wenying Zeng, and Wenfeng Wang. Data grid model based on structured p2p overlay network. In Proceedings of the 7th international conference on Advanced parallel processing technologies, APPT’07, pages 282–291, Berlin, Heidelberg, 2007. Springer-Verlag.

[43]William Stallings. Cryptography And Network Security: Principles and Practices. Prentice Hall.

[44]Osamu Tatebe, Youhei Morita, Satoshi Matsuoka, Noriyuki Soda, and Satoshi Sekiguchi. Grid datafarm architecture for petascale data intensive computing. In Proceedings of the 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid, CCGRID ’02, pages 102–, Washington, DC, USA, 2002. IEEE Computer Society.

[45]The Globus Project. Globus toolkit. http://www.globus.org/toolkit/docs/latest-table/gridftp/.

[46]Srikumar Venugopal, Rajkumar Buyya, and Kotagiri Ramamohanarao. A taxonomy of data grids for distributed data sharing, management, and processing. ACM Comput. Surv., 38(1), 2006.

[47]Srikumar Venugopal, Rajkumar Buyya, and Kotagiri Ramamohanarao. A taxonomy of data grids for distributed data sharing, management, and processing. ACM Comput. Surv., 38, June 2006.