Replica Maintenance Strategy for Data Grid

Authors

  • Mohammed K. Madi Faculty of Engineering, Hasan Kalyoncu Üniversitesi, Gaziantep, Turkey
  • Yuhanis Yusof School of Computing, Universiti Utara Malaysia, 06010 Kedah, Malaysia
  • Hatim Mohamed Tahir School of Computing, Universiti Utara Malaysia, 06010 Kedah, Malaysia
  • Khuzairi Mohd Zaini School of Computing, Universiti Utara Malaysia, 06010 Kedah, Malaysia
  • Suhaidi Hassan School of Computing, Universiti Utara Malaysia, 06010 Kedah, Malaysia

Keywords:

Data Grid, Replica Deletion, Storage Usage,

Abstract

Data Grid is an infrastructure that manages huge amount of data files, and provides intensive computational resources across geographically distributed collaboration. Increasing the performance of such system can be achieved by improving the overall resource usage, which includes network and storage resources. Improving network resource usage is achieved by good utilization of network bandwidth that is considered as an important factor affecting job execution time. Meanwhile, improving storage resource usage is achieved by good utilization of storage space usage. Data replication is one of the methods used to improve the performance of data access in distributed systems by replicating multiple copies of data files in the distributed sites. Having distributed the replicas to various locations, they need to be monitored. As a result of dynamic changes in the data grid environment, some of the replicas need to be relocated. In this paper we proposed a maintenance replica placement strategy termed as Unwanted Replica Deletion Strategy (URDS) as a part of Replica maintenance service. The main purpose of the proposed strategy is to find the placement of unwanted replicas to be deleted. OptorSim is used to evaluate the performance of the proposed strategy. The simulation results show that URDS requires less execution time and consumes less network usage and has a best utilization of storage space usage compared to existing approaches.

References

A. Chervenak, E. Deelman, C. Kesselman, B. Allcock, I. Foster, V.

Nefedova, J. Lee, A. Sim, A. Shoshani, and B. Drach, "Highperformance remote access to climate simulation data: A challenge problem for data grid technologies," in Super Computing, 2003, 1335-1356.

I. Foster, E. Alpert, A. Chervenak, B. Drach, C. Kesselman, V. Nefedova, D. Middleton, A. Shoshani, A. Sim, and D. Williams, "The

Earth System Grid II: Turning climate datasets into community

resources," in Annual Meeting of the American Meteorological

Society, 2002.

B. Wilkinson, Grid computing: techniques and applications: Chapman & Hall/CRC, 2009.

C. Nicholson, D. G. Cameron, A. T. Doyle, A. P. Millar, and K. Stockinger, "Dynamic data replication in lcg 2008," Concurrency and Computation: Practice and Experience, 20, 1259-1271, 2008.

A. Chervenak, E. Deelman, I. Foster, W. Hoschek, A. Iamnitchi, C. Kesselman, M. Ripeanu, B. Schwartzkopf, H. Stockinger, and B. Tierney, "Giggle: A framework for constructing scalable replica

location services," in International IEEE Supercomputing Conference (SC 2002) Baltimore, USA, 2002, 1-17.

A. Chervenak, I. Foster, C. Kesselman, C. Salisbury, and S. Tuecke., "The Data Grid: Towards an Architecture for the Distributed Management and Analysis of Large Scientific Datasets," Journal of Network and Computer Applications, 23, 2001.

L. Guy, P. Kunszt, E. Laure, H. Stockinger, and K. Stockinger, "Replica management in data grids," in Global Grid Forum. 5, 2002.

H. Lamehamedi, Z. Shentu, B. Szymanski, and E. Deelman,

"Simulation of dynamic data replication strategies in data grids," in Proceedings of 12th Heterogeneous Computing Workshop

(HCW2003), Nice, France, 2003.

H. Lamehamedi, B. Szymanski, Z. Shentu, and E. Deelman, "Data Replication Strategies in Grid Environments," in Fifth International Conference on Algorithms and Architectures for Parallel Processing, 2002, 378.

E. Otoo, F. Olken, and A. Shoshani, "Disk cache replacement algorithm for storage resource managers in data grids," in 2002 ACM/IEEE conference on Supercomputing, Baltimore, Maryland 2002, 1-15.

K. Ranganathan and I. Foster, "Identifying Dynamic Replication Strategies for a High-Performance Data Grid," International Grid Computing Workshop, 75-86, 2001.

X. You, G. Chang, X. Chen, C. Tian, and C. Zhu, "Utility-Based Replication Strategies in Data Grids," in Fifth International Conference on Grid and Cooperative Computing, 2006, 500-507.

M. Tang, B. S. Lee, X. Tang, and C. K. Yeo, "The impact of data replication on job scheduling performance in the Data Grid," Future Generation Computer Systems, 22, 254-268, 2006.

S. M. Park, J. H. Kim, Y. B. Ko, and W. S. Yoon, "Dynamic data grid replication strategy based on Internet hierarchy," International Workshop on Grid and Cooperative Computing, 1001, 1324–1331, 2004.

S. Venugopal, R. Buyya, and K. Ramamohanarao, "A taxonomy of data grids for distributed data sharing, management, and processing," ACM Computing Surveys (CSUR), 38, 2006.

R. M. Rahman, K. Barker, and R. Alhajj, "Replica placement strategies in data grid," Journal of Grid Computing, 6, 103-123, 2008.

R. M. Rahman, K. Barker, and R. Alhajj, "Performance evaluation of different replica placement algorithms," International Journal of Grid and Utility Computing, 1, 121-133, 2009.

M. R. Rahman, "Replica placement and selection strategies in data grids," in Department of Computer Science. vol. PhD. thesis Alberta: University of Calgary, 2007.

C. T. Yang, C. J. Huang, and T. C. Hsiao, "A Data Grid File Replication Maintenance Strategy Using Bayesian Networks," in Intelligent Systems Design and Applications, 2008. ISDA'08, 2008.

C. T. Yang, C. P. Fu, and C. J. Huang, "A dynamic file replication

strategy in data grids," in TENCON 2007-2007 IEEE Region 10 Conference, 2007, 1-5.

Saleh, A., Javidan, R., and FatehiKhajeh, M. T., “A four-phase data replication algorithm for data grid”, Journal of Advanced ComputerScience & Technology, 4(1), 163-174, 2015.

Tos, U., Mokadem, R., Hameurlain, A., Ayav, T., and Bora, S. “Dynamic replication strategies in data grid systems: a survey” The Journal of Supercomputing, 71(11), 2015, 4116-4140.

David G. Cameron, "Replica management and optimisation for data grids," PhD. Thesis, University of Glasgow, 2005.

Rahmani, A.M., Fadaie, Z. and Chronopoulos, A.T., “Data placement using Dewey Encoding in a hierarchical data grid”, Journal of Network and Computer Applications, 49, 2015. 88-98.

Grace, R.K. and Manimegalai, R., “HGASA: An Efficient Hybrid Technique for Optimizing Data Access in Dynamic Data Grid”. In Distributed Computing and Internet Technology, 132-136, 2016. Springer International Publishing.

K. Ranganathan, A. Iamnitchi, and I. Foster, "Improving data availability through dynamic model-driven replication in large peer-topeer communities," in Global and Peer-to-Peer Computing on Large Scale Distributed Systems Workshop, 2002, 376–381.

L. Yi-Fang, L. Pangfeng, and W. Jan-Jan, "Optimal placement of replicas in data grid environments with locality assurance," in Parallel and Distributed Systems, 2006. ICPADS 2006. 12th International Conference on, 2006,

Y. F. Lin, J. J. Wu, and P. Liu, "A List-Based Strategy for Optimal

Replica Placement in Data Grid Systems," in Proceedings of Parallel Processing, 2008. ICPP'08. 37th International Conference on, 2008, 198-205.

M.K. Madi, H.M. Tahir, Y. Yusof, and S. Hassan, S., “A novel dynamic replica creation mechanism for Data Grids”. In Game Physics and Mechanics International Conference (GAMEPEC), 2015, 1-5. IEEE.

Y. Mansouri, M. Garmehi, M. Sargolzaei, and M. Shadi, "Optimal Number of Replicas in Data Grid Environment," in First International Conference on Distributed Framework and Applications, 2008., 96-101.

The European Data Grid Project. http://eudatagrid.web.cern.ch/eudatagrid

C. Ruay-Shiung, C. Hui-Ping, and W. Yun-Ting, "A dynamic weighted data replication strategy in data grids," in AICCSA 2008: Proceedings of IEEE/ACS International Conference on computer systems and applications, 2008, 414-421.

D. G. Cameron, R. Carvajal-Schiaffino, A. P. Millar, C. Nicholson, K. Stockinger, and F. Zini, "Evaluating scheduling and replica

optimisation strategies in OptorSim," Journal of Grid Computing, 57-9, March 2004.

W. H. Bell, D. G. Cameron, L. Capozza, P. Millar, K. Stockinger, and F. Zini, "Simulation of Dynamic Grid Replication Strategies in OptorSim," Journal of High Performance Computing Applications, 17, 2003.

Downloads

Published

2017-03-01

How to Cite

Madi, M. K., Yusof, Y., Mohamed Tahir, H., Mohd Zaini, K., & Hassan, S. (2017). Replica Maintenance Strategy for Data Grid. Journal of Telecommunication, Electronic and Computer Engineering (JTEC), 9(1-2), 47–51. Retrieved from https://jtec.utem.edu.my/jtec/article/view/1650

Most read articles by the same author(s)

1 2 3 > >>