Replica Maintenance Strategy for Data Grid
Keywords:
Data Grid, Replica Deletion, Storage Usage,Abstract
Data Grid is an infrastructure that manages huge amount of data files, and provides intensive computational resources across geographically distributed collaboration. Increasing the performance of such system can be achieved by improving the overall resource usage, which includes network and storage resources. Improving network resource usage is achieved by good utilization of network bandwidth that is considered as an important factor affecting job execution time. Meanwhile, improving storage resource usage is achieved by good utilization of storage space usage. Data replication is one of the methods used to improve the performance of data access in distributed systems by replicating multiple copies of data files in the distributed sites. Having distributed the replicas to various locations, they need to be monitored. As a result of dynamic changes in the data grid environment, some of the replicas need to be relocated. In this paper we proposed a maintenance replica placement strategy termed as Unwanted Replica Deletion Strategy (URDS) as a part of Replica maintenance service. The main purpose of the proposed strategy is to find the placement of unwanted replicas to be deleted. OptorSim is used to evaluate the performance of the proposed strategy. The simulation results show that URDS requires less execution time and consumes less network usage and has a best utilization of storage space usage compared to existing approaches.References
A. Chervenak, E. Deelman, C. Kesselman, B. Allcock, I. Foster, V.
Nefedova, J. Lee, A. Sim, A. Shoshani, and B. Drach, "Highperformance remote access to climate simulation data: A challenge problem for data grid technologies," in Super Computing, 2003, 1335-1356.
I. Foster, E. Alpert, A. Chervenak, B. Drach, C. Kesselman, V. Nefedova, D. Middleton, A. Shoshani, A. Sim, and D. Williams, "The
Earth System Grid II: Turning climate datasets into community
resources," in Annual Meeting of the American Meteorological
Society, 2002.
B. Wilkinson, Grid computing: techniques and applications: Chapman & Hall/CRC, 2009.
C. Nicholson, D. G. Cameron, A. T. Doyle, A. P. Millar, and K. Stockinger, "Dynamic data replication in lcg 2008," Concurrency and Computation: Practice and Experience, 20, 1259-1271, 2008.
A. Chervenak, E. Deelman, I. Foster, W. Hoschek, A. Iamnitchi, C. Kesselman, M. Ripeanu, B. Schwartzkopf, H. Stockinger, and B. Tierney, "Giggle: A framework for constructing scalable replica
location services," in International IEEE Supercomputing Conference (SC 2002) Baltimore, USA, 2002, 1-17.
A. Chervenak, I. Foster, C. Kesselman, C. Salisbury, and S. Tuecke., "The Data Grid: Towards an Architecture for the Distributed Management and Analysis of Large Scientific Datasets," Journal of Network and Computer Applications, 23, 2001.
L. Guy, P. Kunszt, E. Laure, H. Stockinger, and K. Stockinger, "Replica management in data grids," in Global Grid Forum. 5, 2002.
H. Lamehamedi, Z. Shentu, B. Szymanski, and E. Deelman,
"Simulation of dynamic data replication strategies in data grids," in Proceedings of 12th Heterogeneous Computing Workshop
(HCW2003), Nice, France, 2003.
H. Lamehamedi, B. Szymanski, Z. Shentu, and E. Deelman, "Data Replication Strategies in Grid Environments," in Fifth International Conference on Algorithms and Architectures for Parallel Processing, 2002, 378.
E. Otoo, F. Olken, and A. Shoshani, "Disk cache replacement algorithm for storage resource managers in data grids," in 2002 ACM/IEEE conference on Supercomputing, Baltimore, Maryland 2002, 1-15.
K. Ranganathan and I. Foster, "Identifying Dynamic Replication Strategies for a High-Performance Data Grid," International Grid Computing Workshop, 75-86, 2001.
X. You, G. Chang, X. Chen, C. Tian, and C. Zhu, "Utility-Based Replication Strategies in Data Grids," in Fifth International Conference on Grid and Cooperative Computing, 2006, 500-507.
M. Tang, B. S. Lee, X. Tang, and C. K. Yeo, "The impact of data replication on job scheduling performance in the Data Grid," Future Generation Computer Systems, 22, 254-268, 2006.
S. M. Park, J. H. Kim, Y. B. Ko, and W. S. Yoon, "Dynamic data grid replication strategy based on Internet hierarchy," International Workshop on Grid and Cooperative Computing, 1001, 1324–1331, 2004.
S. Venugopal, R. Buyya, and K. Ramamohanarao, "A taxonomy of data grids for distributed data sharing, management, and processing," ACM Computing Surveys (CSUR), 38, 2006.
R. M. Rahman, K. Barker, and R. Alhajj, "Replica placement strategies in data grid," Journal of Grid Computing, 6, 103-123, 2008.
R. M. Rahman, K. Barker, and R. Alhajj, "Performance evaluation of different replica placement algorithms," International Journal of Grid and Utility Computing, 1, 121-133, 2009.
M. R. Rahman, "Replica placement and selection strategies in data grids," in Department of Computer Science. vol. PhD. thesis Alberta: University of Calgary, 2007.
C. T. Yang, C. J. Huang, and T. C. Hsiao, "A Data Grid File Replication Maintenance Strategy Using Bayesian Networks," in Intelligent Systems Design and Applications, 2008. ISDA'08, 2008.
C. T. Yang, C. P. Fu, and C. J. Huang, "A dynamic file replication
strategy in data grids," in TENCON 2007-2007 IEEE Region 10 Conference, 2007, 1-5.
Saleh, A., Javidan, R., and FatehiKhajeh, M. T., “A four-phase data replication algorithm for data grid”, Journal of Advanced ComputerScience & Technology, 4(1), 163-174, 2015.
Tos, U., Mokadem, R., Hameurlain, A., Ayav, T., and Bora, S. “Dynamic replication strategies in data grid systems: a survey” The Journal of Supercomputing, 71(11), 2015, 4116-4140.
David G. Cameron, "Replica management and optimisation for data grids," PhD. Thesis, University of Glasgow, 2005.
Rahmani, A.M., Fadaie, Z. and Chronopoulos, A.T., “Data placement using Dewey Encoding in a hierarchical data grid”, Journal of Network and Computer Applications, 49, 2015. 88-98.
Grace, R.K. and Manimegalai, R., “HGASA: An Efficient Hybrid Technique for Optimizing Data Access in Dynamic Data Grid”. In Distributed Computing and Internet Technology, 132-136, 2016. Springer International Publishing.
K. Ranganathan, A. Iamnitchi, and I. Foster, "Improving data availability through dynamic model-driven replication in large peer-topeer communities," in Global and Peer-to-Peer Computing on Large Scale Distributed Systems Workshop, 2002, 376–381.
L. Yi-Fang, L. Pangfeng, and W. Jan-Jan, "Optimal placement of replicas in data grid environments with locality assurance," in Parallel and Distributed Systems, 2006. ICPADS 2006. 12th International Conference on, 2006,
Y. F. Lin, J. J. Wu, and P. Liu, "A List-Based Strategy for Optimal
Replica Placement in Data Grid Systems," in Proceedings of Parallel Processing, 2008. ICPP'08. 37th International Conference on, 2008, 198-205.
M.K. Madi, H.M. Tahir, Y. Yusof, and S. Hassan, S., “A novel dynamic replica creation mechanism for Data Grids”. In Game Physics and Mechanics International Conference (GAMEPEC), 2015, 1-5. IEEE.
Y. Mansouri, M. Garmehi, M. Sargolzaei, and M. Shadi, "Optimal Number of Replicas in Data Grid Environment," in First International Conference on Distributed Framework and Applications, 2008., 96-101.
The European Data Grid Project. http://eudatagrid.web.cern.ch/eudatagrid
C. Ruay-Shiung, C. Hui-Ping, and W. Yun-Ting, "A dynamic weighted data replication strategy in data grids," in AICCSA 2008: Proceedings of IEEE/ACS International Conference on computer systems and applications, 2008, 414-421.
D. G. Cameron, R. Carvajal-Schiaffino, A. P. Millar, C. Nicholson, K. Stockinger, and F. Zini, "Evaluating scheduling and replica
optimisation strategies in OptorSim," Journal of Grid Computing, 57-9, March 2004.
W. H. Bell, D. G. Cameron, L. Capozza, P. Millar, K. Stockinger, and F. Zini, "Simulation of Dynamic Grid Replication Strategies in OptorSim," Journal of High Performance Computing Applications, 17, 2003.
Downloads
Published
How to Cite
Issue
Section
License
TRANSFER OF COPYRIGHT AGREEMENT
The manuscript is herewith submitted for publication in the Journal of Telecommunication, Electronic and Computer Engineering (JTEC). It has not been published before, and it is not under consideration for publication in any other journals. It contains no material that is scandalous, obscene, libelous or otherwise contrary to law. When the manuscript is accepted for publication, I, as the author, hereby agree to transfer to JTEC, all rights including those pertaining to electronic forms and transmissions, under existing copyright laws, except for the following, which the author(s) specifically retain(s):
- All proprietary right other than copyright, such as patent rights
- The right to make further copies of all or part of the published article for my use in classroom teaching
- The right to reuse all or part of this manuscript in a compilation of my own works or in a textbook of which I am the author; and
- The right to make copies of the published work for internal distribution within the institution that employs me
I agree that copies made under these circumstances will continue to carry the copyright notice that appears in the original published work. I agree to inform my co-authors, if any, of the above terms. I certify that I have obtained written permission for the use of text, tables, and/or illustrations from any copyrighted source(s), and I agree to supply such written permission(s) to JTEC upon request.