Connecting arbitrary data resources to the grid

dc.contributor.authorZhang, S.
dc.contributor.authorCoddington, P.
dc.contributor.authorWendelborn, A.
dc.contributor.conferenceIEEE/ACM International Conference on Grid Computing (11th : 2010 : Brussels, Belgium)
dc.date.issued2010
dc.description.abstractMany scientific grid systems have been running and serving researchers for many years around the world. Among them, Globus Toolkit and its variants are playing an important role as the basis of most of those existing grid systems. However, the way data is stored and accessed varies. Proprietary protocols have been designed and developed to serve data by different storage systems or file systems. One example is the integrated Rule Oriented Data System (iRODS), which is a data grid system with the non-standard iRODS protocol and has its own client tools and API. Consequently, it is difficult for the grid to connect to it directly and stage data to computers in the grid for processing. It is usually an ad hoc process to transfer data between two data systems with different protocols. In addition, existing data transfer services are mostly designed for the grid and do not understand proprietary protocols. This requires users to transfer data from the source to a temporary space, and then transfer it from the temporary space to the destination, which is complex, inefficient and error-prone. Some work has been done on the client side to address this issue. In order to address the issues of data staging and data transfer in one solution, this paper describes a different but easy and generic approach to connect any data systems to the grid, by providing a service with an abstract framework to convert any underlying data system protocol to the GridFTP protocol, a de facto standard of data transfer for the grid.
dc.description.statementofresponsibilityShunde Zhang, Paul Coddington, Andrew Wendelborn
dc.identifier.citationProceedings of 2010 11th IEEE/ACM International Conference on Grid Computing, 2010: pp.185-192
dc.identifier.doi10.1109/GRID.2010.5697958
dc.identifier.isbn9781424493487
dc.identifier.issn1550-5510
dc.identifier.orcidCoddington, P. [0000-0003-1336-9686]
dc.identifier.urihttp://hdl.handle.net/2440/64300
dc.language.isoen
dc.publisherIEEE
dc.publisher.placeUSA
dc.rights© 2010 IEEE
dc.source.urihttps://doi.org/10.1109/grid.2010.5697958
dc.subjectGridFTP
dc.subjectdata staging
dc.subjectdata transfer
dc.titleConnecting arbitrary data resources to the grid
dc.typeConference paper
pubs.publication-statusPublished

Files