Welcome to www.ClusterGate.RU

Internet
ClusterGate.RU

Home: Why clusters and for what the clusters are
News: mainly about the site
Linux: General Information,
Distributions, etc.
Clustering: software systems to organize clusters
Virtualization:
Hardware: server hardware and hardware for clustering
General Purpose Software
File Systems:
local and distributed
Access Methods
to large volume of data
Data transfer
between clusters
Security: all aspects (antihacker software, spare backup copy, power control)
High Performance Computing:
Examples of powerful clusters
Examples: midrange clusters
Monitoring and Measurement tools
Batch/Load Balance systems
Grid ...
Further reading:
Journals, Reviews,
News, Books
Computing in High Energy Physics:
Computing sites,
application packages
This is Data Move page for ClusterGate.RU

Moving (copying, replication) large volume of data over WAN

  • PHEDEX complicated data transfer system over unreliable WAN. The PhEDEx components are:
    1. Transfer management database (TMDB), currently version is 2.
    2. Transfer agents that manage the movement of files between sites. This also includes agents to migrate files to mass storage, to manage local mass storage stager pools and stage in files efficiently based on transfer demand, and to calculate file checksums when necessary before transfers.
    3. Management agents, in particular the allocator agent which assigns files to destinations based on site data subscriptions, and agents to maintains file transfer topology routing information.
    4. Tools to manage transfer requests; CMS/RefDB/PubDB specific.
    5. Local agents for managing files locally, for instance as files arrive from a transfer request or a production farm, including any processing that needs to be done before they can be made available for transfer: massaging information, merging files, registering files into the catalogues, injecting into TMDB.
    6. Web monitoring tools.
  • BBFTP -- utility for bulk data transfer. bbFTP is a file transfer software. It implements its own transfer protocol, which is optimized for large files (larger than 2GB) and secure as it does not read the password in a file and encrypts the connection information. bbFTP main features are:
    • Encoded username and password at connection
    • SSH and Certificate authentication modules
    • Multi-stream transfer
    • Big windows as defined in RFC1323
    • On-the-fly data compression
    • Automatic retry
    • Customizable time-outs
    • Transfer simulation
    • AFS authentication integration
    • RFIO interface
    bbFTP is open-source software, released under the GNU General Public License. It was written by Gilles Farrache at IN2P3 Computing Center in Lyon, France.
  • BBCP another tool for bulk data tranfer
  • mirrordir (Mirrordir version 0.10.49 (International - builtin encryption) this utility to copy directory from one machine to another one. Mirrordir makes a minimal set of changes to the directory to make it identical to the directory . Mirrordir dives into subdirectories recursively and duplicates all types of files exactly.
  • GridFTP Grid/Globus data transfer tool. Client part is known as globus-url-copy.
Please email the
portalmaster@pnpi.spb.ru
with questions or comments.
Our smart sponsors:

All rights reserved. Copyright © 2006, 2007, 2008, 2009. Andrey Y Shevel.


Last revised: Monday, 27-Apr-2009 12:41:47 MSD
Current date/time: Saturday, 04-May-2024 13:56:27 MSK
This document URL http://hepd.pnpi.spb.ru:443/ClusterGate.RU/CG_DataMove/index.shtml