Aspera bulk file transfer
The Aspera Connect application (ascp) is a useful file transfer tool for downloading or uploading large files in bulk between the HPCC and data repository sites such as those operated by NCBI. In order to interact with a server via aspera, the remote host must be running the Aspera server.
This short tutorial will demonstrate how to load and use the command line version of Aspera to download files from the NCBI ftp site.
Step 1: Log onto HPCC rsync gateway node:
1 |
|
Step 2: Load Aspera 3.9.8 module:
1 |
|
You can only execute Aspera file transfers from a gateway node. Transfers on the dev-nodes will not work.
Tip
If you need a higer version than 3.9.8, you can try installing it with conda (conda install -c rpetit3 aspera-connect
). Conda will handle Glibc issues.
Example command for downloading data from NCBI:
1 |
|
For uploading files from the HPCC, please refer to the NCBI instructions for uploading SRA files.