Skip to content

Aspera bulk file transfer

The Aspera Connect application (ascp) is a useful file transfer tool for downloading or uploading large files in bulk between the HPCC and data repository sites such as those operated by NCBI.  In order to interact with a server via aspera, the remote host must be running the Aspera server.  This tutorial will demonstrate how to install and use the command line version of Aspera to download files from the NCBI ftp site.

You can only execute Aspera file transfers from gateway.  Transfers on the dev-nodes will not work correctly.

Go to https://www.ibm.com/products/aspera/downloads to download "aspera connect".

  1. Select the Linux OS and download aspera-connect-3.7.2.141527-linux-64.sh (version may change over time) to your home directory.
  2. Run chmod u+x aspera-connect-3.7.2.141527-linux-64.sh
  3. Run ./aspera-connect-3.7.2.141527-linux-64.sh

The installation will then be located in ~/.aspera/connect/; and the command ascp is in ~/.aspera/connect/bin/

Example use:

1
~/.aspera/connect/bin/ascp -T -k 1 -i ~/.aspera/connect/etc/asperaweb_id_dsa.openssh anonftp@ftp.ncbi.nlm.nih.gov:/refseq/uniprotkb ~/NCBI_data

More instructions and examples can be found here.