ABySS
ABySS is a de novo, parallel, paired-end sequence assembler. It can run as an MPI job in the HPCC cluster. The latest version currently installed on the HPCC is 2.1.5, which can be loaded by
1 |
|
You can optionally load other tools as needed, provided that they have been installed under the same toolchain environment as ABySS/2.1.5. For example,
1 |
|
is valid after you've loaded ABySS.
A sample SLURM script is below.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 |
|
This script launches an MPI job by requesting 8 processes; they are
distributed on 4 nodes (--nodes=4
) with two processes each
(--ntasks-per-node=2
). Accordingly, in the abyss-pe
command line, we
specify np=8
. Regarding parameter j, the manual states
The paired-end assembly stage is multithreaded, but must run on a single machine. The number of threads to use may be specified with the parameter j. The default value for j is the value of np.
So, rather than using np as the default value for j, we set j = 2 which is the number of CPUs per node as requested (in this case "task" is equivalent to CPU). To submit the job,
sbatch --constraint="[intel16|intel18]"
While the job is running, you may look at the SLURM output file, in this
example, abyss_test-<job ID>.SLURMout
, which has a lot of running log,
including the following:
Running on 8 processors
6: Running on host lac-391
0: Running on host lac-194
2: Running on host lac-225
4: Running on host lac-287
7: Running on host lac-391
3: Running on host lac-225
1: Running on host lac-194
5: Running on host lac-287