AlphaFold via Singularity
Note
These instructions are for AlphaFold Singularitiy container for version 2.2.2 and earlier. If you are using an AlphFold 2.3.0 or later container, please see the update instructions the new documentation
AlphaFold can be run via Singularity.
AlphaFold database is located in
/mnt/research/common-data/alphafold/database
.
database
├── bfd
│ ├── bfd_metaclust_clu_complete_id30_c90_final_seq.sorted_opt_a3m.ffdata
│ └── bfd_metaclust_clu_complete_id30_c90_final_seq.sorted_opt.tar.gz
├── mgnify
│ └── mgy_clusters_2018_12.fa
├── params
│ ├── params_model_1.npz
│ ├── params_model_1_ptm.npz
│ ├── params_model_2.npz
│ ├── params_model_2_ptm.npz
│ ├── params_model_3.npz
│ ├── params_model_3_ptm.npz
│ ├── params_model_4.npz
│ ├── params_model_4_ptm.npz
│ ├── params_model_5.npz
│ └── params_model_5_ptm.npz
├── pdb70
│ ├── md5sum
│ ├── pdb70_a3m.ffdata
│ ├── pdb70_a3m.ffindex
│ ├── pdb70_clu.tsv
│ ├── pdb70_cs219.ffdata
│ ├── pdb70_cs219.ffindex
│ ├── pdb70_hhm.ffdata
│ ├── pdb70_hhm.ffindex
│ └── pdb_filter.dat
├── pdb_mmcif
│ ├── mmcif_files
│ └── obsolete.dat
├── small_bfd
│ └── bfd-first_non_consensus_sequences.fasta
├── uniclust30
│ └── uniclust30_2018_08
└── uniref90
├── uniref90.fasta
└── uniref90.fasta.1.gz
Before running AlphaFold, you need to set
export ALPHAFOLD_DATA_PATH="/mnt/research/common-data/alphafold/database"
export ALPHAFOLD_MODELS="/mnt/research/common-data/alphafold/database/params"
To run alphafold, please use the following template (for more information about options/flags, please refer to the README on Github).
In the script, input.fasta
is your input data, and you need to set up
output_dir.
#!/bin/bash
#SBATCH --job-name alphafold-run
#SBATCH --time=08:00:00
#SBATCH --gpus=1
#SBATCH --cpus-per-task=8
#SBATCH --mem=20G
echo "This script is from ICER's AlphaFold via Singularity tutorial"
export ALPHAFOLD_DATA_PATH="/mnt/research/common-data/alphafold/database"
export ALPHAFOLD_MODELS="/mnt/research/common-data/alphafold/database/params"
singularity run --nv \
-B $ALPHAFOLD_DATA_PATH:/data \
-B $ALPHAFOLD_MODELS \
-B .:/etc \
--pwd /app/alphafold /opt/.software-legacy/alphafold/2.0.0/alphafold.sif \
--data_dir=/data \
--output_dir=/mnt/gs18/scratch/users/my_id/alphafold/output \
--fasta_paths=/mnt/gs18/scratch/users/my_id/alphafold/input.fasta \
--uniref90_database_path=/data/uniref90/uniref90.fasta \
--mgnify_database_path=/data/mgnify/mgy_clusters_2018_12.fa \
--bfd_database_path=/data/bfd/bfd_metaclust_clu_complete_id30_c90_final_seq.sorted_opt \
--uniclust30_database_path=/data/uniclust30/uniclust30_2018_08/uniclust30_2018_08 \
--pdb70_database_path=/data/pdb70/pdb70 \
--template_mmcif_dir=/data/pdb_mmcif/mmcif_files \
--obsolete_pdbs_path=/data/pdb_mmcif/obsolete.dat \
--max_template_date=2020-05-14 \
--model_names=model_1 \
--preset=casp14