Databases on the HPCC

1 minute read

NOTE: This page is a work in progress. Some information is subject to change.

Databases

The following is a list of databases maintained by the HPCC. Due to the fairly large nature of the databases, we cannot commit to storing databases forever, so a retention period is put in place. For projects that require older databases, please plan to mirror the database to your own storage.

DatabaseDownload FrequencyRetention LengthModule NameDownload Location
NCBI nr6 months3 yrsdb-ncbihttps://ftp.ncbi.nlm.nih.gov/blast/db/
NCBI nt6 months3 yrsdb-ncbihttps://ftp.ncbi.nlm.nih.gov/blast/db/
NCBI core_nt6 months3 yrsdb-ncbihttps://ftp.ncbi.nlm.nih.gov/blast/db/
NCBI swissprot6 months3 yrsdb-swissprothttps://ftp.ncbi.nlm.nih.gov/blast/db/
NCBI nr_clustered_seq6 months3 yrsdb-ncbihttps://ftp.ncbi.nlm.nih.gov/blast/db/experimental/
unirefAs released/Manual3 yrsdb-uniprothttps://ftp.uniprot.org/pub/databases/uniprot/
pfamAs released/Manual3 yrsdb-pfamhttps://ftp.ebi.ac.uk/pub/databases/Pfam/releases/
CAZyAs released/Manual3 yrsdb-cazyhttps://pro.unl.edu/dbCAN2/browse_download.php?path=Databases
interproTBDTBDTBDTBD
UNITEAs released/Manual3 yrsdb-unitehttps://unite.ut.ee/repository.php
Last modified April 21, 2026: Added database page (bf1c2f9fd)