Download Provided Database
Download PhyloFisher’s Provided Database
- Retrieve the provided starting database via wget:
wget https://ndownloader.figshare.com/files/29093409 - Uncompress the .tar.gz file:
tar -xzvf 29093409
The uncompressed database directory contains the subdirectories and files detailed below.
database/backups/{Month}_{Day}_{Year}.tar.gza compressed file containing backups of the two database foldersorthologs/,paralogs/, the database filemetadata.tsv, andtree_colors.tsv
datasetdb/datasetdb.dmnda diamond database of the orthologsdatasetdb.fastaa fasta file of the orthologs used to construct the diamond database
orthologs/{gene_name}.fas240 fasta files of the orthologs
orthomcl/bacterialabbreviated names of bacterial species present in OrthoMCLgene_oga tsv file detailing the names of the 240 genes and their corresponding OrthoMCL orthogroup number(s)orthomcl.diamonddb.dmnddiamond database of OrthoMCL
paralogs/{gene_name}_paralogs.fas240 fasta files of the paralogs
- profiles
{gene_name}.hmm240 profile hmm files of the orthologs
- proteomes/
{Unique_ID}.faa.tar.gzcomplete proteomes of all taxa in the database
metadata.tsvtsv file of containing metadata for species in the database. Detailed extensively in the section “Detailed Explanation of the PhyloFisherDatabase_v1.0 Metadata File” of this manualtree_colors.tsva comma separated file used to color taxa based on taxonomy for manual inspection of single gene trees.