dataset_to_database.py
Convert an existing PhyloFisher database to the new SQLite database format. This is required for version 2.0+ of PhyloFisher.
dataset_to_database.py [OPTIONS]
Required arguments:
-i,--input <db_dir>Path to directory containing an existing PhyloFisher v1 database.
Optional arguments:
-o,--outputPath to user-defined output directory.- Default:
./dataset_to_database_<M.D.Y>
- Default:
dataset_to_database.py output:
- a directory called
./dataset_to_database_<M.D.Y>with:phylofisher.db- an SQLite database file containing metadata, sequence data, taxonomic data, and gene data.orthomcl/- a directory containing the orthomcl files:orthomcl.diamonddb.dmnd- a diamond database of OrthoMCL.gene_og- a tsv file detailing the names of the 240 genes and their corresponding OrthoMCL orthogroup number(s).bacterial- abbreviated names of bacterial species present in OrthoMCL.
proteomes- a directory containing the proteomes of the taxa in the database.nucleotides- a directory containing the CDSs of the taxa in the database.profiles- a directory containing hmm profiles of the genes in the database.