Command Line access points
After a successfull installation, refineGEMs can be accessed via the command line from inside the Python environment it was installed in using:
refinegems [OPTIONS] COMMAND [ARGS]
The following commands are available:
analyse: Provides commands for analysing the model(s).database: Provides commands for handling the internal database.gaps: Provides commands for performing gap-filling.media: Provides commands for interacting with the media (database).polish: Provides commands for polising a model generated by an automated pipeline.refine: Provides commands for refining a model.setup: Provides commands for the set-up or download of files and directories for the tool.
General Options
--help: Call the help page of the command.--version: Show the version and exit.
refinegems analyse
Analyse a model by testing for auxotrophies and growth on different media along with finding EGCs and looking at overall model statistics.
refinegems analyse compare [MODELPATH]
Compare multiple models.
Options:
--type/-t <sboterm|entities>: Type of comparison to perform. Multiple can be added.--all: Perform all comparisons. Overwrites previous option.
refinegems analyse memote [MODELPATH]
Perform a memote analysis on a model.
Options:
--score-only/-s: Return only the final score of the analysis.--file/-f: Name/Path to save output to. Only relevant if-swas not set.
refinegems analyse pancore [MODELPATH] [PCPATH]
Compare the pan-core(-novel) information content of a model to a pan-core model.
Options:
--based-on/-b: How to compare the models. Deafult is a comparison based on the IDs of the model entities--dir/-d: Path to the output directory.
refinegems analyse pathways [MODELPATH]
Analyse the KEGG pathways contained in a model.
Options:
--dir/-d: Path to the output directory.
refinegems analyse stats [MODELPATH]
Perform a statistical analysis of the model.
Options:
--dir/-d: Path to the output directory.--colors/-c: Abbreviation of a matplotlib color palette for the graphics.
refinegems analyse growth
Analyse the growth under different conditions.
refinegems analyse growth auxothrophies [MODELPATH]
For a given set of media, test simulate the amino acid auxotrohpies of the model.
Options:
--media/-m: Path to a media config file. REQUIRED.--namespace/-n: Namespace of the model.--colors/-c: Abbreviation of a matplotlib color palette for the graphics.--dir/-d: Path to the output directory.
refinegems analyse growth minimal-medium [MODELPATH]
Calculate the minimal medium of a model. Can either be calculated by minimising the fluxes of the current medium
(flux), finding the minimal number of compounds needed for growth based on the current medium (medium) or the
minimal number of compounds based on the available exchange reactions (exchanges).
Options:
--objective/-o: One of the three options listed to choose what to optimize for.--growth-rate/-r: Minimal growth rate that should be reached on the minimal medium. The smaller the value, the more computational expensive the calculation will be.--dir/-d: Path to the output directory.
refinegems analyse growth simulate [MODELPATHS]
Simulate the growth of one or multiple models on one or more media.
Options:
--media/-m: Path to a media config file. REQUIRED.--namespace/-n: Namespace of the model.--colors/-c: Abbreviation of a matplotlib color palette for the graphics.--dir/-d: Path to the output directory.
refinegems analyse growth sources [MODELPATH]
Simulate the growth for different sources. When choosing the starting medium, it is useful to have at least one additional source of every other neccessary element separated from the one that is tested against.
Options:
--element/-e: The element to perform the source test for. Should be a valid chemical symbol.--substances/-s: Substances for substitution. Multiple can be given. If none are given, all options found in the database will be utilised as potential sources.--medium/-m: Medium abbreviation of a medium in the database, if the medium currently saved with the model should be substitued before testing.--namespace/-n: Namespace of the model.--colors/-c: Abbreviation of a matplotlib color palette for the graphics.--dir/-d: Path to the output directory.
refinegems database
Access, curate, etc. the in-build database.
refinegems database initialise
Initialise or update the in-build database.
refinegems database add_namespace [DATABASENAME]
Add or update tables for additional namespaces/databases into/of the in-build database.
Options:
--chunksize/-c: Size (in kB) of data to download per chunk, if a download is required.
refinegems database reset
Reset the database by removing all additionally added tables (added with e.g. add_namespace)
refinegems gaps
refinegems gaps automated_gapfill [ALG] [MODELPATH] [OPTIONS]
Find gaps in a model and optionally try to fill them. Algorithm for the gap-filling is chosen by the first argument.
General Options
-o,--outdir: Path to a directory to write the output to. [default:./]-f,--fill: If True, tries to fill the gaps in the model.-fc,--formula-check: [none,existence,wildcard,strict] Set the filter for which metabolite formulas are valid to be added to the model. [default: existence]--no-dna: Exclude DNA reactions (name-based) from being added to the model.--no-rna: Exclude RNA reactions (name-based) from being added to the model.-p,--idprefix: Prefix for the pseudo-IDs, if an ID does not exists for the given namespace. [default: refineGEMs]-n,--namespace: [BiGG] Namespace used in the model. [default: BiGG]
ALG="KEGG"]--orgid: KEGG organism ID
ALG="BioCyc"]-gt,--genetable: Path to the BioCyc gene SmartTable.-rt,--reactable: Path to the BioCyc reaction SmartTable.--gff-bc: Path to the GFF.
ALG="Gene"]--gff-g: Path to the GFF.
--prot-prefix: Prefix for pseudo-protein IDs. [default: refineGEMs]--mail: Mail address for NCBI requests.--ncbi,--check-ncbi: Enable searching protein IDs in NCBI. This increases the runtime significantly.--fasta: Path to the protein FASTA of the model.--dmnd-db: Path to the SwissProt DIAMOND database.-sp-map,--swissprot-mapping: Path to the SwissProt mapping file (ID against EC and BRENDA)-s,--sensitivity: [sensitive,more-sensitive,very-sensitive,ultra-sensitive] Sensitivity mode for running DIAMOND. [default: more-sensitive]--cov: Coverage value (passed to DIAMOND) [default: 90.0]--pid: Percentage identity threshold value for filtering DIAMOND results. [default: 95.0]-t,--threads: Number of threads to be used by DIAMOND. [default: 2]
Constraints:
--mailis required if--check-ncbiis setif one of
--fasta,--dmnd-db,--swissprot-mappingis set, all need to be set--fasta,--dmnd-db,--swissprot-mappingare all required if any of--sensitivity,--cov,--pidand--threadsis set
refinegems media
Access the media part of the database.
refinegems media info
Retrieve information about the media database.
Options:
--list: List the available media.
refinegems polish
refinegems polish run [MODEL] [EMAIL] [PATH]
Options:
--id-db/-i: Main database that is the source of the IDs.--refseq-gff: Path to RefSeq GFF file of organism--protein-fasta/-p: File used as input for CarveMe, the protein fasta file.--lab-strain/-l: True if the strain was sequenced in a local lab.--kegg-organism-id/-k: KEGG Identifier for the organism, if available.
refinegems refine
Refine a model. Includes steps like biomass, charges, SBO annotation, reaction direction correction and addition of pathways and further gene product annotations.
refinegems refine biomass [MODELPATH]
Normalise the biomass objective function(s) of a model to improve the model consistency.
Options:
--cycles/-c: Maximal number of normalisation cycles to ensure the program is not running endlessly. The default is 10.--outfile/-o: Optional, filename to save the updated model under.
refinegems refine charges [MODELPATH]
Compare the charges in a model to the ModelSEED database and adjust them accordingly, if neccessary.
Options:
--dir/-d: Path to the output directory.
refinegems refine direction [MODELPATH]
Check and - if neccessary - correct the direction of the reactions in a model.
refinegems refine egcs [MODELPATH]
Indentify problematic energy generating cycles (EGCs) in the model an optionally, try to resolve them.
Options:
--solver/-s: Options to solve the EGCs. Following Solvers are currently available:greedy: Solve the EGCs by trying out modifications of single reactions (reversible, backward,forward,remove)
--namespace/-n: Namespace of the model.--compartment/-c: Compartments to check, separated by comma. Default isc,p--outfile/-o: File to save the updated model to.
refinegems refine annot
Add annotations to your model.
refinegems refine annot sboterms [MODELPATH]
Call the SBOannotator on a model to enhance/add SBOterms to the annotations.
refinegems refine annot pathways [MODELPATH]
Add the KEGG pathways as group entities to the model.
refinegems setup
Set-up tools, folder structure and more for running the program.
refinegems setup config
Download a configuration file for a specific functionality of the toolbox.
Options:
--filename/-f: Name or path of a file to save the config under.--type/-t: Type of config to download. Defaults tomedia.
refinegems setup data [DOWNLOADTYPE]
Download file(s) needed for a given functionality of the toolbox.
Current options include:
‘SwissProt_gapfill’: download the SwissProt sequences as a FASTA and a mapping of their ID to BRENDA/EC for gap-filling
Options:
--dir/-d: Path to directory to save the downloaded files to.--chunksize/-c: Size of the chunk to download (in kB) at once.
refinegems setup build_pancore [MODELS]
Using the given models, construct a pan-core model.
Options:
--based-on/-o: Option on how to compare the models, defaults toid, which compares and combines the models based in the IDs only.--name/-n: Set the name of the constructed pan-core model.--keep-genes/-g: Keep the genes in the pan-core model, otherwise they are deleted and only reactions and metabolites are kept.--resolve-compartments/--rcomp: Try to standardise the model’s compartment names.--dir/-d: Path to the output directory.