Welcome to refineGEMs!

refineGEMs is a Python package intended to help with the curation of genome-scale metabolic models (GEMS).

Hint

For bug reports please write issues on the GitHub page or open a discussion here.

Overview

Currently refineGEMs can be used for the investigation of a GEM, it can complete the following tasks:

  • loading GEMS with COBRApy and libSBML

  • report number of metabolites, reactions and genes

  • report orphaned, deadends and disconnected metabolites

  • report mass and charge unbalanced reactions

  • report Memote score

  • compare the genes present in the model to the genes found in: * the KEGG Database (Note: This requires a GFF file of your organism and the KEGG identifier of your organism.) * Or the BioCyc Database (Note: This requires that a database entry for your organism exists in BioCyc.)

  • compare the charges and masses of the metabolites present in the model to the charges and masses denoted in the ModelSEED Database.

Other applications of refineGEMs to curate a given model include:

  • The correction of a model created with CarveMe v.1.5.1 (for example moving all relevant information from the notes to the annotation field) this includes automated annotation of NCBI genes to the GeneProduct section of the model,

  • The addition of KEGG Pathways as Groups (using the libSBML Groups Plugin),

  • Updating the SBO-Term annotations based on SBOannotator[1],

  • Updating the annotation of metabolites and extending the model with reactions (for the purpose of filling gaps) based on a table filled by the user data/manual_annotations.xlsx (Note: This only works when the structure of the given table is used.),

  • And extending the model with all information surrounding reactions including the corresponding GeneProducts and metabolites by filling in the table data/modelName_gapfill_analysis_date_example.xlsx (Note: This also only works when the structure of the given Excel file is used).