# mgPipe
## Introduction
mgPipe is a MATLAB based pipeline to integrate microbial abundances (coming
from metagenomic data) with constraint-based modeling, creating
individuals' personalized models. The pipeline is divided into 3 parts: [PART 1]
Analysis of individuals' specific microbes abundances is computed. Individuals'
metabolic diversity in relation to microbiota size and disease presence as well
as Classical multidimensional scaling (PCoA) on individuals' reaction
repertoire are examples. [PART 2]: 1 Constructing a global metabolic model
(setup) containing all the microbes listed in the study. 2 Building
individuals' specific models integrating abundance data retrieved from
metagenomics. For each organism, reactions are coupled to their objective
function. [PART 3] Simulations under different diet regimesand analysis of the
predicted metabolic profile (PCoA of computed MNPCs of
individuals).
**WARNING:** Please take into consideration only the files listed in this
document. Everything present in the folder but not listed and explained in this
document is to be considered not relevant or obsolete.
## Requirements
mgPipe requires `Matrix Laboratory`, the `Parallel Computing Toolbox`, as well
as, the COBRA Toolbox to be installed. Please refer to
the [installation instructions](https://opencobra.github.io/cobratoolbox/stable/installation.html).
The usage of `ILOG CPLEX` solver is strongly advised to obtain the best speed performance
(required for the fastFVA.m function).
MgPipe was created (and tested) for AGORA 1.0 please first download AGORA
version 1.0 from https://www.vmh.life/#downloadview and place the mat files into
a dedicated folder.
## Main Folder Structure and Files
The following files are essential for the usage of the pipeline and are
supplied in the current folder and in `papers/2018_microbiomeModelingToolbox`
| Filename | Purpose |
| -----------------------------------------------|------------------------------------------------------------------------|
| startMgPipe.m | *driver, containing all the input variables, to be modified by the user* |
| initMgPipe.m | *function containing all the input variables launching the pipeline* |
| loadUncModels.m | *function to load unconstrain and prune the models* |
| fastSetupCreator.m | *function to create "global" setup* |
| checkNomenConsist.m | *function to check that microbes have the right nomenclature* |
| detectOutput.m | *function to check if a specific file was already created and saved* |
| getIndividualSizeName.m | *get information on number and ID of organisms and individuals*|
| addMicrobeCommunityBiomass.m | *function to add community biomass* |
| mgPipe.m | *pipeline* |
| parsave.m | *function to allow object saving in parallel loops* |
| getMappingInfo.m | *function to extract information from the mapping* |
| plotMappingInfo.m | *function plot extracted information from the mapping* |
| createPersonalizedModel.m | *function to create personalized models* |
| microbiotaModelSimulator.m | *function to simulate under different diets the created models (called from mgPipe)*|
| makeDummyModel.m | *function to create a dummy model* |
| mgSimResCollect.m | *function to collect and output simulation results* |
| extractFullRes.m | *function to retrieve and export all the results (fluxes) computed during the simulations*|
| README.md | *this file* |
| useDiet.m | *function to impose a specific diet and add essential elements to microbiota models*|
| adaptVMHDietToAGORA.m | *function to convert a specific diet from VMH into an AGORA compatible one*|
| ***compfile*** | *Results subfolder: contains objects saved in open format* |
## Usage
Once installed the necessary dependencies the pipeline is ready to be used at
the condition that some input variables are inserted or changed from the
default input file `startMgPipe.m` or directly in the input function
`initMgPipe.m`.
Running the script called `startMgPipe.m` (after having changed the necessary
inputs) is the only action required from the user to start the
pipeline.
The pipeline can be stopped at every moment as all the results are saved as
soon as they are computed. In case of accidental or volunteer halt in the
execution, the pipeline can be simply restarted without loss of time running
again `startMgPipe.m` : already saved results (from previous runs) are
automatically detected and not recomputed.
## Inputs
Some specific information files need to be loaded by the pipeline. For this
reason, they must be formatted and called in a specific way. See the examples
folder in `papers/2018_microbiomeModelingToolbox` for more information.
The files needed are
| File | Description |
| -----------------------|----------------------------------------------------------------------------------------------|
| normCoverage.csv | table with abundances for each species (normalized to 1, with a minimum value as a threshold to define presence)|
| sampInfo.csv | optional: table of the same length of the number of individuals (0 means patient with disease 1 means healthy)|
Some variables, in the input file, needs to be created/modified to specify
inputs (for example paths of directories containing organisms models). The
variables which need to be created or changed from default are
| Variables | Description |
| ----------------|--------------------------------------|
| modPath | path to microbes models |
| resPath | path to the directory containing results|
| dietFilePath | path to and name of the file with dietary information|
| abunFilePath | path to and name of the file with abundance information|
| indInfoFilePath | path to csv file for stratification criteria (if empty or not existent no criteria is used)|
| objre | name of the objective function of microbes|
| figForm | the output is a vectorized picture ('-depsc'), change to '-dpng' for .png|
| numWorkers | number of cores dedicated for parallelization|
| autoFix | option to automatically solve possible issues (true means on) |
| compMod | if outputs in open format should be produced for some sections |
| rDiet | if to simulate also a rich diet (rdiet=true)|
| extSolve | option to save microbiota models with diet to simulate with a different language (true means yes) |
| fvaType | which FVA function to use (fastFVA =true for fastFVA) |
The `autorun` variable controls the behavior of the pipeline. The autorun
functionality is automatically off. This functionality enables the pipeline to
automatically run and detect outputs. By changing `autorun` variable to false, it is
possible to enter in manual / debug mode.
**WARNING**: concerning the `autorun` v
没有合适的资源?快使用搜索试试~ 我知道了~
资源推荐
资源详情
资源评论
收起资源包目录
MATLAB工具箱大全-基于约束的重构分析工具箱Cobratoolbox (1739个子文件)
cplexFVA.c 47KB
lu1fac.c 8KB
lu1fac_old.c 8KB
sort.c 2KB
ChangeLog 7KB
COPYING 18KB
glpkcc.cpp 29KB
COBRA_structure_fields.csv 10KB
testData_normCoverageReduced.csv 280B
.documenter.enc 4KB
iAF1260_compare.eps 139KB
mi27lu.f 213KB
mi15blas.f 7KB
reactionCompareGUI.fig 78KB
metCompareGUI.fig 46KB
ReconstructionTool.fig 22KB
ReconstructionCreator.fig 16KB
model_stats.fig 14KB
addreactions.fig 6KB
load_reaction.fig 5KB
CreateGPR.fig 4KB
compartment.fig 4KB
model_description.fig 4KB
rBioNetAbout.fig 4KB
addmetabolites.fig 4KB
rBioNetSettings.fig 3KB
addMetInfo.fig 3KB
addmissing.fig 3KB
metabolites_in_database.fig 3KB
unbalanced.fig 3KB
met_charge_formula.fig 3KB
Keep_reactions.fig 3KB
sim_general.fig 2KB
GenesAndReactions.fig 2KB
similarities.fig 2KB
.gitattributes 175B
.gitignore 4KB
.gitignore 71B
.gitmodules 2KB
optForce.gms 14KB
findMustLL.gms 12KB
findMustUL.gms 12KB
findMustUU.gms 12KB
findMustU.gms 11KB
findMustL.gms 11KB
licememo.gms 4KB
nrutil.h 2KB
nearestSPD_demo.html 16KB
README.html 4KB
indexsidebar.html 466B
index.html 120B
INSTALL 3KB
poi-ooxml-schemas-3.8-20120326.jar 4.49MB
xmlbeans-2.3.0.jar 2.54MB
poi-3.8-20120326.jar 1.74MB
poi-ooxml-3.8-20120326.jar 911KB
MatlabPlot.jar 358KB
dom4j-1.6.1.jar 307KB
stax-api-1.0.1.jar 26KB
LICENSE 27KB
LICENSE 1KB
LICENSE 1KB
lu1fac.log 0B
2010_07_30_KEGG_reaction.lst 370KB
testTest_KEGG_Reaction_List.lst 289B
fgf_test_rxn_db.lst 78B
test4HumanFctExt.m 344KB
findjobj.m 154KB
getStructureEnum.m 143KB
getStructureFieldnames.m 112KB
getValueType.m 100KB
getDefaultValues.m 96KB
solveCobraLP.m 68KB
advancedColormap.m 63KB
optForceWithGAMS.m 58KB
forwardReversibleFigures2.m 54KB
setThermoReactionDirectionalityiAF1260.m 52KB
optForce.m 50KB
SteadyComCplex.m 46KB
computeMetFormulae.m 46KB
ReconstructionTool.m 43KB
ReconstructionCreator.m 43KB
setupThermoModel_vonB_1.m 42KB
SimPhenyIDs.m 42KB
SteadyCom.m 40KB
findStoichConsistentSubset.m 39KB
calcdGHT.m 36KB
SteadyComPOAgr.m 35KB
venn.m 34KB
forwardReversibleFigures.m 32KB
reactionCompareGUI.m 32KB
solveCobraLPCPLEXcard.m 32KB
draw_by_rxn.m 32KB
testSteadyCom.m 32KB
BuildMPS.m 29KB
testATPYieldFromCsources.m 29KB
findMustUU.m 28KB
findMustUL.m 28KB
findMustLL.m 28KB
checkModelProperties.m 28KB
共 1739 条
- 1
- 2
- 3
- 4
- 5
- 6
- 18
资源评论
- 崔鹏252024-05-05感谢资源主的分享,这个资源对我来说很有用,内容描述详尽,值得借鉴。
- weixin_433045772022-02-17用户下载后在一定时间内未进行评价,系统默认好评。
- weixin_399962242021-08-27用户下载后在一定时间内未进行评价,系统默认好评。
- Jonson·D2021-05-27用户下载后在一定时间内未进行评价,系统默认好评。
普通网友
- 粉丝: 12w+
- 资源: 9335
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功