# 
**workflow for the quantification, differential expression analysis and miRNA target prediction analysis of circRNAs in RNA-Seq data**.
[](https://github.com/nf-core/circrna/actions)
[](https://github.com/nf-core/circrna/actions)
[](https://www.nextflow.io/)
[](https://bioconda.github.io/)
[](https://hub.docker.com/r/nfcore/circrna)
[](https://nfcore.slack.com/channels/circrna)
## Introduction
**nf-core/circrna** is a bioinformatics pipeline used for the quantification, miRNA target prediction and differential expression analysis of circRNAs present in RNA sequencing data (currently supporting total RNA-Seq paired end sequencing data, mapped to *H. sapiens* Gencode reference genomes GRCh37, GRCh38 v34).
The pipleline has been developed in a modular fashion, permitting the user to select miRNA target prediction, differential expression analysis (or both) in addition to circRNA quantification to facilitate hypotheses surrounding circRNAs involvement in the competing endogenous RNA network.
The pipeline is built using [Nextflow](https://www.nextflow.io), a workflow tool to run tasks across multiple compute infrastructures in a very portable manner. It comes with docker containers making installation trivial and results highly reproducible.
## Pipeline Summary
By default, `nf-core/circrna` utilises all 3 analysis modules: `circrna_discovery, mirna_prediction, differential_expression`.
The creation of reference genome indices and aligners used depends on the circRNA quantification tools selected, given by `--tool`.
### 1. Download Reference Genome Files
- Download Gencode GRCh37/GRCh38 *H. sapiens* reference genome, GTF files.
- Create customised annotation text file ([`gtfToGenePred`](https://anaconda.org/bioconda/ucsc-gtftogenepred))
- Download [`miRbase`](http://www.mirbase.org/ftp.shtml) mature miRNA sequences.
- Download [`TargetScan`](http://www.targetscan.org/cgi-bin/targetscan/data_download.vert72.cgi) miRNA sequences and family conservation information.
### 2. Create Reference Index Files
- [`SAMtools`](https://sourceforge.net/projects/samtools/files/samtools/) reference genome index.
- [`bwa`](https://sourceforge.net/projects/bio-bwa/files/) reference genome indices (`ciriquant` in `--tool`).
- [`HISAT2`](http://daehwankimlab.github.io/hisat2/download/) reference genome indices (`ciriquant` in `--tool`) .
- [`STAR`](https://github.com/alexdobin/STAR/releases) reference genome indices (`circexplorer2`, `circrna_finder`, `dcc` in `--tool`).
- [`Bowtie`](https://sourceforge.net/projects/bowtie-bio/) reference genome indices (`mapsplice` in `--tool`).
- [`Bowtie2`](http://bowtie-bio.sourceforge.net/bowtie2/index.shtml) reference genome indices (`find_circ` in `--tool`).
### 3. Miscellaneous circRNA Tool Requirements
- Split reference genome FASTA file per chromosome (`mapsplice`, `find_circ` in `--tool` ).
- Create `CIRIquant` input `.yml` file (`ciriquant` in `--tool`).
### 4. Stage Input Data
- Accept `BAM` (converted to FASTQ pairs using [`picard`](https://sourceforge.net/projects/picard/)) or FASTQ input data given as a path or `.csv` input file.
### 5. Quality Control
- Perform [`FastQC`](https://www.bioinformatics.babraham.ac.uk/projects/fastqc/) , [`MultiQC`](http://multiqc.info/) on raw input data.
- Optional adapter removal + read trimming performed by [`BBDUK`](https://sourceforge.net/projects/bbmap/).
- Perform [`FastQC`](https://www.bioinformatics.babraham.ac.uk/projects/fastqc/) , [`MultiQC`](http://multiqc.info/) on trimmed reads.
### 6. circRNA Quantification
- Align RNA-Seq reads and perform circRNA quantification ([`ciriquant`](https://sourceforge.net/projects/ciri/files/CIRIquant/), [`circrna_finder`](https://github.com/orzechoj/circRNA_finder), [`circexplorer2`](https://circexplorer2.readthedocs.io/en/latest/tutorial/setup/#installation), [`find_circ`](https://github.com/marvin-jens/find_circ), [`dcc`](https://github.com/dieterich-lab/DCC), [`mapsplice`](https://anaconda.org/bioconda/mapsplice)).
- Raw quantification tool output.
- Filtered quantification tool output as BED6 file.
- Create circRNA count matrix.
### 7. circRNA Filtering
- Filter to remove:
- circRNAs with low evidence of reads aligned to back splice junction (read count < 2).
- circRNAs called by only one tool (when 2+ quantification tools selected).
### 8. circRNA Annotation
- Remove unwanted biotypes from Gencode GTF file.
- Calculate circRNA mature spliced length ([`BEDTools`](https://sourceforge.net/projects/bedtools/)).
- Annotate circRNA exon-intron boundaries, mark as `circRNA`, `ciRNA`, `EIciRNA`.
- Identify circRNA parent gene ([`BEDTools`](https://sourceforge.net/projects/bedtools/)).
- Create circRNA FASTA files ([`BEDTools`](https://sourceforge.net/projects/bedtools/)).
- Create circRNA BED12 files ([`BEDTools`](https://sourceforge.net/projects/bedtools/)).
- Create master annotation file reporting circRNA ID, circRNA type, mature length, parent gene, strand.
### 9. miRNA Target Prediction
- Identify miRNA response elements in mature circRNA sequence using [`miRanda`](https://anaconda.org/bioconda/miranda) and [`TargetScan`](http://www.targetscan.org/cgi-bin/targetscan/data_download.vert72.cgi).
- Create filtered miRNA targets file for each circRNA.
- Create circos plot of circRNA exons / filtered MRE sites.
- Raw [`miRanda`](https://anaconda.org/bioconda/miranda) output.
- Raw [`TargetScan`](http://www.targetscan.org/cgi-bin/targetscan/data_download.vert72.cgi) output.
### 10. miRNA Target Filtering
- Filter to remove:
- 6mers from `TargetScan` output.
- miRNAs with MFE <= -20.00 Kcal/Mol.
- Duplicate miRNA IDs targeting same circRNA MRE site (keep miRNA ID with highest score).
### 11. Differential Expression Analysis
- Perform RNA-Seq quantification using [`StringTie`](https://ccb.jhu.edu/software/stringtie/).
- Perform circRNA differential expression analysis using RNA-Seq library size factors to normalise circRNA count matrix, QC plots, PCA and clustering ([`R`](https://www.r-project.org/), [`DESeq2`](https://bioconductor.org/packages/release/bioc/html/DESeq2.html)).
- Create circRNA expression plots, circRNA-parent gene expression plots ([`R`](https://www.r-project.org/)).
- Create differentially expressed circRNA master file reporting circRNA ID, circRNA Type, mature length, parent gene, strand, Log2FC, pvalue, padj, parent gene description.
## Quick Start
1. Install [`nextflow`](https://nf-co.re/usage/installation)
2. Install any of [`Docker`](https://docs.docker.com/engine/installation/), [`Singularity`](https://www.sylabs.io/guides/3.0/user-guide/) or [`Podman`](https://podman.io/) for full pipeline reproducibility _(please only use [`Conda`](https://conda.io/miniconda.html) as a last resort; see [docs](https://nf-co.re/usage/configuration#basic-configuration-profiles))_
3. Download the pipeline and test it on a minimal dataset with a single command:
```bash
nextflow run nf-core/circrna -profile test,<docker/singularity/podman/conda/institute>
```
> Please check [nf-core/configs](https://github.com/nf-core/configs#documentation) to see if a custom config file to run nf-core pipelines already exists for your Institute. If so, you can simply use `-profile <institute>` in your command. This will enable either `docker` or `singularity` and set the appropriate execution settings
没有合适的资源?快使用搜索试试~ 我知道了~
circrna:circRNA定量,差异表达分析和miRNA靶标预测RNA-Seq数据

共82个文件
yml:13个
sh:10个
md:10个


温馨提示
RNA-Seq数据中circRNA的定量,差异表达分析和miRNA目标预测分析的工作流程。 介绍 nf-core / circrna是一种生物信息学流水线,用于定量,miRNA靶标预测和RNA测序数据中存在的circRNA的差异表达分析(当前支持总RNA-Seq配对末端测序数据,已映射至智人Gencode参考基因组GRCh37, GRCh38 v34)。 pipleline已以模块化方式开发,除了circRNA定量外,还允许用户选择miRNA靶标预测,差异表达分析(或两者),以促进围绕circRNA参与竞争内源RNA网络的假设。 该管道是使用构建的, 是一种工作流工具,可以以非常便携的方式跨多个计算基础架构运行任务。 它带有docker容器,使安装变得简单,结果可高度重现。 管道摘要 默认情况下, nf-core/circrna使用所有3个分析模块: circrna_discovery
资源详情
资源评论
资源推荐
收起资源包目录






























































































共 82 条
- 1

























Jmoh
- 粉丝: 35
上传资源 快速赚钱
我的内容管理 展开
我的资源 快来上传第一个资源
我的收益
登录查看自己的收益我的积分 登录查看自己的积分
我的C币 登录后查看C币余额
我的收藏
我的下载
下载帮助


最新资源
- 互联网背景下的财务会计管理问题研究(1).docx
- 电子商务物流服务合同书(1)(1).docx
- 图书管理系统需求分析报告(3)(1).doc
- cad对象特性常用快捷键命令(1).docx
- 基于互联网+的高职学生创新创业能力培养策略研究(1).docx
- 基于PLC的物料分拣系统设计(1).docx
- 教育信息化建设经费保障制度(1).doc
- 基于Proteus的步进电机闭环控制仿真(3)(1).doc
- 网站设计基础教程整本书电子讲义完整版ppt课件全书教学教程最全教学课件(最新)(1).pptx
- 引领农村电子商务-助推农村建设工程中国新农村月刊大集大利移动应用电子平台启动(1).docx
- 软件公司管理制度-(1)(1).doc
- 基于互联网金融的中小企业融资创新路径研究(1).docx
- 传输技术在通信工程中的应用(1).docx
- 类分裂的代码混淆技术的论文-计算机应用论文(1).docx
- 解析电力自动化系统在配网运行管理中的应用(1).docx
- 自动化生产线安装与调试毕业论文(1).doc
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈



安全验证
文档复制为VIP权益,开通VIP直接复制

评论1