# ![nf-core/circrna](docs/images/nf-core-circrna_logo.png)
**workflow for the quantification, differential expression analysis and miRNA target prediction analysis of circRNAs in RNA-Seq data**.
[![GitHub Actions CI Status](https://github.com/nf-core/circrna/workflows/nf-core%20CI/badge.svg)](https://github.com/nf-core/circrna/actions)
[![GitHub Actions Linting Status](https://github.com/nf-core/circrna/workflows/nf-core%20linting/badge.svg)](https://github.com/nf-core/circrna/actions)
[![Nextflow](https://img.shields.io/badge/nextflow-%E2%89%A520.04.0-brightgreen.svg)](https://www.nextflow.io/)
[![install with bioconda](https://img.shields.io/badge/install%20with-bioconda-brightgreen.svg)](https://bioconda.github.io/)
[![Docker](https://img.shields.io/docker/automated/nfcore/circrna.svg)](https://hub.docker.com/r/nfcore/circrna)
[![Get help on Slack](http://img.shields.io/badge/slack-nf--core%20%23circrna-4A154B?logo=slack)](https://nfcore.slack.com/channels/circrna)
## Introduction
**nf-core/circrna** is a bioinformatics pipeline used for the quantification, miRNA target prediction and differential expression analysis of circRNAs present in RNA sequencing data (currently supporting total RNA-Seq paired end sequencing data, mapped to *H. sapiens* Gencode reference genomes GRCh37, GRCh38 v34).
The pipleline has been developed in a modular fashion, permitting the user to select miRNA target prediction, differential expression analysis (or both) in addition to circRNA quantification to facilitate hypotheses surrounding circRNAs involvement in the competing endogenous RNA network.
The pipeline is built using [Nextflow](https://www.nextflow.io), a workflow tool to run tasks across multiple compute infrastructures in a very portable manner. It comes with docker containers making installation trivial and results highly reproducible.
## Pipeline Summary
By default, `nf-core/circrna` utilises all 3 analysis modules: `circrna_discovery, mirna_prediction, differential_expression`.
The creation of reference genome indices and aligners used depends on the circRNA quantification tools selected, given by `--tool`.
### 1. Download Reference Genome Files
- Download Gencode GRCh37/GRCh38 *H. sapiens* reference genome, GTF files.
- Create customised annotation text file ([`gtfToGenePred`](https://anaconda.org/bioconda/ucsc-gtftogenepred))
- Download [`miRbase`](http://www.mirbase.org/ftp.shtml) mature miRNA sequences.
- Download [`TargetScan`](http://www.targetscan.org/cgi-bin/targetscan/data_download.vert72.cgi) miRNA sequences and family conservation information.
### 2. Create Reference Index Files
- [`SAMtools`](https://sourceforge.net/projects/samtools/files/samtools/) reference genome index.
- [`bwa`](https://sourceforge.net/projects/bio-bwa/files/) reference genome indices (`ciriquant` in `--tool`).
- [`HISAT2`](http://daehwankimlab.github.io/hisat2/download/) reference genome indices (`ciriquant` in `--tool`) .
- [`STAR`](https://github.com/alexdobin/STAR/releases) reference genome indices (`circexplorer2`, `circrna_finder`, `dcc` in `--tool`).
- [`Bowtie`](https://sourceforge.net/projects/bowtie-bio/) reference genome indices (`mapsplice` in `--tool`).
- [`Bowtie2`](http://bowtie-bio.sourceforge.net/bowtie2/index.shtml) reference genome indices (`find_circ` in `--tool`).
### 3. Miscellaneous circRNA Tool Requirements
- Split reference genome FASTA file per chromosome (`mapsplice`, `find_circ` in `--tool` ).
- Create `CIRIquant` input `.yml` file (`ciriquant` in `--tool`).
### 4. Stage Input Data
- Accept `BAM` (converted to FASTQ pairs using [`picard`](https://sourceforge.net/projects/picard/)) or FASTQ input data given as a path or `.csv` input file.
### 5. Quality Control
- Perform [`FastQC`](https://www.bioinformatics.babraham.ac.uk/projects/fastqc/) , [`MultiQC`](http://multiqc.info/) on raw input data.
- Optional adapter removal + read trimming performed by [`BBDUK`](https://sourceforge.net/projects/bbmap/).
- Perform [`FastQC`](https://www.bioinformatics.babraham.ac.uk/projects/fastqc/) , [`MultiQC`](http://multiqc.info/) on trimmed reads.
### 6. circRNA Quantification
- Align RNA-Seq reads and perform circRNA quantification ([`ciriquant`](https://sourceforge.net/projects/ciri/files/CIRIquant/), [`circrna_finder`](https://github.com/orzechoj/circRNA_finder), [`circexplorer2`](https://circexplorer2.readthedocs.io/en/latest/tutorial/setup/#installation), [`find_circ`](https://github.com/marvin-jens/find_circ), [`dcc`](https://github.com/dieterich-lab/DCC), [`mapsplice`](https://anaconda.org/bioconda/mapsplice)).
- Raw quantification tool output.
- Filtered quantification tool output as BED6 file.
- Create circRNA count matrix.
### 7. circRNA Filtering
- Filter to remove:
- circRNAs with low evidence of reads aligned to back splice junction (read count < 2).
- circRNAs called by only one tool (when 2+ quantification tools selected).
### 8. circRNA Annotation
- Remove unwanted biotypes from Gencode GTF file.
- Calculate circRNA mature spliced length ([`BEDTools`](https://sourceforge.net/projects/bedtools/)).
- Annotate circRNA exon-intron boundaries, mark as `circRNA`, `ciRNA`, `EIciRNA`.
- Identify circRNA parent gene ([`BEDTools`](https://sourceforge.net/projects/bedtools/)).
- Create circRNA FASTA files ([`BEDTools`](https://sourceforge.net/projects/bedtools/)).
- Create circRNA BED12 files ([`BEDTools`](https://sourceforge.net/projects/bedtools/)).
- Create master annotation file reporting circRNA ID, circRNA type, mature length, parent gene, strand.
### 9. miRNA Target Prediction
- Identify miRNA response elements in mature circRNA sequence using [`miRanda`](https://anaconda.org/bioconda/miranda) and [`TargetScan`](http://www.targetscan.org/cgi-bin/targetscan/data_download.vert72.cgi).
- Create filtered miRNA targets file for each circRNA.
- Create circos plot of circRNA exons / filtered MRE sites.
- Raw [`miRanda`](https://anaconda.org/bioconda/miranda) output.
- Raw [`TargetScan`](http://www.targetscan.org/cgi-bin/targetscan/data_download.vert72.cgi) output.
### 10. miRNA Target Filtering
- Filter to remove:
- 6mers from `TargetScan` output.
- miRNAs with MFE <= -20.00 Kcal/Mol.
- Duplicate miRNA IDs targeting same circRNA MRE site (keep miRNA ID with highest score).
### 11. Differential Expression Analysis
- Perform RNA-Seq quantification using [`StringTie`](https://ccb.jhu.edu/software/stringtie/).
- Perform circRNA differential expression analysis using RNA-Seq library size factors to normalise circRNA count matrix, QC plots, PCA and clustering ([`R`](https://www.r-project.org/), [`DESeq2`](https://bioconductor.org/packages/release/bioc/html/DESeq2.html)).
- Create circRNA expression plots, circRNA-parent gene expression plots ([`R`](https://www.r-project.org/)).
- Create differentially expressed circRNA master file reporting circRNA ID, circRNA Type, mature length, parent gene, strand, Log2FC, pvalue, padj, parent gene description.
## Quick Start
1. Install [`nextflow`](https://nf-co.re/usage/installation)
2. Install any of [`Docker`](https://docs.docker.com/engine/installation/), [`Singularity`](https://www.sylabs.io/guides/3.0/user-guide/) or [`Podman`](https://podman.io/) for full pipeline reproducibility _(please only use [`Conda`](https://conda.io/miniconda.html) as a last resort; see [docs](https://nf-co.re/usage/configuration#basic-configuration-profiles))_
3. Download the pipeline and test it on a minimal dataset with a single command:
```bash
nextflow run nf-core/circrna -profile test,<docker/singularity/podman/conda/institute>
```
> Please check [nf-core/configs](https://github.com/nf-core/configs#documentation) to see if a custom config file to run nf-core pipelines already exists for your Institute. If so, you can simply use `-profile <institute>` in your command. This will enable either `docker` or `singularity` and set the appropriate execution settings
没有合适的资源?快使用搜索试试~ 我知道了~
circrna:circRNA定量,差异表达分析和miRNA靶标预测RNA-Seq数据
共82个文件
yml:13个
sh:10个
md:10个
5星 · 超过95%的资源 需积分: 37 12 下载量 90 浏览量
2021-03-11
00:53:30
上传
评论
收藏 990KB ZIP 举报
温馨提示
RNA-Seq数据中circRNA的定量,差异表达分析和miRNA目标预测分析的工作流程。 介绍 nf-core / circrna是一种生物信息学流水线,用于定量,miRNA靶标预测和RNA测序数据中存在的circRNA的差异表达分析(当前支持总RNA-Seq配对末端测序数据,已映射至智人Gencode参考基因组GRCh37, GRCh38 v34)。 pipleline已以模块化方式开发,除了circRNA定量外,还允许用户选择miRNA靶标预测,差异表达分析(或两者),以促进围绕circRNA参与竞争内源RNA网络的假设。 该管道是使用构建的, 是一种工作流工具,可以以非常便携的方式跨多个计算基础架构运行任务。 它带有docker容器,使安装变得简单,结果可高度重现。 管道摘要 默认情况下, nf-core/circrna使用所有3个分析模块: circrna_discovery
资源详情
资源评论
资源推荐
收起资源包目录
circrna-dev.zip (82个子文件)
circrna-dev
bin
unwanted_biotypes.txt 648B
get_parent_genes.sh 543B
adapters.fa 14KB
filter_CIRIquant.sh 446B
targetscan_70_BL_PCT.pl 23KB
annotate_circs.R 2KB
get_mature_seq.sh 4KB
ID_to_BED.sh 319B
circ_report.R 7KB
annotate_report.R 947B
targetscan_count_8mers.pl 3KB
split_fasta.sh 119B
find_circ.py 18KB
DEA.R 19KB
targetscan_70_context_scores.pl 67KB
unmapped2anchors.py 3KB
circRNA_counts_matrix.py 921B
mirna_circos.R 6KB
filter_DCC.sh 274B
check_empty.sh 541B
consolidate_algorithms_original.R 3KB
targetscan_70_BL_bins.pl 9KB
scrape_software_versions.py 2KB
targetscan_70.pl 29KB
scorethresh.py 743B
merge_bed.py 2KB
cmp_bed.py 2KB
sum.py 557B
filter_circexplorer2.sh 1KB
markdown_to_html.py 3KB
DE.R 13KB
consolidate_algorithms.R 2KB
find_circ.sh 1KB
prep_circos.sh 1KB
UROBORUS.pl 82KB
conf
igenomes.config 31KB
base.config 2KB
test_full.config 1KB
nuig.config 713B
test.config 1KB
docs
images
nf-core-circrna_logo.png 19KB
circ_gene_lineplot.png 149KB
circos_plot.png 659KB
boxplot.png 100KB
README.md 510B
output.md 34KB
usage.md 8KB
main.nf 66KB
environment.yml 1KB
.github
PULL_REQUEST_TEMPLATE.md 866B
ISSUE_TEMPLATE
bug_report.md 1KB
feature_request.md 809B
config.yml 290B
workflows
linting_comment.yml 810B
linting.yml 2KB
push_dockerhub_release.yml 1KB
push_dockerhub_dev.yml 979B
branch.yml 2KB
awstest.yml 2KB
ci.yml 2KB
awsfulltest.yml 2KB
.dockstore.yml 135B
CONTRIBUTING.md 3KB
markdownlint.yml 113B
assets
email_template.txt 1KB
sendmail_template.txt 1KB
nf-core-circrna_social_preview.svg 26KB
nf-core-circrna_logo.png 11KB
multiqc_config.yaml 453B
nf-core-circrna_social_preview.png 58KB
email_template.html 3KB
Dockerfile 3KB
containers
py3
environment.yml 111B
Dockerfile 502B
LICENSE 1KB
nextflow_schema.json 34KB
.gitignore 73B
CHANGELOG.md 376B
CODE_OF_CONDUCT.md 3KB
README.md 10KB
.gitattributes 36B
nextflow.config 6KB
共 82 条
- 1
Jmoh
- 粉丝: 20
- 资源: 4676
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功
评论1