##### Contents
[Overview] (#overview)
[Copy right] (#copyright)
[How to cite BRANCH?] (#cite)
[Short Manual] (#manual)
<a name="overview"/>
### Overview
BRANCH is a software that extends de novo transfrags and identifies novel transfrags with DNA contigs or genes of close related species. BRANCH discovers novel exons first and then extends/joins fragmented de novo transfrags, so that the resulted transfrags are more complete.
<a name="copyright"/>
###Copy right
BRANCH is under the [Artistic License 2.0](http://opensource.org/licenses/Artistic-2.0).
<a name="cite"/>
### How to cite BRANCH?
If you use BRANCH, please cite the following paper:
Bao E, Jiang T, Girke T (2013). BRANCH: boosting RNA-Seq assemblies with partial or related genomic sequences. Bioinformatics: [epub](http://bioinformatics.oxfordjournals.org/content/29/10/1250).
<a name="manual"/>
### Short manual
1. System requirements
BRANCH is suitable for 32-bit or 64-bit machines with Linux operating systems. At least 4GB of system memory is recommended for assembling larger data sets.
2. Installation
The [LEMON](http://lemon.cs.elte.hu/trac/lemon) graph library is required to compile and run BRANCH.
The [BLAT](http://genome.ucsc.edu/FAQ/FAQblat.html) aligner is required to run BRANCH and the modified version (distributed with BRANCH) is highly recommended.
* Download the .cpp file.
* If LEMON is already installed in your system, execute the command line: `g++ -o BRANCH BRANCH.cpp -lemon -lpthread`; otherwise, down load LEMON, compile it, and execute: `g++ -o BRANCH -I PATH2LEMON/include BRANCH.cpp -L PATH2LEMON/bin -lpthread`.
* To use the modified BLAT, put it to your $PATH: `export PATH=PATH2BLAT:$PATH`.
3. Input
* Single- or paired-end RNA reads in FASTA format.
* De novo transfrags assembled by any de novo RNA assembler (Velvet/Oases, Trinity, etc.).
* DNA contigs assembled by any de novo DNA assembler (Velvet, ABySS, etc.) or genome/gene sequences from a closely related species.
4. Using BRANCH
```
BRANCH --read1 reads_1.fa --read2 reads_2.fa --transfrag transfrags.fa --contig contigs.fa --transcript transcripts.fa [--insertLow insertLow --insertHigh insertHigh --threshSize threshSize --threshCov threshCov --threshSplit threshSplit --threshConn threshConn --closeGap --noAlignment]
```
Inputs:
--read1 is the first pair of PE RNA reads or single-end RNA reads in fasta format
--read2 is the second pair of PE RNA reads in fasta format
--transfrag is the de novo RNA transfrags to be extended
--contig is the reference DNA contigs
Output:
--transcript is the extended de novo transfrags
Options:
--insertLow is the lower bound of insert length (highly recommended; default: 0)
--insertHigh is the upper bound of insert length (highly recommended; default: 99999)
--threshSize is the minimum size of a genome region that could be identified as an exon (default: 2 bp)
--threshCov is the minimum coverage of a genome region that could be identified as an exon (default: 2)
--threshSplit is the minimum upstream and downstream junction coverages to split a genome region into more than one exons (default: 2)
--threshConn is the minimum connectivity of two exons that could be identified as a splice junction (default: 2)
--closeGap closes sequencing gaps using PE read information (default: none)
--noAlignment skips the initial time-consuming alignment step, if all the alignment files have been provided in tmp directory (default: none)
--misassemblyRemoval detects and then breaks at or removes misassembed regions (default: none)
5. Output
BRANCH outputs the transfrag file in FASTA format. It contains all the improved transfrags.
6. Important things to note
* Single-end reads should have the same length and are not recommended, since the quality of single-end alignment is hard to be kept.
* It is better to use related gene sequences rather than related genome sequences to greatly reduce run time and memory usage.
* Though --insertLow and --insertHigh are options, they should always be specified to generate meaning result. Suppose the insert length is I, insertLow = I - 20 and insertHigh = I + 20 would be fine.
没有合适的资源?快使用搜索试试~ 我知道了~
使用部分或相关基因组序列增强RNA-Seq组装.zip
共97个文件
h:91个
pc:1个
md:1个
1.该资源内容由用户上传,如若侵权请联系客服进行举报
2.虚拟产品一经售出概不退款(资源遇到问题,请及时私信上传者)
2.虚拟产品一经售出概不退款(资源遇到问题,请及时私信上传者)
版权申诉
0 下载量 63 浏览量
2023-03-31
22:35:50
上传
评论
收藏 772KB ZIP 举报
温馨提示
使用部分或相关基因组序列增强RNA-Seq组装
资源推荐
资源详情
资源评论
收起资源包目录
使用部分或相关基因组序列增强RNA-Seq组装.zip (97个子文件)
BRANCH-master
BRANCH
BRANCH 845KB
lemon
include
lemon
dheap.h 11KB
lgf_reader.h 81KB
color.h 6KB
connectivity.h 50KB
smart_graph.h 22KB
suurballe.h 23KB
glpk.h 6KB
concept_check.h 2KB
kruskal.h 10KB
preflow.h 30KB
nauty_reader.h 3KB
math.h 2KB
dimacs.h 15KB
planarity.h 84KB
min_cost_arborescence.h 24KB
network_simplex.h 50KB
fib_heap.h 14KB
cost_scaling.h 42KB
bucket_heap.h 17KB
arg_parser.h 13KB
full_graph.h 17KB
lp_skeleton.h 6KB
pairing_heap.h 14KB
assert.h 7KB
cycle_canceling.h 38KB
graph_to_eps.h 38KB
cplex.h 7KB
bfs.h 54KB
binomial_heap.h 13KB
soplex.h 4KB
radix_sort.h 15KB
error.h 7KB
capacity_scaling.h 31KB
euler.h 8KB
bellman_ford.h 38KB
config.h 667B
static_graph.h 15KB
howard_mmc.h 18KB
lgf_writer.h 54KB
bin_heap.h 10KB
quad_heap.h 11KB
grid_graph.h 19KB
radix_heap.h 13KB
lp.h 2KB
windows.h 972B
matching.h 110KB
dim2.h 17KB
bits
bezier.h 5KB
edge_set_extender.h 14KB
alteration_notifier.h 15KB
enable_if.h 3KB
solver_bits.h 5KB
variant.h 14KB
graph_adaptor_extender.h 8KB
traits.h 7KB
default_map.h 5KB
path_dump.h 5KB
array_map.h 10KB
map_extender.h 7KB
graph_extender.h 17KB
vector_map.h 7KB
cbc.h 4KB
edge_set.h 37KB
tolerance.h 7KB
maps.h 117KB
path.h 30KB
lp_base.h 61KB
random.h 30KB
circulation.h 25KB
dfs.h 50KB
hartmann_orlin_mmc.h 19KB
adaptors.h 110KB
fractional_matching.h 63KB
gomory_hu.h 17KB
list_graph.h 45KB
counter.h 8KB
hypercube_graph.h 13KB
karp_mmc.h 17KB
clp.h 4KB
hao_orlin.h 31KB
unionfind.h 49KB
core.h 59KB
dijkstra.h 43KB
elevator.h 26KB
time_measure.h 14KB
concepts
graph_components.h 48KB
heap.h 10KB
maps.h 6KB
path.h 9KB
digraph.h 15KB
graph.h 25KB
lib
pkgconfig
lemon.pc 310B
blat 290KB
BRANCH.cpp 84KB
.gitignore 126B
README.md 4KB
共 97 条
- 1
资源评论
快撑死的鱼
- 粉丝: 1w+
- 资源: 9156
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功