没有合适的资源?快使用搜索试试~ 我知道了~
温馨提示
Technical Report Improvement of Fitch functionfor Maximum Parsimony in Phylogenetic Reconstructionwith Intel AVX2 assembler instructionsResearch Lab: LERIA TR20130624-1Version 1.0 24 June 2013JEAN-M ICHEL RICHER Office: H206 Address: 2 Boulevard Lavoisier, 49045 Angers Cedex 01 Phone: (+33) (0)2-41-73-52-34 Email: jean-michel.richer@univ-angers.frAbstractThe Maximum Parsimony problem aims at reconstructing a phylogenetic tree from DNA, RNA or protein sequences while minimizing the number of evol
资源推荐
资源详情
资源评论
Technical Report
Improvement of Fitch function
for Maximum Parsimony
in Phylogenetic Reconstruction
with Intel AVX2 assembler instructions
Research Lab: LERIA
TR20130624-1
Version 1.0
24 June 2013
JEAN-MICHEL RICHER
Office: H206
Address: 2 Boulevard Lavoisier, 49045 Angers Cedex 01
Phone: (+33) (0)2-41-73-52-34
Email: jean-michel.richer@univ-angers.fr
Abstract
The Maximum Parsimony problem aims at reconstructing a phylogenetic tree from DNA,
RNA or protein sequences while minimizing the number of evolutionary changes. Much
work has been devoted by the Computer Science community to solve this NP-complete prob-
lem and many techniques have been used or designed in order to decrease the computation
time necessary to obtain an acceptable solution. In this paper we report an improvement of the
evaluation of the Fitch function for Maximum Parsimony using AVX2 assembler instruction
of Intel
TM
processors.
1 Introduction
This report is an extension of the technical report TR20080428-1 by the same author. We give here a new
version of the assembler code and we have performed some tests on an Intel Haswell processor to verify if
the AVX2 assembler instruction set gave any improvement over SSE2. For more details we refer the reader
to the technical report TR20080428-1.
1.1 Software improvement using AVX2 instructions
The release of the new Haswell architecture of Intel processor in June 2013 led to the introduction of
AVX2 (Advanced Vector Extensions, version 2) assembler instructions. With AVX, introduced in 2008, the
width of the SIMD registers is increased from 128 bits to 256 bits and the SSE registers xmm0-xmm15 are
renamed to ymm0-ymm15 for a 64 bits architecture.
AVX2 extensions like SSE2 (Streaming SIMD Extensions) instructions of modern x86 processors (Intel,
AMD) help vectorize the code, i.e. apply the same instruction on multiple data at the same time conse-
quently reducing the overall execution time.
In our implementation of phylogenetic reconstruction with Maximum Parsimony using Fitch criterion,
the main function that benefits from the use of vectorization is the computation of a hypothetical parsimony
sequence from two existing sequences. The C code of this function is given figure 1 and takes as input two
sequences x and y of a given size. The outputs are the hypothetical taxon z and the number of changes
(or differences) returned by the function.
1 int fitch(char x[], char y[], char z[], int size) {
2 int i, changes=0;
3 for (i = 0; i < length; ++i) {
4 z[i] = x[i] & y[i];
5 if (z[i] == 0) {
6 ++changes;
7 z[i] = x[i] | y[i];
8 }
9 }
10 return changes;
11 }
Figure 1: Fitch Parsimony function
Modern compiler (gcc GNU, icc Intel) are not able to vectorize the code of this function efficiently if
no implementation specific information is provided. It is then necessary to code the function in assembler
to get a significant improvement during the execution of the program.
The implementation with AVX2 is nearly the same as the one given in report TR20080428-1 for SSE2:
1. we first load into registers ymm0 and ymm1 the first 32 bytes of each taxon (x and y)
2. in ymm2 and ymm3, we respectively compute the binary-AND and the binary-OR of ymm0 and
ymm1 using instructions vpand and vpor (for parallel AND and parallel OR).
2
剩余6页未读,继续阅读
资源评论
weixin_38584642
- 粉丝: 5
- 资源: 945
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
最新资源
- 电路分析基础 实验五 RLC串联谐振的multisim仿真
- 2个月涨粉8w,新玩法AI做漫画小说赛道,操作简单可批量制作,新手小白....mp4
- 高分辨率下的遥感目标分割
- 网络攻防原理与技术-实验8资料.7z
- 电机控制器,永磁同步电机调速控制软件工程PMSM,该工程主要基于DSP28335硬件控制平台,两电平IPM模块主回路,通过位置传感器,速度传感器实时检测位置和速度信号,电流传感器采集电流信号,控制器控
- 24年快手无人直播暴利变现3.0,直播间人气轻松破千上热门,普通人也能....mp4
- 2024年9月28日支付宝分成最新搬运玩法.mp4
- 西门子1200PLC模板通讯程序 modbus 包含多种通讯Modbus-RTU(485),S7通讯,Modbus-TCP,TCP IP等,博途V16及较新版本可打开,简单明了,初学者也能明白
- ICED Smart 网站部署教程文件
- 2024淘宝暴力掘金 单机500+.mp4
- 2024年最新暴力起店玩法,拼多多虚拟电商4.0,24小时实现成交,单人可以...mp4
- 2024影视解说最新玩法,AI一键生成原创影视解说, 十秒钟制作成品,解....mp4
- 2024掌握拼多多运营精髓:爆款流程、定价技巧与SKU设计实战课.mp4
- 房屋租赁推荐系统 房租租赁系统 基于协同过滤的房屋租赁推荐系统 特色功能:协同过滤推荐 对于房租租赁,结合实际场景选择用户对房子的浏览次数作为数据集,体现用户喜好度,应用余弦相似度,实现基于用户协同过
- X6程序模块 AE-10D00.GDL
- AspSweb网页服务器1.0.0.93
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功