Python库|pyfasta-0.2.5.tar.gz资源-CSDN文库

版权申诉

51 浏览量 2022-04-13 07:24:33 上传评论收藏 6KB GZ 举报

共14个文件

txt：5个

py：4个

pkg-info：2个

资源推荐

资源详情

资源评论

收起资源包目录

pyfasta-0.2.5.tar.gz （14个子文件）

pyfasta-0.2.5

PKG-INFO 3KB

README.txt 2KB

tests

data

three_chrs.fasta 4KB

test_fasta.py 2KB

setup.cfg 148B

setup.py 815B

pyfasta.egg-info

PKG-INFO 3KB

not-zip-safe 1B

SOURCES.txt 302B

entry_points.txt 42B

top_level.txt 8B

dependency_links.txt 1B

pyfasta

__init__.py 2KB

fasta.py 8KB

================================================== pyfasta: pythonic access to fasta sequence files. ================================================== :Author: Brent Pedersen (brentp) :Email: bpederse@gmail.com :License: MIT Implementation ============== Requires Python >= 2.5. Stores a flattened version of the fasta file without spaces or headers. And a pickle of the start, stop (for fseek) locations of each header in the fasta file for internal use. Now supports the numpy array interface. Usage ===== :: >>> from pyfasta import Fasta >>> f = Fasta('tests/data/three_chrs.fasta') >>> sorted(f.keys()) ['chr1', 'chr2', 'chr3'] >>> f['chr1'] FastaRecord('tests/data/three_chrs.fasta.flat', 0..80) Slicing ------- :: >>> f['chr1'][:10] 'ACTGACTGAC' # get the 1st basepair in every codon (it's python yo) >>> f['chr1'][::3] 'AGTCAGTCAGTCAGTCAGTCAGTCAGT' # the index stores the start and stop of each header from the fasta file. # (you should never need this) >>> f.index {'chr3': (160, 3760), 'chr2': (80, 160), 'chr1': (0, 80)} # can query by a 'feature' dictionary >>> f.sequence({'chr': 'chr1', 'start': 2, 'stop': 9}) 'CTGACTGA' # with reverse complement for - strand >>> f.sequence({'chr': 'chr1', 'start': 2, 'stop': 9, 'strand': '-'}) 'TCAGTCAG' --------------------- Numpy Array Interface --------------------- :: # FastaRecords support the numpy array interface. >>> import numpy as np >>> a = np.array(f['chr2']) >>> a.shape[0] == len(f['chr2']) True >>> a[10:14] array(['A', 'A', 'A', 'A'], dtype='|S1') # cleanup (though for real use these will remain for faster access) >>> import os >>> os.unlink('tests/data/three_chrs.fasta.gdx') >>> os.unlink('tests/data/three_chrs.fasta.flat')

评论收藏

内容反馈

版权申诉