PyPI官网下载|sru-2.3.3.tar.gz资源-CSDN文库

版权申诉

Python库

22 浏览量 2022-01-16 11:36:18 上传评论收藏 15KB GZ 举报

共17个文件

py：5个

txt：4个

pkg-info：2个

资源推荐

资源详情

资源评论

收起资源包目录

sru-2.3.3.tar.gz （17个子文件）

sru-2.3.3

setup.py 2KB

sru

sru_cuda_kernel.cu 25KB

sru_cuda_impl.cpp 8KB

__init__.py 308B

sru_cpu_impl.cpp 9KB

sru_functional.py 26KB

cuda_functional.py 5KB

version.py 22B

sru.egg-info

top_level.txt 4B

SOURCES.txt 320B

PKG-INFO 4KB

not-zip-safe 1B

dependency_links.txt 1B

requires.txt 19B

setup.cfg 38B

README.md 3KB

PKG-INFO 4KB

## About **SRU** is a recurrent unit that can run over 10 times faster than cuDNN LSTM, without loss of accuracy tested on many tasks. <img width=620 src="https://raw.githubusercontent.com/taolei87/sru/master/imgs/speed.png"> Average processing time of LSTM, conv2d and SRU, tested on GTX 1070 For example, the figure above presents the processing time of a single mini-batch of 32 samples. SRU achieves 10 to 16 times speed-up compared to LSTM, and operates as fast as (or faster than) word-level convolution using conv2d. The [paper](https://arxiv.org/abs/1709.02755) has multiple versions, please check the latest one. #### Reference: Simple Recurrent Units for Highly Parallelizable Recurrence ``` @inproceedings{lei2018sru, title={Simple Recurrent Units for Highly Parallelizable Recurrence}, author={Tao Lei and Yu Zhang and Sida I. Wang and Hui Dai and Yoav Artzi}, booktitle={Empirical Methods in Natural Language Processing (EMNLP)}, year={2018} } ``` ## Requirements - [PyTorch](http://pytorch.org/) >=1.0.0 recommended, [pytorch installation details](docs/pytorch_installation.md) - [ninja](https://ninja-build.org/) Install requirements via `pip install -r requirements.txt`. ## Installation #### From source: SRU can be installed as a regular package via `python setup.py install` or `pip install .`. #### From PyPi: `pip install sru` #### Directly use the source without installation: Make sure this repo and CUDA library can be found by the system, e.g. ``` export PYTHONPATH=path_to_repo/sru export LD_LIBRARY_PATH=/usr/local/cuda/lib64 ``` ## Examples The usage of SRU is similar to `nn.LSTM`. SRU likely requires more stacking layers than LSTM. We recommend starting by 2 layers and use more if necessary (see our report for more experimental details). ```python import torch from torch.autograd import Variable from sru import SRU, SRUCell # input has length 20, batch size 32 and dimension 128 x = Variable(torch.FloatTensor(20, 32, 128).cuda()) input_size, hidden_size = 128, 128 rnn = SRU(input_size, hidden_size, num_layers = 2, # number of stacking RNN layers dropout = 0.0, # dropout applied between RNN layers bidirectional = False, # bidirectional RNN layer_norm = False, # apply layer normalization on the output of each layer highway_bias = 0, # initial bias of highway gate (<= 0) rescale = True, # whether to use scaling correction ) rnn.cuda() output_states, c_states = rnn(x) # forward pass # output_states is (length, batch size, number of directions * hidden size) # c_states is (layers, batch size, number of directions * hidden size) ``` ## Contributing Please read and follow the [guidelines](CONTRIBUTING.md). ### Other Implementations [@musyoku](https://github.com/musyoku) had a very nice [SRU implementaion](https://github.com/musyoku/chainer-sru) in chainer. [@adrianbg](https://github.com/adrianbg) implemented the first [CPU version](https://github.com/taolei87/sru/pull/42).

评论收藏

内容反馈

版权申诉