flax-0.1.0rc1.tar.gz资源-CSDN文库

版权申诉

flax

深度学习

139 浏览量 2023-12-16 13:37:26 上传评论收藏 47KB GZ 举报

共29个文件

py：20个

txt：4个

pkg-info：2个

《Flax：构建深度学习模型的新工具》 Flax，作为一个高效的神经网络库，是Google开源的用于构建深度学习模型的框架，版本号为0.1.0rc1。这个压缩包“flax-0.1.0rc1.tar.gz”包含了Flax库的核心组件，旨在为研究人员和开发者提供一个灵活且可维护的深度学习解决方案。 Flax的设计灵感来源于PyTorch和TensorFlow，它结合了这两者的优点，提供了静态图的优化以及动态计算的能力。在Flax中，你可以自由地构建和调整计算图，同时保持代码的模块化和易于调试。这一特性使得Flax在实验性研究和大规模生产环境之间找到了良好的平衡。 0.1.0rc1是Flax的一个预发行版本，rc1代表"Release Candidate 1"，意味着这是发布前的最后一个测试版本，开发者已经对软件进行了大量测试，但可能存在少量未发现的问题。使用此版本时，用户应密切关注官方更新，以便及时获取正式版本的修复和改进。 Flax库的核心组件包括： 1. **Modules**：Flax的模块系统允许用户定义可重用的神经网络层。这些模块可以被组合在一起，构建复杂的模型结构。与PyTorch中的nn.Module类似，但更强调代码的简洁性和可维护性。 2. **Builders**：Builders是一组工具，帮助用户轻松创建和初始化模型参数。它们提供了多种参数初始化策略，如均匀分布、正态分布等，可以根据具体需求进行选择。 3. **Optimizers**：Flax提供了各种优化器，如SGD、Adam、RMSprop等，用于训练过程中的参数更新。这些优化器具有高度可定制性，允许用户调整学习率、动量等超参数。 4. **Stateful**：Flax支持状态ful计算，这使得在模型中跟踪和管理变量变得简单，例如在训练过程中保存和恢复模型状态。 5. **JAX Integration**：Flax建立在JAX库之上，这意味着它能够利用JAX的自动微分、并行计算和GPU加速功能。JAX是一个强大的数学计算库，为Flax提供了强大的底层支持。 6. **Ease of Use and Flexibility**：Flax的API设计注重简洁性和灵活性，使用户能够快速上手，同时在复杂模型设计时仍能保持代码清晰。在使用“flax-0.1.0rc1.tar.gz”这个压缩包时，用户需要先将其解压，然后通过Python的import语句引入Flax库，根据项目需求进行模型构建、训练和评估。由于这是一个预发行版本，因此在使用过程中遇到问题，可以参考官方文档、社区论坛或GitHub上的Issue来寻找解决方案。 Flax是深度学习领域的一个创新尝试，它结合了静态图的效率和动态图的灵活性，为研究者和开发者提供了新的工具，以应对日益复杂的模型和不断增长的数据规模。随着版本的不断迭代，Flax有望成为深度学习领域的主流框架之一。

资源推荐

资源详情

资源评论

收起资源包目录

flax-0.1.0rc1.tar.gz （29个子文件）

flax-0.1.0rc1

setup.py 2KB

PKG-INFO 11KB

flax.egg-info

SOURCES.txt 570B

top_level.txt 5B

PKG-INFO 11KB

requires.txt 59B

not-zip-safe 1B

dependency_links.txt 1B

flax

__init__.py 754B

utils.py 2KB

__init__.py 1KB

activation.py 1KB

initializers.py 1KB

recurrent.py 7KB

normalization.py 9KB

stochastic.py 3KB

linear.py 11KB

pooling.py 3KB

base.py 34KB

attention.py 21KB

metrics

__init__.py 582B

tensorboard.py 7KB

optim.py 29KB

struct.py 4KB

serialization.py 9KB

traverse_util.py 7KB

jax_utils.py 5KB

setup.cfg 38B

README.md 9KB

# Flax: A neural network library for JAX designed for flexibility **NOTE**: This is alpha software, but we encourage trying it out. Changes will come to the API, but we'll use deprecation warnings when we can, and keep track of them our [Changelog](CHANGELOG.md). A growing community of researchers at Google are happily using Flax daily for their research, and now we'd like to extend that support to the open source community. GitHub issues are encouraged for open converation, but in case you need to reach us directly, we're at flax-dev@google.com. ## Quickstart **⟶ [Full documentation and API reference](https://flax.readthedocs.io/)** **⟶ [Annotated full end-to-end MNIST example](docs/annotated_mnist.md)** **⟶ [The Flax Guide](https://flax.readthedocs.io/en/latest/notebooks/flax_intro.html)** -- a guided walkthrough of the parts of Flax ## Background: JAX [JAX](https://github.com/google/jax) is NumPy + autodiff + GPU/TPU It allows for fast scientific computing and machine learning with the normal NumPy API (+ additional APIs for special accelerator ops when needed) JAX comes with powerful primitives, which you can compose arbitrarily: * Autodiff (`jax.grad`): Efficient any-order gradients w.r.t any variables * JIT compilation (`jax.jit`): Trace any function ⟶ fused accelerator ops * Vectorization (`jax.vmap`): Automatically batch code written for individual samples * Parallelization (`jax.pmap`): Automatically parallelize code across multiple accelerators (including across hosts, e.g. for large TPUs) ## What is Flax? Flax is a high-performance neural network library for JAX that is **designed for flexibility**: Try new forms of training by forking an example and by modifying the training loop, not by adding features to the framework. Flax comes with everything you need to start your research, including: * A module abstraction (`flax.nn.Module`) for parameterized functions such as neural network layers. * Common layers (`flax.nn`): Dense, Conv, {Batch|Layer|Group} Norm, Attention, Pooling, {LSTM|GRU} Cell, Dropout * Optimizers (`flax.optim`): SGD, Momentum, Adam, LARS * Utilities and patterns: replicated training, serialization and checkpointing, metrics, prefetching on device * Educational examples that work out of the box: MNIST, LSTM seq2seq, Graph Neural Networks, Sequence Tagging * HOWTO guides -- diffs that add functionality to educational base exampless * Fast, tuned large-scale end-to-end examples: CIFAR10, ResNet ImageNet, Transformer LM1b ### An annotated MNIST example See [docs/annotated_mnist.md](docs/annotated_mnist.md) for an MNIST example with detailed annotations for each code block. ### Flax Modules The core of Flax is the Module abstraction. Modules allow you to write parameterized functions just as if you were writing a normal numpy function with JAX. The Module api allows you to declare parameters and use them directly with the JAX api’s. Modules are the one part of Flax with "magic" -- the magic is constrained, and enables a very ergonomic style, where modules are defined in a single function with minimal boilerplate. A few things to know about Modules: 1. Create a new module by subclassing `flax.nn.Module` and implementing the `apply` method. 2. Within `apply`, call `self.param(name, shape, init_func)` to register a new parameter and returns its initial value. 3. Apply submodules by calling `MySubModule(...args...)` within `MyModule.apply`. Parameters of `MySubModule` are stored as a dictionary under the parameters `MyModule`. **NOTE:** this returns the *output* of `MySubModule`, not an instance. To get an access to an instance of `MySubModule` for re-use, use [`Module.partial`](https://flax.readthedocs.io/en/latest/flax.nn.html#flax.nn.Module.partial) or [`Module.shared`](https://flax.readthedocs.io/en/latest/notebooks/flax_intro.html#Parameter-sharing) 4. `MyModule.init(rng, ...)` is a pure function that calls `apply` in "init mode" and returnes a nested Python dict of initialized parameter values 5. `MyModule.call(params, ...)` is a pure function that calls `apply` in "call mode" and returnes the output of the module. For example you can define a learned linear transformation as follows: ```py from flax import nn import jax.numpy as jnp class Linear(nn.Module): def apply(self, x, num_features, kernel_init_fn): input_features = x.shape[-1] W = self.param('W', (input_features, num_features), kernel_init_fn) return jnp.dot(x, W) ``` You can also use `nn.module` as a function decorator to create a new module, as long as you don't need access to `self` for creating parameters directly: ```py @nn.module def DenseLayer(x, features): x = flax.nn.Dense(x, features) x = flax.nn.relu(x) return x ``` Read more about Flax Modules and the other parts of the Flax API in the [Flax Guide](https://flax.readthedocs.io/en/latest/notebooks/flax_intro.html#Flax-Modules) ## CPU-only Installation You will need Python 3.5 or later. Now install `flax` from Github: ``` > pip install git+https://github.com/google-research/flax.git@prerelease ``` ## GPU accelerated installation First install `jaxlib`; please follow the instructions in the [JAX readme](https://github.com/google/jax/blob/master/README.md). If they are not already installed, you will need to install [CUDA](https://developer.nvidia.com/cuda-downloads) and [CuDNN](https://developer.nvidia.com/cudnn) runtimes. Now install `flax` from Github: ``` > pip install git+https://github.com/google-research/flax.git@prerelease ``` ## Full end-to-end MNIST example ```py import jax import flax import numpy as onp import jax.numpy as jnp import tensorflow_datasets as tfds class CNN(flax.nn.Module): def apply(self, x): x = flax.nn.Conv(x, features=32, kernel_size=(3, 3)) x = flax.nn.relu(x) x = flax.nn.avg_pool(x, window_shape=(2, 2), strides=(2, 2)) x = flax.nn.Conv(x, features=64, kernel_size=(3, 3)) x = flax.nn.relu(x) x = flax.nn.avg_pool(x, window_shape=(2, 2), strides=(2, 2)) x = x.reshape((x.shape[0], -1)) x = flax.nn.Dense(x, features=256) x = flax.nn.relu(x) x = flax.nn.Dense(x, features=10) x = flax.nn.log_softmax(x) return x @jax.vmap def cross_entropy_loss(logits, label): return -logits[label] def compute_metrics(logits, labels): loss = jnp.mean(cross_entropy_loss(logits, labels)) accuracy = jnp.mean(jnp.argmax(logits, -1) == labels) return {'loss': loss, 'accuracy': accuracy} @jax.jit def train_step(optimizer, batch): def loss_fn(model): logits = model(batch['image']) loss = jnp.mean(cross_entropy_loss( logits, batch['label'])) return loss grad = jax.grad(loss_fn)(optimizer.target) optimizer = optimizer.apply_gradient(grad) return optimizer @jax.jit def eval(model, eval_ds): logits = model(eval_ds['image'] / 255.0) return compute_metrics(logits, eval_ds['label']) def train(): train_ds = tfds.load('mnist', split=tfds.Split.TRAIN) train_ds = train_ds.map(lambda x: {'image':tf.cast(x['image'], tf.float32), 'label':tf.cast(x['label'], tf.int32)}) train_ds = train_ds.cache().shuffle(1000).batch(128) test_ds = tfds.as_numpy(tfds.load( 'mnist', split=tfds.Split.TEST, batch_size=-1)) test_ds = {'image': test_ds['image'].astype(jnp.float32), 'label': test_ds['label'].astype(jnp.int32)} _, initial_params = CNN.init_by_shape( jax.random.PRNGKey(0), [((1, 28, 28, 1), jnp.float32)]) model = nn.Model(CNN, initial_params) optimizer = flax.optim.Momentum( learning_rate=0.1, beta=0.9).create(model) for epoch in range(10): for batch in tfds.as_numpy(train_ds): batch['image'] = batch['image'] / 255.0 optimizer = train_step(optimizer, batch) metrics = eval(optimizer.target, test_ds) print('eval epoch: %d, loss: %.4f, accuracy: %.2f' % (epoch+1, metrics['loss'], metrics['accuracy'] * 100)) ``` ## More end-t

评论收藏

内容反馈

版权申诉