【免费】tf-micro，tf-microtf-microtf-micro资源-CSDN文库

共1730个文件

cc：489个

h：297个

csv：284个

需积分: 0 132 浏览量 2024-03-26 17:06:47 上传评论收藏 11.51MB ZIP 举报

**TensorFlow Micro (TF-Micro) 深度解析** TensorFlow Micro（简称TF-Micro）是Google开发的一个轻量级、低功耗的深度学习框架，专为资源受限的设备设计，如微控制器（MCU）、嵌入式系统以及物联网（IoT）设备。TF-Micro的目标是将机器学习模型部署到这些小型设备上，实现边缘计算，减少对云端的依赖，并提高数据处理的实时性与隐私性。 ### TF-Micro 的主要特点 1. **轻量化**：TF-Micro设计简洁，只包含TensorFlow Lite核心功能，适合在有限内存和计算能力的设备上运行。 2. **低功耗**：优化了计算效率，减少了运行模型时的能源消耗，适合电池供电的设备。 3. **可移植性**：支持多种硬件平台，包括ARM Cortex-M系列微控制器和其他低功耗处理器。 4. **易于集成**：提供简单的API接口，便于开发者将模型集成到现有应用中。 5. **安全性**：在资源受限的环境中运行，有利于数据的安全存储和处理，减少数据泄露风险。 ### TF-Micro 工作流程 TF-Micro的工作流程主要包括以下几个步骤： 1. **模型转换**：需要将训练好的标准TensorFlow模型通过TensorFlow Lite Converter转换为兼容TF-Micro的FlatBuffer格式。 2. **内存规划**：根据目标硬件的内存限制，对模型进行内存优化，确保模型可以在设备上加载。 3. **编译与链接**：将转换后的模型文件与TF-Micro库一起编译，生成适合目标平台的二进制文件。 4. **运行时环境**：TF-Micro提供一个运行时环境，负责加载模型、执行推理和管理资源。 5. **输入输出处理**：设备上的传感器数据经过预处理后作为模型输入，模型预测结果再进行后处理，最终用于控制设备行为或提供服务。 ### TF-Micro 应用场景 TF-Micro广泛应用于各种物联网和嵌入式设备，例如： 1. **智能家居**：智能照明、恒温器、安防系统等，利用TF-Micro实现本地化的语音识别、动作检测等功能。 2. **自动驾驶**：在车辆的嵌入式系统中，TF-Micro可以处理图像识别、障碍物检测等任务。 3. **健康监测**：可穿戴设备中的心率监测、睡眠分析等应用，通过TF-Micro实现本地数据分析。 4. **工业自动化**：在生产线上的机器人、传感器中，利用TF-Micro进行实时的质量检测和故障预测。 ### TF-Micro 开发工具为了方便开发，Google提供了以下工具： - **TensorFlow Lite Converter**：模型转换工具，将标准TensorFlow模型转换为TF-Micro兼容的格式。 - **TensorFlow Lite Micro C++ API**：供开发者在目标平台上编程的接口。 - **Microcontrollers SDK**：包含针对特定微控制器平台的库和示例代码。 - **TensorFlow Lite Micro Test Framework**：用于测试模型在目标平台上的正确性和性能。 ### 结论 TensorFlow Micro为物联网和嵌入式设备带来了强大的机器学习能力，使得在有限资源的设备上实现复杂AI任务成为可能。开发者可以通过TF-Micro轻松地将模型部署到微控制器上，实现高效、低功耗的本地计算，为边缘计算带来新的可能性。

资源推荐

资源详情

资源评论

收起资源包目录

tf-micro，tf-microtf-microtf-micro （1730个子文件）

AUTHORS 348B

BUILD.bazel 334B

.bazelrc 2KB

.bazelversion 6B

person.bmp 10KB

no_person.bmp 10KB

BUILD 48KB

BUILD 37KB

BUILD 31KB

BUILD 28KB

BUILD 27KB

BUILD 23KB

BUILD 14KB

BUILD 10KB

BUILD 9KB

BUILD 7KB

BUILD 6KB

BUILD 5KB

BUILD 4KB

BUILD 3KB

BUILD 2KB

BUILD 1KB

BUILD 1024B

BUILD 1017B

BUILD 975B

nnlib_hifi4.BUILD 970B

BUILD 949B

BUILD 889B

BUILD 808B

BUILD 795B

BUILD 654B

BUILD 591B

BUILD 585B

BUILD 581B

BUILD 560B

BUILD 537B

BUILD 527B

BUILD 458B

BUILD 436B

BUILD 434B

BUILD 423B

BUILD 349B

BUILD 334B

six.BUILD 300B

BUILD 179B

BUILD 131B

BUILD 107B

BUILD 85B

BUILD 21B

BUILD 0B

build_defs.bzl 16KB

py_pkg_cc_deps.bzl 6KB

py_namespace.bzl 5KB

repo.bzl 3KB

tflm_signal.bzl 3KB

workspace.bzl 2KB

build_def.bzl 2KB

expand_stamp_vars.bzl 2KB

build_def.bzl 1KB

extra_rules.bzl 1000B

workspace.bzl 649B

workspace.bzl 640B

workspace.bzl 517B

build_def.bzl 284B

filterbank_util.c 9KB

filterbank.c 4KB

pcan_gain_control_util.c 3KB

frontend_util.c 3KB

log_scale.c 3KB

frontend_io.c 3KB

frontend.c 3KB

共 1730 条

# Micro Speech Example This example shows how to run inference using TensorFlow Lite Micro (TFLM) on two models for wake-word recognition. The first model is an audio preprocessor that generates spectrogram data from raw audio samples. The second is the Micro Speech model, a less than 20 kB model that can recognize 2 keywords, "yes" and "no", from speech data. The Micro Speech model takes the spectrogram data as input and produces category probabilities. ## Table of contents - [Audio Preprocessor](#audio-preprocessor) - [Micro Speech Model Architecture](#micro-speech-model-architecture) - [Run the C++ tests on a development machine](#run-the-c-tests-on-a-development-machine) - [Run the evaluate.py script on a development machine](#run-the-evaluatepy-script-on-a-development-machine) - [Run the evaluate_test.py script on a development machine](#run-the-evaluate_testpy-script-on-a-development-machine) - [Converting models or audio samples to C++](#converting-models-or-audio-samples-to-c) - [Train your own model](#train-your-own-model) ## Audio Preprocessor The Audio Preprocessor model converts raw audio samples into a spectrographic feature. Audio samples are input to the model in windowed frames, each window overlapping the previous. When sufficient features have been accumulated, those features can be provided as input to the Micro Speech model. This model provides a replication of the legacy preprocessing used during training of the Micro Speech model. For additional information on audio preprocessing during training, please refer to the [training README](train/README.md#preprocessing-speech-input) documentation. Audio Preprocessing models providing `int8` and `float32` output, ready for use with the Micro Speech model, are provided in the [models](models/) directory. These models expect the audio input to conform to: * 30ms window frame * 20ms window stride * 16KHz sample rate * 16-bit signed PCM data * single channel (mono) ### Model Architecture This model consists primarily of [Signal Library](https://github.com/tensorflow/tflite-micro/blob/main/python/tflite_micro/signal) operations. The library is a set of Python methods, and bindings to `C++` library code. To allow for use with the `TFLM MicroInterpreter`, a set of [Signal Library kernels](https://github.com/tensorflow/tflite-micro/blob/main/signal/micro/kernels) is also provided. The [audio_preprocessor.py](audio_preprocessor.py) script provides a complete example of how to use the `Signal Library` within your own Python application. This script has support for TensorFlow eager-execution mode, graph-execution mode, and `TFLM MicroInterpreter` inference operations. [<img src="images/audio_preprocessor_int8.png" width="900" alt="model architecture"/>](images/audio_preprocessor_int8.png) *This image was derived from visualizing the 'models/audio_preprocessor_int8.tflite' file in [Netron](https://github.com/lutzroeder/netron)* Each of the steps performed by the model are outlined as follows: 1) Audio frame input with shape `(1, 480)` 1) Apply `Hann Window` smoothing using `SignalWindow` 1) Reshape tensor to match the input of `SignalFftAutoScale` 1) Rescale tensor data using `SignalFftAutoScale` and calculate one of the input parameters to `SignalFilterBankSquareRoot` 1) Compute FFT using `SignalRfft` 1) Compute power spectrum using `SignalEnergy`. The tensor data is only updated for elements between `[start_index, end_index)`. 1) The `Cast`, `StridedSlice`, and `Concatenation` operations are used to fill the tensor data with zeros, for elements outside of `[start_index, end_index)` 1) Compress the power spectrum tensor data into just 40 channels (frequency bands) using `SignalFilterBank` 1) Scale down the tensor data using `SignalFilterBankSquareRoot` 1) Apply noise reduction using `SignalFilterBankSpectralSubtraction` 1) Apply gain control using `SignalPCAN` 1) Scale down the tensor data using `SignalFilterBankLog` 1) The remaining operations perform additional legacy down-scaling and convert the tensor data to `int8` 1) Model output has shape `(40,)` ### The `FeatureParams` Python Class The `FeatureParams` class is located within the [audio_preprocessor.py](audio_preprocessor.py#L260) script. This class allows for custom configuration of the `AudioPreprocessor` class. Parameters such as sample rate, window size, window stride, number of output channels, and many more can be configured. The parameters to be changed must be set during class instantiation, and are frozen thereafter. The defaults for `FeatureParams` match those of the legacy audio preprocessing used during Micro Speech model training. ### The `AudioPreprocessor` Python Class The `AudioPreprocessor` class in the [audio_preprocessor.py](audio_preprocessor.py#L338) script provides easy to use convenience methods for creating and using an audio preprocessing model. This class is configured through use of a `FeatureParams` object, allowing some flexibility in how the audio preprocessing model works. A short summary of the available methods and properties: * `load_samples`: load audio samples from a `WAV` format file and prepare the samples for use by other `AudioPreprocessor` methods * `samples`: tensor containing previously loaded audio samples * `params`: the `FeatureParams` object the class was instantiated with * `generate_feature`: generate a single feature using TensorFlow eager-execution * `generate_feature_using_graph`: generate a single feature using TensorFlow graph-execution * `generate_feature_using_tflm`: generate a single feature using the `TFLM MicroInterpreter` * `reset_tflm`: reset the internal state of the `TFLM MicroInterpreter` and the `Signal Library` operations * `generate_tflite_file`: create a `.tflite` format file for the preprocessor model ### Run the audio_preprocessor.py script on a development machine The [audio_preprocessor.py](audio_preprocessor.py#L532) script generates a `.tflite` file for the preprocessing model, ready for use with the Micro Speech model. To generate a `.tflite` model file with `int8` output: ```bash bazel build tensorflow/lite/micro/examples/micro_speech:audio_preprocessor bazel-bin/tensorflow/lite/micro/examples/micro_speech/audio_preprocessor --output_type=int8 ``` To generate a `.tflite` model file with `float32` output: ```bash bazel build tensorflow/lite/micro/examples/micro_speech:audio_preprocessor bazel-bin/tensorflow/lite/micro/examples/micro_speech/audio_preprocessor --output_type=float32 ``` ### Run the audio_preprocessor_test.py script on a development machine The [audio_preprocessor_test.py](audio_preprocessor_test.py) script performs several tests to ensure correct inference operations occur across all execution modes. The tests are: * cross-check inference results between eager, graph, and `TFLM MicroInterpreter` execution modes * check the `yes` and `no` 30ms samples in the [testdata](testdata/) directory for correct generation of the feature tensor * compare the preprocessor `int8` model against the same model in the [models](models/) directory * compare the preprocessor `float32` model against the same model in the [models](models/) directory ```bash bazel build tensorflow/lite/micro/examples/micro_speech:audio_preprocessor_test bazel-bin/tensorflow/lite/micro/examples/micro_speech/audio_preprocessor_test ``` ## Micro Speech Model Architecture This is a simple model comprised of a Convolutional 2D layer, a Fully Connected Layer or a MatMul Layer (output: logits) and a Softmax layer (output: probabilities) as shown below. Refer to the [`tiny_conv`](https://github.com/tensorflow/tensorflow/blob/master/tensorflow/examples/speech_commands/models.py#L673) model architecture. The output probabilities are in four categories: `silence`, `unknown`, `yes`, `no`. The input to the model is 49 spectrographic features, each feature consisting of 40 channels of data. The features are g

评论收藏

内容反馈