paddelOCR gitclonehttps://github.com/PaddlePaddle/Paddle.git_git基础上配置paddle资源-CSDN文库

共2000个文件

py：3810个

cc：3367个

h：2455个

5星 · 超过95%的资源需积分: 10 95 浏览量 2022-11-21 16:21:19 上传评论收藏 326.3MB ZIP 举报

"PaddleOCR：基于PaddlePaddle的开源OCR系统" 【正文】 PaddleOCR是一款由阿里云开发并开源的光学字符识别（Optical Character Recognition, OCR）系统，它基于深度学习框架PaddlePaddle（百度飞桨）。PaddleOCR的主要特点是轻量级、高效且易于部署，适用于各种应用场景，包括但不限于移动端、云端服务以及边缘计算设备。通过`git clone https://github.com/PaddlePaddle/Paddle.git`命令，你可以获取到PaddlePaddle的源代码，从而进一步构建和研究PaddleOCR。 PaddlePaddle是百度公司推出的开源深度学习平台，它支持动态图和静态图两种模式，适用于学术研究和工业应用。PaddlePaddle以其易用性、灵活性和高性能而著称，支持大规模分布式训练，并且在模型的训练、优化和部署等方面提供了丰富的工具和库。 PaddleOCR包含了多个预训练模型，这些模型针对不同的任务进行了优化，如文字检测（Text Detection）和文字识别（Text Recognition）。其中，文字检测主要负责在图像中找到文字区域，而文字识别则是将检测到的文字转换为可读的文本。PaddleOCR的模型设计兼顾了精度和速度，使得在资源有限的环境下也能实现高效的OCR服务。在PaddleOCR项目中，你可能会接触到以下几个关键概念： 1. **模型架构**：PaddleOCR包含多种模型架构，如DB（DeepBox）用于文字检测，CRNN（Connectionist Recurrent Neural Network）或CTC（Connectionist Temporal Classification）用于序列标注的文字识别。此外，还有一些创新模型，如PP-OCRv3，它在精度和速度上都表现出色。 2. **数据集**：为了训练和验证模型，PaddleOCR支持多种公开数据集，如ICDAR、MSRA-TD500、CTW1500等，同时，它还提供数据处理工具，帮助用户快速准备自定义数据集。 3. **训练与评估**：PaddlePaddle提供的训练工具使得训练过程简单易行。用户可以通过配置文件指定训练参数，如学习率、批次大小等，并进行多GPU分布式训练。 4. **部署**：PaddleOCR支持多种部署方式，包括Python API、C++预测库、移动端（Android/iOS）和服务器环境。这使得模型可以无缝地应用于实际场景，如身份证信息提取、文档自动录入等。 5. **持续更新**：PaddleOCR项目活跃，不断推出新功能和优化，如模型压缩、轻量化等，以适应不同硬件条件下的应用需求。 PaddleOCR结合了PaddlePaddle的强大功能，为开发者提供了完整的OCR解决方案，无论是进行学术研究还是开发实际应用，都是一个值得信赖的选择。通过深入研究PaddleOCR的源代码，你不仅可以学习到OCR领域的最新技术，还能了解到深度学习框架PaddlePaddle的使用方法。

资源详情

资源评论

收起资源包目录

paddel OCR git clone https://github.com/PaddlePaddle/Paddle.git （2000个子文件）

com_baidu_paddle_inference_Config.cpp 10KB

com_baidu_paddle_inference_Tensor.cpp 6KB

com_baidu_paddle_inference_Predictor.cpp 4KB

socket.cpp 2KB

config.go 26KB

tensor.go 7KB

predictor_test.go 6KB

config_test.go 4KB

predictor.go 4KB

utils.go 1KB

version.go 869B

lib.go 812B

unicode_flag.h 270KB

activation_functor.h 120KB

mlu_baseop.h 104KB

variant.h 96KB

round_robin.h 90KB

heter_comm_inl.h 89KB

onednn_reuse.h 79KB

depthwise_conv.h 74KB

layer_norm_kernel.cu.h 73KB

kernel_registry.h 71KB

elementwise_grad_base.h 70KB

blas_impl.cu.h 69KB

graph_pattern_detector.h 68KB

flat_hash_map.h 67KB

matmul_grad_kernel_impl.h 64KB

blas_impl.hip.h 61KB

blas_impl.h 61KB

kernel_utils.h 59KB

interpolate_op.h 58KB

elementwise_op_function.h 57KB

data_feed.h 54KB

small_vector.h 54KB

softmax_gpudnn.h 51KB

reduce_function.h 48KB

tensor_py.h 46KB

tester_helper.h 43KB

transpose_op.cu.h 43KB

datamover_primitives_xpu2.h 42KB

box_wrapper.h 42KB

TensorReductionGpu.h 42KB

fused_multi_transformer_op.cu.h 41KB

paddle_analysis_config.h 40KB

feature_value.h 39KB

xpu2_op_list.h 39KB

broadcast_function.h 39KB

fused_layernorm_residual_dropout_bias.h 38KB

top_k_function_cuda.h 37KB

tinyformat.h 37KB

gru_cpu_kernel.h 37KB

profiler_helper.h 36KB

run_program_op_node.h 36KB

conv_cudnn_v7.h 34KB

conv.cu.h 33KB

reduce_op.h 32KB

dot_grad_kernel_impl.h 32KB

pass_tester_helper.h 32KB

concurrent_unordered_map.cuh.h 32KB

float16.h 32KB

fused_elemwise_activation_op.h 31KB

engine.h 31KB

elementwise_base.h 30KB

conv_handler.h 30KB

tensorrt_engine_op.h 30KB

svd_helper.h 30KB

enforce.h 30KB

metrics.h 29KB

einsum_impl.h 29KB

datamover_primitives.h 29KB

fused_gate_attention.h 29KB

Meta.h 29KB

avx_mathfun.h 28KB

op_meta_info.h 28KB

prepared_operator.h 28KB

device_worker.h 28KB

elementwise_grad_kernel_impl.h 28KB

ps_gpu_wrapper.h 27KB

prroi_pool_op.h 27KB

norm_utils.cu.h 27KB

operator.h 27KB

dlnne_engine_op.h 27KB

unary.h 27KB

fake_quantize_op.cu.h 27KB

common_graph_table.h 26KB

op_converter.h 26KB

deformable_psroi_pooling_op.h 26KB

momentum_kernel_impl.h 26KB

optional.h 26KB

compute_primitives.h 25KB

pd_config.h 25KB

fmha_ref.h 25KB

activation_grad_impl.h 25KB

general_grad.h 25KB

xpu_api_wrapper.h 24KB

lstmp_op.h 24KB

run_program_op.h 24KB

heter_server.h 24KB

visit_type.h 24KB

strided_slice.h 23KB

共 2000 条

# Paddle HIgh reusability operator library (PHI) Design Paddle HIgh reusability operator library (PHI), or we also call it 'functional operator library', supports to implement new operator kernels based on existing operator kernel functions and 'Kernel Primitives API (KPS)', and supports plug-in access to new hardware or new acceleration library. In order to solve the problems of unclear operator interface in the original operator library of the Paddle Fluid Framework, high cost of operator reuse, and poor scheduling performance, we refactored the operator library of the Paddle Framework, designed flexible and efficient functional paradigm. The operator library PHI can implement new operators by combining calls to functional operator interfaces, which can greatly reduce the development cost of native operator and custom operator. ## 1. Background > Introduce the problems to be solved in designing and building the PHI operator library The PHI operator library project was initially launched to support the refactoring of the paddle dynamic graph architecture to reduce scheduling overhead and improve the reuse capability of OpKernel development. However, the subsequent decision to take this opportunity to establish an operator library that can be used in both training and inference scenarios (including server-side and mobile-side scenarios), reduce the cost of infrastructure development and operator maintenance in the paddle ecosystem in the long run, so we expanded the target scope of the project. Specifically, the PHI operator library project carries the expectation to solve the following problems of Paddle. ### 1.1 Poor reusability between Op&OpKernel Before version 2.3, the reusability between Operators (Op) in Paddle was relatively poor. Only in a few backward Ops, some simple Ops were reused by calling `SetType` in the `GradOpMaker` implementation. In most cases where the existing Op implementation can be reused, the code is rewritten by copy. The root cause of poor reusability is the inflexibility of the original Op architecture design: 1. When an Op reuses the `Opkernel::Compute` method of another Op, an `ExecutionContext` needs to be constructed first, and the reuse method is relatively cumbersome > It will be much more convenient if you can directly call the Kernel in the form of a function 2. Due to the overhead introduced by additional data structure construction and independent Op scheduling, from the perspective of computing performance, it is better to copy the calculation code directly when reusing Op, which leads us to gradually abandon the earlier principle of backward Op reusing forward Op, and began to implement Kernel separately for each backward Op, so that Paddle maintains a large number of backward OpKernel implementation codes internally. > Only when the overhead of reusing Ops is small enough, reusing existing Ops to implement new Ops can be widely promoted ### 1.2 Conciseness and fine-grained execution scheduling #### 1.2.1 Dynamic graph After the release of Paddle 2.0, it has received many feedbacks from internal and external users that the performance of the dynamic graph is several times lower than that of competing products in the execution scenario of small model on CPU. The main reason for this problem is: the execution path of the C++ side of the Padddle dynamic graph is relatively long and the scheduling overhead is relatively heavy, which is related to the early design of the dynamic graph which is compatible with the static graph and inherits many object construction processes of the static graph Op. Therefore, the dynamic graph needs to be upgraded to a function-based scheduling architecture, and this problem can be solved by abandoning the original complex Op architecture, which depends on the OpKernel being changed to a functional writing method. #### 1.2.2 Static image + IR Our current static graph mode are not "static" enough. At present, static graph mode still have a lot of logic for dynamic selection at runtime, for example, selecting OpKernel at runtime, judging whether to copy data across devices at runtime, etc.. However, these can actually be determined during the compilation of the static graph mode network, and the execution process is determined as a series of OpKernel executions, and no dynamic judgment selection is made, thereby further improving the execution efficiency. And these rely on the fine-grained OpKernel itself, decoupling the existing complex large OpKernel into small Kernels for specific scenarios and specific devices. ### 1.3 Ease of use improvement requirements for custom operators The new custom C++ external operator paradigm released in early 2021 has a relatively intuitive usage at the level of interface and function writing, but because we lack the C++ APIs for basic operations, in fact, when implementing specific custom Op operation logic, such as basic addition, subtraction, multiplication and division and matrix operations, still need to be reimplemented again, and Paddle's existing and optimized basic operations cannot be reused, development costs are still relatively high. In order to reuse the basic operations inside Paddle, the Op paradigm must be upgraded to functional paradigm, and build the corresponding C++ API system. ### 1.4 Build an integrated training and inference operator library to reduce the maintenance cost of inference operators For a long time, because the Paddle and Paddle-Lite operators are maintained separately, the new paddle operator, if Paddle-Lite needs it, must be manually reimplemented in Paddle-Lite, and when the Paddle operator is upgraded, Paddle-Lite does not perceive it in time, which will directly lead to bugs in the inference model when lite is executed, which introduces high maintenance costs. Only a unified operator library can solve this problem for a long time. Therefore, this functional operator library will be jointly constructed by training and inference team, and will serve as an independent compilation component and underlying infrastructure (not yet independently split), which can serve training, server-inference, and -inference execution systems at the same time. ### 1.5 The adaptation of the new inference Runtime design 'infrt' Inference team designed a new runtime `infrt`. It is expected to unify the execution system of Paddle-Inference and Paddle-Lite. It is necessary to directly call the operators in the PHI operator library jointly built this time. Therefore, the adaptation to `infrt` needs to be considered in the design. (Currently the `infrt` project is temporarily on hold). ### 1.6 Op and Kernel parameter normalization The Python 2.0 API project in 2020 standardized the argument list of the Paddle Python-side API, making it concise, easy to use, and standard. However, due to cost considerations, the argument list at the Op level was not standardized, so there will be many early developed operators that differ greatly in arguments from the Python API. For example, `conv` op, the Python API has only 8 arguments, but the corresponding C++ `Conv` Op has 29 arguments. API and Op are essentially the same layer of concepts, both are descriptions of an operation, and the arguments should be consistent. In order to solve this problem, 'the operator definition enhancement project' was launched, and the declarations of 'AsExtra' and 'AsQuant' were added to some unnecessary arguments, but the problem was not fundamentally solved, which is what the construction of the PHI operator library hopes to solve. We hope to be able to achieve the same three-layer arguments of Python API -> Op(C++ API) -> Kernel API, so that the overall structure is clear, and the reuse relationship of each layer is clear enough. Maintaining a set of official Python API documents can basically satisfy the common reference requirements of the three-tier API, no lo

评论收藏

内容反馈

虚伪的小白

2023-07-26

这个文件详细解释了如何从GitHub克隆Paddle OCR，对于想要学习OCR的人来说十分实用。

paddel OCR git clone https://github.com/PaddlePaddle/Paddle.git

评论5

最新资源

paddel OCR git clone https://github.com/PaddlePaddle/Paddle.git

评论5

最新资源

相关推荐

ctcdecode:PyTorch CTC解码器绑定

paddle:提交到GitHub并自动将其同步到SVN

PaddleGAN-Old-video-coloring-:使用paddle进行老视频修复

LiteKit:Off-The-Shelf AI Development Kit for APP Developers based on Paddle Lite （『飞桨』移动端开箱即用AI套件, 包含Java & Objective C接口支持）

deepreshanPong.github.io:使用Processing构建并通过网站部署的自定义Pong游戏

arcanoid:我的第一个安卓小游戏

hanlp,jieba,nlpir分词工具安装报错完全解决方案.docx

人工智能-PaddlePaddle / PaddleScience

PAPC:PAPC是基于纯PaddlePaddle的深度学习点云平台

paddlepaddle-2.5.0rc1-cp37-cp37m-linux-aarch64.whl

PaddlejsInWechat:基于paddlejs的极轻量级端侧AI，开箱即用，让你的微信小程序也有智能

stable-diffusion部署需要的包

大规模语言模型：从理论到实践

人工智能大模型介绍.pptx

Notepad++ 8.5.6最新版 64位安装包

diabetes糖尿病数据集

21个免费无限制免登录chatgpt资源， OpenAI GPT-4\3.5 模型的智能对话链接

libomp140.x86-64.dll

ChatGPT智能AI机器人微信小程序源码-带部署教程

transformer代码

线性代数-同济大学第七版

Matlab深度学习工具箱

最新AI软件系统源码+支持AI绘画(Midjourney)+文档分析+识图理解+电脑PC端+手机端H5+微信公众号对接

LM Studio windows版本安装

基于Qwen2.5-7B-Instruct的大模型微调实战指南

Matlab中安装MinGW电脑环境配置工具configuremingw

Python调用豆包大模型API及文本转语音TTS

一本关于ChatGPT的书《ChatGPT 革命：了解大型语言模型的力量》中文版