# Optical Character Recognition Using DeepLearning
![Er. Harshul Jain, author](https://img.shields.io/badge/Author-Er.%20Harshul%20Jain%20-blue.svg)
Text is everywhere! It is present in PDFs, docs as well as images. There are lots of applications where text data is useful for doing analytics. Such applications include receipts recognition, number plate detection, extracting the latex formulas from the images etc. General Computer Vision can be used for such task but it lacks in accuracy. In order to solve the low accuracy and variance problem, we use the state of the art deep neural networks.
This repository includes:
```
1. A TensorFlow implementation of the CNN+LSTM+CTC model for OCR.
2. supporting scripts to apply the RCNN appraoch for OCR.
```
### Architecture
![Architecture](images/cnn_lstm_Architecture.jpeg)
### Instructions on How to run
Get the repository
```
git clone https://github.com/harshul1610/OCR.git
```
Get the NIST19 dataset
```
mkdir data
wget https://s3.amazonaws.com/nist-srd/SD19/by_class.zip
unzip by_class.zip
mv by_class NIST19
```
Get the Captcha data
```
cd OCR
python2 generate_captcha.py
```
Run the final notebook for training and testing
```
CNN_LSTM_CTC_OCR-captcha.ipynb
```
### LICENSE
MIT
没有合适的资源?快使用搜索试试~ 我知道了~
OCR:使用深度学习进行光学字符识别
共23个文件
ipynb:14个
py:3个
jpeg:1个
需积分: 31 3 下载量 156 浏览量
2021-05-12
14:52:45
上传
评论 1
收藏 182KB ZIP 举报
温馨提示
使用DeepLearning进行光学字符识别 文字无处不在! 它存在于PDF,文档和图像中。 在许多应用程序中,文本数据可用于进行分析。 这样的应用包括收据识别,车牌检测,从图像中提取乳胶配方等。通用计算机视觉可以用于此类任务,但缺乏准确性。 为了解决低精度和方差问题,我们使用了最先进的深度神经网络。 该存储库包括: 1. A TensorFlow implementation of the CNN+LSTM+CTC model for OCR. 2. supporting scripts to apply the RCNN appraoch for OCR. 建筑学 有关如何运行的说明 获取仓库 git clone https://github.com/harshul1610/OCR.git 获取NIST19数据集 mkdir data wget https://s3.amazo
资源详情
资源评论
资源推荐
收起资源包目录
OCR-master.zip (23个子文件)
OCR-master
.ipynb_checkpoints
LSTM_CTC_OCR-captcha-checkpoint.ipynb 31KB
Combine_Images_annotations_data-checkpoint.ipynb 6KB
ocr_classification-checkpoint.ipynb 41KB
LSTM_CTC_OCR-checkpoint.ipynb 37KB
CNN_LSTM_CTC_OCR-captcha-checkpoint.ipynb 30KB
make_annotations-checkpoint.ipynb 36KB
make_pbtxt-checkpoint.ipynb 2KB
ocr_classification.ipynb 41KB
label_cls_name.json 860B
generate_captcha.py 710B
make_pbtxt.ipynb 2KB
images
cnn_lstm_Architecture.jpeg 50KB
LSTM_CTC_OCR-captcha.ipynb 31KB
generate_tfrecord.py 4KB
xml_to_csv.py 1KB
LICENSE 1KB
captcha 14KB
README.md 1KB
make_annotations.ipynb 36KB
CNN_LSTM_CTC_OCR-captcha.ipynb 30KB
LSTM_CTC_OCR.ipynb 37KB
Combine_Images_annotations_data.ipynb 6KB
.gitignore 16B
共 23 条
- 1
Mika.w
- 粉丝: 32
- 资源: 4592
上传资源 快速赚钱
- 我的内容管理 展开
- 我的资源 快来上传第一个资源
- 我的收益 登录查看自己的收益
- 我的积分 登录查看自己的积分
- 我的C币 登录后查看C币余额
- 我的收藏
- 我的下载
- 下载帮助
安全验证
文档复制为VIP权益,开通VIP直接复制
信息提交成功
评论0