批量文字识别_Python图片识别_批量_文字识别api_文字识别_

共4个文件

png：2个

py：1个

docx：1个

版权申诉

Python图片识别

文字识别

5星 · 超过95%的资源 152 浏览量 2021-10-02 12:21:38 上传评论收藏 149KB RAR 举报

资源推荐

资源详情

资源评论

收起资源包目录

批量文字识别.rar （4个子文件）

批量文字识别

shiyan-docx.py 2KB

shiyan.docx 37KB

新建文件夹

1.PNG 75KB

2.PNG 42KB

from aip import AipOcr import os import docx # 定义常量 APP_ID = '16866256' AIP_KEY = 'L5K8i7Rxx1Edx3kETL4nltsV' SECRET_KEY = 'UzcmLlqOVGECVQdIHLrZNhzTBtQRoAgH' client = AipOcr(APP_ID, AIP_KEY, SECRET_KEY) """ 读取图片的方法""" def get_file_content(filePath): with open(filePath, 'rb') as fp: return fp.read() """目录下的所有文件""" def all_path(dir_name): # 所有的文件 result = [] for main_dir, sub_dir, file_name_list in os.walk(dir_name): # 当前主目录 print("1:", main_dir) # 当前主目录下的所有目录 print("2:", sub_dir) # 当前主目录下的所有文件 print("3:", file_name_list) for file_name in file_name_list: # 合并成一个完整路径 a_path = os.path.join(main_dir, file_name) result.append(a_path) # filelist = [] # for filename in os.listdir(dir_name): # filelist.append(filename) # print(filelist) return result if __name__ == "__main__": result = all_path(r'E:/Desktop/新建文件夹') print(result) # 创建内存中的word文档对象 file = docx.Document() for i in result: # i定义图片的绝对路径 image = get_file_content(i) """ 调用通用文字识别（高精度版） """ img_info = client.basicAccurate(image) ##restu1字典类型 lists = img_info['words_result'] # 列表 for ls in lists: print(ls['words']) # 写入若干段落 file.add_paragraph(ls['words']) # 保存文档 file.save("E:/Desktop/shiyan.docx")

评论收藏

内容反馈

版权申诉