作者:AI拉呱

Python:数据预处理加速办法

预处理加速

  • 核心就是:多进程

例子

import glob
import os
import cv2


### Loop through all jpg files in the current folder 
### Resize each one to size 600x600
for image_filename in glob.glob("*.jpg"):
  ### Read in the image data
  img = cv2.imread(image_filename)
  
  ### Resize the image
  img = cv2.resize(img, (600, 600)) 

改进方式:多cpu同时运行

import glob
import os
import cv2
import concurrent.futures


def load_and_resize(image_filename):
  ### Read in the image data
  img = cv2.imread(image_filename)
  
  ### Resize the image
  img = cv2.resize(img, (600, 600)) 
  

### Create a pool of processes. By default, one is created for each CPU in your
lock