用Python实现OCR识别提取图片文字，操作简单，易上手

程序员CC_

于 2025-06-14 16:25:23 发布

阅读量1.8k

点赞数 29

CC 4.0 BY-SA版权

分类专栏： Python教程 Python入门文章标签： python ocr 开发语言 Python教程文本识别

本文链接：https://blog.csdn.net/2401_85428892/article/details/148654428

Python教程同时被 2 个专栏收录

75 篇文章

订阅专栏

Python入门

29 篇文章

订阅专栏

本文章已经生成可运行项目，

Python实现OCR识别提取图片文字

OCR（Optical Character Recognition，光学字符识别）技术可以从图片中提取文字信息。以下是使用Python实现OCR的几种方法：

方法1：使用Tesseract OCR（推荐）

Tesseract是一个开源的OCR引擎，由Google维护，支持多种语言。

安装步骤

首先安装Tesseract引擎：
- Windows: 下载安装包从 GitHub
- Mac: brew install tesseract
- Linux: sudo apt install tesseract-ocr (Ubuntu/Debian)
安装Python封装库：

pip install pytesseract pillow

示例代码

from PIL import Image
import pytesseract

# 如果Tesseract不在系统PATH中，需要指定路径
# pytesseract.pytesseract.tesseract_cmd = r'C:\Program Files\Tesseract-OCR\tesseract.exe'

def ocr_with_tesseract(image_path, lang='eng'):
    """
    使用Tesseract进行OCR识别
    
    参数:
        image_path: 图片路径
        lang: 语言代码 (默认英语 'eng', 中文 'chi_sim' 或 'chi_tra')
    
    返回:
        识别出的文本
    """
    try:
        # 打开图片文件
        img = Image.open(image_path)
        
        # 使用Tesseract进行OCR识别
        text = pytesseract.image_to_string(img, lang=lang)
        
        return text.strip()
    except Exception as e:
        print(f"OCR处理出错: {e}")
        return ""

# 使用示例
if __name__ == "__main__":
    image_path = "example.png"  # 替换为你的图片路径
    result = ocr_with_tesseract(image_path, lang='chi_sim')  # 中文简体
    print("识别结果:")
    print(result)

方法2：使用EasyOCR（基于深度学习）

EasyOCR是一个基于深度学习的OCR库，支持多种语言，识别准确率较高。

安装步骤

pip install easyocr

示例代码

import easyocr

def ocr_with_easyocr(image_path, langs=['ch_sim', 'en']):
    """
    使用EasyOCR进行OCR识别
    
    参数:
        image_path: 图片路径
        langs: 语言列表 (默认中文简体和英语)
    
    返回:
        识别出的文本
    """
    try:
        # 创建reader对象，指定语言
        reader = easyocr.Reader(langs)
        
        # 读取图片
        result = reader.readtext(image_path)
        
        # 提取文本
        texts = [detection[1] for detection in result]
        
        return '\n'.join(texts)
    except Exception as e:
        print(f"OCR处理出错: {e}")
        return ""

# 使用示例
if __name__ == "__main__":
    image_path = "example.png"  # 替换为你的图片路径
    result = ocr_with_easyocr(image_path)
    print("识别结果:")
    print(result)

方法3：使用PaddleOCR（百度开源OCR）

PaddleOCR是百度开源的OCR工具，支持多种语言和场景。

安装步骤

pip install paddlepaddle paddleocr

示例代码

from paddleocr import PaddleOCR

def ocr_with_paddleocr(image_path, lang='ch'):
    """
    使用PaddleOCR进行OCR识别
    
    参数:
        image_path: 图片路径
        lang: 语言 (默认中文 'ch', 英文 'en')
    
    返回:
        识别出的文本
    """
    try:
        # 初始化OCR
        ocr = PaddleOCR(use_angle_cls=True, lang=lang)
        
        # 读取图片
        result = ocr.ocr(image_path, cls=True)
        
        # 提取文本
        texts = [line[1][0] for line in result[0]]
        
        return '\n'.join(texts)
    except Exception as e:
        print(f"OCR处理出错: {e}")
        return ""

# 使用示例
if __name__ == "__main__":
    image_path = "example.png"  # 替换为你的图片路径
    result = ocr_with_paddleocr(image_path)
    print("识别结果:")
    print(result)

图像预处理提高识别率

对于质量较差的图片，可以先进行预处理：

from PIL import Image, ImageEnhance, ImageFilter
import numpy as np
import cv2

def preprocess_image(image_path):
    """
    图像预处理
    
    参数:
        image_path: 图片路径
    
    返回:
        预处理后的PIL Image对象
    """
    try:
        # 打开图片
        img = Image.open(image_path)
        
        # 转换为灰度图
        img = img.convert('L')
        
        # 增强对比度
        enhancer = ImageEnhance.Contrast(img)
        img = enhancer.enhance(2)
        
        # 锐化
        img = img.filter(ImageFilter.SHARPEN)
        
        # 使用OpenCV进行二值化 (可选)
        # img_cv = np.array(img)
        # _, img_cv = cv2.threshold(img_cv, 0, 255, cv2.THRESH_BINARY + cv2.THRESH_OTSU)
        # img = Image.fromarray(img_cv)
        
        return img
    except Exception as e:
        print(f"图像预处理出错: {e}")
        return None

# 使用预处理后的图片进行OCR
if __name__ == "__main__":
    image_path = "example.png"
    preprocessed_img = preprocess_image(image_path)
    if preprocessed_img:
        result = ocr_with_tesseract(preprocessed_img)
        print("预处理后识别结果:")
        print(result)

注意事项

对于中文识别，需要下载相应的语言包：
- Tesseract: 安装时选择中文包，或下载后放到tessdata目录
- EasyOCR/PaddleOCR: 自动下载所需语言模型
识别准确率受图片质量影响较大，建议：
- 确保文字清晰
- 适当调整对比度和亮度
- 对于复杂背景，可以先进行图像分割
对于大量图片处理，可以考虑使用GPU加速（如EasyOCR和PaddleOCR支持GPU）
商业应用可以考虑使用云服务OCR API（如百度OCR、阿里云OCR、腾讯云OCR等）