Installation

Prerequisites

Linux (Windows is not officially supported)
Python 3.7
PyTorch 1.5 or higher
torchvision 0.6.0
CUDA 10.1
NCCL 2
GCC 5.4.0 or higher
mmcv 1.2.6

We have tested the following versions of OS and softwares:

OS: Ubuntu 16.04
CUDA: 10.1
GCC(G++): 5.4.0
mmcv 1.2.6
PyTorch 1.5
torchvision 0.6.0

MMOCR depends on Pytorch and mmdetection v2.9.0.

Step-by-Step Installation Instructions

a. Create a conda virtual environment and activate it.

conda create -n open-mmlab python=3.7 -y
conda activate open-mmlab

b. Install PyTorch and torchvision following the official instructions, e.g.,

conda install pytorch==1.5.0 torchvision==0.6.0 cudatoolkit=10.1 -c pytorch

Note: Make sure that your compilation CUDA version and runtime CUDA version match. You can check the supported CUDA version for precompiled packages on the PyTorch website.

E.g. 1 If you have CUDA 10.1 installed under /usr/local/cuda and would like to install PyTorch 1.5, you need to install the prebuilt PyTorch with CUDA 10.1.

conda install pytorch cudatoolkit=10.1 torchvision -c pytorch

E.g. 2 If you have CUDA 9.2 installed under /usr/local/cuda and would like to install PyTorch 1.3.1., you need to install the prebuilt PyTorch with CUDA 9.2.

conda install pytorch=1.3.1 cudatoolkit=9.2 torchvision=0.4.2 -c pytorch

If you build PyTorch from source instead of installing the prebuilt package, you can use more CUDA versions such as 9.0.

c. Create a folder called code and clone the mmcv repository into it.

mkdir code
cd code
git clone https://github.com/open-mmlab/mmcv.git
cd mmcv
git checkout -b v1.2.6 v1.2.6
pip install -r requirements.txt
MMCV_WITH_OPS=1 pip install -v -e .

d. Clone the mmdetection repository into it. The mmdetection repo is separate from the mmcv repo in code.

cd ..
git clone https://github.com/open-mmlab/mmdetection.git
cd mmdetection
git checkout -b v2.9.0 v2.9.0
pip install -r requirements.txt
pip install -v -e .
export PYTHONPATH=$(pwd):$PYTHONPATH

Note that we have tested mmdetection v2.9.0 only. Other versions might be incompatible.

e. Clone the mmocr repository into it. The mmdetection repo is separate from the mmcv and mmdetection repo in code.

cd ..
git clone https://github.com/open-mmlab/mmocr.git
cd mmocr

f. Install build requirements and then install MMOCR.

pip install -r requirements.txt
pip install -v -e .  # or "python setup.py build_ext --inplace"
export PYTHONPATH=$(pwd):$PYTHONPATH

Full Set-up Script

Here is the full script for setting up mmocr with conda.

conda create -n open-mmlab python=3.7 -y
conda activate open-mmlab

# install latest pytorch prebuilt with the default prebuilt CUDA version (usually the latest)
conda install pytorch==1.5.0 torchvision==0.6.0 cudatoolkit=10.1 -c pytorch

# install mmcv
mkdir code
cd code
git clone https://github.com/open-mmlab/mmcv.git
cd mmcv # code/mmcv
git checkout -b v1.2.6 v1.2.6
pip install -r requirements.txt
MMCV_WITH_OPS=1 pip install -v -e .

# install mmdetection
cd .. # exit to code
git clone https://github.com/open-mmlab/mmdetection.git
cd mmdetection # code/mmdetection
git checkout -b v2.9.0 v2.9.0
pip install -r requirements.txt
pip install -v -e .
export PYTHONPATH=$(pwd):$PYTHONPATH

# install mmocr
cd ..
git clone https://github.com/open-mmlab/mmocr.git
cd mmocr # code/mmocr

pip install -r requirements.txt
pip install -v -e .  # or "python setup.py build_ext --inplace"
export PYTHONPATH=$(pwd):$PYTHONPATH

Another option: Docker Image

We provide a Dockerfile to build an image.

# build an image with PyTorch 1.5, CUDA 10.1
docker build -t mmocr docker/

Run it with

docker run --gpus all --shm-size=8g -it -v {DATA_DIR}:/mmocr/data mmocr

Prepare Datasets

It is recommended to symlink the dataset root to mmocr/data. Please refer to datasets.md to prepare your datasets. If your folder structure is different, you may need to change the corresponding paths in config files.

The mmocr folder is organized as follows:

mmocr
.
├── configs
│   ├── _base_
│   ├── kie
│   ├── textdet
│   └── textrecog
├── demo
│   ├── demo_text_det.jpg
│   ├── demo_text_recog.jpg
│   ├── image_demo.py
│   └── webcam_demo.py
├── docker
│   └── Dockerfile
├── docs
│   ├── api.rst
│   ├── changelog.md
│   ├── code_of_conduct.md
│   ├── conf.py
│   ├── datasets.md
│   ├── getting_started.md
│   ├── index.rst
│   ├── install.md
│   ├── make.bat
│   ├── Makefile
│   ├── merge_docs.sh
│   ├── requirements.txt
│   └── stats.py
├── LICENSE
├── mmocr
│   ├── apis
│   ├── core
│   ├── datasets
│   ├── __init__.py
│   ├── models
│   ├── utils
│   └── version.py
├── README.md
├── requirements
│   ├── build.txt
│   ├── docs.txt
│   ├── optional.txt
│   ├── readthedocs.txt
│   ├── runtime.txt
│   └── tests.txt
├── requirements.txt
├── resources
│   ├── illustration.jpg
│   └── mmocr-logo.png
├── setup.cfg
├── setup.py
├── tests
│   ├── data
│   ├── test_apis
│   ├── test_dataset
│   ├── test_metrics
│   ├── test_models
│   ├── test_tools
│   └── test_utils
├── tools
│   ├── data
│   ├── dist_test.sh
│   ├── dist_train.sh
│   ├── kie_test_imgs.py
│   ├── kie_test_imgs.sh
│   ├── ocr_test_imgs.py
│   ├── ocr_test_imgs.sh
│   ├── publish_model.py
│   ├── slurm_test.sh
│   ├── slurm_train.sh
│   ├── test_imgs.py
│   ├── test_imgs.sh
│   ├── test.py
│   └── train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

install.md

install.md

Installation

Prerequisites

Step-by-Step Installation Instructions

Full Set-up Script

Another option: Docker Image

Prepare Datasets

Files

install.md

Latest commit

History

install.md

File metadata and controls

Installation

Prerequisites

Step-by-Step Installation Instructions

Full Set-up Script

Another option: Docker Image

Prepare Datasets