Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Performance of PAN in Total-text #526

Open
TianBaoGe opened this issue Oct 16, 2021 · 8 comments
Open

Performance of PAN in Total-text #526

TianBaoGe opened this issue Oct 16, 2021 · 8 comments
Labels
bug Something isn't working

Comments

@TianBaoGe
Copy link

Hi, thanks for your great job.
I run the script and get the H-mean of PAN in Total-text. I find my H-mean is about 0.73, while it is 0.835 in the original paper, without pre-training.

Are the experiment settings different between them?

@gaotongxiao
Copy link
Collaborator

ping @jeffreykuang

@TianBaoGe
Copy link
Author

ping @jeffreykuang

Is there a problem with the official generated code of Total-Text? I used the same settings as in the paper, have you conducted relevant experiments?

@gaotongxiao
Copy link
Collaborator

By "official generated code", do you mean the conversion script? https://github.com/open-mmlab/mmocr/blob/main/tools/data/textdet/totaltext_converter.py

I think it's a quite straightforward script so it's less likely a data conversion problem. The reason is unclear to us as we haven't conducted experiments on TotalText. It would be great if you could share more information, including

  1. Have you run other experiments on PANet? Were the results consistent with the paper?
  2. The config file that you used to train PANet on;
  3. The log file of training.

@jeffreykuang
Copy link
Collaborator

I didn't conduct comparative experiments on total text. I did experiments on icdar15 or icdar17. I suggest you to check the pretrained model used in the paper.

@TianBaoGe
Copy link
Author

By "official generated code", do you mean the conversion script? https://github.com/open-mmlab/mmocr/blob/main/tools/data/textdet/totaltext_converter.py

I think it's a quite straightforward script so it's less likely a data conversion problem. The reason is unclear to us as we haven't conducted experiments on TotalText. It would be great if you could share more information, including

  1. Have you run other experiments on PANet? Were the results consistent with the paper?
  2. The config file that you used to train PANet on;
  3. The log file of training.

I followed https://github.com/open-mmlab/mmocr/blob/main/tools/data/textdet/totaltext_converter.py to generate the Total-Text dataset

(1) Here is my config file I used to train PANet:
panet_r18_fpem_ffm_600e_totaltext.txt

(2) Here is my log file of training:
20211020_165607.log

The highest Hmean score in my experiments is 0.784, while is 83.5 in the original paper.

@TianBaoGe
Copy link
Author

I didn't conduct comparative experiments on total text. I did experiments on icdar15 or icdar17. I suggest you to check the pretrained model used in the paper.

I just used resnet18 without syntxt to train the model, but the final result was terrible.
You can check my experiment settings in the reply above.
Looking forward to your reply. Thanks a lot!

@jeffreykuang
Copy link
Collaborator

As for our experimental results, you can check https://github.com/open-mmlab/mmocr/blob/main/configs/textdet/panet/README.md.

For text detection, we found the following aspects are very important:

  • pretrained model
  • data augmentation
  • if ignore hard text bounding boxes.

@TianBaoGe
Copy link
Author

As for our experimental results, you can check https://github.com/open-mmlab/mmocr/blob/main/configs/textdet/panet/README.md.

For text detection, we found the following aspects are very important:

  • pretrained model
  • data augmentation
  • if ignore hard text bounding boxes.

In fact, in the official implementations of PAN, the config files of total-text and CTW1500 are exactly the same, so I just directly used your CTW config file in https://github.com/open-mmlab/mmocr/blob/main/configs/textdet/panet/README.md for training, but the result is really bad.

@gaotongxiao gaotongxiao added the bug Something isn't working label Nov 12, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants