Use SavedModel instead of HDF5 format, fix dewarping by bertsky · Pull Request #89 · OCR-D/ocrd_anybaseocr

bertsky · 2022-02-19T11:46:25Z

On Python 3.8, you get errors trying to load the existing HDF5 models for Tensorflow processors tiseg and layout-analysis.

However, Tensorflow offers a more stable alternative: SavedModel directories. I have converted the existing models an adapted the code to make them runnable again.

Now, how do we redistribute these? I have uploaded them as tarballs here and here. But really they should go to https://ocr-d-repo.scc.kit.edu/models/dfki as well.

As soon as we get OCR-D/core#800 done, we should then be able to update the resource list in ocrd-tool.json, right?

Another dependency is in the processors using ocrolib.morph, i.e. nlbin and textline: OCR-D/ocropy#2 – @kba, as soon as you have merged and published ocrd-fork-ocropy==1.4.0a4, this is ready to go.

- move model loading into `setup` in constructor context - allow directories as models (TF SavedModel format), too - use correct pageId - simplify and polish

… default

use custom dataset class for in-memory PIL.Image passing instead of file-based repurposed `AlignedDataset` (since (this is faster, and reliable: OCR-D does not guarantee us a `.filename` for derived images; also, does not create temporary files in the input fileGrp anymore)

after decoding, convert tensor to array with due respect for proper channel and dynamic range coding (instead of ad-hoc conversion); then resize while still in RGB and re-binarize (instead of ad-hoc binarization followed by resizing in binary)

- rebase on pix2pixHD#293 (CPU-only option, Torch>=1.0, less verbose, arg passing) - pass args to pix2pixHD directly (instead of sys.args hijacking) - no unneccesary verbosity (and only through loggers) - move model loading into startup context via `setup` fn - rename params: * `imgresize` → `resize_mode`, * `resizeHeight` → `resize_height` * `resizeWidth` → `resize_width` - add proper documentation - fix region-level results

(just BIN is not enough / not as good / not realistic)

bertsky · 2022-02-20T22:22:18Z

Now also depends on NVIDIA/pix2pixHD#293, and contains various other fixes, mostly regarding dewarping.

Fixes #34, #35, #40, #60, #61, #72, #73, #77, #87, #88, and probably #42 (see below – with resize_mode=none).

With better upsampling/re-binarization, the quality of the dewarper has also improved a little. It is obviously not a good idea to downsample in the first place (which is the case with the default resize_mode=resize_and_crop). But one could always increase resize_width/resize_height, or use resize_mode=none to gain full size quality at the cost of higher memory and time demand.

Here are some examples based on the dfki-testdata test case (after binarization and cropping):

dewarped with default settings:

before	after

dewarped with default settings but on GPU:

before	after

dewarped with larger size (less resampling/interpolation):

before	after

dewarped with original/full image size:

before	after

dewarped on cropped but raw RGB (just to show that the models have not been trained on such data):

before	after

bertsky · 2022-02-22T22:23:00Z

Now, how do we redistribute these? I have uploaded them as tarballs here and here. But really they should go to https://ocr-d-repo.scc.kit.edu/models/dfki as well.

Like I said, we still need to upload the new models, and update the resource URLs. (This is the reason the CI still fails.)

bertsky · 2022-02-22T23:33:15Z

Fixes #34, #35, #40, #60, #61, #72, #73, #77, #87, #88, and probably #42

BTW I forgot to link these (and my formulation is not covered by autolinking). Please close them.

OCR-D/ocrd_anybaseocr#89

Robert Sachunsky added 4 commits February 19, 2022 03:40

Makefile: fix test dependencies; update to resmgr cwd semantics

2778e4a

layout-analysis: improve…

9c7e242

- move model loading into `setup` in constructor context - allow directories as models (TF SavedModel format), too - use correct pageId - simplify and polish

ocrd-tool (tiseg/layout-analysis): use SavedFormat instead of HDF5 by…

a7d3b3c

… default

update requirements

4574397

This was referenced Feb 19, 2022

make Python 3.8 work OCR-D/ocrd_all#289

Merged

ocrd_all native installation fails due to tensorflow OCR-D/ocrd_all#235

Closed

Robert Sachunsky added 11 commits February 20, 2022 15:22

add test for dewarping

cee07d6

tests: fix relative import

f8db5f7

tests: fix initLogging

b853a63

test_dewarp: mets.find_files is a generator now

adc2f3e

dewarping: fix image post-processing…

5ba7890

after decoding, convert tensor to array with due respect for proper channel and dynamic range coding (instead of ad-hoc conversion); then resize while still in RGB and re-binarize (instead of ad-hoc binarization followed by resizing in binary)

test_dewarp: also when on CPU, use CROP as input

3575f9c

(just BIN is not enough / not as good / not realistic)

📦 1.7.0

e1e84bb

fix/update README

3cfc69b

update CHANGELOG

89f337e

bertsky changed the title ~~Use SavedModel instead of HDF5 format~~ Use SavedModel instead of HDF5 format, fix dewarping Feb 20, 2022

layout-analysis: fix parent fornew chapter/section

0bbcb66

README: explain resmgr download and pip install

01aea45

kba merged commit 01aea45 into OCR-D:master Feb 22, 2022

kba added a commit to OCR-D/core that referenced this pull request Mar 16, 2022

change ocrd-anybaseocr-layout-analysis model

c354663

OCR-D/ocrd_anybaseocr#89

This was referenced Mar 16, 2022

change ocrd-anybaseocr-layout-analysis model OCR-D/core#819

Merged

tests do not work #88

Closed

dewarping: gpu_id does not default to -1 in any case #87

Closed

test_A dir in dewarping #77

Closed

Error in "ocrd-anybaseocr-layout-analysis" #73

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use SavedModel instead of HDF5 format, fix dewarping#89

Use SavedModel instead of HDF5 format, fix dewarping#89
kba merged 17 commits into
OCR-D:masterfrom
bertsky:model-directories

bertsky commented Feb 19, 2022

Uh oh!

bertsky commented Feb 20, 2022

Uh oh!

bertsky commented Feb 22, 2022

Uh oh!

bertsky commented Feb 22, 2022 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

bertsky commented Feb 19, 2022

Uh oh!

bertsky commented Feb 20, 2022

dewarped with default settings:

dewarped with default settings but on GPU:

dewarped with larger size (less resampling/interpolation):

dewarped with original/full image size:

dewarped on cropped but raw RGB (just to show that the models have not been trained on such data):

Uh oh!

bertsky commented Feb 22, 2022

Uh oh!

bertsky commented Feb 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

bertsky commented Feb 22, 2022 •

edited

Loading