`tf.keras` version that allows any input resolution and doesn't use `Lambda` layers #16

prhbrt · 2024-01-11T18:21:22Z

Since the UNET architecture only uses layers that can scale with the image dimensions, the fixed dimensions seem artificial. I've added a zero-padding layer that increases the dimensions to the nearest multiple of 32. The padding is cut off in the end.

Moreover, I've removed the lambda-layer, as it creates marshaling warning, and used ZeroPadding2D's asymmetric padding feature. This removes a warning upon load_model.

I converted the eynollah-models to this architecture, and they should load without warnings now and use the tensorflow.keras API and can be found here.

This might allow you to skip the patching as used in Eynollah and speed up the whole process. Please let me know what you think.

Notes and sanity checks:

the enhancement model uses the light version without the last batchnorm and with a sigmoid rather than softmax.
the column classifier is a different architecture, I just copied the original model, and hence it's not dimension invariant.
Two models weren't in savemodel-format on your website, e.g. the light versions eynollah-main-regions_20220314 and eynollah-textline_light_20210425, so I couldn't convert them.

prhbrt · 2024-01-11T18:25:05Z

models.py

+import tensorflow as tf

 resnet50_Weights_path='./pretrained_model/resnet50_weights_tf_dim_ordering_tf_kernels_notop.h5'
-IMAGE_ORDERING ='channels_last'


This should be passed as a variable, and ideally use tensorflow's default.

TODO: default to image_data_format from keras' config (~/.keras/keras.json)

prhbrt · 2024-01-11T18:25:35Z

models.py


 resnet50_Weights_path='./pretrained_model/resnet50_weights_tf_dim_ordering_tf_kernels_notop.h5'
-IMAGE_ORDERING ='channels_last'
-MERGE_AXIS=-1


This should follow from data_format, e.g. should always be the channel-axis.

prhbrt · 2024-01-11T18:26:18Z

models.py

-    return x
-
-def identity_block(input_tensor, kernel_size, filters, stage, block):
+def identity_block(input_tensor, kernel_size, filters, stage, block, data_format='channels_last'):


Added data_format as a parameter everywhere, rather than the global constant IMAGE_FORMAT. Also used the same name as tf does for transparency.

prhbrt · 2024-01-11T18:27:04Z

models.py

-
-    x = ZeroPadding2D((3, 3), data_format=IMAGE_ORDERING)(img_input)
-    x = Conv2D(64, (7, 7), data_format=IMAGE_ORDERING, strides=(2, 2),kernel_regularizer=l2(weight_decay), name='conv1')(x)
+class PadMultiple(Layer):


This pads to a multiple of 32 or whatever is specified in dims.

prhbrt · 2024-01-11T18:27:26Z

models.py

+    padded_to_multiple = PadMultiple((32,32))(img_input)
+
+    bn_axis = 3 if data_format == 'channels_last' else 1
+    merge_axis = 3 if data_format == 'channels_last' else 1


Is merge_axis ever something else than the channel dimension?

prhbrt · 2024-01-11T18:28:16Z

models.py

-        model=Model( img_input , x ).load_weights(resnet50_Weights_path)
+        Model(img_input, x).load_weights(resnet50_Weights_path)

+    if light_version:


Duplicate code complicates code maintenance, so I merged the light version in here.

cneud · 2024-01-12T10:30:03Z

Hi @prhbrt, thanks a lot for looking into this and for contributing!

FYI, we are planning to update and refactor this repo and integrate the model training code with https://github.com/qurator-spk/eynollah for future maintenance, so this comes in very handy.

My colleagues @vahidrezanezhad and @michalbubula will be working on this - although due to various reasons, we likely won't be able to get our hands dirty much before March. But we will try our best to review and merge any contributions also beforehand.

Btw the two models that you were missing should be available from our HF:

eynollah-main-regions_20220314 -> https://huggingface.co/SBB/eynollah-main-regions
eynollah-textline_light_20210425 -> https://huggingface.co/SBB/eynollah-textline_light

prhbrt · 2024-01-12T13:57:34Z

@cneud Understood! Could you in the meantime provide a list of all model-architectures used for eynollah (specifically python code)? I couldn't find python-code for the column classifier in particular, so that one still has fixed dimensions.

Also note that this yolo-version might give slightly different outputs as your patching example, due to boundary conditions.

Thank you in advance!

vahidrezanezhad · 2024-01-12T14:19:55Z

@cneud Understood! Could you in the meantime provide a list of all model-architectures used for eynollah (specifically python code)? I couldn't find python-code for the column classifier in particular, so that one still has fixed dimensions.

Also note that this yolo-version might give slightly different outputs as your patching example, due to boundary conditions.

Thank you in advance!

@prhbrt
Sure, I'll try to make classifier code public in the meantime.

kba · 2025-10-16T18:41:07Z

I have tried to port this over to eynollah here: qurator-spk/eynollah#202

If I messed it up, this PR is still around but I am closing so we can archive the repository.

tf.keras version that allows any input resolution

3098700

prhbrt commented Jan 11, 2024

View reviewed changes

cneud requested review from cneud, kba, michalbubula and vahidrezanezhad April 12, 2024 08:42

kba mentioned this pull request Sep 30, 2025

Integrate training from sbb pixelwise segmentation qurator-spk/eynollah#187

Merged

3 tasks

kba closed this Oct 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

`tf.keras` version that allows any input resolution and doesn't use `Lambda` layers #16

`tf.keras` version that allows any input resolution and doesn't use `Lambda` layers #16

Uh oh!

prhbrt commented Jan 11, 2024 •

edited

Loading

Uh oh!

prhbrt Jan 11, 2024

Uh oh!

prhbrt Jan 11, 2024

Uh oh!

prhbrt Jan 11, 2024

Uh oh!

prhbrt Jan 11, 2024

Uh oh!

prhbrt Jan 11, 2024

Uh oh!

prhbrt Jan 11, 2024

Uh oh!

cneud commented Jan 12, 2024

Uh oh!

prhbrt commented Jan 12, 2024

Uh oh!

vahidrezanezhad commented Jan 12, 2024

Uh oh!

kba commented Oct 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

tf.keras version that allows any input resolution and doesn't use Lambda layers #16

tf.keras version that allows any input resolution and doesn't use Lambda layers #16

Uh oh!

Conversation

prhbrt commented Jan 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

prhbrt Jan 11, 2024

Choose a reason for hiding this comment

Uh oh!

prhbrt Jan 11, 2024

Choose a reason for hiding this comment

Uh oh!

prhbrt Jan 11, 2024

Choose a reason for hiding this comment

Uh oh!

prhbrt Jan 11, 2024

Choose a reason for hiding this comment

Uh oh!

prhbrt Jan 11, 2024

Choose a reason for hiding this comment

Uh oh!

prhbrt Jan 11, 2024

Choose a reason for hiding this comment

Uh oh!

cneud commented Jan 12, 2024

Uh oh!

prhbrt commented Jan 12, 2024

Uh oh!

vahidrezanezhad commented Jan 12, 2024

Uh oh!

kba commented Oct 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

`tf.keras` version that allows any input resolution and doesn't use `Lambda` layers #16

`tf.keras` version that allows any input resolution and doesn't use `Lambda` layers #16

prhbrt commented Jan 11, 2024 •

edited

Loading