LayerNorm after last skip connection #1157

SavvasMel · 2025-10-27T12:30:41Z

Description

This PR changes the MLP of the last block of the forecast engine. Specifically, it includes a LayerNorm with scale and bias turned off after the last skip connection of the last block.

Issue Number

This is a draft PR

Checklist before asking for review

I have performed a self-review of my code
My changes comply with basic sanity checks:
- I have fixed formatting issues with ./scripts/actions.sh lint
- I have run unit tests with ./scripts/actions.sh unit-test
- I have documented my code and I have updated the docstrings.
- I have added unit tests, if relevant
I have tried my changes with data and code:
- I have run the integration tests with ./scripts/actions.sh integration-test
- (bigger changes) I have run a full training and I have written in the comment the run_id(s): launch-slurm.py --time 60
- (bigger changes and experiments) I have shared a hegdedoc in the github issue with all the configurations and runs for this experiments
I have informed and aligned with people impacted by my change:
- for config changes: the MatterMost channels and/or a design doc
- for changes of dependencies: the MatterMost software development channel

MatKbauer

Thanks for sharing your code, Savvas. Let's please modify it slightly to minimize redundant lines. See my suggestions below.

MatKbauer · 2025-10-28T11:24:43Z

src/weathergen/model/engines.py

+                            norm_type=self.cf.norm_type,
+                            dim_aux=1,
+                            norm_eps=self.cf.mlp_norm_eps,
+                        )


For the sake of having less redundant code, can we modify the MLP to receive an additional argument with_residual_layer_norm=False instead of introducing FEMLP? When calling MLP here, we can set with_residual_layer_norm=(i + 1) == self.cf.fe_num_blocks to add the residual layer norm in the last MLP layer.

Why would we need to modify the MLP? It can--and in my opinion definitely should--be implemented in the forecast engine. Where we can just have the LayerNorm as the last block.

Can you check now? Is that better?

src/weathergen/model/layers.py

Savvas Melidonis and others added 2 commits October 25, 2025 23:20

changes to engines and layers

298c032

Correct bug with blocks

e36ee2a

github-project-automation bot added this to WeatherGen-dev Oct 27, 2025

SavvasMel changed the title ~~Fe experiments layer~~ LayerNorm after last skip connection Oct 27, 2025

grassesi deleted the branch ecmwf:mk/develop/fe_experiments October 27, 2025 14:18

grassesi closed this Oct 27, 2025

github-project-automation bot moved this to Done in WeatherGen-dev Oct 27, 2025

grassesi mentioned this pull request Oct 27, 2025

[ISSUE] Deleted remote branches #1161

Closed

38 tasks

grassesi reopened this Oct 27, 2025

MatKbauer requested changes Oct 28, 2025

View reviewed changes

github-project-automation bot moved this from Done to In Progress in WeatherGen-dev Oct 28, 2025

Add the LayerNorm as block

ec51425

SavvasMel requested a review from MatKbauer October 28, 2025 18:23

SavvasMel added 2 commits October 28, 2025 19:36

Add some doc comments

6c6b08b

change to original code and submit also the configs

a8ccaf5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

LayerNorm after last skip connection #1157

LayerNorm after last skip connection #1157

Uh oh!

SavvasMel commented Oct 27, 2025

Uh oh!

MatKbauer left a comment

Uh oh!

MatKbauer Oct 28, 2025

Uh oh!

clessig Oct 28, 2025

Uh oh!

SavvasMel Oct 28, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

LayerNorm after last skip connection #1157

Are you sure you want to change the base?

LayerNorm after last skip connection #1157

Uh oh!

Conversation

SavvasMel commented Oct 27, 2025

Description

Issue Number

Checklist before asking for review

Uh oh!

MatKbauer left a comment

Choose a reason for hiding this comment

Uh oh!

MatKbauer Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

clessig Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

SavvasMel Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants