[eval][1122] Plot scores on a map #1176

iluise · 2025-10-29T15:29:56Z

Description

To better investigate spatial patterns in the scores and the ensemble distributions, this PR adds the possibility to plot the scores (all of them, not only RMSE and spread) on a map.
The code recomputes the scores averaging over the sample (initialization time) dimension and produces a plot for each forecast step, ensemble member and metric .

Examples:

Issue Number

Closes #1122

Is this PR a draft? Mark it as draft.

Checklist before asking for review

I have performed a self-review of my code
My changes comply with basic sanity checks:
- I have fixed formatting issues with ./scripts/actions.sh lint
- I have run unit tests with ./scripts/actions.sh unit-test
- I have documented my code and I have updated the docstrings.
- I have added unit tests, if relevant
I have tried my changes with data and code:
- I have run the integration tests with ./scripts/actions.sh integration-test
- (bigger changes) I have run a full training and I have written in the comment the run_id(s): launch-slurm.py --time 60
- (bigger changes and experiments) I have shared a hegdedoc in the github issue with all the configurations and runs for this experiments
I have informed and aligned with people impacted by my change:
- for config changes: the MatterMost channels and/or a design doc
- for changes of dependencies: the MatterMost software development channel

SavvasMel · 2025-10-29T17:56:33Z

How can I replicate this? What should I add to the config?

SavvasMel · 2025-10-29T17:58:33Z

I am also a bit confused, since you average over all samples how you plot for each ensemble member? Do I understand wrongly what an ensemble is?

iluise · 2025-10-30T08:18:55Z

How can I replicate this? What should I add to the config?

evaluation:
  metrics  : ["rmse", "mae"]
  regions: ["global", "nhem"]
  summary_plots : true
  summary_dir: "./plots/"
  plot_ensemble: "members" #supported: false, "std", "minmax", "members"
  plot_score_maps: true <----
  print_summary: false #print out score values on screen. it can be verbose
  log_scale: false
  add_grid: false

iluise · 2025-10-30T08:54:02Z

I am also a bit confused, since you average over all samples how you plot for each ensemble member? Do I understand wrongly what an ensemble is?

There's a loop over ensemble members, so each ensemble member has its own plot:
https://github.com/ecmwf/WeatherGenerator/pull/1176/files#diff-fb043e50e73406a916b50c2ebd2247b619bcdeb851d3e23dc0bf71a522498c89R243

SavvasMel · 2025-10-31T15:43:23Z

It is not working for me if I have something like:

global_plotting_options:
  # image_format : "png" #options: "png", "pdf", "svg", "eps", "jpg" ..
  # dpi_val : 300
  ERA5:
    marker_size: 2
    scale_marker_size: 1
    marker: "o"
    2t: 
      vmin: 
      vmax:

I get:


  File "/p/project1/hclimrep/melidonis1/WG_Fork/WeatherGenerator/.venv/bin/evaluate", line 10, in <module>
    sys.exit(evaluate())
             ^^^^^^^^^^
  File "/p/project1/hclimrep/melidonis1/WG_Fork/WeatherGenerator/packages/evaluate/src/weathergen/evaluate/run_evaluation.py", line 36, in evaluate
    evaluate_from_args(sys.argv[1:])
  File "/p/project1/hclimrep/melidonis1/WG_Fork/WeatherGenerator/packages/evaluate/src/weathergen/evaluate/run_evaluation.py", line 56, in evaluate_from_args
    evaluate_from_config(OmegaConf.load(config))
  File "/p/project1/hclimrep/melidonis1/WG_Fork/WeatherGenerator/packages/evaluate/src/weathergen/evaluate/run_evaluation.py", line 101, in evaluate_from_config
    _ = plot_data(reader, stream, global_plotting_opts)
        ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/p/project1/hclimrep/melidonis1/WG_Fork/WeatherGenerator/packages/evaluate/src/weathergen/evaluate/utils.py", line 338, in plot_data
    maps_config = common_ranges(
                  ^^^^^^^^^^^^^^
  File "/p/project1/hclimrep/melidonis1/WG_Fork/WeatherGenerator/packages/evaluate/src/weathergen/evaluate/utils.py", line 571, in common_ranges
    maps_config[var].update({"vmin": float(min(list_min))})
                                           ^^^^^^^^^^^^^
ValueError: The truth value of an array with more than one element is ambiguous. Use a.any() or a.all()

packages/evaluate/src/weathergen/evaluate/utils.py

iluise added 7 commits October 29, 2025 15:17

first version of score maps

22adc55

add maps to compute_scores

708a689

fix single sample situation

dad8bba

fix single sample

07ce786

lint

164ef5b

restore score.py

50973cd

fix bug in metric stream

a0675e4

github-project-automation bot added this to WeatherGen-dev Oct 29, 2025

iluise self-assigned this Oct 29, 2025

iluise added the eval anything related to the model evaluation pipeline label Oct 29, 2025

iluise removed this from WeatherGen-dev Oct 29, 2025

iluise added this to WeatherGen-dev Oct 29, 2025

iluise moved this to In Progress in WeatherGen-dev Oct 29, 2025

default flag to false

6ecb7e5

SavvasMel reviewed Oct 31, 2025

View reviewed changes

packages/evaluate/src/weathergen/evaluate/utils.py Outdated Show resolved Hide resolved

SavvasMel mentioned this pull request Oct 31, 2025

Minor correction, a line was deleted by mistake? #1193

Merged

SavvasMel and others added 8 commits November 3, 2025 10:14

Minor correction, a line was deleted by mistake? (#1193)

74b69dd

fix climatology

1bb916d

fix

836e9ea

update to develop

02d3a50

working setup for regridded data

d86eb0d

fix missing valid time case

07e0299

lint and fix color in score cards

acdee07

fix path for score maps

61f97fb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[eval][1122] Plot scores on a map #1176

[eval][1122] Plot scores on a map #1176

iluise commented Oct 29, 2025

Uh oh!

SavvasMel commented Oct 29, 2025

Uh oh!

SavvasMel commented Oct 29, 2025

Uh oh!

iluise commented Oct 30, 2025

Uh oh!

iluise commented Oct 30, 2025

Uh oh!

SavvasMel commented Oct 31, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[eval][1122] Plot scores on a map #1176

Are you sure you want to change the base?

[eval][1122] Plot scores on a map #1176

Conversation

iluise commented Oct 29, 2025

Description

Issue Number

Checklist before asking for review

Uh oh!

SavvasMel commented Oct 29, 2025

Uh oh!

SavvasMel commented Oct 29, 2025

Uh oh!

iluise commented Oct 30, 2025

Uh oh!

iluise commented Oct 30, 2025

Uh oh!

SavvasMel commented Oct 31, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants