Add first version of the reduced-space ensemble Kalman filter method as part of #464 #491

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

m-rempel wants to merge 57 commits into pySTEPS:master from m-rempel:add_reduced_space_enkf

Contributor

m-rempel commented Sep 2, 2025

Hi @RubenImhoff and @dnerini!

Here's my first version of the implementation of the reduced-space ensemble Kalman filter method. By calling blending_method = blending.get_method("pca_enkf")

One can now compute a combined forecast by pass arrays of observed precipitation fields, the NWP ensemble model forecast, the motion vector field based on observations as well as the timestamps corresponding to the observations and NWP forecast, respectively.

It's not necessary that the temporal resolution of the NWP forecast is equal to the observation. However, one has to adjust the background inflation factor to >1.0 in this case.

There is also a comprehensive set of additional combination keyword arguments like:

combination_kwargs = dict(n_tapering=0,
                              non_precip_mask = True,
                              n_ens_prec=1,
                              lien_criterion=True,
                              n_lien=10,
                              post_prob_matching = False,
                              iterative_prob_matching = True,
                              inflation_factor_bg=1.0,
                              inflation_factor_obs=1.0,
                              offset_bg=0.0,
                              offset_obs=0.0,
                              nwp_hres_eff=14.0,
                              sampling_prob_source="ensemble",)

Currently, only an AR(1) process within the forecast step is implemented and the respective tests for the method are not implemented.

m-rempel and others added 11 commits

May 27, 2025 10:30


          Add importer for DWD radar products

c466b9c


          Update DWD HDF5 Importer

1e98e4f


          Add reprojection of unstructured grids

9d47508


          docs: add docstrings to function and run black

f4b1506


          Add suggestions of draft pull request in importer and reprojection

a632102


          Merge branch 'add_dwd_radar_data' into add_enkf_combination

0a78666


          Update doc strings in io/importers.py and utils/reprojection.py

1b063e0


          Add first version of reduced-space EnKF combination technique

7d477a4


          Clean up EnKF code

79427ec


          Begin documentation of EnKF code

1fc2f5c


          First version of reduced-space ens kalman filter code

f3b9f3d

m-rempel marked this pull request as ready for review

September 2, 2025 14:59

Contributor

RubenImhoff commented Sep 2, 2025

Fantastic, Martin! I will have a look at it tomorrow (already gave it a go earlier, of course, but I'll make it a more thorough review now).

m-rempel and others added 2 commits

September 6, 2025 13:18


          Move probability matching at the end of the forecast step (steps swap…

954e0f7

…ped)


          fix: use isinstance instead of type

7d0e2b4

RubenImhoff reviewed

View reviewed changes

pysteps/blending/ens_kalman_filter_methods.py Outdated

+              n_lien: int, (n_ens_members/2)
+                  Minimum number of ensemble members that forecast precipitation for the Lien
+                  criterion. As standard, half of the ensemble members should forecast precipitation.
+              post_prob_matching: bool, (False)

Contributor

RubenImhoff Sep 9, 2025

This one is currently not used from what I can see. Should we delete it?

Contributor Author

m-rempel Sep 9, 2025

I've merged the argumentes post_prob_matching and iter_prob_matching into a new one named prob_matching.

RubenImhoff reviewed

View reviewed changes

pysteps/blending/ens_kalman_filter_methods.py

+              iterative_prob_matching: bool, (True)
+                  Flag to specify whether a probability matching should be applied at each correction
+                  step.
+              inflation_factor_bg: float, (1.0)

Contributor

RubenImhoff Sep 9, 2025

Could you add a few lines here describing what the inflation factor is used for?

Contributor Author

m-rempel Sep 9, 2025

Added.

RubenImhoff reviewed

View reviewed changes

pysteps/blending/ens_kalman_filter_methods.py

+                  Inflation factor of the background (NWC) covariance matrix.
+              inflation_factor_obs: float, (1.0)
+                  Inflation factor of the observation (NWP) covariance matrix.
+              offset_bg: float, (0.0)

Contributor

RubenImhoff Sep 9, 2025

Could you add a few lines here describing what the offset is used for?

Contributor Author

m-rempel Sep 9, 2025

Done.

RubenImhoff reviewed

View reviewed changes

pysteps/blending/ens_kalman_filter_methods.py

+                  Offset of the background (NWC) covariance matrix.
+              offset_obs: float, (0.0)
+                  Offset of the observation (NWP) covariance matrix.
+              nwp_hres_eff: float

Contributor

RubenImhoff Sep 9, 2025

This one is currently not used from what I can see. Should we delete it?

Contributor Author

m-rempel Sep 9, 2025

nwp_hres_eff is now used in ForecastModel to simulate the standard deviation of the smaller scales.

RubenImhoff added 2 commits

September 9, 2025 09:32


          refactor: making a start with refactoring (to be continued)

af523b6


          fix: fix merge issues

a6acc99

RubenImhoff reviewed

View reviewed changes

pysteps/blending/ens_kalman_filter_methods.py

+                      return X_ana
+                  def get_covariance_matrix(self, M: np.ndarray, inflation_factor: float, offset: float):
+                      """

Contributor

RubenImhoff Sep 9, 2025

In this function, can you refer to the equation(s) from Nerini et al. (2019) that are used?

Contributor Author

m-rempel Sep 9, 2025

Done.

RubenImhoff and others added 3 commits

September 9, 2025 10:01


          refactor: change function docstrings and names

3e8db51


          refactor: adjust get_precipitation_mask

7ce65f1


          Add suggested documentation.

fee5871

Contributor

RubenImhoff commented Sep 9, 2025 •

edited

Loading

To do:

Refactor code.
Ensure that less energy/power is lost at the smallest scales.
Fix no-rain instances.
Ensure that we end with full NWP weight.
Add tests.
Test with real-world data.
Create gallery example.

m-rempel and others added 4 commits

September 9, 2025 09:55


          Change names of ensemble variables in EnsembleKalmanFilter.update()

3d8dee0


          refactor: adjust get_lien_criterion function

5f5abfc


          refactor: more refactoring of ens_kalman_filter_methods code

4824cc0


          refactor: refactor /utils/pca.py

fb451c0

m-rempel added 3 commits

September 11, 2025 12:58


          Merge branch 'add_reduced_space_enkf' of github.com:m-rempel/pysteps …

8a9354c

…into add_reduced_space_enkf


          Merge branch 'master' into add_reduced_space_enkf


          Run black

4b50e5f

dnerini reviewed

View reviewed changes

pysteps/utils/pca.py Outdated

+              """
+              import numpy as np
+              from sklearn import decomposition

Member

dnerini Sep 11, 2025

I think this should be handled as an an optional dependency (and hence documented as such), see other examples in the code

Contributor Author

m-rempel Sep 11, 2025

Thanks @dnerini, I've implemented it in this manner.

Contributor Author

m-rempel Sep 11, 2025

Hi again @dnerini,
the tests are still failing, since the PCA from sklearn is necessary for the correction step. I've talked with @RubenImhoff about this issue and he has also no idea how to handle this.
One option could be to compute a pure extrapolation nowcast instead of a combination when there's no sklearn installed. However, that makes no really sense for me.

Member

dnerini Sep 11, 2025

We can install sklearn for the tests, but it shouldn't be added to the base dependencies. This means that users that want to use your method need to install
Sklearn too, otherwise they get an error when they try to use the method

Member

dnerini Sep 11, 2025

See example here https://github.com/pySTEPS/pysteps/blob/master/pysteps/feature/tstorm.py#L117

which is documented in the installation page https://pysteps.readthedocs.io/en/stable/user_guide/install_pysteps.html

Member

dnerini Sep 11, 2025

Test dependencies are specified here

https://github.com/pySTEPS/pysteps/blob/master/ci/ci_test_env.yml

so you can add sklearn there

m-rempel added 3 commits

September 11, 2025 14:40


          Resolve codacy issues

84c898f


          Resolve codacy issues II

0510e37


          Add importorskip('sklearn') in PCA EnKF test

11ca2d3

codecov bot commented Sep 11, 2025 •

edited

Loading

Codecov Report

❌ Patch coverage is 87.51625% with 96 lines in your changes missing coverage. Please review.
✅ Project coverage is 84.14%. Comparing base (f4429b9) to head (e45ecdc).
⚠️ Report is 1 commits behind head on master.

Files with missing lines	Patch %	Lines
pysteps/blending/pca_ens_kalman_filter.py	89.83%	44 Missing ⚠️
pysteps/utils/reprojection.py	16.12%	26 Missing ⚠️
pysteps/blending/ens_kalman_filter_methods.py	89.43%	15 Missing ⚠️
pysteps/utils/pca.py	75.00%	11 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #491      +/-   ##
==========================================
+ Coverage   83.95%   84.14%   +0.19%     
==========================================
  Files         163      168       +5     
  Lines       13739    14507     +768     
==========================================
+ Hits        11534    12207     +673     
- Misses       2205     2300      +95

Flag	Coverage Δ
unit_tests	`84.14% <87.51%> (+0.19%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

RubenImhoff mentioned this pull request

Add importer for ICON-D2-RUC data as part of #464 pySTEPS/pysteps-nwp-importers#5

Merged

m-rempel and others added 6 commits

September 12, 2025 08:46


          Add scikit-learn as an optional test dependency

7b1cd8a


          minor updates

10ae024


          Merge branch 'add_reduced_space_enkf' of https://github.com/m-rempel/…

c6d259c

…pysteps into add_reduced_space_enkf


          Add test for PCA and resolve codacy issues

65fa254


          fix: fix some minor errors

89c58b9


          Merge branch 'add_reduced_space_enkf' of https://github.com/m-rempel/…

bede95d

…pysteps into add_reduced_space_enkf

Contributor Author

m-rempel commented Sep 12, 2025

Even after adding the test of the PCA, codacy issues that pca is import in pysteps/utils/init.py but never used. Do you have any ideas what the reason could be for this, @dnerini and @RubenImhoff?

Contributor

RubenImhoff commented Sep 12, 2025

Even after adding the test of the PCA, codacy issues that pca is import in pysteps/utils/init.py but never used. Do you have any ideas what the reason could be for this, @dnerini and @RubenImhoff?

It gives the same issue for the reprojection module. I'm also not entirely sure why. @dnerini, do you know?


          Solve bug in PCA test

c840e6d

Member

dnerini commented Sep 12, 2025 •

edited

Loading

We can safely ignore those, they shouldn't appear in the next iterations

Contributor Author

m-rempel commented Sep 12, 2025

Ok, thank you @dnerini,
I've fixed another bug in the PCA test and now it should be working again.

RubenImhoff added 3 commits

September 12, 2025 14:38


          fix: minor fixes found through testing the gallery example

2ff6d1b


          Merge branch 'add_reduced_space_enkf' of https://github.com/m-rempel/…

d9f2353

…pysteps into add_reduced_space_enkf


          feat: adjust gallery example

4fa4443

Contributor

RubenImhoff commented Sep 12, 2025

I adjusted the gallery example. Due to the very short NWP forecast horizon in pysteps_data, I have adjusted the time step artificially to 15 minutes, so that we can make a blend up to 120 minutes ahead. It works, but is maybe not the prettiest example this way.


          fix: swap indices

91c1cbf

Contributor

RubenImhoff commented Sep 12, 2025

@m-rempel, looks like we still have some failing pca tests :(


          fix: fix assertion margin in pca test

fb9f841

Contributor

RubenImhoff commented Sep 12, 2025

Not the prettiest plot, nor the best example of the reduced-space ensemble Kalman filter blending, but at least this is a sign that it also work on the KNMI ensemble data (30 members). On my laptop, I could only test it with hourly NWP data, so that explains the mismatches so far. I'll try it out with 5-min data next week. :)
Anyway, for now a sign that the model runs and finishes successfully.


          Fix bug in PCA test when n_components < n_ens_member

e45ecdc

Contributor Author

m-rempel commented Sep 12, 2025

Thank you @RubenImhoff for sharing these first results with other than DWD data! I've now fixed the bug in the PCA test for cases when n_components < n_ens_member.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet