Skip to content

GroupNotFoundError with datasets 2025-09+ #29

@ana-sher

Description

@ana-sher

While retrieving datasets for "sentinel-2-l2a" , certain area (will provide in the full code below) and period '2025-07-01/2025-09-30' after filtering out deprecated, got results with a gap (between 2025 09 23 and 2025 07 27), see:

...
'https://objects.eodc.eu:443/e05ab01a9d56408d82ac32d69a5aae2a:202509-s02msil2a/23/products/cpm_v256/S2C_MSIL2A_20250923T095041_N0511_R079_T33UYR_20250923T150014.zarr',
 'https://objects.eodc.eu:443/e05ab01a9d56408d82ac32d69a5aae2a:202507-s02msil2a/27/products/cpm_v256/S2A_MSIL2A_20250727T095101_N0511_R079_T33UYR_20250727T121617.zarr'
...

Datatree for 2025 07 27 zarr file opens ok, but for the datasets after 09 I'm having the following issue:

GroupNotFoundError: No group found in store 'https://objects.eodc.eu:443/e05ab01a9d56408d82ac32d69a5aae2a:202509-s02msil2a/23/products/cpm_v256/S2C_MSIL2A_20250923T095041_N0511_R079_T33UYR_20250923T150014.zarr' at path ''

Full code to reproduce the issue:

from pystac import MediaType, ItemCollection
from pystac_client import Client
import xarray as xr

eopf_stac_api_root_endpoint = "https://stac.core.eopf.eodc.eu/"
collection = "sentinel-2-l2a"
eopf_catalog = Client.open(url=eopf_stac_api_root_endpoint)

def fetch_catalogue_collection(collection_name: str) -> ItemCollection:
    search_result = eopf_catalog.search(
        collections=collection_name,
        bbox=(19.081281771627136, 49.7814303219126, 19.10565768715448, 49.79195922883795),
        datetime='2025-07-01T00:00:00Z/2025-09-30T23:59:59.999999Z',
    )
    return search_result.item_collection()

search_result_s2 = fetch_catalogue_collection(collection)
id_date_collection = [(item.id, item.datetime) for item in search_result_s2 if 'deprecated' not in item.properties or item.properties['deprecated'] == False]

c_sentinel2 = eopf_catalog.get_collection(collection)
items = c_sentinel2.get_items(*[item[0] for item in id_date_collection])
assets = [item.get_assets(media_type=MediaType.ZARR) for item in items]
cloud_storage_urls = [asset['product'].href for asset in assets]

xr.open_datatree(
    cloud_storage_urls[7],
    engine="zarr",
    chunks="auto",
)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions