Skip to content

Add qa_count_clear_total to the outputs#161

Merged
vnewey merged 9 commits intodevelopfrom
add_total_count_clear
Mar 26, 2026
Merged

Add qa_count_clear_total to the outputs#161
vnewey merged 9 commits intodevelopfrom
add_total_count_clear

Conversation

@vnewey
Copy link
Copy Markdown
Collaborator

@vnewey vnewey commented Mar 26, 2026

Add qa_count_clear_total to the outputs
Add some GDAL parameters to get around processes giving up to quickly when a scene is in cold storage.
os.environ["GDAL_HTTP_TIMEOUT"] = "300" # default is 30s
os.environ["GDAL_HTTP_RETRY_DELAY"] = "5" # seconds between retries

Add qa_total_count_clear to the outputs
Add some GDAL parameters to get around processes giving up to quickly when a scene is in cold storage.
os.environ["GDAL_HTTP_TIMEOUT"] = "300"          # default is 30s
    os.environ["GDAL_HTTP_MAX_RETRY"] = "10"          # default is 0
    os.environ["GDAL_HTTP_RETRY_DELAY"] = "5"         # seconds between retries
@vnewey vnewey requested a review from robbibt March 26, 2026 01:58
@github-actions
Copy link
Copy Markdown

For full integration test results, refer to the Tests directory README.

Copy link
Copy Markdown
Member

@robbibt robbibt left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hey @vnewey, have made some suggestions below. I think you can accept the suggestions directly through Github if they look OK.

The GDAL change is the biggest unknown here I think - I was wondering if it might be safer to adopt the existing datacube GDAL cloud defaults instead. But I guess we can test it and see how it goes.

Comment thread intertidal/composites.py Outdated
# Record params in logs
log.info(f"{run_id}: Using parameters {input_params}")

# This is to help when scenes need to be moved from s3 cold storage
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't know enough about these specific options to know if they're likely to have any negative downstream impacts. I do wonder though whether we should just adopt the GDAL_CLOUD_DEFAULTS that are used by datacube when you run configure_s3_access:
https://github.com/opendatacube/odc-loader/blob/main/src/odc/loader/_rio.py#L76-L80

GDAL_CLOUD_DEFAULTS = {
    "GDAL_DISABLE_READDIR_ON_OPEN": "EMPTY_DIR",
    "GDAL_HTTP_MAX_RETRY": "10",
    "GDAL_HTTP_RETRY_DELAY": "0.5",
}

Comment thread intertidal/composites.py Outdated
Comment thread intertidal/composites.py Outdated
Comment thread intertidal/composites.py Outdated
Comment thread intertidal/composites.py Outdated
Comment thread intertidal/composites.py Outdated
Comment thread intertidal/composites.py Outdated
Comment thread intertidal/composites.py Outdated
@robbibt robbibt self-requested a review March 26, 2026 02:59
Copy link
Copy Markdown
Member

@robbibt robbibt left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Woo!

@vnewey vnewey merged commit cb9599c into develop Mar 26, 2026
@vnewey vnewey deleted the add_total_count_clear branch March 26, 2026 03:14
@robbibt robbibt changed the title Add qa_total_count_clear to the outputs Add qa_count_clear_total to the outputs Mar 26, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants