climate_ref.executor.reingest
#
Reingest existing execution results without re-running diagnostics.
This module provides functionality to re-run build_execution_result() and
re-ingest the results into the database for executions that have already completed.
This is useful when new series definitions or metadata extraction logic has been added
and you want to apply it to existing outputs without re-executing the diagnostics.
Reingest always creates a new Execution record under the same ExecutionGroup
with its own output directory, leaving the original execution untouched.
Results are treated as immutable.
get_executions_for_reingest(database, *, execution_group_ids=None, provider_filters=None, diagnostic_filters=None, include_failed=False)
#
Query executions eligible for reingest.
Always selects the oldest (original) execution per group so that reingest uses the execution whose scratch directory actually exists. Reingested executions only have results directories, not scratch.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
database
|
Database
|
Database instance |
required |
execution_group_ids
|
Sequence[int] | None
|
If provided, only include these execution group IDs |
None
|
provider_filters
|
list[str] | None
|
Filter by provider slug (substring, case-insensitive) |
None
|
diagnostic_filters
|
list[str] | None
|
Filter by diagnostic slug (substring, case-insensitive) |
None
|
include_failed
|
bool
|
If True, also include failed executions |
False
|
Returns:
| Type | Description |
|---|---|
list[tuple[ExecutionGroup, Execution]]
|
List of (ExecutionGroup, oldest Execution) tuples |
Source code in packages/climate-ref/src/climate_ref/executor/reingest.py
reconstruct_execution_definition(config, execution, diagnostic, output_fragment=None)
#
Reconstruct an ExecutionDefinition from database state.
This rebuilds the definition that was originally used to produce the execution, using the execution's stored datasets, output fragment, and the live diagnostic object from the provider registry.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
config
|
Config
|
Application configuration (provides |
required |
execution
|
Execution
|
The database |
required |
diagnostic
|
Diagnostic
|
The live |
required |
output_fragment
|
str | None
|
If provided, use this fragment instead of the execution's own fragment for the output directory. Used during reingest to point at the new scratch location after copying. |
None
|
Returns:
| Type | Description |
|---|---|
ExecutionDefinition
|
A reconstructed |
Source code in packages/climate-ref/src/climate_ref/executor/reingest.py
reingest_execution(config, database, execution, provider_registry)
#
Reingest an existing execution.
Re-runs build_execution_result() against the scratch directory
(which contains the raw outputs from the original diagnostic run),
creates a new Execution record with a unique output fragment,
copies results to the new location, and ingests metrics into the database.
The original execution is left untouched.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
config
|
Config
|
Application configuration |
required |
database
|
Database
|
Database instance |
required |
execution
|
Execution
|
The |
required |
provider_registry
|
ProviderRegistry
|
Registry of active providers (used to resolve the live diagnostic) |
required |
Returns:
| Type | Description |
|---|---|
bool
|
True if reingest was successful, False if it was skipped due to an error |
Source code in packages/climate-ref/src/climate_ref/executor/reingest.py
211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 | |