Model

mirror of https://github.com/Hestia-Homes/Model.git synced 2026-06-08 11:17:27 +00:00

Author	SHA1	Message	Date
Khalim Conn-Kowlessar	13dd5fe81a	feat(modelling): per-measure scoring — marginal cascade + per-Option signal (#1156 ) scoring.py adds the telescoping marginal cascade that serves two of the three ADR-0016 scoring roles: - marginal_impacts(scorer, baseline, overlays): applies overlays cumulatively in order and reports each measure's marginal MeasureImpact (sap_points + carbon/energy savings). Role 3 (final-package attribution) — the marginals telescope EXACTLY to the whole-package total. - independent_option_impacts(scorer, baseline, options): role 1 — scores each Option's overlay independently vs baseline, scoring each DISTINCT overlay once (Options sharing an overlay reuse the result). Approximate signal for the optimiser; never surfaced as a measure's true impact. Role 2 (whole-package re-score) is PackageScorer.score directly. Three behaviour tests on the real Sap10Calculator / a counting stand-in (hand-built EPD): single-overlay marginal == improvement-over-baseline; two-overlay marginals telescope to the package total; per-Option dedup scores each distinct overlay once. Closes #1156. pyright strict clean. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-03 08:50:49 +00:00
Khalim Conn-Kowlessar	7a478cff6e	feat(modelling): Package Scorer — compose overlays + score on the calculator PackageScorer(calculator: SapCalculator).score(baseline, simulations) folds the Simulation Overlays onto the baseline via the Overlay Applicator and scores the throwaway EpcPropertyData on the injected deterministic SAP calculator, returning Score(sap_continuous, co2_kg_per_yr, primary_energy_kwh_per_yr). Depends on the SapCalculator abstraction, not a concrete engine. This is the reusable scoring primitive (ADR-0016) — the same call serves the optimiser's whole-package re-score and a future live re-score of a user-assembled plan. Two behaviour tests against the real Sap10Calculator on a hand-built EPD: filling the main cavity improves SAP (right-directional through the real physics); an empty package scores the unmodified baseline (pins the SapResult->Score mapping). The Elmhurst before/after cascade PIN (#1154's acceptance) lands once cert 001431 parses (external _extract_windows fix). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-03 08:41:30 +00:00
Khalim Conn-Kowlessar	bb2c0068ff	feat(modelling): price the cavity Option from area x Product — closes #1155 recommend_cavity_wall now takes a ProductRepository and prices the Measure Option: Cost(total = gross_heat_loss_wall_area(MAIN) x product.unit_cost_per_m2, contingency_rate = product.contingency_rate). Detection is unchanged and runs before pricing, so ineligible walls still return None without a catalogue hit. Completes #1155 — the cavity-wall Recommendation Generator now detects an uninsulated main cavity wall and emits a priced Option carrying the filled- cavity overlay. Four behaviour tests (detection x3 + fully-loaded cost). pyright strict clean. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-03 08:35:52 +00:00
Khalim Conn-Kowlessar	b2c8980dd2	feat(modelling): ProductRepository + Postgres materials-table source Product(measure_type, unit_cost_per_m2, contingency_rate). ProductRepository is the DDD port abstracting the catalogue source; ProductPostgresRepository reads the externally-owned material table (defensive SQLModel view MaterialRow) and maps an active row to a Product — total_cost becomes the fully-loaded unit_cost_per_m2 — joining the per-measure-type contingency (contingencies.py, mirrors Costs.CONTINGENCIES; cavity 0.10). Strict-raise on missing/inactive row. A JSON-backed impl will follow behind the same port for ETL-gap costs. Two DB tests against an ephemeral Postgres (map active row; raise on inactive-only). Toward #1155 cost (4b). Also generalises the CONTEXT Simulation Overlay wording: windows are targeted by index, building-part association carried via window_location (_window_bp_index). pyright clean. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-03 08:32:38 +00:00
Khalim Conn-Kowlessar	0ba0575877	feat(modelling): shared gross heat-loss wall area geometry helper domain/building_geometry.gross_heat_loss_wall_area(epc, identifier) sums heat_loss_perimeter x room_height across a building part's storeys — the heat-loss wall area (party walls excluded by construction), not total wall area. Lives outside the calculator so Modelling cost quantities can reuse it; the calculator computes the same quantity inline today and should be DRY'd onto this later (coordinated with the calculator branch). Pinned at 45.93 m^2 against the 000490 MAIN part. Toward #1155 cost (behaviour 4). pyright strict clean. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 22:53:12 +00:00
Khalim Conn-Kowlessar	214b38ff78	feat(modelling): wall Recommendation Generator — cavity-fill detection + overlay recommend_cavity_wall(epc) detects an uninsulated main cavity wall (wall_construction=4, wall_insulation_type=4) and emits a Recommendation whose single Measure Option carries the Simulation Overlay setting MAIN wall_insulation_type=2 (Table 6 'Filled cavity'; cf. domain/sap10_ml/ rdsap_uvalues.py u_wall). Returns None for already-insulated or non-cavity main walls. Recommendation/MeasureOption reshaped per design review: the target is encoded in the Option's overlay (addresses a building part / window / system), not a typed key on Recommendation — generalises to glazing and heating without changing the type. CONTEXT partition wording generalised to match. Three behaviour tests (hand-built EPD, no PDF). Cost (behaviour 4 of #1155) outstanding — needs net heat-loss wall area + ProductRepository. WIP on #1155. pyright strict clean. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 22:49:33 +00:00
Khalim Conn-Kowlessar	350f4c8e76	feat(modelling): Overlay Applicator folds EpcSimulation onto EpcPropertyData EpcSimulation is the Simulation Overlay — a narrow all-optional partial mirror of EpcPropertyData/SapBuildingPart (wall surface first), targeting building parts by BuildingPartIdentifier (composition, not inheritance). apply_simulations(baseline, simulations) deep-copies the baseline, folds overlays in order (later wins on a shared field) via a generic non-None field write, and returns a throwaway EpcPropertyData for the calculator; the baseline is never mutated. Four behaviour tests (hand-built EPD from the 000490 fixture, no PDF): targeted-write-leaves-others-untouched, empty-overlay no-op, sequential last-wins, baseline-immutability. pyright strict clean. Slice 1 of the Modelling stage rebuild (ADR-0016). Closes #1153. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 22:13:51 +00:00
Khalim Conn-Kowlessar	f179950519	feat(baseline): wire BillDerivation into the orchestrator and persist the Bill (ADR-0014) The PropertyBaselineOrchestrator now reads the current Fuel Rates snapshot once per batch, builds a BillDerivation, and prices each scored property's SapResult -> EnergyBreakdown into a Bill carried on PropertyBaselinePerformance (None only on the stub no-calculator path). The Bill is flattened onto nullable bill_* flat columns (per-section kwh+cost, standing charges, SEG credit, total) on the postgres table, with bill_total_annual_bill_gbp as the not-null discriminator on read-back. Section absent from the bill stays None, not 0. Updated all four orchestrator construction sites to inject the FuelRatesRepository port (handler + three test sites), and the FE migration doc to reflect the prefixed columns and that they are now populated. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 18:51:18 +00:00
Khalim Conn-Kowlessar	f7dc9dbccb	feat(baseline): Rebaseliner returns RebaselineResult carrying the SapResult The Rebaseliner is the assemble-and-score step (ADR-0013 amendment); its SapResult is the scored picture that Bill Derivation also prices (ADR-0014), so rebaseline() now returns a RebaselineResult{effective, reason, sap_result} instead of (Performance, reason). CalculatorRebaseliner sets sap_result on both branches (the bill prices it whether lodged or calculated figures win); StubRebaseliner returns sap_result=None (runs no calculator). Orchestrator unpacks the result; the bill wiring lands in the next slice. Also refreshes the stale ML-era docstrings in rebaseliner.py to the assemble-and-score model (the calculator, not ML, is the rebaseliner mechanism per ADR-0013). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 18:37:13 +00:00
Khalim Conn-Kowlessar	5e75fb474c	feat(baseline): EnergyBreakdown.from_sap_result + COOLING section The SapResult -> EnergyBreakdown adapter (ADR-0014), a classmethod on the target mirroring Performance.from_sap_result. Folds each positive per-end-use delivered kWh into a billable EnergyLine: main/main-2/secondary heating and hot water at their resolved fuel (sap_code_to_fuel); lighting/pumps-fans/ appliances/cooking/cooling as electricity. PV export carries to exported_kwh for the SEG credit. Zero-kWh end uses emit no line; a positive kWh with no fuel code raises rather than billing at a default (strict, mirrors the calculator). Adds BillSection.COOLING (electricity, from space_cooling_fuel_kwh_per_yr). BillDerivation already prices any section it is given, so no change there. Also corrects the ADR-0014 amendment: SapResult carries the calculator's own fuel codes (raw API or Table-32 per mapper, ADR-0015); sap_fuel normalizes. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 18:28:22 +00:00
Khalim Conn-Kowlessar	2cb4dd5833	feat(baseline): sap_code_to_fuel normalizes via the calculator's own helper The fuel codes the calculator now puts on SapResult are its own codes — raw gov-API enums or already-Table-32, depending on the source mapper (ADR-0015). sap_code_to_fuel now runs the code through table_32.to_table_32_code (promoted from private _to_table_32_code) — T32-first, then API-translate, the SAME normalization the calculator's pricing/CO2 helpers use — before the Table-32 -> Fuel dispatch, so the bill's carrier matches what the calculator billed (incl. the API/T32 collision codes, e.g. 20 = wood-logs not heat-net). Falls back to the raw code for billing fuels the price table omits (the 41-58 heat-network range), which resolve to HEAT_NETWORK -> UnpricedFuel — stricter than, and intentionally divergent from, the calculator's lossy default-to-mains-gas for an unpriced code (ADR-0014 §5). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 18:24:39 +00:00
Khalim Conn-Kowlessar	4e9ff7c3cb	feat(calculator): thread per-end-use fuel codes + PV export onto SapResult ADR-0014 BillDerivation attributes each end-use (HEATING / HOT_WATER / SECONDARY / APPLIANCES / COOKING) to a fuel carrier and credits PV export. SapResult already carried the per-end-use kWh but not WHICH fuel each end-use burns, nor the annual exported kWh — so a downstream SapResult->EnergyBreakdown adapter could not pick the right tariff. Surfaces five output-only fields, threaded exactly like the recently merged appliances/cooking change (`2f039aeb`): main_heating_fuel_code RdSAP10 Table 32 / SAP 10.2 Table 12 fuel main_2_heating_fuel_code code column (the lodged fuel code, e.g. secondary_heating_fuel_code mains gas 26). None when the corresponding hot_water_fuel_code system is absent / fuel not resolvable. pv_exported_kwh_per_yr SAP 10.2 Appendix M1 §3-4 annual export kWh (0.0 when no PV). cert_to_inputs.py populates the four fuel codes from the existing resolvers the cost/CO2 cascade already uses — `_main_fuel_code`, `_secondary_fuel_code`, `_water_heating_fuel_code` (not reinvented); Main 2 is the second `main_heating_details` entry, guarded for length. There is a single CalculatorInputs construction site (cert_to_demand_ inputs delegates to cert_to_inputs). `pv_exported_kwh_per_yr` already existed on CalculatorInputs; SapResult collapses its Optional to 0.0. HARD CONSTRAINT honoured — output-only, zero rating drift. These fields do NOT feed ECF / total_fuel_cost_gbp / co2_kg_per_yr / primary_energy_* / sap_score / any monthly value. Every golden-fixture, Elmhurst e2e SapResult pin, section cascade pin, and heating-corpus residual stays byte-identical: calculator suite 1658 -> 1661 passed (+3 new tests), 4 skipped, 0 failed before and after. pyright net-zero (51 -> 51 in domain/; no new errors in the touched test files). New tests: a synthetic threading test (four fuel codes + PV export pass unchanged through calculate_sap_from_inputs; None PV collapses to 0.0) and a cert-level pin (mains-gas combi cert 000516 -> main fuel code 26, no Main 2, secondary 30, HW 26). Synthetic CalculatorInputs / SapResult fixtures updated for the new SapResult fields (defaults cover Inputs). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 18:16:40 +00:00
Khalim Conn-Kowlessar	c431453d75	refactor(fuel-rates): name the adapter aggregate-first per house convention PR feedback: adapters here are <aggregate>_<backend>_repository (e.g. property_baseline_postgres_repository). Rename the fuel-rates adapter to match — file static_file_fuel_rates_repository.py -> fuel_rates_static_file_repository.py and class StaticFileFuelRatesRepository -> FuelRatesStaticFileRepository, plus its test. git mv preserves history. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 17:21:27 +00:00
Khalim Conn-Kowlessar	d7d5084f90	Move sap10_calculator tests to tests/domain/sap10_calculator/ for CI The calculator tests lived under domain/sap10_calculator/{tests,worksheet/ tests,rdsap/tests,climate/tests,validation/tests}, none of which are in pytest.ini testpaths — so CI (which collects tests/) never ran them. Relocate all five dirs to tests/domain/sap10_calculator/{,worksheet,rdsap,climate, validation}, mirroring the tests/domain/property_baseline/ convention, so the cascade-pin / golden / e2e conformance suites run in CI. Mechanics: - git mv preserves history (110 files). - Flattening the trailing /tests keeps each file's depth-to-repo-root identical, so all 16 repo-root parents[4] fixture refs stay valid. Only test_pcdb_etl.py's parents[1] (→ pcdb data) and one hardcoded absolute golden-fixture path in test_cert_to_inputs.py needed rebasing. - Cross-imports rewritten domain.sap10_calculator.worksheet.tests → tests.domain.sap10_calculator.worksheet (21 files incl. the external importer backend/documents_parser/tests/test_summary_pdf_mapper_chain.py). - Golden-fixture path strings in test_summary_pdf_mapper_chain.py + scripts/fetch_cohort2_api_jsons.py updated to the new location (the JSONs moved with the rdsap tests). load_cells / gitignored worksheet xlsx: the xlsx-pinned tests (test_dimensions / ventilation / water_heating) read 2026-05-19-17-18 RdSap10Worksheet.xlsx, which is gitignored (.gitignore `*.xlsx`) and so absent in CI. _xlsx_loader. load_cells now pytest.skip()s when the file is absent, so those tests run locally and skip cleanly in CI instead of erroring — no new CI failures from the move, and the gitignore policy is respected. Verified: tests/domain/sap10_calculator + backend/documents_parser + tests/domain/property_baseline = 2248 pass, 1 skipped; pyright resolves the new import paths with zero import-resolution errors. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 16:58:00 +00:00
Khalim Conn-Kowlessar	69995edec8	Merge branch 'main' of https://github.com/Hestia-Homes/Model into feature/per-cert-mapper-validation	2026-06-02 16:10:41 +00:00
Jun-te Kim	4e02eb7c77	more tests to ensure we don't deploy something that is brokern	2026-06-02 15:03:20 +00:00
Khalim Conn-Kowlessar	2f039aeb39	Thread appliances + cooking annual kWh onto SapResult for ADR-0014 bills ADR-0014 BillDerivation prices a per-end-use EnergyBreakdown (HEATING / HOT_WATER / LIGHTING / PUMPS_FANS / APPLIANCES / COOKING). SapResult already carried the first four but not appliances or cooking, so a downstream SapResult→EnergyBreakdown adapter had to stub those two at 0 kWh — understating the bill by the whole unregulated electricity load. Surface them so the property_baseline side can wire the sections. Adds two output-only fields to CalculatorInputs + SapResult, threaded exactly like lighting_kwh_per_yr: appliances_kwh_per_yr — SAP 10.2 Appendix L L13/L14/L16a annual E_A (sum of the §5 (68) monthly appliances kWh) cooking_kwh_per_yr — SAP 10.2 Appendix L L20 (p.91) ELECTRICITY estimate E_cook = 138 + 28×N Both values already existed in cert_to_inputs.py (appliances_monthly_kwh, cooking_monthly_kwh) — reused, not recomputed. Fuel attribution: cooking_kwh_per_yr is the L20 ELECTRICITY figure (the field docstring says so), distinct from the L18 cooking heat GAIN (35 + 7N W) the §5 internal-gains cascade uses. The bill adapter should treat cooking as an electricity carrier; a gas-cooker split, if ever needed, is a separate follow-up. HARD CONSTRAINT honoured — output-only, zero rating drift. Appliances + cooking are unregulated and are NOT fed into ECF / total_fuel_cost / CO2 / primary energy / sap_score. Every golden-fixture, Elmhurst e2e SapResult pin, section cascade pin, and heating-corpus residual stays byte-identical (1165 rated pins green). The synthetic CalculatorInputs fixtures set the new fields non-zero on purpose so the existing cost/PE reconciliation assertions act as leak detectors. New focused test asserts both fields are populated (non-zero) and threaded unchanged onto SapResult, with cooking equal to the L20 electricity figure (138 + 28×occupancy) to 1e-9. pyright net-zero 111 → 111. Note: 11 pre-existing failures in test_appendix_u.py / test_table_32.py arrived with the recently absorbed PR and are unrelated to this change (they fail identically on the clean branch); flagged separately. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 15:00:10 +00:00
Khalim Conn-Kowlessar	bce4a9f7ec	refactor(baseline): SapCalculator ABC replaces the Calculator Protocol PR feedback: prefer an abstract base the calculator inherits from over a structural Protocol. Define `SapCalculator(ABC)` in the calculator package (the engine owns its own contract) and have `Sap10Calculator` inherit it; a future methodology is another subclass. Placing the ABC with the engine — not in property_baseline — keeps the dependency pointing consumer -> engine (sap10_calculator imports nothing from property_baseline). Consistent with the repo's existing port convention (FuelRatesRepository(ABC)). CalculatorRebaseliner keeps its reference to SapCalculator type-only (under TYPE_CHECKING), so the module still does not import the calculator at runtime. Test fakes now inherit the ABC since structural conformance no longer applies. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 13:45:48 +00:00
Khalim Conn-Kowlessar	15da2d3970	feat(baseline): CalculatorRebaseliner — calculator goes load-bearing (ADR-0013 amend) Slice 5a: the promotion. Replaces StubRebaseliner in production and collapses the shadow runner into the rebaseliner (ADR-0013 amendment). - CalculatorRebaseliner runs Sap10Calculator on every Property: * sap_version < 10.2 -> Effective Performance IS the calculator output (band via Epc.from_sap_score, CO2 kg->t, PEUI rounded), reason "pre_sap10". * sap_version >= 10.2 -> Effective = lodged (API figures on-target), and the calculator only logs divergence (SAP>0.5, PEUI/CO2 1%) as a validation signal. * a calculator raise propagates -> batch aborts (ADR-0012); fix the cert at once. - Rebaseliner.rebaseline gains property_id (for the divergence log). - LoggingCalculatorShadow / the calculator_shadow seam removed from the orchestrator; its divergence-comparison logic now lives in the rebaseliner. - StubRebaseliner kept (signature updated) for orchestrator/repo unit tests. The SapResult->EnergyBreakdown adapter + BillDerivation wiring (to populate the bill block) follow once the appliances/cooking SapResult fields land. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 10:04:24 +00:00
Khalim Conn-Kowlessar	5f65b9be62	feat(baseline): SAP fuel-code -> Fuel mapping for billing (ADR-0014) Slice 3 of Bill Derivation. sap_code_to_fuel(code) maps a SAP 10.2 / Table 32 fuel code to the canonical billing Fuel — bounded to the ~47 Table 32 codes (the carrier, orthogonal to the PCDB product index, so all PCDB heat pumps share one electricity code). Mains gas / LPG / oil+bioliquids / coal / smokeless / wood / electricity (standard + off-peak) / heat-network groupings; an unmapped code (dual fuel, grid-export) raises UnmappedSapCode rather than guessing. Also: ADR-0014 deferred/TODO section records the stubbed appliances+cooking (pending the SapResult fields), the off-peak day/night split, the heat-network rate gap, and regional rates / ETL. The SapResult -> EnergyBreakdown adapter (next slice) is gated on the appliances/cooking fields landing on SapResult. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 09:50:10 +00:00
Khalim Conn-Kowlessar	8ae3b56f41	feat(baseline): BillDerivation prices an energy breakdown at Fuel Rates (ADR-0014) Slice 2 of Bill Derivation. BillDerivation(fuel_rates).derive(breakdown) takes a delivered-energy breakdown (per-section EnergyLine(section, fuel, kwh) + exported_kwh) and produces a Bill: per-section kWh + cost, standing charges, SEG credit, and total. - Each end-use line billed at its fuel's unit rate. - Standing charge added ONCE per distinct fuel used (a meter, not an end use); off-gas fuels carry 0 so contribute nothing — no metered/unmetered special case. - SEG export credit subtracted. - Deterministic (ADR-0006); raises UnpricedFuel (via FuelRates) on an unpriced fuel (e.g. heat network) rather than billing at a wrong default. Pure domain — no calculator dependency; the SapResult->EnergyBreakdown adapter is slice 3. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 09:38:44 +00:00
Khalim Conn-Kowlessar	14b45a1b3e	feat(fuel-rates): FuelRates snapshot + repository foundation (ADR-0014) Slice 1 of Bill Derivation — the reference-data foundation that later slices price the calculator's per-end-use kWh against: - Fuel enum (canonical billing fuels; the join key between the calculator's SAP-code fuels and the rates snapshot). COAL + HEAT_NETWORK are members with no national rate. - FuelRates value object: unit_rate_p_per_kwh / standing_charge_p_per_day / seg_export_p_per_kwh; raises UnpricedFuel on a fuel it has no rate for rather than billing at a wrong default. - FuelRatesRepository port (ADR-0011 Repo-reads-stored-reference-data) + StaticFileFuelRatesRepository reading a committed JSON snapshot. - Snapshot fuel_rates_2026_q2.json: GB national, Apr-Jun 2026 Ofgem cap (gas/electricity) + DESNZ/NEP May 2026 (off-gas). Carries the full researched data; the value object exposes single-rate fuels this slice. Off-peak (day/night), house coal and heat network raise UnpricedFuel until later slices. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 09:29:07 +00:00
Khalim Conn-Kowlessar	561e1b8b49	feat(baseline): run Sap10Calculator in shadow on Property Baseline (ADR-0013) Wire Sap10Calculator into PropertyBaselineOrchestrator as a non-load-bearing shadow runner. For each property it scores the Effective EPC beside the load-bearing Lodged/Effective write, catches any strict-raise -> log.error (never aborts the batch), and on success log.warning's divergence from Lodged: SAP \|continuous - lodged\| > 0.5; PEUI/CO2 > 1% relative (CO2 after kg->tonnes). Every line is tagged with sap_version so SAP-10.2 signal separates from older-spec drift (ADR-0010 Validation Cohort). Per ADR-0013, Calculated SAP10 Performance is not a persisted third value-set: effective = calculated in every baselining scenario, so the calculator IS the mechanism that produces Effective Performance (the Rebaseliner). It runs in shadow only while being hardened; when overrides/estimation land it is promoted to drive Effective and the failure posture flips to abort (ADR-0012, calculator now load-bearing). No table change. - ADR-0013 + CONTEXT (Calculated SAP10 Performance / Effective Performance / Rebaselining) record the decision. - CalculatorShadow port + LoggingCalculatorShadow + Calculator protocol. - FakeCalculatorShadow for orchestrator unit tests. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 08:01:47 +00:00
Jun-te Kim	bf3b689f15	Remove EPC and asset_list changes unrelated to SAL handler This branch's objective is the SAL ingestion handler (applications/SAL/handler.py) and its dependency tree. Drop work that crept in but is unreferenced by it: - EPC feature: domain/epc, infrastructure/epc (gov_uk + historical clients), tests/infrastructure/epc - datatypes/epc edits (instantaneous_wwhrs Optional) reverted to main - asset_list/app.py local data-file/column tweak reverted to main Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-06-01 16:39:09 +00:00
Jun-te Kim	bdf703ea00	updated rdsap option; seperated s3 location in infrastrucutre; added open ai api	2026-06-01 16:33:14 +00:00
Khalim Conn-Kowlessar	1ea71a3acb	refactor(ara): rename FirstRunPipeline → AraFirstRunPipeline (PR #1139 review) Aligns the composition with its entry point (the `ara_first_run` lambda + `AraFirstRunTriggerBody`): clearer what the file does. - orchestration/first_run_pipeline.py → ara_first_run_pipeline.py - FirstRunPipeline → AraFirstRunPipeline; FirstRunCommand → AraFirstRunCommand - test files renamed to match Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 16:28:48 +00:00
Khalim Conn-Kowlessar	457d959b1f	refactor(property-baseline): rename baseline → property_baseline aggregate (PR #1139 review) Wholesale rename of the Baseline aggregate to PropertyBaseline for clarity / to disambiguate from baselines that appear elsewhere in Modelling. Scoped to this aggregate only — the distinct Rebaselining term (rebaseline_reason, StubRebaseliner, RebaselineNotImplemented) is deliberately untouched. - domain/baseline → domain/property_baseline; BaselinePerformance → PropertyBaselinePerformance. - repositories/baseline → repositories/property_baseline; BaselineRepository / BaselinePostgresRepository → PropertyBaseline*. - orchestration/baseline_orchestrator.py → property_baseline_orchestrator.py; BaselineOrchestrator → PropertyBaselineOrchestrator. BaselineStage → PropertyBaselineStage. - infrastructure/postgres: baseline_performance_table.py → property_baseline_performance_table.py; table `baseline_performance` → `property_baseline_performance`; Model renamed. - UnitOfWork attribute `.baseline` → `.property_baseline`. - Docs: ADR-0004 references + migration doc (renamed to property-baseline-performance-table.md) updated. CONTEXT.md glossary term ("Baseline Performance") left as-is pending a ubiquitous-language call (raised on the PR). 123 tests pass; pyright strict clean (only the unrelated pre-existing moto import errors remain). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 16:28:48 +00:00
Khalim Conn-Kowlessar	d2d008f5c5	perf(repos): bulk get_many / get_for_properties — batch reads, not N round-trips (#1138 ) Final slice of ADR-0012: collapse the per-property read round-trips a batch made (Baseline hydrated ~8 queries x 30 properties one at a time) into a handful of per-table IN queries. - EpcPostgresRepository: extracted a shared `_compose(rows)` from `get` (the windows + floor-dim fetches are now passed in, not fetched inline), so both `get` and the new `get_for_properties(property_ids)` build EpcPropertyData from pre-fetched rows. `get_for_properties` fetches each child table once (`WHERE epc_property_id IN ...`), groups in memory, and composes — load-whole per ADR-0002. - PropertyRepository.get_many(property_ids) -> Properties: one query for the property rows + one bulk EPC hydration, composed in input order. - BaselineOrchestrator / IngestionOrchestrator read the batch via get_many instead of N x get. - Ports + fakes gain the bulk methods. The #1129 round-trip fidelity test stays green (the compose extraction is behaviour-preserving). New tests: bulk hydration correctness + round-trips are constant w.r.t. batch size (one-per-table, proven by query count). 123 pass; pyright strict clean; AAA. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 16:28:48 +00:00
Khalim Conn-Kowlessar	7275850c9e	refactor(orchestration): wire stages onto the UnitOfWork; per-stage commit (#1138 ) Replaces the handler's whole-pipeline Session (one transaction across all three stages, connection pinned during Ingestion's external IO) with a Unit-of-Work per stage (ADR-0012, added here). Each stage runs its batch in one unit and commits once; any property raising aborts the batch and the subtask fails noisily. - BaselineOrchestrator(unit_of_work, rebaseliner): one unit for the batch, commit once. Raise on a pre-SAP10 property leaves the unit uncommitted. - IngestionOrchestrator(unit_of_work, epc_fetcher, geospatial_repo, solar_fetcher): fetch/write split — phase 1 fetches the whole batch (EPC / coords / solar) with NO unit open; phase 2 writes in one unit and commits. The connection is never held during external IO. Geospatial S3 repo stays injected (reference data, not transactional). - Handler: module-scoped engine (pool reused across warm invocations) + a UoW factory; whole-pipeline `with Session` gone. `build_first_run_pipeline` composes on the factory. Source clients still behind the raising seam. - ADR-0012 records the decision (per-stage boundary, all-or-nothing batch, idempotent re-run, fetch/write split, module-scoped engine). Modelling stub left untouched (no-op, no DB) per the ADR. Tests: orchestrators on a shared FakeUnitOfWork (assert persisted batch + exactly-once commit + no-commit-on-raise). New real-DB E2E integration test: real PostgresUnitOfWork, Ingestion writes the EPC → Baseline reads it back through the repo → re-run replaces, not duplicates (1 EPC row, 1 baseline row after two runs). 121 pass in tests/; pyright strict clean; AAA. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 16:28:48 +00:00
Khalim Conn-Kowlessar	fa5224b6ed	feat(repos): idempotent EPC + Baseline writes (replace by property_id) (#1138 ) Re-runs of a First Run batch re-save a property's data; that must replace, not duplicate (ADR-0012 idempotent batch writes). - `EpcPostgresRepository.save` deletes the property's existing EPC graph (parent + all child tables, floor-dims via their building parts) before inserting, when a `property_id` is given. Anonymous saves still insert. - `BaselinePostgresRepository.save` deletes the existing row for the `property_id` before inserting — no more unique-constraint violation on re-save; also what the re-score-on-override path needs. - Solar already upserts, so it's unchanged. The #1129 round-trip fidelity test stays green (delete-first is a no-op on a first save). 2 new tests (re-save replaces, not duplicates). pyright strict clean; AAA. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 16:28:48 +00:00
Khalim Conn-Kowlessar	5524385984	feat(uow): UnitOfWork port + PostgresUnitOfWork adapter (#1138 ) First slice of the per-stage batch-transaction refactor (ADR-0012). A UnitOfWork is the single transaction a stage runs its batch in: a context manager exposing the DB repos bound to one session, committing once on `commit()` and rolling back on exception or exit-without-commit (all-or-nothing per batch, fail noisily). - `UnitOfWork` (port): `property` / `epc` / `solar` / `baseline` repos + `commit()` / `rollback()`; `__exit__` rolls back uncommitted work. - `PostgresUnitOfWork(session_factory)`: opens a Session from an injected factory (a module-scoped engine + sessionmaker in prod, so the pool is reused across warm invocations), binds the Postgres repos to it, closes on exit. Not yet wired into any orchestrator — that lands in the Baseline / Ingestion refactor slices. 3 tests against ephemeral PG (commit durable across units; exception rolls back; no-commit persists nothing). pyright strict clean; AAA. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 16:28:48 +00:00
Khalim Conn-Kowlessar	61846665b1	feat(first-run): FirstRunPipeline E2E — Ingestion → Baseline → Modelling (#1136 ) Completes the First Run spine. Replaces the #1130 stub FirstRunPipeline with the real three-stage composition and wires it into the handler. - `FirstRunPipeline.run(command)` sequences Ingestion → Baseline → Modelling, threading only `property_ids` between stages (and `scenario_ids` into Modelling, off the command — never a prior stage's output). Stages are injected behind thin `IngestionStage` / `BaselineStage` / `ModellingStage` Protocols (the EpcFetcher/SolarFetcher idiom), so the handler owns wiring and tests substitute fakes (ADR-0011). - `ModellingOrchestrator` stub + `ScenarioRepository` / `MaterialsRepository` seam ports — `run(property_ids, scenario_ids)` reads through repos, does no scoring yet. Method shapes deferred to the Modelling per-service grills (Scenario / Scenario Phase / Snapshot / Optimised Package / Plans are rich — not pre-empted here). - Handler delegates to the real pipeline via `build_first_run_pipeline` (Postgres-backed repos off the session). The Ingestion source clients (EPC API / Google Solar / geospatial S3) are isolated behind one `_source_clients_from_env` seam that raises until the deploy/Terraform config settles — out of scope for this slice. Subtask complete/failed + CloudWatch URL still come from `@subtask_handler`. Integration test (the criterion's centrepiece): wires REAL Ingestion + REAL Baseline + stub Modelling through a shared fake EPC repo, with a repo-backed PropertyRepo composing the Property from that slice. Proves Baseline reads the very EPC Ingestion persisted — the through-repos hand-off, no in-memory coupling. Plus a composition test pinning stage order + only-property_ids threading. TDD, one test → one impl. pyright strict clean; AAA layout. 116 pass in the tests/ tree, no regressions. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 16:28:48 +00:00
Khalim Conn-Kowlessar	9f22b0aae8	feat(baseline): BaselineOrchestrator + BaselinePerformance aggregate (#1135 ) Stage 2 of First Run. Establishes each Property's Baseline Performance from persisted source data and writes it back — reads only from repos, never a Fetcher or HTTP (ADR-0003), so it is byte-identical whether Ingestion ran milliseconds ago or last week. Domain (`domain/baseline/`): - `Performance` VO — the four rated quantities: SAP / EPC Band / CO2 / Primary Energy Intensity. `lodged_performance(epc)` reads them off the EPC's recorded fields (PEUI = `energy_consumption_current`). - `BaselinePerformance` (ADR-0004) — the paired `lodged` + `effective` Performance + `rebaseline_reason`, plus the no-derivation part of the energy block (`space_heating_kwh` / `water_heating_kwh`, off the RHI, deterministic per ADR-0006). Both halves always populated. - `Rebaseliner` port + `StubRebaseliner`: the re-score-on-override seam (ADR-0011). SAP10 certs pass through (effective == lodged, reason "none"); a pre-SAP10 cert raises `RebaselineNotImplemented` rather than fabricating a plausible-but-wrong "none" — ML rebaselining is not wired yet. Mirrors the repo's strict-raise culture. Persistence: new `BaselineRepository` port + `BaselinePostgresRepository` + flat-column `baseline_performance` SQLModel (one row per Property). Per ADR-0004's amendment this is a standalone table, NOT columns on the retiring `property_details_epc`. Production migration is FE-owned (Drizzle) — docs/migrations/baseline-performance-table.md. Docs (grill-with-docs): corrected CONTEXT.md Lodged/Effective Performance to Primary Energy Intensity (the term collided with its own _Avoid_ entry under "heat demand") + fixed stale RHI field names; amended ADR-0004 Consequences for the standalone-table decision. Fuel split + bills (rest of EPC Energy Derivation) deferred to a follow-up — they need a Fuel Rates source (Ofgem-cap ETL) that does not exist yet. TDD, one test -> one impl: 7 tests (lodged read, rebaseliner pass-through + raise, orchestrator establish-and-persist + pre-SAP10 raise, Postgres round-trip + absent). pyright strict clean; AAA layout. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 16:28:48 +00:00
Khalim Conn-Kowlessar	a910ce9855	feat(ara): AraFirstRunTriggerBody + ara_first_run lambda skeleton (#1130 ) Stage-2 entry point for the First Run use case. Adds the `ara_first_run` Lambda package mirroring the `postcode_splitter` template, its typed trigger contract, and a stub `FirstRunPipeline`. - `AraFirstRunTriggerBody`: thin command of five fields — `task_id`, `sub_task_id` (UUID, lifecycle), `portfolio_id`, `property_ids`, `scenario_ids` (int business IDs). No `model_config` override, so Pydantic's default `extra="ignore"` lets the FastAPI backend add fields without breaking deployed lambdas. UPRNs / Scenario defs are deliberately off the event — read from source-of-truth tables. - Thin `handler.py`: validate-and-delegate only, via a named `dispatch_first_run` seam (testable without the Lambda runtime). Subtask status (in-progress/complete/failed) + CloudWatch log URL come for free from the existing `@subtask_handler()` decorator. - `FirstRunPipeline` (orchestration/) stub: `run(command)` receives the validated command. Declares a structural `FirstRunCommand` Protocol (the three business fields) that `AraFirstRunTriggerBody` satisfies, so orchestration needs no application-layer import — rhymes with the `EpcFetcher`/`SolarFetcher` Protocols on IngestionOrchestrator (ADR-0011). Full Ingestion→Baseline→Modelling composition lands in #1136. - Dockerfile / requirements.txt / local_handler/ mirror postcode_splitter. TDD: 7 new tests (trigger-body validation incl. forward-compat + id-types, pipeline seam, handler delegation). pyright strict clean. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 16:28:48 +00:00
Khalim Conn-Kowlessar	454456bf22	feat(ingestion): IngestionOrchestrator end-to-end (#1134 ) Stage 1 of the pipeline: per property, read its UPRN from the property row, fetch its EPC, resolve coordinates from the Geospatial reference repo, thread those into the Solar fetcher, and persist EPC + solar via repos. Fetchers never call each other — the orchestrator threads the coordinate (ADR-0011). Coordinates are reference data (deterministic from UPRN), resolved transiently to drive the solar fetch rather than persisted per-property. Depends on thin EpcFetcher/SolarFetcher Protocols (EpcClientService and GoogleSolarApiClient satisfy them structurally). Unit-tested against fakes — no DB, gov API, or network: persists EPC, threads coords into solar, skips UPRN-less properties and skips solar when coordinates are absent. pyright clean. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 16:28:48 +00:00
Khalim Conn-Kowlessar	285e7f8824	feat(geospatial): GeospatialRepo — OS Open-UPRN coordinate lookup (#1131 ) Add Coordinates value object + GeospatialRepository port + GeospatialS3Repository adapter. Resolves a Property's lon/lat from the partitioned Ordnance Survey Open-UPRN parquet (filename_meta -> partition -> UPRN row). A Repo, not a Fetcher (ADR-0011): no live OS API call. The parquet reader is injected, so it's unit-tested against fixture parquets with no S3/network; returns None when the UPRN is uncovered or absent. pyright strict clean. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 16:28:48 +00:00
Khalim Conn-Kowlessar	5a3be9d672	feat(ingestion): relocate EpcClientService to infrastructure + SolarRepo (#1133 ) Move the EpcClientService package (client + _retry + exceptions + tests) from the dying backend/ tree to infrastructure/epc_client/ as the New-EPC-API Fetcher; update the two callers (address2UPRN, a script). All 14 client tests pass. Add SolarRepository port + SolarPostgresRepository persisting Google Solar building insights as JSONB (solar_building_insights table), one row per Property. The EPC repo half of this slice already landed in #1129. pyright strict clean. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 16:28:48 +00:00
Khalim Conn-Kowlessar	460970af39	feat(property): Property aggregate + PropertyRepository (#1132 ) Add the Ara modelling aggregate root (ADR-0002): domain/property/ with PropertyIdentity, SiteNotes, Property, Properties. Property.source_path implements the two disjoint source paths + Recency Tie-Break (ADR-0001; survey wins on an equal date); effective_epc resolves to the surveyed data (Site Notes path) or the public EPC (epc_with_overlay path — Landlord Overrides overlay is a later slice). Pure dataclasses, no infrastructure imports. PropertyRepository port + PropertyPostgresRepository hydrate the aggregate whole from a defensive view of the FE-owned 'property' table (identity columns) plus the EPC slice via EpcRepository.get_for_property. Reads only from repos (ADR-0003). 8 domain + 1 hydration test; pyright strict clean. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 16:28:48 +00:00
Khalim Conn-Kowlessar	3e1d3acfbf	feat(epc): persist renewable_heat_incentive — full round-trip equality (#1137 ) Add epc_renewable_heat_incentive table (space_heating_kwh, water_heating_kwh + the three insulation-impact kWh fields), wired into EpcPostgresRepository save/get. This is the P0 gap: RenewableHeatIncentive carries the baseline space-heating/hot-water kWh that EPC Energy Derivation consumes. The round-trip test now asserts full deep-equality (dropped the renewable_heat_incentive exclusion) and passes for RdSAP 21.0.0 + 21.0.1. DB migration for the new table documented in docs/migrations/epc-property-round-trip-fidelity.md. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 16:28:48 +00:00
Khalim Conn-Kowlessar	559616d3bb	feat(epc): EPC persistence round-trip fidelity + JSONB code columns (Slice 1 #1129 ) Relocate EpcPropertyModel + child tables from the dying backend/ tree to infrastructure/postgres/epc_property_table.py (re-export shim keeps documents_parser working). Add EpcRepository port + EpcPostgresRepository with a full reverse mapper (epc_property tables -> EpcPropertyData). Round-trip test surfaced two fidelity gaps: 1. Union[int,str] SAP code fields were str()-coerced on save, losing the int (API) vs str (Site Notes) distinction. Now stored as JSONB (type-preserving). 2. The schema was a partial projection. Closed the cheap gaps on the model (heating shower/bath counts, roof_construction_type, curtain_wall_age, addendum, mechanical_vent_duct_insulation_level, SAP 10.2 §2 ventilation fields + a ventilation_present flag). Structural gaps tracked as follow-ups; renewable_heat_incentive (P0, #1137) excluded from the assertion until landed. Round-trip passes for RdSAP-Schema-21.0.0 and 21.0.1; pyright strict clean. Migration inventory for the DB: docs/migrations/epc-property-round-trip-fidelity.md Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 16:28:48 +00:00
Daniel Roth	b09ef8248e	GoogleSolarApi translates BuildingInsightsNotFoundError to sentinel dict 🟩	2026-06-01 16:28:47 +00:00
Daniel Roth	3f43dacfb9	GoogleSolarApi delegates get_building_insights to GoogleSolarApiClient 🟥	2026-06-01 16:28:47 +00:00
Daniel Roth	074cbf2f5a	GoogleSolarApiClient propagates exception after retry exhaustion 🟩	2026-06-01 16:28:47 +00:00
Daniel Roth	89e9c962cb	GoogleSolarApiClient raises BuildingInsightsNotFoundError on 404 entity-not-found 🟥	2026-06-01 16:28:47 +00:00
Daniel Roth	7bc00fdac8	GoogleSolarApiClient retries on transient HTTP errors 🟥	2026-06-01 16:28:47 +00:00
Daniel Roth	fe463f7eea	GoogleSolarApiClient fetches building insights from the Solar API 🟥	2026-06-01 16:28:47 +00:00
Jun-te Kim	5470fa1d93	move landlord overrides	2026-06-01 15:46:46 +00:00
Jun-te Kim	8a9d14a45c	landlord overrids moved into one repo	2026-06-01 15:16:23 +00:00
Khalim Conn-Kowlessar	305bffd284	refactor(ara): rename FirstRunPipeline → AraFirstRunPipeline (PR #1139 review) Aligns the composition with its entry point (the `ara_first_run` lambda + `AraFirstRunTriggerBody`): clearer what the file does. - orchestration/first_run_pipeline.py → ara_first_run_pipeline.py - FirstRunPipeline → AraFirstRunPipeline; FirstRunCommand → AraFirstRunCommand - test files renamed to match Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 15:00:33 +00:00
Khalim Conn-Kowlessar	c3691d9af2	refactor(property-baseline): rename baseline → property_baseline aggregate (PR #1139 review) Wholesale rename of the Baseline aggregate to PropertyBaseline for clarity / to disambiguate from baselines that appear elsewhere in Modelling. Scoped to this aggregate only — the distinct Rebaselining term (rebaseline_reason, StubRebaseliner, RebaselineNotImplemented) is deliberately untouched. - domain/baseline → domain/property_baseline; BaselinePerformance → PropertyBaselinePerformance. - repositories/baseline → repositories/property_baseline; BaselineRepository / BaselinePostgresRepository → PropertyBaseline*. - orchestration/baseline_orchestrator.py → property_baseline_orchestrator.py; BaselineOrchestrator → PropertyBaselineOrchestrator. BaselineStage → PropertyBaselineStage. - infrastructure/postgres: baseline_performance_table.py → property_baseline_performance_table.py; table `baseline_performance` → `property_baseline_performance`; Model renamed. - UnitOfWork attribute `.baseline` → `.property_baseline`. - Docs: ADR-0004 references + migration doc (renamed to property-baseline-performance-table.md) updated. CONTEXT.md glossary term ("Baseline Performance") left as-is pending a ubiquitous-language call (raised on the PR). 123 tests pass; pyright strict clean (only the unrelated pre-existing moto import errors remain). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 14:54:59 +00:00

1 2 3

102 commits