Model

mirror of https://github.com/Hestia-Homes/Model.git synced 2026-06-08 11:17:27 +00:00

Author	SHA1	Message	Date
Khalim Conn-Kowlessar	58a9547210	Slice S0380.168: Bio-liquid mapper extensions + Table 32 FAME price flip Mapper extensions (`_ELMHURST_MAIN_HEATING_EES_TO_FUEL_CODE`): "BFD": 71, # HVO — corpus variant oil 2 (SAP 127) "BXE": 73, # FAME — corpus variant oil 3 (SAP 128) "BXF": 73, # FAME alt — corpus variant oil 4 (SAP 129) "BZC": 76, # Bioethanol — corpus variant oil 5 (SAP 126) "B3C": 75, # B30K — corpus variant oil 6 (SAP 126) `_ELMHURST_MAIN_FUEL_TO_SAP10` water-side labels: "Bio-liquid HVO from used cooking oil": 71, "Bio-liquid FAME from animal/vegetable oils": 73, "Bioethanol": 76, "B30K": 75, Values are direct Table 32 codes (the bio-liquid codes 71/73/75/76 don't collide with any API enum value so they pass through `unit_price_p_per_kwh` etc. unchanged). Spec: SAP 10.2 Table 12 (PDF p.189) notes (d)/(e)/(f). Pre-slice all 5 oil 2-6 variants raised `MissingMainFuelType` per S0380.132. Post-mapper-extension cascade results: oil 2 (HVO): SAP / cost / CO2 / PE all EXACT first try ✓ oil 5 (Bioethanol): SAP / cost / CO2 / PE all EXACT first try ✓ oil 3 (FAME): SAP +17.34, cost −£398 oil 4 (FAME alt): SAP +16.06, cost −£367 oil 6 (B30K): SAP +3.05, cost −£70 Slice S0380.131 had left a deferred TODO in `table_32.py` for FAME code 73 ("worksheet 7.64 vs spec 5.44 — flipping has no measurable cascade effect today, deferred until a cert that exercises it surfaces"). Now exercised — flipping `73: 5.44 → 7.64` closes 85 % of the oil 3/4 cost gap: oil 3 (FAME): SAP +17.34 → +2.59, cost −£398 → −£62 oil 4 (FAME alt): SAP +16.06 → +2.56, cost −£367 → −£57 The Elmhurst-engine canonical 7.64 ↔ spec PDF 5.44 divergence is the same pattern S0380.131 applied to heating oil (code 4: 7.64 → 5.44) per [[feedback-software-no-special-handling]]. Remaining residuals on oil 3 / oil 4 / oil 6 are cascade-side (HW kWh under by ~250-900, SH demand small diff, CO2/PE blend artifacts) — pinned at observed values as forcing functions for follow-up slices. Open fronts: - HW kWh discrepancy on FAME (cascade applies different efficiency path than Elmhurst for SAP codes 128/129) - B30K (oil 6) Δcost −£70 with prices matching: SH/HW kWh gap Closures `oil 2` / `oil 5`: ±0.0000 on all 4 metrics. Moves all 5 oil variants out of `_BLOCKED_BY_MISSING_MAIN_FUEL_TYPE` into `_EXPECTATIONS`. Blocked tier now: 6 variants (community heating × 5, no system). Cascade-OK tier: 32 variants (up from 30), 30 EXACT + 3 (oil 3/4/6) pinned with non-zero residuals + 1 (pcdb 1 SH residual closed in S0380.165). Tests: - test_elmhurst_main_heating_ees_maps_bio_liquid_codes_to_table_32_fuel_codes - test_elmhurst_main_fuel_to_sap10_maps_bio_liquid_water_heating_labels - corpus pins: oil 2/3/4/5/6 expected residuals Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-02 10:14:10 +00:00
Khalim Conn-Kowlessar	1b1f45b679	Slice S0380.148: Table 4f — liquid fuel boiler flue fan and fuel pump (100 kWh/yr) SAP 10.2 Table 4f (PDF p.174) "Electricity for fans, pumps and other auxiliary uses" row: Liquid fuel boiler — flue fan and fuel pump 100 kWh/yr c) d) Note c): "Applies to all liquid fuel boilers that provide main heating, but not if boiler provides hot water only. Where there are two main heating systems include two figures from this table." Pre-slice the cascade's `_table_4f_additive_components` only wired: - (230a) MEV / MVHR - (230e) Main 2 gas-boiler flue fan (45 kWh) - (230g) Solar HW pump The liquid-fuel sibling row was missing — oil 1 worksheet (230d) and oil pcdb 3 worksheet (230d) both lodge 100 kWh/yr "oil boiler pump" that the cascade was silently skipping. Implementation: - Add `_LIQUID_FUEL_CODES = frozenset({4, 71, 73, 75, 76})` and new `is_liquid_fuel_code(fuel_code)` helper in `domain/sap10_calculator/tables/table_32.py`. Mirror of `is_electric_fuel_code` — routes through `_to_table_32_code` normalisation so Elmhurst-derived Table 32 codes (e.g. code 23 = bulk wood pellets, solid) don't collide with API enum codes (where 23 = B30D community). - Extend `_table_4f_additive_components` to add 100 kWh for Main 1 when `is_liquid_fuel_code(main.main_fuel_type)` returns True (`isinstance(int)` guard for the `Union[int, str]` field). Mirror the same gate for Main 2 per Note c) "Where there are two main heating systems include two figures". - LPG is GAS (Table 4b/4f convention, Ecodesign classification) — `_LIQUID_FUEL_CODES` deliberately excludes 2/3/5/9 LPG codes. Cascade impact across heating-systems corpus: \| Variant \| SAP Δ \| Cost Δ \| PE Δ \| \|-----------\|-------------\|-------------\|-------------\| \| oil 1 \| +1.18→+0.60 \| -£27→-£14 \| -276→-124 \| \| oil pcdb 1\| +0.42→-0.15 \| -£10→+£3.4 \| -84→+67 \| \| oil pcdb 2\| +0.42→-0.15 \| -£10→+£3.4 \| -84→+67 \| \| oil pcdb 3\| +1.16→+0.59 \| -£27→-£14 \| -271→-120 \| \| pcdb 1 \| +0.57→-0.03 \| -£13→+£0.6 \| -109→+42 \| Cohort closures: pcdb 1 EXACT (-0.03), oil pcdb 1/2 closed to -0.15. Golden fixtures impact: - cert 0240 (dual-main oil combi 130): SAP integer 73→72 (resid +0→-1), PE +1.02→+2.52, CO2 +0.11→+0.14. Dual-main certs add 2 × 100 = 200 kWh aux per Note c). Cert's published SAP 73 suggests the dual-main Q_space split (main_heating_fraction) may also need wiring — slice candidate. - cert 0390 (Firebird PCDF 9005 oil combi): PE -28.50→-28.08 (CLOSER to zero), CO2 -2.75→-2.73 (CLOSER to zero), SAP +7 unchanged. Test: test_sap_table_4f_liquid_fuel_boiler_flue_fan_and_fuel_pump_adds_ 100_kwh — asserts oil pcdb 3 inputs.pumps_fans_kwh_per_yr ≥ 230 (130 base + 100 liquid fuel boiler aux). Extended handover suite: 891 pass, 0 fail. Pyright net-zero (44=44). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-01 08:53:23 +00:00
Khalim Conn-Kowlessar	7dceeff24b	Slice S0380.147: Appendix D Eq D1 — Table 4b non-PCDB boilers (winter/summer monthly cascade) SAP 10.2 Appendix D §D2.1 (2) Equation (D1) (PDF p.57): If the boiler provides both space and water heating, and the summer seasonal efficiency is lower than the winter seasonal efficiency, the efficiency is a combination of winter and summer seasonal efficiencies according to the relative proportion of heat needed from the boiler for space and water heating in the month concerned: Q_space + Q_water η_water,m = ─────────────────────────────── Q_space/η_winter + Q_water/η_summer where Q_space (kWh/month) is the quantity calculated at (98c)m multiplied by (204) or by (205); Q_water (kWh/month) is the quantity calculated at (64)m; η_winter and η_summer are the winter and summer seasonal efficiencies (from Table 4b). Pre-slice the cascade only wired Eq D1 for PCDB-tested boilers (the `pcdb_record` branch in `_apply_water_efficiency`). For non-PCDB Table 4b boilers (`sap_main_heating_code` 101-141) where the cert lodges no `main_heating_index_number`, the cascade fell through to the scalar `water_efficiency_pct` divisor — which resolved via WHC 901 inherit to Table 4b WINTER eff (wrong direction; spec wants the monthly Eq D1 blend). This slice: - Adds `domain/sap10_calculator/tables/table_4b.py` with the full 41-row Table 4b (winter, summer) pair dict for codes 101-141 verbatim from SAP 10.2 PDF p.168 (Table 4b). - Refactors `_apply_water_efficiency` parameter from `pcdb_record: Optional[GasOilBoilerRecord]` to `eq_d1_winter_summer_pct: Optional[tuple[float, float]]` — decouples the Eq D1 input from the PCDB record so a Table 4b fallback can populate it without faking a PCDB record. - Resolves Eq D1 inputs at the call site with priority order: 1. PCDB Table 105 winter/summer (existing path) 2. SAP 10.2 Table 4b (PDF p.168) winter/summer when PCDB absent + WHC=901 (`_WHC_FROM_MAIN_HEATING`, the spec form of "boiler provides both space and water heating"). §9.4.11 -5pp interlock applies symmetrically to both columns of whichever (winter, summer) tuple is resolved. Oil 1 cert worksheet (217)m verified Jan 81.83 / Apr 81.42 / May 79.94 / Jun-Sep 72.00 / Dec 81.86 — exact back-solve to Eq D1 with Table 4b code 127 (winter 84, summer 72). Annual HW fuel (219) = Σ (64)m × 100 / (217)m = 3638.99 kWh/yr ≡ cascade post-slice. Cascade impact: Heating-systems corpus (worksheet-pinned, oil 1 only on pin grid): oil 1 SAP +1.76 → +1.18 (Δ -0.59) cost -£40.60 → -£27.12 (Δ +£13.48) CO2 -129.22 → -55.36 (Δ +73.86 kg/yr) PE -590.02 → -275.52 (Δ +314.50 kWh/yr) Remaining oil 1 residual is Table 4f auxiliary energy (cascade pumps_fans 130 kWh vs worksheet 265 kWh — missing the oil-boiler pump 100 kWh + CH pump 130 vs ws 165). Follow-up slice. Golden fixtures (cert-pinned, integer-rounded PE): cert 0240 (dual oil combi 130, no cylinder): PE +0.05 → +1.02 cert 6035 (gas combi 104, no cylinder): PE +46.10 → +47.29 Both shifts reflect spec-correct Eq D1 now firing for non-PCDB combi-no-cylinder configs. The pre-slice near-zero pin on cert 0240 was masking offsetting cascade gaps (likely Table 4f auxiliary energy and/or dual-main Q_space split per (98c)m × (204) which the cascade currently treats as full demand). Following [[reference-unmapped-sap-code]] discipline, the new Table 4b dict is the canonical spec-source — `domain.sap10_ml.sap_ efficiencies._SPACE_EFF_BY_CODE` still carries the winter column for the ML feature cascade and is left in place per the sap10_ml deprecation plan (separate migration). Test: test_sap_appendix_d_eq_d1_water_efficiency_monthly_for_non_pcdb_ table_4b_boiler_with_cylinder — asserts cert 1431 oil 1 HW fuel annual = 3638.99 ± 1.0 kWh/yr (matches worksheet (219)). Extended handover suite: 890 pass, 0 fail. Pyright net-zero (44=44). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-01 08:22:46 +00:00
Khalim Conn-Kowlessar	4d004790db	Slice S0380.136: route _is_electric_main / _is_electric_water via the canonical T32-first normaliser (dual-fuel closure) `_is_electric_main` and `_is_electric_water` hand-rolled a literal set check `code in {10, 25, 29}` ∪ `{30..40}` to classify a fuel code as electricity. The set conflated two enums: - {10, 25, 29} — API enum codes (epc_codes.csv row main_fuel): 10 = electricity (backwards compat) 25 = electricity (community) 29 = electricity (not community) - {30, 31, ..., 40} — Table 32 codes (RdSAP 10 spec p.95): 30 = standard tariff 31/32 = 7-hour low/high 33/34 = 10-hour low/high 35 = 24-hour heating 38/40 = 18-hour high/low API enum codes 1-29 collide with Table 32 codes 1-29 for unrelated fuels — API 10 = "electricity" vs Table 32 10 = "dual fuel (mineral + wood)". S0380.135's EES dispatch sets `main_fuel_type` to Table 32 codes (BDI → 10 for dual fuel), so a dual-fuel main was silently mis-classified as electric. The `_space_heating_fuel_cost_gbp_per_kwh` tariff branch then re-routed solid fuel 6's space heating cost through the 18-hour-low electric rate (5.50 p/kWh) instead of dual-fuel 3.99 p/kWh — solid fuel 6 SAP residual −7.38 → −11.37 in S0380.135. The fix promotes the existing `table_32._is_electric_code` to public `is_electric_fuel_code` and routes both `_is_electric_main` and `_is_electric_water` through it. The canonical helper normalises a fuel code via T32-first then API-translate fallback (same convention as `unit_price_p_per_kwh`), so a Table-32-code-10 dual-fuel main classifies as non-electric correctly. Subtle behaviour change: API enum code 25 ("electricity (community)") maps via API_FUEL_TO_TABLE_32 to Table 32 code 41 ("heat from electric heat pump (community)") which is a heat network billed at the heat- network rate (4.24 p/kWh single rate), not at the off-peak electric tariff. Pre-S0380.136 the literal-set check would have treated this as direct electric and applied the Table 12a high/low-rate split — that was wrong; community heat networks don't have an off-peak split. The new canonical helper correctly excludes code 41 from _ELECTRIC_FUEL_CODES. Heating-systems corpus impact: solid fuel 6 (Dual Fuel Anthracite Wood, SAP 160): ΔSAP −11.3731 → +1.9493 (now in cluster with other solid-fuel) Δcost +£268.44 → −£44.91 ΔPE unchanged (PE wasn't affected by the cost mis-routing) No other corpus variants moved — none have `main_fuel_type` in the ambiguous API/T32 collision range that was previously mis-classified. Extended handover suite: 879 pass / 0 fail (+2 from new AAA tests covering both `_is_electric_main` and `_is_electric_water` dual-fuel non-electric classification + API code 29 → electric / API code 25 → heat-network non-electric semantics). Pyright net-zero on touched files (43 → 43). No golden fixture impact — no golden cert lodges `main_fuel_type=10` (dual fuel) on the cascade path. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-31 13:37:14 +00:00
Khalim Conn-Kowlessar	14eee259b4	Slice S0380.131: flip Table 32 heating-oil price 7.64 → 5.44 (empirical) The published RdSAP 10 Specification 10-06-2025 PDF Table 32 (p.95) lists heating oil at 7.64 p/kWh. Two independent operational sources both use 5.44 p/kWh for the same fuel: - Elmhurst P960 worksheets across all five oil-fired variants in `sap worksheets/heating systems examples/` (oil 1, oil pcdb 1/2/3, pcdb 1) lodge 5.4400 p/kWh on (240) "Space heating - main system 1" and (247) "Water heating (other fuel)" for every "FuelType: Heating oil" worksheet. - The gov.uk EPC register's lodging software back-solves to ~5.48 p/kWh from cert 0240-0200-5706-2365-8010's lodged SAP 73 (oil + PV detached, age J). With heating-oil at 5.44 in the cascade this cert closes to ΔSAP = 0 exactly against its lodged value. The BRE technical papers (`docs/specs/sap10 technical papers/`) carry no Table 32 errata or fuel-price update, so the change is grounded in empirical cross-source evidence rather than a spec citation — the worksheet PDF is the source of truth per the project convention. Post-flip residuals: Heating-systems corpus (cascade − worksheet ΔSAP_c): oil 1 −9.7030 → +2.6578 oil pcdb 1 −11.6343 → +0.4239 ← within 1 SAP of closure oil pcdb 2 −11.6343 → +0.4239 oil pcdb 3 −10.8674 → +1.1597 pcdb 1 −9.4083 → +6.9521 ← largest remaining oil-cohort gap Golden fixtures (cascade − lodged SAP): 0240-0200-5706-2365-8010 resid −10 → +0 ← EXACT closure 0390-2954-3640-2196-4175 resid −6 → +7 ← oil-price bug was masking +13 SAP of opposite-direction cascade gaps; now exposed for follow-up PE / CO2 residuals are unaffected by the unit-price flip (cost-only change). The 41-variant corpus regression guard (S0380.129) holds; all other golden cohorts pass unchanged. Extended handover suite: 874 pass. Bio-FAME (code 73) shows the inverse divergence on oil 3/4 worksheets (worksheet 7.64 vs spec 5.44 — possible row-swap typo in the spec PDF) but flipping it has no measurable cascade effect today, so deferred until a cert that exercises it surfaces. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-31 09:21:29 +00:00
Khalim Conn-Kowlessar	d8cdee4e53	Slice S0380.125: map Elmhurst Summary "18 Hour" meter_type to EIGHTEEN_HOUR The Elmhurst Summary §14.2 Meters section lodges the electricity meter type as the bare RdSAP enum form "18 Hour", but `_METER_STR_TO_INT` only carried the legacy "off-peak 18 hour" alias. All 41 P960-format heating-system fixtures at `sap worksheets/heating systems examples/` lodge meter_type "18 Hour", so `cert_to_inputs` strict-raised on every one of them before this slice. Per RdSAP 10 Specification §17 page 85 (Electricity meter row 10-2): > "Electricity meter: Dual/single/10-hour/18-hour/24-hour/unknown" Per RdSAP 10 §12 page 62: > "if the meter is dual 18-hour/24-hour it is 18-hour/24-hour tariff" So the bare "18 Hour" lodging routes directly to enum 5 (Off-peak 18 hour) → `Tariff.EIGHTEEN_HOUR`, bypassing the §12 Rules 1-4 dispatch (which only fires for Dual meters that aren't 18-hour or 24-hour). After this slice the heating-system corpus probe (`/tmp/probe_.py` across 41 variants of the same property × different heating systems) shifts from "32 raises + 7 mapper gaps + 2 emitter gaps" to "32 cascade-OK + 7 community-heating + 2 underfloor-emitter + 1 cylinder-size 'No Access'". The 32 newly-OK variants surface a positive ΔSAP cluster (cascade SAP_c > worksheet SAP_c by +0.87..+30 across boiler types) — that residual layer is queued for the next slice. Extended handover suite at HEAD post-slice: 829 pass, 0 fail* (baseline 775 + test_table_12a.py's 54 incl. the new "18 Hour" entry). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-30 23:51:47 +00:00
Khalim Conn-Kowlessar	433f4a49ce	Slice S0380.99: PCDB Table 329 (MV In-Use Factors) ETL + parser + lookup (PCDF Spec §A.20) PCDF Spec Rev 6b §A.20 (May 2021) Format 430 — Mechanical Ventilation In-Use Factors Table. Pcdb10.dat carries Format 432 (header `$329,432,4,2021,11,25,2`), an extended-field version where Format 430 fields 1-4 (system_type + 3 SFP factors for the "no approved scheme" variant) align at positions 0..3. The remainder of Format 432 carries MVHR adjustments + "with approved scheme" variants + additional Format 432 columns, preserved verbatim in `raw` for follow-up slices. Per PCDF Spec §A.20 field 1 — system types: 1 = centralised MEV 2 = decentralised MEV 3 = balanced whole-house MV (with or without heat recovery) 5 = positive input ventilation (PIV) 10 = default data (used with SAP Table 4g defaults) Decentralised MEV (system_type=2) IUFs: SFP × ducting type: flexible: 1.45 (field 2) rigid: 1.30 (field 3) no-duct: 1.15 (field 4 — through-wall fans) Per spec Note: "If there is no applicable approved installation scheme the values for with and without scheme are the same." Cert 000565 lodges "Approved Installation: No" → use the "no scheme" IUFs. Validation for cert 000565 against worksheet line (230a): Σ(SFP_j × FR_j × IUF_j) for the 4 lodged fans: in-room kitchen: 1×0.15×13×1.45 = 2.8275 in-room other wet: 1×0.15× 8×1.45 = 1.7400 through-wall kitchen: 2×0.11×13×1.15 = 3.2890 through-wall other wet: 3×0.14× 8×1.15 = 3.8640 Σ = 11.7205 W (matches worksheet "total watage = 11.7205") Σ(FR_j) = 92.0 l/s (matches worksheet "total flow = 92.0000") SFPav = 11.7205 / 92.0 = 0.1274 W/(l/s) ✓ matches worksheet Foundation only this slice — typed parser + ETL + runtime lookup `mv_in_use_factors_record(system_type)`. No cascade integration; no behavioural change on any cert. Next slice S0380.100 wires the SFPav formula. 5 Table 329 records ingested. Pyright net-zero per touched file. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-30 15:20:02 +00:00
Khalim Conn-Kowlessar	b3330821e7	Slice S0380.98: PCDB Table 322 (Decentralised MEV) ETL + parser + lookup (PCDF Spec §A.19) PCDF Spec Rev 6b §A.19 (May 2021) Format 427 — Decentralised MEV Systems Table. Pcdb10.dat carries the per-fan-configuration block in Format 428 (header `$322,428,72,...`), which drops the spec's per- group "Fan speed setting" string. Each group is a 3-field triplet: (config_code, flow_l_per_s, sfp_w_per_l_per_s). Per the spec § field 14, the 6 fan configurations are: 1 = In-room fan, kitchen 2 = In-room fan, other wet room 3 = In-duct fan, kitchen 4 = In-duct fan, other wet room 5 = Through-wall fan, kitchen 6 = Through-wall fan, other wet room Some configurations may be blank per spec Note 1 — these are not valid SAP selections and are excluded from the SFPav summation downstream. This slice lands the foundation only — typed parser, ETL promotion to typed write, and a runtime lookup `decentralised_mev_record(pcdb_ id)`. No cascade integration yet → no behavioural change on any cert; full test suite + cert 000565 expected fails unchanged. Subsequent slices in the arc: - S0380.99: PCDB Table 329 (In-Use Factors) ETL + lookup - S0380.100: SAP 10.2 §2.6.4 SFPav cascade helper - S0380.101: HP SAP code 211-227 / 521-527 → main_heating_category=4 - S0380.102: wire MEV cascade into pumps_fans Cert 000565 lodges `MV PCDF Reference Number = 500755` (Titon Ultimate dMEV), resolving via this lookup to: config 1 (in-room kitchen): flow=13.0, SFP=0.15 W/(l/s) config 2 (in-room other wet): flow=8.0, SFP=0.15 config 3 (in-duct kitchen): not tested config 4 (in-duct other wet): not tested config 5 (thru-wall kitchen): flow=13.0, SFP=0.11 config 6 (thru-wall other wet): flow=8.0, SFP=0.14 48 Table 322 records ingested. Pyright net-zero per touched file. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-30 15:12:55 +00:00
Khalim Conn-Kowlessar	9bfb852483	Slice S0380.90: 6 strict-raise dispatches + UnmappedSapCode promoted to shared module Bundled slice closing the next 6 silent-fallback dispatch sites flagged by the post-S0380.89 audit per [[reference-unmapped-sap-code]]: 1. PV pitch (RdSAP 10 §11.1 — codes 1..5 → 0/30/45/60/90°) 2. PV overshading (SAP 10.2 Table M1 — codes 1..4 → 1.0/0.8/0.5/0.35) 3. Meter type (RdSAP cert enum 1..5 → Tariff enum) 4. Tariff → (high, low) rate (RdSAP 10 Table 32 — 4 of 5 Tariffs) 5. Heat-network DLF by age band (SAP 10.2 Table 12c — A..M) 6. Secondary heating fraction by main_heating_category (SAP Table 11) Each dispatch follows the established strict / total split: - Absent lodging (None / 0 / "") → cascade's modal-default value - Lodging present but unmapped → `UnmappedSapCode(field, value)` `UnmappedSapCode` promoted from `cert_to_inputs.py` to new module `domain/sap10_calculator/exceptions.py` so `tables/table_12a.py` can raise it too (the meter-type dispatch lives there). `cert_to_inputs` re-exports it for backward compat with existing test imports. Corpus audit at HEAD `6d02d205` (full JSON sweep): PV pitch codes: {2, 3} — covered PV overshading codes: {1, 2} — covered meter_type codes: {1, 2, 3} — covered (incl. digit-string '2') main_heating_category: {2, 4, 6, 7, 10} — covered All corpus codes already in dispatch dicts — no production regression expected. One silent runtime fix surfaced by the strict-raise rollout: the GOV.UK API lodges `meter_type` as a digit-string (e.g. `'2'`) on many certs, but the original `_METER_STR_TO_INT` dict only had word aliases ("single", "dual", "unknown"). Pre-S0380.90 the digit-string fell through to the silent `return Tariff.STANDARD` default. Adding a `key.isdigit() → int(key)` short-circuit routes these through the int enum correctly. Confirmed 125 golden cert fixtures previously running on this silent default — all now passing with explicit STANDARD via the int dispatch path (not via the silent fallback). Tests (6 new, AAA-structure): - `test_pv_pitch_deg_full_table_coverage_per_rdsap_10_section_11_1` - `test_pv_overshading_factor_full_table_m1_coverage` - `test_meter_type_dispatch_full_table_12a_coverage` (incl. digit-string) - `test_tariff_high_low_rates_full_dispatch_coverage` - `test_heat_network_dlf_full_table_12c_age_band_coverage` - `test_secondary_heating_fraction_for_category_full_table_11_coverage` Each test pins: spec-correct codes → expected dispatch result; absent lodging → modal default; lodging present but unmapped → `UnmappedSapCode` with field + value attached. Test baseline: 574 pass (was 568 + 6 new) + 9 expected `test_sap_result_pin[000565-*]` fails unchanged. Cohort + golden + cert 9501 unaffected. Pyright net-zero per touched file. Open silent-fallback inventory now empty per [[reference-unmapped-sap-code]] — the cascade dispatch boundary is now fully strict-raise-gated for code translations. Cascade VALUE defaults (u_wall, u_floor, etc.) remain total per RdSAP §6.2.3. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-30 09:46:55 +00:00
Khalim Conn-Kowlessar	488492a927	Slice S0380.60: RdSAP 10 §12 page 62 — Dual-meter tariff dispatch (Rules 1-4) Cert 000565 surfaced the spec gap. Worksheet shows "Electricity Tariff: 10 Hour Off Peak" while the Summary PDF only lodges "Electricity meter type: Dual" — no separate tariff-hour field is exported. Elmhurst SAP picks 10-hour because RdSAP 10 §12 page 62 contains a published inference algorithm: > If the meter is dual 18-hour/24-hour it is 18-hour/24-hour tariff. > Otherwise the choice between 7-hour and 10-hour is determined as > follows. > 1. If the main heating system (or main system if there are two) > is an electric CPSU (192) it is 10-hour tariff. > 2. Otherwise, if … electric storage heaters (401 to 409), or > electric dry core or water storage boiler (193 or 195), or > electric underfloor heating (421 or 422) — it is 7-hour tariff. > 3. If that has not resolved it then if … direct-acting electric > boiler (191), or heat pump (211 to 224, 521 to 524, or > database), or electric room heaters — it is 10-hour tariff. > 4. If none of the above applies it is 7-hour tariff. Cert 000565 Main 1 SAP code 224 (ASHP) + Dual meter → Rule 3 → 10-hour. Matches the worksheet exactly. New `rdsap_tariff_for_cert(meter_type, main_1_sap_code=..., main_2_sap_code=..., main_1_is_heat_pump_database=..., main_2_is_heat_pump_database=...)` implements the dispatch. "or database" branch covers PCDB Table 362 heat-pump lodgements per the spec's "or database" wording. Callers compute the boolean via `heat_pump_record(main_heating_index_number) is not None`. The pre-existing `tariff_from_meter_type(meter_type)` keeps its contract for legacy call sites — returns SEVEN_HOUR as the Dual default (the §12 Rule 4 fallback). Docstring updated to point at the new helper for callers that need spec-correct dispatch. Code sets (verbatim §12 page 62): - `_RULE_1_CPSU_CODES` = {192} - `_RULE_2_STORAGE_CODES` = {401..409, 193, 195, 421, 422} (NOT 423/424/425) - `_RULE_3_TEN_HOUR_CODES` = {191, 211..224, 521..524} - electric room heater codes (Table 4a 6xx) deferred with TODO until a fixture surfaces them — Rule 4 fallback is correct in the interim (electric room heater certs would currently get 7-hour, biasing their cost residual; not on the active fixture front). This commit is the FOUNDATIONAL change — no cost helpers are wired to the new dispatch yet, so cohort/golden tests are unchanged (354 pass + 10 expected 000565 fails). The next slice wires `_space_heating_fuel_cost_gbp_per_kwh` / `_hot_water_fuel_cost_gbp_ per_kwh` / `_other_fuel_cost_gbp_per_kwh` to use the new dispatch + Table 12a high-rate fractions for off-peak certs. Spec source: `domain/sap10_calculator/docs/specs/RdSAP 10 Specification 10-06-2025.pdf` §12 page 62. Verified verbatim per [[feedback-verify-handover-claims]] before implementing. Pyright net-zero (0 / 0). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 23:42:45 +00:00
Khalim Conn-Kowlessar	081bb8fd7e	Slice S0380.28: SAP 10.2 Appendix N footnote 43 reciprocal η interpolation — closes the +0.03..+0.06 ASHP precision-floor cluster Per SAP 10.2 Appendix N, PDF p.101 footnote 43 (line 7053): "For the efficiency values, the interpolated efficiency is the reciprocal of linear interpolation between the reciprocals of the efficiencies." i.e. 1/η_interp = (1 − t)·1/η_low + t·1/η_high, the weighted harmonic mean at t = (PSR − PSR_low) / (PSR_high − PSR_low). Cascade was using linear interpolation directly on η — a +0.15..+0.25% over-estimate in the typical PSR range (1.2..1.5) for ASHPs in the cohort. Cohort fixture: cert 3336-2825-9400-0512-8292 (Mitsubishi PUZ-WM50VHA, PCDB 104568). MIT/η-zone cascade matches worksheet EXACTLY (every line 86..92, every month), but η_main_heating cascade 225.443 vs worksheet 224.923 → main_heating_fuel +5.24 kWh/yr too high → ECF 1.5474 vs ws 1.5503 → SAP +0.04 vs worksheet 78.3739. Back-solving the worksheet's η_main implies η_space_1 = 224.923 / 0.95 ≈ 236.76. Closed form at PSR=1.40151, bracketing PCDB rows PSR 1.2 (η_space_1=253.9) and PSR 1.5 (η_space_1=229.2): Linear (pre-slice): 253.9 + (229.2 − 253.9) × 0.6717 = 237.31 ✗ Reciprocal (footnote 43): 1 / ((1 − 0.6717)/253.9 + 0.6717/229.2) = 1 / 0.004224 = 236.74 ✓ The harmonic mean is curvature-aware: linear interpolation under- penalises efficiency drops at higher PSR (η typically falls off as PSR increases past the system's design point) by averaging on η rather than 1/η. SAP 10.2 footnote 43 is explicit about which side of the reciprocal the interpolation sits. Outcome: Cohort-2 Summary path (38 certs): exact (<1e-4): 23 → 33 (+10) ≤±0.07: 15 → 5 (-10: HP certs close to exact) ±0.07..0.5: 0 → 0 ±0.5..1: 0 → 0 ±1+: 0 → 0 RAISES: 0 → 0 Cohort-2 HP cluster post-slice: 0100 +0.00003 ← was +0.00283 0320 -0.00001 ← was +0.01801 0330 -0.00004 ← was +0.01772 2336 +0.00003 ← was +0.01778 3336 +0.00001 ← was +0.04005 (worst residual closes exact) 4536 -0.00002 ← was +0.01312 9036 -0.00003 ← was +0.02159 9796 +0.00000 ← was +0.00174 (post-S0380.27) 2536 +0.00072 ← was +0.00163 2800 +0.00068 ← was +0.00436 4800 +0.00068 ← was +0.02939 9370 +0.00002 ← was +0.00174 9421 +0.00001 ← was +0.00117 Cohort-1 ASHP cohort (7-cert cohort + new chain test certs): cert 0380: +1e-6 ← was +0.034 (Mitsubishi PUZ-WM50VHA, the canonical first-HP cohort cert) cert 3800: -2e-5 ← was +0.021 cert 9418: -3e-7 ← was +0.00004 cert 9285: -3e-5 ← was +0.021 cert 2636: -0.015 ← was +0.003 (cantilever fixture; remaining residual is non-η in nature) 5 of 7 cohort-1 ASHP certs now hit delta < 1e-4 vs worksheet — the +0.04 spec-precision-floor cluster diagnosed in HANDOVER_CERT_0380_MIT_CASCADE.md is the linear-vs-reciprocal η interpolation bug, not a spec-floor at all. The handover doc's "no public spec or BRE data field would distinguish these" claim was incorrect — SAP 10.2 footnote 43 is the resolution. API path (golden fixtures): 6 ASHP cohort residuals updated to reflect the cascade closure: cert 0380 PE: -14.7865 → -14.6848 kWh/m²; CO2: +0.2774 → +0.2780 t/yr cert 0350 PE: -7.9281 → -7.8741; CO2: +0.1697 → +0.1701 cert 2225 PE: -11.9175 → -11.8557; CO2: +0.2617 → +0.2621 cert 2636 PE: -9.7153 → -9.6692; CO2: +0.2189 → +0.2193 cert 3800 PE: -9.7551 → -9.6838; CO2: +0.2598 → +0.2603 cert 9285 PE: -8.1110 → -8.0466; CO2: +0.1559 → +0.1564 All SAP integer residuals unchanged (cascade tracks the EPC integer SAP at residual 0 across the cohort). PSR interpolation unit test (`test_interpolate_heat_pump_efficiency_at _cert_0380_psr_per_sap_app_n`) updated to reflect the reciprocal formula with the SAP-10.2-footnote-43 spec citation and closed-form asserts (η_space_1 ≈ 234.5235; η_water_3 ≈ 285.0861 at PSR=1.43). Pyright net-zero (1 → 1 across touched files: pcdb/parser.py, tests/test_pcdb_table_362_lookup.py, rdsap/tests/test_golden_fixtures.py). Tests: 710 pass (was 710 pre-slice with linear interp + un-updated pins; net-zero because the 6 golden pin updates + 1 interp test update exactly offset the 6 + 1 failures the formula change introduced), 10 expected fails unchanged. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 11:44:11 +00:00
Khalim Conn-Kowlessar	4879e8c3d7	Slice S0380.20: extract PCDB keep-hot fields + strict-raise for no-keep-hot combis Surfaces the SAP 10.2 Appendix J Table 3a sub-row dispatch gap that masked +0.2..+0.4 SAP residuals on 11 cohort-2 PCDB-listed combi certs. Identified via cert 7800-1501-0922-7127-3563 (Potterton Promax Combi 28 HE Plus A, PCDF 15709): cascade used the keep-hot 600 kWh/yr default; worksheet (61) sums to ~428 kWh/yr via the no-keep-hot sub-row formula. Root cause: the PCDB Table 105 record carries keep-hot metadata at field positions 58 (`keep_hot_facility`) and 59 (`keep_hot_timer`) per the SAP 10 PCDB spec (private feed for SAP software vendors — not surfaced on the public PCDB website nor the Open EPC API). The parser preserved these in `raw=fields` but didn't surface them as typed attributes, so the cascade had no signal to dispatch the right Table 3a sub-row. Two-part change: 1. `domain/sap10_calculator/tables/pcdb/parser.py` — adds typed `keep_hot_facility` and `keep_hot_timer` fields to `GasOilBoilerRecord`, parsed from fields[57] and fields[58]. Field enums (per BRE STP09-B04 + SAP 10 PCDB spec): Field 58: 0=no keep-hot, 1=fuel keep-hot, 2=electric keep-hot, 3=gas+electric keep-hot Field 59: 0=no timer, 1=overnight time-switch Verified against cohort-1 fixture 000490 (Vaillant Ecotec Pro 28, PCDF 10328) — record lodges keep_hot_facility=1, keep_hot_timer=1, exactly matching the hand-built fixture comment "Combi keep hot type = Gas/Oil, time clock" at `_elmhurst_worksheet_000490.py: 277-280`. 2. `domain/sap10_calculator/rdsap/cert_to_inputs.py` — adds `UnresolvedPcdbCombiLoss` exception. `pcdb_combi_loss_override` now raises (instead of silently returning None) when the PCDB record has `separate_dhw_tests=0/None` AND `keep_hot_facility=0/None`. The cascade's only implemented Table 3a row is "with keep-hot, time clock" (600 kWh/yr), which is the wrong spec row for no-keep-hot combis — silently using it masked the cohort-2 negative band. The ETL was re-run to refresh `pcdb_table_105_gas_oil_boilers.jsonl` with the new typed fields (raw fields unchanged, just additional columns surfacing what was previously buried). Cohort distribution after slice: cohort-1 cert 000490 (Vaillant PCDF 10328, kh=1): NO RAISE — cascade keep-hot 600 default IS the spec-correct row. Tests still GREEN. cohort-2: 10 exact + 13 sub-±0.07 + 2 ±0.07..0.5 + 1 ±0.5..1 + 1 ±5+ + 11 RAISES. The 11 raising certs are now blocked until the Table 3a no-keep-hot sub-row is implemented (BRE STP09-B04 methodology — pending slice). Previously these certs silently produced +0.2..+0.4 SAP errors AND ranged into the big-gap band; raising surfaces the gap rather than shipping wrong numbers. Two golden cert tests blocked alongside (Firebird oil PCDF 9005 also hits this path): - test_golden_cert_residual_matches_pin[0390-2954-3640-2196-4175] - test_api_to_domain_mapper_preserves_main_heating_index_number[0390-2954-3640-2196-4175] Re-enable when the Table 3a no-keep-hot row lands. Two other tests updated: - test_main_heating_index_number_in_pcdb_overrides_seasonal_efficiency: switched from Baxi 98 (sdt=0, kh=None, would raise) to Worcester PCDF 10241 (sdt=1, routes via Table 3b row 1). Asserts 0.885 not 0.66. - test_pcdb_combi_loss_override_returns_none_or_raises_for_untested _or_storage_combis: renamed + extended to pin the new strict-raise behaviour. Pyright net-zero per file: - domain/sap10_calculator/rdsap/cert_to_inputs.py: 35 (baseline 35) - domain/sap10_calculator/tables/pcdb/parser.py: 0 - domain/sap10_calculator/tables/pcdb/__init__.py: 0 - domain/sap10_calculator/rdsap/tests/test_cert_to_inputs.py: 13 (baseline 13) - domain/sap10_calculator/rdsap/tests/test_golden_fixtures.py: 1 (was 2 — improved) Regression baseline: 697 pass + 10 fail (= prior 699 + 10 - 2 dropped golden parametrize entries for cert 0390-2954-3640-2196-4175). Spec refs: - SAP 10 PCDB spec (private SAP software vendor feed) — keep-hot facility / timer / electric-heater fields at positions 58 / 59 / 60. - BRE STP09-B04 (combi boiler test methodology) — origin of the keep-hot Table 3a derivation. URL: https://bregroup.com/documents/d /bre-group/stp09-b04_combi_boiler_tests - SAP 10.2 Appendix J Table 3a row-selection — to be implemented per PCDB keep-hot dispatch in a follow-up slice. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 08:10:27 +00:00
Khalim Conn-Kowlessar	7adb6c7907	Slice 102f-prep.1: PCDB Table 362 heating_duration_code field SAP 10.2 Appendix N3.5 (PDF p.105 line 6099) — heat-pump packages lodge a "Daily heating duration" field encoded as "24" / "16" / "9" / "V" (Variable). Footnote 48 (PDF p.105): "Daily heating durations of 24, 16 and 9 hours are retained for legacy purposes" — modern records always lodge "V". Format-465 position 48 holds the code; cohort ground truth: "V" on Mitsubishi PUZ-WM50VHA (104568) and Daikin EDLQ05CAV3 (102421). The field drives Appendix N3.5 + Table N4/N5 day allocation for the extended-heating MIT cascade (slice 102f-prep.2 onward). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-27 13:09:07 +00:00
Khalim Conn-Kowlessar	5b78a1e2c8	Slice 102c.2: PCDB Table 362 PSR groups + APM linear interpolation SAP 10.2 Appendix N3.6 / N3.7(a) (PDF p.108) compute heat-pump efficiencies from a PSR-dependent dataset in the PCDB record. Spec PDF p.100 line 5957 instructs: "The PSR-dependent results applicable to the dwelling are then obtained by linear interpolation between the two datasets whose PSRs enclose that of the dwelling." This slice decodes the format-465 PSR-group block (idx[58] count followed by N groups × 9 raw fields apiece) and adds the interpolation primitive. Field positions within each 9-field group reverse-engineered against Mitsubishi PUZ-WM50VHA (104568) by back-solving cert 0380's worksheet pin η_space=223.0480, η_water=171.0746: group offset 0 → PSR group offset 2 → η_space,1 (% gross) group offset 6 → η_water,3 (% gross — Appendix N3.7(a) + footnote 49, PSR-dependent and calculated via the annual performance method, used directly for HPs providing both space + water heating) Offsets 1 / 3 / 4 / 5 / 7 / 8 are unpopulated for record 104568 and not yet ground-truthed. They likely hold the secondary results documented under format 464 field 42-43 (specific electricity consumed, running hours) plus additional format-465 extensions. The clamping behaviour at the PSR ends is taken from SAP 10.2 PDF p.101 lines 6007-6008: "if the PSR is greater than the largest PSR in the database record then the heat pump space and water heating fractions for the largest PSR should be used, and if the PSR is less than the smallest PSR in the database record then the heat pump space and water heating fractions for the smallest PSR should be used". Verified against cohort: - Record 104568 (Mitsubishi PUZ-WM50VHA) → 14 PSR groups decoded; interpolation at PSR=1.43 yields η_space,1≈234.96 and η_water,3 ≈285.09, matching back-solved worksheet values (slice 102e applies the N3.6 ×0.95 and N3.7 ×0.60 in-use factors to close the chain).	2026-05-27 12:01:04 +00:00
Khalim Conn-Kowlessar	70aa709c1c	Slice 102c.1: typed PCDB Table 362 (heat pumps) header parser SAP 10.2 Appendix N (N3.6 / N3.7(a)) requires PSR-interpolated values from PCDB Table 362 for any heat-pump cert. The published PCDF Spec Rev 6b §A.23 documents format 464 for that table; the live pcdb10.dat (April 2026) ships format 465, which extends 464 with additional header fields between fields 11 and 12 and a larger PSR group set. The parser-layer test pins the format-465 offsets against the BRE web entry for Mitsubishi Ecodan 5.0 kW PUZ-WM50VHA (pcdb_id=104568, the cohort's dominant heat-pump model — 6 of 7 ASHP certs use it). This slice lands only the header fields the downstream APM cascade needs (PSR-group decoding + linear interpolation follow in slice 102c.2): field spec ref format-465 idx brand_name §A.23 field 7 6 model_name §A.23 field 8 7 model_qualifier §A.23 field 9 8 fuel §A.23 field 13 16 service_provision §A.23 field 17 22 hw_vessel_mode §A.23 field 18 23 vessel_volume_l §A.23 field 19 24 vessel_heat_loss_kwh_per_day §A.23 field 20 25 vessel_heat_exchanger_area_m2 §A.23 field 21 26 max_output_kw §A.23 field 30 47 `max_output_kw` is the PSR-denominator per SAP 10.2 PDF p.100 line 5946 ("maximum nominal output of the package … divided by the design heat loss of the dwelling"); BRE labels it "Output power @ -4.7°C" on the web entry. Cohort header parse verified end-to-end against BRE web ground truth for record 104568. Identical field positions apply to the Daikin EDLQ05CAV3 (102421, cert 9418), confirmed by spot-checking the populated raw indices.	2026-05-27 11:56:06 +00:00
Khalim Conn-Kowlessar	a7b08a4e8f	refactor: move docs/sap-spec/ contents into domain/sap10_calculator/ Locality of reference — SAP-specific docs, specs, and runtime data now live alongside the calculator that consumes them, mirroring the prior packages→domain layout moves. Move targets: - Narrative MDs → domain/sap10_calculator/docs/ NEXT_AGENT_PROMPT.md, HANDOVER_NEXT.md, SAP_CALCULATOR.md - Spec PDFs → domain/sap10_calculator/docs/specs/ RdSAP 10 Specification 10-06-2025.pdf PCDF_Spec_Rev-06b_12_May_2021.pdf sap-10-2-full-specification-2025-03-14.pdf sap-10-3-full-specification-2026-01-13.pdf - PCDB runtime data → domain/sap10_calculator/tables/pcdb/data/ pcdb10.dat (8.3MB) + 7× pcdb_table_*.jsonl (18MB total) Path code rewrites (load-bearing): - tables/pcdb/__init__.py: replaced parents[4]/'docs'/'sap-spec' with Path(__file__).resolve().parent/'data' for Table 105 JSONL loading. - tables/pcdb/postcode_weather.py: same rebase for the pcdb10.dat path read by _postcode_climate_table(). - tables/pcdb/etl.py __main__: same rebase for the manual ETL invocation (source + output_dir both now point inside the package). - tests/test_pcdb_etl.py: _PCDB_DAT_PATH now derives from parents[1]/'tables'/'pcdb'/'data' (was parents[3]/'docs'/'sap-spec'). Citation rewrites: - 12 .py docstrings and 4 .md docs (ADRs + READMEs + narrative docs) had `docs/sap-spec/<file>` strings rewritten to their new locations. - Two cases where the catch-all sed misfired (an ADR-0009 line about a PCDB extract; the pcdb __init__.py docstring about ETL output) were hand-corrected to point at tables/pcdb/data/ rather than docs/specs/. docs/sap-spec/ is now empty (will be removed in a follow-up sweep or left as a vestigial empty dir for future repurposing). ADRs 0009 and 0010 remain at docs/adr/ — they're part of the chronological cross-cutting decision log, not calculator-specific narrative. Verified: - Calculator's 1e-4 production gate (test_api_001479_full_chain_sap_matches_worksheet_pdf_exactly) GREEN. - Wider sweep (domain/sap10_calculator/ + domain/sap10_ml/): 1654 passed / 20 failed — exact pre-move baseline. All 20 failures pre-existing (10 hand-built skeleton + 4 cohort chain + 6 cohort diff). - Pyright net-zero on the 4 touched runtime/test files (0 errors) and unchanged on heat_transmission.py (13) / cert_to_inputs.py (35) / mapper.py (33). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-26 13:17:18 +00:00
Khalim Conn-Kowlessar	68401c517a	refactor: lift-and-shift packages/domain/src/domain/ml → domain/sap10_ml Sibling migration to the sap10_calculator move — `domain.ml` now lives at the root-level layout (`domain/sap10_ml/`) matching the pattern already used by `domain.addresses`, `domain.tasks`, `domain.postcode`, and `domain.sap10_calculator`. Changes: - `git mv packages/domain/src/domain/ml → domain/sap10_ml` (19 files; history preserved). - Subpackage rename: `domain.ml` → `domain.sap10_ml`. 32 references rewritten across .py and .md files: 11 internal + 21 external (datatypes/epc/domain/mapper.py, 14 files in domain/sap10_calculator, 2 backend tests, 2 ADRs, 1 README, 1 design doc). - Path-string updates: `pytest.ini` testpath `packages/domain/src/domain/ml/tests` → `domain/sap10_ml/tests` so ML tests stay in the default auto-discovered sweep. `CONTEXT.md` also updated. `packages/domain/src/domain/` is now empty — the workspace `domain/` tree has been fully migrated. Together with the `domain/__init__.py` deletions from the sap10_calculator commit (`29ac35cc`), `domain` is now a single root-level namespace package with subpackages {addresses, sap10_calculator, sap10_ml, tasks} + the standalone `postcode.py` module. Verified: - Focused sweep (backend mapper-chain + sap10_calculator worksheet e2e + golden fixtures): 99 passed / 19 failed — identical baseline. - Wider sweep (all sap10_calculator + sap10_ml): 1654 passed / 20 failed (same pre-existing failures). - domain/sap10_ml/tests: 210/210 PASSED at new path. - Pyright net-zero: heat_transmission.py 13, cert_to_inputs.py 35, mapper.py 33, rdsap_uvalues.py 1 (all unchanged from baseline). Note: `packages/domain/pyproject.toml` still declares `packages = ["src/domain"]` for the hatchling wheel — that target directory is now empty and the wheel build is effectively a no-op. Retiring the workspace package or repointing the wheel is a follow-up. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-26 13:01:35 +00:00
Khalim Conn-Kowlessar	29ac35ccbe	refactor: lift-and-shift packages/domain/src/domain/sap → domain/sap10_calculator Migration of the SAP 10.2 calculator package from the uv-workspace src-layout (`packages/domain/src/domain/sap`) to the root-level layout (`domain/sap10_calculator`), matching the pattern already used by `domain.addresses` / `domain.tasks` / `domain.postcode`. Changes: - `git mv packages/domain/src/domain/sap → domain/sap10_calculator` (92 files; git auto-detected all as renames so blame/history is preserved). - Subpackage rename: `domain.sap` → `domain.sap10_calculator`. 48 Python files rewritten (`from domain.sap.X` → `from domain.sap10_ calculator.X`); zero remaining `domain.sap` refs after the sed pass. - Path-string updates: 3 .py files (test fixtures + xlsx loader) + 6 markdown docs (CONTEXT.md, 2 ADRs, 3 sap-spec docs, sap10_ calculator/README.md) had hard-coded `packages/domain/src/domain/ sap/...` paths rewritten to `domain/sap10_calculator/...`. - `Path(__file__).parents[N]` rebasing: the old tree was 3 levels deeper than the new one (`packages/domain/src/`), so 4× `parents[7]` became `parents[4]` and 1× `parents[6]` became `parents[3]` across `tables/pcdb/{__init__.py, postcode_weather.py, etl.py}`, `worksheet/tests/_xlsx_loader.py`, and `tests/test_pcdb_etl.py`. - PEP 420 namespace package: deleted both `domain/__init__.py` (root + workspace, both load-bearing only as empty/docstring) so Python combines `domain.sap10_calculator` (root) and `domain.ml` (workspace) into one namespace package. Confirmed via `domain.__path__ == ['/workspaces/model/domain', '/workspaces/model/packages/domain/src/domain']`. Without this, the root `domain/__init__.py` shadowed the workspace one and `domain.ml` was unreachable. Verified: - Full sweep (`backend/documents_parser/tests/test_summary_pdf_ mapper_chain.py + domain/sap10_calculator/worksheet/tests/test_ e2e_elmhurst_sap_score.py + domain/sap10_calculator/rdsap/tests/ test_golden_fixtures.py`): 99 passed / 19 failed — exact same counts as pre-refactor. All 19 failures pre-existing (9 hand-built 001479 + 6 cohort diff + 4 cohort chain non-spec). - Wider sweep (all sap10_calculator + domain.ml): 1654 passed / 20 failed (the +1 vs the focused sweep is the pre-existing `test_roof_insulated_assumed_with_ni_thickness_uses_50mm_per_ section_5_11_4` which was already failing on the previous baseline). - Pyright net-zero on the three load-bearing baselines: `heat_transmission.py` 13, `cert_to_inputs.py` 35, `mapper.py` 33. Lift-and-shift only — no semantic renames (`Sap10Calculator` stays `Sap10Calculator`), no testpaths edits in pytest.ini (sap tests continue to be invoked by explicit pytest paths). Note: `domain.ml` still lives at `packages/domain/src/domain/ml/`. Migrating it would close out the dual-`domain/` layout but is out of scope for this commit. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-26 12:22:37 +00:00

18 commits