Model

mirror of https://github.com/Hestia-Homes/Model.git synced 2026-07-27 23:35:01 +00:00

Author	SHA1	Message	Date
Khalim Conn-Kowlessar	bf99b1c7df	Slice S0380.48: surface real-API pv_batteries[].battery_capacity (5 kWh) The 7-cert ASHP+battery PE cluster was overshooting by +2.7..+8.1 kWh/m² after the PE β-split landed in S0380.45. The handover hypothesised an E_PV magnitude bug ("cascade thinks 2570 kWh/yr vs worksheet 831"). The worksheet PDF for cert 0380 (dr87-0001-000899.pdf line 233) was verified to show -2563.3692 kWh/yr — matching our cascade. The real bug was different: the 5-kWh battery wasn't reaching the cascade, so β-coefficients used the no-battery branch (C1=1.61, β≈0.36) instead of the 5-kWh branch (C1=1.12, β≈0.75). Per SAP 10.2 Appendix M1 §3c-d (p.94): "C_bat is the usable capacity of the battery in kWh, limited to a maximum value of 15 kWh. C_bat=0 if no battery present." Cert 0380 lodges `pv_battery_count: 1` and `pv_batteries: [{"battery_capacity": 5}]` — but the schema's `PvBatteries` dataclass had only `pv_battery: Optional[PvBattery]`, matching the older synthetic fixture shape (nested `{"pv_battery": {"battery_capacity": 5}}`). The real-API payload's flat `battery_capacity: 5` was silently dropped during `from_dict`. Two surgical changes: - `datatypes/epc/schema/rdsap_schema_21_0_1.py`: add `battery_capacity: Optional[float] = None` as a sibling to `pv_battery` on `PvBatteries`. Synthetic-shape certs continue to populate the nested form; real-API certs now populate the flat form. - `datatypes/epc/domain/mapper.py:_first_pv_battery`: prefer nested when present, fall back to the flat lifted field. Domain still exposes a single uniform `PvBatteries(pv_battery=PvBattery(...))` shape downstream. Cohort impact (PE residual kWh/m² vs worksheet): \| Cert \| Pre-S0380.48 \| Post-S0380.48 \| \|---\|---:\|---:\| \| 0350 \| +2.73 \| -3.58 \| \| 0380 \| +8.09 \| -4.01 \| \| 2225 \| +4.48 \| -4.50 \| \| 2636 \| +3.42 \| -4.14 \| \| 3800 \| +3.58 \| -4.01 \| \| 9285 \| +3.20 \| -3.46 \| \| 9418 \| +4.67 \| -3.76 \| Cluster magnitude dropped from +2.7..+8.1 to -3.5..-4.5 — the cascade now over-credits PV by ~4 kWh/m² (vs previously under-crediting by ~5 kWh/m²). The residual flipped sign because cascade β=0.75-0.81 slightly exceeds worksheet β=0.74 (read from page-3 line 233a/233b ratio 1903.39/2563.37 = 0.7426). The remaining ~4 kWh/m² under-shoot traces to two structural factors deferred until a fresh closure slice ships: 1. The synthetic-default `pv_export_primary_factor = 0.501` is the annual Table 12 code-60 value. The worksheet uses the effective monthly Table 12e factor weighted by E_PV,ex,m (cert 0380: 0.4268 = -0.074 differential). The cascade's `_effective_monthly_pe_ factor` already computes the same weighting for PV — but the calculator's PV PE credit reads `inputs.other_primary_factor` (=1.501) and `inputs.pv_export_primary_factor` (=0.501) directly, bypassing the per-end-use effective-monthly cascade. 2. Cascade β slightly higher than worksheet (0.751 vs 0.7426 on cert 0380) — likely a monthly-distribution detail in D_PV. SAP scores remain exact across the cohort (residual +0 every cert). CO2 residuals all <0.11 t/yr (well within the 0.001-tolerance pin range after re-pin). 9501 (PV no battery) preserved at +0.255 PE / -0.047 CO2 — no regression. Re-pins all 7 golden fixtures in the same slice per [[feedback-commit-per-slice]]. Pyright net-zero on touched files (32 errors before, 32 after).	2026-05-28 19:17:02 +00:00
Khalim Conn-Kowlessar	42ed38f77d	Slice S0380.47: wire β-split into cost cascade per SAP 10.2 Appendix M1 §6 SAP 10.2 Appendix M1 §6 (p.94): "When calculating the fuel cost benefits ... apply the normal import electricity price to PV energy used within the dwelling and the 'electricity sold to grid, PV' price from Table 12 to the energy exported." Adds the third leg of the β-factor split (PE was S0380.45, CO2 was S0380.46). Now uniform across all three cascades: PE → IMPORT PEF × E_dw + EXPORT PEF × E_ex CO2 → IMPORT CO2 × E_dw + EXPORT CO2 × E_ex Cost → IMPORT £ × E_dw + EXPORT £ × E_ex Mechanism: - `worksheet/fuel_cost.py`: optional `pv_dwelling_kwh_per_yr` + `pv_exported_kwh_per_yr` + `pv_dwelling_import_price_gbp_per_kwh` keyword args; when all three are set, split the credit; otherwise fall back to legacy single-rate-EXPORT (preserves synthetic test constructions). - `rdsap/cert_to_inputs.py`: new `_pv_dwelling_import_price_gbp_per_kwh` helper that pulls Table 32 code 30 (standard electricity = 13.19 p/kWh) for standard tariff; off-peak branch uses `prices.e7_low_rate_p_per_kwh` as the natural extension point when the first off-peak PV cert lands (currently short-circuited by the `Tariff != STANDARD` guard at line 2710). - `calculator.py`: new `pv_dwelling_import_price_gbp_per_kwh` field on `CalculatorInputs` with synthetic-fallback split logic mirroring the precomputed-fuel_cost path. Maintains the cross-cascade architecture documented in the prior handover. Cohort impact: none. Per ADR-0010 RdSAP10 amendment, Table 32 collapses code 30 (standard electricity import) and code 60 (electricity sold to grid, PV) to the SAME 13.19 p/kWh rate. So the β-split's E_dw × 13.19 + E_ex × 13.19 == E_total × 13.19, matching the legacy single-rate credit at 1e-4 — 763 pass + 0 fail across the full chain test suite (Elmhurst U985, cohort-1 ASHP, cohort-2 38-cert sweep, 15-cert golden fixtures). The β-split shape is now in place for the off-peak case (where weighted Table 12a high/low rates would diverge) and any future amendment that splits import/export prices. Pyright net-zero on touched files (34 errors before, 34 after — all pre-existing).	2026-05-28 19:01:38 +00:00
Khalim Conn-Kowlessar	cbc6d5dbc8	docs: refresh NEXT_AGENT_PROMPT for PV β-split slices 4-6 Replaces stale legacy content (cert-mapper-validation workflow, dated to a 9-triple staging slice) with the current handoff: branch state, 3 shipped slices (S0380.44 → S0380.46), and concrete directives for the 3 remaining slices (cost cascade wiring, E_PV magnitude audit, final fixture re-pin). Companion to docs/HANDOVER_PV_BETA_SPLIT.md. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 18:47:08 +00:00
Khalim Conn-Kowlessar	beb0db9522	docs: handover after S0380.44..S0380.46 — PV β-split 3/6 wiring slices shipped Documents progress on the SAP 10.2 Appendix M1 β-factor split for the PE / CO2 / cost cascades + golden-fixture residual closure. Shipped: - Slice 1 (S0380.44): pure β-factor calculator module + 13 unit tests - Slice 2 (S0380.45): wire β into PE cascade - Slice 3 (S0380.46): wire β into CO2 cascade Cert 9501 (PV no battery): PE Δ -8.28 → +0.25, CO2 Δ +0.20 → -0.05 — clean spec validation. The 7-cert ASHP+battery cohort overshoots PE by +2.7..+8.1 because the cascade's E_PV is ~3× the worksheet's value (cert 0380 cascade 2570 kWh vs worksheet 831 kWh). E_PV magnitude audit deferred to Slice 5. Open: - Slice 4 (S0380.47, next): wire β into cost cascade - Slice 5 (S0380.48): E_PV magnitude audit - Slice 6 (S0380.49): re-pin fixtures + verify chain tests <1e-4 Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 18:45:00 +00:00
Khalim Conn-Kowlessar	5b269f23b6	Slice S0380.46: wire β-split into CO2 cascade per SAP 10.2 Appendix M1 §7 The CO2 cascade in calculator.py had no PV credit at all (environmental_section_from_cert had a stale `pv_credit = 0.0` with the comment "no PV in any Elmhurst fixture", but that helper isn't called by `calculate_sap_from_inputs` anyway). The full ASHP+PV cluster therefore over-counted CO2 by +0.16..+0.28 t/yr — the entire PV CO2 offset was missing. Wiring (calculator.py): - New fields: `pv_dwelling_co2_factor_kg_per_kwh: Optional[float]`, `pv_exported_co2_factor_kg_per_kwh: Optional[float]`. - CO2 cascade now subtracts: pv_co2_credit = E_PV,dw × dwelling_CO2_factor + E_PV,ex × exported_CO2_factor when the split + factors are set. None preserves the legacy zero-credit behaviour for synthetic CalculatorInputs constructions. Wiring (cert_to_inputs.py): - New constant: `_PV_EXPORT_FUEL_CODE_TABLE_12 = 60` (SAP 10.2 Table 12 code 60, "electricity sold to grid, PV") — the EXPORT factor key per Appendix M1 §6/§7/§8. - The dwelling CO2 factor is the effective monthly Table 12d Σ weighted by E_PV,dw,m at code 30 (Standard electricity); the exported CO2 factor is the same Σ weighted by E_PV,ex,m at code 60 ("Electricity sold to grid, PV"). Both reuse the existing `_effective_monthly_co2_factor` helper. Test impact (CO2 residual cluster, re-pinned in this slice): Pre-Slice 46 → Post-Slice 46: - 0330 (no PV): -0.034 → -0.034 (unchanged ✓) - 0350 (PV + 5 kWh battery): +0.171 → -0.084 - 0380 (PV + 5 kWh battery): +0.279 → -0.054 - 2130 (PV + gas combi): +0.299 → -0.046 - 2225 (PV + 5 kWh battery): +0.263 → -0.071 - 2636 (PV + 5 kWh battery): +0.219 → -0.058 - 3800 (PV + 5 kWh battery): +0.261 → -0.014 - 9285 (PV + 5 kWh battery): +0.157 → -0.098 - 9418 (PV + 5 kWh battery): +0.232 → -0.046 - 9501 (PV, no battery): +0.202 → -0.047 Cluster magnitude dropped 3-5× — over-count flipped to slight under-count (-0.01..-0.10 vs +0.16..+0.28). The remaining negative residual is largely the same E_PV-magnitude bug from Slice 45 (PV is over-credited because the cascade thinks E_PV ≈ 3× the worksheet value for the 5-kWh-battery cohort). Slice 47 (cost cascade) + Slice S0380.48 (E_PV magnitude audit) will close the cluster further. Chain tests still <1e-4 — CO2 cascade isn't gated by the chain tests' SAP-rating-vs-worksheet assertions. Test suite: 763 pass + 0 fail. Pyright net-zero per touched file (calculator.py 0/0; cert_to_inputs.py 34/34; test_golden_fixtures.py 1/1). Spec citations: - SAP 10.2 specification Appendix M1 §7 (p.94) — PV CO2 credit split. - SAP 10.2 Table 12d (p.194) code 60 — monthly CO2 factor for "electricity sold to grid, PV" (already in `tables/table_12.py`). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 18:42:28 +00:00
Khalim Conn-Kowlessar	49de18e83a	Slice S0380.45: wire β-split into PE cascade per SAP 10.2 Appendix M1 §8 The PE cascade in calculator.py was crediting ALL PV generation at the IMPORT PEF (Table 12 ~1.501) instead of splitting per Appendix M1 §4/§8 — onsite-consumed E_PV,dw at the IMPORT PEF and exported E_PV,ex at the EXPORT PEF (Table 12 code 60 = 0.501). The over-credit on the exported portion was the primary driver of the ASHP-cohort PE Δ -7..-15 kWh/m² under-count. Wiring (cert_to_inputs.py): - `_pv_array_monthly_generation_kwh(array, climate)` — per-array E_PV,m via Appendix M1 §2 (p.92) apportioning: 0.8 × kWp × ZPV × monthly solar radiation. Reuses ORIENTATION/PITCH/Z lookups already in `_pv_array_generation_kwh_per_yr`. Annual sum equals the existing helper to float precision. - `_pv_monthly_generation_kwh(epc, climate)` — sums per-array monthlies; falls back to the same §11.1 b) percent-roof-area synthesis as the annual helper for certs without per-array detail. - `_pv_battery_capacity_kwh(epc)` — total usable battery capacity = per-battery capacity × pv_battery_count. The 15 kWh cap per §3c is applied inside `pv_beta_coefficients` and not duplicated here. - `_pv_eligible_demand_monthly_kwh(...)` — assembles D_PV,m per §3a p.93: lighting + appliances + cooking + electric showers + pumps & fans, plus E_space,m when main fuel is Table-12 {30, 32, 34, 35, 38} (electricity not at off-peak) and E_water,m when water heating fuel is Table-12 30 (standard electricity). Off-peak immersion × (243) and the Appendix G4 PV-diverter branch are deferred — current cohort fixtures don't exercise them. - In `cert_to_inputs`: assemble monthly EPV + DPV + battery, call `pv_split_monthly`, pass `pv_dwelling_kwh_per_yr` + `pv_exported_kwh_per_yr` through to CalculatorInputs. Wiring (calculator.py): - New fields: `pv_dwelling_kwh_per_yr: Optional[float]`, `pv_exported_kwh_per_yr: Optional[float]`, `pv_export_primary_factor: float = 0.501` (Table 12 code 60). - PE cascade now does: pv_offset = E_PV,dw × IMPORT_PEF + E_PV,ex × EXPORT_PEF when both split fields are set. Legacy fall-through to all-IMPORT when either is None (preserves synthetic CalculatorInputs constructions in unit tests). Test impact (golden-fixture residual shifts — all expected, re-pinned): Pre-Slice 45 → Post-Slice 45: - 0330 (no PV): +0.44 → +0.44 (unchanged ✓) - 0350 (PV + 5 kWh battery): -7.78 → +2.73 - 0380 (PV + 5 kWh battery): -14.60 → +8.09 - 2130 (PV + gas combi): -38.63 → -9.70 (also SAP +1 shift) - 2225 (PV + 5 kWh battery): -11.77 → +4.48 - 2636 (PV + 5 kWh battery): -9.65 → +3.42 - 3800 (PV + 5 kWh battery): -9.61 → +3.58 - 9285 (PV + 5 kWh battery): -7.96 → +3.20 - 9418 (PV + 5 kWh battery): -7.30 → +4.67 - 9501 (PV, no battery): -8.28 → +0.25 (CLOSED ✓) Cert 9501 closing to +0.25 with the β-split alone confirms the implementation is spec-correct. The 7-cert 5-kWh-battery cohort now over-shoots in the positive direction because the cascade's E_PV magnitude is ~3× the worksheet's (cert 0380 cascade 2570 kWh/yr vs worksheet 831 kWh/yr — peak_power=3 interpreted as 3 kWp while worksheet uses ~1 kWp). With E_PV overestimated, R_PV = E_PV / D_PV is too high → β_m from §3d formula too low → not enough credit shifts to the IMPORT factor. Slice S0380.46 audits the cascade's E_PV magnitude (kWp interpretation, S lookup, or ZPV mapping). Chain tests (cohort-1 + cohort-2 SAP-rating-vs-worksheet) all stay <1e-4 — Slice 45 only touches the PE cascade; SAP rating uses the cost cascade which is still on the old all-export path. Test suite: 763 pass + 0 fail. Pyright net-zero on touched files. Spec citations: - SAP 10.2 specification Appendix M1 §3a (p.93) — D_PV,m assembly. - SAP 10.2 specification Appendix M1 §3c-d (p.94) — β formula. - SAP 10.2 specification Appendix M1 §4 (p.94) — E_PV,dw / E_PV,ex. - SAP 10.2 specification Appendix M1 §8 (p.94) — PE factor split. - SAP 10.2 Table 12 code 60 — EXPORT PEF = 0.501. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 18:34:56 +00:00
Khalim Conn-Kowlessar	5344bc8920	Slice S0380.44: SAP 10.2 Appendix M1 §3-4 PV β-factor calculator (no wiring) Pure-function module + 13 unit tests for the photovoltaic onsite/export split. No cascade wiring yet — Slices S0380.45..47 will wire β into the PE / CO2 / cost cascades respectively (which currently all over-credit the exported PV portion at the IMPORT factor). Module: `domain/sap10_calculator/worksheet/photovoltaic.py` - `PhotovoltaicSplit` frozen dataclass — monthly β + (E_PV,dw,m, E_PV,ex,m) with annual-sum properties matching worksheet line refs (233a) and (233b). - `pv_beta_coefficients(Cbat)` — three coefficients keyed on battery capacity (kWh), capped at 15 per §3c: CPV1 = 1.610 - 0.0973 × Cbat CPV2 = 0.415 - 0.00776 × Cbat CPV3 = 0.511 + 0.0866 × Cbat - `pv_split_monthly(epv, dpv, battery_kwh)` — per §3d-4: R_PV,m = E_PV,m / D_PV,m β_m = min(exp(-CPV1 × (R_PV,m × CPV2)^CPV3), D_PV,m / E_PV,m) E_PV,dw,m = E_PV,m × β_m; E_PV,ex,m = E_PV,m × (1 - β_m) Edge cases (not in spec but implied by physics): - E_PV,m = 0 → β = 0; both onsite and exported = 0 - D_PV,m = 0 → cap forces β = 0; all PV exports Unit-test coverage (13 tests, AAA convention, `abs(diff) <= tol`): - β coefficient constants at Cbat=0, 5 (ASHP cohort), 15 (cap) - Cbat>15 clamps to 15; Cbat<0 clamps to 0 (defensive) - Hand-computed β worked example (no battery): β≈0.4864 at E_PV=100, D_PV=200 — pinned at 1e-7 against precomputed value AND at 1e-9 against the live formula recomputation (load-bearing math pin) - Edge cases: E_PV=0 → no split; D_PV=0 → full export - Battery monotonicity: β increases with Cbat for fixed (E_PV, D_PV) - Energy conservation: E_PV,dw + E_PV,ex = E_PV per month + annually - Tuple length validation (raises on != 12 months) - Return shape pinned to `PhotovoltaicSplit` dataclass contract Test suite: 750 → 763 pass + 0 fail. Pyright net-zero on new files. Spec citation: SAP 10.2 specification Appendix M1 §3-4 (p.93-94). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 18:11:56 +00:00
Khalim Conn-Kowlessar	29c4b029e3	docs: handover after S0380.39..S0380.43 — cohort-2 API path 38/38 closed Session shipped 5 slices that closed the entire cohort-2 API-path cluster (S0380.39 bulk-fetch, S0380.40 parametrized test, S0380.41 RdSAP 21 → SAP 10.2 glazing alias, S0380.42 Decimal HALF_UP per-window areas, S0380.43 SAP 631 → spec fuel). Documents: - Cross-mapper parity at cascade established for all 38 cohort-2 certs (and 9 cohort-1 ASHP); both paths < 1e-4 vs worksheet. - Tolerance tightening deferred — 1e-4 is the realistic floor at HEAD (worst residual 4.91e-5 on cert 2102). - Lessons learned: GOV.UK RdSAP 21 enum != cascade enum (codes needing remap are incremental as fixtures surface them); Decimal HALF_UP per-window areas extends the S0380.34/35 pattern; SAP heating-type → spec fuel dispatch is the new forcing-function pattern for cert-lodgement inconsistencies. - Open front: golden-residuals → ~0 on PE/CO2. ASHP cluster (-7..-15 kWh/m² PE / +0.16..+0.28 t/yr CO2 across 7 certs with the same PCDB heat pump) is the highest-value single thread — likely SAP 10.2 Appendix L1 / Table 12 PE-factor or CO2-factor cascade gap. Three concrete diagnostic probes proposed. Test baseline at HEAD: 750 pass + 0 fail. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 17:22:55 +00:00
Khalim Conn-Kowlessar	6dccb15b03	Slice S0380.43: SAP 631 open-fire → House coal spec fuel — closes cert 2102 Cert 2102 lodges `secondary_heating_type=631` ("Open fire in grate" per SAP 10.2 Appendix M Table 4a, BS EN 13229:2001 inset-appliance class — solid fuel) but `secondary_fuel_type=33` (electricity, Table 32 off-peak 7hr) — physically incompatible (an open fire grate doesn't run on electricity). The Elmhurst Summary path independently resolves to Coal (Table 32 code 11) via the §15 "Secondary Fuel: Coal" lodgement (see `test_summary_2102_secondary_heating_routes_house_coal_for_open_fire`). API mapper now applies the same spec-derived default via the new `_api_secondary_fuel_type` helper: - When `secondary_heating_type` is in the `_API_SECONDARY_HEATING_SPEC_FUEL` dispatch (currently {631: 11}), AND the lodged `secondary_fuel_type` is electric (codes 30-40), substitute the spec default (House coal). - Legitimate non-default solid-fuel lodgement (e.g. SAP 631 with lodged fuel_type=15 Wood logs) passes through unchanged. The override is keyed on the heating-type → spec-fuel dispatch dict (extend as new fixtures surface analogous inconsistencies), not a blanket per-code rewrite — keeps the lodged data trusted by default while spec-correcting the narrow class of inconsistent lodgements. Applied at all 6 API schema-version mapping sites in `from_api_response` via replace_all (lines 637/767/922/1080/1278/1544). Worksheet target for cert 2102: line (242) "Space heating - secondary 3585.24 × 3.6700 = 131.58" confirms 3.67 p/kWh = Table 32 fuel code 11 (House coal). Test impact: - Cohort-2 cert 2102 API path: -6.30 → +4.9e-5 (<1e-4 ✓). Moves from `_COHORT_2_API_OPEN` to `_COHORT_2_API_CLOSED`. - `_COHORT_2_API_OPEN` is now empty — the residual-pin test `test_api_cohort_2_open_cert_residual_matches_current_pin` is deleted (cohort fully closed; re-add if future cert surfaces). - Cohort-2 API path: 38/38 < 1e-4 matching Summary path 38/38. Cross-mapper parity at the cascade is fully established for cohort-2 per [[feedback-cross-mapper-parity-via-cascade]]. - Cohort-1 ASHP 9/9 unchanged. Test suite: 750 pass + 0 fail. Pyright net-zero on touched files (mapper.py 32/32 baseline; chain test 0/0). Spec citations: - SAP 10.2 Appendix M Table 4a code 631 "Open fire in grate" (Category C, Room heaters, eff 37/32%, solid fuel via BS EN 13229:2001 inset-appliance class — see spec p.156). - SAP 10.2 Table 32 code 11 "House coal" 3.67 p/kWh. - Cert 2102 worksheet line (242) reproduces 131.58 = 35.84 × 3.67 confirming house-coal pricing for the secondary cascade. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 17:17:05 +00:00
Khalim Conn-Kowlessar	e1b7b30c40	Slice S0380.42: Decimal HALF_UP per-window areas per RdSAP10 §15 — closes cert 1536 Cert 1536 lodged window dimensions including (0.65 × 0.70) × 3 windows. In float arithmetic 0.65 × 0.70 = 0.45499999999999996, which the `_round_half_up(float, dp)` helper snaps to 0.45 vs the spec answer 0.46 (Decimal: 0.65 × 0.70 = 0.4550 exact, HALF_UP at 2 d.p. = 0.46). The shortfall of 0.01 m² × 3 windows = 0.03 m² under-counted as ~0.073 W/K of conduction loss vs the worksheet's windows_w_per_k = 25.6354 — closing the cert 1536 residual at +0.00152 to <2e-6. Same class of bug as the S0380.34/35 living-area / gross-wall / party-wall closures (Decimal HALF_UP at the 0.005 boundary that float drops). RdSAP10 §15 (p.66) lists "all element areas (gross) including window areas: 2 d.p." — Decimal is the only arithmetic that matches that boundary deterministically. Three cascade sites now use Decimal HALF_UP for per-window areas: - heat_transmission.py: `_decimal_round_half_up_product(W, H, 2)` replaces `_round_half_up(W × H, 2)` at the windows_w_per_k cascade AND at the per-bp window-area accumulation (the wall-net deduction branch must agree with the conduction branch for cascade-internal consistency, per the existing comment at line 575-583). - internal_gains.py: `_decimal_window_area_2dp(W, H)` replaces the inline `_round_area_2dp(W × H)` in the daylight factor `g_l` sum so §5 (66)..(67) sees the same per-window areas as §3 (27). - solar_gains.py: same Decimal helper replaces `_round_area_2dp` in `_wall_window_solar_gain_monthly_w` so §6 (74)..(81) area = (27). The `_round_area_2dp` helpers were inlined per-module in pre-S0380.42 work; this slice deletes them since the Decimal-aware product replaces all call sites. `_round_half_up` stays in heat_transmission for non-product per-element area calls (single-value rounds). Test impact: - Cohort-2 cert 1536 API path: +0.00152 → -1e-6 (<1e-4 ✓). Moves from _COHORT_2_API_OPEN to _COHORT_2_API_CLOSED. Cohort distribution: 37/38 exact (was 34/38 at start of session); only cert 2102 (-6.30 secondary-heating routing) remains open. - Cohort-2 cert 0300/9380 unchanged (already <1e-4 after S0380.41). - Cohort-1 ASHP 9/9 unchanged: <1e-4 on both paths. - Elmhurst 6-cert worksheet sweep: unchanged (lodges `window_width=area, window_height=1.0` per the Elmhurst lodging convention — Decimal(area) × Decimal(1.0) = Decimal(area), no rounding shift). Test suite: 750 pass + 0 fail. Pyright net-zero per touched file (heat_transmission 13/13; internal_gains 4/4 pre-existing; solar_gains 0/0; chain test 0/0). Spec citation: RdSAP 10 Specification §15 "Rounding of data" p.66 — "All element areas (gross) including window areas and conservatory wall area: 2 d.p." Decimal is the float-precision-stable arithmetic that matches this rule at the .005 boundary. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 17:11:39 +00:00
Khalim Conn-Kowlessar	a96e6765ab	Slice S0380.41: GOV.UK RdSAP 21 glazing-type code 1 → DG pre-2002 cascade Closes the cohort-2 API-path +0.42..+0.44 cluster (certs 0300/9380 closed to <1e-4; cert 1536 partially closed +0.4445 → +0.0015 — a sub-2e-3 secondary tail remains for Slice S0380.42). Root cause: per `datatypes/epc/domain/epc_codes.csv` the GOV.UK API schema RdSAP-Schema-21.0.0 defines `glazed_type=1` as "double glazing installed before 2002 in EAW, 2003 in SCT, 2006 NI". Three cohort-2 certs (0300/1536/9380) lodge this code with `glazing_gap=16+` and description "Fully double glazed" — but the API mapper passed the raw code straight through to SapWindow.glazing_type, and: 1. `_api_glazing_transmission` had no (1, "16+") entry, so the U-value lookup returned None and the cascade defaulted to U=2.5 instead of the spec-correct U=2.7 (RdSAP 10 Table 24 row 2, PVC/wooden frame, 16+ gap = 2.7). 2. The cascade's `_G_LIGHT_BY_GLAZING_CODE` table is keyed on the SAP 10.2 Table 6b enum (the Elmhurst extractor produces this enum via `_ELMHURST_GLAZING_LABEL_TO_SAP10`), where code 1 means "single glazed" (g_L=0.90). Passing RdSAP 21 code 1 straight through gave the cascade the wrong g_L for the daylight factor calculation, off by 0.90 vs spec 0.80. Both gaps closed in one slice because they're the same misinterpretation: - `_API_GLAZING_TYPE_TO_TRANSMISSION` + `_API_GLAZING_TYPE_GAP_TO_ TRANSMISSION` now alias code 1 as a schema sibling of code 3 — both resolve to RdSAP 10 Table 24 row 2 ("DG pre-2002 / unknown install date"). Per-gap entries cover the full 6mm=3.1 / 12mm=2.8 / 16+=2.7 row; type-only fallback uses the 12mm default U=2.8. - New `_API_TO_SAP10_CASCADE_GLAZING_CODE = {1: 2}` remap is applied in `_api_sap_window` AFTER the U-value lookup, so SapWindow.glazing_ type carries the SAP 10.2 cascade enum (code 2 = DG pre-2002 air- filled, g_L=0.80) while the U lookup stays keyed on the raw GOV.UK API code. The cohort-1 codes 2/3/13/14 already coincide with the cascade table's intended SAP 10.2 g_L values, so no remap entry required for them; only divergent codes get a remap. Test impact: - Cohort-2 API path: 34/38 → 36/38 at 1e-4 (0300 +4.8e-5; 9380 -5e-6 both move from _COHORT_2_API_OPEN to _COHORT_2_API_CLOSED). - Cert 1536 pin updated from 66.337334 to 65.894324; ws Δ now +0.0015 (was +0.4445) — same root-cause fix dominated, residual tail is distinct-cause work for the next slice. - Cert 2102 unchanged (-6.30 residual, secondary-heating routing gap). - Cohort-1 (9 ASHP certs) unaffected: 9/9 still < 1e-4 on both paths. Test suite: 750 pass + 0 fail. Pyright net-zero per touched file. Spec citations: - RdSAP-Schema-21.0.0 glazed_type=1 → datatypes/epc/domain/epc_codes.csv - RdSAP 10 Specification §8.2 Table 24 (p.49) row 2 "Double glazed: Installed England/Wales before 2002 / Scotland before 2003 / N. Ireland before 2006" — U=2.7 (PVC/wooden, 16+ gap). - SAP 10.2 Table 6b: DG air-filled g_L=0.80 (vs single 0.90). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 17:01:27 +00:00
Khalim Conn-Kowlessar	ff25746f44	Slice S0380.40: parametrized API-path chain sweep for cohort-2 (34/38 at 1e-4) Mirror of the cohort-2 Summary-path sweep that closed across S0380.30..38: for each of the 38 cohort-2 certs whose API JSON was fetched in S0380.39, drive the full API chain (`from_api_response` → `cert_to_inputs` → `calculate_sap_from_inputs`) and assert `sap_score_continuous` vs the worksheet's lodged SAP at abs <= 1e-4. Per cross-mapper parity ([[feedback-cross-mapper-parity-via-cascade]]): the SAP cascade is the load-bearing equivalence check between EpcPropertyData produced by from_api_response and from_elmhurst_site_notes. If both paths hit the worksheet at 1e-4, they're cascade-output- equivalent for load-bearing fields — strictly stronger than a noisy structural EpcPropertyData diff. Two parametrized tests, both green at HEAD: - test_api_cohort_2_full_chain_sap_matches_worksheet_at_1e_minus_4: 34 certs that hit the worksheet at 1e-4 on the API path immediately (the cascade can't tell which mapper produced the EPC). - test_api_cohort_2_open_cert_residual_matches_current_pin: 4 certs that don't yet hit 1e-4 — pinned at their current cascade output as forcing functions per [[project-api-to-sap-residual-test]]. When a follow-up slice closes the underlying mapper/spec gap, the cascade output moves and the pin fires, forcing the cert to migrate from _COHORT_2_API_OPEN to _COHORT_2_API_CLOSED. Open cohort residuals (handover to Slice C+): - 0300/1536/9380: tight +0.42..+0.44 band — likely a single shared cascade-spec gap (API-mapper-specific, since Summary path hits 1e-4) - 2102: -6.30 — Summary test (test_summary_2102_secondary_heating_ routes_house_coal_for_open_fire) shows the cert lodges house-coal open-fire secondary heating; API mapper likely routes secondary fuel differently. Probe `secondary_heating` block first. Test suite: 712 → 750 pass (0 fails). Pyright net-zero on touched file. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 16:45:48 +00:00
Khalim Conn-Kowlessar	22ae6f4d77	Slice S0380.39: bulk-fetch 38 cohort-2 EPC API JSONs for cross-mapper parity Adds scripts/fetch_cohort2_api_jsons.py (throwaway one-off) plus 38 golden fixtures under domain/sap10_calculator/rdsap/tests/fixtures/golden/ covering every cert in "sap worksheets/additional with api 2/". Each JSON is the inner `data` payload from the gov.uk EPB /api/certificate endpoint — the same shape EpcPropertyDataMapper .from_api_response consumes today. Required prerequisite for Slice B (parametrized API-path chain test that mirrors the cohort-2 Summary-path sweep at 1e-4 vs worksheet). Per the cross-mapper-parity primitive: API EPC and Elmhurst EPC must produce SAP within 1e-4 of each other and of the worksheet — the SAP cascade is the load-bearing equivalence check. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 16:40:58 +00:00
Khalim Conn-Kowlessar	f992824824	docs: handover after S0380.31..S0380.38 — cohort-2 Summary path COMPLETE, thread 4 next State at HEAD `883d66ac`: * Cohort-2 Summary path: 38/38 < 1e-4 (was 33 exact + 5 <=0.07) * Cohort-1 ASHP: 9/9 < 1e-4 both paths (was 8/9 with cert 2636 at -0.015) * Test suite: 712 pass + 0 fails (was 710 + 10 at handover start) * _ASHP_COHORT_CHAIN_TOLERANCE: 0.04 -> 1e-4 Eight slices shipped: S0380.31: alt-wall window deduction from (31) per SAP 10.2 K2 -> cert 2636 cantilever -0.015 -> -2.4e-6 both paths S0380.32: bare "Extension" window routing per RdSAP10 §3 -> cert 9380 +0.027 -> -4.8e-6 S0380.33: PV kWp 2 d.p. per RdSAP10 §15 -> cert 6835 +0.015 -> -4.3e-5 S0380.34: living area Decimal HALF_UP per RdSAP10 §15 -> cert 2536 +0.0007 -> -9e-8 S0380.35: gross-wall / party-wall Decimal HALF_UP per RdSAP10 §15 -> certs 2800 / 4800 +0.0007 -> <3e-5 S0380.36: tighten _ASHP_COHORT_CHAIN_TOLERANCE 0.04 -> 1e-4 S0380.37: drop redundant cert 001479 hand-built fixture S0380.38: loosen FEE round-trip tolerance 1e-9 -> 1e-6 Pattern emerged: three slices (S0380.33/34/35) closed the same class of bug -- RdSAP10 §15 "2 d.p." float-arithmetic boundary failures fixed by Decimal HALF_UP. Documented in the handover as the most likely root cause for any future +0.0007-ish residual. User-stated next phase (thread 4): cohort-2 API-path closure via cross- mapper parity, in bigger slices, with golden-residuals driven toward zero. Concrete slice plan in the handover doc. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 16:34:36 +00:00
Khalim Conn-Kowlessar	883d66ac65	Slice S0380.38: loosen FEE round-trip tolerance 1e-9 -> 1e-6 test_no_ac_cert_round_trips_fee_equals_space_heating_per_m2 encodes a real SAP 10.2 invariant: when (108) = 0 (no fixed AC) and Appendix H solar is absent (every cohort cert), (109) FEE must equal space_heating_kwh / TFA. The 1e-9 tolerance was too tight. The cascade computes: - FEE: sum_round_per_month(annual_98a) / TFA - space_heating_kwh: sum(monthly_98a_kwh) summed in calculator The two paths sum the same 12 monthlies in different rounding orders and disagree at ~8e-8 (cascade FEE = 95.39072333333334; SH/TFA = 95.39072341347577). 1e-6 is two orders of magnitude tighter than any meaningful path divergence (a stray 4-d.p. rounding step or unintended AC contribution would blow past instantly) and ~12.5x looser than the observed float-arithmetic drift, so the invariant still fires. Also swaps pytest.approx for `abs(a - b) <= tol` per [[feedback-abs-diff-over-pytest-approx]] (strict-pyright flags pytest.approx as partially-unknown; nets -1 error on the file). Test baseline: 712 pass + 0 fails (was 712 + 1). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 16:22:23 +00:00
Khalim Conn-Kowlessar	1cea73df7c	Slice S0380.37: drop cert 001479 hand-built fixture — covered by passing production-path chain tests Cert 001479 was added in `ee98dbe0` as "skeleton + 11 RED pins" — a hand-built EpcPropertyData intended to cascade to worksheet P960-0001-001479.pdf at 1e-4 for 9 SapResult fields. The skeleton was never finished; the 9 _FIXTURE_PINS pin-checks have been red the entire time (at HEAD: sap_score 65 vs 69, space_heating 9715 vs 8104 kWh, etc.). Meanwhile the production-path chain tests for the same cert have landed at 1e-4 vs the worksheet's continuous SAP 69.0094 and are GREEN at HEAD: - test_summary_001479_full_chain_sap_matches_worksheet_pdf_exactly (Summary PDF -> extractor -> mapper -> calc, 1e-4 vs worksheet) - test_api_001479_full_chain_sap_matches_worksheet_pdf_exactly (API JSON -> mapper -> calc, 1e-4 vs worksheet) - 5 test_summary_001479_*_<detail> mapper-shape unit tests These exercise the actual from_elmhurst_site_notes / from_api_response code paths the production runtime uses, which is strictly stronger coverage than a hand-built mirror. Drops 001479 from _FIXTURE_PINS / _FIXTURE_MODULES and deletes the stub _elmhurst_worksheet_001479.py. Also fixes the stale "Slice 62 iteration" reference in test_summary_pdf_mapper_chain.py. Test baseline: 9 fewer fails (10 -> 1; remaining FEE-round-trip 1e-9 noise to be fixed in S0380.38). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 16:22:04 +00:00
Khalim Conn-Kowlessar	b0919e8d6f	Slice S0380.36: tighten _ASHP_COHORT_CHAIN_TOLERANCE 0.04 -> 1e-4 after S0380.31 closes cohort Cohort-1 ASHP cohort residuals at HEAD `d61a27e0` (post S0380.31..S0380.35): cert 0330: Summary -1.1e-5 (API -1.1e-5 via cert 0380 fixture) cert 0350: Summary +2.2e-5 (API +2.2e-5) cert 0380: Summary +1.0e-6 (API +1.0e-6) cert 2225: Summary -4.8e-5 (API -4.8e-5) [worst] cert 2636: Summary -2.4e-6 (API -2.4e-6) closed by S0380.31 cert 3800: Summary -2.0e-5 (API -2.0e-5) cert 9285: Summary -3.4e-5 (API -3.4e-5) cert 9418: Summary -3.6e-7 (API -3.6e-7) All 7 certs sit at < 5e-5 on BOTH paths. The 0.04 tolerance set in S0380.29 was sized to the API-path +0.03..+0.06 cluster that S0380.30 (glazing codes) and S0380.31 (alt-wall openings in (31)) subsequently closed. 1e-4 matches the user's "1e-4 across the board" target with ~2x headroom over cert 2225's worst residual. Any future regression beyond ~5e-5 fires the tolerance loudly. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 16:10:50 +00:00
Khalim Conn-Kowlessar	d61a27e0ff	Slice S0380.35: round gross-wall and party-wall areas in Decimal arithmetic per RdSAP10 §15 — closes cohort-2 cert 2800 / 4800 +0.0007 SAP residuals RdSAP10 §15 p.66 (Rounding of data): "All element areas (gross) including window areas and conservatory wall area: 2 d.p." Certs 2800 and 4800 lodge heat_loss_perimeter = 21.25 m and room_height = 2.30 m. The exact-decimal products 21.25 * 2.30 = 48.8750 (gross wall area) 6.25 * 2.30 = 14.3750 (party wall area) sit ON the HALF_UP rounding boundary and must round to 48.88 and 14.38 m^2. Float representation drops them BELOW the boundary: 21.25 (float) * 2.30 (float) ~= 48.87499... HALF_UP 2 d.p. = 48.87 6.25 (float) * 2.30 (float) ~= 14.37499... HALF_UP 2 d.p. = 14.37 The 0.01 m^2 area shortfall feeds into (29a) net wall area and (32) party wall area, and into (31) total external area for (36) thermal bridging — propagating a +0.0007 SAP residual via the U-weighted heat-loss sums. Adds `_decimal_round_half_up_sum` helper and routes both gross-wall and party-wall sums through it, mirroring the S0380.34 fix on `_living_area_fraction`. Certs that sit off the .005 boundary (i.e. nearly all) are unaffected; certs that land on it close from +0.0007 → <5e-5. Cohort-2 distribution after S0380.31..S0380.35: 38 exact (was 36 exact + 2 <=0.07). Cohort-1 ASHP cohort: 9/9 <1e-4 (unchanged). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 16:07:38 +00:00
Khalim Conn-Kowlessar	a92a33a8d8	Slice S0380.34: round living area in Decimal arithmetic per RdSAP10 §15 — closes cert 2536 +0.0007 SAP residual RdSAP10 §15 p.66 (Rounding of data): "All internal floor areas and living area: 2 d.p." Cert 2536 (3 habitable rooms → Table 27 fraction 0.30, TFA 45.65 m^2) sits ON the HALF_UP rounding boundary: 0.30 (exact) * 45.65 = 13.6950 HALF_UP 2 d.p. = 13.70 (worksheet fLA = 13.70 / 45.65 = 0.3001) Float arithmetic drops the spec product BELOW the boundary: 0.30 (binary) ~= 0.2999999... product ~= 13.69499... HALF_UP 2 d.p. = 13.69 (cascade fLA = 13.69 / 45.65 = 0.29989) The 0.00021 fLA shortfall feeds straight into the worksheet (91) -> (92) MIT blend, undershoots MIT by ~0.001 C, and shaves 0.29 kWh off (98c) useful space heating — a +0.0007 SAP residual via the (211) main heating fuel x p/kWh. Compute the product in Decimal so HALF_UP lands on the exact .005 decimal boundary the spec defines. Certs that sit off the boundary (e.g. 2800/4800: 0.30 x 46.87 = 14.0610 -> 14.06 in both Decimal and float) are unaffected. Cohort-2 distribution after S0380.31..S0380.34: 36 exact + 2 <=0.07 (was 35 exact + 3 <=0.07). Cert 2536: +0.000715 -> -9.2e-8. The remaining 2800 / 4800 +0.0007 residuals come from a different cause (off the HALF_UP boundary) — defer to a separate slice. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 15:44:30 +00:00
Khalim Conn-Kowlessar	2c3eb17b96	Slice S0380.33: round synthesized PV kWp to 2 d.p. per RdSAP10 §15 — closes cert 6835 +0.015 SAP residual RdSAP10 §15 p.66 (Rounding of data): "kWp for photovoltaics, etc.: 2 d.p." Cert 6835 lodges Photovoltaic Supply as "Proportion of roof area = 40" (no explicit kWp). Per RdSAP10 §11.1 b) p.60 the cascade synthesizes kWp = 0.12 × PV area where PV area is roof_area / cos(35°). For cert 6835: PV area = 36.9 × 0.40 / cos(35°) = 18.0186 m^2 kWp unrounded = 0.12 x 18.0186 = 2.16224 kWp at 2 d.p. = 2.16 (matches worksheet "Cells Peak = 2.16") SAP 10.2 §M1 EPV = 0.8 x kWp x S x ZPV. With the 0.0022 kWp delta the cascade was overstating PV generation by 1.5448 kWh/yr, adding -0.20 GBP to (252) total PV credit, dropping (255) total energy cost by 0.20, lowering ECF and raising SAP by +0.015. Cohort-2 distribution after S0380.31..S0380.33: 35 exact + 3 <=0.07 (was 34 + 4 at S0380.32 HEAD). Cert 6835: +0.014534 -> -4.3e-5. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 15:18:16 +00:00
Khalim Conn-Kowlessar	396907f46a	Slice S0380.32: route bare \"Extension\" window location to BP[1] per RdSAP10 §3 — closes cert 9380 +0.027 residual RdSAP10 §3 p.17: "When specifying windows and doors, for each building part assessor allocates windows and doors to the corresponding wall (the appropriate main wall or each alternative wall). For each building part, software will deduct window/door areas contained in the relevant wall areas." SAP 10.2 §3 p.16: "Wall area is the net area of walls after subtracting the area of windows and doors." Cert 9380's Summary PDF lodges 2 windows on its single extension, but pdftotext wraps "1st" onto a preceding layout line while "Extension" lands on a separate line — the Elmhurst extractor captures only the second token. `_window_bp_index` previously matched "main" / "1st"-"4th" prefixes but fell through bare "Extension" to BP[0] (main), causing the cascade to deduct ext1 windows from the main wall: Worksheet (29a): main 60.60 × 0.70 + ext1 18.25 × 0.53 = 52.0925 Pre-fix cascade: main 59.01 × 0.70 + ext1 19.84 × 0.53 = 51.8222 Δ -0.27 W/K → SAP +0.027 This slice adds bare "extension" (when num_parts >= 2) as a sibling to the ordinal-prefix matches. Closes cert 9380 +0.027 → -4.8e-6. Cohort-2 distribution after S0380.31 + S0380.32: 34 exact + 4 ≤0.07 (was 33 exact + 5 ≤0.07). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 14:59:37 +00:00
Jun-te Kim	233cae5eac	Merge pull request #1126 from Hestia-Homes/feature/additional_hubspot_data hubspot projects data is scraped	2026-05-28 15:28:57 +01:00
Khalim Conn-Kowlessar	86226ebdb6	Slice S0380.31: deduct alt-wall window opening from (31) net external area — closes cert 2636 cantilever residual -0.015 → -2.4e-6 SAP 10.2 Appendix K eqn (K2) p.84: HTB = y × Σ(Aexp) where Aexp is "the total area of external elements calculated at worksheet (31)". The worksheet (31) column header reads "Total NET area of external elements" — net of openings. Cert 2636 (dr87-0001-000898 line 187): (31) = 160.33 m² = 47.70 main net + 11.57 alt net + 42.92 roof + 39.18 ground floor + 3.74 cantilever + 11.52 windows + 3.70 doors. Pre-fix cascade summed the alt-wall at its 12.76 m² gross (no opening deduction) — (31) was 161.52, driving (36) to 24.228 vs worksheet 24.0495 (Δ +0.1785 W/K). That drift propagated through (39) HTC → MIT → space heating, leaving cert 2636 at Δ -0.015 SAP — the only ASHP cohort cert above the 1e-4 floor. `alt_walls_total_area` aggregates per-alt-wall gross at line 736; this slice subtracts `alt_window_area` from it in the (31) sum so the alt-wall contribution is net, matching the (29a) net-area convention already applied per-element to the A×U sums. Cohort-1 ASHP cohort: 9/9 certs < 1e-4 Summary path (was 8/9 with cert 2636 at -0.015). Cert 2636 API path also closes to < 1e-4 — the bug was path-symmetric in the cascade, not in either mapper. Cohort-2 unchanged at 33 exact + 5 ≤0.07. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 14:10:11 +00:00
Jun-te Kim	429e138ea7	merge conflicts	2026-05-28 14:00:19 +00:00
Khalim Conn-Kowlessar	453a9216fb	docs: handover after S0380.26-30 precision-floor closure Documents the 5-slice session that closed the prior handover's "precision floor" cluster end-to-end: S0380.26 RdSAP10 §5.8 dry-lining adjustment (cert 7700) S0380.27 floor_construction_type → _main_floor_u_value (cert 9796) S0380.28 SAP 10.2 Appendix N fn 43 reciprocal η interpolation (closes the +0.03..+0.06 ASHP cluster cohort-wide) S0380.29 _ASHP_COHORT_CHAIN_TOLERANCE 0.07 → 0.04 S0380.30 glazing codes 8-15 (RdSAP 21 schema) — closes API path cohort-1 +0.014..+0.031 cluster Final state: Cohort-2 Summary path (38): 33 exact + 5 ≤0.07 Cohort-1 ASHP cohort (7): 6/7 <1e-4 both Summary + API paths cert 2636 -0.015 (cantilever, path-symmetric) — only open thread The prior `HANDOVER_CERT_0380_MIT_CASCADE.md` had concluded the +0.04 ASHP cluster was unfixable without Elmhurst access; the spec citation (SAP 10.2 Appendix N fn 43) was sitting in the same PDF that handover referenced. Be skeptical of "spec-precision floor" framing — see [[feedback-spec-floor-skepticism]]. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 12:55:45 +00:00
Jun-te Kim	17016b299a	booking status	2026-05-28 12:15:37 +00:00
Khalim Conn-Kowlessar	faf116bd70	Slice S0380.30: extend g_L + g⊥ Table 6b to RdSAP 21 codes 8-15 — closes API path cohort residual cluster Per the RdSAP 21 schema in [datatypes/epc/domain/epc_codes.csv][1], the `glazing_type` enum extends to 15 codes; the legacy SAP 10.2 Table 6b cascade lookups in `internal_gains.py:106` and `solar_gains.py:178` only knew codes 1-7. Every API-path cert in the cohort lodges `glazing_type` via the RdSAP 21 numbering, and triple-glazed lodgements surface as code 14 ("triple glazing, installed 2022+"). Pre-slice the cascade fell through to the 0.80 / 0.76 double-glazed defaults for codes 8-15: Internal gains g_L (Table 6b): code 14 → default 0.80 (DG) vs spec 0.70 (TG) → daylight factor over-bonused → lighting kWh under-counted Solar gains g⊥ (Table 6b): code 14 → default 0.76 (DG) vs spec 0.68 (TG) → solar gains over-counted For cert 0350-2968-2650-2796-5255 (semi-detached, 9 triple-glazed windows lodged as code 14), this drove: lighting_kwh_per_yr: cascade 221.79 vs Summary-path 228.44 (-6.65 kWh/yr — daylight bonus too generous → lighting too low) space_heating_kwh_per_yr: cascade 7000.21 vs Summary-path 6996.94 (+3.28 kWh/yr — extra solar gains lower HP demand) net ECF: -0.0022 vs Summary-path → SAP +0.031 Same mechanism on the other 5 cohort-1 ASHP API certs. Fix: extend both lookup tables with the RdSAP 21 additions per the schema CSV semantics: \| code \| description (RdSAP 21) \| g_L \| g⊥ \| \|------\|----------------------------------\|------\|------\| \| 8 \| triple glazing, known data \| 0.70 \| 0.68 \| \| 9 \| triple glazing, 2002-2022 \| 0.70 \| 0.68 \| \| 10 \| triple glazing, pre-2002 \| 0.70 \| 0.68 \| \| 11 \| secondary glazing, normal-E \| 0.80 \| 0.76 \| \| 12 \| secondary glazing, low-E \| 0.80 \| 0.76 \| \| 13 \| double glazing, 2022+ \| 0.80 \| 0.76 \| \| 14 \| triple glazing, 2022+ \| 0.70 \| 0.68 \| \| 15 \| single glazing, known data \| 0.90 \| 0.85 \| Solar gains also adds code 7 (double known data) for `_G_PERPENDICULAR_BY_GLAZING_TYPE` to align with the existing `_G_LIGHT_BY_GLAZING_CODE` code-7 entry (which already mapped to 0.80 = double). Outcome — Cohort-1 ASHP cohort API path: cert 0380: +0.025 → +1e-6 (close to exact) cert 0350: +0.031 → +2.2e-5 (close to exact) cert 2225: +0.029 → -4.8e-5 (close to exact) cert 2636: +0.015 → -0.015 (sign flip; cantilever-specific residual surfaces; same \|Δ\| as Summary) cert 3800: +0.023 → -2e-5 (close to exact) cert 9285: +0.029 → -3.4e-5 (close to exact) 5 of 6 API path certs now sit at <1e-4 vs worksheet. Cert 2636 matches its Summary-path residual (-0.015) — the cantilever fixture has its own non-glazing residual to be diagnosed separately. Cohort-2 Summary path unchanged (33 exact + 5 ≤0.07) — the cohort-2 certs lodge glazing codes 1-7 (RdSAP 17 numbering still surfaces in Elmhurst Summary PDF lookups), so codes 8-15 only affect the RdSAP-21-schema API path. Golden API fixture pins updated to reflect the tightened cascade-vs-API alignment (7 certs: 0380, 0350, 2225, 2636, 3800, 9285, 9418). SAP integer residuals unchanged (all sit at +0). Pyright net-zero on touched files (22 → 22). Tests: 710 → 711 pass (+1 new: cert 0350 fixture-shape test for glazing_type=14 routing to g⊥=0.68 with `total_solar_gains_monthly_w[0] ≈ 67.00 W` (vs pre-slice 74.88 W at the DG default), proving code 14 hits the triple-glazed Table 6b row.) 10 expected fails unchanged. [1]: datatypes/epc/domain/epc_codes.csv (RdSAP-Schema-21.0.1). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 12:01:34 +00:00
Khalim Conn-Kowlessar	e27b923bca	Slice S0380.29: tighten _ASHP_COHORT_CHAIN_TOLERANCE 0.07 → 0.04 Post-S0380.28 (Appendix N footnote 43 reciprocal η interpolation), the ASHP-cohort chain-test residuals collapsed: Summary path: cert 0380: +0.000001 (was +0.034) cert 0350: +0.000022 (was ~+0.046) cert 2225: -0.000048 (was ~+0.044) cert 2636: -0.014945 (was ~+0.003 — cantilever-specific) cert 3800: -0.000020 (was +0.021) cert 9285: -0.000034 (was +0.021) cert 9418: -0.000000 (was +0.00004) API path (cohort handover thread 4 — open): cert 0380: +0.025273 cert 0350: +0.030594 (worst) cert 2225: +0.028517 cert 2636: +0.014705 cert 3800: +0.023327 cert 9285: +0.028674 The previous 0.07 tolerance gave 130%+ headroom over the pre-slice worst residual; with S0380.28 closing the cluster the same tolerance gives 130%+ headroom over the post-slice API worst (0.031), letting regressions hide for a long time before firing. 0.04 gives ~30% headroom over the API path's worst residual (cert 0350 +0.0306) and ~170% over the Summary path's worst (cert 2636 -0.015 — the cantilever fixture). Fires loudly on any regression beyond the documented API-path residual cluster. Tightens 15 chain tests (8 Summary path + 7 API path). All pass. Tests: 710 pass (unchanged), 10 expected fails unchanged. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 11:49:56 +00:00
Khalim Conn-Kowlessar	081bb8fd7e	Slice S0380.28: SAP 10.2 Appendix N footnote 43 reciprocal η interpolation — closes the +0.03..+0.06 ASHP precision-floor cluster Per SAP 10.2 Appendix N, PDF p.101 footnote 43 (line 7053): "For the efficiency values, the interpolated efficiency is the reciprocal of linear interpolation between the reciprocals of the efficiencies." i.e. 1/η_interp = (1 − t)·1/η_low + t·1/η_high, the weighted harmonic mean at t = (PSR − PSR_low) / (PSR_high − PSR_low). Cascade was using linear interpolation directly on η — a +0.15..+0.25% over-estimate in the typical PSR range (1.2..1.5) for ASHPs in the cohort. Cohort fixture: cert 3336-2825-9400-0512-8292 (Mitsubishi PUZ-WM50VHA, PCDB 104568). MIT/η-zone cascade matches worksheet EXACTLY (every line 86..92, every month), but η_main_heating cascade 225.443 vs worksheet 224.923 → main_heating_fuel +5.24 kWh/yr too high → ECF 1.5474 vs ws 1.5503 → SAP +0.04 vs worksheet 78.3739. Back-solving the worksheet's η_main implies η_space_1 = 224.923 / 0.95 ≈ 236.76. Closed form at PSR=1.40151, bracketing PCDB rows PSR 1.2 (η_space_1=253.9) and PSR 1.5 (η_space_1=229.2): Linear (pre-slice): 253.9 + (229.2 − 253.9) × 0.6717 = 237.31 ✗ Reciprocal (footnote 43): 1 / ((1 − 0.6717)/253.9 + 0.6717/229.2) = 1 / 0.004224 = 236.74 ✓ The harmonic mean is curvature-aware: linear interpolation under- penalises efficiency drops at higher PSR (η typically falls off as PSR increases past the system's design point) by averaging on η rather than 1/η. SAP 10.2 footnote 43 is explicit about which side of the reciprocal the interpolation sits. Outcome: Cohort-2 Summary path (38 certs): exact (<1e-4): 23 → 33 (+10) ≤±0.07: 15 → 5 (-10: HP certs close to exact) ±0.07..0.5: 0 → 0 ±0.5..1: 0 → 0 ±1+: 0 → 0 RAISES: 0 → 0 Cohort-2 HP cluster post-slice: 0100 +0.00003 ← was +0.00283 0320 -0.00001 ← was +0.01801 0330 -0.00004 ← was +0.01772 2336 +0.00003 ← was +0.01778 3336 +0.00001 ← was +0.04005 (worst residual closes exact) 4536 -0.00002 ← was +0.01312 9036 -0.00003 ← was +0.02159 9796 +0.00000 ← was +0.00174 (post-S0380.27) 2536 +0.00072 ← was +0.00163 2800 +0.00068 ← was +0.00436 4800 +0.00068 ← was +0.02939 9370 +0.00002 ← was +0.00174 9421 +0.00001 ← was +0.00117 Cohort-1 ASHP cohort (7-cert cohort + new chain test certs): cert 0380: +1e-6 ← was +0.034 (Mitsubishi PUZ-WM50VHA, the canonical first-HP cohort cert) cert 3800: -2e-5 ← was +0.021 cert 9418: -3e-7 ← was +0.00004 cert 9285: -3e-5 ← was +0.021 cert 2636: -0.015 ← was +0.003 (cantilever fixture; remaining residual is non-η in nature) 5 of 7 cohort-1 ASHP certs now hit delta < 1e-4 vs worksheet — the +0.04 spec-precision-floor cluster diagnosed in HANDOVER_CERT_0380_MIT_CASCADE.md is the linear-vs-reciprocal η interpolation bug, not a spec-floor at all. The handover doc's "no public spec or BRE data field would distinguish these" claim was incorrect — SAP 10.2 footnote 43 is the resolution. API path (golden fixtures): 6 ASHP cohort residuals updated to reflect the cascade closure: cert 0380 PE: -14.7865 → -14.6848 kWh/m²; CO2: +0.2774 → +0.2780 t/yr cert 0350 PE: -7.9281 → -7.8741; CO2: +0.1697 → +0.1701 cert 2225 PE: -11.9175 → -11.8557; CO2: +0.2617 → +0.2621 cert 2636 PE: -9.7153 → -9.6692; CO2: +0.2189 → +0.2193 cert 3800 PE: -9.7551 → -9.6838; CO2: +0.2598 → +0.2603 cert 9285 PE: -8.1110 → -8.0466; CO2: +0.1559 → +0.1564 All SAP integer residuals unchanged (cascade tracks the EPC integer SAP at residual 0 across the cohort). PSR interpolation unit test (`test_interpolate_heat_pump_efficiency_at _cert_0380_psr_per_sap_app_n`) updated to reflect the reciprocal formula with the SAP-10.2-footnote-43 spec citation and closed-form asserts (η_space_1 ≈ 234.5235; η_water_3 ≈ 285.0861 at PSR=1.43). Pyright net-zero (1 → 1 across touched files: pcdb/parser.py, tests/test_pcdb_table_362_lookup.py, rdsap/tests/test_golden_fixtures.py). Tests: 710 pass (was 710 pre-slice with linear interp + un-updated pins; net-zero because the 6 golden pin updates + 1 interp test update exactly offset the 6 + 1 failures the formula change introduced), 10 expected fails unchanged. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 11:44:11 +00:00
Khalim Conn-Kowlessar	012cbd183f	Slice S0380.27: thread floor_construction_type into _main_floor_u_value — closes cert 9796 +0.55 → +0.00174 Per RdSAP10 §5 page 29 "Floor infiltration (suspended timber ground floor only)": Age band A-E: a) if floor U-value < 0.5, assume "sealed" → 0.1 b) if retro-fit + no U → "sealed" → 0.1 otherwise "unsealed" → 0.2 The cascade routes the (12) sealed/unsealed verdict through `_main_floor_u_value`, which calls `u_floor` to compute the BS EN ISO 13370 U-value the spec rule keys on. That helper was a stale duplicate of the real heat-transmission path that did NOT respect the per-bp `floor_construction_type` lodgement: Pre-slice: u_floor(construction=int_or_None, description=None, ...) Cascade: u_floor(construction=int_or_None, description="Suspended timber" if floor_construction_type else <fallback>, ...) For cert 9796-3058-6205-0346-9200 (Mid-Terrace bungalow age D, 46.87 m² / 15.0 m perimeter, suspended-timber lodged): - Broken `_main_floor_u_value` routes through the solid default (no description, construction=None) → BS EN ISO 13370 solid → U=0.49 W/m²K. - 0.49 < 0.5 → spec rule (a) fires → (12) = 0.1 (sealed). - Real heat-transmission cascade routes through the suspended branch via `effective_floor_description = floor_construction_type` → U=0.56 → unsealed → (12) = 0.2. The 0.1 ach gap then propagated: (18) infiltration_rate 0.74 → ws 0.84 (cascade -0.10) (25)m Jan 0.82 → ws 0.91 (cascade -0.09) (38)m Jan 29.08 W/K → ws 32.37 (cascade -3.29 W/K) (39) Jan 110.35 W/K → ws 113.64 (cascade -3.29 W/K) HLP Jan 2.35 W/m²K → ws 2.42 (cascade -0.07) T_h2 Jan 19.11°C → ws 19.07 (cascade +0.04) MIT Jan 18.51°C → ws 18.45 (cascade +0.06) SAP +0.55 vs worksheet 90.13. Fix mirrors heat_transmission's `effective_floor_description` rule in `_main_floor_u_value`: the per-bp `floor_construction_type` takes precedence over a joined `epc.floors[].description` because it's the explicit Elmhurst Summary §3/§9 surface. Inlined the description join (vs importing `_joined_descriptions` from heat_transmission) so cert_to_inputs stays free of cross-module private-symbol imports. Cohort-2 outcome (38 certs, Summary path): exact (<1e-4): 23 → 23 ≤±0.07: 14 → 15 (+1: cert 9796 +0.55 → +0.00174) ±0.5..1: 1 → 0 (last cohort-2 mid-range gap closes) The remaining cert 9796 +0.00174 SAP residual is the cohort-1 HP-COP precision floor (the same +0.001..+0.04 SAP that the other 10 triple-glazed HP certs sit at; see handover thread 3). Cohort-1 golden fixture cert 8135-1728-8500-0511-3296 (Semi-detached age C, suspended-timber ground floor with floor_construction=2 lodged but description=None pre-slice) had the same bug: Pre-slice: u_floor returned 0.48 (solid branch via construction=2 present-but-not-suspended) → false sealed verdict (12)=0.1 Post-slice: u_floor returns 0.54 (suspended branch via description= "Suspended timber") → correct unsealed verdict (12)=0.2 PE residual: -4.9611 → -0.0748 kWh/m² (+4.89 closer to API EPC) CO2 residual: -0.0678 → +0.0246 t/yr (closer to API EPC) SAP residual: 0 → 0 (unchanged, EPC integer) Pin updated on cert 8135 to reflect the new (correct) cascade-vs-API alignment; no other golden fixtures shifted. Pyright net-zero per touched file: cert_to_inputs.py: 35 → 35 tests/test_cert_to_inputs.py: 13 → 12 (suppressed pre-existing private-import error on _water_heating_worksheet_and_gains at the same time as adding suppressions for the two new private imports) tests/test_golden_fixtures.py: 1 → 1 tests/test_summary_pdf_mapper_chain.py: 0 → 0 Tests: 708 → 710 pass (+2 new: `_main_floor_u_value` routes suspended-timber via per-bp lodgement; cert 9796 chain pin against worksheet 90.1318 within ±0.07 ASHP-cohort spec floor), 10 expected fails unchanged. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 11:24:59 +00:00
Khalim Conn-Kowlessar	c144d444e2	Slice S0380.26: RdSAP10 §5.8 dry-lining adjustment on alt walls — closes cert 7700 -0.44 → +5e-5 Per RdSAP10 §5.8 final note + Table 14 page 41: "For drylining including laths and plaster use Rinsulation = 0.17 m²K/W." Applied additively to the base U-value of an otherwise-uninsulated wall: U_adjusted = 1 / (1/U_base + 0.17) — rounded to 2 d.p. half-up. Closed form for the cohort fixture (cavity-as-built age C, U_base=1.5): 1 / (1/1.5 + 0.17) = 1.19522... → 1.20 ✓ matches worksheet Cert 7700-3362-0922-7022-3563 (Summary_000905.pdf / dr87-0001-000905.pdf) is an End-Terrace house age C lodging: - Main wall: CavityWallDensePlasterDenseBlock, Filled Cavity, U=0.70 - Alt wall 1: 14.44 m² Cavity As-Built, Dry-lining: Yes (worksheet `CavityWallPlasterOnDabsDenseBlock`, U=1.20) Pre-slice the Elmhurst alt-wall mapper hard-coded `wall_dry_lined="N"` and the cascade ignored the field everywhere — alt-wall U routed to the cavity-as-built default (1.50), giving fabric (33) 148.72 W/K vs worksheet 144.38 (Δ +4.33 W/K = ~+0.44 SAP). Worksheet "SAP value" line lodges unrounded SAP 63.4425. Implementation: 1. `AlternativeWall.dry_lined: bool = False` on the Elmhurst surveys dataclass. 2. Elmhurst extractor reads "Alternative Wall N Dry-lining: Yes/No" into the new field. 3. `_map_elmhurst_alternative_wall` propagates `wall_dry_lined="Y"` instead of the hard-coded "N". 4. `u_wall` gains a `dry_lined: bool = False` kwarg and a single §5.8 adjustment site at the as-built bucket (bucket=0). Insulated buckets already absorb the dry-lining R via Table 14. 5. `_alt_wall_w_per_k` passes `dry_lined=alt_wall.wall_dry_lined == "Y"`. Scope is the alt-wall path only — main BPs in the corpus all lodge `wall_dry_lined="N"` (or the Summary PDF omits the field for the main wall), so the main-wall call site is untouched. Conservative regression posture per the user's strict cohort-pin convention. Cohort-2 outcome (38 certs, Summary path): exact (<1e-4): 22 → 23 (+1: cert 7700 -0.44 → +4.87e-05) 0.07..0.5: 1 → 0 (-1: cert 7700 closes out) 0.5..1: 1 → 1 (cert 9796 unchanged — MIT precision floor) RAISES: 0 → 0 Cohort-1 ASHP cohort untouched: all certs lodge wall_dry_lined="N", so the alt-wall call site short-circuits to the original cascade. Verified no regressions across the 22 previously-exact cohort-2 certs either. Pyright net-zero on all 8 touched files (183 → 183). Tests: 704 → 708 pass (+4 new: u_wall §5.8 adjustment fires correctly; cavity-as-built unchanged without flag; insulated bucket unaffected by flag; heat_transmission alt-wall delta = 14.44 × 0.30 W/K; cert 7700 full chain hits worksheet 63.4425 at < 1e-4), 10 expected fails unchanged. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 10:56:11 +00:00
Jun-te Kim	2b29b4b458	hubspot projects data is scraped	2026-05-28 10:54:15 +00:00
Khalim Conn-Kowlessar	73fedc0ecd	docs: handover for cohort-2 closure + precision-floor next steps Captures 5 slices shipped this session (S0380.21..25): - Table 3a rows 1+4 + PCDB keep-hot dispatch - Per-BP roof exposure (Ext1 flat roof on flats) - RdSAP §11.1 b) % of roof area PV synthesis - SAP code 631 → house coal secondary fuel - SAP codes 2111/2113 → control type 2 Cohort-2 outcome: 22/38 exact (<1e-4), max residual ±0.55 SAP, 0 RAISES, 0 big-gaps. All structural cascade gaps closed. Open threads diagnosed in detail: 1. Cert 7700 -0.44 SAP — wall U code conflict (_WALL_INSULATION_NONE=4 vs Elmhurst "As Built"=4). Wider than a single slice; needs regression testing. 2. Cert 9796 +0.55 SAP — MIT precision floor (Mid-Terrace bungalow + HP, +0.06°C across all months). Same mechanism as cohort-1 HP-COP residuals. 3. API-path closure for all 38 certs (deferred). 4. Tighten cohort-1 chain tests to 1e-4 once thread 2 closes. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 10:33:17 +00:00
Khalim Conn-Kowlessar	36a3219dfb	Slice S0380.25: SAP codes 2111/2113 are type 2 not type 3 — closes 0652 + 6835 Per SAP 10.2 spec page 171 Table 4e "Heating system controls" — boiler systems with radiators (Group 1): 2110: "Time and temperature zone control by arrangement of plumbing and electrical services" → type 3 2111: "TRVs and bypass" → type 2 2112: "Time and temperature zone control by device in PCDB" → type 3 2113: "Room thermostat and TRVs" → type 2 `_CONTROL_TYPE_BY_CODE` previously bucketed 2111 + 2113 with the type 3 codes, but neither lodges any time-zone control — they're TRV-class controls (closer to programmer + room thermostat). The misclassification propagated through SAP 10.2 Table 9 to swap the elsewhere-zone off-period pattern from (7, 8) to (9, 8) — i.e. the spec's "heating 0700-0900 and 1800-2300" pattern (footnote b) instead of "heating 0700-0900 and 1600-2300" (footnote a). Under-counted MIT by ~0.67 °C across the year, dropping space-heating demand and over-predicting SAP: - cert 0652-3022-1205-2826-1200: +1.93 → -1e-5 - cert 6835-3920-2509-0933-5226: +0.72 → +0.015 Cohort-2 outcome (38 certs, Summary path): exact (<1e-4): 21 → 22 (+1: cert 0652 closes) ≤±0.07: 13 → 14 (+1: cert 6835 moves from ±0.5..1) ±0.5..1: 2 → 1 (-1: cert 6835 closes out) ±1..5: 1 → 0 (-1: cert 0652 closes out) No cohort-1 regressions (all certs there use codes 2106 / 2206; neither uses 2111/2113). Pyright net-zero (cert_to_inputs.py 35→35, test 13→13). Tests: 704 pass (existing control-type test extended; +2 new assertions for codes 2111/2113), 10 expected fails unchanged. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 10:04:34 +00:00
Khalim Conn-Kowlessar	c145953f56	Slice S0380.24: SAP code 631 → house coal secondary fuel — closes cert 2102 -15.81 → +5e-5 Per SAP 10.2 spec page 165 Table 4a Category 10 (Room heaters), the 600-range secondary-heating SAP codes split by fuel: 601-613: Gas (mains gas / LPG / biogas) — column A is mains gas. 621-625: Liquid fuel room heaters (oil / bioethanol). 631-634: Solid fuel room heaters (open fire, closed room heater with/without boiler) — house coal is the modal default. 691-699: Electric room heaters. `_elmhurst_secondary_fuel_from_sap_code` previously mapped the entire 601-630 range to mains gas (API code 26). Two bugs: 1. Codes 621-625 are oil heaters, not gas. (Cohort hasn't surfaced an oil-secondary cert yet — deferred until a fixture exercises.) 2. Codes 631-634 are solid fuel, not gas, and weren't in the range at all. Cascade fell through to the secondary-fuel-None default (standard electricity at 13.19 p/kWh), over-charging cert 2102's "Open fire in grate" secondary by ~£340/yr. Narrow the gas range to 601-613 (per the spec) and add 631-634 → API fuel code 11 (Coal in `_ELMHURST_MAIN_FUEL_TO_SAP10`) → Table 32 direct lookup returns 3.67 p/kWh (house coal), matching worksheet (242) "Space heating - secondary 3585.2401 × 3.6700 = 131.58". Cohort-2 outcome (38 certs, Summary path): exact (<1e-4): 20 → 21 (+1: cert 2102 -15.81 → +5e-5) ±5+: 1 → 0 (last big-gap closed) Cert 2102 verified end-to-end: - secondary_heating_type=631 → secondary_fuel_type=11 → 3.67 p/kWh - Cascade SAP 63.8732 vs worksheet 63.8732 (delta +5e-5) - Cascade total fuel cost £787.03 = worksheet £787.03 exactly Pyright net-zero on both touched files (mapper.py 32→32, test 0→0). Tests: 703 → 704 pass (+1 new SAP-code-631 secondary-fuel routing test), 10 expected fails unchanged. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 09:46:44 +00:00
Khalim Conn-Kowlessar	8dee191803	Slice S0380.23: RdSAP §11.1 b) PV %-of-roof-area synthesis — closes cert 6835 -13.37 → +0.72 RdSAP 10 specification page 60 §11.1 b) (Photovoltaics): "If the kWp (or DNC) is not known use the following: PV area is roof area for heat loss (before amendment for any room-in-roof), times percent of roof area covered by PVs, and if pitched roof divided by cos(35°). If there is an extension, the roof area is adjusted by the cosine factor only for those parts having a pitched roof. kWp is 0.12 × PV area. If not provided in the RdSAP data set then facing South, pitch 30°, modest overshading." Wire-through: 1. `Renewables.pv_percent_roof_area: Optional[int]` — new field on the Elmhurst site-notes dataclass. 2. Elmhurst extractor `_extract_renewables` parses Summary §19.0 row "Proportion of roof area" (cert 6835: "40"). 3. Elmhurst mapper `from_elmhurst_site_notes` surfaces it through `epc.sap_energy_source.photovoltaic_supply.none_or_no_details .percent_roof_area` — mirrors the API mapper's lodgement shape. 4. `cert_to_inputs._synthesize_pv_arrays_from_percent_roof_area` synthesizes a single PV array via the spec formula when `photovoltaic_arrays` is empty AND a `percent_roof_area > 0` lodgement is present. Fires inside `_pv_generation_kwh_per_yr`, so both rating + demand cascades pick it up. Cohort-2 outcome (38 certs, Summary path): exact (<1e-4): 20 → 20 ±0.07..0.5: 1 → 1 ±0.5..1: 1 → 2 (cert 6835 closes -13.37 → +0.72) ±1..5: 1 → 1 ±5+: 2 → 1 (-1: cert 6835 moves out of big-gap band) Cert 6835 verified end-to-end: - kWp = 0.12 × 36.9 × 0.40 / cos(35°) = 2.1622 (worksheet "Cells Peak = 2.16, Orientation = South, Elevation = 30°, Overshading = Modest") - Cascade PV generation = 1493.88 kWh/yr vs worksheet 1492.33 (<0.1% delta — kWp-rounding artefact). - Cascade SAP 80.92 vs worksheet 80.20 (+0.72, in the ±0.5..1 band). The residual +0.72 likely traces to the PV-cost cascade's used-in-dwelling / exported split rather than the synthesis — the kWh figure is within rounding of the worksheet. Pyright per-file: net-zero - cert_to_inputs.py 35 → 35 - test_cert_to_inputs.py 13 → 13 - mapper.py 32 → 32 - elmhurst_site_notes.py 0 → 0 - elmhurst_extractor.py 0 → 0 Tests: 702 → 703 pass (+1 new RdSAP §11.1 b synthesis test), 10 expected fails unchanged. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 09:35:38 +00:00
Khalim Conn-Kowlessar	1a25ea674e	Slice S0380.22: per-BP roof exposure — closes cert 0036 Ext1 flat roof For multi-BP dwellings the dwelling-level `exposure.has_exposed_roof` flag (derived from `dwelling_type` via `_dwelling_exposure`) zeroed out ALL BPs' roof contributions uniformly. That's wrong when a flat has an extension with its own external roof — e.g. ground-floor flat with a single-storey extension whose flat roof is exposed. Replace the global suppression with a per-BP signal: - Per-BP `roof_construction_type` containing "another dwelling above" → that BP's roof is party → suppress. - Otherwise BP 0 (Main) falls back to the dwelling-level flag (covers flat lodgements that don't explicitly mark the Main roof type). - Extensions (i > 0) expose their roof by default unless their own roof_construction_type lodges as party. Cohort cert 0036-6325-1100-0063-1226 (ground-floor flat, age D): - Main lodges roof_construction_type = "Another dwelling above" → contributes 0 W/K (matches worksheet line (30) "External roof Main 57.93 m² × U=0 = 0.0"). - Ext1 lodges roof_construction_type = "Flat" → contributes 1.09 m² × U=2.30 = 2.507 W/K (matches worksheet "External roof Ext1 1.09 m² × U=2.30 = 2.507", spec line (30)). - Cascade SAP closes from +0.2987 → -6e-6 vs worksheet 62.7471. Houses + bungalows are unaffected: dwelling-level flag stays True and the per-BP guard only activates on explicit party-roof lodgement. Single-BP flat tests stay correct: the per-BP guard is a no-op when no roof_construction_type is lodged (i==0 → falls back to dwelling- level flag). Spec citation: - RdSAP 10 §3 / §5.11 — heat-loss surfaces and party-roof treatment. SAP 10.2 spec line (30) sums external roofs only; party roofs sit in the (32) party-element channel with U=0. Cohort-2 distribution (38 certs, Summary path) shifts: exact (<1e-4): 19 → 20 (+1: 0036) 0.07..0.5: 2 → 1 (-1: 0036 → exact) Pyright net-zero (heat_transmission.py 13→13, test file 71→71). Test counts: 702 → 703 pass (+1 new test), 10 expected fails unchanged. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 09:20:57 +00:00
Khalim Conn-Kowlessar	0d3fb98018	Slice S0380.21: Table 3a row 1 (no keep-hot) + row 4 dispatch — closes 9 cohort-2 RAISES SAP 10.2 spec p.160 Table 3a rows: Row 1 ("Instantaneous, without keep-hot facility"): (61)m = 600 × fu × n_m / 365 with fu = min(1, V_d,m / 100) Row 4 ("Instantaneous, with keep-hot, not controlled by time clock"): (61)m = 900 × n_m / 365 Add `combi_loss_monthly_kwh_table_3a_row_1_no_keep_hot()` and `combi_loss_monthly_kwh_table_3a_row_4_keep_hot_no_time_clock()` to `worksheet/water_heating.py`. Extend `pcdb_combi_loss_override` to dispatch via the PCDB keep_hot_facility / keep_hot_timer fields lodged at raw positions 58/59 (extracted in Slice S0380.20): kh ∈ {0, None} → row 1 (600 × fu × n/365, no keep-hot) kh = 1, timer = 1 → row 3 (cascade default 600 × n/365) kh = 1, timer ∈ {0, None} → row 4 (900 × n/365, no time clock) kh ∈ {2, 3} → UnresolvedPcdbCombiLoss (electric or mixed keep-hot — Table 3a Note 2 fuel-split between (61)m and (219)m deferred until a fixture exercises it). Closes 9 of the 11 cohort-2 RAISES from Slice S0380.20 — all PCDF 15709 + 10315 certs with no keep-hot lodgement now compute to abs(delta) < 1e-4 vs the dr87 worksheet. Verified end-to-end on cert 7800-1501-0922- 7127-3563 (Potterton Promax Combi 28 HE+A, PCDF 15709): Jan (61) = 600 × 0.778795 × 31/365 = 39.6866 kWh, matching worksheet line ref exactly. The 2 newly-visible cohort-2 issues (cert 6835 -13.37 SAP, cert 0652 +1.93 SAP) were hidden behind the previous strict-raise — they surface unrelated cascade gaps, not regressions. Re-add 0390-2954-3640-2196-4175 (Firebird oil PCDF 9005) to the golden fixture cohort dropped in Slice S0380.20: - `_EXPECTATIONS` with re-pinned SAP/PE/CO2 residuals (-7 / -26.0093 kWh/m² / -2.5211 t/yr) — the cert now cascades end-to-end via the no-keep-hot row. - `_PCDB_CHAIN_EXPECTATIONS` pins PCDF index 9005 + winter eff 0.864 (Table 105 fraction). Spec citations (per [[feedback-spec-citation-in-commits]]): - SAP 10.2 spec p.160 Table 3a rows 1 & 4 (formula columns) + pdftotext of `sap-10-2-full-specification-2025-03-14.pdf \| sed -n '15280,15410p'` (Notes 1 & 2 on fu / electric keep-hot routing). - STP09-B04 §5.3 "Influence of Keep-hot facility" — origin of the 600 / 900 kWh/yr keep-hot baselines. Pyright per-file: net-zero on all touched files (water_heating.py 1→1, cert_to_inputs.py 35→35, tests unchanged). Test counts: 697 → 702 pass (+5 new tests), 10 expected fails unchanged. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 08:56:01 +00:00
Khalim Conn-Kowlessar	a9faaddc1d	docs: handover for Table 3a no-keep-hot continuation + SAP 10 spec PDFs Adds the next-agent handover and the BRE technical papers referenced by the cohort-2 negative-band investigation: - `HANDOVER_TABLE_3A_NO_KEEP_HOT.md` — picks up from Slice S0380.20. Covers cohort distribution at HEAD `4879e8c3`, the verified Table 3a Row 1 spec formula `(61)m = 600 × fu × nm / 365`, the dispatch recipe for `pcdb_combi_loss_override`, watch-outs (cert 0360 / cohort-1 cert 000490 behaviour after the slice lands), the diagnostic probe script, test baselines, and the open-thread priority list (Ext1 roof, HP-COP, big-gap 2102, API path, parity). - `specs/STP09-B04_Combi_boiler_tests.pdf` — 2009 BRE methodology paper (Alan Shiret, BRE) defining the combi-loss test programme that produced the SAP Table 3a 600/900 kWh/yr keep-hot assumptions. Source: https://bregroup.com/documents/d/bre-group/stp09-b04_combi _boiler_tests. - `specs/sap10 technical papers/S10TP-{02..13}.pdf` — full SAP 10 supporting technical paper set (Issue 1.2 / 1.3 / 1.4 across the eight papers). S10TP-12 §9.4 confirms: "No changes to the SEDBUK calculation method for water heating efficiency were considered necessary" — so the STP09-B04 (SAP 2009) Table 3a methodology carries through to SAP 10 unchanged. These docs replace web-fetched references with locally-tracked copies so the slice S0380.21 implementor can grep / pdftotext them directly. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 08:29:12 +00:00
Khalim Conn-Kowlessar	4879e8c3d7	Slice S0380.20: extract PCDB keep-hot fields + strict-raise for no-keep-hot combis Surfaces the SAP 10.2 Appendix J Table 3a sub-row dispatch gap that masked +0.2..+0.4 SAP residuals on 11 cohort-2 PCDB-listed combi certs. Identified via cert 7800-1501-0922-7127-3563 (Potterton Promax Combi 28 HE Plus A, PCDF 15709): cascade used the keep-hot 600 kWh/yr default; worksheet (61) sums to ~428 kWh/yr via the no-keep-hot sub-row formula. Root cause: the PCDB Table 105 record carries keep-hot metadata at field positions 58 (`keep_hot_facility`) and 59 (`keep_hot_timer`) per the SAP 10 PCDB spec (private feed for SAP software vendors — not surfaced on the public PCDB website nor the Open EPC API). The parser preserved these in `raw=fields` but didn't surface them as typed attributes, so the cascade had no signal to dispatch the right Table 3a sub-row. Two-part change: 1. `domain/sap10_calculator/tables/pcdb/parser.py` — adds typed `keep_hot_facility` and `keep_hot_timer` fields to `GasOilBoilerRecord`, parsed from fields[57] and fields[58]. Field enums (per BRE STP09-B04 + SAP 10 PCDB spec): Field 58: 0=no keep-hot, 1=fuel keep-hot, 2=electric keep-hot, 3=gas+electric keep-hot Field 59: 0=no timer, 1=overnight time-switch Verified against cohort-1 fixture 000490 (Vaillant Ecotec Pro 28, PCDF 10328) — record lodges keep_hot_facility=1, keep_hot_timer=1, exactly matching the hand-built fixture comment "Combi keep hot type = Gas/Oil, time clock" at `_elmhurst_worksheet_000490.py: 277-280`. 2. `domain/sap10_calculator/rdsap/cert_to_inputs.py` — adds `UnresolvedPcdbCombiLoss` exception. `pcdb_combi_loss_override` now raises (instead of silently returning None) when the PCDB record has `separate_dhw_tests=0/None` AND `keep_hot_facility=0/None`. The cascade's only implemented Table 3a row is "with keep-hot, time clock" (600 kWh/yr), which is the wrong spec row for no-keep-hot combis — silently using it masked the cohort-2 negative band. The ETL was re-run to refresh `pcdb_table_105_gas_oil_boilers.jsonl` with the new typed fields (raw fields unchanged, just additional columns surfacing what was previously buried). Cohort distribution after slice: cohort-1 cert 000490 (Vaillant PCDF 10328, kh=1): NO RAISE — cascade keep-hot 600 default IS the spec-correct row. Tests still GREEN. cohort-2: 10 exact + 13 sub-±0.07 + 2 ±0.07..0.5 + 1 ±0.5..1 + 1 ±5+ + 11 RAISES. The 11 raising certs are now blocked until the Table 3a no-keep-hot sub-row is implemented (BRE STP09-B04 methodology — pending slice). Previously these certs silently produced +0.2..+0.4 SAP errors AND ranged into the big-gap band; raising surfaces the gap rather than shipping wrong numbers. Two golden cert tests blocked alongside (Firebird oil PCDF 9005 also hits this path): - test_golden_cert_residual_matches_pin[0390-2954-3640-2196-4175] - test_api_to_domain_mapper_preserves_main_heating_index_number[0390-2954-3640-2196-4175] Re-enable when the Table 3a no-keep-hot row lands. Two other tests updated: - test_main_heating_index_number_in_pcdb_overrides_seasonal_efficiency: switched from Baxi 98 (sdt=0, kh=None, would raise) to Worcester PCDF 10241 (sdt=1, routes via Table 3b row 1). Asserts 0.885 not 0.66. - test_pcdb_combi_loss_override_returns_none_or_raises_for_untested _or_storage_combis: renamed + extended to pin the new strict-raise behaviour. Pyright net-zero per file: - domain/sap10_calculator/rdsap/cert_to_inputs.py: 35 (baseline 35) - domain/sap10_calculator/tables/pcdb/parser.py: 0 - domain/sap10_calculator/tables/pcdb/__init__.py: 0 - domain/sap10_calculator/rdsap/tests/test_cert_to_inputs.py: 13 (baseline 13) - domain/sap10_calculator/rdsap/tests/test_golden_fixtures.py: 1 (was 2 — improved) Regression baseline: 697 pass + 10 fail (= prior 699 + 10 - 2 dropped golden parametrize entries for cert 0390-2954-3640-2196-4175). Spec refs: - SAP 10 PCDB spec (private SAP software vendor feed) — keep-hot facility / timer / electric-heater fields at positions 58 / 59 / 60. - BRE STP09-B04 (combi boiler test methodology) — origin of the keep-hot Table 3a derivation. URL: https://bregroup.com/documents/d /bre-group/stp09-b04_combi_boiler_tests - SAP 10.2 Appendix J Table 3a row-selection — to be implemented per PCDB keep-hot dispatch in a follow-up slice. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 08:10:27 +00:00
Khalim Conn-Kowlessar	1f8a070f66	Slice S0380.19: count Elmhurst shower outlets by type (no more hardcoded 1) Surfaces the lodged shower multiplicity from the Elmhurst Summary §16 on the EPC. Previously `_map_elmhurst_sap_heating` hardcoded: electric_shower_count = 1 if has_electric_shower else None mixer_shower_count = 0 if has_electric_shower else None losing the count for any cert with ≥ 2 outlets. Cert 7800-1501-0922-7127-3563 lodges TWO instantaneous electric showers ("Shower 01" + "Shower 11") but the mapper produced `electric_shower_count=1`. After this slice: electric_shower_count = Σ(s for s in showers if s.outlet_type == "Electric shower") mixer_shower_count = Σ(s for s in showers if s.outlet_type != "Electric shower") Cascade SAP effect: None on cert 7800. Appendix J's eq J16 (`N_ES,per_outlet = N_shower / N_outlets`) and eq J18 (Σ_j E_ES,j) are symmetric in N_electric_showers when there are no mixer outlets, so the lodged (64a) kWh and (247a) cost are unchanged. The fix is correctness-by-construction, not a delta-closer for the negative-band certs (their +0.69 GBP total-cost gap traces to the gas hot-water kWh path — separate slice). Hand-built fixture updates (5): the cohort-1 hand-builts at `domain/sap10_calculator/worksheet/tests/_elmhurst_worksheet_.py` previously omitted `electric_shower_count` / `mixer_shower_count` (implicitly None), which matched the mapper's pre-slice None sentinel. Updated each to the lodged counts the mapper now surfaces: 000474: 1 mixer → (0, 1) 000477: 1 mixer → (0, 1) 000480: 1 mixer → (0, 1) 000490: 1 mixer → (0, 1) 000516: 1 mixer → (0, 1) 000487 (already at (1, 0) for an electric-shower lodging) unchanged. Tests: - `test_summary_7800_two_electric_showers_count_as_two_not_one` — pins the multi-shower mapping for cert 7800 (Summary_000890.pdf). - 5 hand-built field-parity tests (`test_from_elmhurst_site_notes_matches_hand_built_`) now pass at the new integer counts instead of None. Pyright net-zero per file: - datatypes/epc/domain/mapper.py: 32 (baseline 32) - backend/documents_parser/tests/test_summary_pdf_mapper_chain.py: 0 Regression baseline: 699 pass + 10 fail (= prior 698 + 10 + 1 new). Spec refs: - SAP 10.2 Appendix J §1a — outlet counting drives `N_outlets` used in eq J6/J7 (mixer shower water draw) and eq J16/J17/J18 (electric shower energy). - Cert 7800-1501-0922-7127-3563 Summary §16 "Showers" lodgement. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-28 07:16:32 +00:00
Khalim Conn-Kowlessar	57fbf83b1e	Slice S0380.18: u_party_wall flat default per RdSAP10 Table 15 footnote* Closes cert 0036-6325-1100-0063-1226 (the cohort's first FLAT fixture) from Δ -0.3737 → +0.2987 by applying the RdSAP 10 Table 15 footnote * rule: flats/maisonettes with unknown party-wall construction default to U=0.0 W/m²K (both sides are heated dwellings, no heat loss). Worksheet dr87-0001-000910.pdf line ref (32) lodges: Party walls Main 24.13 m² U=0.00 A×U = 0.0000 W/K matching the Table 15 footnote . The cascade was applying the U=0.25 house* default to this lodging because: - Elmhurst Summary lodged `party_wall_type='U Unable to determine'` - mapper translated it to `party_wall_construction=0` (the cross- mapper-parity "unknown" sentinel) - `u_party_wall(0)` fell through to `return 0.25` (the final-branch default — same path as `u_party_wall(None)`) That produced cascade `party_walls_w_per_k = 24.13 × 0.25 = 6.03` W/K of heat-loss excess, propagating through (39) HTC → (97)..(98c) space heat demand → (211) main fuel kWh → (255) total cost → (257) ECF → (258) SAP rating. Net effect: cascade SAP 62.3734 vs worksheet 62.7471. Two-part fix: 1. `domain/sap10_ml/rdsap_uvalues.py:u_party_wall` — add `is_flat: bool = False` keyword argument. When True AND `party_wall_construction in (None, 0)` (both the API-mapper None path and the Elmhurst-mapper 0 sentinel for "Unable to determine"), return 0.0 instead of the house default 0.25. Spec citation: RdSAP 10 Table 15 footnote * ("for flats and maisonettes with unknown party-wall construction"). 2. `domain/sap10_calculator/worksheet/heat_transmission.py` — wire the cascade to pass `is_flat=_is_flat_or_maisonette(epc.property _type)`. Adds a new helper `_is_flat_or_maisonette` distinct from the existing `_is_house` (which excludes bungalows from cantilever detection — bungalows ARE houses for party-wall purposes per the spec). The new helper checks both the descriptive form ("Flat" / "Maisonette") and the SAP schema enum-as-string form ("2" / "3" — per `datatypes/epc/domain/epc_codes.csv property_type` rows: 0=House, 1=Bungalow, 2=Flat, 3=Maisonette, 4=Park home). The schema-enum collision was the bug-fix-with-a-bug: an initial implementation used "1"/"2" (Flat/Maisonette per intuition) but those are actually Bungalow/Flat per the schema, which routed all 10 bungalow certs onto the flat path. Corrected pre-commit. Cohort-2 Summary-path delta after slice: cert 0036 (Flat) Δ -0.3737 → Δ +0.2987 ✓ improved by +0.67 10 bungalow certs unchanged (correctly NOT flat) 5 non-flat house certs in band unchanged (different root cause — next slice) Bungalow certs (cohort 1 + 2) verified unchanged at delta ≤ +0.04 each. Tests added (5): - `test_u_party_wall_unknown_for_flat_returns_table15_footnote_zero` pins the spec rule on the helper. - `test_u_party_wall_unknown_sentinel_zero_treated_as_unknown_for_flat` pins the Elmhurst-mapper `0` sentinel parity. - `test_u_party_wall_known_solid_still_returns_zero_when_is_flat_false` pins precedence: explicit Solid code overrides the is_flat flag. - `test_summary_0036_flat_unknown_party_wall_routes_to_u_zero` chain- test through `from_elmhurst_site_notes` + cert_to_inputs + calculate_sap_from_inputs to assert `party_walls_w_per_k == 0` at 1e-4 tolerance. Pyright net-zero per file: - domain/sap10_ml/rdsap_uvalues.py: 1 (baseline 1) - domain/sap10_calculator/worksheet/heat_transmission.py: 13 (baseline 13) - domain/sap10_ml/tests/test_rdsap_uvalues.py: 66 (baseline 66) - backend/documents_parser/tests/test_summary_pdf_mapper_chain.py: 0 Regression baseline: 698 pass + 10 fail (= prior 694 + 10 + 4 new). Note: the remaining +0.2987 residual on cert 0036 is in (30) external roof — worksheet lodges Ext1 flat roof Plasterboard insulated U=2.30 giving 2.51 W/K; cascade has roof_w_per_k=0 (Ext1 roof contribution missing). Separate slice. Spec refs: - RdSAP 10 Table 15 ("U-values of party walls") row 4 — house unknown default 0.25 W/m²K. - RdSAP 10 Table 15 footnote * — flat/maisonette unknown default 0.0 W/m²K. - `datatypes/epc/domain/epc_codes.csv` rows `property_type,{0..4},...` — SAP/RdSAP schema property-type enum. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-27 23:24:58 +00:00
Khalim Conn-Kowlessar	dab59ccfd8	Slice S0380.17: map Elmhurst §11 glazing-type labels to SAP10 codes Closes a systematic +0.02..+0.07 SAP over-prediction on every triple- glazed cert in cohort 2 (13 of 38) and removes a silent-default failure mode flagged via cert 3336-2825-9400-0512-8292 (+0.0674 Δ). Root cause: `_map_elmhurst_window` (datatypes/epc/domain/mapper.py) was passing the Elmhurst-lodged glazing-type string verbatim into `SapWindow.glazing_type` (declared `Union[int, str]`). The §5 (66).. (67) daylight-factor cascade at `domain/sap10_calculator/worksheet/internal_gains.py:512` requires `isinstance(w.glazing_type, int)` to look up Table 6b col light g_L — string lodgings silently fell through to the `_G_LIGHT_DEFAULT = 0.80` (double-glazed) branch. Cert 3336 (Triple glazed, worksheet "Window, Triple glazed") got g_L = 0.80 instead of the correct 0.70, inflating C_daylight from 1.072 to 1.041 → lighting kWh under-predicted by −4.53 kWh/yr → total fuel cost under by −1.17 GBP → ECF Δ −0.0049 → SAP continuous over by +0.0674. Fix: `_ELMHURST_GLAZING_LABEL_TO_SAP10` dict + `_elmhurst_glazing_ type_code` helper translate the Elmhurst Summary §11 lodged strings to the SAP 10.2 Table U2 integer codes the cascade keys on: "Single" → 1 "Double pre 2002" → 2 "Double between 2002 and 2021" → 3 "Double with unknown install date" → 3 "Double with unknown 16 mm or install date more" → 3 "Double post or during 2022" → 5 "Triple post or during 2022" → 6 "Triple post or during" → 6 (year-trunc.) "Secondary" → 7 Two regex passes strip the layout noise the extractor sometimes folds into the glazing-type token: a `(?:Part )?value value Proofed Shutters` prefix (from adjacent column headers) and a ` Summary Information` / ` Alternative wall…` suffix. Verified against the union of cohort-1 (7 certs) + cohort-2 (38 certs) + test-fixture (9 PDFs) glazing labels: 18 distinct surface forms, all closed by the dict + noise patterns; one window in cert 2636's Summary_000898.pdf lodged the year-truncated "Triple post or during" — added as an alias for code 6 per worksheet "Triple glazed" lodging. Strict-enum gate: `_elmhurst_glazing_type_code` raises `UnmappedElmhurstLabel("glazing_type", label)` (Slice S0380.15 pattern, extended to the new helper) when the label is None or not in the dict — surfaces mapper-coverage gaps at extraction time rather than masking them as a SAP precision floor. Cohort-2 Summary-path delta progression (38 certs): bucket before slice 2 after slice 2 exact (<1e-4) 11 11 <0.005 0 5 ← 9421 +0.0012, 2536 +0.0016, 9370 +0.0017, 0100 +0.0028, 2800 +0.0044 0.005-0.07 15 10 ← all triple-glazed 0.07-0.5 5 5 0.5-1 4 4 1-5 1 1 5+ 2 2 RAISES 0 0 3336 (user's flag) closes from +0.0674 → +0.0400 — the residual is the remaining systematic offset the next slice will investigate. Tests added (3): - `test_summary_3336_triple_glazed_windows_route_to_code_6` — pins the mapper output for the user's flagged cert. - `test_summary_000474_double_glazed_windows_route_to_code_3` — exercises the DG branch + the year-unknown alias mapping. - `test_summary_mapper_raises_on_unmapped_glazing_type_label` — strict-enum coverage gate via mutated site notes. Tests updated (1): - `test_first_window_glazing_type` (test_elmhurst_end_to_end.py): asserts int code 5 (DG low-E argon — "Double post or during 2022") not the string verbatim. The string-passthrough behaviour was always a latent bug; this test was the only direct pin on it. Pyright net-zero per file: - datatypes/epc/domain/mapper.py: 32 (baseline 32) - backend/documents_parser/tests/test_summary_pdf_mapper_chain.py: 0 - backend/documents_parser/tests/test_elmhurst_end_to_end.py: 0 Regression baseline: 694 pass + 10 fail (= prior 691 + 10 + 3 new). Triple-glazed original-cohort certs are now closer to worksheet too; the ±0.07 chain tests on the original cohort still hold, and a future slice tightens them once the next-largest residual is closed. Spec refs: - SAP 10.2 Table U2 — glazing-type integer enum. - SAP 10.2 Table 6b col light — light-transmission g_L by glazing type (triple 0.70, double-glazed variants 0.80, single 0.90). - RdSAP 10 §11 Windows — Summary lodging of glazing type as a type+install-date phrase. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-27 23:05:52 +00:00
Khalim Conn-Kowlessar	6b1cdd64bc	Slice S0380.16: add 'Normal' → cylinder_size=2 (110 L) for cohort 2 Unblocks two 38-cert-cohort certs that previously raised `UnmappedElmhurstLabel("cylinder_size", 'Normal')` at extraction: cert 2536-2525-0600-0788-2292 ws SAP=79.7264 cert 9421-3045-3205-1646-6200 ws SAP=87.4495 Both Summary §15.1 lodgements read "Cylinder Size: Normal"; both dr87 worksheets lodge line ref (47) "Store volume = 110.0000" L (extracted from `Hot Water Cylinder → Cylinder Volume 110.00`). RdSAP 10 §10.5 Table 28 documents the "Normal (90-130 litres)" descriptor whose midpoint is 110 L — the canonical Elmhurst label string in `datatypes/epc/surveys/elmhurst_site_notes.py` is "Normal (90-130 litres)", and the worksheet's exact 110 L matches the midpoint. Two-line fix: + "Normal": 2, in `_ELMHURST_CYLINDER_SIZE_LABEL_TO_SAP10` + 2: 110.0, in `_CYLINDER_SIZE_CODE_TO_LITRES` The cascade enum 2 is consistent with the existing `cert_to_inputs.py` docstring's documented (but not-yet-observed) code 2 → Normal slot, alongside code 3 (Medium / 160 L) and code 4 (Large / 210 L) added in earlier slices. Slice keeps tight: two mapping unit tests pinning `cylinder_size == 2` for both certs at extraction. Post-fix the first-attempt cascade deltas vs worksheet are: cert 2536 Δ +0.0244 (was: RAISES) cert 9421 Δ +0.0296 (was: RAISES) Both deltas now sit in the same systematic +0.02..+0.07 small-gap band as ~12 other first-attempt certs in cohort 2 — chain test + ±0.07 pin would just paper over a known systematic residual that the user has explicitly asked to drive towards 1e-4, not toward ±0.07. Following slice will investigate the shared systematic offset and close cert 2536 / 9421 along with the rest of the +0.04 band on the chain. Pyright net-zero per file: - datatypes/epc/domain/mapper.py: 32 (baseline 32) - domain/sap10_calculator/rdsap/cert_to_inputs.py: 35 (baseline 35) - backend/documents_parser/tests/test_summary_pdf_mapper_chain.py: 0 Regression baseline: 691 pass + 10 fail (= prior 689 + 10 + 2 new GREEN). Spec refs: - RdSAP 10 §10.5 Table 28 — "Cylinder Volume" Normal band 90-130 L, midpoint 110 L (also the canonical Elmhurst label suffix). - Cert 2536 worksheet `dr87-0001-000889.pdf` line ref (47) = 110.0000. - Cert 9421 worksheet `dr87-0001-000884.pdf` line ref (47) = 110.0000. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-27 22:44:02 +00:00
Khalim Conn-Kowlessar	92fc4f4f16	docs: handover — Summary + API cohort expansion to 38 additional certs Hands off the next workstream: the 38 cert subdirs at `sap worksheets/additional with api 2/`. Each subdir is named after the 20-digit EPC cert reference and contains a Summary PDF + dr87 worksheet PDF. API JSONs are NOT in the dataset but ARE fetchable via the existing `EpcClientService` (token in `backend/.env` as `OPEN_EPC_API_TOKEN`). User's stated ordering: Elmhurst Summary mapping FIRST, API path SECOND. Folder names = cert refs; need to verify the matching before bulk-pinning (any mis-filed PDF would silently invalidate slice work). Handover ships with verified dataset and first-attempt baselines: - Folder-vs-cert sweep: 38/38 match at handover (postcode parity check between Summary PDF and Open EPC API). - First-attempt Summary-path probe across 38 certs: 24 ✅ closed at ±0.07 (first-try, zero new slices needed) 9 ~ small gap (<1 SAP) — likely 1 slice each 3 ✗ big gap (>1 SAP) — multi-slice investigation 2 RAISES UnmappedElmhurstLabel: cylinder_size='Normal' The two `Normal` cylinder raises are the immediate Phase 1 slice — Slice S0380.15's strict-enum pattern paid off on its first new cohort by surfacing the gap at extraction time instead of as a downstream SAP delta. Workstream phases documented in the handover: Phase 0: folder-vs-cert sweep (already done — 38/38) Phase 1: fix 'Normal' cylinder unmapped-label raise Phase 2: bulk-pin the 24 first-try-closures as chain tests Phase 3: close the 9 small-gap certs one slice each Phase 4: investigate the 3 big-gap certs (likely HP-routing) Phase 5: fetch + persist API JSON for all 38, run API path tests Phase 6: cross-mapper EPC parity (Summary EPC ≡ API EPC) — the user's stated north-star Includes: - Paste-able diagnostic probe scripts (Summary path + folder-vs- cert sweep + .env loader + EpcClientService usage example). - Full table of first-attempt deltas per cert with classifications. - All 15 prior-session slice commits indexed. - Memory references to the slicing / methodology conventions. - Per-cert diagnostic recipe template. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-27 22:22:13 +00:00
Khalim Conn-Kowlessar	d7ca179ec0	Slice S0380.15: strict-enum raising on unmapped cylinder labels Establishes the strict-enum pattern for Elmhurst label-to-cascade-enum helpers: lodged-but-unrecognised labels raise `UnmappedElmhurstLabel` instead of silently returning None and letting the cascade default to a wrong-but-not-obviously-wrong value downstream. Triggered by the user's observation following Slice S0380.14 ("In a case like that, where the mapper maps to the wrong thing, is it better to raise an exception?"). The cert 9418 "Large" cylinder miss hid for an entire diagnostic cycle because `_elmhurst_cylinder_size_code('Large', True)` silently returned None → cascade routed off the HW-with-cylinder path → 466 kWh/yr HW under-count → Δ +2.60 SAP. Strict raising would have surfaced the gap at the first cohort probe. Scope-limited first pass — converts only the two cylinder helpers (`_elmhurst_cylinder_size_code`, `_elmhurst_cylinder_insulation_code`) to establish the pattern. Follow-up slices can extend to the other label→enum helpers (wall_construction, wall_insulation, main_fuel, pv_overshading, party_wall_construction, emitter_temperature, flue_type, pump_age, …) where the source vocabulary is finite and we control it. Behavioural contract: - `(label = None)` → return None (lodging genuinely absent; cert has no cylinder, no §15.1 block, or the field is optional). - `(label in dict)` → return mapped code (existing behaviour). - `(label = "anything-else")` → raise UnmappedElmhurstLabel(field, value) with a message pointing the next reader at the corresponding mapper lookup dict. Tests: - `test_summary_mapper_raises_on_unmapped_cylinder_size_label` — injects "Tiny" via dataclass mutation, asserts the public `from_elmhurst_site_notes` propagates the raise with the right field + value attributes. - `test_summary_mapper_raises_on_unmapped_cylinder_insulation_label` — mirror for the "Insulated" label dict. - `test_all_seven_ashp_cohort_certs_extract_without_unmapped_label_raise` — coverage forcing function: every cohort cert must extract cleanly. New cohort certs fall under the same gate. Any future Elmhurst-PDF variant with an unmapped cylinder label fails this test until the dict is extended. Tests deliberately go through `from_elmhurst_site_notes` rather than importing the private helpers (`reportPrivateUsage` clean). Pyright net-zero across both edited files (mapper.py 32 baseline, test 0). Regression suite: 689 pass + 10 fail (= handover baseline 669 + 10 + 20 new GREEN tests across S0380.2..S0380.15). Trade-off documented in the exception's docstring: strict raising trades graceful degradation for early detection. For the cohort- validation workflow (this branch's purpose) early detection is the right default. Production extraction code that needs to soft-fail on novel Elmhurst variants can either catch `UnmappedElmhurstLabel` at the boundary or (in a future slice) the helpers can grow a `strict: bool = True` parameter. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-27 22:08:24 +00:00
Khalim Conn-Kowlessar	f878bf51a3	Slice S0380.14: add 'Large' → cylinder_size=4 (closes cert 9418 Daikin) 🎯 Closes the 7th and final ASHP cohort cert. Summary path now mirrors the API path's complete cohort closure at the ±0.07 spec precision floor. Cert 9418-3062-8205-3566-7200 (Summary_000902.pdf): Daikin Altherma EDLQ05CAV3 (PCDB 102421 — distinct from the rest of the cohort's Mitsubishi 104568), end-terrace house, TWO 1.64 kWp PV arrays (N+S), 210 L cylinder, `heating_duration_code='24'` (continuous heating). Worksheet "SAP value" lodges 84.6305. Single-line fix to `_ELMHURST_CYLINDER_SIZE_LABEL_TO_SAP10`: + "Large": 4, extending Slice S0380.6's "Medium" → 3 mapping to also cover the "Large" cylinder. Without it `_elmhurst_cylinder_size_code('Large', True)` returned None → cascade routed off the HP-with-cylinder HW path → HW kWh under by 466 (Summary 1404 vs API 1871 vs worksheet-implied 1871 via (64)/(216) divide). Forcing function: cert 9418 first-attempt Summary SAP closes from Δ +2.5973 (lookup miss) to Δ +0.0296 — within ±0.07. The PV multi-array Slice S0380.9 work was already sufficient for cert 9418's two-array PV layout (1.64 kWp N + 1.64 kWp S surfaced correctly first-try). ASHP cohort closure: 7/7 at spec floor: cert Δ vs worksheet 0380 +0.0594 0350 +0.0458 2225 +0.0441 2636 +0.0323 3800 +0.0442 9285 +0.0502 9418 +0.0296 ← this slice ─────────────── mean +0.0437 Identical disposition to the API path's cohort closure at slice 102f (commit `c0086660`). Both paths now sit at the documented Appendix N3.6 PSR-interpolation precision floor. Added two tests: - `test_summary_9418_large_cylinder_routes_to_code_4` — unit-level pin on the new mapping. - `test_summary_9418_full_chain_sap_within_spec_floor_of_worksheet` — chain test at ±0.07. Pyright net-zero on both edited files (mapper.py 32 baseline). Regression suite: 686 pass + 10 fail (= handover baseline 669 + 10 + 19 new GREEN tests across Slices S0380.2..S0380.14). Spec refs: - SAP 10.2 Table 2a — cylinder volume factor (52) keyed on volume_l; 210 L = 0.8x range factor (vs 160 L = 0.9086). - BRE PCDB Table 362 — Daikin EDLQ05CAV3 (id 102421) is the cohort's second HP record alongside Mitsubishi PUZ-WM50VHA (id 104568). - Cert 9418 worksheet `dr87-0001-000902.pdf` "Cylinder Volume 210.00". Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-27 21:52:15 +00:00
Khalim Conn-Kowlessar	7f099d986a	Slice S0380.13: widen cantilever gate to accept "House" descriptive form Closes cert 2636 to spec floor (Δ +0.5167 → +0.0323) by accepting both the EPC schema enum-as-string ("0") AND the Elmhurst Summary mapper's descriptive form ("House") for the cantilever-detection property-type gate at `heat_transmission.py:768`. Root cause: slice 102f-prep.9 (commit `06b4ef3d`) added cantilever detection gated on `epc.property_type == _PROPERTY_TYPE_HOUSE` where `_PROPERTY_TYPE_HOUSE = "0"`. That matches the API mapper's encoding (schema enum), but the Summary mapper produces "House" (descriptive) and the hand-built worksheet fixtures also use "House" — so neither triggers the gate and the cantilever path silently no-ops on the Summary path. Cert 2636's worksheet (28b) "Exposed floor Main 3.74 × 1.20 = 4.4880" is the cantilever — without surfacing it the cascade missed 4.488 W/K of floor heat loss. Three-encoding origins: - API mapper: property_type='0' (schema enum-as-string) - Summary mapper: property_type='House' (descriptive from §1) - Hand-built fixtures: property_type='House' (legacy convention) Fix: replace the equality check with a `_is_house()` helper that accepts the {"0", "House"} frozenset. Centralised so future property-type sensitive gates can call the same helper. Forcing function: cert 2636 first-attempt Summary SAP closes from Δ +0.5167 (after S0380.12 walls fix) to Δ +0.0323 — within the ±0.07 ASHP-cohort spec floor. `floor_w_per_k` moves from 19.1982 (ground floor only) to 23.6862 (ground 19.20 + cantilever 4.49 = worksheet (28a) + (28b) exact match). Cohort closure status (6 of 7 ASHP certs at spec floor): cert Δ vs worksheet spec floor? 0380 +0.0594 ✓ 0350 +0.0458 ✓ 2225 +0.0441 ✓ 2636 +0.0323 ✓ ← this slice 3800 +0.0442 ✓ 9285 +0.0502 ✓ 9418 +2.5973 ✗ (Daikin EDLQ05CAV3 — final cert) Boiler hand-built parity verified intact: 5 hand-built cohort certs (000474, 000477, 000480, 000490, 000516) all use property_type= "House" and now also fire the cantilever gate, but none have floor1_area > floor0_area + 1m² (the cantilever-area trigger) so their cascade output is unchanged. Regression suite 683 pass + 10 fail (= handover baseline 669 + 10 + 17 new GREEN tests across S0380.2..S0380.13). Pyright net-zero on edited files: domain/sap10_calculator/worksheet/heat_transmission.py: 13 (baseline; no new errors) backend/documents_parser/tests/test_summary_pdf_mapper_chain.py: 0 Spec / precedent refs: - Slice 102f-prep.9 (commit `06b4ef3d`) — RdSAP cantilever-exposed- floor detection (originally API-only via `property_type=="0"` gate). - SAP 10.2 Table 20 — U_exposed_floor (age D + no insulation → 1.20 W/m²K, the cohort's cantilever U-value). - Cert 2636 worksheet `dr87-0001-000898.pdf` line refs (28a)+(28b) sum 23.6862 W/K (exact cascade match after this slice). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-27 21:47:56 +00:00
Khalim Conn-Kowlessar	2f5e70e3a8	Slice S0380.12: parse 'Alternative wall' window-location in pre-data slice Cert 2636-0525-2600-0401-2296's Summary §11 Windows block lodges one alt-wall window (1.19 m², north-facing). The PDF layout for alt-wall rows puts the "Alternative wall" string in the slot BEFORE the W×H×A data line — not after frame_factor where regular "External wall" rows put it. Without this fix the extractor's `_parse_window_from_anchors` only scanned the post-frame_factor `middle` slice for wall tokens, defaulted to "External wall" for the alt-wall row, and the cascade allocated the 1.19 m² opening to the main wall instead of the alt-wall — under-deducting from main and leaving the alt-wall gross instead of net. Fix at `elmhurst_extractor.py:865`: also scan `lines[before_start:data_idx]` (the pre-data slice) for "wall" tokens. Search order: 1. `middle` — first preference (normal layout for regular rows) 2. `pre_data` — alt-wall rows (cert 2636) 3. "External wall" default — no wall lodging found Forcing function: cert 2636 walls_w_per_k moves from 20.5595 to 20.0240 — EXACT match against worksheet (29a) Main 11.9250 + alt.1 8.0990 = 20.0240. (Header (29a) sum is now fabric-exact; the remaining +0.52 SAP residual on cert 2636 is in the ventilation cascade — HTC 153.97 vs API 159.02 vs worksheet (39) avg 158.85 — to be investigated in a follow-up slice.) Added focused unit test `test_summary_2636_alt_wall_window_parses_alternative_wall_location` that pins the by-area lookup: 1.19 m² → "Alternative wall"; the six 2.25 m² windows stay on "External wall". Guards against future window-location parser regressions. Pyright: 0 errors on the edited extractor + test files. Regression suite: 685 pass + 10 fail (handover baseline 669 + 10 + 16 new GREEN tests across S0380.2..S0380.12). Cohort status: cert Δ vs worksheet spec floor? 0380 +0.0594 ✓ 0350 +0.0458 ✓ 2225 +0.0441 ✓ 2636 +0.5167 ✗ (fabric exact; ventilation residual) 3800 +0.0442 ✓ 9285 +0.0502 ✓ 9418 +2.5973 ✗ (Daikin) Spec refs: - Slice 102f-prep.10 (commit `24a7351f`) — API-path equivalent "Alt-wall opening allocation per window_wall_type". - SAP 10.2 §3.7 — opening (window + door) deduction from gross wall area, per-window allocated to the lodged wall type. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-27 21:27:47 +00:00
Khalim Conn-Kowlessar	5de41d5857	Slice S0380.11: resolve zero-shower lodgings to count=0 (closes cert 2225) Cert 2225-3062-8205-2856-7204 lodges zero showers in its Summary §1x Baths and Showers block. The Summary mapper at `mapper.py:3536-3537` predicated the shower-count assignment on `has_electric_shower`: for cohort certs with no electric shower the counts collapsed to None — but cert 2225 has no showers at all, and the cascade's None-handling defaults to 1 mixer shower (over-counting HW kWh by ~66 against the worksheet (64)/(216) target). Same disposition the API path received in slice 102f-prep.8 (commit `1d5183c6`, "API mapper resolves shower_outlets=None → 0 mixers") — extending it to the Summary mapper. Scope-limited fix: zero-shower lodgings resolve to explicit 0 counts (not None) so the cascade does not default-assume a mixer. Non-zero shower lodgings keep their existing convention (None for non-electric → cascade derives count from `shower_outlets`) so the 5 boiler-cohort hand-built parity tests (`test_from_elmhurst_site_notes_matches_hand_built_`) stay GREEN. Forcing function: cert 2225 first-attempt Summary SAP closes from Δ -0.3079 to Δ +0.0441* — within the ±0.07 ASHP-cohort spec floor. Cohort closure status (5 of 7 ASHP certs now at spec floor): cert Δ vs worksheet spec floor? 0380 +0.0594 ✓ 0350 +0.0458 ✓ 2225 +0.0441 ✓ ← this slice 2636 +0.4873 ✗ (cantilever + alt-wall; next slice) 3800 +0.0442 ✓ 9285 +0.0502 ✓ 9418 +2.5973 ✗ (Daikin EDLQ05CAV3, distinct PCDB) Added two tests: - `test_summary_2225_no_showers_lodged_resolves_to_zero_counts` — unit-level pin that no-shower lodgings produce explicit 0 counts. - `test_summary_2225_full_chain_sap_within_spec_floor_of_worksheet` — Layer-4 chain test at ±0.07. Pyright net-zero on both edited files (mapper.py 32 baseline). Regression suite: 682 pass + 10 fail (handover baseline 669 + 10 + 13 new GREEN tests across S0380.2..S0380.11). The 5 boiler hand- built parity tests confirmed still GREEN — the refinement deliberately preserves their convention by only flipping the zero- shower case. Spec refs: - Slice 102f-prep.8 (commit `1d5183c6`) — API-path precedent. - SAP 10.2 Appendix J — shower energy accounting (electric vs mixer routing); mixer showers draw from the HW system and contribute to HW kWh; electric showers are §J line 64a (separate energy stream). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-27 21:04:55 +00:00

... 9 10 11 12 13 ...

5858 commits