Model

mirror of https://github.com/Hestia-Homes/Model.git synced 2026-06-08 11:17:27 +00:00

Author	SHA1	Message	Date
Khalim Conn-Kowlessar	d793ae8851	Slice S0380.160: SAP 10.2 Table 5a wet-pump gate for central heating gain SAP 10.2 Table 5a (PDF p.177) row "Central heating pump in heated space" only applies to mains with a water-loop circulation pump. Footnote a) names two exclusions verbatim ("Does not apply if a heating system used solely for domestic hot water. ... Not applicable for electric heat pumps from database."), and the row's name carries the implicit third: dry mains with no central heating pump (electric storage heaters, electric direct-acting, solid-fuel room heaters without back-boilers) — the row simply doesn't list them. Pre-slice `internal_gains_from_cert` gated only on Note a) (HP exclusion), applying `central_heating_pump_w(date_category=...)` to every non-HP main. The default UNKNOWN-date branch added 7 W of pump gain to (70)m for every dry-system fixture in the controlled-variable corpus, even though the worksheet (70)m = 0 every month. Per-line walk on electric 3 (SAP code 401 "Manual charge control"): cascade (73)[Jan] = 640.21 W worksheet (73)[Jan] = 633.21 W delta = +7.00 W cascade (70)[Jan] = 7.00 W worksheet (70)[Jan] = 0.00 W Table 5a inapplicable The +7 W winter-month gain lowered cascade SH demand by ~38 kWh/yr (cascade 11050 vs worksheet 11088). At Table 32 18-hour low-rate ~7.4 p/kWh that's £2.50/yr under-charging — matching the cluster's uniform Δcost = -£1.96..-£2.80 pattern. Continuous SAP rose ~+0.10 because cost dominates the ECF. Fix: new `_any_main_system_has_central_heating_pump(epc)` predicate in `internal_gains.py`, mirroring `cert_to_inputs._is_wet_boiler_main` (S0380.149 — Table 4f kWh side). Wet if any non-HP main lodges: - sap_main_heating_code in {101-141, 151-161, 191-196} (gas/oil/ solid-fuel/electric boilers per Table 4a/4b), - main_heating_index_number (PCDB Table 322 record), - main_heating_category in {1, 2} (RdSAP central heating), OR - heat_emitter_type in {1, 3} (radiators / fan-coil per Table 4d). Dead `_all_main_systems_are_heat_pumps` helper removed (the new predicate subsumes its role). Cluster closures (10 variants): electric 3: SAP +0.1215 → -0.0000, cost -£2.80 → -£0.00 electric 5: SAP +0.1081 → -0.0000, cost -£2.49 → -£0.00 electric 6: SAP +0.1081 → -0.0000, cost -£2.49 → -£0.00 electric 7: SAP +0.1017 → -0.0000, cost -£2.34 → -£0.00 electric 8: SAP +0.0941 → -0.0000, cost -£2.17 → -£0.00 electric 9: SAP +0.1199 → -0.0000, cost -£2.76 → -£0.00 solid fuel 4: SAP +0.0850 → -0.0000, cost -£1.96 → -£0.00 solid fuel 9: SAP +0.1072 → -0.0000, cost -£2.47 → -£0.00 solid fuel 10: SAP +0.1134 → +0.0000, cost -£2.61 → -£0.00 solid fuel 11: SAP +0.0912 → +0.0000, cost -£2.10 → +£0.00 Σ \|ΔSAP_c\| across 25-variant cohort: 1.24 → 0.18. All 10 cluster variants now join the lighting-PE +48.66 / CO2 +11.95 deferred cohort (Elmhurst-vs-spec monthly factor quirk, same shape as electric 1 + solid fuel 5/6/7/8 from prior closures). Verbatim spec quote (SAP 10.2 Table 5a row 1, PDF p.177): "Central heating pump in heated space, 2013 or later 3 a)" "Central heating pump in heated space, 2012 or earlier 10 a)" "Central heating pump in heated space, unknown date 7 a)" The row name ("Central heating pump") gates by construction: dry systems have no central heating pump and the row's three sub-rows don't apply. No regressions on the other 31 variants or any golden fixture; the 6 Elmhurst U985 fixtures lodge PCDB index → the new predicate returns True → pump_w unchanged. Tests: 904 pass (+1), 0 fail. Pyright net-zero (35 → 35). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-04 15:40:25 +00:00
Khalim Conn-Kowlessar	5d086f3ed1	Slice S0380.159: SAP 10.2 Table 4a R tariff-aware dispatch for electric storage SAP 10.2 Table 4a (PDF p.166) Cat 7 "Electric storage heaters" splits the responsiveness R between two sub-tables: Off-peak tariff: Slimline storage heaters ... R = 0.2 402 Convector storage heaters ... R = 0.2 403 Slimline + Celect-type control ... R = 0.4 405 Convector + Celect-type ctrl ... R = 0.4 406 24-hour heating tariff: Slimline storage heaters ... R = 0.4 402 Convector storage heaters ... R = 0.4 403 Slimline + Celect-type control ... R = 0.6 405 Convector + Celect-type ctrl ... R = 0.6 406 Per SAP 10.2 §12.4.3 (PDF p.36) the 18-hour tariff has electricity at low rate for 18 hours per day with at most 6h of interruption / 2h max each — operationally equivalent to 24-hour for storage-heater charging. The cascade therefore routes EIGHTEEN_HOUR + TWENTY_FOUR_ HOUR through the 24-hour Table 4a sub-row. Pre-slice `_responsiveness` keyed on `sap_main_heating_code` only and returned R=0.2 for code 402 regardless of tariff. The existing docstring already flagged the gap: 402: 0.20, # Slimline storage heaters (24-hr tariff: 0.40) ... "promote to (sap_code, tariff) lookup when 24-hour fixture surfaces; until then the off-peak default applies (under-shoots R for the 24-hour case)." Per-line walk on electric 5 (sap_main_heating_code=402 + meter_type="18 Hour"): cascade T_living (87)[Jan] = 20.1213 vs worksheet 19.6519, (92)[Jan] = 18.6996 vs worksheet 18.2063, (93) [Jan] = 19.0996 vs worksheet 18.6063 (cascade +0.4933 K throughout the cascade). Back-solve from worksheet T_living=19.6519 via the Table 9b Tsc formula: Tsc(R=0.4) = 0.6 × (21-2) + 0.4 × (4.3 + 0.9933 × 705.4/210.23) = 11.4 + 0.4 × 7.6325 = 14.4528 ΔT = 21 - 14.4528 = 6.5472 u_sum = 0.5 × 6.5472 × (7² + 8²) / (24 × 11.43) = 1.3481 T_living = 21 - 1.3481 = 19.6519 EXACT match. Adds: - `_CONTINUOUS_CHARGING_TARIFFS: frozenset[Tariff]` = {EIGHTEEN_ HOUR, TWENTY_FOUR_HOUR} — the tariffs treated as "24-hour heating" for Table 4a R selection. - `_RESPONSIVENESS_24_HOUR_OVERRIDE_BY_SAP_CODE: dict[int, float]` — the override table for codes 402/403/405/406 (404, 407, 409 keep the same R in both sub-tables). - `tariff: Optional[Tariff]` parameter to `_responsiveness`, with the override consulted before the off-peak default. - Tariff threaded through both call sites of MIT cascade (rating + demand paths) via `tariff_from_meter_type`. Closures electric 5: ΔSAP −1.1759 → +0.1081 (91% reduction) Δcost +£27.09 → −£2.49 ΔCO2 +62.72 → +7.30 kg ΔPE +438.03 → +0.07 kWh (essentially EXACT) Electric 5 now joins the same residual cluster as electric 3/6/7/8/ 9 (+0.09..+0.12 SAP, −£2..−£3 cost, +£7 CO2) — the cluster that the prior handovers suspected was a shared shave-the-residual gap. No regressions on the other 24 cohort variants. Extended handover suite: 903 pass / 0 fail (was 902 — +1 from the new AAA test). Pyright net-zero (43 → 43). Σ \|ΔSAP_c\| across the 25-variant cohort: 2.30 → 1.24 (~46% reduction from this slice). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-04 15:40:25 +00:00
Khalim Conn-Kowlessar	4291193ba8	Slice S0380.158: SAP 10.2 Table 4f warm-air heating system fans SAP 10.2 Table 4f (PDF p.174) row "Warm air heating system fans" + footnote e) — verbatim: Warm air heating system fans e) SFP × 0.4 × V e) SFP is the specific fan power from the database record for the warm air unit if applicable; otherwise 1.5 W/(l/s). These values of SFP include the in-use factor. If the heating system is a warm air unit and there is balanced whole house mechanical ventilation, the electricity for warm air circulation should not be included in addition to the electricity for mechanical ventilation. However it is included for a warm air system and MEV or PIV from outside. V is the volume of the dwelling in m³. Per Table 4a (PDF p.165-166), warm-air systems are: - Category 5: heat pumps with warm-air distribution (codes 521, 523, 524 electric; 525, 526, 527 gas-fired) - Category 9: warm-air systems NOT heat pump (501-511, 520 gas- fired; 512-514 liquid-fired; 515 Electricaire electric) Pre-slice the cascade's `_table_4f_additive_components` docstring explicitly listed "(230b) Warm-air heating fans + (230c) for warm- air pump" as "Not yet wired" — every Cat 5 / Cat 9 warm-air corpus variant resolved `pumps_fans_kwh_per_yr` to 0. For electric 2 (code 524 Cat 5 air-source warm-air HP, no MV, V = 227.25 m³), the P960 worksheet block 11a (249) lodges 136.35 kWh × 13.67 p/kWh = £18.64 where the cascade computed 0. New `_TABLE_4A_WARM_AIR_SAP_CODES` frozenset (22 codes) + leaf helper `_table_4f_warm_air_heating_fans_kwh(main, dwelling_volume_m3, has_balanced_mv)` wired at the orchestrator pumps_fans summation alongside the existing circulation-pump and gas-flue-fan helpers. Footnote-e balanced-MV omission reads `epc.sap_ventilation. mechanical_ventilation_kind` via the new `_has_balanced_mechanical_ventilation` predicate (returns True for MVHR / MV; False for MEV / PIV / NATURAL). Per-line walk evidence: cascade `pumps_fans_kwh_per_yr` = 0.0000 vs worksheet (249) = 136.3500 = 1.5 × 0.4 × 227.25 exactly. Default SFP from footnote e matches; PCDB warm-air-unit SFP lookup deferred until a fixture exercises it. Closures electric 2: pumps_fans_kwh_per_yr: 0 → 136.35 (EXACT match to worksheet) ΔSAP +0.7002 → −0.1087 (residual swung past worksheet — the +0.70 pre-slice was an under-counted-fan offset; spec-correct fix lands just past zero, exposing a small upstream SH cascade gap likely in the Cat 5 warm-air HP Table 4a SH efficiency or Table 9c MIT cascade for warm-air mains — follow-up slice) Δcost −£16.14 → +£2.50 ΔCO2 −2.37 → +16.54 kg ΔPE −108.58 → +97.69 kWh No regressions on the other 24 cohort variants — the warm-air-code gate fires only when `sap_main_heating_code` is in the new frozenset and only electric 2 has a warm-air SAP code in the corpus. Extended handover suite: 902 pass / 0 fail (was 901 — +1 from the new AAA test). Pyright net-zero (43 → 43). Σ \|ΔSAP_c\| across the 25-variant cohort: 2.87 → 2.30 (~20% reduction from this slice). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-04 15:40:25 +00:00
Khalim Conn-Kowlessar	8507728237	Slice S0380.157: SAP 10.2 Table 2b note b) WHC=903 electric-immersion guard SAP 10.2 Table 2b note b) (PDF p.159) — verbatim: Multiply Temperature Factor by 0.9 if there is separate time control of domestic hot water (boiler systems, warm air systems and heat pump systems). The parenthetical list restricts the rule to systems where the heat generator (boiler / warm-air / HP) is the device heating the cylinder. Electric immersion is NOT in that list because the immersion isn't a heat-generator system feeding DHW — it sits inside the cylinder. The ×0.9 multiplier reflects shorter cylinder-heating periods when a boiler / HP / warm-air operates on a separate timer for DHW vs SH; if the heat generator doesn't feed the cylinder at all (because the immersion does), there's no such timing effect. Pre-slice `_separately_timed_dhw` returned True for any Cat 4 HP main BEFORE consulting WHC (line 3872 `if main.main_heating_category == 4: return True`). For electric 2 (sap_main_heating_code=524 Cat 5 warm-air ASHP, main_heating_category=4 per Elmhurst mapper, WHC=903 electric immersion + cylinder + cylinder thermostat lodged), the cat-4 branch fired before the existing `_is_electric_water` check could route the cert to False. The cascade applied ×0.9 to the Temperature Factor (53), pulling (55) from 1.2294 → 1.1064 → cascade annual (56) = 403.87 vs worksheet (56) annual = 448.73. Same WHC=903 principle as the prior slice S0380.156 (Table 3 zero- loss list for electric immersion): when HW is independent of the main heating, main-heating-specific DHW rules don't apply — even when the main happens to be a HP / boiler / warm-air system. Fix: new top-of-function `if epc.sap_heating.water_heating_code == _WHC_ELECTRIC_IMMERSION: return False` guard in `_separately_timed_dhw`. Reuses the constant introduced in S0380.156. Closures electric 2: Cylinder (56) storage loss annual 403.87 → 448.73 (matches worksheet 1.2294 × 365 = 448.73 EXACT within rounding) HW kWh demand 2339.24 → 2384.12 (matches worksheet (62)/(64) = 2384.116 EXACT) ΔSAP +0.8118 → +0.7002 Δcost −£18.71 → −£16.14 ΔCO2 −7.21 → −2.37 kg ΔPE −161.68 → −108.58 kWh The remaining +0.70 SAP residual is a separate upstream gap (likely warm-air-HP SH cascade or Table 4a SH efficiency for code 524) — follow-up slice. No regressions on the other 24 cohort variants. Cohort-1 ASHP certs (Cat 4 HP + WHC=901 = HW from HP + cylinder) keep ×0.9 as before because their WHC=901 doesn't trigger the new guard. Extended handover suite: 901 pass / 0 fail (was 900 — +1 from the new AAA test). Pyright net-zero (43 → 43). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-04 15:40:25 +00:00
Khalim Conn-Kowlessar	c727b3cb5a	Slice S0380.156: SAP 10.2 Table 3 WHC=903 electric-immersion zero-loss guard SAP 10.2 Table 3 (PDF p.160) verbatim: Primary loss is set to zero for the following: Electric immersion heater Combi boiler ... CPSU ... Boiler and thermal store within a single casing Separate boiler and thermal store connected by no more than 1.5 m of insulated pipework Direct-acting electric boiler Heat pump (...) with hot water vessel integral to package The Elmhurst WHC=903 lodging signals exactly the first row: "HW from a separate electric immersion heater" — the cylinder is heated by an immersion element inside the tank, no primary pipework between any heat generator and the cylinder. The rule is universal: regardless of what main heating exists for space heating, electric immersion means no primary circuit means no primary loss. Pre-slice `_primary_loss_applies` only consulted `water_heating_code` in the Table 4a wet-boiler branch (codes 151-161 / 191-196). The Cat 4 HP branch returned True unconditionally when no PCDB record was lodged; the Cat 1/2 boiler branch returned True unconditionally; the PCDB Table 322 + Table 4b non-PCDB branches likewise. For the electric 2 corpus variant (sap_main_heating_code=524 Cat 5 warm-air ASHP, main_heating_category=4 per Elmhurst mapper, no PCDB record, WHC=903 + cylinder), the Cat-4 branch falsely returned True and the cascade added ~510 kWh/yr primary loss to a system with no primary circuit at all. Per-line walk discipline applied: cascade `water_heating_from_cert` output dump showed `primary_loss_monthly_kwh_annual = 509.98` while worksheet (59)m = 0 every month → spec lookup found Table 3 verbatim "Electric immersion heater" zero-loss line. Adds `_WHC_ELECTRIC_IMMERSION: Final[int] = 903` constant + a top-of-function `if water_heating_code == _WHC_ELECTRIC_IMMERSION: return False` guard that fires before any of the system-type-keyed branches. Closures electric 2: HW kWh 2849.22 → 2339.24 (matches worksheet (62)/(64) = 2384.12 within the residual ~45 kWh storage-loss gap) ΔSAP −0.4584 → +0.8118 (cascade swung past the worksheet by +1.27 — the pre-slice 'near-correct' value was offsetting cascade bugs per [[feedback-software-no-special-handling]]; the +0.81 residual exposes a separate upstream gap to chase in a follow-up slice) Δcost +£10.56 → −£18.71 ΔCO2 +47.89 → −7.21 kg ΔPE +443.13 → −161.68 kWh No regressions on the other 24 cohort variants — only electric 2 has the (Cat 4 HP, no PCDB, WHC=903) combination in the corpus. Extended handover suite: 900 pass / 0 fail (was 899 — +1 from the new AAA test). Pyright net-zero (43 → 43). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-06-04 15:40:25 +00:00
Daniel Roth	a68ab01d70	All downloaded PasHub files uploaded to SharePoint property folder 🟩	2026-06-04 15:40:25 +00:00
Daniel Roth	3d2a21d9d9	All downloaded PasHub files uploaded to SharePoint property folder 🟥	2026-06-04 15:40:25 +00:00
Daniel Roth	815173bc79	tweak local trigger	2026-06-04 15:40:25 +00:00
Daniel Roth	35b65d0d8d	adjust comment about evidence type	2026-06-04 15:40:25 +00:00
Daniel Roth	5fa6f32644	Service persists mcs_certificate file_type when evidence_category is MCS cert 🟩	2026-06-04 15:40:25 +00:00
Daniel Roth	367f6f6cf9	Downloaded files carry evidence_category as DownloadedFile 🟩	2026-06-04 15:40:25 +00:00
Daniel Roth	1e2b997f2f	Downloaded files carry evidence_category as DownloadedFile 🟥	2026-06-04 15:40:25 +00:00
Daniel Roth	e18c300deb	MCS cert identified by evidence_category in get_core_file_type 🟩	2026-06-04 15:40:25 +00:00
Daniel Roth	1af1d4a82c	MCS cert identified by evidence_category in get_core_file_type 🟥	2026-06-04 15:40:25 +00:00
Daniel Roth	5ed3bf73e8	evidence categories plus typehinting	2026-06-04 15:40:25 +00:00
Daniel Roth	020a24d345	run() returns core and other file paths 🟩	2026-06-04 15:40:25 +00:00
Daniel Roth	d8ec12065f	run() returns core and other file paths 🟥	2026-06-04 15:40:25 +00:00
Daniel Roth	0aa6a4fc30	Other files persisted to DB with file_type OTHER 🟩	2026-06-04 15:40:25 +00:00
Daniel Roth	1650762ae2	Other files persisted to DB with file_type OTHER 🟥	2026-06-04 15:40:25 +00:00
Daniel Roth	c86dbeb4a1	Upload other files to S3 when get_other_files is True 🟩	2026-06-04 15:40:25 +00:00
Daniel Roth	098f60ecfd	Upload other files to S3 when get_other_files is True 🟥	2026-06-04 15:40:25 +00:00
Daniel Roth	9c38f45c98	tidying for readability	2026-06-04 15:40:25 +00:00
Daniel Roth	c9a2ce4921	Service deletes other-file temp paths after run 🟩	2026-06-04 15:40:25 +00:00
Daniel Roth	f8d2bb8049	Service deletes other-file temp paths after run 🟥	2026-06-04 15:40:25 +00:00
Daniel Roth	49e7b7fea6	Wire service to `get_evidence_files_by_job_id`; retire `get_core_evidence_files_by_job_id` 🟪	2026-06-04 15:40:25 +00:00
Daniel Roth	662f6de0ab	`get_evidence_files_by_job_id` downloads other files when `include_other=True` 🟩	2026-06-04 15:40:25 +00:00
Daniel Roth	c4ffaaa069	`get_evidence_files_by_job_id` downloads other files when `include_other=True` 🟥	2026-06-04 15:40:25 +00:00
Daniel Roth	f95b6bdd7d	`get_evidence_files_by_job_id` returns `DownloadedFiles` with empty `other` when `include_other=False` 🟩	2026-06-04 15:40:25 +00:00
Daniel Roth	665dc69ad5	`get_evidence_files_by_job_id` returns `DownloadedFiles` with empty `other` when `include_other=False` 🟥	2026-06-04 15:40:25 +00:00
Daniel Roth	e7c679e0db	Group evidence into core and other via `_group_into_core_and_other_files` 🟪	2026-06-04 15:40:25 +00:00
Daniel Roth	99229844b5	`_select_other_files` returns non-core evidence files 🟩	2026-06-04 15:40:25 +00:00
Daniel Roth	db796747d9	`_select_other_files` returns non-core evidence files 🟥	2026-06-04 15:40:25 +00:00
Daniel Roth	6cb6c8c756	allow for missing deal stage column when triggering sqs from file	2026-06-04 15:40:25 +00:00
Daniel Roth	790e430aff	rename local handler trigger script	2026-06-04 15:40:25 +00:00
Jun-te Kim	dfd05ba28b	tests files	2026-06-04 11:47:42 +00:00
Khalim Conn-Kowlessar	fe59c4d8a2	S0380.208: case 7 combi e2e fixture — condensing-oil-combi path validated exact Adds simulated case 7: case 6 (P960-0001-001431) with the heating swapped to a CONDENSING OIL COMBI (SAP code 130, Table 4b 82/73) and the cylinder removed — combi instantaneous DHW (WHC 901), Table 3a keep-hot combi loss (61) = 600 kWh/yr, no primary/storage loss, boiler interlock PRESENT (no −5pp). This is the heating archetype golden cert 0240-0200-5706-2365-8010 uses, which case 6 (SAP code 127, a regular condensing oil boiler + cylinder) never exercised. The cascade reproduces the case-7 worksheet EXACTLY at abs=1e-4 on every top-level SapResult output with ZERO calculator changes: (211) 7865.4304 (213) 7556.9821 (219) 3496.8121 (98c) 12646.3783 (255) 1123.3372 (257) 1.9631 (272) 5738.9315 (258) 73 This validates the SAP 10.2 Appendix D Eq D1 combi efficiency blend + Table 3a keep-hot combi loss + Table 4b code 130 (82/73) path, and exonerates the combi mechanism as the source of 0240's API-path residual — which therefore lives in 0240's fabric/demand or the API mapper. Test-only slice (no impl change). New fixture file: 0 pyright errors. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-03 17:57:22 +00:00
Khalim Conn-Kowlessar	2b1afa7339	S0380.204: extract Main Heating2's own emitter + control (§14.1) Prerequisite for the SAP 10.2 p.186 two-systems-different-parts MIT. When two main systems heat different parts of a dwelling, §14.1 Main Heating2 lodges its OWN "Heat Emitter" + "Main Heating Controls Sap" (simulated case 6: Main 1 radiators / control 2106 serving the living area, Main 2 underfloor / control 2110 serving elsewhere). The extractor + mapper dropped both — `MainHeatingDetail.heat_emitter_type` and `main_heating_control` came through as empty-string sentinels, so the cascade saw system 2 as having no responsiveness (defaulted R=1.0) and no control type. - `MainHeating2` datatype gains `heat_emitter` + `heating_controls_sap`. - The extractor reads them from the §14.1 block. - `_map_elmhurst_main_heating_2` maps them via the same helpers as Main 1 (`_elmhurst_heat_emitter_int` → underfloor-in-screed = emitter 2, Table 4d R=0.75; `_elmhurst_sap_control_code` → 2110, Table 4e type 3), threading the dwelling floor + age band for the underfloor subtype. Empty-string fallback preserved for the legacy DHW-only Main 2 (cert 000565 §14.1 omits emitter/control). No cascade output changes yet — the MIT consumer lands in S0380.205. Full suite 2358 pass + 0 fail. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-03 15:53:32 +00:00
Khalim Conn-Kowlessar	2b1f90a7de	S0380.199: site-notes "Roof of Room" windows → roof windows (cross-mapper parity with S0380.198) The Elmhurst extractor crashed parsing simulated-case-6's room-in-roof window rows: the §11 "Location" cell "Roof of Room in Roof" wraps across the layout prefix/suffix blocks and leaked into the glazing-type phrase ("Double between 2002 Roof of Room and 2021 in Roof" → UnmappedElmhurst- Label). Fix (`_parse_window_from_anchors`): detect the roof-of-room location tokens, strip them from the before/after blocks so the glazing phrase reconstructs cleanly, and set location="Roof of Room". Mapper: `_is_elmhurst_roof_window` gains a "Roof of Room" location branch (highest-confidence rooflight signal, above the BP-roof-type / U>3.0 gates); `_ELMHURST_ROOF_WINDOW_U_BY_GLAZING` gains "Double between 2002 and 2021" → 2.30 (case 6 lodges the already-inclined roof-window U, so the +0.30 inclination adjustment must not double-apply). This is the site-notes mirror of S0380.198 (API window_wall_type=4): both paths now route room-in-roof rooflights to (27a) at the inclined U. Validated against the case-6 P960 worksheet at abs=1e-4: (27) Windows = 22.7408 (cascade 22.7407) (27a) Roof Windows = 13.0375 (cascade 13.0375, EXACT) (31) ext area = 336.13 Case 6 is pinned only on the §3 window line refs (new standalone test, not added to the section-pin `_FIXTURES`) because its DUAL main heating (51% rads + 49% underfloor, oil) makes the §10/§12 per-system lines non-comparable to SapResult's aggregated fields — documented in the fixture module. Summary mirrored to Summary_001431_case6.pdf. Suite: 2355 passed, 1 skipped. New code: 0 pyright errors. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-03 12:46:18 +00:00
Jun-te Kim	c614ff6388	save local changes	2026-06-03 12:41:56 +00:00
Khalim Conn-Kowlessar	570df83459	S0380.197: simulated case 5 e2e fixture — detached sandstone RR validates S0380.196 (RdSAP 10 §3.9.1 + Table 4 p.22) Promotes user-simulated "case 5" (detached, sandstone-walled, room-in-roof cousin of golden cert 0240) to an e2e worksheet fixture pinning the WHOLE extractor → mapper → calculator pipeline at abs=1e-4 on all 11 Block-1 line refs. Its worksheet prints the exact RR-gable routing S0380.196 implements, validating that fix against ground truth: Roof room Main Gable Wall 1 15.68 U=0.35 (29a) Exposed → walls @ main-wall U Roof room Main remaining area 61.73 U=0.30 (30) A_RR shell − Σ gables External roof Main 14.52 U=0.11 (30) loft residual Roof room Main Gable Wall 2 15.68 U=0.25 (32) Party → party @ 0.25 gable area = 6.40 × 2.45 (§3.9.1 default RR storey height); A_RR remaining = 12.5√(83.2/1.5) − 2×15.68 = 93.09 − 31.36 = 61.73 (RdSAP 10 §3.9.1(e)). Confirms a DETACHED dwelling can lodge a Party RR gable (Table 4 p.22 row 2) — so my S0380.196 mapping (gable_wall_type 0=Party, 1=Exposed) is correct; do not flip it. Two extractor/mapper gaps surfaced and fixed (case 5 is the forcing test): - Sandstone wall label "SS Stone: sandstone or limestone" had no `_ELMHURST_WALL_CODE_TO_SAP10` entry (raised UnmappedElmhurstLabel). Added "SS" → 2 (WALL_STONE_SANDSTONE), matching 0240's API wall_construction=2 (cross-mapper parity). - Roof "Insulation Thickness 400+ mm" was silently dropped: the four thickness parsers used `.split()[0].isdigit()`, which rejects the trailing "+" → None → u_roof fell back to the age-J default 0.16 instead of 0.11 (+1.09 W/K roof, the whole 0.12 SAP gap). Added `_parse_thickness_mm` (strips to leading digits) and applied it at all four sites (walls / alt-wall / roof / floor). The only existing fixture with "400+ mm" (000565 Stud Wall) routes via the RIR regex, unaffected. Result: case 5 cascade ≡ worksheet at 1e-4 on SAP/ECF/cost/CO2 + every energy stream. Neither gap affects 0240 (its API path captures both the sandstone code and "400mm+"); 0240's residual is therefore non-fabric. Suite: 2353 passed, 1 skipped. New code: 0 pyright errors. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-03 11:41:16 +00:00
Khalim Conn-Kowlessar	4a21717de6	S0380.195: pin sim case 4 (6035 floor geometry) e2e at 1e-4 — 6035 +19 PE is lodged divergence Adds the user-simulated case-4 worksheet as e2e fixture `001431_6035` — reproduces golden cert 6035's full floor geometry (Main ground-floor HLP 15.99 + first-floor HLP 8.32, the asymmetric upper storey) and 8 windows. All 11 Block-1 line refs pin at abs=1e-4 against the worksheet (SAP 68, ECF 2.2802, cost 937.2341, CO2 4682.3494, space 15745.3260, main fuel 18744.4357). This is the 4th independent 1e-4 confirmation across the 6035 archetype (sim cases 1-4). Case 4 matches 6035 on floors + window areas; the residual ~50 kWh / £11 cascade delta vs 6035 is two lodged inputs only (largest window orientation N vs S; meter type "Dual" vs API 2), not calculator behaviour. Conclusion: the cascade reproduces the spec engine exactly for 6035's geometry, so 6035's +19 PE vs the lodged register is lodged-register divergence (the gov.uk register's rounded value vs the spec-exact worksheet), NOT a calculator gap. 6035 is a "pin-forever" lodged-only cert. Bugs surfaced + fixed along the way: S0380.192 (Simplified-RR remaining area) and S0380.193 (suspended-floor sealed rule). 2341 passed (+11), 0 failed; pyright net-zero. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-03 09:56:39 +00:00
Khalim Conn-Kowlessar	e7a0c9885e	S0380.194: pin sim case 3 (near-exact 6035 replica) e2e at 1e-4 Adds the user-simulated case-3 worksheet as e2e fixture `001431_rr8` — Main + Extension + Simplified room-in-roof with 8 windows (≈14.15 m², reproducing golden cert 6035's glazing) and Main ground-floor HLP 15.99. All 11 Block-1 line refs pin at abs=1e-4 against the worksheet (SAP 68, cost 951.3425, CO2 4767.4862, space 16086.3557, main fuel 19150.4235, HW 3307.2639, lighting 262.0885). This is the third independent 1e-4 confirmation that the cascade reproduces the spec engine for the 6035 archetype (after S0380.192 Simplified-RR + S0380.193 suspended-floor). It differs from 6035 in one input only — the Main first-floor HLP (15.99 here vs 6035's 8.32) — so 6035's +19 PE vs the lodged register is lodged-register divergence, not a calculator gap. A byte-identical 6035 replica (first-floor HLP 8.32) would let 6035 itself be pinned directly to close that out. 2330 passed (+11), 0 failed; pyright net-zero. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-03 09:46:56 +00:00
Khalim Conn-Kowlessar	62fc27a5cc	S0380.193: suspended-floor (12) sealed rule fires only on a SUPPLIED U-value RdSAP 10 §5 (PDF p.29) "Floor infiltration (suspended timber ground floor only)", age band A-E, splits on whether a floor U-value is supplied: a) [U-value supplied] if floor U-value < 0.5 → "sealed", (12) = 0.1 b) [no U-value supplied] retro-fitted insulation → "sealed" 0.1; otherwise "unsealed", (12) = 0.2 `_has_suspended_timber_floor_per_spec` fed the cascade's COMPUTED default U into rule (a), so an as-built/uninsulated suspended-timber floor whose default U happens to be < 0.5 was marked "sealed" (0.1) where Elmhurst uses "unsealed" (0.2). That dropped (18) infiltration 0.85 → 0.75, (25) effective ACH, HTC, and understated space heating ~450 kWh. Fix: gate rule (a) on `floor_u_value_known` — a computed default U is not a supplied value, so it falls through to (b). Verified against the cert 001431 sim-case-2 worksheet: floor "As built", U=0.43 (matches the worksheet's (28a) 0.4300 exactly), (12)=0.2 unsealed. Golden cert 6035 (also a suspended uninsulated floor) is unaffected — its U=0.63 ≥ 0.5 already routed to unsealed. Promotes sim case 2 to the e2e harness as `001431_rr` (Main + Extension + Simplified room-in-roof — the 6035 archetype). All 11 Block-1 line refs pin at abs=1e-4, locking BOTH this fix and S0380.192 (Simplified-RR remaining area) end-to-end: SAP 69, cost 920.5046, CO2 4566.7090, space 15269.8593, main fuel 18178.4039. 2319 passed (+11), 0 failed; pyright net-zero. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-03 09:16:25 +00:00
Khalim Conn-Kowlessar	ec9ef0e8bb	fix(extractor): drop windows-table header remnant from first window glazing type Summary PDFs preprocessed from `pdftotext -layout` wrap the windows-table header across several lines. The third header line's tail ("U value / g value / Draught Proofed / Permanent Shutters") tokenises to "value value Proofed Shutters" and lands directly above the FIRST window's data row. Because the first window in a building part has `before_start = 0`, its prefix block reaches back into that header remnant. The remnant is neither an orientation nor a building-part fragment, so it survived the pops in `_compose_window_descriptors` and leaked into glazing_type as "value value Proofed Shutters Double between 2002 and 2021" (windows 2-3, whose prefix starts after the previous window's manufacturer line, were clean). Fix: the glazing-type phrase always starts with a glazing-start word (Single/Double/Triple/Secondary), so trim any prefix fragments preceding that word before joining the glazing type. Orientation/bp pops still run on the full prefix, so they are unaffected. Reproduced from `sap worksheets/Recommendations Elmhurst Files/ cavity_wall_insulation - main wall/before/Summary_001431.pdf`. Added a regression test driving the real `_extract_windows_from_layout` path with the verbatim tokenised header+rows. 2306 passed (+4), pyright net-zero. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 22:54:49 +00:00
Khalim Conn-Kowlessar	896b5740c3	S0380.191: pin simulated 001431 gas-combi end-to-end at 1e-4 (e2e harness) Adds the user-simulated 001431 case (the cert that drove S0380.189/.190) as an Elmhurst-only e2e fixture: Summary PDF → extractor → mapper → calculator, every Block-1 SapResult field pinned against the P960-0001-001431 worksheet at abs=1e-4. All 11 pins pass with zero residual — the case is clean, confirming the S0380.190 gas-combi fuel derivation closes the Summary path natively. Verified the handover's flagged "+0.0007 SAP" was a target artifact, not a cascade gap: the worksheet displays ECF (257) rounded to 1.6047 and integer SAP (258)=78; the cascade's continuous SAP is computed from the UNROUNDED ECF = (255)(256)/((4)+45) = 660.97500.4200/173.0, giving 77.6147 — which matches the worksheet's own unrounded value. Pinning the continuous SAP from the display-rounded ECF (→ 77.6144) was the wrong target. Block-1 line refs all match exactly: (211) 10699.7225, (219) 3327.1592, (231) 86.0, (232) 283.2229, (255) 660.9750, (272) 3000.1664, Σ(98) 8987.7669. Summary mirrored into the tracked fixtures dir as Summary_001431_gas_combi.pdf (distinct name — the corpus reuses cert 001431 across every heating variant); source Summary + worksheet tracked under sap worksheets/golden fixture debugging/ as the pin ground truth. 2302 passed (+11), 0 failed; pyright net-zero on new/changed files. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 22:44:32 +00:00
Khalim Conn-Kowlessar	57241322ea	S0380.185: record CH6 pin-forever proof — distribution-loss is a Summary-export gap CH6's P960 worksheet input lodges Distribution Loss = "Two adjoining dwellings sharing a single heating system" → (306) DLF = 1.0000, vs CH4's "Calculated" → 1.5 → (306) = 1.4500. That DLF choice swings SAP/cost/CO2/PE materially, but it is NOT present in the Summary PDF that the corpus pipeline consumes (Summary → ElmhurstSiteNotesExtractor → mapper → calculator). Proven empirically with a user-supplied controlled pair (CH adjoined dwellings/Summary_001431 (1) vs (2)): the two Summaries are byte-identical across every RdSAP INPUT field, differing only in the derived header (SAP 80 vs 75, bill £954 vs £1237, emissions 5.407 vs 7.394 t). A case-insensitive scan of the CH6 Summary for "distribution"/"adjoin" returns 0 hits. Since CH4/CH6 Summaries are themselves identical bar fuel type, no Summary-derivable rule can yield CH4=1.45 AND CH6=1.0. Doc-only change (comment in _EXPECTATIONS); 20/20 community-heating corpus tests pass. Closes the CH6 re-litigation: pin held. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 19:21:28 +00:00
Khalim Conn-Kowlessar	82f7315f8d	S0380.184: community electric-HP network CO2/PE uses monthly Table 12d/12e — closes CH3 SAP 10.2 worksheet block 12b/13b (367)/(467) for a community heating electric heat pump (Table 4a code 304 → Table 12 fuel 41 "heat from electric heat pump"). The HP meters grid electricity, so per Table 12 note (s)/(t) + block 12b/13b footnote (a) its emission/PE factor is the MONTHLY Table 12d/12e cascade (fuel 41 = standard-electricity profile), weighted by the network heat profile, then × 1/heat-source-eff (1/COP): (367)/(467) = [(307)+(310)] / COP × Σ((307+310)_m × factor_m)/Σ(...) Per-line walk of CH3 (the displayed (367) 0.1535 / (467) 1.5717 are PDF artifacts; the (373)/(473) totals reconcile only with): CO2 factor = 0.15040 (monthly Table 12d wtd) vs cascade annual 0.136 PE factor = 1.55692 (monthly Table 12e wtd) vs cascade annual 1.501 Pre-slice the cascade routed code 304 through the non-electric branch (`_co2_factor_kg_per_kwh(main) × 1/COP` = annual × scaling). New `_is_heat_network_electric_main` (heat-network main whose fuel has a Table 12d monthly set — i.e. fuel 41) routes all four factor helpers (main + HW, CO2 + PE) through the monthly cascade × 1/COP. Non-electric heat networks (gas 51 / oil 53 / coal 54) have no monthly set → annual path unchanged (CH1, CH6 untouched). Closure (CH3 was already SAP+cost EXACT): CH3 (HP/Elec) CO2 −75.32→+0.0000 (= [(307+310)/3]×(0.1504−0.136)), PE −249.32→−0.0000 (× (1.5569−1.501)) — FULLY EXACT Corpus now 40/41 EXACT on all four metrics. Only CH6 remains: its worksheet lodges a manual DLF=1.0 ("two adjoining dwellings") absent from the Summary PDF (byte-identical to CH4 bar fuel type) — an architectural limit, not a cascade gap. 2226 pass + 1 skip + 0 fail (tolerances 1e-4 all metrics); pyright net-zero 43→43. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 18:43:16 +00:00
Khalim Conn-Kowlessar	803da062a2	S0380.183: community-heating HW bills at heat-network rate (§10b) — closes CH2/CH4 fully SAP 10.2 §10b: hot water for a community-heating dwelling bills at the heat-network rate, not the cert-lodged fuel. Elmhurst §15.0 lodges `water_heating_fuel_type = "Mains gas"` (3.48 p/kWh) as a placeholder on community certs; the worksheet (342) Water-heating cost = (310) × the S0380.171 CHP heat-fraction blend — the SAME rate as space heating (340). Per-line walk of the CH2 block 10b: (340) space = 11837.83 × 0.037955 = 449.3047 (cascade EXACT) (342) water = 3854.12 × 0.037955 = 146.2830 (cascade billed 3854.12 × 0.0348 = 134.12 → −£12.16, the whole residual) (350) lighting + (351) standing → (355) 754.1502. `_hot_water_fuel_cost_gbp_per_kwh`'s `inherit_main_for_community_heating` path already routes HW cost through `_fuel_cost_gbp_per_kwh(main)` (the CHP blend), but its gate `_is_community_heating_hw_from_main` excluded code 302. S0380.182 wired the 302 CO2/PE credit via `_heat_network_code_302_effective_factor`, which intercepts the HW CO2/PE helpers ABOVE this predicate's branch — so extending the predicate to include 302 now affects ONLY the cost path. Closures: CH2 (CHP/Gas) SAP +0.5277→−0.0000, cost −£12.16→−£0.00 — FULLY EXACT CH4 (CHP/Oil) SAP +0.5277→−0.0000, cost −£12.16→−£0.00 — FULLY EXACT CH6 (CHP/Coal) SAP −7.49→−8.02, cost +£172.68→+£184.84 — its HW now also bills the blend, compounding the DLF=1.0 quirk (cascade DLF=1.45); same separate CH6 DLF front. Corpus now 39 variants EXACT on all four metrics (CH2/CH4 join). Open: CH3 CO2/PE (code-304 community-HP COP), CH6 all-metric (DLF=1.0 manual override the Summary doesn't carry). 2225 pass + 1 skip + 0 fail (tolerances 1e-4 all metrics); pyright net-zero 32→32. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 18:29:57 +00:00
Khalim Conn-Kowlessar	8e86de2257	S0380.182: community-heating CHP+boilers CO2/PE credit (§12b/13b) — closes CH2/CH4 CO2+PE SAP 10.2 worksheet block 12b (CO2) / 13b (PE) for community heating "CHP and boilers" (SAP code 302). Per unit of network heat fuel H = (307)+(310) the effective generation factor is: chp×100/(362)×f_fuel − chp×(361)/(362)×f_disp + (1−chp)×100/(367)×f_fuel (363)/(463) CHP fuel = chp_frac × 100/heat_eff × f_fuel (364)/(464) less credit = −chp_frac × elec_eff/heat_eff × f_disp (368)/(468) boiler fuel = (1−chp_frac) × 100/boiler_eff × f_fuel f_fuel = Table 12 heat-network fuel factor (the CHP unit and the back-up boilers burn the same community fuel — verified vs CH2 gas / CH4 oil / CH6 coal worksheets (363)/(368)); f_disp = Table 12f (PDF p.196) credit for the CHP-generated electricity. RdSAP 10 §C (p.58) defaults: heat eff 50% (362), electrical eff 25% (361), boiler eff 80% (367); CHP heat frac 0.35 per-cert via community_heating_chp_fraction. New `_heat_network_code_302_effective_factor` + Table 12f flexible constants (0.420 CO2 / 2.369 PE) + RdSAP §C efficiency constants, wired into all four factor helpers (main + HW, CO2 + PE) ahead of the existing single-fuel / 1-over-heat-source-eff path. The worksheet (368)/(468) boiler emissions DISPLAY rounded/mis-aligned in the PDF, but the (373)/(473)/(386)/(486) totals reconcile only with the boiler at the full Table 12 factor — verified EXACT. Two spec citations applied: - Table 12f flexible-operation default for RdSAP community CHP is an Elmhurst engine choice (Table 12f notes make "standard" the default); mirrored per [[feedback-software-no-special-handling]] and documented in SAP_CALCULATOR.md §8.3. - Table 12 heat-network oil/biodiesel CO2 (codes 53/56) corrected 0.298 → 0.335 per Table 12 (p.189) "assumes 'gas oil'"; the code-302 oil cascade (CH4) was the first to exercise it. PE 1.180 was already correct. No other variant uses these codes (no regression). Closures (CO2 + PE only — the CHP credit does not touch cost/SAP): CH2 (CHP/Gas) CO2 −1411.49→+0.0000, PE +1331.23→+0.0000 EXACT CH4 (CHP/Oil) CO2 −4378.24→−0.0000, PE +319.81→−0.0000 EXACT CH6 (CHP/Coal) CO2/PE re-pinned (+2411.54 / +5023.48) — its worksheet lodges a manual DLF=1.0 the Summary doesn't carry, so cascade DLF=1.45 over-scales H; same root as the CH6 SAP −7.49 / cost +£172 (separate DLF front). CH2/CH4 are now CO2+PE-exact but still carry the heat-network cost/SAP residual (+0.5277 SAP / −£12.16 cost, exposed by S0380.175 — cost-side, untouched here). CH3 unchanged (code 304 community-HP COP front). Corpus state: 37 variants EXACT on all four metrics (incl. CH1); remaining residuals are CH2/CH4 cost+SAP, CH3 CO2+PE (HP COP), CH6 all-metric (DLF quirk). 2223 pass + 1 skip + 0 fail (tolerances 1e-4 all metrics per S0380.181); pyright net-zero 43→43. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 18:23:17 +00:00
Khalim Conn-Kowlessar	02a89bcb39	S0380.181: tighten heat-systems corpus residual tolerances to 1e-4 (all metrics) The corpus residual-pin tolerances had drifted looser than the comment above them claimed ("pin at 1e-4 relative to lodged precision"): SAP was 1e-3, cost ±£0.01, CO2 ±0.1 kg, PE ±0.1 kWh. A ±0.1 kg CO2 band could silently mask a ~0.09 kg drift on a variant we report as EXACT. The worksheet pins are extracted from the P960 PDF text, which prints 4 d.p., so the hard residual floor is ~5e-5 (half a unit in the last printed digit) regardless of cascade precision. 1e-4 sits just above that floor. All 41 variants hold at uniform 1e-4 on continuous SAP, cost, CO2 AND PE — confirming the 37 EXACT variants are genuinely exact to PDF print-rounding and the looser bands were masking nothing. Aligns the guard with [[feedback-zero-error-strict]] / [[feedback-continuous-sap-tolerance]] (basically zero error across all four metrics). Test-only change; no cascade behaviour touched. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 18:11:28 +00:00

1 2 3 4 5 ...

2298 commits