Model

mirror of https://github.com/Hestia-Homes/Model.git synced 2026-07-27 23:35:01 +00:00

Author	SHA1	Message	Date
Khalim Conn-Kowlessar	26d7fc036e	docs(modelling): record Valuation Uplift design (ADR-0018 + glossary) From the grill-with-docs pass on the depth+scale phase. Splits the overloaded "valuation" into two glossary terms — Property Valuation (current market value, a Baseline attribute, mostly missing) and Valuation Uplift (plan-conditional, percentage-primary; absolute £ only when a Property Valuation exists, 2x ROI cap on the £ form). ADR-0018 records the percentage-primary decision and why (the EPC scale corpus has no market values, so a value-primary model produces nothing), plus the deferred sourcing / per-measure / rental-yield items. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-04 07:52:33 +00:00
Khalim Conn-Kowlessar	b76d0f814b	docs(modelling): design the plan_recommendations retirement (ADR-0017 amendment) Rewrite the migration spec into the full expand/contract sequence (add plan_id → backfill → dual-write → cut reads → drop) with the two load-bearing rules: backfill before any read cuts over, and dual-write the m2m until all reads are off it (the Drizzle FE reads the tables directly, so the repos can't deploy atomically). Amend ADR-0017 from "m2m retired for new writes" to "m2m dropped + one SQLModel definition per table under infrastructure/postgres/modelling/". Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-03 20:24:30 +00:00
Khalim Conn-Kowlessar	75ba5dd744	docs(modelling): ADR-0014 amendment — cross-stage billing + Modelling post-package bills Records the /grill-with-docs design for the Modelling Bill-Derivation slice: Bill Derivation is cross-stage (relocate Bill/EnergyBreakdown/BillDerivation/ sap_fuel to a neutral domain/billing/); Modelling bills the fully-overlaid post-package SapResult (so fuel-switch measures price at the new fuel for free), diffing against the baseline at the same FuelRates snapshot; the post-package and baseline SapResults are captured from scores the optimiser/orchestrator already compute (Score.sap_result), so no second calculate; FuelRatesRepository is constructor-injected into ModellingOrchestrator mirroring Baseline; plan-level columns this slice, per-measure telescoping bill cascade next (energy_savings is vestigial, left NULL). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-03 17:17:03 +00:00
Khalim Conn-Kowlessar	af501fce0e	feat(modelling): ventilation-aware selection — price the forced dependency in The warm-start (and max-gain fallback) now price each forced Measure Dependency the candidate triggers, not just inject it afterwards: optimise/optimise_min_cost fold dependencies into each candidate's cost+gain via _augmented_cost_gain, and optimise_package scores each dependency's true role-1 signal (_with_role1_signals) instead of the 0.0 placeholder. This stops the min-cost objective (i) ignoring the ~£900 a wall drags in (a wall-free package reaching target can be cheaper) and (ii) picking a small-gain wall whose mandatory ventilation (down to -5 SAP) makes it net-negative, which repair cannot un-pick. Budget is now a hard envelope: the constraint applies to the augmented (measure + its ventilation) cost, so a wall that fits alone but whose ventilation would bust the budget is DROPPED rather than forced over budget. This reverses the earlier 'forced regardless of budget' call (which made sense when selection was ventilation-blind). Safety invariant intact — presence still injected on every path; we just never recommend a wall we can't afford to ventilate. ADR-0016 amendment updated. 94 modelling+orchestration tests pass. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-03 16:16:26 +00:00
Khalim Conn-Kowlessar	5620f49f18	docs(modelling): ADR-0016 amendment — optimiser objective is least-cost-to-target The original ADR-0016 mis-specified the warm-start objective as maximise-gain- subject-to-budget (with the target a repair floor); the rebuild faithfully implemented that wrong objective. The intended behaviour is the legacy StrategicOptimiser Case 1: minimise cost subject to (true) SAP gain >= target and cost <= budget, falling back to max-gain-within-budget only when the target is unreachable. For Increasing EPC this is least-cost-to-target: cheapest package reaching the band, stops at the target (no overshoot into a higher band), surplus budget unspent. Also records: target predicate sap_continuous >= band floor (conservative, no legacy slack — re-score+repair supersede it); ventilation-aware selection (the forced dependency, -1 to -5 SAP, is folded into candidate evaluation with a real negative role-1 signal, not just injected afterwards); presence-vs-awareness enforcement; warm-start+re-score+repair structure and scalability rationale kept. Sharpened the CONTEXT.md Optimised Package definition to match. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-03 15:26:02 +00:00
Khalim Conn-Kowlessar	772cdd4f5a	docs(modelling): #1157 Plan-persistence design review Outcome of the /grill-with-docs session scoping #1157. - CONTEXT.md: add Plan Measure (the persisted selected Option + role-3 attribution + cost); Recommendation stays the candidate. Remove Scenario Phase / Plan Phase / Rolled-over Options — multi-phase is deferred. Reshape Scenario + Plan to single-phase; fix relationships, dialogue, and the "phase" ambiguity note. - ADR-0005: rewritten to Deferred (multi-phase was speculative prospective-client work; single-phase now; future plan_phase back-fill path preserved). Stray phase refs cleaned in ADR-0016 / ADR-0009. - ADR-0017 (new): Plan persistence — reuse the live plan/recommendation tables via SQLModel mirrors + a PlanRepository on the UoW; add recommendation.plan_id, retire the plan_recommendations m2m; flat post-retrofit on plan; idempotent replace; CO2 in tonnes. Unselected alternatives + bills noted as deferred directions. - docs/migrations/recommendation-plan-id.md: the FE-owned Drizzle change. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-03 11:12:54 +00:00
Khalim Conn-Kowlessar	0ba45a09cc	docs(modelling): record stage design — CONTEXT terms + ADR-0016 Reframe Recommendation as a target surface (partitions the EpcPropertyData surface, so selected overlays never collide); add Measure Option, Simulation Overlay (EpcSimulation), Product, Cost, Contingency, and Measure Dependency. ADR-0016 fixes the scoring/optimisation approach (warm-start grouped-knapsack MILP -> deterministic package re-score -> greedy repair, with a final-package marginal cascade for display attribution), resolving the open question in ADR-0005 §14. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 22:13:51 +00:00
Khalim Conn-Kowlessar	5e75fb474c	feat(baseline): EnergyBreakdown.from_sap_result + COOLING section The SapResult -> EnergyBreakdown adapter (ADR-0014), a classmethod on the target mirroring Performance.from_sap_result. Folds each positive per-end-use delivered kWh into a billable EnergyLine: main/main-2/secondary heating and hot water at their resolved fuel (sap_code_to_fuel); lighting/pumps-fans/ appliances/cooking/cooling as electricity. PV export carries to exported_kwh for the SEG credit. Zero-kWh end uses emit no line; a positive kWh with no fuel code raises rather than billing at a default (strict, mirrors the calculator). Adds BillSection.COOLING (electricity, from space_cooling_fuel_kwh_per_yr). BillDerivation already prices any section it is given, so no change there. Also corrects the ADR-0014 amendment: SapResult carries the calculator's own fuel codes (raw API or Table-32 per mapper, ADR-0015); sap_fuel normalizes. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 18:28:22 +00:00
Khalim Conn-Kowlessar	19a56461ba	docs(baseline): Bill Derivation design — fuel as calculator output + rebaselining is assemble-and-score Captures a /grill-with-docs session resolving how BillDerivation gets the fuel each end use burns, and what Rebaselining actually is. - ADR-0014 amendment: per-end-use fuel is a calculator OUTPUT (resolved Table-32 codes on SapResult: main-1/main-2/secondary/HW + pv_exported_kwh); the adapter is a pure SapResult->EnergyBreakdown map. Corrects stale §3 (is_gas_code... -> sap_fuel.sap_code_to_fuel). Adds COOLING section. Interim, pending ADR-0015. - ADR-0013 amendment: the calculator is the SCORING ENGINE within Rebaselining (assemble the Effective EPC picture, then score), not the whole of it; the Rebaseliner exposes its SapResult so the orchestrator composes Effective Performance AND the Bill from one scoring. - ADR-0015 (new): mappers own cert normalization; EpcPropertyData becomes a strict type. Explains why fuel resolution sits in the calculator today. - CONTEXT.md: Effective EPC = the assembled picture; Rebaselining = assemble (overrides / neighbour-estimation / old-schema remap) then score. - EpcPropertyData docstring points at ADR-0015. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 18:04:55 +00:00
Khalim Conn-Kowlessar	69995edec8	Merge branch 'main' of https://github.com/Hestia-Homes/Model into feature/per-cert-mapper-validation	2026-06-02 16:10:41 +00:00
Khalim Conn-Kowlessar	5f65b9be62	feat(baseline): SAP fuel-code -> Fuel mapping for billing (ADR-0014) Slice 3 of Bill Derivation. sap_code_to_fuel(code) maps a SAP 10.2 / Table 32 fuel code to the canonical billing Fuel — bounded to the ~47 Table 32 codes (the carrier, orthogonal to the PCDB product index, so all PCDB heat pumps share one electricity code). Mains gas / LPG / oil+bioliquids / coal / smokeless / wood / electricity (standard + off-peak) / heat-network groupings; an unmapped code (dual fuel, grid-export) raises UnmappedSapCode rather than guessing. Also: ADR-0014 deferred/TODO section records the stubbed appliances+cooking (pending the SapResult fields), the off-peak day/night split, the heat-network rate gap, and regional rates / ETL. The SapResult -> EnergyBreakdown adapter (next slice) is gated on the appliances/cooking fields landing on SapResult. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 09:50:10 +00:00
Khalim Conn-Kowlessar	57867832f6	docs(adr): Bill Derivation (ADR-0014) + calculator goes load-bearing (ADR-0013 amend) Pin the bills design from a /grill-with-docs session: - ADR-0014: whole-home annual bill from SAP10 Calculation's delivered kWh per end use, re-priced at real Fuel Rates (NOT the calculator's SAP-notional total_fuel_cost_gbp, which is RdSAP Table 32 standardised prices ~half real electricity). Fuel enum + FuelRates + FuelRatesRepository static snapshot; per-section + total flat columns; raise on unpriced fuel (house coal / heat network are the named gaps). - ADR-0013 amendment: the shadow stepping-stone is collapsed — the calculator is load-bearing now. effective=calculated for sap_version<10.2 (StubRebaseliner floor 10.0->10.2); >=10.2 keeps lodged + logs divergence; a strict-raise aborts the batch (load-bearing for bills regardless of version). - CONTEXT: EPC Energy Derivation -> Bill Derivation (no "service" suffix); Baseline Performance energy block = per-end-use kWh + per-section bill + total; Fuel Rates = committed static snapshot; Rebaselining trigger threshold 10.2. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 09:20:50 +00:00
Khalim Conn-Kowlessar	561e1b8b49	feat(baseline): run Sap10Calculator in shadow on Property Baseline (ADR-0013) Wire Sap10Calculator into PropertyBaselineOrchestrator as a non-load-bearing shadow runner. For each property it scores the Effective EPC beside the load-bearing Lodged/Effective write, catches any strict-raise -> log.error (never aborts the batch), and on success log.warning's divergence from Lodged: SAP \|continuous - lodged\| > 0.5; PEUI/CO2 > 1% relative (CO2 after kg->tonnes). Every line is tagged with sap_version so SAP-10.2 signal separates from older-spec drift (ADR-0010 Validation Cohort). Per ADR-0013, Calculated SAP10 Performance is not a persisted third value-set: effective = calculated in every baselining scenario, so the calculator IS the mechanism that produces Effective Performance (the Rebaseliner). It runs in shadow only while being hardened; when overrides/estimation land it is promoted to drive Effective and the failure posture flips to abort (ADR-0012, calculator now load-bearing). No table change. - ADR-0013 + CONTEXT (Calculated SAP10 Performance / Effective Performance / Rebaselining) record the decision. - CalculatorShadow port + LoggingCalculatorShadow + Calculator protocol. - FakeCalculatorShadow for orchestrator unit tests. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 08:01:47 +00:00
Khalim Conn-Kowlessar	457d959b1f	refactor(property-baseline): rename baseline → property_baseline aggregate (PR #1139 review) Wholesale rename of the Baseline aggregate to PropertyBaseline for clarity / to disambiguate from baselines that appear elsewhere in Modelling. Scoped to this aggregate only — the distinct Rebaselining term (rebaseline_reason, StubRebaseliner, RebaselineNotImplemented) is deliberately untouched. - domain/baseline → domain/property_baseline; BaselinePerformance → PropertyBaselinePerformance. - repositories/baseline → repositories/property_baseline; BaselineRepository / BaselinePostgresRepository → PropertyBaseline*. - orchestration/baseline_orchestrator.py → property_baseline_orchestrator.py; BaselineOrchestrator → PropertyBaselineOrchestrator. BaselineStage → PropertyBaselineStage. - infrastructure/postgres: baseline_performance_table.py → property_baseline_performance_table.py; table `baseline_performance` → `property_baseline_performance`; Model renamed. - UnitOfWork attribute `.baseline` → `.property_baseline`. - Docs: ADR-0004 references + migration doc (renamed to property-baseline-performance-table.md) updated. CONTEXT.md glossary term ("Baseline Performance") left as-is pending a ubiquitous-language call (raised on the PR). 123 tests pass; pyright strict clean (only the unrelated pre-existing moto import errors remain). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 16:28:48 +00:00
Khalim Conn-Kowlessar	7275850c9e	refactor(orchestration): wire stages onto the UnitOfWork; per-stage commit (#1138 ) Replaces the handler's whole-pipeline Session (one transaction across all three stages, connection pinned during Ingestion's external IO) with a Unit-of-Work per stage (ADR-0012, added here). Each stage runs its batch in one unit and commits once; any property raising aborts the batch and the subtask fails noisily. - BaselineOrchestrator(unit_of_work, rebaseliner): one unit for the batch, commit once. Raise on a pre-SAP10 property leaves the unit uncommitted. - IngestionOrchestrator(unit_of_work, epc_fetcher, geospatial_repo, solar_fetcher): fetch/write split — phase 1 fetches the whole batch (EPC / coords / solar) with NO unit open; phase 2 writes in one unit and commits. The connection is never held during external IO. Geospatial S3 repo stays injected (reference data, not transactional). - Handler: module-scoped engine (pool reused across warm invocations) + a UoW factory; whole-pipeline `with Session` gone. `build_first_run_pipeline` composes on the factory. Source clients still behind the raising seam. - ADR-0012 records the decision (per-stage boundary, all-or-nothing batch, idempotent re-run, fetch/write split, module-scoped engine). Modelling stub left untouched (no-op, no DB) per the ADR. Tests: orchestrators on a shared FakeUnitOfWork (assert persisted batch + exactly-once commit + no-commit-on-raise). New real-DB E2E integration test: real PostgresUnitOfWork, Ingestion writes the EPC → Baseline reads it back through the repo → re-run replaces, not duplicates (1 EPC row, 1 baseline row after two runs). 121 pass in tests/; pyright strict clean; AAA. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 16:28:48 +00:00
Khalim Conn-Kowlessar	9f22b0aae8	feat(baseline): BaselineOrchestrator + BaselinePerformance aggregate (#1135 ) Stage 2 of First Run. Establishes each Property's Baseline Performance from persisted source data and writes it back — reads only from repos, never a Fetcher or HTTP (ADR-0003), so it is byte-identical whether Ingestion ran milliseconds ago or last week. Domain (`domain/baseline/`): - `Performance` VO — the four rated quantities: SAP / EPC Band / CO2 / Primary Energy Intensity. `lodged_performance(epc)` reads them off the EPC's recorded fields (PEUI = `energy_consumption_current`). - `BaselinePerformance` (ADR-0004) — the paired `lodged` + `effective` Performance + `rebaseline_reason`, plus the no-derivation part of the energy block (`space_heating_kwh` / `water_heating_kwh`, off the RHI, deterministic per ADR-0006). Both halves always populated. - `Rebaseliner` port + `StubRebaseliner`: the re-score-on-override seam (ADR-0011). SAP10 certs pass through (effective == lodged, reason "none"); a pre-SAP10 cert raises `RebaselineNotImplemented` rather than fabricating a plausible-but-wrong "none" — ML rebaselining is not wired yet. Mirrors the repo's strict-raise culture. Persistence: new `BaselineRepository` port + `BaselinePostgresRepository` + flat-column `baseline_performance` SQLModel (one row per Property). Per ADR-0004's amendment this is a standalone table, NOT columns on the retiring `property_details_epc`. Production migration is FE-owned (Drizzle) — docs/migrations/baseline-performance-table.md. Docs (grill-with-docs): corrected CONTEXT.md Lodged/Effective Performance to Primary Energy Intensity (the term collided with its own _Avoid_ entry under "heat demand") + fixed stale RHI field names; amended ADR-0004 Consequences for the standalone-table decision. Fuel split + bills (rest of EPC Energy Derivation) deferred to a follow-up — they need a Fuel Rates source (Ofgem-cap ETL) that does not exist yet. TDD, one test -> one impl: 7 tests (lodged read, rebaseliner pass-through + raise, orchestrator establish-and-persist + pre-SAP10 raise, Postgres round-trip + absent). pyright strict clean; AAA layout. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 16:28:48 +00:00
Khalim Conn-Kowlessar	8291f29721	docs(ara): composable stage-orchestrator design (ADR-0011 + ADR-0003 amend + CONTEXT) Records the grill-with-docs outcomes for the ara_first_run rebuild: three composable stage orchestrators (Ingestion/Baseline/Modelling), one lambda per use case chaining them through repos (not in-memory), and the Fetcher-vs-Repo data-source taxonomy. Amends ADR-0003's chaining rule to generalise beyond RefreshOrchestrator. Adds the pipeline-composition + First Run vocabulary to CONTEXT.md. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 16:28:48 +00:00
Jun-te Kim	5470fa1d93	move landlord overrides	2026-06-01 15:46:46 +00:00
Jun-te Kim	8a9d14a45c	landlord overrids moved into one repo	2026-06-01 15:16:23 +00:00
Khalim Conn-Kowlessar	c3691d9af2	refactor(property-baseline): rename baseline → property_baseline aggregate (PR #1139 review) Wholesale rename of the Baseline aggregate to PropertyBaseline for clarity / to disambiguate from baselines that appear elsewhere in Modelling. Scoped to this aggregate only — the distinct Rebaselining term (rebaseline_reason, StubRebaseliner, RebaselineNotImplemented) is deliberately untouched. - domain/baseline → domain/property_baseline; BaselinePerformance → PropertyBaselinePerformance. - repositories/baseline → repositories/property_baseline; BaselineRepository / BaselinePostgresRepository → PropertyBaseline*. - orchestration/baseline_orchestrator.py → property_baseline_orchestrator.py; BaselineOrchestrator → PropertyBaselineOrchestrator. BaselineStage → PropertyBaselineStage. - infrastructure/postgres: baseline_performance_table.py → property_baseline_performance_table.py; table `baseline_performance` → `property_baseline_performance`; Model renamed. - UnitOfWork attribute `.baseline` → `.property_baseline`. - Docs: ADR-0004 references + migration doc (renamed to property-baseline-performance-table.md) updated. CONTEXT.md glossary term ("Baseline Performance") left as-is pending a ubiquitous-language call (raised on the PR). 123 tests pass; pyright strict clean (only the unrelated pre-existing moto import errors remain). Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-01 14:54:59 +00:00
Khalim Conn-Kowlessar	48a488d1e9	refactor(orchestration): wire stages onto the UnitOfWork; per-stage commit (#1138 ) Replaces the handler's whole-pipeline Session (one transaction across all three stages, connection pinned during Ingestion's external IO) with a Unit-of-Work per stage (ADR-0012, added here). Each stage runs its batch in one unit and commits once; any property raising aborts the batch and the subtask fails noisily. - BaselineOrchestrator(unit_of_work, rebaseliner): one unit for the batch, commit once. Raise on a pre-SAP10 property leaves the unit uncommitted. - IngestionOrchestrator(unit_of_work, epc_fetcher, geospatial_repo, solar_fetcher): fetch/write split — phase 1 fetches the whole batch (EPC / coords / solar) with NO unit open; phase 2 writes in one unit and commits. The connection is never held during external IO. Geospatial S3 repo stays injected (reference data, not transactional). - Handler: module-scoped engine (pool reused across warm invocations) + a UoW factory; whole-pipeline `with Session` gone. `build_first_run_pipeline` composes on the factory. Source clients still behind the raising seam. - ADR-0012 records the decision (per-stage boundary, all-or-nothing batch, idempotent re-run, fetch/write split, module-scoped engine). Modelling stub left untouched (no-op, no DB) per the ADR. Tests: orchestrators on a shared FakeUnitOfWork (assert persisted batch + exactly-once commit + no-commit-on-raise). New real-DB E2E integration test: real PostgresUnitOfWork, Ingestion writes the EPC → Baseline reads it back through the repo → re-run replaces, not duplicates (1 EPC row, 1 baseline row after two runs). 121 pass in tests/; pyright strict clean; AAA. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-05-31 09:54:47 +00:00
Khalim Conn-Kowlessar	76717dfc3a	feat(baseline): BaselineOrchestrator + BaselinePerformance aggregate (#1135 ) Stage 2 of First Run. Establishes each Property's Baseline Performance from persisted source data and writes it back — reads only from repos, never a Fetcher or HTTP (ADR-0003), so it is byte-identical whether Ingestion ran milliseconds ago or last week. Domain (`domain/baseline/`): - `Performance` VO — the four rated quantities: SAP / EPC Band / CO2 / Primary Energy Intensity. `lodged_performance(epc)` reads them off the EPC's recorded fields (PEUI = `energy_consumption_current`). - `BaselinePerformance` (ADR-0004) — the paired `lodged` + `effective` Performance + `rebaseline_reason`, plus the no-derivation part of the energy block (`space_heating_kwh` / `water_heating_kwh`, off the RHI, deterministic per ADR-0006). Both halves always populated. - `Rebaseliner` port + `StubRebaseliner`: the re-score-on-override seam (ADR-0011). SAP10 certs pass through (effective == lodged, reason "none"); a pre-SAP10 cert raises `RebaselineNotImplemented` rather than fabricating a plausible-but-wrong "none" — ML rebaselining is not wired yet. Mirrors the repo's strict-raise culture. Persistence: new `BaselineRepository` port + `BaselinePostgresRepository` + flat-column `baseline_performance` SQLModel (one row per Property). Per ADR-0004's amendment this is a standalone table, NOT columns on the retiring `property_details_epc`. Production migration is FE-owned (Drizzle) — docs/migrations/baseline-performance-table.md. Docs (grill-with-docs): corrected CONTEXT.md Lodged/Effective Performance to Primary Energy Intensity (the term collided with its own _Avoid_ entry under "heat demand") + fixed stale RHI field names; amended ADR-0004 Consequences for the standalone-table decision. Fuel split + bills (rest of EPC Energy Derivation) deferred to a follow-up — they need a Fuel Rates source (Ofgem-cap ETL) that does not exist yet. TDD, one test -> one impl: 7 tests (lodged read, rebaseliner pass-through + raise, orchestrator establish-and-persist + pre-SAP10 raise, Postgres round-trip + absent). pyright strict clean; AAA layout. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-05-30 21:21:34 +00:00
Khalim Conn-Kowlessar	5aebd90ef7	docs(ara): composable stage-orchestrator design (ADR-0011 + ADR-0003 amend + CONTEXT) Records the grill-with-docs outcomes for the ara_first_run rebuild: three composable stage orchestrators (Ingestion/Baseline/Modelling), one lambda per use case chaining them through repos (not in-memory), and the Fetcher-vs-Repo data-source taxonomy. Amends ADR-0003's chaining rule to generalise beyond RefreshOrchestrator. Adds the pipeline-composition + First Run vocabulary to CONTEXT.md. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-05-30 19:26:17 +00:00
Jun-te Kim	429e138ea7	merge conflicts	2026-05-28 14:00:19 +00:00
Jun-te Kim	8422041215	landlord overrid orchestration	2026-05-26 15:27:45 +00:00
Khalim Conn-Kowlessar	a7b08a4e8f	refactor: move docs/sap-spec/ contents into domain/sap10_calculator/ Locality of reference — SAP-specific docs, specs, and runtime data now live alongside the calculator that consumes them, mirroring the prior packages→domain layout moves. Move targets: - Narrative MDs → domain/sap10_calculator/docs/ NEXT_AGENT_PROMPT.md, HANDOVER_NEXT.md, SAP_CALCULATOR.md - Spec PDFs → domain/sap10_calculator/docs/specs/ RdSAP 10 Specification 10-06-2025.pdf PCDF_Spec_Rev-06b_12_May_2021.pdf sap-10-2-full-specification-2025-03-14.pdf sap-10-3-full-specification-2026-01-13.pdf - PCDB runtime data → domain/sap10_calculator/tables/pcdb/data/ pcdb10.dat (8.3MB) + 7× pcdb_table_*.jsonl (18MB total) Path code rewrites (load-bearing): - tables/pcdb/__init__.py: replaced parents[4]/'docs'/'sap-spec' with Path(__file__).resolve().parent/'data' for Table 105 JSONL loading. - tables/pcdb/postcode_weather.py: same rebase for the pcdb10.dat path read by _postcode_climate_table(). - tables/pcdb/etl.py __main__: same rebase for the manual ETL invocation (source + output_dir both now point inside the package). - tests/test_pcdb_etl.py: _PCDB_DAT_PATH now derives from parents[1]/'tables'/'pcdb'/'data' (was parents[3]/'docs'/'sap-spec'). Citation rewrites: - 12 .py docstrings and 4 .md docs (ADRs + READMEs + narrative docs) had `docs/sap-spec/<file>` strings rewritten to their new locations. - Two cases where the catch-all sed misfired (an ADR-0009 line about a PCDB extract; the pcdb __init__.py docstring about ETL output) were hand-corrected to point at tables/pcdb/data/ rather than docs/specs/. docs/sap-spec/ is now empty (will be removed in a follow-up sweep or left as a vestigial empty dir for future repurposing). ADRs 0009 and 0010 remain at docs/adr/ — they're part of the chronological cross-cutting decision log, not calculator-specific narrative. Verified: - Calculator's 1e-4 production gate (test_api_001479_full_chain_sap_matches_worksheet_pdf_exactly) GREEN. - Wider sweep (domain/sap10_calculator/ + domain/sap10_ml/): 1654 passed / 20 failed — exact pre-move baseline. All 20 failures pre-existing (10 hand-built skeleton + 4 cohort chain + 6 cohort diff). - Pyright net-zero on the 4 touched runtime/test files (0 errors) and unchanged on heat_transmission.py (13) / cert_to_inputs.py (35) / mapper.py (33). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-26 13:17:18 +00:00
Khalim Conn-Kowlessar	68401c517a	refactor: lift-and-shift packages/domain/src/domain/ml → domain/sap10_ml Sibling migration to the sap10_calculator move — `domain.ml` now lives at the root-level layout (`domain/sap10_ml/`) matching the pattern already used by `domain.addresses`, `domain.tasks`, `domain.postcode`, and `domain.sap10_calculator`. Changes: - `git mv packages/domain/src/domain/ml → domain/sap10_ml` (19 files; history preserved). - Subpackage rename: `domain.ml` → `domain.sap10_ml`. 32 references rewritten across .py and .md files: 11 internal + 21 external (datatypes/epc/domain/mapper.py, 14 files in domain/sap10_calculator, 2 backend tests, 2 ADRs, 1 README, 1 design doc). - Path-string updates: `pytest.ini` testpath `packages/domain/src/domain/ml/tests` → `domain/sap10_ml/tests` so ML tests stay in the default auto-discovered sweep. `CONTEXT.md` also updated. `packages/domain/src/domain/` is now empty — the workspace `domain/` tree has been fully migrated. Together with the `domain/__init__.py` deletions from the sap10_calculator commit (`29ac35cc`), `domain` is now a single root-level namespace package with subpackages {addresses, sap10_calculator, sap10_ml, tasks} + the standalone `postcode.py` module. Verified: - Focused sweep (backend mapper-chain + sap10_calculator worksheet e2e + golden fixtures): 99 passed / 19 failed — identical baseline. - Wider sweep (all sap10_calculator + sap10_ml): 1654 passed / 20 failed (same pre-existing failures). - domain/sap10_ml/tests: 210/210 PASSED at new path. - Pyright net-zero: heat_transmission.py 13, cert_to_inputs.py 35, mapper.py 33, rdsap_uvalues.py 1 (all unchanged from baseline). Note: `packages/domain/pyproject.toml` still declares `packages = ["src/domain"]` for the hatchling wheel — that target directory is now empty and the wheel build is effectively a no-op. Retiring the workspace package or repointing the wheel is a follow-up. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-26 13:01:35 +00:00
Khalim Conn-Kowlessar	29ac35ccbe	refactor: lift-and-shift packages/domain/src/domain/sap → domain/sap10_calculator Migration of the SAP 10.2 calculator package from the uv-workspace src-layout (`packages/domain/src/domain/sap`) to the root-level layout (`domain/sap10_calculator`), matching the pattern already used by `domain.addresses` / `domain.tasks` / `domain.postcode`. Changes: - `git mv packages/domain/src/domain/sap → domain/sap10_calculator` (92 files; git auto-detected all as renames so blame/history is preserved). - Subpackage rename: `domain.sap` → `domain.sap10_calculator`. 48 Python files rewritten (`from domain.sap.X` → `from domain.sap10_ calculator.X`); zero remaining `domain.sap` refs after the sed pass. - Path-string updates: 3 .py files (test fixtures + xlsx loader) + 6 markdown docs (CONTEXT.md, 2 ADRs, 3 sap-spec docs, sap10_ calculator/README.md) had hard-coded `packages/domain/src/domain/ sap/...` paths rewritten to `domain/sap10_calculator/...`. - `Path(__file__).parents[N]` rebasing: the old tree was 3 levels deeper than the new one (`packages/domain/src/`), so 4× `parents[7]` became `parents[4]` and 1× `parents[6]` became `parents[3]` across `tables/pcdb/{__init__.py, postcode_weather.py, etl.py}`, `worksheet/tests/_xlsx_loader.py`, and `tests/test_pcdb_etl.py`. - PEP 420 namespace package: deleted both `domain/__init__.py` (root + workspace, both load-bearing only as empty/docstring) so Python combines `domain.sap10_calculator` (root) and `domain.ml` (workspace) into one namespace package. Confirmed via `domain.__path__ == ['/workspaces/model/domain', '/workspaces/model/packages/domain/src/domain']`. Without this, the root `domain/__init__.py` shadowed the workspace one and `domain.ml` was unreachable. Verified: - Full sweep (`backend/documents_parser/tests/test_summary_pdf_ mapper_chain.py + domain/sap10_calculator/worksheet/tests/test_ e2e_elmhurst_sap_score.py + domain/sap10_calculator/rdsap/tests/ test_golden_fixtures.py`): 99 passed / 19 failed — exact same counts as pre-refactor. All 19 failures pre-existing (9 hand-built 001479 + 6 cohort diff + 4 cohort chain non-spec). - Wider sweep (all sap10_calculator + domain.ml): 1654 passed / 20 failed (the +1 vs the focused sweep is the pre-existing `test_roof_insulated_assumed_with_ni_thickness_uses_50mm_per_ section_5_11_4` which was already failing on the previous baseline). - Pyright net-zero on the three load-bearing baselines: `heat_transmission.py` 13, `cert_to_inputs.py` 35, `mapper.py` 33. Lift-and-shift only — no semantic renames (`Sap10Calculator` stays `Sap10Calculator`), no testpaths edits in pytest.ini (sap tests continue to be invoked by explicit pytest paths). Note: `domain.ml` still lives at `packages/domain/src/domain/ml/`. Migrating it would close out the dual-`domain/` layout but is out of scope for this commit. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-26 12:22:37 +00:00
Khalim Conn-Kowlessar	6c966ffe2b	docs: handover for Table 3c two-profile combi loss → close 4 Elmhurst fixtures Rewrites HANDOVER_NEXT.md for the next agent. Two-ticket sequence: 1. Table 3c (immediate): implement SAP10.2 Appendix J §J3 two-profile combi-loss formula + route PCDB records with separate_dhw_tests=2 through it. Closes 000477/000480/000487/000516 from SAP delta +1/+12/+11/+12 to delta=0. Currently those fall through to Table 3a keep-hot 600 kWh/yr default = ~25× overshoot. 2. RdSAP API integration test (end-state): real RdSAP10 API response → EpcPropertyDataMapper → cert_to_inputs → SAP integer == lodged. User generating exotic fixtures to pressure-test first. SPEC_COVERAGE §4 row updated to call out the Table 3c gap. ADR-0010 gains a "Cohort residual hunt + SAP 10.2 rating constants" amendment documenting the 5 component closures (secondary heating, ventilation cert lodgement, Table 4f pumps_fans, SAP 10.2 rating constants, 000477 partial) and naming the deferred Table 3c work. Carries a PCDF parser concern: raw row at index 52 has 13.729 which looks like F2-annual-kWh but parser reads F2 from fields[55] = 0.0. Verify field positions per BRE PCDF Spec §7.11 before assuming F2=0. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-22 12:14:00 +00:00
Khalim Conn-Kowlessar	fd9df9e502	Appendix L slice 3: docs — SPEC_COVERAGE rows + ADR-0010 amendment + heuristic deprecation note SPEC_COVERAGE: - §5 row: note new `annual_lighting_kwh` public leaf + InternalGainsResult field + per-fixture U985 (232) abs=1e-4 pin across all 6 Elmhurst fixtures. - Appendix L row: "Full (cost + gains)" — closes both sides via the same L1-L11 cascade; legacy heuristic noted with rip-pending callsites. ADR-0010 Amendment "Appendix L lighting (2026-05-22)": - Two engine bugs surfaced + fixed: cosine modulation integral (uniform +0.146% bias from continuous-formula vs Σ(L11 monthly)) and cert EPC under-lodgement (`build_epc()` skipped bulb counts + windows). - 000474 hits SAP integer delta=0 (first Elmhurst fixture across the gate). - 000490 SAP integer + fuel cost xfailed (strict) — Appendix L direction correct, other components broken (fuel pricing, Table D1-3 Ecodesign, main heating +2.5%). Tracked as next ticket. - Golden cohort PE tolerance widened 30→35 with rationale. - Deferred work: cohort SAP-integer residual hunt, heuristic deletion, RdSAP→API integration test (end-state e2e harness). `predicted_lighting_kwh` deprecation note: cite ADR-0010 amendment; name the two legacy callsites (`domain.ml.ecf`, `domain.ml.transform`) that block deletion. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-22 09:34:09 +00:00
Khalim Conn-Kowlessar	ae8c946179	docs: §10a slice 3 — ADR-0010 amendment + SPEC_COVERAGE row - ADR-0010 amendment: narrow the SAP10.2 spec target — §10a/§10b cost prices source from RdSAP10 Table 32 (per RdSAP10 §19.1), not SAP10.2 Table 12. CO2 + PEF stay on Table 12 (RdSAP10 §19.2 says they're identical). Closes out the 000490 "spec-version drift" framing as wrong-table + missing-standing-charges, not corpus drift. Names §4 HW + Appendix L as the next-ticket upstream debt that pre-§10a wrong-prices had been masking. - SPEC_COVERAGE: new §10a row (32-field FuelCostResult, three new tables/* + worksheet/* modules, per-line-ref status, Remaining §10a work list). Updates §12 to "folded into §10a". Updates header attribution. No code changes in this commit — docs only. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-21 20:11:37 +00:00
Khalim Conn-Kowlessar	bb9c5ac017	docs: ADR-0010 retargets calculator to SAP 10.2; rewrite handover Adds ADR-0010 superseding ADR-0009's spec-version target, PCDB sequencing, and cert-calibration layer. Captures the conclusions of a grill-with-docs session: 1. Active spec target is SAP 10.2 (14-03-2025), not SAP 10.3 — no SAP-10.3-lodged certs exist in the corpus to validate against. 2. table_12_cert_calibration is deleted (not "re-derived at the end"). It was pre-March-2025 spec prices fit against a mixture distribution of two spec-version regimes, with downstream- component bugs absorbed into the fit — not Elmhurst deviation. 3. Validation Cohort: filter the corpus to inspection_date ≥ 2025-07-01 so every cert in the probe was lodged on SAP 10.2 (14-03-2025) prices. One spec, one signal. 4. PCDB integration is promoted from "Session C deferred" to prerequisite P4 — dominates residual variance on heat pumps and the 78% of gas-boiler certs lodging main_heating_data_source=1. 5. Trace mode (SapResult.intermediate) and BRE worked-example fixtures replace the 7 cert-based golden fixtures, which contained compensating errors. 6. Strict-type EpcPropertyData via codes.csv-derived canonical enums (P6) — the in-source motivation lives at dimensions.py:74-82 (Khalim's comment, included in this commit). 7. Worksheet-faithful structure is a sweep-time principle: each worksheet module mirrors SAP 10.2 worksheet line numbering. CONTEXT.md additions: - Refined "Calculated SAP10 Performance" and "SAP10 Calculation" to reference SAP 10.2 + ADR-0010. - New term "SAP Spec Version" — domain-meaningful because the same EpcPropertyData yields different sap_score under different spec revisions. - New term "Validation Cohort" — the version-locked sub-corpus. HANDOVER_SYSTEMATIC_REVIEW.md is rewritten section-by-section to reflect ADR-0010: §1 framing, §2 status pointer, new §2.5 with the six prerequisites P1–P6 in dependency order, §3 diagnosis (cert-cal was stale prices, not Elmhurst deviation), §4 scope (PCDB IN, SAP 10.3 stays OUT), §5 approach (worksheet-faithful principle as §5.5), §7 tension dissolved, §7b findings re-framed, §8 dead-ends re-classified as conditional, §9 cohort filter, §10 fixture strategy, §11 trace mode as prerequisite, §12 prereqs-first, §13 Phase 0/Phase 1 workflow, §14 ADR-0010 reference, §15 final note. P2.1 (commit `ac1aa56a`) already lands the first ADR-0010 slice (probe swap to spec prices). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-19 09:54:24 +00:00
Khalim Conn-Kowlessar	8dbe873daf	ADR-0009: pivot to deterministic SAP 10.3 calculator (Accepted) Promotes ADR-0009 from Proposed to Accepted after the grill-with-docs session resolved all seven open questions. Bundles the SAP 10.3 and RdSAP 10 specifications under docs/sap-spec/ plus a calculator design sketch (module layout, monthly-loop pseudo-code, status table). CONTEXT.md adds three new domain terms parallel to existing performance language: - Calculated SAP10 Performance (parallel to Effective / Lodged) - SAP10 Calculation (process; implemented by Sap10Calculator) - Measure Application (process; implemented by MeasureApplicator) ML pipeline is NOT retired — it stays as the residual head once the calculator reaches parity in Session B. ADR-0009 §"Grill outcomes" carries the seven binding scope decisions plus three Session-A-scope changes discovered during the grill (RdSAP §19 EER formula, SAP 10.2 Appendix A cross-reference, RdSAP Table 29 cascade defaults).	2026-05-17 21:27:21 +00:00
Khalim Conn-Kowlessar	f61d74a327	docs: ADR-0008 physics-as-feature + v16.0.0 schema bump Captures the slice-16 plan decisions before code lands: - Mid-physics: predicted_ecf + predicted_log10_ecf, NOT predicted_sap_score - Cost scope: heating + DHW + lighting (no PV/pumps/secondary) - Crude annual heat-demand calc (HLC * HDH / efficiency) - Cascade-defaulting U-value imputation - envelope_heat_loss_w_per_k sums all parts; extension_1 only as discrete features (88% null drops extension_2) - v16.0.0 MAJOR bump (rename secondary_dwelling_* -> extension_1_*); coordinated cutover with AutoGluon repo + scoring lambda - LightGBM objective="mape" for sap_score+peui_ucl in 16g; sample weights deferred	2026-05-17 11:20:40 +00:00
Khalim Conn-Kowlessar	611ff24eb6	scaffolding for ml pipeline	2026-05-16 14:15:56 +00:00
Khalim Conn-Kowlessar	d9c1696085	added architechtural decisions, added to prd	2026-05-13 21:26:18 +00:00

36 commits