Model

mirror of https://github.com/Hestia-Homes/Model.git synced 2026-06-08 11:17:27 +00:00

History

Khalim Conn-Kowlessar fd8d71eb05 slice 15e: per-decile residuals reporting in train_baseline Adds `_per_decile_residuals` and writes `residuals_<target>.json` next to metrics.json. Buckets test-set rows by deciles of the true target value; each bucket carries count + MAPE + MAE + mean residual + true_min/max. Lets us tell whether errors concentrate in the tails of the true distribution (e.g. SAP<40 / SAP>85) vs the mid-band — which the global MAPE alone hides. Baseline for slice 16's MAPE-improvement ablations.		2026-05-17 11:18:40 +00:00
..
__init__.py	slice 14a: ml_training_data pkg + sample.py (CSV filter + random sample)	2026-05-16 17:39:43 +00:00
test_build_features.py	slice 14h: handle real bulk-JSON shape (NDJSON wrappers + document payload)	2026-05-16 19:45:52 +00:00
test_bulk_zip_reader.py	slice 14h: handle real bulk-JSON shape (NDJSON wrappers + document payload)	2026-05-16 19:45:52 +00:00
test_sample.py	slice 14a: ml_training_data pkg + sample.py (CSV filter + random sample)	2026-05-16 17:39:43 +00:00
test_storage.py	slice 14c: BulkZipReader streams certs from gov bulk JSON ZIP	2026-05-16 18:27:24 +00:00
test_train_baseline.py	slice 15e: per-decile residuals reporting in train_baseline	2026-05-17 11:18:40 +00:00
test_write_parquet.py	slice 14e: write_training_dataset emits parquet + schema.json + manifest.json	2026-05-16 18:43:31 +00:00