mirror of
https://github.com/Hestia-Homes/Model.git
synced 2026-06-08 11:17:27 +00:00
The 6 worksheet fixtures build EpcPropertyData by hand, validating the cascade in isolation from the mapper. This commit lands the first half of the OTHER validation: Summary_000474.pdf → ElmhurstSiteNotesExtractor → from_elmhurst_site_notes → EpcPropertyData, asserting it produces the same shape as the hand-built fixture. Test is strict-xfail on sap_building_parts count (mapper produces 1, cert lodges 3). Includes a pdftotext-layout preprocessor that converts spatial label/value layout into the Textract-style sequence the existing extractor expects (test-only). Full punch list of 28 mapper-output diffs captured in project memory. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> |
||
|---|---|---|
| .. | ||
| elmhurst_site_notes_1_text.json | ||
| elmhurst_site_notes_2_text.json | ||
| ElmhurstSiteNotes.pdf | ||
| ElmhurstSiteNotes_2.pdf | ||
| pashub_site_notes_1_text.json | ||
| pashub_site_notes_2_text.json | ||
| pashub_site_notes_3_text.json | ||
| pashub_site_notes_4_text.json | ||
| pashub_site_notes_5_text.json | ||
| pashub_site_notes_6_text.json | ||
| pashub_site_notes_7_text.json | ||
| PasHubSiteNotes_1.pdf | ||
| PasHubSiteNotes_2.pdf | ||
| PasHubSiteNotes_3.pdf | ||
| PasHubSiteNotes_4.pdf | ||
| PasHubSiteNotes_5.pdf | ||
| PasHubSiteNotes_6.pdf | ||
| PasHubSiteNotes_7.pdf | ||
| Summary_000474.pdf | ||