survey-extraction/etl
2025-04-01 12:21:53 +00:00
..
db made db thing in serpeate file 2025-03-26 17:11:31 +00:00
files save changes 2025-03-20 07:39:52 +00:00
pdfReader sap score 2025-03-21 18:31:15 +00:00
ratecard/jjcRateCards made jjc automation work if I'm giving the survey sheet 2025-03-26 12:02:18 +00:00
scraper merge something I shuldn't 2025-03-27 11:38:08 +00:00
surveyedData fix windows problems 2025-03-20 15:04:02 +00:00
temp completed jjc invidiual values 2025-03-24 17:05:51 +00:00
tests finsihed writing floor area, add more test as I need them@ 2025-03-31 10:37:14 +00:00
transform made db thing in serpeate file 2025-03-26 17:11:31 +00:00
utils make logging less verbose 2025-03-13 06:57:03 +00:00
validator make output nicer 2025-03-17 21:25:17 +00:00
__init__.py moved files 2025-03-08 06:29:07 +00:00
daily_script.py added a new script 2025-03-19 09:14:26 +00:00
development.py save changes 2025-03-20 07:39:52 +00:00
jjc_invoice.py made jjc automation work if I'm giving the survey sheet 2025-03-26 12:02:18 +00:00
load_metadata.py load db first 2025-04-01 12:21:53 +00:00
README.md added a scraper class to do some calculation outside of script 2025-03-05 14:00:56 +00:00
scis_invoice.py local changes 2025-03-21 07:30:51 +00:00
sgec_invoice.py added new scripts 2025-03-21 10:22:52 +00:00
simple_load_example.py simple script to run tests 2025-03-28 16:09:03 +00:00

ETL

Extract, transform and load DATA

We get data from multiple places and merge them into one place.

Definition of multiple places: - Retro Team Sharepoint - Future Osmosis Sharepoint

Definition of one place: - into a CSV...today (03/03/2025)

  • Added sharepointclient that khalim made - Need to proof it works
  • Read a file from what khalim has shared

Add a local file:

  • mount a local folder directory wiht what Khalim sharepoint he has shared
  • REad files and file path

Once I have sharepoint api working:

  • [] Make validator for retro team
  • [] once validated, produce a csv file
  • [] show some cool productivity metric

Currently working on:

  • [On hold until i get sharepoint working] Validator

    • check names
    • [in progress, blocked unitl sharepoint. Easy to add] check it has dates
  • Useful file reader:

    • Khalim showed me a useful pdf, that I should try to extract and get some information
  • [] Share point connection Figure out how to use the sharepoint connector

  • With Khalim:

  • Check if I have access to sharepoint

  • [] Try and get his client API working and see if I can read files

MVP: Script we can run that will Go to share point fetch all the data ( in progress ) provide some form of output that shows the number of surverys done (Get this information!!!)

Flat table

Billing: Billing table, left join