Files
Andrew Charlwood adc1dbfc58 feat: complete Task 2.2 - test refresh pipeline with Snowflake data
Tested full refresh pipeline end-to-end with real Snowflake data:
- Fixed trust filter to read Name column from defaultTrusts.csv
- Fixed Decimal type handling in calculate_cost_per_patient_per_annum
- Fixed array handling in convert_to_records for average_administered
- Added required reference CSV files to data/ directory
- Configured Snowflake connection (account, warehouse, user)

Results:
- Snowflake fetch: 656,695 records in ~7s
- Transformations: 519,848 records after UPID/drug/directory
- Pathway nodes: 293 for all_6mo (8 trusts, 14 directories)
- Total processing time: ~6.2 minutes
2026-02-05 00:20:12 +00:00

10 lines
417 B
CSV

Code,Name
RM1,NORFOLK AND NORWICH UNIVERSITY HOSPITALS NHS FOUNDATION TRUST
RGR,WEST SUFFOLK NHS FOUNDATION TRUST
RGT,CAMBRIDGE UNIVERSITY HOSPITALS NHS FOUNDATION TRUST
RCX,THE QUEEN ELIZABETH HOSPITAL
RGP,JAMES PAGET UNIVERSITY HOSPITALS NHS FOUNDATION TRUST
RGM,ROYAL PAPWORTH HOSPITAL NHS FOUNDATION TRUST
RGN,NORTH WEST ANGLIA NHS FOUNDATION TRUST
RRV,UNIVERSITY COLLEGE LONDON HOSPITALS NHS FOUNDATION TRUST