Files
HighCostDrugsDemo/data/defaultTrusts.csv
T
Andrew Charlwood adc1dbfc58 feat: complete Task 2.2 - test refresh pipeline with Snowflake data
Tested full refresh pipeline end-to-end with real Snowflake data:
- Fixed trust filter to read Name column from defaultTrusts.csv
- Fixed Decimal type handling in calculate_cost_per_patient_per_annum
- Fixed array handling in convert_to_records for average_administered
- Added required reference CSV files to data/ directory
- Configured Snowflake connection (account, warehouse, user)

Results:
- Snowflake fetch: 656,695 records in ~7s
- Transformations: 519,848 records after UPID/drug/directory
- Pathway nodes: 293 for all_6mo (8 trusts, 14 directories)
- Total processing time: ~6.2 minutes
2026-02-05 00:20:12 +00:00

417 B

1CodeName
2RM1NORFOLK AND NORWICH UNIVERSITY HOSPITALS NHS FOUNDATION TRUST
3RGRWEST SUFFOLK NHS FOUNDATION TRUST
4RGTCAMBRIDGE UNIVERSITY HOSPITALS NHS FOUNDATION TRUST
5RCXTHE QUEEN ELIZABETH HOSPITAL
6RGPJAMES PAGET UNIVERSITY HOSPITALS NHS FOUNDATION TRUST
7RGMROYAL PAPWORTH HOSPITAL NHS FOUNDATION TRUST
8RGNNORTH WEST ANGLIA NHS FOUNDATION TRUST
9RRVUNIVERSITY COLLEGE LONDON HOSPITALS NHS FOUNDATION TRUST