docs: update progress.txt with Iteration 6 results (Task 3.2)

This commit is contained in:
Andrew Charlwood
2026-02-05 23:55:26 +00:00
parent c6e426e36c
commit b674543878
+1 -1
View File
@@ -343,7 +343,7 @@ This project extends the indication-based pathway charts (Phase 1-5 complete) wi
- data_processing/diagnosis_lookup.py (batch_size default 500 → 5000) - data_processing/diagnosis_lookup.py (batch_size default 500 → 5000)
- cli/refresh_pathways.py (batch_size 500 → 5000) - cli/refresh_pathways.py (batch_size 500 → 5000)
- IMPLEMENTATION_PLAN.md (marked 3.2 subtasks [x]) - IMPLEMENTATION_PLAN.md (marked 3.2 subtasks [x])
### Committed: [pending] ### Committed: c6e426e "fix: increase network timeout and batch size for GP lookup queries (Task 3.2)"
### Patterns discovered: ### Patterns discovered:
- Snowflake Python connector `network_timeout` (set via connection_timeout in config) controls client-side wait time for ALL query responses, not just connection establishment. Must be high enough for slow queries. - Snowflake Python connector `network_timeout` (set via connection_timeout in config) controls client-side wait time for ALL query responses, not just connection establishment. Must be high enough for slow queries.
- PrimaryCareClinicalCoding query performance is dominated by CTE compilation (~40s fixed cost), not by patient count. Larger batches (5000 vs 500) are dramatically more efficient. - PrimaryCareClinicalCoding query performance is dominated by CTE compilation (~40s fixed cost), not by patient count. Larger batches (5000 vs 500) are dramatically more efficient.