admin/portfolio

Fork 0

Files

T

admin 657d2f299e Added .env, and ammended .gitignore

2026-02-15 20:32:46 +00:00

13 KiB

Raw Blame History

PRD: Semantic Search & AI Chat

Introduction

The portfolio's command palette currently uses Fuse.js for fuzzy string matching across ~40 palette items. While it handles typos, it doesn't understand intent — searching "NHS leadership" won't surface relevant roles unless those exact words appear in the keywords field. This PRD covers two complementary features:

Phase 1 — Semantic Vector Search: Replace Fuse.js with pre-computed embeddings and cosine similarity, enabling meaning-based search in the existing command palette. Zero runtime API cost.
Phase 2 — AI Chat Widget: A floating chat button (bottom-right, like a support chat) powered by Google Gemini Flash. Visitors can ask natural language questions about Andy's experience. Hybrid responses: conversational answer + relevant portfolio items.

Goals

Enable meaning-based search (e.g., "data visualization" matches Power BI dashboards, analytics roles)
Maintain instant search performance (<50ms) in the command palette via client-side vectors
Add a conversational "Ask about me" chat widget powered by Gemini Flash
Keep the existing command palette UX (Ctrl+K, keyboard nav, grouped results) intact
Hybrid chat responses: short natural language answer + clickable portfolio items

User Stories

Phase 1: Semantic Vector Search

US-001: Generate embeddings at build time

Description: As a developer, I want a build script that generates embeddings for all palette items so they ship as a static asset.

Acceptance Criteria:

Node script scripts/generate-embeddings.ts reads all palette data from src/lib/search.ts
Uses the same ONNX model (all-MiniLM-L6-v2 via @xenova/transformers) as the browser runtime to generate embeddings
Builds a rich text representation of each item (title + subtitle + keywords + extended context from data files)
Outputs src/data/embeddings.json — array of { id: string, embedding: number[] }
Script is runnable via npm run generate-embeddings
No external API key required — model runs locally via Node.js
Embeddings file is committed to repo (static asset, not generated per-build)
Typecheck passes

US-002: Preload ONNX model during boot sequence

Description: As a visitor, I want the semantic search model to be ready by the time I reach the dashboard, without slowing down the initial experience.

Acceptance Criteria:

Model download (all-MiniLM-L6-v2 via @xenova/transformers) begins when App.tsx mounts (during 'boot' phase)
Download runs in background — does not block or affect boot/ECG/login animations
Model is cached in browser (IndexedDB) — second visit loads from cache instantly
A global ready state (React context or module-level promise) signals when model is available
If model fails to load (network error, etc.), the app continues normally — no error shown to user
Typecheck passes

US-003: Client-side cosine similarity search

Description: As a visitor, I want the command palette to understand what I mean, not just match strings.

Acceptance Criteria:

New src/lib/semantic-search.ts module with cosine similarity function
Loads embeddings.json and provides a semanticSearch(query: string, items: PaletteItem[]) function
Query embedding computed in-browser using all-MiniLM-L6-v2 ONNX model via @xenova/transformers
Returns ranked PaletteItem[] with similarity scores
Typecheck passes

US-004: Integrate semantic search into command palette

Description: As a visitor, I want the command palette to use semantic search with Fuse.js as a fallback.

Acceptance Criteria:

Command palette uses semantic search as primary ranking when embeddings are available
Falls back to Fuse.js if embeddings fail to load
Search latency remains <100ms for all queries
Existing keyboard navigation, grouping, and action routing unchanged
Typecheck passes
Verify in browser: search "data analysis" surfaces analytics-related roles/skills, not just items with "data" in the title

US-005: Enrich embedding content with deep context

Description: As a developer, I want embeddings to capture rich context beyond just titles, so semantic search is truly useful.

Acceptance Criteria:

Consultation embeddings include: role, org, duration, history narrative, examination bullets, coded entry descriptions
Skill embeddings include: name, category, frequency, proficiency, years
KPI embeddings include: value, label, explanation, story context/outcomes
Investigation embeddings include: name, methodology, tech stack, results
Education embeddings include: title, institution, type, research detail
Each item's embedding text is a natural-language paragraph, not a keyword list
Typecheck passes

Description: As a visitor, I see a floating chat button at the bottom-right of the dashboard that opens a chat panel.

Acceptance Criteria:

Floating circular button, bottom-right corner, consistent with design system (teal accent, shadow-md)
Button shows a chat/message icon (lucide-react)
Click toggles the chat panel open/closed
Button has a subtle entrance animation after dashboard loads (delayed ~1s)
Button respects prefers-reduced-motion
Button is above all dashboard content but below command palette overlay (z-index layering)
Typecheck passes
Verify in browser using dev server

US-007: Chat panel UI

Description: As a visitor, I want a chat panel that feels like a support chat — compact, positioned above the floating button.

Acceptance Criteria:

Panel opens above the chat button, anchored to bottom-right
Panel dimensions: ~380px wide, ~480px tall max, with scroll for overflow
Header with title ("Ask about Andy" or similar), close button
Message area showing conversation history (user messages right-aligned, AI responses left-aligned)
Input area at bottom with text field and send button
AI responses show: natural language answer paragraph, then clickable portfolio item cards below (hybrid format)
Clicking a portfolio item card triggers the same action routing as command palette (scroll, panel, link, etc.)
Panel entrance/exit animation (scale + fade, 200ms)
Respects prefers-reduced-motion
Responsive: on mobile (<640px), panel goes full-width with adjusted height
Typecheck passes
Verify in browser using dev server

US-008: Gemini Flash integration

Description: As a visitor, I can ask natural language questions and get intelligent answers about Andy's experience.

Acceptance Criteria:

API calls to Google Gemini Flash model
System prompt includes full CV context (structured from data files) so the model can answer accurately
API key sourced from environment variable VITE_GEMINI_API_KEY (exposed to client via Vite)
Responses are streamed token-by-token for perceived speed
Response format: JSON with { answer: string, relevantItems: string[] } where items are palette item IDs
If API key is missing or call fails, show a graceful fallback message ("Chat unavailable" or similar)
Loading state shown while waiting for response
Typecheck passes

US-009: Chat context and conversation history

Description: As a visitor, I want multi-turn conversation so I can ask follow-up questions.

Acceptance Criteria:

Conversation history maintained in component state (not persisted across page loads)
Previous messages included in Gemini API calls for context
History capped at last 10 messages to manage token usage
"Clear conversation" option available (button or typing /clear)
Typecheck passes

Functional Requirements

Phase 1

FR-1: Build script generates embeddings using all-MiniLM-L6-v2 ONNX model (same model used at runtime)
FR-2: Embeddings stored as committed static JSON (src/data/embeddings.json)
FR-3: Client-side cosine similarity ranks items by semantic relevance
FR-4: Command palette uses semantic search as primary, Fuse.js as fallback
FR-5: ONNX model preloaded during boot sequence (before dashboard renders)
FR-6: Query embedding computed in-browser — no runtime API calls

Phase 2

FR-7: Floating chat button rendered in DashboardLayout, bottom-right, above content
FR-8: Chat panel opens/closes on button click with animation
FR-9: User messages sent to Gemini Flash API with CV context as system prompt
FR-10: Gemini responses parsed into answer text + relevant item IDs
FR-11: Relevant items rendered as clickable cards using existing palette item styling and action routing
FR-12: Streaming responses displayed progressively
FR-13: Conversation state managed per-session (cleared on page reload)

Non-Goals

No server-side search infrastructure (everything client-side or direct API calls)
No persistent chat history across sessions
No user authentication or rate limiting (API key cost is accepted)
No voice input or speech-to-text
No training or fine-tuning of models
Chat widget does not replace the command palette — they coexist
No analytics or tracking of search queries

Design Considerations

Command Palette (Phase 1)

No visual changes to the command palette UI
Semantic search is a drop-in replacement for the ranking logic
Same grouped sections, icons, keyboard navigation, and action routing

Button: 48px circle, teal bg (var(--accent)), white icon, shadow-md. Hover: shadow-lg + slight scale
Panel: White surface, 12px border-radius, shadow-lg. Same card/border tokens as rest of design system
Messages: User messages in teal-tinted bubbles (right). AI messages in light gray bubbles (left) with font-ui
Item cards: Reuse icon/color mapping from command palette results. Compact horizontal layout
Typography: Body text 13px font-ui, timestamps 11px font-geist
Position: Fixed, bottom: 24px, right: 24px. Panel above button with 8px gap
Mobile: Button smaller (40px), panel full-width with bottom: 0, right: 0 and rounded top corners only

Existing components to reuse

iconByType and iconColorStyles mappings from CommandPalette.tsx
PaletteItem, PaletteAction types from src/lib/search.ts
buildPaletteData() for building the searchable dataset
handlePaletteAction() in DashboardLayout.tsx for action routing
Design tokens from index.css and tailwind.config.js

Technical Considerations

Phase 1: ONNX Model Strategy

Decision: all-MiniLM-L6-v2 via @xenova/transformers for both build-time and runtime embedding.

Same model used everywhere — embeddings live in the same vector space, so cosine similarity works correctly
No external API keys required for embedding generation or search
Build script runs the model in Node.js to pre-compute item embeddings
Browser loads the same model at runtime for query embedding

Preloading strategy:

Model download (~23MB, ONNX format) begins during the boot sequence phase ('boot' in App.tsx)
The boot → ECG → login flow takes 8-10s, giving ample time for download + cache
Model is cached by the browser (IndexedDB via Transformers.js) — subsequent visits load instantly
If model hasn't finished loading when user opens command palette, fall back to Fuse.js silently

Embedding content:

Each palette item gets a natural-language paragraph for embedding, not just keywords
E.g., a consultation becomes: "Senior Data Analyst at Norfolk and Waveney ICB, 2021 to present. Led medicines optimisation analytics, built Power BI dashboards for 200+ clinicians..."
Richer text = better semantic matching

Phase 2: Gemini Flash

Use gemini-2.0-flash (or latest) — fast, cheap, good for short-form Q&A
System prompt should be a structured summary of all CV data, not raw data dumps
Response schema enforced via Gemini's JSON mode or structured output
VITE_GEMINI_API_KEY exposed to client — acceptable for a portfolio (low traffic, low cost)
Consider a soft rate limit in the UI (e.g., 1 request per 2 seconds) to prevent abuse

Shared

Both features use buildPaletteData() as the canonical item dataset
Action routing through handlePaletteAction() is shared
DetailPanelContent union type supports all drill-down destinations

Success Metrics

Semantic search returns relevant results for intent-based queries (e.g., "healthcare leadership" surfaces ICB roles)
Command palette search latency stays <100ms
Chat widget responds within 2-3 seconds for typical questions
Chat answers are factually accurate to the CV content (no hallucinated roles or dates)
Both features degrade gracefully when APIs are unavailable

Open Questions

Chat widget on mobile: Should the chat panel be a full-screen modal on small screens, or a bottom sheet?
Suggested questions: Should the chat widget show 2-3 starter questions when first opened (e.g., "What's Andy's NHS experience?", "Tell me about his data skills")?
Model CDN: Transformers.js downloads models from Hugging Face by default. Should we self-host the ONNX model files for reliability, or trust HF's CDN?

13 KiB

Raw Blame History

PRD: Semantic Search & AI Chat

Introduction

Goals

User Stories

Phase 1: Semantic Vector Search

US-001: Generate embeddings at build time

US-002: Preload ONNX model during boot sequence

US-003: Client-side cosine similarity search

US-004: Integrate semantic search into command palette

US-005: Enrich embedding content with deep context

Phase 2: AI Chat Widget

US-006: Chat widget UI — floating button

US-007: Chat panel UI

US-008: Gemini Flash integration

US-009: Chat context and conversation history

Functional Requirements

Phase 1

Phase 2

Non-Goals

Design Considerations

Command Palette (Phase 1)

Chat Widget (Phase 2)

Existing components to reuse

Technical Considerations

Phase 1: ONNX Model Strategy

Phase 2: Gemini Flash

Shared

Success Metrics

Open Questions

13 KiB Raw Blame History

PRD: Semantic Search & AI Chat

Introduction

Goals

User Stories

Phase 1: Semantic Vector Search

US-001: Generate embeddings at build time

US-002: Preload ONNX model during boot sequence

US-003: Client-side cosine similarity search

US-004: Integrate semantic search into command palette

US-005: Enrich embedding content with deep context

Phase 2: AI Chat Widget

US-006: Chat widget UI — floating button

US-007: Chat panel UI

US-008: Gemini Flash integration

US-009: Chat context and conversation history

Functional Requirements

Phase 1

Phase 2

Non-Goals

Design Considerations

Command Palette (Phase 1)

Chat Widget (Phase 2)

Existing components to reuse

Technical Considerations

Phase 1: ONNX Model Strategy

Phase 2: Gemini Flash

Shared

Success Metrics

Open Questions

13 KiB

Raw Blame History