<- AI Systems Dossier
ENTRY NO. 01Hospitality / F&B AnalyticsLive deliveryRAG / Semantic SearchLIVE

TasteTrend Analytics

Identity · Retrieval intelligence

RAG-powered intelligence layer over 74k+ restaurant reviews across 200+ locations

RAG / Semantic Search
Problem

74k+ reviews accumulated across Google, TripAdvisor, and internal channels — no queryable layer.

System Flow
6 stages · click to inspect
->
->
->
->
->
Click any stage to inspect
Signature Module · Retrieval intelligence

Inspect retrieval

RAG / Semantic Search

Watch one question move through the retrieval stack: query, ranked chunks, citations, and grounded answer.

Retrieval Trace · System Output● live
QUERY"top service complaints Q4 downtown"
↓ embed · text-embedding-3-small· 1,536-dim · ~80ms
ANNFAISS IVF100 · top-20 candidates· 40ms
↓ BGE cross-encoder rerank· 180ms
TOP 5├─ Rev #47821 "Slow service at..." 0.94
├─ Rev #12903 "Staff was unresponsive..." 0.87
└─ Rev #29841 "Waited 40 min for..." 0.81
↓ GPT-4o · citation enforcement· ~1.4s
OUTAnswer grounded [¹][²][³] — 87% faithful

Business Impact

Outcomes
Before

Analysis time per cycle: 3–5 weeks

->
After

Analysis time per cycle: < 4 hours

The review corpus became a queryable operational asset. Analysis time dropped from multi-week sprints to sub-four-hour sessions. Every generated insight is traceable to a specific cited review. The team now runs ad-hoc queries — "What are the top service complaints in Q4 across downtown locations?" — and receives grounded answers in seconds.

52×
Faster review analysis
200+
Restaurant locations indexed
74k+
Reviews in active corpus

Engineering Evaluation

Nightly eval
81%
Recall@5
87%
Grounding Rate
95%
Dedup Coverage
84%
Query Answer Relevancy
Headline business result52×
Faster review analysis

Why This Is Hard

4 engineering challenges
Challenge · 01

Hallucination on multi-location queries

when a query spans ambiguous location signals, the LLM blended review content across sites. Solved by injecting explicit location metadata as context anchors in the system prompt and requiring per-citation location attribution in the output schema.

Challenge · 02

Near-duplicate review noise at scale

chain-level copy-paste reviews (same text across multiple franchise locations) inflated the corpus without adding information. MinHash LSH at a 0.85 Jaccard threshold, applied at ingestion, removed ~12% of raw corpus volume as near-duplicates.

Challenge · 03

Incremental index updates without full reindex

FAISS flat indexes do not support in-place deletion. Solved via an ID map append-and-compact strategy: new embeddings appended to a staging index, merged into main during off-peak compaction triggered when delta exceeds 5% of total.

Challenge · 04

Multilingual corpus (EN/HU/DE)

text-embedding-3-small outperformed multilingual-e5-large and mE5 on mixed-language retrieval recall in internal benchmarks on this specific corpus. Language detection is applied at ingestion to enable language-band filtering at retrieval time.

Engineering Depth

5 topics · click to expand

Two-stage retrieval: FAISS IVF100 flat index returns top-20 ANN candidates in ~40ms, followed by BGE-reranker-large cross-encoder scoring each candidate against the full query text. Top-5 advance to generation. The reranker adds ~180ms latency but improved Recall@5 by 11 pp over retrieve-only in internal A/B evaluation on 150 labelled queries. Metadata pre-filters on location/date run before ANN, reducing search space by up to 60% on scoped queries.

Key numbers
312kTotal indexed chunks
81%Recall@5
Built withRAGSemantic SearchFAISSGPT-4oAWS