ALL-Sort
improved retrieval augmented generation
Trelis Research
All-sort
(assisted large language sorting)
Text chunks
Database
--------
--------
--------
--------
Helper LLM (Smaug 34B)
Sorted Chunks
Chunk | Relevance |
Lorem ipsum | 5 |
Eval | 4 |
Total | 3 |
Text chunks
Rate Relevance
all-sort
Prompt
=
{ context = high relevance chunks }
+
{ Query }
Sorted Chunks
Chunk | Relevance |
Lorem ipsum | 5 |
Eval | 4 |
Total | 3 |
overview
costing
Assumptions:
Costing
| Full context | rag | All-sort (self) | All-sort (hosted) |
Prep | n/A | $0 | $0.05 | $0.05 |
Eval | $1.00 | $0.02 | $0.02 | $0.02 |
Total | $1.00 | $0.02 | $0.07 | $0.07 |
Latency (very unoptimised)
| Full context | rag | All-sort (self) | All-sort (hosted) |
Eval | 25 s | 40 s | 75 s | ? 30-40 s ? |