When it comes to the evals for this kind of thing, is there a standard set of te...

		urbandw311er 81 days ago \| parent \| context \| favorite \| on: So you wanna build a local RAG? When it comes to the evals for this kind of thing, is there a standard set of test data out there that one can work with to benchmark against? ie a collection of documents with questions that should result in particular documents or chunks being cited as the most relevant match.

Yes check out haiku-rag benchmarks and evaluations