Standard datasets can no longer be used for benchmarking against LLMs since they...

		47282847 on Aug 9, 2024 \| parent \| context \| favorite \| on: Show HN: LLM-aided OCR – Correcting Tesseract OCR ... Standard datasets can no longer be used for benchmarking against LLMs since they have already been fed into it and are thus too well-known to compare to lesser known documents.