Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

>their embedding model is not SOTA, does not even outperform the open ones out there, and reranking is a dead end in 2025.

Are you referring to the MTEB leaderboard? It's widely believed many of those test datasets are considered during the training of most open-source text embedding models, hence why you see novel + private benchmarks discussed in many launch blogs that don't exclusively refer to MTEB. There are problems there, and it would be great to see more folks in the search benchmark dataset production space like what Marqo AI has done in recent months.

Also what makes you say reranking is dead? Mongo doesn't provide it out of the box but many other search providers like ES, Pinecone, Opensearch do so it must provide some value to their customers? Maybe you're saying it's overrated in terms of how many apps actually need it?

disclosure: I work on vector search at Mongo



>Maybe you're saying it's overrated in terms of how many apps actually need it?

Yes, my comment leans more towards that, rather than suggesting is useless.


Taking a step back, accuracy/quality of retrieval is critical as input to anything generated b/c your generated output is only as good as your input. And right now folks are struggling to adopt generative use cases due to risk and fear of how to control outputs. Therefore I think this could be bigger than you think.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: