Hacker Newsnew | past | comments | ask | show | jobs | submit | danielsgriffin's commentslogin

> Information retrieval is a field founded on inescapable uncertainty. We cannot fully model the meaning of the documents, because language is fundamentally ambiguous. We cannot fully comprehend the user’s notion of relevance, because that notion is defined with respect to the entire cognitive state of the person, which changes as they use the system. When we receive a query from a user, it is a poor representation of the user’s need or goal, and systems are forced to guess where to look. Industry systems make extensive use of behavioral observations, which are uncertain in their implications. So even though we have many tools, we cannot actually perform matching of documents to information needs with absolute certainty


This discusses tradeoffs, optimizations, evaluations, proxying, etc.

I also appreciate a comment from Eric Drummond, which lists out what this type of embedding search might support: "There are 16 kinds of search-y things in Quora (web UI)..."


Someone on Twitter just tried asking Bard [are you familiar with a paper called "A Short History of Searching"] and Bard responded by linking Claude Shannon to a fabrication. https://twitter.com/nunohipolito/status/1710063374145343511


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: