タイトル & 超要約:eコマース検索を賢く!LLMでカテゴリ分けするスゴ技!
ギャル的キラキラポイント✨
● ユーザーの検索ワードを、商品の種類にピッタリ分類するんだって!✨検索がマジで捗る予感! ● なんでそのカテゴリになったのか、理由まで教えてくれるの!賢すぎ!😲 ● 大規模サイト(Amazonとか)でも、サクサク動くように工夫されてるみたい!🚀
詳細解説
背景
続きは「らくらく論文」アプリで
Search in e-Commerce is powered at the core by a structured representation of the inventory, often formulated as a category taxonomy. An important capability in e-Commerce with hierarchical taxonomies is to select a set of relevant leaf categories that are semantically aligned with a given user query. In this scope, we address a fundamental problem of search query categorization in real-world e-Commerce taxonomies. A correct categorization of a query not only provides a way to zoom into the correct inventory space, but opens the door to multiple intent understanding capabilities for a query. A practical and accurate solution to this problem has many applications in e-commerce, including constraining retrieved items and improving the relevance of the search results. For this task, we explore a novel Chain-of-Thought (CoT) paradigm that combines simple tree-search with LLM semantic scoring. Assessing its classification performance on human-judged query-category pairs, relevance tests, and LLM-based reference methods, we find that the CoT approach performs better than a benchmark that uses embedding-based query category predictions. We show how the CoT approach can detect problems within a hierarchical taxonomy. Finally, we also propose LLM-based approaches for query-categorization of the same spirit, but which scale better at the range of millions of queries.