Cluster-based Collection Selection for Information Retrieval - Bertold Van Voorst - 書籍 - LAP LAMBERT Academic Publishing - 9783844318852 - 2011年3月11日
カバー画像とタイトルが一致しない場合、正しいのはタイトルです

Cluster-based Collection Selection for Information Retrieval

価格
€ 40,49
税抜

遠隔倉庫からの取り寄せ

発送予定日 年12月29日 - 2026年1月8日
クリスマスプレゼントは1月31日まで返品可能です
iMusicのウィッシュリストに追加

The focus of this research is collection selection for distributed information retrieval. The collection descriptions that are necessary for selecting the most relevant collections are often created from information gathered by random sampling. Collection selection based on an incomplete index constructed by using random sampling instead of a full index leads to inferior results. We propose to use collection clustering to compensate for the incompleteness of the indexes. When collection clustering is used we do not only select the collections that are considered relevant based on their collection descriptions, but also collections that have similar content in their indexes. We describe a new clustering algorithm that allows us to specify the sizes of the produced clusters instead of the number of clusters. Our experiments show that that collection clustering can indeed improve the performance of distributed information retrieval systems that use random sampling. There is not much difference in retrieval performance between our clustering algorithm and the well-known k-means algorithm. We suggest to use the algorithm we proposed because it is more scalable.

メディア 書籍     Paperback Book   (ソフトカバーで背表紙を接着した本)
リリース済み 2011年3月11日
ISBN13 9783844318852
出版社 LAP LAMBERT Academic Publishing
ページ数 84
寸法 226 × 5 × 150 mm   ·   143 g
言語 ドイツ語