NUS-WIDE - Concepedia

TLDR

The paper introduces the NUS‑Wide web image dataset and outlines its use for studying web image annotation and retrieval, highlighting key characteristics and four research issues. The dataset contains 269,648 Flickr images with 5,018 tags, 81 concept ground‑truth labels, and six low‑level feature sets ranging from color histograms to SIFT‑based bag‑of‑words. Baseline experiments using k‑NN demonstrate that models trained on this large dataset can effectively annotate images and improve general image retrieval performance.

Abstract

This paper introduces a web image dataset created by NUS's Lab for Media Search. The dataset includes: (1) 269,648 images and the associated tags from Flickr, with a total of 5,018 unique tags; (2) six types of low-level features extracted from these images, including 64-D color histogram, 144-D color correlogram, 73-D edge direction histogram, 128-D wavelet texture, 225-D block-wise color moments extracted over 5x5 fixed grid partitions, and 500-D bag of words based on SIFT descriptions; and (3) ground-truth for 81 concepts that can be used for evaluation. Based on this dataset, we highlight characteristics of Web image collections and identify four research issues on web image annotation and retrieval. We also provide the baseline results for web image annotation by learning from the tags using the traditional k-NN algorithm. The benchmark results indicate that it is possible to learn effective models from sufficiently large image dataset to facilitate general image retrieval.

References

Page 1

	Year	Citations

Page 1