Publication | Closed Access
Design and analysis of a content-based pathology image retrieval system
126
Citations
22
References
2003
Year
Wavelet CoefficientsEngineeringImage RetrievalDigital PathologyContent SimilarityDiagnosisPathologyImage DatabaseImage SearchImage AnalysisInformation RetrievalData SciencePattern RecognitionPathological Image AnalysisBiostatisticsRadiologyMedical ImagingVector Dot ProductHistopathologyImage SimilarityMedical Image ComputingMedicineHealth InformaticsContent-based Image Retrieval
Retrieval accuracy in pathology image databases varies by disease category, depending on training sample size and feature set effectiveness. The system is a client/server prototype that accesses supercomputing resources to retrieve microscopic pathology images and annotations from a networked database, matching query images using color, texture, Fourier, and wavelet features combined via a vector dot‑product distance metric, which was validated by agglomerative clustering informed by domain knowledge. The algorithm’s distance values correlate with pathological significance and visual similarity, outperforming traditional statistical tests in small‑sample settings, and multidimensional scaling reveals a low‑dimensional embedded space for the test set.
A prototype, content-based image retrieval system has been built employing a client/server architecture to access supercomputing power from the physician's desktop. The system retrieves images and their associated annotations from a networked microscopic pathology image database based on content similarity to user supplied query images. Similarity is evaluated based on four image feature types: color histogram, image texture, Fourier coefficients, and wavelet coefficients, using the vector dot product as a distance metric. Current retrieval accuracy varies across pathological categories depending on the number of available training samples and the effectiveness of the feature set. The distance measure of the search algorithm was validated by agglomerative cluster analysis in light of the medical domain knowledge. Results show a correlation between pathological significance and the image document distance value generated by the computer algorithm. This correlation agrees with observed visual similarity. This validation method has an advantage over traditional statistical evaluation methods when sample size is small and where domain knowledge is important. A multi-dimensional scaling analysis shows a low dimensionality nature of the embedded space for the current test set.
| Year | Citations | |
|---|---|---|
Page 1
Page 1