Links
|
Fast Wavelet-Based Visual Classification
Guoshen Yu, Jean-Jacques Slotine
Summary: We
investigate a biologically motivated approach to fast visual classification,
directly inspired by the recent work
[Serre et al 07]. Specifically, trading off biological accuracy for
computational efficiency, we explore using standard wavelet transforms and
patch transforms to parallel the tuning of visual cortex V1 and V4 cells,
alternated with max operations
References:
Outline:
Algorithm overview: 4 steps (S1, C1, S2, C2)
S1: Wavelet Transform
C1: Local Maximum and Subsampling
C2: Global Maximum
Attention Focusing
Object Classification (object vs background)
Object Recognition with Attention Focusing
Accuracy without/with attention focusing: 74% / 98%
Texture Classification (111-class Brodatz Database, only 10 are shown here)
Accuracy: 87.8%. The state-of-the-art texture classification algorithm in [Lazebnik et al 05] achieves 88.2%.
Satellite Image Classification (4-class, multi-resolultion: forest, urban areas, rural areas, sea.)
Accuracy: 100%. (Images provided by CNES.)
Alphabet Classification (8-class: Arabic, Chinese, English, Greek, Hebrew, Japanese, Korean, Russian.)
Accuracy: 100%.
Sound Classification (5-class)
Accuracy: 100%.
|