论文标题
基于距离的图像分类:生成分类的难题的解决方案?
Distance Based Image Classification: A solution to generative classification's conundrum?
论文作者
论文摘要
大多数分类器都依靠歧视界限,这些边界将每个类别的实例与其他所有内容区分开。我们认为,歧视界限是违反直觉的,因为它们是通过什么来定义语义的。并应由生成分类器代替,这些分类器将其定义为他们的语义。不幸的是,生成分类器的准确性明显较小。这可能是由于生成模型倾向于专注于易于建模语义生成因素而忽略重要但难以建模的非语义因素的趋势。我们提出了一个新的生成模型,其中通过实例特定的噪声项来容纳Shell理论的层次生成过程和非语义因素的语义因素。我们使用该模型来开发一种分类方案,该方案在保留语义提示的同时抑制了噪声的影响。结果是一个令人惊讶的精确生成分类器,它采用了修改后的最近邻居算法的形式。我们认为IT距离分类。与判别性分类器不同,距离分类器:通过什么样的人来定义语义;适合增量更新;并按照类的数量很好地缩放。
Most classifiers rely on discriminative boundaries that separate instances of each class from everything else. We argue that discriminative boundaries are counter-intuitive as they define semantics by what-they-are-not; and should be replaced by generative classifiers which define semantics by what-they-are. Unfortunately, generative classifiers are significantly less accurate. This may be caused by the tendency of generative models to focus on easy to model semantic generative factors and ignore non-semantic factors that are important but difficult to model. We propose a new generative model in which semantic factors are accommodated by shell theory's hierarchical generative process and non-semantic factors by an instance specific noise term. We use the model to develop a classification scheme which suppresses the impact of noise while preserving semantic cues. The result is a surprisingly accurate generative classifier, that takes the form of a modified nearest-neighbor algorithm; we term it distance classification. Unlike discriminative classifiers, a distance classifier: defines semantics by what-they-are; is amenable to incremental updates; and scales well with the number of classes.