untitled

<OAI-PMH schemaLocation=http://www.openarchives.org/OAI/2.0/ http://www.openarchives.org/OAI/2.0/OAI-PMH.xsd> <responseDate>2018-01-15T18:41:19Z</responseDate> <request identifier=oai:HAL:hal-00664462v1 verb=GetRecord metadataPrefix=oai_dc>http://api.archives-ouvertes.fr/oai/hal/</request> <GetRecord> <record> <header> <identifier>oai:HAL:hal-00664462v1</identifier> <datestamp>2018-01-12</datestamp> <setSpec>type:ART</setSpec> <setSpec>subject:info</setSpec> <setSpec>subject:stat</setSpec> <setSpec>collection:CNRS</setSpec> <setSpec>collection:I3S</setSpec> <setSpec>collection:UNICE</setSpec> <setSpec>collection:UNIV-AG</setSpec> <setSpec>collection:BNRMI</setSpec> <setSpec>collection:CEREGMIA</setSpec> <setSpec>collection:UCA-TEST</setSpec> <setSpec>collection:UNIV-COTEDAZUR</setSpec> </header> <metadata><dc> <publisher>HAL CCSD</publisher> <title lang=en>Leveraging k-NN for generic classification boosting</title> <creator>Piro, Paolo</creator> <creator>Nock, Richard</creator> <creator>Nielsen, Frank</creator> <creator>Barlaud, Michel</creator> <contributor>Laboratoire d'Informatique, Signaux, et Systèmes de Sophia-Antipolis (I3S) / Equipe IMAGES-CREATIVE ; Signal, Images et Systèmes (SIS) ; Laboratoire d'Informatique, Signaux, et Systèmes de Sophia Antipolis (I3S) ; Université Nice Sophia Antipolis (UNS) ; Université Côte d'Azur (UCA) - Université Côte d'Azur (UCA) - Centre National de la Recherche Scientifique (CNRS) - Université Nice Sophia Antipolis (UNS) ; Université Côte d'Azur (UCA) - Université Côte d'Azur (UCA) - Centre National de la Recherche Scientifique (CNRS) - Laboratoire d'Informatique, Signaux, et Systèmes de Sophia Antipolis (I3S) ; Université Nice Sophia Antipolis (UNS) ; Université Côte d'Azur (UCA) - Université Côte d'Azur (UCA) - Centre National de la Recherche Scientifique (CNRS) - Université Nice Sophia Antipolis (UNS) ; Université Côte d'Azur (UCA) - Université Côte d'Azur (UCA) - Centre National de la Recherche Scientifique (CNRS)</contributor> <contributor>Centre de Recherche en Economie, Gestion, Modélisation et Informatique Appliquée (CEREGMIA) ; Université des Antilles et de la Guyane (UAG)</contributor> <contributor>Sony Corporation ; Sony Corporation</contributor> <description>International audience</description> <source>ISSN: 0925-2312</source> <source>Neurocomputing</source> <publisher>Elsevier</publisher> <identifier>hal-00664462</identifier> <identifier>https://hal.inria.fr/hal-00664462</identifier> <source>https://hal.inria.fr/hal-00664462</source> <source>Neurocomputing, Elsevier, 2012, 80, pp.3-9. 〈http://www.sciencedirect.com/science/article/pii/S0925231211005984〉</source> <source>http://www.sciencedirect.com/science/article/pii/S0925231211005984</source> <language>en</language> <subject lang=en>kNN</subject> <subject lang=en>Boosting</subject> <subject lang=en>Machine Learning</subject> <subject lang=en>classification</subject> <subject>[INFO.INFO-TI] Computer Science [cs]/Image Processing</subject> <subject>[STAT.ML] Statistics [stat]/Machine Learning [stat.ML]</subject> <type>info:eu-repo/semantics/article</type> <type>Journal articles</type> <description lang=en>Voting rules relying on k-nearest neighbors (k-NN) are an effective tool in countless many machine learning techniques. Thanks to its simplicity, k-NN classification is very attractive to practitioners, as it enables very good performances in several practical applications. However, it suffers from various drawbacks, like sensitivity to "noisy" instances and poor generalization properties when dealing with sparse high-dimensional data. In this paper, we tackle the k-NN classification problem at its core by providing a novel k-NN boosting approach. Namely, we propose a supervised learning algorithm, called Universal Nearest Neighbors (UNN), that induces a leveraged k-NN rule by globally minimizing a surrogate risk upper bounding the empirical misclassification rate over training data. Interestingly, this surrogate risk can be arbitrary chosen from a class of Bregman loss functions, including the familiar exponential, logistic and squared losses. Furthermore, we show that UNN allows to efficiently filter a dataset of instances by keeping only a small fraction of data. Experimental results on the synthetic Ripley's dataset show that such a filtering strategy is able to reject "noisy" examples, and yields a classification error close to the optimal Bayes error. Experiments on standard UCI datasets show significant improvements over the current state of the art.</description> <date>2012-03</date> </dc> </metadata> </record> </GetRecord> </OAI-PMH>