Oops, I Sampled it Again: Reinterpreting Confidence Intervals in Few-Shot Learning

Raphael Lafargue; Luke Anthony Smith; Franck Vermet; Mathias Löwe; Ian Reid; Vincent Gripon; Jack Valmadre

Article Dans Une Revue Transactions on Machine Learning Research Journal Année : 2024

Oops, I Sampled it Again: Reinterpreting Confidence Intervals in Few-Shot Learning

Oups, j'ai échantillonné avec remise : Réinterprétation des intervalles de confiance dans l'apprentissage parcimonieux

(1, 2, 3) , (2) , (4) , (5) , (6, 2) , (3, 1) , (2)

1
2
3
4
5
6

Raphael Lafargue

Fonction : Auteur
PersonId : 1146579
ORCID : 0000-0003-4385-5749

Equipe Better Representations for Artificial Intelligence

University of Adelaide

Département Mathematical and Electrical Engineering

Luke Anthony Smith

Fonction : Auteur
PersonId : 1416713

University of Adelaide

Franck Vermet

Fonction : Auteur
PersonId : 874245
IdHAL : franck-vermet
ORCID : 0000-0003-3816-5401

Laboratoire de Mathématiques de Bretagne Atlantique

Mathias Löwe

Fonction : Auteur
PersonId : 1416714

Westfälische Wilhelms-Universität Münster = University of Münster

Ian Reid

Fonction : Auteur
PersonId : 1416715

Mohamed bin Zayed University of Artificial Intelligence

University of Adelaide

Vincent Gripon

Fonction : Auteur
PersonId : 21307
IdHAL : vincent-gripon
ORCID : 0000-0002-4353-4542
IdRef : 16122203X

Département Mathematical and Electrical Engineering

Equipe Better Representations for Artificial Intelligence

Jack Valmadre

Fonction : Auteur
PersonId : 1416716

University of Adelaide

Résumé

The predominant method for computing confidence intervals (CI) in few-shot learning (FSL) is based on sampling the tasks with replacement, i.e. allowing the same samples to appear in multiple tasks. This makes the CI misleading in that it takes into account the randomness of the sampler but not the data itself. To quantify the extent of this problem, we conduct a comparative analysis between CIs computed with and without replacement. These reveal a notable underestimation by the predominant method. This observation calls for a reevaluation of how we interpret confidence intervals and the resulting conclusions in FSL comparative studies. Our research demonstrates that the use of paired tests can partially address this issue. Additionally, we explore methods to further reduce the (size of the) CI by strategically sampling tasks of a specific size. We also introduce a new optimized benchmark, which can be accessed at https://github.com/RafLaf/FSL-benchmark-again.

La méthode prédominante pour calculer les intervalles de confiance (IC) dans l'apprentissage parcimonieux (FSL) repose sur l'échantillonnage des tâches avec remise, c'est-à-dire en permettant aux mêmes échantillons d'apparaître dans plusieurs tâches. Cela rend les IC trompeurs, car ils prennent en compte l'aléatoire de l'échantillonnage des tâches mais pas celui des données elles-mêmes. Pour quantifier l'ampleur de ce problème, nous menons une analyse comparative entre des IC calculés avec et sans remise. Celle-ci révèle une sous-estimation notable par la méthode prédominante. Cette observation appelle à une réévaluation de notre interprétation des intervalles de confiance et des conclusions qui en découlent dans les études comparatives en FSL. Nos recherches montrent que l'utilisation de tests appariés peut en partie résoudre ce problème. De plus, nous explorons des méthodes pour réduire davantage la taille des IC en dimensionnant stratégiquement les tâches. Nous introduisons également un nouveau benchmark optimisé, accessible à l'adresse https://github.com/RafLaf/FSL-benchmark-again.

Mots clés

Few-shot Learning Statistics Confidence Intervals

Domaines

Intelligence artificielle [cs.AI]

Fichier sous embargo

0	―	3	―	8
Année		Mois		Jours

Avant la publication
lundi 31 mars 2025

Raphael Lafargue : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04702128

Soumis le : lundi 14 octobre 2024-11:51:56

Dernière modification le : jeudi 17 octobre 2024-03:25:59

Dates et versions

hal-04702128 , version 1 (14-10-2024)

Licence

Paternité

Identifiants

HAL Id : hal-04702128 , version 1
ARXIV : 2409.02850

Citer

Raphael Lafargue, Luke Anthony Smith, Franck Vermet, Mathias Löwe, Ian Reid, et al.. Oops, I Sampled it Again: Reinterpreting Confidence Intervals in Few-Shot Learning. Transactions on Machine Learning Research Journal, 2024, https://openreview.net/forum?id=JxxkKt9yrx. ⟨hal-04702128⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-BREST CNRS INSMI LAB-STICC_UBO LMBA UBS ENIB LAB-STICC CHL IMT-ATLANTIQUE IBNM ANR LAB-STICC_BRAIN INSTITUT-MINES-TELECOM

48 Consultations

9 Téléchargements

Oops, I Sampled it Again: Reinterpreting Confidence Intervals in Few-Shot Learning

Oups, j'ai échantillonné avec remise : Réinterprétation des intervalles de confiance dans l'apprentissage parcimonieux

Résumé

Mots clés

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager