reproducibilityindex.ai

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].

Unified Binary and Multiclass Margin-Based Classification

Authors: Yutong Wang, Clayton Scott

JMLR 2024 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Theoretical	In this work, we show that a broad range of multiclass loss functions, including many popular ones, can be expressed in the relative margin form, a generalization of the margin form of binary losses. The relative margin form is broadly useful for understanding and analyzing multiclass losses as shown by our prior work (Wang and Scott, 2020, 2021). To further demonstrate the utility of this way of expressing multiclass losses, we use it to extend the seminal result of Bartlett et al. (2006) on classiﬁcation-calibration of binary margin losses to multiclass. We then analyze the class of Fenchel-Young losses, and expand the set of these losses that are known to be classiﬁcation-calibrated. Keywords: Classiﬁcation, loss functions, consistency, margins, label encodings
Researcher Affiliation	Academia	Yutong Wang1,2 EMAIL Clayton Scott1,3 EMAIL 1Department of Electrical Engineering and Computer Science 2Michigan Institute of Data Science 3Department of Statistics University of Michigan Ann Arbor, MI 48109, USA
Pseudocode	No	The paper describes theoretical concepts, mathematical proofs (e.g., Theorem 2.5, Theorem 4.7, Theorem 5.10), and detailed derivations. There are no explicitly labeled pseudocode blocks, algorithms, or structured steps for a method formatted like code.
Open Source Code	No	The paper does not contain any explicit statements about making code available, nor does it provide links to any code repositories for the methodology described.
Open Datasets	No	The paper is theoretical and focuses on mathematical properties of loss functions. It discusses 'training data' and 'joint distribution P over X [k]' in an abstract sense for theoretical analysis but does not mention any specific public datasets, provide links, or citations to datasets used in empirical experiments.
Dataset Splits	No	The paper is theoretical and does not conduct experiments on specific datasets. Therefore, there is no discussion or specification of training/test/validation dataset splits.
Hardware Specification	No	The paper presents theoretical research, focusing on mathematical analysis and proofs. It does not describe any experiments that would require specific hardware, and thus, no hardware specifications are mentioned.
Software Dependencies	No	This paper is theoretical and does not involve empirical experiments. Therefore, it does not list any software dependencies or versions required to reproduce experimental results.
Experiment Setup	No	The paper is theoretical, presenting mathematical analysis, proofs, and characterizations of loss functions. It does not describe any experimental setup, specific hyperparameter values, or training configurations.