reproducibilityindex.ai

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].

LIBOL: A Library for Online Learning Algorithms

Authors: Steven C.H. Hoi, Jialei Wang, Peilin Zhao

JMLR 2014 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	To illustrate the online learning procedure, we take two data sets from the LIBSVM website, including one small data set svmguide3 with 1243 instances and one large data set ijcnn1 with 141,691 instances. In the following example, we use the default Perceptron algorithm to demo the usage of LIBOL for a binary classiﬁcation ( bc ) task: $ demo( bc , Perceptron , svmguide3 ) The results output by the above command are summarized as follows: Algorithm: mistake rate nb of updates cpu time (seconds) Perceptron 0.3318 +/0.0118 412.45 +/14.66 0.0516 +/0.0008.
Researcher Affiliation	Academia	Steven C.H. Hoi EMAIL Jialei Wang EMAIL Peilin Zhao EMAIL School of Computer Engineering Nanyang Technological University Singapore 639798
Pseudocode	Yes	Algorithm 1: LIBOL: Online Learning Framework for Linear Classiﬁcation. 1 Initialize: w1 = 0 2 for t = 1, 2, . . . , T do 3 The learner receives an incoming instance: xt X; 4 The learner predicts the class label: ˆyt = sgn(f(xt; wt)); 5 The true class label is revealed from the environment: yt Y; 6 The learner calculates the suﬀered loss: ℓ(wt; (xt, yt)); 7 if ℓ(wt; (xt, yt)) > 0 then 8 The learner updates the classiﬁcation model: 9 wt+1 wt + (wt; (xt, yt));
Open Source Code	Yes	The LIBOL software is available at http://libol.stevenhoi.org/.
Open Datasets	Yes	To illustrate the online learning procedure, we take two data sets from the LIBSVM website, including one small data set svmguide3 with 1243 instances and one large data set ijcnn1 with 141,691 instances.
Dataset Splits	No	The paper uses the svmguide3 and ijcnn1 datasets from the LIBSVM website but does not provide specific details on how these datasets were split into training, validation, or test sets, or if cross-validation was used. It describes an online learning process where data is processed sequentially, but not explicit splits for reproduction.
Hardware Specification	No	The paper mentions 'cpu time (seconds)' in its experimental results, but it does not provide any specific details about the CPU model, GPU, memory, or any other hardware components used for running the experiments.
Software Dependencies	No	The paper mentions 'MATLAB library' and 'C/C++ implementation for the core functions' and that 'The data formats used by this software are compatible with popular machine learning and data mining packages, such as LIBSVM, SVM-light, and WEKA, etc.'. However, it does not provide specific version numbers for MATLAB, C/C++ compilers, or any of the mentioned compatible packages.
Experiment Setup	No	The paper states that the `run_experiment` command 'will run side-by-side comparison of varied online learning algorithms on the given data set fully automatically, including all the parameter settings and selection.' However, it does not explicitly list these 'parameter settings and selection' (e.g., learning rates, epochs, specific algorithm hyperparameters) within the text.