reproducibilityindex.ai

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].

Online PCA with Optimal Regret

Authors: Jiazhong Nie, Wojciech Kotlowski, Manfred K. Warmuth

JMLR 2016 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Theoretical	We evaluate online algorithms in terms of their worst-case regret, which is a bound on the additional total loss of the online algorithm on all instances matrices over the compression loss of the best k-dimensional subspace (chosen in hindsight). We focus on two popular online algorithms for generalized PCA: the Gradient Descent (GD) and Matrix Exponentiated Gradient (MEG) algorithms. We show that if the regret is expressed as a function of the number of trials, then both algorithms are optimal to within a constant factor on worst-case sequences of positive definite instances matrices with trace norm at most one (which subsumes the original PCA problem with outer products). In this paper, we carefully studied two popular online algorithms for PCA: the Gradient Descent (GD) and Matrix Exponentiated Gradient (MEG) algorithms. For the case when the instance matrices are L1-bounded, we showed that both algorithms are optimal to within a constant factor when the worst-case regret is expressed as a function of the number of trials.
Researcher Affiliation	Academia	Jiazhong Nie EMAIL Department of Computer Science, University of California, Santa Cruz, Wojciech Kotlowski EMAIL Institute of Computing Science, Poznan University of Technology, Poland, Manfred K. Warmuth EMAIL Department of Computer Science, University of California, Santa Cruz
Pseudocode	No	The paper describes algorithms (GD, Loss MEG, Gain MEG) in text and mathematical formulas (e.g., 'GD update: Descent step: c Wt+1 = Wt ηXt, Projection step: Wt+1 = argmin W Wm W c Wt+1 2 F .') rather than providing structured pseudocode blocks or algorithm listings.
Open Source Code	No	The paper does not contain any explicit statements about releasing source code, nor does it provide links to a code repository or mention code in supplementary materials for the methodology described.
Open Datasets	No	The paper is theoretical, analyzing algorithms and their regret bounds based on abstract 'instance matrices' and 'loss vectors' (e.g., 'L1-bounded instance matrices', 'L-bounded instance matrices'). It does not describe experiments performed on specific real-world or benchmark datasets, and therefore provides no information about public dataset access.
Dataset Splits	No	The paper is theoretical and does not perform experiments on datasets, thus no information regarding dataset splits (e.g., training/test/validation splits) is provided.
Hardware Specification	No	The paper is theoretical and focuses on proving regret bounds and analyzing computational complexity (e.g., 'O(n3) in general', 'O(n2) computation'). It does not describe any experiments that would require specific hardware, and therefore no hardware specifications are provided.
Software Dependencies	No	The paper is theoretical and describes algorithms at a mathematical level. It does not provide implementation details or mention specific software dependencies with version numbers.
Experiment Setup	No	The paper is theoretical and presents mathematical analysis of online algorithms (GD, MEG) and their regret bounds. It does not describe any empirical experiments, and thus no experimental setup details, hyperparameters, or system-level training settings are provided.