reproducibilityindex.ai

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].

mvlearn: Multiview Machine Learning in Python

Authors: Ronan Perry, Gavin Mischler, Richard Guo, Theodore Lee, Alexander Chang, Arman Koul, Cameron Franz, Hugo Richard, Iain Carmichael, Pierre Ablin, Alexandre Gramfort, Joshua T. Vogelstein

JMLR 2021 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	mvlearn has been tested on Linux, Mac, and PC platforms, and adheres to strong code quality principles. Continuous integration ensures compatibility with past versions, PEP8 style guidelines keep the source code clean, and unit tests provide over 95% code coverage at the time of release. Additionally, mvlearn preprocessing tools can be used to generate multiple views from a single original data matrix, expanding the use-cases of multiview methods and potentially improving results over typical single-view methods with the same data.
Researcher Affiliation	Academia	1 Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21218; 9 Department of Electrical Engineering, Columbia University, New York, NY 10027; 5 Universit e Paris-Saclay, Inria, Palaiseau, France; 6 Department of Statistics, University of Washington, Seattle, WA 98195; 7 CNRS and DMA, Ecole Normale Sup erieure, PSL University, Paris, France.
Pseudocode	No	No explicit pseudocode or algorithm blocks are present in the paper. The paper describes the algorithms conceptually and lists them in Table 1, but does not provide structured, step-by-step procedures in pseudocode format.
Open Source Code	Yes	mvlearn is a Python library which implements the leading multiview machine learning methods. ... The package can be installed from Python Package Index (Py PI) and the conda package manager and is released under the MIT open-source license. The documentation, detailed examples, and all releases are available at https://mvlearn.github.io/.
Open Datasets	Yes	A synthetic multiview data generator as well as dataloaders for the Multiple Features Data Set (Breukelen et al., 1998) in the UCI repository (Dua and Graﬀ, 2017) and the genomics Nutrimouse data set (Martin et al., 2007) are included.
Dataset Splits	No	The paper introduces a software library and provides data loaders for existing datasets, but it does not describe specific training/test/validation dataset splits used for experiments within this paper.
Hardware Specification	No	The paper states 'mvlearn has been tested on Linux, Mac, and PC platforms', but does not provide any specific hardware details such as GPU/CPU models, processor types, or memory used for testing or development.
Software Dependencies	No	The paper mentions 'scikit-learn' for API design and 'matplotlib' and 'seaborn' for plotting, and installation via 'Python Package Index (Py PI)' and 'conda package manager', but does not specify version numbers for any software dependencies.
Experiment Setup	No	The paper describes a software library and its functionalities, but it does not detail specific experimental setup parameters such as hyperparameters, learning rates, batch sizes, or training configurations for a particular model.