Heterogeneous Sufficient Dimension Reduction and Subspace Clustering

Authors: Lei Yan, Xin Zhang, Qing Mai

ICML 2025 | Venue PDF | Archive PDF | Plain Text | LLM Run Details

Reproducibility Variable Result LLM Response
Research Type Experimental Through extensive simulation studies, mix PFC demonstrates superior performance compared to existing methods across various settings. Applications to real world datasets further highlight its effectiveness and practical advantages. 5. Numerical Results 5.1. Simulations 5.2. Real Data Analysis
Researcher Affiliation Academia 1Department of Statistics, Florida State University, Tallahassee, Florida, United States. Correspondence to: Xin Zhang <EMAIL>.
Pseudocode Yes Algorithm 1 Penalized EM algorithm for mixture PFC
Open Source Code Yes The code is available on Git Hub at https: //github.com/leiyan-ly/mixPFC.
Open Datasets Yes The Australian Institute of Sport (AIS) dataset, available in the R package dr, contains lean body mass data for 102 male and 100 female athletes. The Cancer Cell Line Encyclopedia (CCLE) dataset contains 8-point dose-response curves for 24 chemical compounds across over 400 cell lines, with 18,926 gene expression features for each cell line, accessible at https: //sites.broadinstitute.org/ccle.
Dataset Splits Yes The dataset is randomly partitioned into 80% training and 20% testing samples, with 100 repetitions.
Hardware Specification No The paper does not provide specific hardware details used for running its experiments.
Software Dependencies No The paper mentions the 'dr' R package but does not provide specific version numbers for any software dependencies.
Experiment Setup Yes The sample size is fixed at n = 200K with p = 1000, and for each simulation setting, 100 independent datasets are generated. To explore the influence of different covariance structures, we examine four configurations: 0.1Ip, Ip, AR(0.3), AR(0.5)... we set µw = 0, f(Y ) = (Y, |Y |)T , and πw = 1/K. ... tune λ with cross-validated distance correlation (Sz ekely et al., 2007). ... the number of clusters set to 3 and 5 for Nutlin-3 and AZD6244 when using mix PFC.