Heterogeneous Sufficient Dimension Reduction and Subspace Clustering
Authors: Lei Yan, Xin Zhang, Qing Mai
ICML 2025 | Venue PDF | Archive PDF | Plain Text | LLM Run Details
| Reproducibility Variable | Result | LLM Response |
|---|---|---|
| Research Type | Experimental | Through extensive simulation studies, mix PFC demonstrates superior performance compared to existing methods across various settings. Applications to real world datasets further highlight its effectiveness and practical advantages. 5. Numerical Results 5.1. Simulations 5.2. Real Data Analysis |
| Researcher Affiliation | Academia | 1Department of Statistics, Florida State University, Tallahassee, Florida, United States. Correspondence to: Xin Zhang <EMAIL>. |
| Pseudocode | Yes | Algorithm 1 Penalized EM algorithm for mixture PFC |
| Open Source Code | Yes | The code is available on Git Hub at https: //github.com/leiyan-ly/mixPFC. |
| Open Datasets | Yes | The Australian Institute of Sport (AIS) dataset, available in the R package dr, contains lean body mass data for 102 male and 100 female athletes. The Cancer Cell Line Encyclopedia (CCLE) dataset contains 8-point dose-response curves for 24 chemical compounds across over 400 cell lines, with 18,926 gene expression features for each cell line, accessible at https: //sites.broadinstitute.org/ccle. |
| Dataset Splits | Yes | The dataset is randomly partitioned into 80% training and 20% testing samples, with 100 repetitions. |
| Hardware Specification | No | The paper does not provide specific hardware details used for running its experiments. |
| Software Dependencies | No | The paper mentions the 'dr' R package but does not provide specific version numbers for any software dependencies. |
| Experiment Setup | Yes | The sample size is fixed at n = 200K with p = 1000, and for each simulation setting, 100 independent datasets are generated. To explore the influence of different covariance structures, we examine four configurations: 0.1Ip, Ip, AR(0.3), AR(0.5)... we set µw = 0, f(Y ) = (Y, |Y |)T , and πw = 1/K. ... tune λ with cross-validated distance correlation (Sz ekely et al., 2007). ... the number of clusters set to 3 and 5 for Nutlin-3 and AZD6244 when using mix PFC. |