reproducibilityindex.ai

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].

Beyond Confidence: Exploiting Homogeneous Pattern for Semi-Supervised Semantic Segmentation

Authors: Rui Sun, Huayu Mai, Wangkai Li, Yujia Chen, Naisong Luo, Yuan Wang, Tianzhu Zhang

ICML 2025 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	Extensive experimental results on challenging benchmarks show that integrating Ag Score into existing semi-supervised segmentation frameworks yields consistent improvements. Also, sections like '5. Experiments' and tables (Table 1, Table 2, Table 3, Table 4) detailing quantitative results on various datasets confirm empirical studies.
Researcher Affiliation	Academia	1Shenzhen International Graduate School, Tsinghua University 2Mo E Key Laboratory of Brain-inspired Intelligent Perception and Cognition, University of Science and Technology of China 3National Key Laboratory of Deep Space Exploration, Deep Space Exploration Laboratory. Correspondence to: Tianzhu Zhang <EMAIL>. All listed affiliations are academic institutions or affiliated public research labs, and the email domain `ustc.edu.cn` is academic.
Pseudocode	Yes	Algorithm 1 Agent Construction. Inputs: Pixel Embeddings F RB HW C, Prediction P RB HW , #Positive Agents N, #Negative Agents M Output: Positive Agents Ap RN C, Negative Agents An RM C and Algorithm 2 Orthogonal Selection Strategy. Inputs: Pixel Embedding F RM C, #Agents N Output: Agents A RN C are provided in Appendix A.
Open Source Code	No	The paper does not contain an explicit statement about releasing the source code for the described methodology, nor does it provide a direct link to a code repository. Phrases like 'We release our code' or links to platforms like GitHub are absent.
Open Datasets	Yes	Datasets: (1) PASCAL VOC 2012 (Everingham et al., 2010) is an object-centric semantic segmentation dataset... (2) Cityscapes (Cordts et al., 2016) is an urban scene understanding dataset... (3) COCO (Lin et al., 2014), composed of 118k/5k training/validation images...
Dataset Splits	Yes	We report m Io U (%) under various partition protocols... 1/16 (92) 1/8 (183) 1/4 (366) 1/2 (732) Full (1464) (Table 1). Similar partition protocols are mentioned for other datasets in Tables 2, 3, and 4.
Hardware Specification	Yes	The model is trained for 80 epochs on PASCAL and 240 epochs on Cityscapes with a batch size of 8, using 8 RTX 3090 GPUs (memory is 24G/GPU).
Software Dependencies	No	The paper mentions using Res Net-50/101 as the backbone and Deep Labv3+ as the decoder, and stochastic gradient descent (SGD) optimizer. However, it does not provide specific version numbers for any software, libraries, or frameworks used, which are required for reproducibility.
Experiment Setup	Yes	The crop size is set as 513 × 513 for PASCAL and 801 × 801 for Cityscapes, respectively. We adopt stochastic gradient descent (SGD) optimizer with an initial learning rate of 0.001 for PASCAL and 0.005 for Cityscapes. Polynomial Decay learning rate policy is applied throughout the whole training... We set the number of positive agents N = 64 and the number of negative agents M = 256 for all experiments. The model is trained for 80 epochs on PASCAL and 240 epochs on Cityscapes with a batch size of 8...