reproducibilityindex.ai

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].

DuMo: Dual Encoder Modulation Network for Precise Concept Erasure

Authors: Feng Han, Kai Chen, Chao Gong, Zhipeng Wei, Jingjing Chen, Yu-Gang Jiang

AAAI 2025 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	Our method achieves state-of-the-art performance on Explicit Content Erasure (detecting only 34 nude parts), Cartoon Concept Removal (with an average LPIPSda of 0.428, 0.113 higher than SOTA at 0.315), and Artistic Style Erasure (with an average LPIPSda of 0.387, 0.088 higher than SOTA at 0.299), clearly outperforming alternative methods. ... We conduct a variety of experiments on SD v1.4 to demonstrate the effectiveness of our method. Specifically, we compare ours with 10 baseline methods, including ESD (Gandikota et al. 2023), UCE (Gandikota et al. 2024), SLD-Med (Schramowski et al. 2023), SA (Heng and Soh 2024), CA (Kumari et al. 2023), SDD (Kim et al. 2023), RECE (Gong et al. 2024) MACE (Lu et al. 2024), and SPM (Lyu et al. 2024). Following SPM (Lyu et al. 2024), we perform three tasks: explicit content erasure (Sec. 4.1), cartoon concept removal (Sec. 4.2), and artistic style erasure (Sec. 4.3), to evaluate these methods. Besides, we conduct the ablation study (Sec. 4.4) to verify the effects of each component.
Researcher Affiliation	Academia	Feng Han1,2, Kai Chen1,2, Chao Gong1,2, Zhipeng Wei1,2, Jingjing Chen1,2*, Yu-Gang Jiang1,2 1Shanghai Key Lab of Intell. Info. Processing, School of Computer Science, Fudan University 2Shanghai Collaborative Innovation Center on Intelligent Visual Computing EMAIL, EMAIL
Pseudocode	No	The paper describes the methodology using prose and mathematical equations but does not contain a clearly labeled pseudocode block or algorithm.
Open Source Code	Yes	Code https://github.com/Maplebb/Du Mo
Open Datasets	Yes	Following ESD (Gandikota et al. 2023), we utilize the Inappropriate Image Prompts (I2P) dataset (Schramowski et al. 2023), which contains 4,703 toxic text prompts... With regards to the generation of non-target concepts, we employ COCO-30K, the validation set of MS-COCO (Lin et al. 2014) dataset.
Dataset Splits	Yes	With regards to the generation of non-target concepts, we employ COCO-30K, the validation set of MS-COCO (Lin et al. 2014) dataset.
Hardware Specification	No	The paper does not provide specific hardware details (e.g., GPU/CPU models, processor types, or memory amounts) used for running its experiments.
Software Dependencies	No	The paper does not specify any software dependencies with version numbers (e.g., Python, PyTorch, CUDA versions) needed to replicate the experiments.
Experiment Setup	Yes	We set the threshold of the Nudenet to 0.6 to detect naked parts of images on I2P... Where ... η represents the erasure strength. ... L = Lera2 + λLpre where λ is the preserve scale of unrelated concepts. ... We equally split those 1000 denoising steps into 20 groups, and each group owns 13 parameters corresponding to the number of skip connection layers. ... We conduct a variety of experiments on SD v1.4 ... For each template, 5 images are generated with the seed 2024. ... The modulation factors in TLMO are initialized to 1