reproducibilityindex.ai

Notice: The reproducibility variables underlying each score are classified using an automated LLM-based pipeline, validated against a manually labeled dataset. LLM-based classification introduces uncertainty and potential bias; scores should be interpreted as estimates. Full accuracy metrics and methodology are described in [1].

GEVRM: Goal-Expressive Video Generation Model For Robust Visual Manipulation

Authors: Hongyin Zhang, Pengxiang Ding, Shangke Lyu, Ying Peng, Donglin Wang

ICLR 2025 | Venue PDF | LLM Run Details

Reproducibility Variable	Result	LLM Response
Research Type	Experimental	In this section, we evaluate the state generation and visual manipulation capabilities of GEVRM. To this end, our experiments aim to investigate the following questions: 1) Can GEVRM have strong generalization ability to generate expressive goal in various environments? 2) Does GEVRM exhibit a higher success rate in executing robot tasks compared to the baseline in various environments? 3) How important are the core components of the GEVRM for achieving robust decision action?
Researcher Affiliation	Academia	Hongyin Zhang1,2 Pengxiang Ding1,2 Shangke Lyu2 Ying Peng2 Donglin Wang2 1 Zhejiang University. 2 Westlake University. EMAIL
Pseudocode	Yes	Algorithm 1 GEVRM: Test-time Execution
Open Source Code	No	The paper does not contain a clear, affirmative statement about releasing the source code for GEVRM, nor does it provide a specific link to a code repository. It mentions 'open-source video generative models' in the context of baselines but not for its own method.
Open Datasets	Yes	We utilized two types of datasets (realistic Bridge (Walke et al., 2023) and simulated CALVIN (Mees et al., 2022b)) to evaluate the generalization of goal generation.
Dataset Splits	Yes	We study zero-shot multi-environment training on A, B, and C, and testing on D, varying in table texture, furniture positioning, and color patches.
Hardware Specification	No	The paper does not explicitly state the specific hardware (e.g., GPU models, CPU types) used for training or inference of the GEVRM model. It mentions a 'UR5' robotic arm for real-world tasks, but this is the robot being controlled, not the computational hardware.
Software Dependencies	No	The paper mentions several models and algorithms like T5, Rectified Flow, DDPM, and ResNet34, along with their respective research papers. However, it does not specify software dependencies with version numbers, such as Python versions or library versions (e.g., PyTorch 1.9).
Experiment Setup	Yes	The hyperparameters are shown in Appendix Tab. 8 and Tab. 9. The policy training hyperparameters are shown in Appendix Tab. 10.