Artifact Evaluation

Call for Artifacts

Authors of accepted CGO 2020 papers are invited to formally submit their supporting materials to the Artifact Evaluation (AE) process. The artifact evaluation committee attempts to reproduce (at least the main) experiments and assesses if submitted artifacts support the claims made in the paper. The submission is voluntary and does not influence the final decision regarding the paper acceptance.

We invite every author of an accepted CGO paper to consider submitting an artifact. At CGO we follow ACM’s artifact reviewing and badging policy. ACM describes a research artifact as follows:

By “artifact” we mean a digital object that was either created by the authors to be used as part of the study or generated by the experiment itself. For example, artifacts can be software systems, scripts used to run experiments, input datasets, raw data collected in the experiment, or scripts used to analyse results.

Artifact Evaluation Chairs

Bastian Hagedorn, University of Münster
Michel Steuwer, University of Glasgow
Michael Laurenzano, University of Michigan/Clinc

Deadlines

15 November 2019	Artifact Submission
25 November – 6 December 2019	Artifact Evaluation Technical Clarification Period
13 December 2019	Artifact Evaluation Notification

Submission

Submissions are made via the artifact evaluation submission website: https://cgo20ae.hotcrp.com/

Authors submit:

the paper that has been accepted at CGO, extended with
an appendix providing a link to and describing the artifact.
We recommend to use this AE appendix template from ctuning.org where you can also find a detailed description what information to provide.

For the artifact itself, we encourage the use of container or VM technologies like Docker, Singularity, Virtual Box or Vagrant to package the artifact in one stand-alone container or VM which provides all required dependencies. Giving AE reviewers remote access to your machines with preinstalled (proprietary) software is also possible.

If you have an unusual experimental setup which requires specific hardware (i.e., custom hardware, oscilloscopes for measurements …) or proprietary software please contact the artifact evaluation chairs before the submission.

There are more tips preparing a submission available on the ctuning website.

Evaluation Process

Each submitted artifact is evaluated by at least two members of the artifact evaluation committee.

During the process authors and evaluators are allowed to anonymously communicate with each other to overcome technical difficulties.
Ideally, we hope to see all submitted artifacts to successfully pass artifact evaluation.

The evaluators are asked to evaluate the artifact based on the following criteria, that are defined by ACM.

Is the artifact functional?

Package complete? All components relevant to evaluation are included in the package?
Well documented? Enough to understand, install and evaluate artifact?
Exercisable? Includes scripts and/or software to perform appropriate experiments and generate results?
Consistent? Artifacts are relevant to the associated paper and contribute in some inherent way to the generation of its main results?

The artifacts associated with the paper will receive an “Artifacts Evaluated - Functional” badge only if they are found to be documented, consistent, complete, exercisable, and include appropriate evidence of verification and validation.

Is the artifact customizable and reusable?

Can this artifact and experimental workflow be easily reused and customized?
For example, can it be used on a different platform, with different benchmarks, data sets, compilers, tools, under different conditions and parameters, etc.?

The artifacts associated with the paper will receive an “Artifact Evaluated - Reusable” badge only if they are of a quality that significantly exceeds minimal functionality. That is, they have all the qualities of the Artifacts Evaluated - Functional level, but, in addition, they are very carefully documented and well-structured to the extent that reuse and repurposing are facilitated. In particular, norms and standards of the research community for artifacts of this type are strictly adhered to.

Have the results been validated?

Can all main results from the paper be validated using provided artifacts?
Evaluators are asked to report any unexpected artifact behavior (depends on the type of artifact such as unexpected output, scalability issues, crashes, performance variation, etc).

The artifacts associated with the paper will receive a “Results replicated” badge only if the main results of the paper have been obtained in a subsequent study by a person or team other than the authors, using, in part, artifacts provided by the author. Note that variation of empirical and numerical results is tolerated. In fact it is often unavoidable in computer systems research - see “how to report and compare empirical results?” in AE FAQ on ctuning.org!

Based on the results the following badges are awarded.

Badges

ACM recommends awarding three different type of badges to communicate how the artifact has been evaluated.
A single paper can receive up to three badges — one badge of each type.

The green Artifacts Available badge indicates that an artifact is publicly accessible in an archival repository. For this badge to be awarded the paper does not have to be independently evaluated. ACM requires that a qualified archival repository is used, for example Zenodo, figshare, Dryad. Personal webpages, GitHub repositories or alike are not sufficient as it can be changed after the submission deadline!

		The red Artifacts Evaluated badges indicate that a research artifact has been successfully completed an independent audit. A reviewer has verified that the artifact is documented, complete, consistent, and exercisable.
		The lighter red Artifacts Evaluated — Functional badge indicates a basic level of functionality. The darker red Artifacts Evaluated — Reusable badge indicates a higher quality artifact which significantly exceeds minimal functionality so that reuse and repurposing is facilitated.

		The blue Results Validated badges indicate that the main results of the paper have been successfully obtained by an independent reviewer.
		The lighter blue Results Replicated badge indicates that the main results of the paper have been successfully obtained using the provided artifact. The darker blue Results Reproduced badge indicates that the main results of the paper have been independently obtained without using the author-provided research artifact.

At CGO the artifact evaluation committee awards for each successfully evaluated paper one of the two red Artifacts Evaluated badges as well as the lighter blue Results Replicated badge. We do not award the darker blue Results Reproduced badge in this artifact evaluation process. The green Artifact Available badge does not require the formal audit and, therefore, is awarded directly by the publisher — if the authors provide a link to the deposited artifact.