2026 Evaluation System

The 2026 evaluation system is accessible here.

Please use a non-commercial (institutional or organizational) email address when registering for an evaluation account. Commercial email providers such as Gmail, QQ Mail, and similar services will not be approved for submissions.

To create an account:

    1. Click “Sign Up” in the upper-right corner of the evaluation system.
    2. Register using your email address and create a password.
    3. Complete your team information and submit the registration request.
    4. Verify your email address through the confirmation email you receive.
    5. After verification, you will be able to log in to the system.

Please note that submission privileges require administrator approval. Approval typically takes 24–48 hours.

Once your account has been approved, the “Add” button under the corresponding track on the Submissions page will become available, allowing you to submit results. Submission access may be enabled for different tracks at different times, so the Add button may appear only for selected tracks.

Teams may submit results to either the Public or General leaderboard:

    • Public Leaderboard
      • Results are publicly visible and may be included in the workshop summary paper.
      • To be eligible for challenge awards and prizes, teams must not use private data to train their models.
      • Teams must also submit their code, trained models, and any annotations created from the provided training data to the organizers before the challenge concludes.
    • General Leaderboard
      • Includes all Public leaderboard submissions as well as submissions that do not meet Public leaderboard requirements.

The following submission limits apply:

    • Maximum 5 submissions per track per day (Pacific Time).
    • Maximum 20 submissions per track throughout the challenge.
    • Limits apply collectively across both Public and General leaderboards.

Evaluation and leaderboard policy:

    • Results displayed during the competition are computed on a 50% subset of the test data.
    • After the competition deadline, final scores will be automatically recomputed using the full test set.
    • During the competition, the leaderboard displays only the top three teams and your team’s current ranking (if not in the top three).
    • After the challenge concludes, the full leaderboard will be published, including team names. Please choose a descriptive team name when registering.

Note: The 2021-2025 evaluation server remains available. Please refer to the individual page (PAST CHALLENGES>[Year] CHALLENGE>Evaluation System) for the corresponding evaluation server link.