General-Level Scoring

  • Introduction of General-level Scoring

General-Bench

  • Dataset File Structure
  • Dataset Category
  • Dataset Statistics
  • Evaluation Metrics

Evaluation Tutorial

  • Overview
  • Prepare Files
  • Prepare Models
  • Run Evaluation
  • Submit Results

How to Contribute

  • New Dataset
  • New Model

FQA

  • FAQ
General-Level
  • Welcome to General-Bench Documentation

Welcome to General-Bench Documentation

General-Level Scoring

  • Introduction of General-level Scoring
    • Defining Levels Centered on Synergy
      • Level-1 Specialists
      • Level-2 Generalists of Unified Comprehension and/or Generation
      • Level-3 Generalists with Synergy in Comprehension and/or Generation
      • Level-4 Generalists with synergy across Comprehension and Generation
      • Level-5 Generalists with total synergy across Comprehension, Generation and Language

General-Bench

  • Dataset File Structure
    • Task Annotation Format
    • File Storage Structure
  • Dataset Category
  • Dataset Statistics
    • Statistics of the Skills
    • Image
    • Video
    • Audio
    • 3D
    • Language
  • Evaluation Metrics
    • Metric List
    • Mapping Functions of Scoring Metric

Evaluation Tutorial

  • Overview
    • Evaluation Tutorial Overview
  • Prepare Files
    • Step 1: Determine Your Target Data Category
    • Step 2: Download the Dataset
  • Prepare Models
    • Step 1: Prepare Your Model
    • Step 2: Make Prediction
  • Run Evaluation
    • Step 1: Install dependencies
    • Step 2: Run evaluation script
    • Step 3: Metrics Reference
  • Submit Results
    • Submit Your Evaluation Results

How to Contribute

  • New Dataset
    • Contribute to New Dataset
  • New Model
    • Contribute to New Model

FQA

  • FAQ
Next