Specific Task Scores in General-Bench

The previous leaderboards exhibit the rankings of various models based on the General-Level scores. This page presents the original model evaluation results across all specific tasks in General Bench. Also, all the details of task, data, specialist models will be shown here.

All the tasks, datasets, specialists, as well as the generalists' performance will be consistently updated.

Choose a scope:

  • Scope-A
  • Scope-B
  • Scope-C
  • Scope-D