FlagEval and SuperCLUE are both popular model benchmarks. Here is how they compare.
FlagEval alternatives · SuperCLUE alternatives · 中文版