| A | B | C | D | E | F | G | H | I | J | |
|---|---|---|---|---|---|---|---|---|---|---|
1 | ||||||||||
2 | Rank | Team | JudgeLM Score | Reference-based metrics | ||||||
3 | rougeL (%) | bleu (%) | bertscore (%) | gen_len | novelty (%) | |||||
4 | 1 | CODEOFCONDUCT run1 | 2465.5 | 8.2 | 1.5 | 66.4 | 67.5 | 86.8 | ||
5 | 2 | CODEOFCONDUCT run3 | 2382.5 | 10.4 | 2.2 | 67.5 | 69.1 | 87.5 | ||
6 | 3 | CODEOFCONDUCT run2 | 2371.0 | 9.8 | 2.2 | 67.0 | 66.2 | 87.1 | ||
7 | 4 | MilaNLP run1 | 2242.5 | 10.7 | 1.0 | 69.0 | 44.6 | 87.8 | ||
8 | 5 | NLP@IIMAS run2 | 2086.0 | 8.9 | 0.6 | 67.7 | 34.6 | 87.5 | ||
9 | 6 | HuaweiTSC run2 | 1881.5 | 17.7 | 5.6 | 72.4 | 34.5 | 86.8 | ||
10 | 7 | HuaweiTSC run3 | 1722.0 | 23.3 | 10.5 | 74.2 | 32.1 | 86.5 | ||
11 | 8 | ground truth | 1534.5 | 100.0 | 100.0 | 100.0 | 26.5 | 85.3 | ||
12 | 9 | HuaweiTSC run1 | 1484.5 | 18.3 | 6.3 | 72.1 | 30.2 | 87.2 | ||
13 | 10 | TrenTeam run2 | 1394.5 | 32.8 | 20.9 | 77.1 | 27.5 | 85.7 | ||
14 | 11 | TrenTeam run1 | 1364.5 | 33.8 | 22.4 | 77.6 | 28.2 | 85.2 | ||
15 | 12 | Hyderabadi Pearls run2 | 1322.0 | 27.6 | 15.5 | 75.5 | 27.8 | 85.3 | ||
16 | 13 | TrenTeam run3 | 1246.0 | 31.7 | 18.2 | 76.6 | 24.0 | 85.9 | ||
17 | 14 | SemanticCUETSync run1 | 1194.0 | 26.5 | 15.4 | 75.1 | 26.0 | 85.4 | ||
18 | 15 | Northeastern Uni run2 | 1158.0 | 27.6 | 13.5 | 75.7 | 24.5 | 83.4 | ||
19 | 16 | Northeastern Uni run3 | 1145.0 | 30.9 | 17.6 | 76.2 | 29.6 | 85.2 | ||
20 | 17 | Northeastern Uni run1 | 1107.5 | 25.6 | 13.3 | 74.6 | 24.8 | 84.3 | ||
21 | 18 | Hyderabadi Pearls run3 | 1023.5 | 29.2 | 17.4 | 75.5 | 26.2 | 85.6 | ||
22 | 19 | Hyderabadi Pearls run1 | 1011.5 | 29.2 | 17.4 | 75.5 | 26.2 | 85.6 | ||
23 | 20 | counterspeech go run1 | 904.0 | 31.8 | 15.6 | 76.7 | 18.0 | 84.9 | ||
24 | 21 | counterspeech go run3 | 855.5 | 31.6 | 15.3 | 76.5 | 17.7 | 85.1 | ||
25 | 22 | counterspeech go run2 | 837.0 | 32.4 | 15.8 | 77.1 | 18.0 | 85.1 | ||
26 | 23 | NLP@IIMAS run1 | 720.5 | 29.2 | 17.6 | 74.9 | 24.9 | 86.0 | ||
27 | 24 | NLP@IIMAS run3 | 720.0 | 29.2 | 17.6 | 74.9 | 24.9 | 86.0 | ||
28 | 25 | MilaNLP run2 | 430.0 | 18.5 | 6.9 | 70.4 | 50.5 | 87.4 | ||
29 | 26 | MilaNLP run3 | 422.5 | 17.9 | 6.8 | 70.7 | 72.8 | 88.3 | ||
30 | 27 | bhavanark run1 | 74.0 | 5.5 | 0.5 | 61.7 | 32.4 | 88.7 | ||