b94f0b3c03b0652f94d15d6dba47e8d127b49c71
s444498 prod test another repo algo

 

challenge
"He Said She Said" classification challenge (2nd edition)
submitter
[anonymized]
submitted
2023-11-03 12:15:41.829443 UTC
file basename
out

test-A / 0a1087b71a5ac387ac9b7123aa90700bc9d0f2a9
Metric Score
Likelihood 0.00000
Accuracy 0.51464
Likelihood Accuracy
+H 0.00000 0.47250
+C 0.00000 0.50000
-C 0.00000 0.51510

dev-1 / 3e8de864fd04a33cb986589d89ae888d47fb489c
Metric Score
Likelihood 0.00000
Accuracy 0.51665
Likelihood Accuracy
+H 0.00000 0.00000
+C 0.00000 0.00000
-C 0.00000 0.51665

dev-0 / 77a487ba232f2c111e6436b3082d3dc643442e2d
Metric Score
Likelihood 0.00000
Accuracy 0.51690
Likelihood Accuracy
+H 0.00000 0.51000
+C 0.00000 0.00000
-C 0.00000 0.51691

worst items

note: the gold standard is taken from the submission itself, not from the challenge data!
# input expected output actual output dev-1 Accuracy +C
1 Cierpiałem na straszne lagi – kilkanaście sekund lub dłużej czarnego ekranu przy próbie przełączenia się / uruchomienia prawie każdej aplika… 1 0 0.00000

Compare with other submission