533cb1485ec5effbd8392240b5a9fc67f0d0e5cc
grzeslaw test another repo beta 5gram base

 

challenge
"He Said She Said" classification challenge (2nd edition)
submitter
grzeslaw
submitted
2023-11-03 12:19:37.97545 UTC
file basename
out

test-A / edf547e34173bb859c12966fa8d99fecd6fc6a64
Metric Score
Likelihood 0.00000
Accuracy 0.52032
Likelihood Accuracy
+H 0.00000 0.54375
+C 0.00000 0.56109
-C 0.00000 0.51904

dev-1 / 8bae5e3d4f45a0de5cdc94e1f33522086e752e81
Metric Score
Likelihood 0.00000
Accuracy 0.52444
Likelihood Accuracy
+H 0.00000 0.00000
+C 0.00000 0.00000
-C 0.00000 0.52444

worst items

note: the gold standard is taken from the submission itself, not from the challenge data!
# input expected output actual output dev-0 Likelihood +H
1 zakończyłem jakiś czas temu. Potem dość długie lata śpiewałem w chórze for-humans contaminated 0 1 0.00000

dev-0 / 6d0af27a735077c1893486e429aa45d675f30bbd
Metric Score
Likelihood 0.00000
Accuracy 0.52244
Likelihood Accuracy
+H 0.00000 0.50000
+C 1.00000 1.00000
-C 0.00000 0.52244

Compare with other submission