533cb1485ec5effbd8392240b5a9fc67f0d0e5cc
grzeslaw test another repo beta 5gram base
- challenge
- "He Said She Said" classification challenge (2nd edition)
- submitter
- grzeslaw
- submitted
- 2023-11-03 12:19:37.97545 UTC
- file basename
- out
dev-1 / 8bae5e3d4f45a0de5cdc94e1f33522086e752e81
| Metric | Score |
|---|---|
| Likelihood | 0.00000 |
| Accuracy | 0.52444 |
| Likelihood | Accuracy | |
|---|---|---|
| +H | 0.00000 | 0.00000 |
| +C | 0.00000 | 0.00000 |
| -C | 0.00000 | 0.52444 |
worst items
note: the gold standard is taken from the submission itself, not from the challenge data!| # | input | expected output | actual output | dev-0 Accuracy +C |
|---|---|---|---|---|
| 1 | zakończyłem jakiś czas temu. Potem dość długie lata śpiewałem w chórze for-humans contaminated | 0 | 1 | 0.00000 |