In your readme I see that you are using the following classes:
one, two, three, four, five, front, back, left, right, stop, none
But on the dataset page they are using these classes:
yes, no, up, down, left, right, on, off, stop, go
Why this difference?