Hi there,
I´m trying to read several compressed files to train Gensim´s word2vec. So far I´ve been able to do it, but as soon as I try to get more than a dozen or so per batch I get this error:
File "/usr/local/lib/python2.7/dist-packages/bz2file.py", line 214, in _fill_buffer
raise EOFError("Compressed file ended before the "
EOFError: Compressed file ended before the end-of-stream marker was reached
Is there any line I can add to my script to avoid this error, skip it or tweak the buffer if possible to make it all work? Thanks