Data sets are currently spread out across strangely named folders without any reference to where data points came from, etc.. We need: - [ ] proper folder names, - [ ] separate `README.md` files for each folder, - [ ] references on the origin of each data points.