*** This repository is a modification of the following one: ***
https://github.com/infolab-usc/SCAWG
- Added straightforward running parameters to easily generate synthetic/synthesis data. Please see the main.Entry for all parameters.
- Added support to other dataset format
- Real dataset should be put in
dataset/real/{name}folders. Please see each processor under org.geocrowd.synthesis for details. - The result is put in
dataset/{name}/taskanddataset/{name}/workerfor synthesis data. For synthetic data, the result is put indataset/{distribution}/taskanddataset/{distribution}/worker. - Please remember clean the result folder, so that the output of different runs will not mix together.