@@ -190,10 +190,10 @@ We have tried to resolve any conflicts in the *best* possible manner.
190190 We excluded the ` DIM ` -sets as they turn out to be too easy
191191 for most algorithms.
192192
193- 5 . [ ` uci ` ] ( catalog /uci.md) -
193+ 5 . [ ` uci ` ] ( catalogue /uci.md) -
194194 a selection of datasets available at the University of California, Irvine,
195195 [ Machine Learning Repository] ( http://archive.ics.uci.edu/ml/ )
196- (Dua and Graff, 2018 )
196+ (Dua and Graff, 2019 )
197197
198198 Some of these datasets in this selection were considered
199199 for benchmark purposes
@@ -203,14 +203,14 @@ We have tried to resolve any conflicts in the *best* possible manner.
203203
2042046 . [ ` wut ` ] ( catalogue/wut.md ) -
205205 authored by the fantastic students
206- of Marek's [ Python for Data Analysis course] ( http://www.gagolewski.com/teaching/padpy/ ) @
207- [ Warsaw University of Technology] ( https://ww4.mini.pw.edu.pl/ ) :
206+ of Marek Gagolewski 's Python for Data Analysis course at
207+ Warsaw University of Technology:
208208 Przemysław Kosewski, Jędrzej Krauze, Eliza Kaczorek, Anna Gierlak,
209209 Adam Wawrzyniak, Aleksander Truszczyński, Mateusz Kobyłka and Michał Maciąg.
210210
211211
2122127 . [ ` g2mg ` ] ( catalogue/g2mg.md ) -
213- a modified version of the SIPU ` G2 ` -sets with variances
213+ a modified version of ` G2 ` -sets from SIPU with variances
214214 dependent on datasets' dimensionalities, i.e., s* np.sqrt(d/2),
215215 which makes these problems more difficult.
216216
@@ -278,40 +278,43 @@ We have tried to resolve any conflicts in the *best* possible manner.
278278| 43 | sipu/s4 | 5000| 2|
279279| 44 | sipu/spiral | 312| 2|
280280| 45 | sipu/unbalance | 6500| 2|
281- | 46 | uci/ecoli | 336| 7|
282- | 47 | uci/glass | 214| 9|
283- | 48 | uci/ionosphere | 351| 34|
284- | 49 | uci/sonar | 208| 60|
285- | 50 | uci/statlog | 2310| 19|
286- | 51 | uci/wdbc | 569| 30|
287- | 52 | uci/wine | 178| 13|
288- | 53 | uci/yeast | 1484| 8|
289- | 54 | wut/circles | 4000| 2|
290- | 55 | wut/cross | 2000| 2|
291- | 56 | wut/graph | 2500| 2|
292- | 57 | wut/isolation | 9000| 2|
293- | 58 | wut/labirynth | 3546| 2|
294- | 59 | wut/mk1 | 300| 2|
295- | 60 | wut/mk2 | 1000| 2|
296- | 61 | wut/mk3 | 600| 3|
297- | 62 | wut/mk4 | 1500| 3|
298- | 63 | wut/olympic | 5000| 2|
299- | 64 | wut/smile | 1000| 2|
300- | 65 | wut/stripes | 5000| 2|
301- | 66 | wut/trajectories | 10000| 2|
302- | 67 | wut/trapped_lovers | 5000| 3|
303- | 68 | wut/twosplashes | 400| 2|
304- | 69 | wut/windows | 2977| 2|
305- | 70 | wut/x1 | 120| 2|
306- | 71 | wut/x2 | 120| 2|
307- | 72 | wut/x3 | 185| 2|
308- | 73 | wut/z1 | 192| 2|
309- | 74 | wut/z2 | 900| 2|
310- | 75 | wut/z3 | 1000| 2|
311-
312-
313-
314- We recommend that ` h2mg ` sets should be studied separately
281+ | 46 | sipu/worms_2 | 105600| 2|
282+ | 47 | sipu/worms_64 | 105000| 64|
283+ | 48 | uci/ecoli | 336| 7|
284+ | 49 | uci/glass | 214| 9|
285+ | 50 | uci/ionosphere | 351| 34|
286+ | 51 | uci/sonar | 208| 60|
287+ | 52 | uci/statlog | 2310| 19|
288+ | 53 | uci/wdbc | 569| 30|
289+ | 54 | uci/wine | 178| 13|
290+ | 55 | uci/yeast | 1484| 8|
291+ | 56 | wut/circles | 4000| 2|
292+ | 57 | wut/cross | 2000| 2|
293+ | 58 | wut/graph | 2500| 2|
294+ | 59 | wut/isolation | 9000| 2|
295+ | 60 | wut/labirynth | 3546| 2|
296+ | 61 | wut/mk1 | 300| 2|
297+ | 62 | wut/mk2 | 1000| 2|
298+ | 63 | wut/mk3 | 600| 3|
299+ | 64 | wut/mk4 | 1500| 3|
300+ | 65 | wut/olympic | 5000| 2|
301+ | 66 | wut/smile | 1000| 2|
302+ | 67 | wut/stripes | 5000| 2|
303+ | 68 | wut/trajectories | 10000| 2|
304+ | 69 | wut/trapped_lovers | 5000| 3|
305+ | 70 | wut/twosplashes | 400| 2|
306+ | 71 | wut/windows | 2977| 2|
307+ | 72 | wut/x1 | 120| 2|
308+ | 73 | wut/x2 | 120| 2|
309+ | 74 | wut/x3 | 185| 2|
310+ | 75 | wut/z1 | 192| 2|
311+ | 76 | wut/z2 | 900| 2|
312+ | 77 | wut/z3 | 1000| 2|
313+
314+
315+
316+
317+ We recommend that the ` h2mg ` sets should be studied separately
315318(there are too many of them -- they can easily overshadow the
316319above ones).
317320
@@ -336,7 +339,7 @@ above ones).
336339| 72 | h2mg/h2mg_128_90 | 2048| 128|
337340
338341
339- We recommend that ` g2mg ` sets should be studied separately as well .
342+ The ` g2mg ` sets should be studied separately too .
340343
341344
342345| | dataset | n| d|
0 commit comments