Commit 8cd0187
committed
recount3 sub-package updated
API-breaking changes:
* The output data frames produced by the 'dgd_get_recount3_data' executable now contain both gene expression data and metadata, unless otherwise filtered (see below).
Other changes:
* Now the 'experiment_attributes' column, if present in the metadata columns of a SRA study, will be split into its constituent components when writing the output data frames for the 'dgd_get_recount3_data' executable (as it is already the case with the 'sample_attributes' column).
* The user can now pass a YAML file to 'dgd_get_recount3_data' to download data from the Recount3 platform in bulk and filter them.
* The user can now pass 'metadata_to_keep' and 'metadata_to_drop' lists of metadata columns in the input file to 'dgd_get_recount3_data' to keep or drop specific metadata columns in the output data frames. These can be passed both as columns, if the input file is a CSV file, or as specific keywords, if the input file is a YAML file.
* The 'recount3.util.get_metadata' function now returns the metadata data frame with the 'recount3_project_name' and 'recount3_samples_category' columns added.
* The 'model_untrained.yaml' configuration file was added to the examples of configuration files available within the package.
Internal changes (for contributors):
* Two new internal functions in the 'bulkDGD.recount3.util' module ('_load_samples_batches_csv' and 'load_samples_batches_yaml') were introduced to parse the input files to 'dgd_get_recount3_data'. The public function 'load_samples_batches' simply calls one of them depending on the file's extension.
* The 'bulkDGD.util.get_handlers' function now accepts two new arguments 'log_level_console' and 'log_level_file' instead of the old 'log_level' to have a more fine-grained control over the log level of the handlers.
* The log level of the console handler for the '_dgd_get_recount3_data_single_batch' execuable was changed to ERROR not to clutter the console too much with all the INFO messages from the subprocesses (which get logged to their own log files anyway if the overall log level is INFO or below).
* The header of the 'bulkDGD/recount3/data/sra_metadata_fields.txt' file was changed to better describe the metadata fileds included in it.
Documentation:
* The documentation was updated to reflect the user-facing changes.
* The readme files for the configurations were removed because of the redundancy with the content of the documentation and of the configuration files themselves.
* The 'model_config_options.rst' file was removed from the documentation because it was empty and not referenced anywhere.1 parent 49848d4 commit 8cd0187
File tree
15 files changed
+752
-512
lines changed- bulkDGD
- execs
- ioutil/configs
- model
- plot
- representations
- recount3
- data
- doc/source
15 files changed
+752
-512
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
65 | 65 | | |
66 | 66 | | |
67 | 67 | | |
68 | | - | |
69 | 68 | | |
70 | 69 | | |
71 | 70 | | |
| |||
80 | 79 | | |
81 | 80 | | |
82 | 81 | | |
83 | | - | |
84 | 82 | | |
85 | 83 | | |
86 | 84 | | |
| |||
92 | 90 | | |
93 | 91 | | |
94 | 92 | | |
95 | | - | |
96 | 93 | | |
97 | 94 | | |
98 | 95 | | |
| |||
102 | 99 | | |
103 | 100 | | |
104 | 101 | | |
105 | | - | |
106 | 102 | | |
107 | 103 | | |
108 | 104 | | |
| |||
139 | 135 | | |
140 | 136 | | |
141 | 137 | | |
142 | | - | |
143 | 138 | | |
144 | 139 | | |
145 | 140 | | |
146 | 141 | | |
147 | 142 | | |
| 143 | + | |
| 144 | + | |
| 145 | + | |
| 146 | + | |
| 147 | + | |
| 148 | + | |
| 149 | + | |
| 150 | + | |
| 151 | + | |
| 152 | + | |
| 153 | + | |
| 154 | + | |
| 155 | + | |
| 156 | + | |
| 157 | + | |
| 158 | + | |
| 159 | + | |
| 160 | + | |
| 161 | + | |
| 162 | + | |
| 163 | + | |
| 164 | + | |
| 165 | + | |
| 166 | + | |
| 167 | + | |
| 168 | + | |
| 169 | + | |
148 | 170 | | |
149 | 171 | | |
150 | 172 | | |
151 | 173 | | |
152 | 174 | | |
153 | 175 | | |
154 | | - | |
155 | 176 | | |
156 | 177 | | |
157 | 178 | | |
| |||
187 | 208 | | |
188 | 209 | | |
189 | 210 | | |
| 211 | + | |
| 212 | + | |
190 | 213 | | |
191 | 214 | | |
192 | 215 | | |
| |||
197 | 220 | | |
198 | 221 | | |
199 | 222 | | |
200 | | - | |
| 223 | + | |
201 | 224 | | |
202 | 225 | | |
203 | 226 | | |
| |||
219 | 242 | | |
220 | 243 | | |
221 | 244 | | |
| 245 | + | |
222 | 246 | | |
223 | | - | |
224 | | - | |
| 247 | + | |
| 248 | + | |
| 249 | + | |
225 | 250 | | |
226 | 251 | | |
227 | 252 | | |
| |||
247 | 272 | | |
248 | 273 | | |
249 | 274 | | |
250 | | - | |
251 | | - | |
252 | | - | |
253 | | - | |
| 275 | + | |
| 276 | + | |
254 | 277 | | |
255 | 278 | | |
256 | 279 | | |
| |||
271 | 294 | | |
272 | 295 | | |
273 | 296 | | |
274 | | - | |
275 | | - | |
276 | | - | |
| 297 | + | |
277 | 298 | | |
278 | 299 | | |
279 | 300 | | |
280 | 301 | | |
281 | 302 | | |
| 303 | + | |
| 304 | + | |
| 305 | + | |
| 306 | + | |
| 307 | + | |
| 308 | + | |
| 309 | + | |
| 310 | + | |
| 311 | + | |
| 312 | + | |
| 313 | + | |
| 314 | + | |
| 315 | + | |
| 316 | + | |
| 317 | + | |
| 318 | + | |
| 319 | + | |
| 320 | + | |
| 321 | + | |
| 322 | + | |
282 | 323 | | |
283 | 324 | | |
284 | 325 | | |
| |||
303 | 344 | | |
304 | 345 | | |
305 | 346 | | |
306 | | - | |
307 | | - | |
308 | | - | |
309 | | - | |
310 | | - | |
311 | | - | |
312 | | - | |
313 | 347 | | |
314 | 348 | | |
315 | 349 | | |
316 | 350 | | |
317 | | - | |
318 | | - | |
319 | | - | |
| 351 | + | |
| 352 | + | |
320 | 353 | | |
321 | 354 | | |
322 | 355 | | |
| |||
328 | 361 | | |
329 | 362 | | |
330 | 363 | | |
331 | | - | |
332 | | - | |
| 364 | + | |
| 365 | + | |
| 366 | + | |
| 367 | + | |
| 368 | + | |
333 | 369 | | |
334 | | - | |
335 | | - | |
336 | | - | |
| 370 | + | |
| 371 | + | |
| 372 | + | |
| 373 | + | |
| 374 | + | |
| 375 | + | |
| 376 | + | |
| 377 | + | |
| 378 | + | |
| 379 | + | |
| 380 | + | |
| 381 | + | |
| 382 | + | |
| 383 | + | |
| 384 | + | |
| 385 | + | |
| 386 | + | |
| 387 | + | |
| 388 | + | |
| 389 | + | |
| 390 | + | |
| 391 | + | |
| 392 | + | |
| 393 | + | |
| 394 | + | |
| 395 | + | |
| 396 | + | |
| 397 | + | |
| 398 | + | |
| 399 | + | |
| 400 | + | |
| 401 | + | |
| 402 | + | |
| 403 | + | |
| 404 | + | |
| 405 | + | |
| 406 | + | |
| 407 | + | |
| 408 | + | |
| 409 | + | |
| 410 | + | |
| 411 | + | |
| 412 | + | |
| 413 | + | |
| 414 | + | |
| 415 | + | |
| 416 | + | |
| 417 | + | |
| 418 | + | |
| 419 | + | |
| 420 | + | |
| 421 | + | |
| 422 | + | |
| 423 | + | |
| 424 | + | |
| 425 | + | |
| 426 | + | |
| 427 | + | |
| 428 | + | |
| 429 | + | |
| 430 | + | |
| 431 | + | |
| 432 | + | |
| 433 | + | |
| 434 | + | |
| 435 | + | |
| 436 | + | |
| 437 | + | |
| 438 | + | |
| 439 | + | |
| 440 | + | |
| 441 | + | |
| 442 | + | |
| 443 | + | |
| 444 | + | |
| 445 | + | |
| 446 | + | |
| 447 | + | |
| 448 | + | |
| 449 | + | |
| 450 | + | |
| 451 | + | |
| 452 | + | |
337 | 453 | | |
338 | 454 | | |
339 | 455 | | |
| |||
364 | 480 | | |
365 | 481 | | |
366 | 482 | | |
367 | | - | |
368 | | - | |
| 483 | + | |
| 484 | + | |
369 | 485 | | |
370 | 486 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
198 | 198 | | |
199 | 199 | | |
200 | 200 | | |
| 201 | + | |
201 | 202 | | |
202 | 203 | | |
203 | 204 | | |
204 | | - | |
| 205 | + | |
205 | 206 | | |
206 | 207 | | |
207 | 208 | | |
| |||
0 commit comments