Skip to content

Conversation

@rfejgin
Copy link
Collaborator

@rfejgin rfejgin commented Dec 20, 2025

Added a command line option to select a subset of datasets to run inference on.

Reasoning: for day-to-day work we need a way to select a subset of datasets to run inference on. Directly editing the JSON file leads to non-reproducible local testing as the JSON file is edited over and over in a non-traceable way. Hence adding a programmatic way to choose datasets. It's entirely optional to specify this new command line argument; if not specified, all datasets in the JSON file will be processed.

New command line argument format: --datasets <dataset1,dataset2,...> where
dataset1, dataset2, ... are the names of datasets to process in the
datasets_json_path file.

If not specified, all datasets in the datasets_json_path will be processed.
If specified, only the datasets in the list will be processed.

New command line argument: --datasets <dataset1,dataset2,...> where
dataset1, dataset2, ... are the names datasets to process in the
datasets_json_path file.

If not specified, all datasets in the datasets_json_path will be processed.
If specified, only the datasets in the list will be processed.

Signed-off-by: Fejgin, Roy <rfejgin@nvidia.com>
* Correctly handle comma-separated list of dataset names in the --datasets argument.
* Help text

Signed-off-by: Fejgin, Roy <rfejgin@nvidia.com>
@github-actions github-actions bot added the TTS label Dec 20, 2025
@rfejgin rfejgin changed the title [TTS] MagpieTTS inference: Add command line to select a subset of datasets to run inference on [TTS] MagpieTTS inference: Add command line option to select a subset of datasets to run inference on Dec 20, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant