speechcommand_dataset: Speech Commands Dataset
In curso-r/torchaudio: R Interface to 'pytorch''s 'torchaudio'

speechcommand_dataset

R Documentation

Speech Commands Dataset

Speech Commands Dataset

speechcommand_dataset(
  root,
  url = "speech_commands_v0.02",
  folder_in_archive = "SpeechCommands",
  download = FALSE,
  normalization = NULL
)

`root`	(str): Path to the directory where the dataset is found or downloaded.
`url`	(str, optional): The URL to download the dataset from, or the type of the dataset to dowload. Allowed type values are `"speech_commands_v0.01"` and `"speech_commands_v0.02"` (default: `"speech_commands_v0.02"`)
`folder_in_archive`	(str, optional): The top-level directory of the dataset. (default: `"SpeechCommands"`)
`download`	(bool, optional): Whether to download the dataset if it is not found at root path. (default: `FALSE`).
`normalization`	(NULL, bool, int or function): Optional normalization. If boolean TRUE, then output is divided by 2^31. Assuming the input is signed 32-bit audio, this normalizes to [-1, 1]. If numeric, then output is divided by that number. If function, then the output is passed as a paramete to the given function, then the output is divided by the result. (Default: NULL)