convert_txtCollection: Converts a collection of audio files and plain text...

View source: R/emuR-convert_txtCollection.R

convert_txtCollectionR Documentation

Converts a collection of audio files and plain text transcriptions into an emuDB

Description

This function takes as input pairs of media files (i.e. wav files) and plain text transcriptions files. It creates a new emuDB with one bundle per media file, and turns the associated transcription into an item in that bundle. For this purpose, media files and text files belonging to the same bundle must be named identically (with the exception of their respective file extensions). The newly created emuDB is stored in the target directory, and its handle is returned.

Usage

convert_txtCollection(
  dbName,
  sourceDir,
  targetDir,
  txtExtension = "txt",
  mediaFileExtension = "wav",
  attributeDefinitionName = "transcription",
  cleanWhitespaces = TRUE,
  verbose = TRUE
)

Arguments

dbName

name of the new emuDB

sourceDir

directory containing the plain text transcription files and media files

targetDir

directory where the new emuDB will be stored

txtExtension

file extension of transcription files

mediaFileExtension

file extension of media files

attributeDefinitionName

label name of the transcription items

cleanWhitespaces

if true, any sequence of whitespaces in the transcription (including newlines and tabs) is transformed into a single blank

verbose

display progress bar

See Also

convert_BPFCollection, convert_TextGridCollection


emuR documentation built on Nov. 4, 2023, 1:06 a.m.