Voice
Voice(
sample_rate = 16000,
n_fft = 1024,
win_length = NULL,
hop_length = 128,
f_min = 50,
f_max = 8000,
pad = 0,
n_mels = 128,
window_fn = torch()$hann_window,
power = 2,
normalized = FALSE,
wkwargs = NULL,
mel = TRUE,
to_db = TRUE
)
sample rate
number of fast fourier transforms
windowing length
hopping length
minimum frequency
maximum frequency
padding mode
number of mel-spectrograms
window function
power
normalized or not
additional arguments
mel-spectrogram or not
to decibels
None