TY - ADVS
T1 - Datasets used to train and test the Cortical Spectro-Temporal Model (CSTM)
AU - Dematties, Dario
AU - Thiruvathukal, George K.
AU - Rizzi, Silvio
AU - Wainselboim, Alejandro Javier
AU - Zanutto, Bonifacio Silvano
N1 - Dematties, Dario, Thiruvathukal, George K., Rizzi, Silvio, Wainselboim, Alejandro Javier, & Zanutto, Bonifacio Silvano. (2019). Datasets used to train and test the Cortical Spectro-Temporal Model (CSTM). (Version v1.0) [Data set]. Zenodo. http://doi.org/10.5281/zenodo.2576130
PY - 2019/3/1
Y1 - 2019/3/1
N2 - ZIP files of folders containing all the datasets (audio file corpora) employed in our research to train the Encoder Layer (EL) and the SVMs and to test the complete CSTM. This folder includes a set of 840 corpora which are distributed in 2 corpora for each configuration organized by 2 sets of synthesized voices, 3 syllabic conditions (i.e. mono-, di- and tri-syllabic English words) and 10 completely different vocabularies all distributed in 6 acoustic variants, beyond the original version of the corpora. The 6 acoustic variants corresponds to: two levels of white noise (19.8 dB and 13.8 dB Signal to Noise Ratio (SNR) average Root Mean Square (RMS) power rate), two levels of reverberation (Reveberation-Time 60 dB (RT-60) value of 0.61 seconds and 1.78 seconds) and variations of pitch on both directions (from E to G and from E to C).
AB - ZIP files of folders containing all the datasets (audio file corpora) employed in our research to train the Encoder Layer (EL) and the SVMs and to test the complete CSTM. This folder includes a set of 840 corpora which are distributed in 2 corpora for each configuration organized by 2 sets of synthesized voices, 3 syllabic conditions (i.e. mono-, di- and tri-syllabic English words) and 10 completely different vocabularies all distributed in 6 acoustic variants, beyond the original version of the corpora. The 6 acoustic variants corresponds to: two levels of white noise (19.8 dB and 13.8 dB Signal to Noise Ratio (SNR) average Root Mean Square (RMS) power rate), two levels of reverberation (Reveberation-Time 60 dB (RT-60) value of 0.61 seconds and 1.78 seconds) and variations of pitch on both directions (from E to G and from E to C).
UR - https://ecommons.luc.edu/cs_facpubs/212
UR - https://doi.org/10.5281/zenodo.2576130
M3 - Digital or Visual Products
ER -