-
Student behaviour and engagement with adaptive exercises on a thermodynamics course
Authors:
Matti Harjula,
Ville Havu,
Inkeri Kontro,
Kimmo Kulmala,
Jarmo Malinen,
Petri Salo
Abstract:
A teaching experiment was carried out in a university-level thermodynamics course using adaptive and interactive e-learning material, created in the new Moodle question type Stateful extending the original e-learning platform STACK. The system collects data about the students that is used to algorithmically classify them according to their behaviour in solving problems. It is observed that the cla…
▽ More
A teaching experiment was carried out in a university-level thermodynamics course using adaptive and interactive e-learning material, created in the new Moodle question type Stateful extending the original e-learning platform STACK. The system collects data about the students that is used to algorithmically classify them according to their behaviour in solving problems. It is observed that the classification of this data predicts students' success in the other parts of the course for a majority of students. Also, the classification is statistically consistent with Thermodynamic Concept Survey and Maryland Physics Expectation Survey.
△ Less
Submitted 7 March, 2024;
originally announced March 2024.
-
Notes on glottal flow and acoustic inertial effects
Authors:
Jarmo Malinen
Abstract:
This text is a compilation of some of the notes that the author has written during the development of the low-order model "DICO" [2, 8, 10, 11] for vowel phonation and the even more rudimentary glottal flow model [9] for processing high-speed glottal video data. The following subject matters are covered: (i) Incompressible, laminar, lossless flow models for idealised rectangular and wedge shape vo…
▽ More
This text is a compilation of some of the notes that the author has written during the development of the low-order model "DICO" [2, 8, 10, 11] for vowel phonation and the even more rudimentary glottal flow model [9] for processing high-speed glottal video data. The following subject matters are covered: (i) Incompressible, laminar, lossless flow models for idealised rectangular and wedge shape vocal fold geometries. Equations of motion and the pressure distribution are computed in a closed form for each model using the unsteady Bernoulli's theorem; (ii) The assumption of incompressibility and energy loss (i.e., irrecoverable pressure drop) of the airflow in airways (including the glottis) is discussed using steady compressible Bernoulli theorem as the main tool; (iii) Inertia of an uniform waveguide is studied in terms of the low-frequency limit of the the (acoustic) impedance transfer function. It is observed that the inductive loading in the boundary condition sums up with the waveguide inertance in an expected way; (iv) It is shown that an acoustic waveguide, modelled by Webster's lossless equation with Dirichlet boundary condition at the far end, will produce the expected mass inertance of the fluid column as the low-frequency limit of the impedance transfer function.
△ Less
Submitted 11 November, 2019;
originally announced November 2019.
-
An acoustic glottal source for vocal tract physical models
Authors:
Antti Hannukainen,
Juha Kuortti,
Jarmo Malinen,
Antti Ojalammi
Abstract:
A sound source was proposed for acoustic measurements of physical models of the human vocal tract. The physical models are produced by Fast Prototyping, based on Magnetic Resonance Imaging during prolonged vowel production. The sound source, accompanied by custom signal processing algorithms, is used for two kinds of measurements: (i) amplitude frequency response and resonant frequency measurement…
▽ More
A sound source was proposed for acoustic measurements of physical models of the human vocal tract. The physical models are produced by Fast Prototyping, based on Magnetic Resonance Imaging during prolonged vowel production. The sound source, accompanied by custom signal processing algorithms, is used for two kinds of measurements: (i) amplitude frequency response and resonant frequency measurements of physical models, and (ii) signal reconstructions at the source output according to a target waveform with measurements at the mouth position of the physical model. The proposed source and the software are validated by measurements on a physical model of the vocal tract corresponding to vowel [a] of a male speaker.
△ Less
Submitted 26 April, 2017; v1 submitted 5 April, 2017;
originally announced April 2017.
-
Post-processing speech recordings during MRI
Authors:
Juha Kuortti,
Jarmo Malinen,
Antti Ojalammi
Abstract:
We discuss post-processing of speech that has been recorded during Magnetic Resonance Imaging (MRI) of the vocal tract. Such speech recordings are contaminated by high levels of acoustic noise from the MRI scanner. Also, the frequency response of the sound signal path is not flat as a result of severe restrictions on recording instrumentation due to MRI technology.
The post-processing algorithm…
▽ More
We discuss post-processing of speech that has been recorded during Magnetic Resonance Imaging (MRI) of the vocal tract. Such speech recordings are contaminated by high levels of acoustic noise from the MRI scanner. Also, the frequency response of the sound signal path is not flat as a result of severe restrictions on recording instrumentation due to MRI technology.
The post-processing algorithm for noise reduction is based on adaptive spectral filtering. The speech material consists of samples of prolonged vowel productions that are used for validation of the post-processing algorithm. The comparison data is recorded in anechoic chamber from the same test subject. Formant analysis is carried out for the post-processed speech and the comparison data. Artificially noise-contaminated vowel samples are used for validation experiments to determine performance of the algorithm where using true data would be difficult.
The properties of recording instrumentation or the post-processing algorithm do not explain the consistent frequency dependent discrepancy between formant data from experiments during MRI and in anechoic chamber. It is shown that the discrepancy is statistically significant, in particular, where it is largest at 1 kHz and 2 kHz. The reflecting surfaces of the MRI head and neck coil are suspected to change the speech acoustics which results in "external formants" at these frequencies. However, the role of test subject adaptation to noise and constrained space acoustics during an MRI examination cannot be ruled out.
△ Less
Submitted 21 June, 2016; v1 submitted 17 September, 2015;
originally announced September 2015.
-
Modal locking between vocal fold and vocal tract oscillations: Simulations in time domain
Authors:
Atte Aalto,
Tiina Murtola,
Jarmo Malinen,
Daniel Aalto,
Martti Vainio
Abstract:
During voiced speech, the human vocal folds interact with the vocal tract acoustics. The resulting glottal source-resonator coupling has been observed using mathematical and physical models as well as in in vivo phonation. We propose a computational time-domain model of the full speech apparatus that, in particular, contains a feedback mechanism from the vocal tract acoustics to the vocal fold osc…
▽ More
During voiced speech, the human vocal folds interact with the vocal tract acoustics. The resulting glottal source-resonator coupling has been observed using mathematical and physical models as well as in in vivo phonation. We propose a computational time-domain model of the full speech apparatus that, in particular, contains a feedback mechanism from the vocal tract acoustics to the vocal fold oscillations. It is based on numerical solution of ordinary and partial differential equations defined on vocal tract geometries that have been obtained by Magnetic Resonance Imaging. The model is used to simulate rising and falling pitch glides of [a, i] in the fundamental frequency (f_o) interval [150 Hz, 320 Hz]. The interval contains the first vocal tract resonance f_R1 and the first formant F1 of [i] as well as the fractions of the first resonance f_R1/4 and fR1/3 of [a].
The simulations reveal a locking pattern of the fo-trajectory at f_R1 of [i] in falling and rising glides. The resonance fractions of [a] produce perturbations in the pressure signal at the lips but no locking. All these observations from the model behaviour are consistent and robust within a wide range of feasible model parameter values and under exclusion of secondary model components.
△ Less
Submitted 15 March, 2017; v1 submitted 3 June, 2015;
originally announced June 2015.
-
Measurement of acoustic and anatomic changes in oral and maxillofacial surgery patients
Authors:
Daniel Aalto,
Olli Aaltonen,
Risto-Pekka Happonen,
Päivi Jääsaari,
Atle Kivelä,
Juha Kuortti,
Jean-Marc Luukinen,
Jarmo Malinen,
Tiina Murtola,
Riitta Parkkola,
Jani Saunavaara,
Tero Soukka,
Martti Vainio
Abstract:
We describe an arrangement for simultaneous recording of speech and geometry of vocal tract in patients undergoing surgery involving this area. Experimental design is considered from an articulatory phonetic point of view. The speech and noise signals are recorded with an acoustic-electrical arrangement. The vocal tract is simultaneously imaged with MRI. A MATLAB-based system controls the timing o…
▽ More
We describe an arrangement for simultaneous recording of speech and geometry of vocal tract in patients undergoing surgery involving this area. Experimental design is considered from an articulatory phonetic point of view. The speech and noise signals are recorded with an acoustic-electrical arrangement. The vocal tract is simultaneously imaged with MRI. A MATLAB-based system controls the timing of speech recording and MR image acquisition. The speech signals are cleaned from acoustic MRI noise by a non-linear signal processing algorithm. Finally, a vowel data set from pilot experiments is compared with validation data from anechoic chamber as well as with Helmholtz resonances of the vocal tract volume.
△ Less
Submitted 11 September, 2013;
originally announced September 2013.
-
Modal locking between vocal fold and vocal tract oscillations: Experiments and statistical analysis
Authors:
Daniel Aalto,
Jarmo Malinen,
Martti Vainio
Abstract:
The human vocal folds are known to interact with the vocal tract acoustics during voiced speech production; namely a nonlinear source-filter coupling has been observed both by using models and in \emph{in vivo} phonation. These phenomena are approached from two directions in this article. We first present a computational dynamical model of the speech apparatus that contains an explicit filter-sour…
▽ More
The human vocal folds are known to interact with the vocal tract acoustics during voiced speech production; namely a nonlinear source-filter coupling has been observed both by using models and in \emph{in vivo} phonation. These phenomena are approached from two directions in this article. We first present a computational dynamical model of the speech apparatus that contains an explicit filter-source feedback mechanism from the vocal tract acoustics back to the vocal folds oscillations. The model was used to simulate vocal pitch glideswhere the trajectory was forced to cross the lowest vocal tract resonance, i.e., the lowest formant $F_1$. Similar patterns produced by human participants were then studied. Both the simulations and the experimental results reveal an effect when the glides cross the first formant (as may happen in \textipa{[i]}). Conversely, this effect is not observed if there is no formant within the glide range (as is the case in \textipa{[\textscripta]}). The experiments show smaller effect compared to the simulations, pointing to an active compensation mechanism.
△ Less
Submitted 16 November, 2015; v1 submitted 20 November, 2012;
originally announced November 2012.