2. This work presents a new technique to produce fast and reliable auralizations with a computer code for
room acoustics simulation. It discusses the binaural room impulse responses generation classic method
and presents a new technique using radial basis functions type of artificial neural networks. The radial basis
functions type of artificial neural networks is briefly presented and its training and testing proce-dures are
discussed. The artificial neural network models the filtered head-related impulse responses for 64,442
directions uniformly distributed around the head with a significant reduction in computa-tional cost of
around 90% in the generation of binaural impulse responses. It is shown that the filtered head-related
impulse responses calculated with the classical convolution method and with the artificial neural network
technique are almost indistinguishable. It is concluded that the new technique produces fastest and
reliable binaural room impulse responses for auralization purposes.
a b s t r a c t
3. This work deals with room acoustics computer simulation and its techniques to generate auralization at
selected seats in the room. In general, the room acoustics simulation follows the requirements of
geometrical acoustics. This means that the sound waves can be treated as acoustic rays that leave the
sound source and propagate in the room, reflecting and refracting on their internal surfaces. There are two
main ways of modeling acous-tic rays: the ray-tracing method and the image source method. There are
also hybrid algorithms that use the image source method for the calculation of first specular reflections
and the ray-tracing method for the calculus of the remaining ones. However, as already pointed out by
several authors, diffuse reflection plays an important role in room acoustics, providing a greater uniformity
in the sound field. Having in mind the roomâs auralization, the diffuse reflections are also fundamental,
leading to greater authenticity. In this case, it seems essential to have a good model to deal with diffuse
reflections, since the ray-tracing technique cannot handle properly. One of the ways to approxi-mately
model diffuse reflections is the radiosity technique.
Introduction
4. â Radial basis functions type of artificial neural network
â Training and testing the artificial neural network set
â Fast auralization with ann technique
â Computational cost
â Comparative results for filtered HRIRs
â Conclusion remarks
Index
5. An artificial neural network (ANN) is an information processing system based on simplified mathematical
models of biological neurons whose learning process results from experience. The knowledge gained by
the network through the examples are stored in the form of connection synaptic weights that are adjusted
in order to make the correct decisions when presented to new entries. In other words, the network has the
ability to generalize the learned information. The process of adjusting synaptic weights is performed by the
learning algorithm. Artificial neural networks are useful tools for solving many types of problems as, for
instance, classification, grouping, optimization, approximation and forecast-ing. One of the main
applications of ANNs is on pattern recognition, and this is the application under consideration here: the
ANNs are trained to learn the HRIRs patterns.
Radial basis functions type of artificial neural network
8. The RBF parameters were calculated as follows: first the centers are obtained using the non-supervised K-
means algorithm. Once the centers have been calculated, the widths are deter-mined. Finally, after
defining the parameters of the radial func-tions, the free parameters of the output layer are computed
using the same procedures that are used for the output layer of other types of neural networks.
Training and testing the artificial neural network set
12. Once the acoustic field in the room simulation is completed, the goal in sequel consists in the
determination of the room impulse mono (RIRs) and binaural (BRIRs) responses at selected points. As
regards the calculation of RIRs, it is about converting the energy arrival, via Hilbertâs transform [48] and
filtering in octave bands, obtaining filtered impulse responses, whose computational cost is relatively
small. In order to compute the BRIRs, however, it is necessary to take into account the head-related
impulse responses (HRIRs) â or their corresponding in frequency domain, the so-called head-related
transfer functions (HRTFs). In the computa-tional codes that generate auralization, this is usually done via
the convolution procedure.
Fast auralization with ann technique
14. In order to examine the convolution method (CM) numerical efficiency and that of the artificial neural
network method (ANNM), a comparison is made as to the number of arithmetic operations that each
technique requires.
The number of operations in the convolution method to com-pute the filtered HRIRs equals the sum of two
parts. The first one corresponds to the number of multiplications between the ray spectrum (in octave
bands) and the HRTF of the considered direc-tion. Note, however, that due to the HRTF symmetry, only l/2
prod-ucts, with l being the number of samples, are necessary. The second one corresponds to the number
of operations for calculating the inverse Fourier transform.
Computational cost
18. As mentioned, the convolution technique is the classic BRIR generation method and it is present in almost
all acoustic field sim-ulation software with auralization, to our knowledge. Therefore, in order to verify the
reliability of the method of generating BRIRs with artificial neural networks of the radial basis function type,
a comparison between the two methods is presented in the sequel. Since, once the filtered HRIRs are
generated, the procedure is iden-tical, involving the delays and sum to generate the BRIRs, the com-
parison between the two methods will be done among the filtered HRIRs. In other words, since the
procedures of delay and sum of the filtered HRIRs are exactly the same in the two techniques, if the
filtered HRIRs computed by both techniques are almost identical, the resulting BRIR will be also the same.
Comparative results for filtered HRIRs
20. List of references:
[1] Kuttruff H. Room acoustics. 5th ed. London: Spon Press; 2009.
[2] Savioja L, Svensson UP. Overview of geometrical room acoustic modeling techniques. J
Acoust Soc Am 2015;138(2):708â30.
[3] Ondet M, Barbry JL. Modeling of sound propagation in fitted workshops using ray
tracing. J Acoust Soc Am 1989;85(2):787â96.
[4] Borish J. Extension of the image model to arbitrary polyhedra. J Acoust Soc Am
1984;75:1827.
[5] VorlÀnder M. Simulation of the transient and steady-state sound propagation in rooms
using a new combined ray-tracing/image-source algorithm. J Acoust Soc Am
1989;86(1):172â8.
Refrences
21. List of references:
[6] DâAntonio P, Cox TJ. Diffusor application in rooms. Appl Acoust 2000;60:113â42.
[7] Cox TJ, DalenbĂ€ck BI, DâAntonio P, Embrechts JJ, Jeon JY, Mommertz E, et al. A tutorial
on scattering and diffusion coefficients for room acoustic surfaces. Acta Acustica United
Acustica 2006;92(1):1â15.
[8] AlarcĂŁo D, Bento Coelho JL, Tenenbaum RA, On modeling of room acoustics by a sound
energy transition approach. In: Proceedings of EEA Symposium on Architectural Acoustics,
2000.
[9] Tenenbaum RA, Camilo TS, Torres JCB, Gerges SNY. Hybrid method for numerical
simulation of room acoustics: part 1 â theoretical and numerical aspects. J Braz Soc Mech
Sci Eng 2007;29(2):211â21.
Refrences
22. List of references:
[10] Schröeder M, Digital computers in room acoustics. In: Proc. 4th ICA, Copenhagen, 1962.
[11] Blauert J, Lehnert H, Pompetzki W, Xiang N. Binaural room simulation. Acustica
1990;72:295â6.
[12] Ahnert W, Feistel R, Binaural auralization from a sound system simulation programme.
In: Proc. 91th AES Convention, New York, 1991.
[13] Lehnert H, Blauert J. Principles of binaural room simulation. Appl Acoust 1992;36:259.
[14] MĂžller H. Fundamentals of binaural technology. Appl Acoust 1992;36:171.
[15] Vian J-P, Martin J. Binaural room acoustics simulation: practical uses and applications.
Appl Acoust 1992;36:293.
Refrences
23. List of references:
[16] Kleiner M, DalenbĂ€ck B-I, Svensson P. Auralization â an overview. J Audio Eng Soc
1993;41:861.
[17] Begault D. 3-D sound for virtual reality and multimedia. Cambridge: Academic Press
Professional; 1994.
[18] DalenbÀck B-I, McGrath D. Narrowing the gap between virtual reality and auralization.
In: Proc. 15th ICA, Trondheim, 1995.
[19] Sottek R, Virtual binaural auralization of product sound quality: importance and
application in practice. Proc. EURONOISE, Naples, 2003.
[20] Rindel JH, Otondo F, Christensen CL. Sound source representation for auralization. In:
Proc. Int. Symp. on Room Acoust., Awaji, 2004.
Refrences
24. List of references:
[21] Torres JCB, Petraglia MR, Tenenbaum RA. An efficient wavelet-based HRTF for
auralization. Acta Acustica United Acustica 2004;90:108.
[22] Otondo F, Rindel JH. A new method for the radiation representation of musical
instruments in auralizations. Acta Acustica United Acustica 2005;91:902.
[23] DalenbĂ€ck B-I, Strömberg M. Real time walkthrough auralization â the first year. In: Proc.
IOA Spring Conference, Copenhagen, 2006.
[24] Summers JE. What exactly is meant by the term âauralization?â. J Acoust Soc Am
2008;124(2):697.
[25] DalenbÀck B-I. A new model for room acoustics prediction and auralization Doctoral
thesis. Gothenburg: Chalmers University; 1995.
Refrences
25. List of references:
[26] HammershĂži D. Binaural technique â a method of true 3-D sound reproduction.
Doctoral thesis. Denmark: Aalborg University; 1995.
[27] Savioja L. Modelling techniques for virtual acoustics Doctoral thesis. Finland: Helsinki
University of Technology; 1999.
[28] Lokki T. Physically-based auralization â design implementation and evaluation Doctoral
thesis. Finland: Helsinki University of Technology; 2002.
[29] Torres JCB. Efficient auralization system using wavelet transforms Doctoral thesis.
Brazil: Federal University of Rio de Janeiro; 2004.
Refrences
26. List of references:
[30] Schröeder D. Integration of real-time room acoustical simulations in VR environments
Doctoral thesis. Germany: RWTH Aachen University; 2004.
[31] Thaden R. Auralization in building acoustics Doctoral thesis. Germany: RWTH Aachen
University; 2005.
[32] Naranjo JFL. Machine learning applied in the generation of binaural room impulse
responses and in auralization in rooms Doctoral thesis. State University of Rio de Janeiro;
2014.
[33] Taminato FO. Artificial neural networks applied to model head-related impulse
responses to generate auralization Ph.D. Thesis. State University of Rio de Janeiro; 2018.
Refrences
27. List of references:
[34] Savioja L, Xiang N. Introduction to the special issue on room acoustic modeling and
auralization. J Acoust Soc Am 2019;145(4):2597â600.
[35] Brinkmann F, Aspöck L, Ackermann D, Lepa S, VorlÀnder M, Weinzierl S. A round robin
on room acoustical simulation and auralization. J Acoust Soc Am 2019;145(4):2746â60.
[36] Lindau A, Erbes V, Lepa S, Maempel HJ, Brinkmann F, Weinzierl S. A spatial audio
quality inventory for virtual acoustic environments (SAQI). Acta Acustica United Acustica
2014;100(5):984â94.
[37] Peng J. Feasibility of subjective speech intelligibility assessment based on auralization.
Appl Acoust 2005;66:591â601.
Refrences
28. List of references:
[38] Hodgson M, York N, Yang W, Bliss M. Comparison of predicted, measured and
auralized sound fields with respect to speech intelligibility in classrooms using CATT-
Acoustic and ODEON. Acta Acustica United Acustica 2008;94(6):883â90.
[39] Tenenbaum RA, Taminato FO, Melo VSG, Torres JCB. Auralization generated by
modeling HRIRs with artificial neural networks and its validation using articulation tests.
Appl Acoust 2018;130:260â9.
[40] Haykin S. Neural networks and learning machines. 3rd ed. New Jersey: Prentice Hall;
2009.
[41] Mulgrew B. Applying radial basis functions. IEEE Signal Process Mag 1996:50â65.
Refrences
29. List of references:
[42] Bishop C. Neural networks for pattern recognition. Oxford: Oxford University Press;
2005.
[43] Broomhead DS, Lowe D. Multivariable functional interpolation and adaptive networks.
Complex Syst 1988;2:321â55.
[44] Moody J, Darken CJ. Fast learning in networks of locally-tuned processing units. Neural
Comput 1989;1:281â94.
[45] Mulgrew B. Applying radial basis functions. IEEE Signal Process Mag 1996;50â65.
[46] Chen S, Muldrew B, Mclaughlin S, Adaptative bayesian feedback equalizer based on a
radial basis function network. IEEE International Conference on Communications, Chicago,
3, pp. 1267â1271, 1992.
Refrences
30. List of references:
[47] Brinkmann F., Lindau A., Weinzierl S., Geissler G., van de Par S., MĂŒller-Trapet M.,
Opdam R., VorlÀnder M. The FABIAN head-related transfer function data base. (2017),
available at http://dx.doi.org/10.14279/depositonce-5718.2 (Last viewed September, 2018).
[48] Kak S. Number theoretic Hilbert transform. Circ Syst Signal Process 2014;33:2539â48.
[49] Blauert J. Spatial hearing. Cambridge: The MIT Press; 1997.
[50] Willcox RR. Introduction to robust estimation and hypothesis testing. New York:
Academic Press; 2005.
[51] Proakis JG, Manolakis DG. Digital signal processing: principles, algorithms and
applications. 3rd ed. New Jersey: Prentice-Hall; 1996.
Refrences
31. List of references:
[52] Conte SD, Boor C. Elementary numerical analysis: an algorithmic approach. 3rd ed. New
York: McGraw-Hill; 1980.
[53] Xie B, Zhong X, He N. Typical data and cluster analysis on head-related transfer
functions. Appl Acoust 2015;94:1â13.
Refrences