Data is an essential raw material for the technological development of intelligent algorithms, digital applications and innovative products of our time. Data not only help to describe our (surrounding) world, but also to predict changes or challenges. Data is therefore generated, collected and analysed in almost all areas of life and work.
In particular, the development, training and validation of AI models rely on data. These must be of suitable quality and describe the part of reality in which the models will later be used. By "suitable quality" we mean not only the knowledge and control of disturbing influences, but also the correct and detailed description (annotation) of the processes under consideration.
There are several aspects to modelling acoustic events or environments. An acoustic observation usually consists of one or more individual sound sources with different acoustic properties as well as intentional or unintentional background noise. The radiated sound is then influenced on its way to the receiver by effects of sound propagation, e.g. level decrease with distance, diffraction at objects, refraction at air layers, absorption, transmission and reflection at walls. A sound receiver then converts the sound into an analysable signal. This can be, for example, a human being who perceives his environment with his or her auditory system. Other, technical receivers are electroacoustic sensors, e.g. microphones or microphone arrays and accelerometers, which are specified as part of a sensor system for a specific application.
Depending on the use case, the following therefore applies to the development, training and validation of acoustic AI models: The more detailed the aforementioned parameters are recorded, the better an acoustic data set can be used for the training and validation of a robust and powerful AI model.