Glossary

A-D

Acoustic Monitoring

Acoustic monitoring processes/methods can be used for sensor-based process monitoring or to determine/evaluate product properties. Applications can be found, for example, in quality assurance or predictive maintenance, where acoustic signatures (and/or changes to them) are analysed for the condition and error monitoring of machines, plant and products. Increased/higher product quality and its assurance, as well as less/plannable/minimised downtime can generate added value. 

Artificial Intelligence (AI)

Artificial intelligence (AI) is a form of machine learning that imitates human abilities, such as problem-solving strategies, text comprehension, audio enhancement, acoustic pattern recognition or image analysis. Typically, deep artificial neural networks are used, which model the human nervous system and transpose it into machine-readable form. These DNNs can be trained or adopted with the help of a large amount of data. 

Artificial Neuronal Network

An artificial neural network is mimicking some structures of the human brain. Both consist of a large number of individual neurons connected to each other. These neurons receive, weight, and summarise incoming signals and forward the modified result. In artificial neural networks, several groups of neurons arranged on top of each other form what is called a layer. Many successive layers are aggregated to form a deep artificial neural network. The parameters that determine how the individual inputs of a single neuron are weighted are variable and optimised during training so that the task in question is optimally solved. In this way, phonemes, sounds, speech, and other items can be recognised. 

Audio Signal Enhancement

Audio signal enhancement describes methods used to process acoustic signals in such a way that the highest possible quality is achieved at the point of processing. Processing can be done by a person or a machine. Apart from the selection of optimal sound transducer components with the associated digitalisation and specification of their number, audio signal enhancement also includes processing using digital signal processing algorithms. By means of AI-based background noise reduction, voices can be extracted from a mixture with such noise, for example, or a single voice with which the AI has previously been trained can be extracted from a babble of voices. However, the target signal can also be something other than a voice, e.g. people snoring at the same time can be split into individual audio tracks.

Augmentative and Alternative Communication

Augmentative and Alternative Communication (AAC) encompasses various no-tech and low-tech options for people of all ages who have severe speech or language problems. In addition to gestures and facial expressions, symbol cards or communication boards, speech-generating devices, tablets, and other technical methods are used to find other ways to communicate besides talking.

Automated Speech Recognition

Speech recognition is the process that enables computers to automatically transcribe spoken language into written words and phrases. This can be used, for example, to control devices with voice commands or to transcribe free text automatically.

Beamforming

What is understood by beamforming is a signal processing strategy in which several microphones are offset in such a way that sound is no longer received from all directions simultaneously but from a defined direction – basically a kind of audio focus. Intelligent algorithms or the user can select and process specific directions from which the microphone array captures the audio signals so that, for example, the microphone is directed in telephone conferences to where it is needed at the time – for instance towards the speaker. 

Binaural Hearing

Binaural hearing, also known as directional hearing, refers to spatial hearing using both ears. It involves processing two incoming acoustic signals to create an auditory impression. This enables us to localize the direction of a sound source and distinguish between different sound sources. Binaural hearing allows us to focus on a specific voice even in challenging listening environments, especially when it originates from a different direction than the background noise.

Biomarker

Biomarkers for medicine or biology are measurable parameters of biological processes that provide prognostic or diagnostic information. These can be measurable changes, e. g. in blood count or in continuous data series such as speech or EEG. Our “Assistive Speech and Language Analysis Group”, for example, uses features of the speech signal as biomarkers that can help in the early detection and prevention of diseases. Biomarkers in this context can be the voice, the speed of speech or the words a person uses in everyday life.

Computer Linguistics

Computer linguistics develops and studies models that can be used to understand and interpret human language in both oral and written form based on algorithms. It forms the interface between linguistics and computer science and is often referred to as computational linguistics or natural language processing in English-language literature.