The process of content verification can be considered a "search for truth" that applies falsification, similar to how scientific theories and hypotheses are or should be tested according to "critical rationalism": To answer an overall question (typically something along the lines of ‘does the material at hand capture a real event and its appropriate context?’), there must be falsifiable claims about the material. For an audio recording, for instance, this could look like this:
"This file was recorded on Dec 6, 2022 in Amsterdam, NL, using an iPhone 6 and its standard recording app. The recording was not processed afterwards. The SHA-512 hash of the original file is 9f86d081 … The file was uploaded to cloud service XYZ… no transcoding or other modifications were applied."
Content verification is the process of testing such claims (which can also be implicit) against facts and findings using human assessment and various tools. The more and the “richer” the claims provided and verified (i. e. not rejected), the more trustworthy the content. However, human capabilities are limited with respect to perception and speed necessary to conduct the testing - therefore, our goal is to develop approaches and tools that support this process in the best possible manner, focusing on a broad technological coverage for the audio domain. The objective is to provide solutions and methods to support verification, including
- Technologies for the analysis of acquisition, editing and synthesis traces within A/V material, to understand whether and how it was recorded, encoded, edited or synthesized, and then use it for the falsification process, and especially for manipulation and synthesis detection and localization.
- Technologies for content provenance analysis, i. e. detect relationships between A/V content items, to understand whether and how they were reused and transformed, and in which order they were created (including the detection of “root” items).
- Technologies for automatic annotation of A/V material, to quickly research relevant material for content verification, i. e. for a specific event, a specific person, or to retrieve information about circumstances that can be used for the verification process, i. e. acoustic scene classification and event detection.
We focus primarily on broad technological coverage for the audio domain and collaborate with other organizations that specialize in other tasks and modalities. The aim is to provide a comprehensive set of tools that can enhance and accelerate the verification of content.
In addition, our research also includes development of technologies for active media authentication, which are based on a combination of digital signatures and signal analysis. The idea is that content providers can use this to proactively sign and “mark” content and related metadata, including synthetic content, to allow other stakeholders to check its authenticity afterwards. Both approaches, (passive) falsification and (active) authentication, have distinct advantages and disadvantages, and we believe the two approaches are not mutually exclusive. On the contrary, they are complementary and should be considered and used together wherever possible.