Media Forensics - Fraunhofer IDMT

Trustworthy media content

Fraunhofer IDMT is focusing on research and development of various media forensics technologies to analyze, detect and localize manipulation, decontextualization, and fabrication in media content (audio, visual, text), using a combination of different techniques and competencies, including signal analysis, machine learning, reasoning and IT security methods. The goal is to support e. g. journalists, law enforcement, or media platforms in the process of content verification. By doing so, it helps avoiding potential negative impacts of misinformation, deepfakes, and other malicious uses of manipulated media.

News and upcoming events

Workshop / 30.6.2025

MAD'25

The 4th ACM International Workshop on Multimedia AI against Disinformation welcomes your contributions.

Submit your paper

29.10.2024

EBU webinar on audio analysis

Luca Cuccovillo and Milica Gerhardt showcased AI-based audio tools to verify information developed in veraAI.

EBU webinar

Project Results / 1.10.2024

Highlighted innovation of AI4Media

High-potential innovation of Fraunhofer IDMT and NISV “Improvement of search and retrieval of audio content” featured in the European's commission innovation radar

Innovation Radar

Tabbed contents

Expand all Close all

Research focus

Disinformation as a challenge

Thanks to the availability of ever-growing amounts of content, low-cost editing tools, advanced synthesis techniques for content generation, and an abundance of distribution and communication channels, the creation and distribution of disinformation in all forms has become cheap and easy, and increasingly common.

Obvious forms of disinformation include decontextualization, i.e. presentation of authentic material in a misleading or inappropriate context; manipulation, i.e. modification of existing material; and fabrication, where material is made up from scratch. Two terms that are commonly used in this context are

Shallowfakes/cheapfakes, a term that refers to media content created through transformation or editing of genuine content, for example, through deletion, splicing or doctoring, with the aim of manipulation or decontextualisation. Until now, most fakes have fallen into this category, as they are simple to create and yet can be very effective and convincing.
Deepfakes, a term that refers to media content that is fabricated using AI. Until now, these remain less common than shallowfakes/cheapfakes, but ease of use, pervasiveness and availability of technologies to create them improve by the day, and it is clear that they pose serious challenge for disinformation detection.

Content verification – the search for truth

The process of content verification can be considered a "search for truth" that applies falsification, similar to how scientific theories and hypotheses are or should be tested according to "critical rationalism": To answer an overall question (typically something along the lines of ‘does the material at hand capture a real event and its appropriate context?’), there must be falsifiable claims about the material. For an audio recording, for instance, this could look like this:

"This file was recorded on Dec 6, 2022 in Amsterdam, NL, using an iPhone 6 and its standard recording app. The recording was not processed afterwards. The SHA-512 hash of the original file is 9f86d081 … The file was uploaded to cloud service XYZ… no transcoding or other modifications were applied."

Content verification is the process of testing such claims (which can also be implicit) against facts and findings using human assessment and various tools. The more and the “richer” the claims provided and verified (i. e. not rejected), the more trustworthy the content. However, human capabilities are limited with respect to perception and speed necessary to conduct the testing - therefore, our goal is to develop approaches and tools that support this process in the best possible manner, focusing on a broad technological coverage for the audio domain. The objective is to provide solutions and methods to support verification, including

Technologies for the analysis of acquisition, editing and synthesis traces within A/V material, to understand whether and how it was recorded, encoded, edited or synthesized, and then use it for the falsification process, and especially for manipulation and synthesis detection and localization.
Technologies for content provenance analysis, i. e. detect relationships between A/V content items, to understand whether and how they were reused and transformed, and in which order they were created (including the detection of “root” items).
Technologies for automatic annotation of A/V material, to quickly research relevant material for content verification, i. e. for a specific event, a specific person, or to retrieve information about circumstances that can be used for the verification process, i. e. acoustic scene classification and event detection.

We focus primarily on broad technological coverage for the audio domain and collaborate with other organizations that specialize in other tasks and modalities. The aim is to provide a comprehensive set of tools that can enhance and accelerate the verification of content.

In addition, our research also includes development of technologies for active media authentication, which are based on a combination of digital signatures and signal analysis. The idea is that content providers can use this to proactively sign and “mark” content and related metadata, including synthetic content, to allow other stakeholders to check its authenticity afterwards. Both approaches, (passive) falsification and (active) authentication, have distinct advantages and disadvantages, and we believe the two approaches are not mutually exclusive. On the contrary, they are complementary and should be considered and used together wherever possible.

How to proceed

Media forensics research includes various disciplines, such as signal analysis, machine learning, but also security and adversarial thinking. At Fraunhofer IDMT, we address the topic with a focus on audio, and a combination of signal analysis and machine learning. Both provide specific and somewhat complementary advantages and disadvantages regarding interpretability / explainability, robustness, and performance for several tasks.

We believe that there are particular challenges in media forensic research that need to be addressed:

The need to design and develop technologies that users can work with, to enable them to make the best possible decisions within the verification process; this also includes addressing trust aspects, especially explainability, bias (most importantly sample bias, by ensuring suitable selection of training data wherever applicable), adversarial robustness and generalizability, all of which needs to be supported by systematic evaluation.
The need to establish a “falsification culture” that ensures that whoever provides content to be verified, also provides enough information to enable a proper verification process.
The need to cooperate with many other disciplines related to disinformation analysis, including textual analysis, visual analysis, social network analysis, legal analysis, and others; similarly, disinformation analysis needs to be understood as a complex interplay between technology, market, law and norms.
The understanding that media forensics is a cat-and-mouse game, which requires continuous research and development and sustainable business models as well as cooperation between companies and research institutions – financing such activities solely through publicly funded projects will not be enough.
The understanding that not only machine bias, but also human bias is a topic for content verification, and that organizational and technical measures are needed to address this.

Projects and activities

The projects REWIND (EU) and AudioTrust+ (TMWWDG) were aiming at the development of a core set of technologies for audio forensics, providing a technological basis for the research domain. The technologies developed were then adapted and optimized in DIGGER (Google DNI), particularly with regard to user-friendliness and verification workflows, and integrated into the TrulyMedia verification platform, where they are available to journalists. This was key to ensure practical applicability, and the project delivered the basis for further work that also included selected trustworthiness aspects for audio manipulation detection in the AI4Media (EU) project.

More recent and current activities focus on the difficult domain of synthesis detection and the improvement of manipulation detection and origin analysis. On one hand, the SpeechTrust+ (BMBF) project focuses on speech synthesis detection for the requirements from law enforcement in the project. The vera.ai project (EU) involves content provenance analysis, scalability improvements and research on speaker-informed speech synthesis detection for the media domain. Finally, the news-polygraph (BMBF) project aims at embedding such tools in a large multi-modal disinformation analysis technology platform that also includes the use of crowdsourcing and recommendation systems to support and improve the content verification process.

As for the development of technologies for active media authentication, first prototypes were developed many years ago in AudioTrust+ (TMWWDG) and TRA-ICT (FhG), and they are now enhanced for the demands of journalistic purposes in news-polygraph (BMBF) and authentication of AI-based synthetic speech for media production in GEISST (BMBF).

Beyond the research and networking activities within the aforementioned projects, Fraunhofer IDMT is also active within the European Network of Forensic Science Institutes (ENFSI) and The European Digital Media Observatory (EDMO).

Research project

news-polygraph

Multi-modal, crowd-supported technology platform for disinformation analysis

news-polygraph

Research project

vera.ai

Explainable AI-based disinformation and provenance analysis

vera.ai

Research project

AI4Media

Center of excellence for AI in media – Our contributions: Audio forensics, audio provenance analysis, music analysis, privacy and recommendation systems

AI4Media

Research project

SpeechTrust+

Detection of AI-based speech synthesis and voice alienation for law enforcement

SpeechTrust+

Research project

DIGGER

Integration of audio forensic tools into the content verification platform TrulyMedia

DIGGER

Research project

GEI§T

Robust authentication of synthetic audio material

GEI§T

Research project

REWIND

REVerse engineering of audio-VIsual coNtent Data: Detection of recording and manipulation traces, development of a test framework for media forensics

REWIND

Research project

AudioTrust+

Advanced audio analysis for audio manipulation detection and provenance analysis

AudioTrust+

Range of services

Products

Content Verification Toolbox

Advanced Tools for Audio Manipulation and Synthesis Detection

Content Verification Toolbox

Media Verification Support

Support in the use of the latest analysis technologies for manipulation and synthesis detection (only for official request e.g. for judicial expert opinions and investigations, not for private requests).

Audio Forensics Research and Development

Customized research and development: Development and evaluation of technologies for forensic analyses, audio manipulation and speech synthesis detection

use cases

With our technical solutions and services, we provide companies and institutions with concrete support and real added value for their use cases. Contact us to discuss your application!

Detection of manipulation, synthesis, and reuse of audio material

use case

Forensic audio analysis for law enforcement

use case

Active authentication and marking of media content

use case

Interested in further use cases?

Here you will find an overview of our use cases.

Overview use cases

Publications

Jahr Year	Titel/Autor:in Title/Author	Publikationstyp Publication Type
2025	Sound recurrence analysis for acoustic scene classification Abeßer, Jakob; Liang, Zhiwei; Seeber, Bernhard	Zeitschriftenaufsatz Journal Article
2025	Automatic Retrieval of Indicator Sounds for Acoustic Geo-Tagging Abeßer, Jakob	Poster
2024	Calibrating neural networks for synthetic speech detection: A likelihood-ratio-based approach Cuccovillo, Luca; Aichroth, Patrick; Köllmer, Thomas	Zeitschriftenaufsatz Journal Article
2024	Advancing Audio Phylogeny: A Neural Network Approach for Transformation Detection Gerhardt, Milica; Cuccovillo, Luca; Aichroth, Patrick	Konferenzbeitrag Conference Paper
2024	Audio Transformer for Synthetic Speech Detection via Benford's Law Distribution Analysis Talagini Ashoka, Anitha Bhat; Cuccovillo, Luca; Aichroth, Patrick	Konferenzbeitrag Conference Paper
2024	Visual and audio scene classification for detecting discrepancies in video: a baseline method and experimental protocol Apostolidis, Konstantinos; Abeßer, Jakob; Cuccovillo, Luca; Vasileios, Mezaris	Konferenzbeitrag Conference Paper
2024	Audio Spectrogram Transformer for Synthetic Speech Detection via Speech Formant Analysis Cuccovillo, Luca; Gerhardt, Milica; Aichroth, Patrick	Konferenzbeitrag Conference Paper
2024	Generative AI and Disinformation Bontcheva, Kalina; Papadopoulous, Symeon; Tsalakanidou, Filareti; Gallotti, Riccardo; Dutkiewicz, Lidia; Krack, Noémie; Nucci, Francesco Severio; Spangenberg, Jochen; Srba, Ivan; Aichroth, Patrick; Cuccovillo, Luca; Verdoliva, Luisa	Paper
2024	Audio Transformer for Synthetic Speech Detection via Formant Magnitude and Phase Analysis Cuccovillo, Luca; Gerhardt, Milica; Aichroth, Patrick	Konferenzbeitrag Conference Paper
2024	Audio Provenance Analysis in Heterogeneous Media Sets Gerhardt, Milica; Cuccovillo, Luca; Aichroth, Patrick	Konferenzbeitrag Conference Paper
2024	An Open Dataset of Synthetic Speech Yaroshchuk, Artem; Papastergiopoulos, Christoforos; Cuccovillo, Luca; Aichroth, Patrick; Votis, Konstantinos; Tzovaras, Dimitrios	Konferenzbeitrag Conference Paper
2024	Audio Transformer for Synthetic Speech Detection via Multi-Formant Analysis Cuccovillo, Luca; Gerhardt, Milica; Aichroth, Patrick	Konferenzbeitrag Conference Paper
2023	MAD ’23 Workshop: Multimedia AI against Disinformation Cuccovillo, Luca; Ionescu, Bogdan; Kordopatis-Zilos, Giorgos; Papadopoulos, Symeon; Popescu, Adrian	Konferenzbeitrag Conference Paper
2023	Audio Splicing Detection and Localization Based on Acquisition Device Traces Leonzio, Daniele Ugo; Cuccovillo, Luca; Bestagini, Paolo; Marcon, Marco; Aichroth, Patrick; Tubaro, Stefano	Zeitschriftenaufsatz Journal Article
2023	MAD'23 Workshop Chairs' Welcome Message Cuccovillo, Luca; Ionescu, Bogdan; Kordopatis-Zilos, Giorgos; Papadopoulos, Symeon; Popescu, Ana Maria	Konferenzbeitrag Conference Paper
2022	Speaker-Independent Microphone Identification in Noisy Conditions Giganti, Antonio; Cuccovillo, Luca; Bestagini, Paolo; Aichroth, Patrick; Tubaro, Stefano	Konferenzbeitrag Conference Paper
2022	MAD 2022, 1st International Workshop on Multimedia AI against Disinformation. Proceedings	Tagungsband Conference Proceeding
2022	Spectral Denoising for Microphone Classification Cuccovillo, Luca; Giganti, Antonio; Bestagini, Paolo; Aichroth, Patrick; Tubaro, Stefano	Konferenzbeitrag Conference Paper
2022	Mad '22 Workshop Chairs' Welcome Message Ionescu, B.; Kordopatis-Zilos, G.; Papadopoulos, S.; Popescu, A.; Cuccovillo, Luca	Konferenzbeitrag Conference Paper
2022	Environment Classification via Blind Roomprints Estimation Baum, Malte; Cuccovillo, Luca; Yaroshchuk, Artem; Aichroth, Patrick	Konferenzbeitrag Conference Paper
2022	Open Challenges in Synthetic Speech Detection Cuccovillo, Luca; Papastergiopoulos, Christoforos; Vafeiadis, Anastasios; Yaroshchuk, Artem; Aichroth, Patrick; Votis, Konstantinos; Tzovaras, Dimitrios	Konferenzbeitrag Conference Paper
2021	Audio forensics and provenance analysis: Technologies for media verification and asset management Aichroth, Patrick; Cuccovillo, Luca; Gerhardt, Milica	Zeitschriftenaufsatz Journal Article
2021	Detection and localization of partial audio matches in various application scenarios Maksimovic, Milica; Aichroth, Patrick; Cuccovillo, Luca	Zeitschriftenaufsatz Journal Article
2019	Audioforensik, partielles Audio-Matching und Audio Phylogenie-Analyse Aichroth, Patrick; Lukashevich, Hanna	Zeitschriftenaufsatz Journal Article
2019	Inverse Decoding of PCM A-law and µ-law Cuccovillo, Luca; Aichroth, Patrick	Zeitschriftenaufsatz Journal Article
2019	Copy-Move Forgery Detection and Localization via Partial Audio Matching Maksimovic, Milica; Cuccovillo, Luca; Aichroth, Patrick	Zeitschriftenaufsatz Journal Article
2017	Increasing the temporal resolution of ENF analysis via harmonic distortion Cuccovillo, Luca; Aichroth, Patrick	Konferenzbeitrag Conference Paper
2017	Phylogeny analysis for MP3 and AAC coding transformations Maksimovic, Milica; Cuccovillo, Luca; Aichroth, Patrick	Konferenzbeitrag Conference Paper
2016	AAC encoding detection and bitrate estimation using a convolutional neural network Seichter, Daniel; Cuccovillo, Luca; Aichroth, Patrick	Konferenzbeitrag Conference Paper
2016	Open-set microphone classification via blind channel analysis Cuccovillo, Luca; Aichroth, Patrick	Konferenzbeitrag Conference Paper
2014	Video camera identification using audio-visual features Milani, S.; Cuccovillo, Luca; Tagliasacchi, M.; Tubaro, S.; Aichroth, Patrick	Konferenzbeitrag Conference Paper
2014	Efficient Cross-Codec Framing Grid Analysis for Audio Tampering Detection Gärtner, Daniel; Dittmar, Christian; Aichroth, Patrick; Schroth, P.; Cuccovillo, Luca; Mann, Sebastian; Schuller, Gerald	Konferenzbeitrag Conference Paper
2014	A multi-codec audio dataset for codec analysis and tampering detection Gärtner, Daniel; Cuccovillo, Luca; Mann, Sebastian; Aichroth, Patrick	Konferenzbeitrag Conference Paper
2013	Audio tampering detection via microphone classification Cuccovillo, Luca; Mann, Sebastian; Tagliasacchi, Marco; Aichroth, Patrick	Konferenzbeitrag Conference Paper
2013	Blind microphone analysis and stable tone phase analysis for audio tampering detection Cuccovillo, Luca; Mann, Sebastian; Aichroth, Patrick; Tagliasacchi, Marco; Dittmar, Christian	Konferenzbeitrag Conference Paper
2013	Estimating MP3PRO encoder parameters from decoded audio Bießmann, Paul; Gärtner, Daniel; Dittmar, Christian; Aichroth, Patrick; Schuller, Gerald; Schnabel, Michael; Geiger, Ralf	Konferenzbeitrag Conference Paper
2013	Combining ENF phase discontinuity checking and temporal pattern matching for audio tampering detection Mann, Sebastian; Cuccovillo, Luca; Aichroth, Patrick; Dittmar, Christian	Konferenzbeitrag Conference Paper

Diese Liste ist ein Auszug aus der Publikationsplattform Fraunhofer-Publica

This list has been generated from the publication platform Fraunhofer-Publica

Highlights and news

2024

5.11.2024

Content verification toolbox at WSDB 2024

Presentation, demonstration and exchange at German networking event for the broadcasting industry in Erfurt

WSDB 2024 (in German)

29.10.2024

EBU webinar on audio analysis

Luca Cuccovillo and Milica Gerhardt showcased AI-based audio tools to verify information developed in veraAI.

EBU webinar

News / 11.10.2024

IEEE Best Poster Award

Kay Fuhrmeister, Hao Cui, Artem Yaroshchuk and Thomas Köllmer received a Best Poster Award at the IEEE 5th International Symposium on the Internet of Sounds

IEEE Best Poster Award

1.10.2024

Highlighted innovation of AI4Media

High-potential innovation of Fraunhofer IDMT and NISV “Improvement of search and retrieval of audio content” featured in the European's commission innovation radar

Innovation Radar

Trade fair / 13.9.2024

Content verification toolbox at IBC 2024

Successful presentation and inspring exchange at IBC 2024 in Amsterdam.

IBC 2024

News / 7.8.2024

AES Best Paper Award

Luca Cuccovillo, Patrick Aichroth and Thomas Köllmer honored for best paper at the AES International Conference on Audio Forensics 2024.

AES Best Paper Award

Article / 21.6.2024

dpa fact checking

Article about an audio analysis for dpa on fan chants during a European Championship match

Read article

Workshop / 10.6.2024

MAD'24 Workshop

Luca Cuccovillo, part of the organizing comitee of the Workshop on Multimedia AI against Disinformation, captures the essence of the event in his article on the veraAI blog.

Workshop

Workshop / 12.3.2024

Data Technology Seminar 2024

Presentation "Digital Traces: Verification of audio-visual content" at EBU Data Technology Seminar on data and AI for media in Geneva.

DTS 2024

White paper / 12.2.2024

Generative AI and Disinformation

The EU-funded projects on AI and disinformation including vera.ai and AI4Media published a white paper.

White paper

2023

1.11.2023

New Chair ENFSI Expert Working Group

Luca Cuccovillo is the new Chair of the ENFSI Expert Working Group on “Forensic Speech and Audio Analysis" (FSAAWG)

ENFSI Chair

Event / 23.11.2023

news-polygraph conference on desinformation and verification

External project kick-off news-polygraph