Synthesize and recognize artificial voices

Audio deep fakes are a challenge for Telekom, so the solution synthesizes voices of board members and can distinguish them from fakes.

Our contribution

The problem

01

The project

01

As a large corporation with a high level of digital affinity, Telekom is exposed to numerous risks and fraud attempts. In addition to classic phishing, these have increasingly included attempts by audio deep fakes in recent years. These are used, for example, to obtain internal Group information, to initiate bank transfers or to manipulate the market with false audio and video messages.

With ever-improving open source tools - and an almost endless data pool of publicly available recordings of performances by telecom executives - these deepfakes are becoming increasingly difficult for the human ear to identify as such.

For this reason, a project was launched with Telekom's innovation arm, T-Labs, to use AI to reliably automate this very distinction and consistently prevent fraud attempts.

The solution

02

Our contribution

02

Our solution approach consisted of two parts: speech synthesis (to test the goodness of our model) and the actual counterfeit classification tool.

Speech synthesis consisted of an encoder that encodes a target's voice using an audio sample, a synthesizer that creates the audio spectrogram (i.e., a 2D image) of a given text using the encoded voice, and a vocoder that finally generates the audio from the spectrogram.

The forgery detection tool was trained with publicly available datasets. The normalized and trimmed 2s audio sequences were converted to a fixed dimension Mel spectrogram and a CNN-based network was trained.

As a result, deepfakes could be created for numerous executives based on public material and classified as real or fake with 98.6% reliability using the tool.

Our result

03

10+

Executive voices deceptively synchronized

98.6%

Reliability in the identification of fakes

>2s

Audio material to make a classification

10+

Executive voices deceptively synchronized

98.6%

Reliability in the identification of fakes

>2s

Audio material to make a classification

View more projects

International AI strategy

Establish organizational structures, processes, and methodologies to create cross-functional synergies and enable continuous, successful implementation of new AI use cases.

AI-assisted decision support for hematology.

Information extraction from scientific studies for individualized therapy management based on clinical data.

From silos to added value

Data integration and self-configurable AI apps shorten the path from decentralized data silos to valuable insights along the entire process chain.

Testing & Inspection

Development of an innovative AI solution for the detection of car damage

TÜV Rheinland has developed a system for automatically recording damage. Based on multimodal sensor data, the goal was to classify damage to car bodies with an innovative AI solution.

Automatic quotation generation

Automatic generation of individualized offers in the building materials trade through AI-supported text processing and supplier integration.

AI-based product placements to increase revenue

Setting up of a cloud-based data platform with customer profiles and development of a recommender system for the playout of product recommendations in the optics business.

Contact us

Critical and holistic evaluation of the approach
Development of guidance for reliable implementation
Free of charge and without obligation

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

We would like to get to know you!

Start your AI journey with us now

Subscribe now to the Merantix Momentum Newsletter.

View more projects

AdaProQ

AdaProQ develops methods to optimize production quality and scrap rates in manufacturing and to view them holistically.

KI protection

AI safeguarding established new safeguarding processes for autonomous cars to reliably identify and classify their environment.

KITE

KITE improves the efficiency of electric motors in vehicles by automating the design process, thereby reducing costs and emissions.

Language models

More With Less

More-With-Less is developing an open-source framework for efficiently and cost-effectively adapting large-scale language models for SME-specific applications.

ResKriVer

ResKriVer develops interactive services for the automatic collection, synthesis and dissemination of information to manage crisis situations.

KITE

KITE improves the efficiency of electric motors in vehicles by automating the design process, thereby reducing costs and emissions.

Contact us

Critical and holistic evaluation of the approach
Development of guidance for reliable implementation
Free of charge and without obligation

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

We would like to get to know you!

Start your AI journey with us now

Subscribe now to the Merantix Momentum Newsletter.