KI-Glossar

Das vollständige Wörterbuch der Künstlichen Intelligenz

162

Kategorien

2.032

Unterkategorien

23.060

Begriffe

📖

Begriffe

Neural Vocoding

Audio reconstruction process from intermediate acoustic representations using neural networks to generate realistic audio waveforms.

📖

Begriffe

Zero-shot TTS

Voice synthesis approach capable of generating speech in never-before-seen voices during training, using a short audio sample as reference.

📖

Begriffe

Audio Diffusion Models

Generative models based on the diffusion process that progressively add and then remove noise to generate high-quality audio samples.

📖

Begriffe

Mel-spectrogram

Spectrographic representation of audio on the Mel scale that better matches human hearing perception, used as input for many TTS models.

📖

Begriffe

Griffin-Lim Algorithm

Iterative algorithm to reconstruct an audio waveform from a magnitude spectrogram by estimating the missing phase through successive projections.

📖

Begriffe

Neural Audio Codec

Audio compression-decompression system based on deep learning that encodes and decodes audio with superior quality to traditional codecs.

📖

Begriffe

Audio Style Transfer

Technique that applies the stylistic characteristics of a source audio signal to a target signal while preserving the original semantic content.

📖

Begriffe

Voice Conversion

Technique that transforms the vocal characteristics of a source speaker to those of a target speaker while preserving the linguistic content of the message.

📖

Begriffe

Music Generation

Process of automatically creating original musical compositions using AI models like Transformers or GANs to generate melodies and harmonies.

📖

Begriffe

Sound Effect Synthesis

Procedural generation of realistic sound effects to enrich training datasets or create audio content for interactive media.

📖

Begriffe

Neural Source Separation

AI technique individually isolating mixed sound sources in an audio recording, allowing voice/music separation or multiple instruments.

📖

Begriffe

Audio Super-resolution

Process of improving the temporal or frequency resolution of existing audio signals to restore or enhance their perceived quality.

📖

Begriffe

Adversarial Audio Generation

Use of generative adversarial networks (GANs) to create realistic audio samples through competition between a generator and a discriminator.

🔍

KI-Glossar

Neural Vocoding

Zero-shot TTS

Audio Diffusion Models

Mel-spectrogram

Griffin-Lim Algorithm

Neural Audio Codec

Audio Style Transfer

Voice Conversion

Music Generation

Sound Effect Synthesis

Neural Source Separation

Audio Super-resolution

Adversarial Audio Generation

Keine Ergebnisse gefunden