home/categories/media/ecnu-icalk-autoskill-skillbank-convskill-english-gpt4-8-glm4-7-audio-dataset-loading-and-stft-feature-extraction-skill-md
mediacontent-media

audio-dataset-loading-and-stft-feature-extraction

Load audio files from a directory, parse labels from filenames, generate random VAD segments, extract STFT features (mean along axis 1, converted to dB), and split the dataset into train/test sets.

ECNU-ICALK
maintainer
ECNU-ICALK
Mis à jour 3/13/2026
Étoiles
304
Forks
34
quick start

Installation and usage

Load audio files from a directory, parse labels from filenames, generate random VAD segments, extract STFT features (mean along axis 1, converted to dB), and split the dataset into train/test sets.

Installation
$ install --globalskills.sh
Utilisation

Après l'installation, vous pouvez utiliser ce skill en exécutant la commande suivante dans votre terminal :

skills use audio-dataset-loading-and-stft-feature-extraction