home/categories/media/ecnu-icalk-autoskill-skillbank-convskill-english-gpt4-8-glm4-7-audio-dataset-loading-and-stft-feature-extraction-skill-md
mediacontent-media

audio-dataset-loading-and-stft-feature-extraction

Load audio files from a directory, parse labels from filenames, generate random VAD segments, extract STFT features (mean along axis 1, converted to dB), and split the dataset into train/test sets.

ECNU-ICALK
maintainer
ECNU-ICALK
Updated 3/13/2026
Stars
304
Forks
34
quick start

Installation and usage

Load audio files from a directory, parse labels from filenames, generate random VAD segments, extract STFT features (mean along axis 1, converted to dB), and split the dataset into train/test sets.

Installation
$ install --globalskills.sh
Usage

Once installed, you can use this skill by running the following command in your terminal:

skills use audio-dataset-loading-and-stft-feature-extraction