Wavaugmentate 0.2.5 Package Documentation

_images/title_image.png

The package wavaugmentate makes audio signal augmentation conversions. It provides the MultiChannelSignal, SignalAugmentation classes and wavaug-cli console utility.

  • MultiChannelSignal provides basic operations with multi-channel signals.

  • SignalAugmentation helps to perform augmentation of multi-channel signals for AI models learning purpose.

PyPi: https://pypi.org/project/wavaugmentate/ GitHub: https://github.com/chetverovod/wavaugmentate

Input Data

WAV-file or NumPy array.

Array shape: (num_channels, num_samples).

Output Data

Same types as in section Input Data.

Augmentation Methods

  1. Amplitude (volume change, inversion).

  2. Time shift.

  3. Echo.

  4. Adding noise.

  5. Time stretching (not implemented).

  6. Tempo change (not implemented).

  7. Pitch shift (not implemented).

  8. Adding silence.

  9. Frequency masking (not implemented).

  10. Time masking (not implemented).

  11. Combinations of methods.

Additional Functionality

  1. Generation multichannel tonal signals of desired frequency, amplitude, durance.

  2. Generation multichannel speech-like signals of desired formants frequency, amplitude, durance.

Interfaces

Signal augmentation can be applied by two ways: 1. As python module Mcs, Aug classes methods. 2. As console application wavaug-cli with CLI interface options.

Note

This project is under active development.