Audio Data
Audio DataBlocks and show_batch
Spectrogram
Spectrogram (n_fft:Listified[int]=1024, win_length:Listified[int]|None=None, hop_length:Listified[int]|None=None, pad:Listified[int]=0, window_fn:Listified[Callable[...,Tensor]]=<built-in method hann_window of type object at 0x7f95e7c335c0>, power:Listified[float]=2.0, normalized:Listified[bool]=False, wkwargs:Listified[dict]|None=None, center:Listified[bool]=True, pad_mode:Listified[str]='reflect', onesided:Listified[bool]=True, norm:Listified[str]|None=None)
Convert a TensorAudio
into one or more TensorSpec
MelSpectrogram
MelSpectrogram (sample_rate:Listified[int]=16000, n_fft:Listified[int]=1024, win_length:Listified[int]|None=None, hop_length:Listified[int]|None=None, f_min:Listified[float]=0.0, f_max:Listified[float]|None=None, pad:Listified[int]=0, n_mels:Listified[int]=128, window_fn:Listified[Callable[...,Tensor]]=<built-in method hann_window of type object at 0x7f95e7c335c0>, power:Listified[float]=2.0, normalized:Listified[bool]=False, wkwargs:Listified[dict]|None=None, center:Listified[bool]=True, pad_mode:Listified[str]='reflect', norm:Listified[str]|None=None, mel_scale:Listified[str]='htk')
Convert a TensorAudio
into one or more TensorMelSpec
TransformBlocks for audio
Audio data blocks for using with the fastai data block API.
AudioBlock
AudioBlock (cls=<class 'fastxtend.audio.core.TensorAudio'>)
A TransformBlock
for audio of cls
SpecBlock
SpecBlock (cls=<class 'fastxtend.audio.core.TensorAudio'>, n_fft:Union[in t,Iterable[int],MutableSequence[int],fastcore.foundation.L,fas tcore.basics.fastuple]=1024, win_length:Union[int,Iterable[int ],MutableSequence[int],fastcore.foundation.L,fastcore.basics.f astuple,NoneType]=None, hop_length:Union[int,Iterable[int],Mut ableSequence[int],fastcore.foundation.L,fastcore.basics.fastup le,NoneType]=None, pad:Union[int,Iterable[int],MutableSequence [int],fastcore.foundation.L,fastcore.basics.fastuple]=0, windo w_fn:Union[Callable[...,torch.Tensor],Iterable[Callable[...,to rch.Tensor]],MutableSequence[Callable[...,torch.Tensor]],fastc ore.foundation.L,fastcore.basics.fastuple]=<built-in method hann_window of type object at 0x7f95e7c335c0>, power:Union[flo at,Iterable[float],MutableSequence[float],fastcore.foundation. L,fastcore.basics.fastuple]=2.0, normalized:Union[bool,Iterabl e[bool],MutableSequence[bool],fastcore.foundation.L,fastcore.b asics.fastuple]=False, wkwargs:Union[dict,Iterable[dict],Mutab leSequence[dict],fastcore.foundation.L,fastcore.basics.fastupl e,NoneType]=None, center:Union[bool,Iterable[bool],MutableSequ ence[bool],fastcore.foundation.L,fastcore.basics.fastuple]=Tru e, pad_mode:Union[str,Iterable[str],MutableSequence[str],fastc ore.foundation.L,fastcore.basics.fastuple]='reflect', norm:Uni on[str,Iterable[str],MutableSequence[str],fastcore.foundation. L,fastcore.basics.fastuple,NoneType]=None)
A TransformBlock
to read TensorAudio
and then use the GPU to turn audio into one or more Spectrogram
s
Type | Default | Details | |
---|---|---|---|
cls | _TensorMeta | TensorAudio | |
n_fft | Listified[int] | 1024 | Spectrogram args |
win_length | Listified[int] | None | None | |
hop_length | Listified[int] | None | None | |
pad | Listified[int] | 0 | |
window_fn | Listified[Callable[…, Tensor]] | hann_window | |
power | Listified[float] | 2.0 | |
normalized | Listified[bool] | False | |
wkwargs | Listified[dict] | None | None | |
center | Listified[bool] | True | |
pad_mode | Listified[str] | reflect | |
norm | Listified[str] | None | None |
MelSpecBlock
MelSpecBlock (cls=<class 'fastxtend.audio.core.TensorAudio'>, sr:Union[in t,Iterable[int],MutableSequence[int],fastcore.foundation.L, fastcore.basics.fastuple]=16000, n_fft:Union[int,Iterable[i nt],MutableSequence[int],fastcore.foundation.L,fastcore.bas ics.fastuple]=1024, win_length:Union[int,Iterable[int],Muta bleSequence[int],fastcore.foundation.L,fastcore.basics.fast uple,NoneType]=None, hop_length:Union[int,Iterable[int],Mut ableSequence[int],fastcore.foundation.L,fastcore.basics.fas tuple,NoneType]=None, f_min:Union[float,Iterable[float],Mut ableSequence[float],fastcore.foundation.L,fastcore.basics.f astuple]=0.0, f_max:Union[float,Iterable[float],MutableSequ ence[float],fastcore.foundation.L,fastcore.basics.fastuple, NoneType]=None, pad:Union[int,Iterable[int],MutableSequence [int],fastcore.foundation.L,fastcore.basics.fastuple]=0, n_ mels:Union[int,Iterable[int],MutableSequence[int],fastcore. foundation.L,fastcore.basics.fastuple]=128, window_fn:Union [Callable[...,torch.Tensor],Iterable[Callable[...,torch.Ten sor]],MutableSequence[Callable[...,torch.Tensor]],fastcore. foundation.L,fastcore.basics.fastuple]=<built-in method hann_window of type object at 0x7f95e7c335c0>, power:Union[ float,Iterable[float],MutableSequence[float],fastcore.found ation.L,fastcore.basics.fastuple]=2.0, normalized:Union[boo l,Iterable[bool],MutableSequence[bool],fastcore.foundation. L,fastcore.basics.fastuple]=False, wkwargs:Union[dict,Itera ble[dict],MutableSequence[dict],fastcore.foundation.L,fastc ore.basics.fastuple,NoneType]=None, center:Union[bool,Itera ble[bool],MutableSequence[bool],fastcore.foundation.L,fastc ore.basics.fastuple]=True, pad_mode:Union[str,Iterable[str] ,MutableSequence[str],fastcore.foundation.L,fastcore.basics .fastuple]='reflect', norm:Union[str,Iterable[str],MutableS equence[str],fastcore.foundation.L,fastcore.basics.fastuple ,NoneType]=None, mel_scale:Union[str,Iterable[str],MutableS equence[str],fastcore.foundation.L,fastcore.basics.fastuple ]='htk')
A TransformBlock
to read TensorAudio
and then use the GPU to turn audio into one or more MelSpectrogram
s
Type | Default | Details | |
---|---|---|---|
cls | _TensorMeta | TensorAudio | |
sr | Listified[int] | 16000 | MelSpectrogram args |
n_fft | Listified[int] | 1024 | |
win_length | Listified[int] | None | None | |
hop_length | Listified[int] | None | None | |
f_min | Listified[float] | 0.0 | |
f_max | Listified[float] | None | None | |
pad | Listified[int] | 0 | |
n_mels | Listified[int] | 128 | |
window_fn | Listified[Callable[…, Tensor]] | hann_window | |
power | Listified[float] | 2.0 | |
normalized | Listified[bool] | False | |
wkwargs | Listified[dict] | None | None | |
center | Listified[bool] | True | |
pad_mode | Listified[str] | reflect | |
norm | Listified[str] | None | None | |
mel_scale | Listified[str] | htk |