从音频中提取特征的不同方法和技术_JupyterNotebook_Python

共28个文件

py：11个

png：6个

pyc：3个

版权申诉

134 浏览量 2023-04-26 11:38:37 上传评论收藏 27.86MB ZIP 举报

资源推荐

资源详情

资源评论

收起资源包目录

从音频中提取特征的不同方法和技术_Jupyter Notebook_Python_下载.zip （28个子文件）

Cough-signal-processing-master

setup.py 439B

csp

__init__.py 437B

tools

__init__.py 133B

io.py 1KB

spectrogram_features

__init__.py 176B

audio_data_augmentation.py 2KB

core_features.py 2KB

__pycache__

core_features.cpython-37.pyc 2KB

audio_data_augmentation.cpython-37.pyc 2KB

audio_features

frequency_domain.py 9KB

time_domain.py 20KB

data_utils.py 8KB

__pycache__

__init__.cpython-37.pyc 358B

Notebook Examples

featureExtraction.ipynb 7KB

data.h5 61.14MB

ResultMEtrics.png 22KB

coughDetectionTrial.py 6KB

Cough-detection-Trial.ipynb 240KB

Example.ipynb 493KB

Python.gitattributes 572B

.gitignore 3KB

Images

spectrogram_two.png 72KB

spectrogram_one.png 147KB

WAVE.png 16KB

spectrogram_three.png 148KB

Audio Feature Classification.PNG 184KB

readme.txt 1B

README.md 4KB

<p align="center"> <img width="250" src="./Images/WAVE.png"> </p> <h2 align="center">Cough Signal Processing ( csp ) </h2> <p align="center">A micro framework for cough singal processing </p> <p align="center"> Contribute and Support </p> [![GitHub license](https://img.shields.io/badge/License-Creative%20Commons%20Attribution%204.0%20International-blue)](https://github.com/coughresearch/Cough-signal-processing/blob/master/LICENSE) [![GitHub commit](https://img.shields.io/github/last-commit/coughresearch/Cough-signal-processing)](https://github.com/coughresearch/Cough-signal-processing/commits/main) [![PRs Welcome](https://img.shields.io/badge/PRs-welcome-brightgreen.svg?style=flat-square)](http://makeapullrequest.com) ### Features - Spectrogram features extraction - Contiguous features - Cough event detection - Experiments on noise removal, Silence in cough sounds - Applying different types of filters - Audio augmentation techniques | Feature ID | Feature Name | Description | | :-------------: |:-------------:|-----| | 1 | Zero Crossing Rate | The rate of sign-changes of the signal during the duration of a particular frame. | | 2 | Energy | The sum of squares of the signal values, normalized by the respective frame length. | | 3 | Entropy of Energy | The entropy of sub-frames' normalized energies. It can be interpreted as a measure of abrupt changes. | | 4 | Bispectrum Score (BGS) | 3rd order spectrum of the signal is known as the bispectrum.| | 5 | Non-gaussianity score(NGS) | NGS gives the measure of non-gaussianity of a given segment of data. | | 6 | Formant frequencies (FF) | A formant is the spectral shaping that results from an acoustic resonance of the human vocal tract. | | 7 | log energy (LogE) | The log energy for every subsegment | | 8 | kurtosis (Kurt) | kurtosis is a measure of the "tailedness" of the probability distribution of a real-valued random variable. | | 9 | MFCCs | Mel Frequency Cepstral Coefficients form a cepstral representation where the frequency bands are not linear but distributed according to the mel-scale. | 10 | MFCC delta, delta2 | Delta-MFCC and Delta-Delta-MFCC are used to extract the features of speakers. | | 11 | Skewness | skewness is a measure of the asymmetry of the probability distribution | | 12 | Power Spectral Density (PSD) | A Power Spectral Density (PSD) is the measure of signal's power content versus frequency. | | 13 | Linear Predictive Coding (LPC) | Representing the spectral envelope of a digital signal of speech in compressed form | | 14 | Continuous Wavelet Transform (CWT) | provides an overcomplete representation of a signal by letting the translation and scale parameter of the wavelets vary continuously. | ## More features and suggestions are welcome. ### Quick Start ```python from csp import SpectrogramFeatures # path of the cough audio sp = SpectrogramFeatures('cough_sound_9412.m4a') data = sp.spectrogram_data() ``` #### output <img width="350" src="./Images/spectrogram_one.png"> ### Audio augmentation techniques #### Speed tuning ```python from csp import AudioAugmentation # Audio_augmentation speed tuning Audio_aug = AudioAugmentation.speed_tuning(data['signal']) ``` #### output <img width="350" src="./Images/spectrogram_two.png"> #### Time shifting ```python # Audio augmentation time shifting aug = AudioAugmentation.time_shifting(data['signal']) ``` #### output <img width="350" src="./Images/spectrogram_three.png"> #### Feature extraction { @thileepanp } <p align="center"> <img width=850" src="./Images/Audio Feature Classification.PNG"> </p>

评论收藏

内容反馈

版权申诉