pesqMOS语音评测代码_pesq音频对齐算法资源-CSDN文库

共10个文件

c：5个

h：3个

dsp：1个

pesq

语音评测

5星 · 超过95%的资源需积分: 50 190 浏览量 2011-10-21 10:46:44 上传评论 2 收藏 39KB RAR 举报

**_pesq MOS 语音评测代码_** Pesq (Perceptual Evaluation of Speech Quality) 是一种国际上广泛采用的客观语音质量评估方法，主要用于衡量语音通信系统中的语音质量。MOS（Mean Opinion Score）是评估标准，它代表了平均主观评价分数，通常在1到5之间，其中5表示最优质量，1表示最差。这个压缩包包含的是基于VC6（Visual C++ 6.0）编译环境的pesq源码，用于实现这一评测过程。在语音通信领域，尤其是在编码、传输、噪声抑制等环节，评估语音质量至关重要。Pesq通过模拟人类听觉系统对语音的感知，提供了一种与主观听测结果高度相关的客观评估手段。它的主要特点包括： 1. **非线性处理**：Pesq考虑到人耳对不同频率成分敏感度的差异，采用了非线性处理，更接近人耳的听觉感知。 2. **时域和频域分析**：Pesq同时考虑了语音信号的时域和频域特性，能够更好地捕捉到信号的细节变化。 3. **宽频带支持**： Pesq支持从窄带到超宽带的各种语音信号，适应性强。 4. **多语言适应性**：Pesq可以处理多种语言的语音，不受特定语言的影响。 5. **标准化测试**： Pesq是ITU-T P.862标准的一部分，具有权威性和普适性。压缩包内的文件“pesq”可能包含了以下几个部分： - **源代码文件**：通常是.c或.cpp格式，包含了pesq算法的具体实现，包括预处理、计算、后处理等步骤。 - **头文件**：.h文件，定义了函数接口和数据结构，供其他程序调用。 - **编译脚本**：用于在VC6环境下构建和编译源代码的批处理或Makefile文件。 - **示例输入/输出**：可能包含了一些示例语音文件和对应的参考MOS评分，用于验证代码正确性。使用这些代码，开发人员可以将Pesq集成到自己的语音处理系统中，进行语音质量的自动评估。具体步骤可能包括： 1. **编译代码**：将pesq源码在VC6环境中编译成可执行文件。 2. **输入处理**：准备待评测的语音信号和参考信号，确保格式正确。 3. **运行评估**：调用pesq可执行文件，传入参数如采样率、信号路径等，执行评估。 4. **输出分析**：程序会返回一个MOS分数，根据该分数判断语音质量。了解并掌握Pesq MOS语音评测的原理和使用方法，有助于优化语音通信系统的性能，提高用户体验。对于从事语音处理、通信工程、音频编码等领域的人来说，这是一项必不可少的技能。

资源推荐

资源详情

资源评论

收起资源包目录

pesq.rar （10个子文件）

pesq

pesqdsp.c 33KB

pesq.dsw 533B

dsp.c 15KB

pesq.dsp 5KB

pesqmain.c 22KB

pesqmod.c 52KB

pesq.h 9KB

Release

dsp.h 5KB

pesqio.c 13KB

pesqpar.h 20KB

/***************************************************************************** Perceptual Evaluation of Speech Quality (PESQ) ITU-T Recommendation P.862. Version 1.2 - 2 August 2002. **************************************** PESQ Intellectual Property Rights Notice **************************************** DEFINITIONS: ------------ For the purposes of this Intellectual Property Rights Notice the terms Perceptual Evaluation of Speech Quality Algorithm and PESQ Algorithm refer to the objective speech quality measurement algorithm defined in ITU-T Recommendation P.862; the term PESQ Software refers to the C-code component of P.862. NOTICE: ------- All copyright, trade marks, trade names, patents, know-how and all or any other intellectual rights subsisting in or used in connection with including all algorithms, documents and manuals relating to the PESQ Algorithm and or PESQ Software are and remain the sole property in law, ownership, regulations, treaties and patent rights of the Owners identified below. The user may not dispute or question the ownership of the PESQ Algorithm and or PESQ Software. OWNERS ARE: ----------- 1. British Telecommunications plc (BT), all rights assigned to Psytechnics Limited 2. Royal KPN NV, all rights assigned to OPTICOM GmbH RESTRICTIONS: ------------- The user cannot: 1. alter, duplicate, modify, adapt, or translate in whole or in part any aspect of the PESQ Algorithm and or PESQ Software 2. sell, hire, loan, distribute, dispose or put to any commercial use other than those permitted below in whole or in part any aspect of the PESQ Algorithm and or PESQ Software PERMITTED USE: -------------- The user may: 1. Use the PESQ Software to: i) understand the PESQ Algorithm; or ii) evaluate the ability of the PESQ Algorithm to perform its intended function of predicting the speech quality of a system; or iii) evaluate the computational complexity of the PESQ Algorithm, with the limitation that none of said evaluations or its results shall be used for external commercial use. 2. Use the PESQ Software to test if an implementation of the PESQ Algorithm conforms to ITU-T Recommendation P.862. 3. With the prior written permission of both Psytechnics Limited and OPTICOM GmbH, use the PESQ Software in accordance with the above Restrictions to perform work that meets all of the following criteria: i) the work must contribute directly to the maintenance of an existing ITU recommendation or the development of a new ITU recommendation under an approved ITU Study Item; and ii) the work and its results must be fully described in a written contribution to the ITU that is presented at a formal ITU meeting within one year of the start of the work; and iii) neither the work nor its results shall be put to any commercial use other than making said contribution to the ITU. Said permission will be provided on a case-by-case basis. ANY OTHER USE OR APPLICATION OF THE PESQ SOFTWARE AND/OR THE PESQ ALGORITHM WILL REQUIRE A PESQ LICENCE AGREEMENT, WHICH MAY BE OBTAINED FROM EITHER OPTICOM GMBH OR PSYTECHNICS LIMITED. EACH COMPANY OFFERS OEM LICENSE AGREEMENTS, WHICH COMBINE OEM IMPLEMENTATIONS OF THE PESQ ALGORITHM TOGETHER WITH A PESQ PATENT LICENSE AGREEMENT. PESQ PATENT-ONLY LICENSE AGREEMENTS MAY BE OBTAINED FROM OPTICOM. *********************************************************************** * OPTICOM GmbH * Psytechnics Limited * * Am Weichselgarten 7, * Fraser House, 23 Museum Street, * * D- 91058 Erlangen, Germany * Ipswich IP1 1HN, England * * Phone: +49 (0) 9131 691 160 * Phone: +44 (0) 1473 261 800 * * Fax: +49 (0) 9131 691 325 * Fax: +44 (0) 1473 261 880 * * E-mail: info@opticom.de, * E-mail: info@psytechnics.com, * * www.opticom.de * www.psytechnics.com * *********************************************************************** Further information is also available from www.pesq.org *****************************************************************************/ #include <math.h> #include <stdio.h> #include "pesq.h" #include "pesqpar.h" #include "dsp.h" #define CRITERIUM_FOR_SILENCE_OF_5_SAMPLES 500. float Sl, Sp; int *nr_of_hz_bands_per_bark_band; double *centre_of_band_bark; double *centre_of_band_hz; double *width_of_band_bark; double *width_of_band_hz; double *pow_dens_correction_factor; double *abs_thresh_power; void input_filter( SIGNAL_INFO * ref_info, SIGNAL_INFO * deg_info, float * ftmp ) { DC_block( (*ref_info).data, (*ref_info).Nsamples ); DC_block( (*deg_info).data, (*deg_info).Nsamples ); apply_filters( (*ref_info).data, (*ref_info).Nsamples ); apply_filters( (*deg_info).data, (*deg_info).Nsamples ); } void calc_VAD( SIGNAL_INFO * sinfo ) { apply_VAD( sinfo, sinfo-> data, sinfo-> VAD, sinfo-> logVAD ); } int id_searchwindows( SIGNAL_INFO * ref_info, SIGNAL_INFO * deg_info, ERROR_INFO * err_info ) { long Utt_num = 0; long count, VAD_length; long this_start; int speech_flag = 0; float VAD_value; long del_deg_start; long del_deg_end; VAD_length = ref_info-> Nsamples / Downsample; del_deg_start = MINUTTLENGTH - err_info-> Crude_DelayEst / Downsample; del_deg_end = ((*deg_info).Nsamples - err_info-> Crude_DelayEst) / Downsample - MINUTTLENGTH; for (count = 0; count < VAD_length; count++) { VAD_value = ref_info-> VAD [count]; if( (VAD_value > 0.0f) && (speech_flag == 0) ) { speech_flag = 1; this_start = count; err_info-> UttSearch_Start [Utt_num] = count - SEARCHBUFFER; if( err_info-> UttSearch_Start [Utt_num] < 0 ) err_info-> UttSearch_Start [Utt_num] = 0; } if( ((VAD_value == 0.0f) || (count == (VAD_length-1))) && (speech_flag == 1) ) { speech_flag = 0; err_info-> UttSearch_End [Utt_num] = count + SEARCHBUFFER; if( err_info-> UttSearch_End [Utt_num] > VAD_length - 1 ) err_info-> UttSearch_End [Utt_num] = VAD_length -1; if( ((count - this_start) >= MINUTTLENGTH) && (this_start < del_deg_end) && (count > del_deg_start) ) Utt_num++; } } err_info-> Nutterances = Utt_num; return Utt_num; } void id_utterances( SIGNAL_INFO * ref_info, SIGNAL_INFO * deg_info, ERROR_INFO * err_info ) { long Utt_num = 0; long Largest_uttsize = 0; long count, VAD_length; int speech_flag = 0; float VAD_value; long this_start; long last_end; long del_deg_start; long del_deg_end; VAD_length = ref_info-> Nsamples / Downsample; del_deg_start = MINUTTLENGTH - err_info-> Crude_DelayEst / Downsample; del_deg_end = ((*deg_info).Nsamples - err_info-> Crude_DelayEst) / Downsample - MINUTTLENGTH; for (count = 0; count < VAD_length ; count++) { VAD_value = ref_info-> VAD [count]; if( (VAD_value > 0.0f) && (speech_flag == 0) ) { speech_flag = 1; this_start = count; err_info-> Utt_Start [Utt_num] = count; } if( ((VAD_value == 0.0f) || (count == (VAD_length-1))) && (speech_flag == 1) ) { speech_flag = 0; err_info-> Utt_End [Utt_num] = count; if( ((

评论收藏

内容反馈