语音识别说话人识别语音库资源-CSDN文库

共211个文件

wav：207个

pdf：4个

说话人识别

语音识别

5星 · 超过95%的资源需积分: 50 5 浏览量 2011-10-14 11:18:16 上传评论 8 收藏 59.72MB ZIP 举报

资源推荐

资源详情

资源评论

收起资源包目录

语音识别说话人识别语音库（211个子文件）

readme.pdf 126KB

test text.pdf 100KB

phonetic alphabet.pdf 43KB

training text.pdf 11KB

FTEJ_Sd.wav 734KB

FMEV_Sd.wav 674KB

FTEJ_Sb.wav 647KB

FAML_Sb.wav 635KB

FMEV_Sb.wav 626KB

FAML_Sd.wav 625KB

FEAB_Sd.wav 609KB

FEAB_Sb.wav 606KB

FHRO_Sd.wav 594KB

MPRA_Sb.wav 588KB

FJAZ_Sd.wav 586KB

FEAB_Sr6.wav 581KB

MFKC_Sb.wav 578KB

MOEW_Sd.wav 569KB

MFKC_Sd.wav 569KB

MNHP_Sd.wav 566KB

MRKO_Sd.wav 563KB

MMLP_Sd.wav 556KB

MOEW_Sb.wav 544KB

MPRA_Sd.wav 538KB

MMNA_Sd.wav 538KB

FMEV_Sr10.wav 534KB

FHRO_Sb.wav 531KB

FSLJ_Sd.wav 525KB

MLKH_Sd.wav 524KB

MNHP_Sb.wav 522KB

MASM_Sd.wav 520KB

FMEL_Sd.wav 519KB

MREM_Sd.wav 519KB

FDHH_Sd.wav 519KB

FUAN_Sd.wav 519KB

MASM_Sb.wav 514KB

FMEL_Sb.wav 513KB

MMLP_Sb.wav 511KB

FSLJ_Sb.wav 509KB

FUAN_Sb.wav 488KB

MRKO_Sb.wav 484KB

MLKH_Sb.wav 478KB

FJAZ_Sb.wav 472KB

FUAN_Sr42.wav 469KB

MMNA_Sb.wav 469KB

FAML_Sc.wav 466KB

MKBP_Sb.wav 456KB

FTEJ_Sa.wav 456KB

MREM_Sb.wav 453KB

FMEV_Sc.wav 453KB

MKBP_Sd.wav 450KB

MCBR_Sb.wav 450KB

FTEJ_Sc.wav 447KB

FDHH_Sb.wav 444KB

MASM_Sr12.wav 444KB

MCBR_Sd.wav 441KB

MTLS_Sd.wav 438KB

MTLS_Sb.wav 428KB

MOEW_Sr44.wav 428KB

MOEW_Sa.wav 422KB

MOEW_Sc.wav 419KB

MFKC_Sc.wav 416KB

FAML_Sa.wav 416KB

FEAB_Sa.wav 413KB

MNHP_Sc.wav 406KB

FHRO_Sc.wav 403KB

MPRA_Sa.wav 399KB

FMEV_Sa.wav 397KB

MASM_Sc.wav 397KB

MPRA_Sc.wav 394KB

FEAB_Sc.wav 394KB

MMLP_Sc.wav 394KB

MFKC_Sa.wav 391KB

FUAN_Sc.wav 384KB

FHRO_Sr33.wav 384KB

MREM_Sa.wav 381KB

MREM_Sr8.wav 381KB

FAML_Sr3.wav 375KB

FDHH_Sc.wav 372KB

FSLJ_Sc.wav 372KB

MNHP_Sr2.wav 372KB

FHRO_Sa.wav 363KB

MRKO_Sr17.wav 363KB

MLKH_Sc.wav 359KB

MREM_Sc.wav 359KB

FTEJ_Sf.wav 359KB

FJAZ_Sc.wav 356KB

MRKO_Sc.wav 353KB

FMEV_Sf.wav 350KB

FDHH_Sa.wav 350KB

MASM_Sa.wav 350KB

FMEL_Sr24.wav 350KB

FMEL_Sc.wav 349KB

FUAN_Sa.wav 349KB

FAML_Sf.wav 344KB

MRKO_Sa.wav 344KB

MNHP_Sa.wav 341KB

FMEL_Sa.wav 338KB

FJAZ_Sr37.wav 338KB

FSLJ_Sa.wav 331KB

共 211 条

Database description

Side 1 of 1

English Language Speech Database for Speaker Recognition

(ELSDSR)

ELSDSR corpus of read speech has been designed to provide speech data for

the development and evaluation of automatic speaker recognition system.

ELSDSR corpus design was a joint effort of the faculty, Ph. D students and

Master students from department of Informatics and Mathematical Modeling

(IMM) at Technical University of Denmark (DTU). The speech language is

English, and spoken by 21 Dane, one Islander and one Canadian. Due to the

usage of this database and some realistic factors, perfect or even correct

pronunciation is not required and necessary for getting the specific and uniquely

identifiable characteristics for individual.

1. Recording Condition

The recording work has been carried out in a chamber (room 133) in building

321, 1

floor at DTU. The chamber is an 8.82*11.8*3.05 m

(width*length*height)

computer room (classroom), with 22 monitors and 34 tables. The recording is

manipulated in, approximately, the middle of this chamber, with one microphone,

one 70*120*70 cm

table in front of speakers. In order to deflect the reflection,

two deflection boards with measure of 93*211.5*6 cm

were placed at tilted

angles facing each other, and were infront of the table and speakers. For details

please see the setup drawing, drawing of the room and position of recording,

etc., in appendix.

2. Recording equipment and setup

The equipment for recording work is MARANTZ PMD670 portable solid state

recorder. PMD670 can record in a variety of compression algorithm, associated

bit rate, file format, and recording type (channels recorded) parameters. It

supports two kinds of recording format: compressed recording, which includes

MP2 and MP3; uncompressed recording, which includes linear pulse code

modulation (PCM). The recording type can be stereo, mono or digital, and the file

can be recorded into .wav .bwf .mpg or .mp3 format according to user need.

In this database, the voice messages are recorded into the most commonly

used file type--.wav. And the algorithm used is PCM. Table 1 shows the initial

setup for the recorder, for detail see PMD670 user guide.

Table 1: Recorder Setup

Setup

Input

Auto

Mark

Pre

Rec

Analog

Out

MIN

Atten

Repeat ANC

EDL

Play

Level

Cont.

S. Skip

MIC

(MONO)

OFF ON OFF 20dB OFF FLAT OFF

MANUA

20dB

评论收藏

内容反馈

ZLG1012

2014-11-10

下载失败了，不知道是不是CSDN的问题。而且10分太贵了。
rainsbaby

2012-12-31

英文的，挺清晰，希望有用
yanfly

2013-05-16

很清晰，作为识别的样本很好！
邪仔

2016-06-23

很好的资源对于语音识别学习很有帮助
xihongshisdpo

2012-05-15

很清晰，可用于说话人识别的对比测试

前往

页

visual870811

粉丝: 4
资源: 6

语音识别说话人识别语音库

speaker.rar_matlab音频_语音库 MATLAB_语音识别matlab_音频识别_音频识别 MATLAB

语音识别支持库

语音处理说话人识别

语音识别的demo及需要的库

C#讯飞语音识别库

语音库（发音人）-附件资源

中文语音库

短语音说话人识别算法及说话人识别技术应用研究

实用语音识别基础

中文语音库 微软 sdk

中文女声语音库定制版 3.5

NOIZEUS实验室纯净语音库

使用Windows自带语音库语音合成

XP语音库 亲测可用

实用语音识别基础电子版

微软语音识别开发包 Speech SDK 5.1 & 中文语音包

语音软件(讯飞语音库)

微软中文语音库

TTs-中文语音库

语音身份识别c代码

Qt 5实现串口调试助手 （源工程文件、0积分下载）

【SystemVerilog】路科验证V2学习笔记（全600页）.pdf

AutoSAR标准协议4.2.2

光伏-储能并网系统仿真.rar

最新资源

中文语音库微软 sdk

XP语音库亲测可用

Qt 5实现串口调试助手（源工程文件、0积分下载）