【免费】StatisticalLanguageModelsBasedonNeuralNetworks资源-CSDN文库

需积分: 0 102 浏览量 2021-12-24 09:51:31 上传评论收藏 794KB PDF 举报

资源详情

资源评论

资源推荐

VYSOK

E U

CEN

I TECHNICK

E V BRN

BRNO UNIVERSITY OF TECHNOLOGY

FAKULTA INFORMA

ICH TECHNOLOGI

USTAV PO

ITA

COV

E GRAFIKY A MULTIM

EDI

FACULTY OF INFORMATION TECHNOLOGY

DEPARTMENT OF COMPUTER GRAPHICS AND MULTIMEDIA

STATISTICAL LANGUAGE MODELS BASED ON NEURAL

NETWORKS

DISERTA

I PR

ACE

PHD THESIS

AUTOR PR

ACE Ing. TOM

S MIKOLOV

AUTHOR

BRNO 2012

Abstrakt

Statistick´e jazykov´e modely jsou d˚uleˇzitou souˇc´ast´ı mnoha ´uspˇeˇsn´ych aplikac´ı, mezi nˇeˇz

patˇr´ı napˇr´ıklad automatick´e rozpozn´av´an´ı ˇreˇci a strojov´y pˇreklad (pˇr´ıkladem je zn´am´a

aplikace Google Translate). Tradiˇcn´ı techniky pro odhad tˇechto model˚u jsou zaloˇzeny

na tzv. N-gramech. Navzdory zn´am´ym nedostatk˚um tˇechto technik a obrovsk´emu ´usil´ı

v´yzkumn´ych skupin napˇr´ıˇc mnoha oblastmi (rozpozn´av´an´ı ˇreˇci, automatick´y pˇreklad, neu-

roscience, umˇel´a inteligence, zpracov´an´ı pˇrirozen´eho jazyka, komprese dat, psychologie

atd.), N -gramy v podstatˇe z˚ustaly nej´uspˇeˇsnˇejˇs´ı technikou. C´ılem t´eto pr´ace je prezen-

tace nˇekolika architektur jazykov´ych model˚u zaloˇzen´ych na neuronov´ych s´ıt´ıch. Aˇckoliv

jsou tyto modely v´ypoˇcetnˇe n´aroˇcnˇejˇs´ı neˇz N-gramov´e modely, s technikami vyvinut´ymi v

t´eto pr´aci je moˇzn´e jejich efektivn´ı pouˇzit´ı v re´aln´ych aplikac´ıch. Dosaˇzen´e sn´ıˇzen´ı poˇctu

chyb pˇri rozpozn´av´an´ı ˇreˇci oproti nejlepˇs´ım N -gramov´ym model˚um dosahuje 20%. Model

zaloˇzen´y na rekurentn´ı neurovov´e s´ıti dosahuje nejlepˇs´ıch publikovan´ych v´ysledk˚u na velmi

zn´am´e datov´e sadˇe (Penn Treebank).

Abstract

Statistical language models are crucial part of many successful applications, such as au-

tomatic speech recognition and statistical machine translation (for example well-known

Google Translate). Traditional techniques for estimating these models are based on N-

gram counts. Despite known weaknesses of N-grams and huge eﬀorts of research commu-

nities across many ﬁelds (speech recognition, machine translation, neuroscience, artiﬁcial

intelligence, natural language processing, data compression, psychology etc.), N-grams

remained basically the state-of-the-art. The goal of this thesis is to present various archi-

tectures of language models that are based on artiﬁcial neural networks. Although these

models are computationally more expensive than N -gram models, with the presented

techniques it is possible to apply them to state-of-the-art systems eﬃciently. Achieved

reductions of word error rate of speech recognition systems are up to 20%, against state-

of-the-art N -gram model. The presented recurrent neural network based model achieves

the best published performance on well-known Penn Treebank setup.

Kl´ıˇcov´a slova

jazykov´y model, neuronov´a s´ıt

’

, rekurentn´ı, maxim´aln´ı entropie, rozpozn´av´an´ı ˇreˇci, komp-

rese dat, umˇel´a inteligence

Keywords

language model, neural network, recurrent, maximum entropy, speech recognition, data

compression, artiﬁcial intelligence

Citace

Tom´aˇs Mikolov: Statistical Language Models Based on Neural Networks, disertaˇcn´ı pr´ace,

Brno, FIT VUT v Brnˇe, 2012

Statistical Language Models Based on Neural Net-

works

Prohl´aˇsen´ı

Prohlaˇsuji, ˇze jsem tuto disertaˇcn´ı pr´aci vypracoval samostatnˇe pod veden´ım Doc. Dr.

Ing. Jana

Cernock´eho. Uvedl jsem vˇsechny liter´arn´ı publikace, ze kter´ych jsem ˇcerpal.

Nˇekter´e experimenty byly provedeny ve spolupr´aci s dalˇs´ımi ˇcleny skupiny Speech@FIT,

pˇr´ıpadnˇe se studenty z Johns Hopkins University - toto je v pr´aci vˇzdy explicitnˇe uvedeno.

. . . . . . . . . . . . . . . . . . . . . . .

Tom´aˇs Mikolov

Kvˇeten 2012

Acknowledgements

I would like to thank my supervisor Jan

Cernock´y for allowing me to explore new ap-

proaches to standard problems, for his support and constructive criticism of my work,

and for his ability to quickly organize everything related to my studies. I am grateful to

Luk´aˇs Burget for many advices he gave me about speech recognition systems, for long

discussions about many technical details and for his open-minded approach to research.

I would also like to thank all members of Speech@FIT group for cooperation, especially

Stefan Kombrink, Oldˇrich Plchot, Martin Karaﬁ´at, Ondˇrej Glembek and Jiˇr´ı Kopeck´y.

It was great experience for me to visit Johns Hopkins University during my studies, and I

am grateful to Frederick Jelinek and Sanjeev Khudanpur for granting me this opportunity.

I always enjoyed discussions with Sanjeev, who was my mentor during my stay there. I

also collaborated with other students at JHU, especially Puyang Xu, Scott Novotney and

Anoop Deoras. With Anoop, we were able to push state-of-the-art on several standard

tasks to new limits, which was the most exciting for me.

As my thesis work is based on work of Yoshua Bengio, it was great for me that I could

have spent several months in his machine learning lab at University of Montreal. I al-

ways enjoyed reading Yoshua’s papers, and it was awesome to discuss with him my ideas

personally.

Tato pr´ace vznikla jako ˇskoln´ı d´ılo na Vysok´em uˇcen´ı technick´em v Brnˇe, Fakultˇe in-

formaˇcn´ıch technologi´ı. Pr´ace je chr´anˇena autorsk´ym z´akonem a jej´ı uˇzit´ı bez udˇelen´ı

opr´avnˇen´ı autorem je nez´akonn´e, s v´yjimkou z´akonem deﬁnovan´ych pˇr´ıpad˚u.

剩余132页未读，继续阅读

评论收藏

内容反馈

陈一甲

粉丝: 1
资源: 2

Statistical Language Models Based on Neural Networks

评论0

最新资源

Statistical Language Models Based on Neural Networks

评论0

Adaptation of Language Models for SMT using Neural Networks with Topic Information

Neural Networks for Electronics Hobbyists A Non-Technical Project-Based epub

Wavelet-Based Statistical Signal Processing Using Hidden Markov Models，，，HMM

Wavelet-Based Statistical Signal Processing Using Hidden Markov Models,,,HMM

Hierarchical Convolutional Neural Networks for EEG-Based Emotion Recognition

Statistical Language Models for Information Retrieval: A Critical Review

Pattern Recognition with Neural Networks in C++

Neural.Networks.with.R.epub

Neural networks and statistical learning 艾伯特神经网络和统计学习

A Hybrid Movie Recommender System Based on Neural Networks

Survey of Image Based Graph Neural Networks.pdf

Survey of Image Based Graph Neural Networks.zip

Tree-Based Convolutional Neural Networks

Convolutional Neural Networks Based Hyperspectral Image Classification Method with Adaptive Kernels

Oxford NLP lecture

Information-Theoretic Aspects of Neural Networks

Artificial Neural Networks_New Research-Nova Science(2017).pdf

Pattern Recognition With Neural Networks In C.pdf

A Statistical View of deep learning

人工神经网络-模型和应用Artificial Neural Networks - Models and Applications

A Self-adaptive Access Selection Algorithm for Cognitive Networks Based on Fuzzy Neural Networks

Scene Text Detection and Segmentation Based on Cascaded Convolution Neural Networks

Attention-Based Recurrent Neural Network Models for Joint Intent Detection

License plate Recognition Model Based on Neural Networks

The Elements of Statistical Learning(2nd)(Trevor Hastie 2008)_11.Neural Networks.pdf

Statistical Models Theory and Practice

Statistical Language Model for Information Retrieval

Statistical Models and Methods for Financial Markets

Applied Linear Statistical Models

最新资源