给初学者看的神经网络的数学基础ABeginner'sGuidetotheMathematicsofNeuralNetworks资源-CSDN文库

神经网络

需积分: 7 27 浏览量 2018-09-25 14:19:00 上传评论收藏 935KB PDF 举报

资源推荐

资源详情

资源评论

A Beginner's Guide to the

Mathematics of Neural Networks

A.C.C. Co olen

Department of Mathematics, King's College London

Abstract

In this pap er I try to describe both the role of mathematics in shap-

ing our understanding of how neural networks operate, and the curious

new mathematical concepts generated by our attempts to capture neu-

ral networks in equations. My target reader b eing the non-exp ert, I will

present a biased selection of relatively simple examples of neural network

tasks, models and calculations, rather than try to give a full encyclop edic

review-like account of the many mathematical developments in this eld.

Contents

1 Intro duction: Neural Information Pro cessing 2

2 From Biology to Mathematical Mo dels 6

2.1 From Biological Neurons to Mo del Neurons . . . . . . . . . . . 6

2.2 Universality of Mo del Neurons . . . . . . . . . . . . . . . . . . 9

2.3 Directions and Strategies . . . . . . . . . . . . . . . . . . . . . 12

3 Neural Networks as Asso ciative Memories 14

3.1 Recipes for Storing Patterns and Pattern Sequences . . . . . . 15

3.2 Symmetric Networks: the Energy Picture . . . . . . . . . . . . 19

3.3 Solving Mo dels of Noisy Attractor Networks . . . . . . . . . . . 20

4 Creating Maps of the Outside World 26

4.1 Map Formation Through Competitive Learning . . . . . . . . . 26

4.2 Solving Mo dels of Map Formation . . . . . . . . . . . . . . . . 29

5 Learning a Rule From an Exp ert 35

5.1 Perceptrons . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35

5.2 Multi-layer Networks . . . . . . . . . . . . . . . . . . . . . . . . 39

5.3 Calculating what is Achievable . . . . . . . . . . . . . . . . . . 43

5.4 Solving the Dynamics of Learning for Perceptrons . . . . . . . 47

6 Puzzling Mathematics 52

6.1 Complexity due to Frustration, Disorder and Plasticity. . . . . 52

6.2 The World of Replica Theory . . . . . . . . . . . . . . . . . . . 55

7 Further Reading 59

1 Intro duction: Neural Information Pro cessing

Our brains perform sophisticated information pro cessing tasks, using hardware

and op eration rules which are quite dierent from the ones on which conven-

tional computers are based. The pro cessors in the brain, the neurons (see gure

1), are rather noisy elements

which op erate in parallel. They are organised in

dense networks, the structure of which can vary from very regular to almost

amorphous (see gure 2), and they communicate signals through a huge num-

ber of inter-neuron connections (the so-called synapses). These connections

represent the `program' of a network. By continuously up dating the strengths

of the connections, a network as a whole can modify and optimise its `program',

`learn' from exp erience and adapt to changing circumstances.

Figure 1: Left: a Purkinje neuron in the human cerebellum. Right: apyramidal

neuron of the rabbit cortex. The black blobs are the neurons, the trees of wires

fanning out constitute the input channels (or dendrites) through which signals

are received which are sentoby other ring neurons. The lines at the b ottom,

bifurcating only mo destly, are the output channels (or axons).

From an engineering p oint of view neurons are in fact rather p o or processors,

they are slow and unreliable (see the table below). In the brain this is overcome

by ensuring that always a very large numb er of neurons are involved in any task,

and byhaving them operate in parallel, with many connections. This is in sharp

contrast to conventional computers, where op erations are as a rule p erformed

sequentially, so that failure of any part of the chain of operations is usually

fatal. Furthermore, conventional computers execute a detailed sp ecication of

orders, requiring the programmer to know exactly which data can b e exp ected

and how to respond. Subsequentchanges in the actual situation, not foreseen

by the programmer, lead to trouble. Neural networks, on the other hand,

can adapt to changing circumstances. Finally, in our brain large numbers of

neurons end their careers eachday unnoticed. Compare this to what happens

if we randomly cut a few wires in our workstation.

By this we mean that their output signals are to some degree sub ject to random variation;

they exhibit so-called spontaneous activity which appears not to b e related to the information

processing task they are involved in.

One can distinguish three types of motivation for studying neural networks.

Biologists, physiologists, psychologists and to some degree also philosophers aim

at understanding information pro cessing in real biological nervous tissue. They

study mo dels, mathematically and through computer simulations, which are

preferably close to what is b eing observed experimentally, and try to understand

the global prop erties and functioning of brain regions.

conventional computers biological neural networks

processors neurons

operation speed



Hz operation speed



sig nal=noise

1

sig nal=noise



sig nal v el ocity



m=sec sig nal v el ocity



m=sec

connections



connections



sequential op eration parallel op eration

program & data connections, neuron thresholds

external programming self-programming & adaptation

hardware failure: fatal robust against hardware failure

no unforseen data messy, unforseen data

Engineers and computer scientists would like to understand the princi-

ples behind neural information pro cessing in order to use these for designing

adaptive software and articial information pro cessing systems which can also

`learn'. They use highly simplied neuron models, which are again arranged

in networks. As their biological counterparts, these articial systems are not

programmed, their inter-neuron connections are not prescribed, but they are

`trained'. They gradually `learn' to perform tasks by b eing presented with ex-

amples of what they are supp osed to do. The key question then is to understand

the relationships b etween the network performance for a given type of task, the

choice of `learning rule' (the recip e for the mo dication of the connections) and

the network architecture. Secondly, engineers and computer scientists exploit

the emerging insightinto the way real (biological) neural networks manage to

process information eciently in parallel, by building articial neural networks

in hardware, which also op erate in parallel. These systems, in principle, have

the p otential of b eing incredibly fast information processing machines.

Finally, it will be clear that, due to their complex structure, the large num-

bers of elements involved, and their dynamic nature, neural network models

exhibit a highly non-trivial and rich behaviour. This is why also theoretical

physicists and mathematicians have b ecome involved, challenged as they are

by the many fundamental new mathematical problems p osed by neural net-

work models. Studying neural networks as a mathematician is rewarding in

twoways. The rst reward is to nd nice applications for one's to ols in biology

and engineering. It is fairly easy to come up with ideas ab out how certain in-

formation pro cessing tasks could be p erformed by (either natural or synthetic)

neural networks; byworking out the mathematics, however, one can actually

剩余60页未读，继续阅读

评论收藏

内容反馈

wumoomowu

粉丝: 0
资源: 4

给初学者看的神经网络的数学基础 A Beginner's Guide to the Mathematics of Neural ...

最新资源

给初学者看的神经网络的数学基础 A Beginner's Guide to the Mathematics of Neural ...

神经网络的数学基础

A Beginner's Guide to Mathematica.pdf

A Beginner's Guide to R 英文原版书

A Beginner's Guide to R

A Beginner’s Guide to R.pdf

《符号计算系统Mathematica教程》-张韵华

Tektronix_Fundamentals_of_Signal_Integrity.pdf

MicroPython for the Internet of Things A Beginner's Guide to Programming epub

R 语言初学者指南 A Beginner's Guide to R

The Beginner's Guide to IDAPython 6.0 - Alexander Hanel.pdf

Learning Web Design: A Beginner’s Guide to HTML... ..., 5th Edition

《A Beginner's Guide to Pointers》（英文doc）

Beyond Spreadsheets with R: A beginner’s guide to R and RStudio

Learning Processing A Beginner's Guide to Programming Images, Animation 2nd

A Beginner’s Guide to Programming Images, Animation, and Interaction

Convolutional Neural Networks in Visual Computing_A Concise Guide-CRC(2018).pdf

Absolute Beginner's Guide to Project Management (2nd Edition)

The Mathematics of Medical Imaging

Apache JMeter: A practical beginner's guide .....

Learning Processing: A Beginner’s Guide to Programming Images

Linux Administration A Beginner's Guide 5ed.pdf

A Beginner's Guide to R（R语言开发者指南 英文原版）

YOLOv8-deepsort 实现智能车辆目标检测+车辆跟踪+车辆计数

YOLOv8网络结构图，自制visio文件，yolov8.vsds，需要的自取，在原有的基础上直接改就行了

yolov8(2023年8月版本),已经下好yolov8s.pt和yolov8n.pt

Transformer模型实现长期预测并可视化结果（附代码+数据集+原理介绍）

社交平台上经济类话题的文章热度信息，数据是真实的，但不是真实日期

行人跌倒数据集（VOC格式）

Unet眼底血管图像分割数据集+代码+模型+系统界面+教学视频.zip

最新资源

A Beginner's Guide to R（R语言开发者指南英文原版）