<!DOCTYPE html>
<!-- saved from url=(0038)https://zhuanlan.zhihu.com/p/690262925 -->
<html lang="zh" data-hairline="true" class="itcauecng" data-theme="light" data-rh="data-theme" style=""><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8"><title>一文搞懂 LSTM(长短期记忆网络) - 知乎</title><meta name="viewport" content="width=device-width,initial-scale=1,maximum-scale=1"><meta name="renderer" content="webkit"><meta name="force-rendering" content="webkit"><meta http-equiv="X-UA-Compatible" content="IE=10,chrome=1"><meta name="google-site-verification" content="FTeR0c8arOPKh8c5DYh_9uu98_zJbaWw53J-Sch9MTg"><meta data-rh="true" name="keywords" content="LSTM,LSTM Transformers,算法"><meta data-rh="true" name="description" content="本文将从LSTM的本质、LSTM的原理、LSTM的应用三个方面,带您一文搞懂长短期记忆网络Long Short Term Memory | LSTM。 一、LSTM的本质 RNN 面临问题:RNN(递归神经网络)在处理长序列时面临的主要问题:短时记忆和…"><meta data-rh="true" property="og:title" content="一文搞懂 LSTM(长短期记忆网络)"><meta data-rh="true" property="og:url" content="https://zhuanlan.zhihu.com/p/690262925"><meta data-rh="true" property="og:description" content="本文将从LSTM的本质、LSTM的原理、LSTM的应用三个方面,带您一文搞懂长短期记忆网络Long Short Term Memory | LSTM。 一、LSTM的本质 RNN 面临问题:RNN(递归神经网络)在处理长序列时面临的主要问题:短时记忆和…"><meta data-rh="true" property="og:image" content="https://picx.zhimg.com/v2-b7d46f9967732bcd771cc64fc722e88a_720w.jpg?source=172ae18b"><meta data-rh="true" property="og:type" content="article"><meta data-rh="true" property="og:site_name" content="知乎专栏"><link data-rh="true" rel="apple-touch-icon" href="https://static.zhihu.com/heifetz/assets/apple-touch-icon-152.81060cab.png"><link data-rh="true" rel="apple-touch-icon" href="https://static.zhihu.com/heifetz/assets/apple-touch-icon-152.81060cab.png" sizes="152x152"><link data-rh="true" rel="apple-touch-icon" href="https://static.zhihu.com/heifetz/assets/apple-touch-icon-120.d5793cac.png" sizes="120x120"><link data-rh="true" rel="apple-touch-icon" href="https://static.zhihu.com/heifetz/assets/apple-touch-icon-76.7abf3393.png" sizes="76x76"><link data-rh="true" rel="apple-touch-icon" href="https://static.zhihu.com/heifetz/assets/apple-touch-icon-60.362a8eac.png" sizes="60x60"><link crossorigin="" rel="shortcut icon" type="image/x-icon" href="https://static.zhihu.com/heifetz/favicon.ico"><link crossorigin="" rel="search" type="application/opensearchdescription+xml" href="https://static.zhihu.com/heifetz/search.xml" title="知乎"><link rel="dns-prefetch" href="https://static.zhimg.com/"><link rel="dns-prefetch" href="https://pica.zhimg.com/"><link rel="dns-prefetch" href="https://picx.zhimg.com/"><link rel="dns-prefetch" href="https://pic1.zhimg.com/"><link rel="dns-prefetch" href="https://pic2.zhimg.com/"><link rel="dns-prefetch" href="https://pic3.zhimg.com/"><link rel="dns-prefetch" href="https://pic4.zhimg.com/"><link rel="dns-prefetch" href="https://static.zhihu.com/"><style data-emotion-css="9mvwt1">:root{--zhc-padding-horizontal:20px;--zhc-padding-vertical:16px;--zhc-notification-top:75px;--app-padding:16px;--app-header-height:52px;--app-max-width:640px;--app-width:1000px;--app-font-size:15px;}</style><script nonce="" data-web-reporter-config="{"platform":"web","project":"heifetz"}">!function(e,t){"object"==typeof exports&&"undefined"!=typeof module?t(exports):"function"==typeof define&&define.amd?define(["exports"],t):t((e=e||self).webReporter={})}(this,function(e){"use strict";var t={},n=!1,o=function(){var e,o,r,a,i;return n||(e=document.querySelector("script[data-web-reporter-config]"),o=e&&e.dataset.webReporterConfig||"{}",r=JSON.parse(o),a=r.platform,i=r.project,t={platform:a,project:i},n=!0),t};function r(e){return a(function(){return localStorage.getItem(e)})()}function a(e){return function(){try{return e.apply(void 0,arguments)}catch(e){}}}var i=a(function(e,t){var n={platform:"web",project:o().project,clientTimestamp:+new Date};!function(e,t,n){"1"===r("weber:logenabled")&&console.log("[web-reporter]%o",{type:e,base:t,data:n})}(e,n,t),function(e,t){var n=btoa(JSON.stringify(t));if("undefined"!=typeof Blob&&window.navigator&&window.navigator.sendBeacon){var o=new Blob([n],{type:"text/plain"});navigator.sendBeacon(e,o)}else{var r=new XMLHttpRequest;r.open("POST",e),r.withCredentials=!1,r.setRequestHeader("Content-Type","text/plain;charset=UTF-8"),r.send(n)}}(r("weber:api")||"https://apm.zhihu.com/collector/web_json",{type:e,base:n,data:t})});e.report=i,Object.defineProperty(e,"__esModule",{value:!0})});
</script><link href="./一文搞懂 LSTM(长短期记忆网络) _files/1566.216a26f4.cfb4ec8879cc0ffbf7ad.css" crossorigin="" rel="stylesheet"><link href="./一文搞懂 LSTM(长短期记忆网络) _files/column.216a26f4.2c6bc29d03f98e987815.css" crossorigin="" rel="stylesheet"><link rel="stylesheet" type="text/css" href="./一文搞懂 LSTM(长短期记忆网络) _files/user-hover-card.216a26f4.0e3930e2ffc649f7ed8b.css" crossorigin="anonymous"><link rel="stylesheet" type="text/css" href="./一文搞懂 LSTM(长短期记忆网络) _files/GoodsRecommendGoodsCardList.216a26f4.d95ce79191cdf8d7ac28.css" crossorigin="anonymous"><link rel="stylesheet" type="text/css" href="./一文搞懂 LSTM(长短期记忆网络) _files/report_modals.216a26f4.370935343bb8c3bca346.css" crossorigin="anonymous"><link rel="stylesheet" type="text/css" href="./一文搞懂 LSTM(长短期记忆网络) _files/632.216a26f4.763aa082a89da06da92e.css" crossorigin="anonymous"><link rel="stylesheet" type="text/css" href="./一文搞懂 LSTM(长短期记忆网络) _files/3433.216a26f4.1c95267e7d0f4f116db2.css" crossorigin="anonymous"><link rel="stylesheet" type="text/css" href="./一文搞懂 LSTM(长短期记忆网络) _files/ECommerceAd.216a26f4.c90979d08ed65b86fc8f.css" crossorigin="anonymous"><link rel="stylesheet" type="text/css" href="./一文搞懂 LSTM(长短期记忆网络) _files/1468.216a26f4.101bb14a2c1d8d20b744.css" crossorigin="anonymous"><link rel="stylesheet" type="text/css" href="./一文搞懂 LSTM(长短期记忆网络) _files/EditableV2.216a26f4.6c9301eaad8df7e50cb2.css" crossorigin="anonymous"><script nonce="">!function(){"use strict";!function(e,n){var r=[];function t(e){return function(){r.push([e,arguments])}}n.Raven={captureException:t("captureException"),captureMessage:t("captureMessage"),captureBreadcrumb:t("captureBreadcrumb")};var a,o,c,i,s,u="undefined"!=typeof DOMError;function d(e){var n=e instanceof Error||e instanceof ErrorEvent||u&&e instanceof DOMError||e instanceof DOMException;Raven.captureException(n?e:new Error(e.message||e.reason))}n.addEventListener("unhandledrejection",d),n.addEventListener("error",d,!0),a=e.src,o=e,c=function(){r.forEach(function(e){var n;(n=Raven)[e[0]].apply(n,e[1])}),n.removeEventListener("unhandledrejection",d),n.removeEventListener("error",d,!0)},i=document.head||document.getElementsByTagName("head")[0],(s=document.createElement("script")).crossOrigin=o.crossOrigin,s.dataset.sentryConfig=o["data-sentry-config"],s.onload=c,s.src=a,i.appendChild(s)}({"defer":true,"crossOrigin":"anonymous","src":"https://unpkg.zhimg.com/@cfe/sentry-script@1.3.1/dist/init.js","data-sentry-config":"{\"dsn\":\"https://2d8d764432cc4f6fb3bc78ab9528299d@crash2.zhihu.com/1224\",\"sampleRate\":0.1,\"release\":\"1341-9a59882d\",\"ignoreErrorNames\":[\"NetworkError\",\"SecurityError\"],\"ignoreErrorsPreset\":\"ReactApp\",\"tags\":{\"app_name\":\"heifetz\"}}"},window)}();
</script><script crossorigin="anonymous" data-sentry-config="{"dsn":"https://2d8d764432cc4f6fb3bc78ab9528299d@crash2.zhihu.com/1224","sampleRate":0.1,"release":&
没有合适的资源?快使用搜索试试~ 我知道了~
温馨提示
lstm 梯度更新规则 短时记忆 问题描述:RNN在处理长序列时,由于信息的传递是通过隐藏状态进行的,随着时间的推移,较早时间步的信息可能会在传递到后面的时间步时逐渐消失或被覆盖。 影响:这导致RNN难以捕捉和利用序列中的长期依赖关系,从而限制了其在处理复杂任务时的性能。 梯度消失/梯度爆炸 问题描述:在RNN的反向传播过程中,梯度会随着时间步的推移而逐渐消失(变得非常小)或爆炸(变得非常大)。 影响:梯度消失使得RNN在训练时难以学习到长期依赖关系,因为较早时间步的梯度信息在反向传播到初始层时几乎为零。梯度爆炸则可能导致训练过程不稳定,权重更新过大,甚至导致数值溢出。 LSTM解决问题:大脑和LSTM在处理信息时都选择性地保留重要信息,忽略不相关细节,并据此进行后续处理。这种机制使它们能够高效地处理和输出关键信息,解决了RNN(递归神经网络)在处理长序列时面临的问题。
资源推荐
资源详情
资源评论




















收起资源包目录



























































共 56 条
- 1
资源评论


比特流1024
- 粉丝: 2333
上传资源 快速赚钱
我的内容管理 展开
我的资源 快来上传第一个资源
我的收益
登录查看自己的收益我的积分 登录查看自己的积分
我的C币 登录后查看C币余额
我的收藏
我的下载
下载帮助


最新资源
- DB21_T_1545_2007_大米中苄嘧磺隆残留量的测定(高效液相色谱法).pdf
- XX-公司网络解决方案.doc
- 餐饮系统程序设计说明.doc
- 2023年自考计算机软件基础历年真题集.doc
- 2022软件企业杂志新年寄语.docx
- 茶叶(-)表没食子酸儿茶素没食子酸酯抗UVB辐射损伤作用研究.doc
- XX职称计算机考试网络综合试题.doc
- HPLC法测定复方益母口服液中水苏碱的含量.docx
- 2023年天大考试电子商务系统分析与设计离线作业考核试题.doc
- 操作系统考试题答案.doc
- 初级网络管理员培训班学员心得体会.doc
- SAP金融业管理信息化解决方案.doc
- javaOracle数据库课程设计报告.doc
- 2023年全国10月高等教育自学考试电子商务概论试题(及答案).doc
- 办公室网络环境的搭建.pdf
- 财务常识:Excel常见的错误值[会计实务-会计实操].doc
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈



安全验证
文档复制为VIP权益,开通VIP直接复制
