吴恩达 NLP 课程 2
目录
吴恩达 NLP 课程 2 ........................................................................................................................................ 1
1. N-gram 模型 ........................................................................................................................................ 2
1.1 简介 ................................................................................................................................................. 2
1.2 什么是 N-Gram? ......................................................................................................................... 3
1.3 文本序列概率的计算 ..................................................................................................................... 6
1.4 句子的开头和结束部分 ................................................................................................................. 8
1.5 N-gram 语言模型 ........................................................................................................................ 13
1.6 语言模型评估 ............................................................................................................................... 16
1.7 词汇量匮乏时 ............................................................................................................................... 19
1.8 平滑处理 ....................................................................................................................................... 21
2. 词嵌入 ................................................................................................................................................... 24
2.1 大纲介绍 ....................................................................................................................................... 24
2.2 独热向量 ....................................................................................................................................... 24
2.3 词嵌入 ........................................................................................................................................... 24
2.4 词嵌入方法模型介绍 ................................................................................................................... 26
2.5 连续词袋模型(CBOW) ........................................................................................................... 27
2.6 清理语料库、令牌化(标记化过程) ....................................................................................... 29
2.7 将单词转化为向量 ....................................................................................................................... 33