目录
1 前言...................................................................................................................1
2 范围...................................................................................................................2
3 规范引用文件...................................................................................................3
4 术语和定义.......................................................................................................4
4.1 大数据....................................................................................................4
4.2 大数据平台............................................................................................4
4.3 大数据开发技术....................................................................................4
4.4 批处理....................................................................................................4
4.5 即席查询................................................................................................4
5 综述...................................................................................................................5
6 大数据开发流程...............................................................................................6
6.1 数据采集................................................................................................6
6.2 数据治理................................................................................................7
6.2.1 数据清洗.....................................................................................7
6.2.2 数据比对.....................................................................................8
6.2.3 数据标准化.................................................................................8
6.3 数据存储和批处理................................................................................9
6.4 大数据 OLAP 分析 ...............................................................................11
6.5 数据展现..............................................................................................11
7 大数据开发设计方法.....................................................................................12
7.1 确定业务场景......................................................................................12
7.2 梳理数据源,确定数据范围 ..............................................................12
7.3 设计模型算法......................................................................................12
7.4 定义大数据分析服务..........................................................................12
8 大数据开发主要技术要求.............................................................................13
8.1 数据集成工具......................................................................................13
8.2 SQL.........................................................................................................13
8.3 批处理编程..........................................................................................14