• PySpark_Cookbook操作手册机器学习sparkml

    Combine the power of Apache Spark and Python to build effective big data applications About This Book ? Perform effective data processing, machine learning, and analytics using PySpark ? Overcome challenges in developing and deploying Spark solutions using Python ? Explore recipes for efficiently combining Python and Apache Spark to process data Who This Book Is For The PySpark Cookbook is for you if you are a Python developer looking for hands-on recipes for using the Apache Spark 2.x ecosystem in the best possible way. A thorough understanding of Python (and some familiarity with Spark) will help you get the best out of the book. What You Will Learn ?

    0
    0
    6.58MB
    2018-11-02
    16
  • Apache_Spark_Tutorial__Machine_Learning_with_PySpark_(Article)

    Apache Spark is known as a fast, easy-to-use and general engine for big data processing that has built-in modules for streaming, SQL, Machine Learning (ML) and graph processing. This technology is an in-demand skill for data engineers, but also data scientists can benefit from learning Spark when doing Exploratory Data Analysis (EDA), feature extraction and, of course, ML.

    0
    124
    1.88MB
    2018-11-02
    9
  • excel散点图插件

    在Excel里插入一个散点图之后,却发现这些散点的数据标签设置选项里,可以选择为 序列名称、X值、Y值,但就是无法指定为前面的项目名称。这么一个问题,不知难倒多少初学者

    0
    726
    47.71MB
    2018-10-29
    40
  • 大数据书序

    大数据书序 -胡强 胡强/宏源证券总经理 近年来,互联网与传统产业融合进程加速推进,传统产业的运营模式和游戏规则正在被逐步瓦解并再造。苹果、三星颠覆了传统手机终端,亚马逊、阿里巴巴、京东商城改变了传统零售业,Twitter、Facebook和微信撼动了传统媒体社交……这样的故事正在不断上演。

    5
    55
    21KB
    2013-06-17
    0
  • Serengeti虚拟化你的大数据应用

    Serengeti虚拟化你的大数据应用 Agenda •Today’s big data system •Why virtualize hadoop? •Serengeti introduction •Common questions about virtualization •Serengeti solution •Deep insight into Serengeti •Summary

    0
    84
    2.31MB
    2013-06-17
    0
  • 社会化媒体在英格索兰公司营销策略中的运用

    社会化媒体在英格索兰公司营销策略中的运用 第一个真正的通用社会化网站是 Friendster,它在推出的仅几个月内,就发展了 400万的注册用户,而在高峰时期亦每个星期都有 20 万的新用户加入。Friendster 在全球掀起了交友热潮,甚至掀起了第一波 SNS 浪潮,大批的竞争者继而争相效仿。

    0
    54
    3.12MB
    2013-06-17
    4
  • 如何选择适用企业的NoSQL解决方案

    如何选择适用企业的NoSQL解决方案 Highly Coupled (clustered): refers typically to a cluster of machines that closely work together, running a shared process in parallel. The task is subdivided in parts that are made individually by each one and then put back together to make the final result. Space Based (virtual single address space)

    0
    79
    446KB
    2013-06-17
    10
上传资源赚积分or赚钱