@@ -21,107 +21,46 @@ spark-assembly-1.5.2-hadoop2.6.0.jar(下载地址: http://pan.baidu.com/s/1hrSxi
21
21
SparkLearning项目带有数据,下载会比较慢,如果只想下载部分文件夹,可以使用svn。另外也在20160810弄了一个没有数据的project,方便下载:https://github.com/xubo245/SparkLearning_NoData
22
22
23
23
# 3.具体博客目录: #
24
- ## (1).Spark基本学习篇: ##
25
- spark学习1之examples运行:http://blog.csdn.net/xubo245/article/details/48548079
26
- spark学习2之OutOfMemoryError错误的解决办法:http://blog.csdn.net/xubo245/article/details/48548507
27
- spark学习3之examples中的SparkPi:http://blog.csdn.net/xubo245/article/details/50596227
28
- spark学习4之集群上直接用scalac编译.scala出现的MissingRequirementError问题(已解决):http://blog.csdn.net/xubo245/article/details/50596822
29
- spark学习5之sbt问题:http://blog.csdn.net/xubo245/article/details/50603502
30
- spark学习6之scala版本不同的问题:http://blog.csdn.net/xubo245/article/ details/50609476
31
- spark学习7之IDEA下搭建SPark本地编译环境并上传到集群运行:http://blog.csdn.net/xubo245/article/details/50789983
32
- spark学习8之eclipse安装scala2.10和spark编译环境并上传到集群运行:http://blog.csdn.net/xubo245/article/details/50790463
33
- spark学习9之在window下进行源码编译打包:http://blog.csdn.net/xubo245/article/details/51386564
34
- spark学习10之将spark的AppName设置为自动获取当前类名:http://blog.csdn.net/xubo245/article/details/51428158
35
- spark学习11之在idea中将eclipse导入的java project改成maven project:http://blog.csdn.net/xubo245/article/details/51428502
24
+ ## (1).Spark基本学习篇: ##
25
+ [ SparkBaseLearning] ( docs/spark/SparkBaseLearning )
26
+
36
27
37
28
## (2).Spark代码篇: ##
38
- Spark代码1之RDDparallelizeSaveAsFile:http://blog.csdn.net/xubo245/article/details/50791485
39
- Spark代码2之Transformation:union,distinct,join:http://blog.csdn.net/xubo245/article/details/50792201
40
- Spark代码3之Action:reduce,reduceByKey,sorted,lookup,take,saveAsTextFile:http://blog.csdn.net/xubo245/article/details/50800934
41
- Spark代码4之Spark 文件API及其对搜狗数据的操作:http://blog.csdn.net/xubo245/article/details/50801827
29
+ [ SparkCodeLearning] ( docs/Spark/SparkCodeLearning )
42
30
43
31
44
32
## (3).Spark组件之Mllib学习篇 ##
45
- Spark中组件Mllib的学习1之Kmeans错误解决:http://blog.csdn.net/xubo245/article/details/51007690
46
- Spark中组件Mllib的学习2之MovieLensALS学习(集群run-eaxmples运行):http://blog.csdn.net/xubo245/article/details/51264145
47
- Spark中组件Mllib的学习3之用户相似度计算:http://blog.csdn.net/xubo245/article/details/51428175
48
- Spark中组件Mllib的学习4之examples中的MovieLensALS修改本地运行:http://blog.csdn.net/xubo245/article/details/51429221
49
- Spark中组件Mllib的学习5之ALS测试(apache spark):http://blog.csdn.net/xubo245/article/details/51429365
50
- Spark中组件Mllib的学习6之ALS测试(apache spark 含隐式转换):http://blog.csdn.net/xubo245/article/details/51429391
51
- Spark中组件Mllib的学习7之ALS隐式转换训练的model来预测数据:http://blog.csdn.net/xubo245/article/details/51429490
52
- Spark中组件Mllib的学习8之ALS训练的model来预测数据:http://blog.csdn.net/xubo245/article/details/51429503
53
- Spark中组件Mllib的学习9之ALS训练的model来预测数据的准确率研究:http://blog.csdn.net/xubo245/article/details/51439208
54
- Spark中组件Mllib的学习10之修改MovieLens来对movieLen中的100k数据进行预测:http://blog.csdn.net/xubo245/article/details/51439491
55
- Spark中组件Mllib的学习11之使用ALS对movieLens中一百万条(1M)数据集进行训练,并对输入的新用户数据进行电影推荐:http://blog.csdn.net/xubo245/article/details/51439920
56
- 更多请见:https://github.com/xubo245/SparkLearning/tree/master/docs/Spark%20MLlib%E5%AD%A6%E4%B9%A0
33
+ [ MLlibLearning] ( docs\Spark\MLlibLearning )
57
34
58
35
## (4).Spark组件之SparkSQL学习篇 ##
59
- Spark组件之SparkSQL学习1之问题报错No TypeTag available for Person:http://blog.csdn.net/xubo245/article/details/51153243
60
- SparkSQL在代码库中还有不少,当时没写成博客
36
+ [ SparkSQLLearning] ( docs\Spark\SparkSQLLearning )
61
37
62
38
## (5).Spark组件之SparkR学习篇 ##
63
- Spark组件之SparkR学习1--安装与测试:http://blog.csdn.net/xubo245/article/details/51195287
64
- Spark组件之SparkR学习2--使用spark-submit向集群提交R代码文件dataframe.R:http://blog.csdn.net/xubo245/article/details/51199216
65
- Spark组件之SparkR学习3--使用spark-submit向集群提交R代码文件data-manipulation.R:http://blog.csdn.net/xubo245/article/details/51199813
66
- Spark组件之SparkR学习4--Eclipse下R语言环境搭建:http://blog.csdn.net/xubo245/article/details/51199918
67
- Spark组件之SparkR学习5--R语言函数调用(跨文件调用):http://blog.csdn.net/xubo245/article/details/51205276
39
+ [ SparkRLearning] ( docs\Spark\SparkRLearning )
68
40
69
41
## (6).Spark组件之Spark Streaming学习篇 ##
70
- Spark组件之Spark Streaming学习1--NetworkWordCount学习:http://blog.csdn.net/xubo245/article/details/51251970
71
- Spark组件之Spark Streaming学习2--StatefulNetworkWordCount 学习:http://blog.csdn.net/xubo245/article/details/51252142
72
- Spark组件之Spark Streaming学习3--结合SparkSQL的使用(wordCount):http://blog.csdn.net/xubo245/article/details/51252229
73
- Spark组件之Spark Streaming学习4--HdfsWordCount 学习:http://blog.csdn.net/xubo245/article/details/51254412
42
+ [ SparkStreamingLearning] ( docs\Spark\SparkStreamingLearning )
74
43
75
44
## (7). Spark组件之GraphX学习篇 ##
76
- Spark组件之GraphX学习1--入门实例Property Graph:http://blog.csdn.net/xubo245/article/details/51306975
77
- Spark组件之GraphX学习2--triplets实践:http://blog.csdn.net/xubo245/article/details/51307037
78
- Spark组件之GraphX学习3--Structural Operators:subgraph:http://blog.csdn.net/xubo245/article/details/51307162
79
- Spark组件之GraphX学习4--Structural Operators:mask:http://blog.csdn.net/xubo245/article/details/51307237
80
- Spark组件之GraphX学习5--随机图生成和消息发送aggregateMessages以及mapreduce操作(含源码分析):http://blog.csdn.net/xubo245/article/details/51307386
81
- Spark组件之GraphX学习6--随机图生成和出度入度等信息显示:http://blog.csdn.net/xubo245/article/details/51307641
82
- Spark组件之GraphX学习7--随机图生成和reduce最大或最小出度/入度/度:http://blog.csdn.net/xubo245/article/details/51307774
83
- Spark组件之GraphX学习8--随机图生成和TopK最大入度:http://blog.csdn.net/xubo245/article/details/51308278
84
- Spark组件之GraphX学习8--邻居集合:http://blog.csdn.net/xubo245/article/details/51308337
85
- Spark组件之GraphX学习9--使用pregel函数求单源最短路径:http://blog.csdn.net/xubo245/article/details/51314928
86
- Spark组件之GraphX学习10--PageRank学习和使用(From examples):http://blog.csdn.net/xubo245/article/details/51315240
87
- Spark组件之GraphX学习11--PageRank例子(PageRankAboutBerkeleyWiki):http://blog.csdn.net/xubo245/article/details/51316151
88
- Spark组件之GraphX学习12--GraphX常见操作汇总SimpleGraphX:http://blog.csdn.net/xubo245/article/details/51316317
89
- Spark组件之GraphX学习13--ConnectedComponents操作:http://blog.csdn.net/xubo245/article/details/51316654
90
- Spark组件之GraphX学习14--TriangleCount实例和分析:http://blog.csdn.net/xubo245/article/details/51317245
91
- Spark组件之GraphX学习15--we-Google.txt大图分析:http://blog.csdn.net/xubo245/article/details/51317594
92
- Spark组件之GraphX学习16--最短路径ShortestPaths:http://blog.csdn.net/xubo245/article/details/51317892
93
- Spark组件之GraphX学习20--待学习部分:http://blog.csdn.net/xubo245/article/details/51317710
45
+ [ GraphXLearning] ( docs\Spark\GraphXLearning )
94
46
95
47
96
48
## (8).Spark-Avro学习篇 ##
97
- Spark-Avro学习1之使用SparkSQL读取AVRO文件:http://blog.csdn.net/xubo245/article/details/51295474
98
- Spark-Avro学习2之使用byDatabricksSparkAvroL读取AVRO文件:http://blog.csdn.net/xubo245/article/details/51295593
99
- Spark-Avro学习3之使用AvroCompression存储AVRO文件:http://blog.csdn.net/xubo245/article/details/51295604
100
- Spark-Avro学习4之使用AvroWritePartitioned存储AVRO文件时进行划分:http://blog.csdn.net/xubo245/article/details/51295627
101
- Spark-Avro学习5之使用AvroReadSpecifyName存储AVRO文件时指定name和namespace:http://blog.csdn.net/xubo245/article/details/51295642
102
- Spark-Avro学习6之Ubuntu下安装:http://blog.csdn.net/xubo245/article/details/51295674
103
- Spark-Avro学习7之Java Avro使用(生成code方式):http://blog.csdn.net/xubo245/article/details/51295843
104
- Spark-Avro学习8之Java Avro使用(不生成code方式): Spark-Avro 学习8之Java Avro使用(不生成code方式)
105
- Spark-Avro学习9之SCALA环境下Avro使用(不生成code方式):http://blog.csdn.net/xubo245/article/details/51296717
49
+ [ SparkAvroLearning] ( docs\Spark\SparkAvroLearning )
106
50
107
- ## (9).Spark生态之Tachyon学习篇 ##
108
- Spark生态之Tachyon学习1---单机版搭建和运行(Alluxio):http://blog.csdn.net/xubo245/article/details/51318566
109
- Spark生态之Tachyon学习2---Spark从tachyon中读取文件(Alluxio):http://blog.csdn.net/xubo245/article/details/51318863
110
- Spark生态之Tachyon学习3---机器重启后数据存储位置的变化:http://blog.csdn.net/xubo245/article/details/51322437
111
- Spark生态之Tachyon学习4---下载源码通过maven install安装失败记录:http://blog.csdn.net/xubo245/article/details/51322911
112
- Spark生态之Tachyon学习5--tachyon的几个问题(待解决):http://blog.csdn.net/xubo245/article/details/51323101
113
- Spark生态之Tachyon学习6---集群版搭建和运行(Alluxio):http://blog.csdn.net/xubo245/article/details/51324273
114
- Spark生态之Tachyon学习7--下载源码通过maven安装成功:http://blog.csdn.net/xubo245/article/details/51325776
115
- Spark生态之Tachyon学习6---集群版搭建问题之集群无法全部启动:http://blog.csdn.net/xubo245/article/details/51325834
116
- Spark生态之Tachyon学习7---Tachyon的优点:http://blog.csdn.net/xubo245/article/details/51326644
51
+ ## (9).Spark生态之Alluxio(Tachyon)学习篇 ##
52
+ [ AlluxioLearning] ( docs\Spark\AlluxioLearning )
117
53
118
54
119
55
## (10).Spark生态之spark-csv篇: ##
120
- Spark生态之Spark-csv学习1之安装和简单的examples: http://blog.csdn.net/xubo245/article/details/51184946
56
+ [ SparkCsvLearning ] ( docs\Spark\SparkCsvLearning )
121
57
122
58
## (11).Spark疑问篇 ##
123
- Spark疑问1之如何查看sparkContext没有关闭的sc:http://blog.csdn.net/xubo245/article/details/51173463
124
- Spark疑问2之spark 丢了executor会恢复吗?:http://blog.csdn.net/xubo245/article/details/51173493
59
+ [ SparkQuestion] ( docs\Spark\SparkQuestion )
60
+
61
+ ## (12).MLLearning: ##
62
+
63
+ [ MLLearning] ( docs\Spark\MLLearning )
125
64
126
- ## (12).其他: ##
127
- MLlib学习文档: https://github.com/xubo245/SparkLearning/tree/master/ docs/Spark%20MLlibLearning
65
+ ## (13). Spark源码学习
66
+ [ SparkSourceLearning ] ( docs\SparkSourceLearning )
0 commit comments