搜索资源列表
TYJreng
- 一个基于Hadoop进行分布工LDA模型训练的程序源代码完整,可直接使用。-Distribution of work based on Hadoop for a LDA model training program complete source code can be used directly.
MurmurHash
- mur hash是 Austin Appleby于2008年创立的一种非加密hash算法,适用于基于hash进行查找的场景。murmurhash在多个开源项目中得到应用,包括libstdc、libmemcached、nginx、hadoop等。-MurmurHash is a non-cryptographic hash function suitable for general hash-based lookup. It was created by Austin Appleby in 200
shortestPath
- 采用hadoop,解决海量数据中,图的并行计算,计算图的最短距离,主要用于社区数据分析-Using Hadoop Distributed to solve the shortest path problem, and and hbase do back-end database, and learn hadoop is a very good reference
kmeans-hadoop
- Kmeans hadoop分布式版本实现,用map-reduce将kmeans重新实现。-Kmeans hadoop distributed version achieved with map-reduce will kmeans reimplemented.
pagerank_mr
- pagerank算法在hadoop框架下用mapreduce实现-pagerank althogrithm
dataiku-hive-udf-master
- This dataki hadoop hive udf files , which includes some common functions-This is dataki hadoop hive udf files , which includes some common functions
Fk-menas
- 这个是一个基于hadoop上的模糊kmeans算法的实现。里面有一个测试数据。程序可以使用。里面有使用方法 代码易懂- hadoop mapreduce kmeans FKM FKmeans
mrMeanMapper
- 机器学习算法,分布式均值和方差计算的mapper,用于hadoop流-Machine learning algorithms, mapper distributed computing the mean and variance for hadoop streaming
Hadoop
- 使用hadoop开发,可以对输入文件中出现的关键词统计词频并进行不同文本词频统计高低的排序,本代码需要用户自行定义关键词和输入文件-Use hadoop development, can appear in the input file keyword statistics word frequency and low frequency statistics different sort of text, the code requires a user-defined keywords an
WordCount2
- 基于hadoop1.x的wordcount程序,jar包是全的,只要设置一下即可使用-a word count program depend on hadoop 1.x with all jar files needed,easy to use
hadoop-book-master
- 这个是hadoop权威指南的课后源码,包含书中的全部实例,方便大家学习-This is the authoritative guide to the after-school hadoop source, including all example in the book, it facilitates learning
wordcount
- 并行算法实现wordcount:基于hadoop平台,在Linux上测试: 包括C语言实现的串行算法和Java实现的并行算法。-Parallel algorithm based on hadoop on Linux for wordcount: Including serial algorithm and parallel algorithm based on hadoop.
CH-8.6---PageRank
- 非常著名的pagerank算法曾经在google使用,奠基的google在搜索王国的地位,本算法是hadoop的实现版本-Very famous pagerank algorithm has been used in google, the foundation of the google search in the Kingdom of the position, the algorithm is the realization of hadoop version
genetic-algorithm-based-on-Hadoop
- 基于Hadoop大数据平台实现遗传算法并行化,利用将任务划分为MapReduce过程进行分布式处理。-Based on Hadoop big data platform to achieve parallel genetic algorithm, the use of the task is divided into MapReduce process distributed processing.
Hadoop-data-find
- Hadoop数据挖掘算法 在mapreduce中的实现-Hadoop data mining algorithms implemented in the mapreduce
3个算法
- 同一个算法的Hadoop和spark实现,一共三个算法。(Hadoop and spark implementation of the same algorithm, a total of three algorithms)
recommend
- 该程序可实现电影推荐功能,运用hadoop的map,reduce,功能实现在大量不同热度电影中寻找适合推荐给个人的电影(The program can achieve the film recommended function, the use of hadoop map, reduce, function in a large number of different heat film to find suitable for the individual recommended to the
wordcount3
- hadoop的wordcount程序,去除标点和部分停词(Hadoop's wordcount program, removing punctuation and partial parked words)
wordcount
- 基于eclipse下的hadoop的wordcount程序(Hadoop's wordcount program)
hadoop-core-1.2.1
- 大数据文件,核心库,大数据开发用,要改成jar后缀(bigdata jar pakage,the .jar file for bigdata development)