搜索资源列表
Hadoop_src_fenxi
- 本文档对Hadoop的源代码进行分析。通过学习可以了解Hadoop的具体实现-This document Hadoop source code analysis. By learning to understand the specific implementation Hadoop
pagerank_mr
- pagerank算法在hadoop框架下用mapreduce实现-pagerank althogrithm
ImpalaJdbcClient
- hadoop impala,需要cdh4版本的hadoop+hive+mysql+impala-hadoop impala,cdh4hadoop+hive+mysql+impala
hdfs-replication-management-
- hdfs副本管理,详细介绍了hdfs分布式文件系统的副本维护原理-HDFS replica management, introduces in detail a copy of the hadoop distributed file system maintenance principle
centos6_2_Hadoop_INTSALL_CONFIG
- centos6.2系统在虚拟机中搭建Hadoop文档 建议用libreoffice打开。-Documentation of Install & Config Hadoop under Centos6.2 in a virtual machine. Recommends open this document package using libreoffice.
SequenceFileFlixer
- hadoop mapreduce 程序 重写了hadoop自定义的write函数 对付不好处理的数据格式 重新定义sequencefile的写文件方式-hadoop mapreduce program Rewritten hadoop custom write function Processed data format to deal with bad Redefining sequencefile write papers
HBaseExample
- Hbase是实时、多维数据库,是Hadoop的子项目,技术领先,上传的是Hbase example-Hbase example
LatestWordCount
- Hadoop word count example
matlab--implementation
- matlab 的安装实现,帮助大家成功的实现matlab的安装。-Hadoop installation implementation, to help you successfully hadoop installation.
wordCount
- python代码,利用hadoop分布式框架处理文本内容重的统计词频问题 -python code, use hadoop distributed framework for handling text heavy question word frequency statistics
pageRank
- python代码,利用hadoop分布式框架,处理与搜索引擎相关的pagerank问题 -python code, use hadoop distributed framework for dealing with pagerank search engine-related issues
starfish-0.1.0
- Starfish is a self-tuning system for big data analysis. Starfish equivalent of a performance optimization tool, allows Hadoop users and applications to achieve the best performance, and consists of three components: 1. Profiler What-if Engine Optimiz
newtest
- 在hadoop下实现的一个简单的哈夫曼算法,能对文件进行压缩-In hadoop achieve a simple Huffman algorithm, can compress the file
sqoop-sqlserver-1.0.tar
- 这是一个针对hadoop的hdfs和sqlserver之间互传文件的工具-This is one for hadoop s hdfs and transfer files between sqlserver tools
selected
- Example_Reduce编程模型实现海量数据处理—数字求和-Hadoop学习-Example_Reduce programming model to achieve massive data processing- Digital Sum-Hadoop learning
Project1_cs525
- Hadoop query Map/reduce
FileSystemCat
- Hadoop HDFS文件系统操作例程。功能包括:获取HDFS指定目录下所有文件列表,打印输出 递归遍历目录 上传本地文件到HDFS 在HDFS上Hadoop HDFS文件系统例程。功能包括:创建文件夹 创建HDFS文件 读取HDFS文件内容 重命名HDFS文件 删除HDFS文件及目录 查看HDFS文件是否存在 获取HDFS中指定目录中的文件列表. -Hadoop HDFS file system operations routine. Features include: HDFS to get
NetDisk
- Hadoop分布式文件系统HDFS访问例程,为Java界面程序,通过IOUtils.copyBytes可将 本地上传文件到HDFS;或从HDFS下载文件到本地硬盘。开发环境为Eclipse。-Hadoop Distributed File System HDFS access routines for Java interface program, through IOUtils.copyBytes can upload files to the local HDFS or downloa
preprocessing-data-map_reduce
- 介绍了怎样在window平台上搭建hadoop平台,以及利用map_reduce 进行数据处理的基础知识-Describes how to build a platform in the window hadoop platform for data processing and the use of map_reduce basics
DataGeneration
- hadoop下mapreduce程序,实现对搜索日志文件中关键词的统计,通过并行运算得到结果-Under hadoop mapreduce program to realize the log file keyword search statistics, the results obtained by means of parallel computing