LieBrother

当才华撑不起野心时,应该静下心来学习;当能力驾驭不了目标时,应该沉下心来历练。


  • 首页

  • 归档

  • 分类

  • 标签

  • 关于

Flume集群搭建

发表于 2016-04-21   |   分类于 Flume   |     |   阅读次数

概念

集群的意思是多台机器,最少有2台机器,一台机器从数据源中获取数据,将数据传送到另一台机器上,然后输出。接下来就要实现Flume集群搭建。文中的集群如下图所示。
架构
这里我们需要2台机器,node1作为push推送数据,node2作为pull获取数据后显示出来。

配置pull.conf

【在node2机器上操作】
在conf目录下创建pull.conf文件

1
touch pull.conf

编辑pull.conf

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
#汇总数据代理的配置文件pull.conf
#Name the components on this agent
a1.sources= r1
a1.sinks= k1
a1.channels= c1

#Describe/configure the source
a1.sources.r1.type= avro
a1.sources.r1.channels= c1
a1.sources.r1.bind= node2
a1.sources.r1.port= 44444

#Describe the sink
a1.sinks.k1.type= logger
a1.sinks.k1.channel = c1

#Use a channel which buffers events in memory
a1.channels.c1.type= memory
a1.channels.c1.keep-alive= 10
a1.channels.c1.capacity= 100000
a1.channels.c1.transactionCapacity= 100000

配置push.conf

【在node1机器上操作】
在conf目录下创建push.conf文件

1
touch push.conf

编辑push.conf

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
#推数据代理的配置文件push.conf
#Name the components on this agent
a2.sources= r1
a2.sinks= k1
a2.channels= c1

#Describe/configure the source
a2.sources.r1.type= spooldir
a2.sources.r1.spoolDir= /csh/hadoop/flume/logs
a2.sources.r1.channels= c1

#Use a channel which buffers events in memory
a2.channels.c1.type= memory
a2.channels.c1.keep-alive= 10
a2.channels.c1.capacity= 100000
a2.channels.c1.transactionCapacity= 100000

#Describe/configure the source
a2.sinks.k1.type= avro
a2.sinks.k1.channel= c1
a2.sinks.k1.hostname= node2
a2.sinks.k1.port= 44444

创建spoolDir目录

【在node1中进行该操作】
根据push.conf中的配置 a2.sources.r1.spoolDir参数,创建目录,如果不先创建目录,则启动时会报错

1
mkdir -p /csh/hadoop/flume/logs

启动作为pull的主机

【本文为node2主机】

1
[root@node2 flume]# flume-ng agent -c conf -f conf/pull.conf -n a1 -Dflume.root.logger=INFO,console

显示如下信息则为启动成功

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
2016-04-20 00:08:15,550 (conf-file-poller-0) [INFO - org.apache.flume.node.PollingPropertiesFileConfigurationProvider$FileWatcherRunnable.run(PollingPropertiesFileConfigurationProvider.java:133)] Reloading configuration file:conf/pull.conf
2016-04-20 00:08:15,573 (conf-file-poller-0) [INFO - org.apache.flume.conf.FlumeConfiguration$AgentConfiguration.addProperty(FlumeConfiguration.java:931)] Added sinks: k1 Agent: a1
2016-04-20 00:08:15,573 (conf-file-poller-0) [INFO - org.apache.flume.conf.FlumeConfiguration$AgentConfiguration.addProperty(FlumeConfiguration.java:1017)] Processing:k1
2016-04-20 00:08:15,574 (conf-file-poller-0) [INFO - org.apache.flume.conf.FlumeConfiguration$AgentConfiguration.addProperty(FlumeConfiguration.java:1017)] Processing:k1
2016-04-20 00:08:15,621 (conf-file-poller-0) [INFO - org.apache.flume.conf.FlumeConfiguration.validateConfiguration(FlumeConfiguration.java:141)] Post-validation flume configuration contains configuration for agents: [a1]
2016-04-20 00:08:15,622 (conf-file-poller-0) [INFO - org.apache.flume.node.AbstractConfigurationProvider.loadChannels(AbstractConfigurationProvider.java:145)] Creating channels
2016-04-20 00:08:15,658 (conf-file-poller-0) [INFO - org.apache.flume.channel.DefaultChannelFactory.create(DefaultChannelFactory.java:42)] Creating instance of channel c1 type memory
2016-04-20 00:08:15,672 (conf-file-poller-0) [INFO - org.apache.flume.node.AbstractConfigurationProvider.loadChannels(AbstractConfigurationProvider.java:200)] Created channel c1
2016-04-20 00:08:15,677 (conf-file-poller-0) [INFO - org.apache.flume.source.DefaultSourceFactory.create(DefaultSourceFactory.java:41)] Creating instance of source r1, type avro
2016-04-20 00:08:15,732 (conf-file-poller-0) [INFO - org.apache.flume.sink.DefaultSinkFactory.create(DefaultSinkFactory.java:42)] Creating instance of sink: k1, type: logger
2016-04-20 00:08:15,735 (conf-file-poller-0) [INFO - org.apache.flume.node.AbstractConfigurationProvider.getConfiguration(AbstractConfigurationProvider.java:114)] Channel c1 connected to [r1, k1]
2016-04-20 00:08:15,750 (conf-file-poller-0) [INFO - org.apache.flume.node.Application.startAllComponents(Application.java:138)] Starting new configuration:{ sourceRunners:{r1=EventDrivenSourceRunner: { source:Avro source r1: { bindAddress: node2, port: 44444 } }} sinkRunners:{k1=SinkRunner: { policy:org.apache.flume.sink.DefaultSinkProcessor@ea5ba80 counterGroup:{ name:null counters:{} } }} channels:{c1=org.apache.flume.channel.MemoryChannel{name: c1}} }
2016-04-20 00:08:15,782 (conf-file-poller-0) [INFO - org.apache.flume.node.Application.startAllComponents(Application.java:145)] Starting Channel c1
2016-04-20 00:08:15,784 (conf-file-poller-0) [INFO - org.apache.flume.node.Application.startAllComponents(Application.java:160)] Waiting for channel: c1 to start. Sleeping for 500 ms
2016-04-20 00:08:15,897 (lifecycleSupervisor-1-2) [INFO - org.apache.flume.instrumentation.MonitoredCounterGroup.register(MonitoredCounterGroup.java:120)] Monitored counter group for type: CHANNEL, name: c1: Successfully registered new MBean.
2016-04-20 00:08:15,901 (lifecycleSupervisor-1-2) [INFO - org.apache.flume.instrumentation.MonitoredCounterGroup.start(MonitoredCounterGroup.java:96)] Component type: CHANNEL, name: c1 started
2016-04-20 00:08:16,285 (conf-file-poller-0) [INFO - org.apache.flume.node.Application.startAllComponents(Application.java:173)] Starting Sink k1
2016-04-20 00:08:16,288 (conf-file-poller-0) [INFO - org.apache.flume.node.Application.startAllComponents(Application.java:184)] Starting Source r1
2016-04-20 00:08:16,298 (lifecycleSupervisor-1-2) [INFO - org.apache.flume.source.AvroSource.start(AvroSource.java:228)] Starting Avro source r1: { bindAddress: node2, port: 44444 }...
2016-04-20 00:08:16,951 (lifecycleSupervisor-1-2) [INFO - org.apache.flume.instrumentation.MonitoredCounterGroup.register(MonitoredCounterGroup.java:120)] Monitored counter group for type: SOURCE, name: r1: Successfully registered new MBean.
2016-04-20 00:08:16,952 (lifecycleSupervisor-1-2) [INFO - org.apache.flume.instrumentation.MonitoredCounterGroup.start(MonitoredCounterGroup.java:96)] Component type: SOURCE, name: r1 started
2016-04-20 00:08:16,959 (lifecycleSupervisor-1-2) [INFO - org.apache.flume.source.AvroSource.start(AvroSource.java:253)] Avro source r1 started.

启动作为push的主机

【本文为node1主机】

1
[root@node1 flume]# flume-ng agent -n a2 -c conf -f conf/push.conf -Dflume.root.logger=INFO,console

显示如下信息则为启动成功

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
2016-04-20 00:11:58,196 (conf-file-poller-0) [INFO - org.apache.flume.node.AbstractConfigurationProvider.getConfiguration(AbstractConfigurationProvider.java:114)] Channel c1 connected to [r1, k1]
2016-04-20 00:11:58,226 (conf-file-poller-0) [INFO - org.apache.flume.node.Application.startAllComponents(Application.java:138)] Starting new configuration:{ sourceRunners:{r1=EventDrivenSourceRunner: { source:Spool Directory source r1: { spoolDir: /csh/hadoop/flume/logs } }} sinkRunners:{k1=SinkRunner: { policy:org.apache.flume.sink.DefaultSinkProcessor@6b089e25 counterGroup:{ name:null counters:{} } }} channels:{c1=org.apache.flume.channel.MemoryChannel{name: c1}} }
2016-04-20 00:11:58,236 (conf-file-poller-0) [INFO - org.apache.flume.node.Application.startAllComponents(Application.java:145)] Starting Channel c1
2016-04-20 00:11:58,360 (lifecycleSupervisor-1-1) [INFO - org.apache.flume.instrumentation.MonitoredCounterGroup.register(MonitoredCounterGroup.java:120)] Monitored counter group for type: CHANNEL, name: c1: Successfully registered new MBean.
2016-04-20 00:11:58,361 (lifecycleSupervisor-1-1) [INFO - org.apache.flume.instrumentation.MonitoredCounterGroup.start(MonitoredCounterGroup.java:96)] Component type: CHANNEL, name: c1 started
2016-04-20 00:11:58,362 (conf-file-poller-0) [INFO - org.apache.flume.node.Application.startAllComponents(Application.java:173)] Starting Sink k1
2016-04-20 00:11:58,369 (conf-file-poller-0) [INFO - org.apache.flume.node.Application.startAllComponents(Application.java:184)] Starting Source r1
2016-04-20 00:11:58,372 (lifecycleSupervisor-1-4) [INFO - org.apache.flume.source.SpoolDirectorySource.start(SpoolDirectorySource.java:78)] SpoolDirectorySource source starting with directory: /csh/hadoop/flume/logs
2016-04-20 00:11:58,388 (lifecycleSupervisor-1-0) [INFO - org.apache.flume.sink.AbstractRpcSink.start(AbstractRpcSink.java:289)] Starting RpcSink k1 { host: node2, port: 44444 }...
2016-04-20 00:11:58,409 (lifecycleSupervisor-1-0) [INFO - org.apache.flume.instrumentation.MonitoredCounterGroup.register(MonitoredCounterGroup.java:120)] Monitored counter group for type: SINK, name: k1: Successfully registered new MBean.
2016-04-20 00:11:58,409 (lifecycleSupervisor-1-0) [INFO - org.apache.flume.instrumentation.MonitoredCounterGroup.start(MonitoredCounterGroup.java:96)] Component type: SINK, name: k1 started
2016-04-20 00:11:58,409 (lifecycleSupervisor-1-0) [INFO - org.apache.flume.sink.AbstractRpcSink.createConnection(AbstractRpcSink.java:206)] Rpc sink k1: Building RpcClient with hostname: node2, port: 44444
2016-04-20 00:11:58,410 (lifecycleSupervisor-1-0) [INFO - org.apache.flume.sink.AvroSink.initializeRpcClient(AvroSink.java:126)] Attempting to create Avro Rpc client.
2016-04-20 00:11:58,458 (lifecycleSupervisor-1-0) [WARN - org.apache.flume.api.NettyAvroRpcClient.configure(NettyAvroRpcClient.java:634)] Using default maxIOWorkers
2016-04-20 00:11:58,536 (lifecycleSupervisor-1-4) [INFO - org.apache.flume.instrumentation.MonitoredCounterGroup.register(MonitoredCounterGroup.java:120)] Monitored counter group for type: SOURCE, name: r1: Successfully registered new MBean.
2016-04-20 00:11:58,536 (lifecycleSupervisor-1-4) [INFO - org.apache.flume.instrumentation.MonitoredCounterGroup.start(MonitoredCounterGroup.java:96)] Component type: SOURCE, name: r1 started
2016-04-20 00:11:59,263 (lifecycleSupervisor-1-0) [INFO - org.apache.flume.sink.AbstractRpcSink.start(AbstractRpcSink.java:303)] Rpc sink k1 started.

这时pull主机【本文为node2】输出信息表示连接成功

1
2
3
2016-04-20 00:11:58,875 (New I/O server boss #1 ([id: 0x71ba9ce2, /192.168.161.12:44444])) [INFO - org.apache.avro.ipc.NettyServer$NettyServerAvroHandler.handleUpstream(NettyServer.java:171)] [id: 0x7d9299a9, /192.168.161.11:44003 => /192.168.161.12:44444] OPEN
2016-04-20 00:11:58,880 (New I/O worker #1) [INFO - org.apache.avro.ipc.NettyServer$NettyServerAvroHandler.handleUpstream(NettyServer.java:171)] [id: 0x7d9299a9, /192.168.161.11:44003 => /192.168.161.12:44444] BOUND: /192.168.161.12:44444
2016-04-20 00:11:58,884 (New I/O worker #1) [INFO - org.apache.avro.ipc.NettyServer$NettyServerAvroHandler.handleUpstream(NettyServer.java:171)] [id: 0x7d9299a9, /192.168.161.11:44003 => /192.168.161.12:44444] CONNECTED: /192.168.161.11:44003

测试

在push主机中【本文为node1】的spoolDir目录【本文为/csh/hadoop/flume/logs】中创建test.log

1
2
vi test.log
# 输入内容 hello flume

这时push主机【本文为node1】中命令行输出如下

1
2
2016-04-20 00:13:09,274 (pool-4-thread-1) [INFO - org.apache.flume.client.avro.ReliableSpoolingFileEventReader.readEvents(ReliableSpoolingFileEventReader.java:258)] Last read took us just up to a file boundary. Rolling to the next file, if there is one.
2016-04-20 00:13:09,275 (pool-4-thread-1) [INFO - org.apache.flume.client.avro.ReliableSpoolingFileEventReader.rollCurrentFile(ReliableSpoolingFileEventReader.java:348)] Preparing to move file /csh/hadoop/flume/logs/test.log to /csh/hadoop/flume/logs/test.log.COMPLETED

pull主机【本文为node2】中命令行输出如下

1
2016-04-20 00:13:21,344 (SinkRunner-PollingRunner-DefaultSinkProcessor) [INFO - org.apache.flume.sink.LoggerSink.process(LoggerSink.java:94)] Event: { headers:{} body: 68 65 6C 6C 6F 20 66 6C 75 6D 65                hello flume }

证明Flume集群搭建成功

我们可以发现test.log被改名为test.log.COMPLETED

Flume单机安装与配置

发表于 2016-04-21   |   分类于 Flume   |     |   阅读次数

下载

apache-flume-1.6.0-bin.tar.gz

解压

1
tar -zxvf apache-flume-1.6.0-bin.tar.gz

配置

1
2
cp conf/flume-conf.properties.template conf/flume-conf.properties
cp conf/flume-env.sh.template conf/flume-env.sh

flume-env.sh

1
2
3
4
5
6
7
8
# Enviroment variables can be set here.
export JAVA_HOME=/csh/link/jdk

# Give Flume more memory and pre-allocate, enable remote monitoring via JMX
# export JAVA_OPTS="-Xms100m -Xmx2000m -Dcom.sun.management.jmxremote"

# Note that the Flume conf directory is always included in the classpath.
FLUME_CLASSPATH=/csh/link/flume/lib

单机配置

在conf目录下创建single-node.conf文件,并将下面内容复制粘贴

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
# example.conf: A single-node Flume configuration

# Name the components on this agent
a1.sources = r1
a1.sinks = k1
a1.channels = c1

# Describe/configure the source
a1.sources.r1.type = netcat
a1.sources.r1.bind = localhost
a1.sources.r1.port = 44444

# Describe the sink
a1.sinks.k1.type = logger

# Use a channel which buffers events in memory
a1.channels.c1.type = memory
a1.channels.c1.capacity = 1000
a1.channels.c1.transactionCapacity = 100

# Bind the source and sink to the channel
a1.sources.r1.channels = c1
a1.sinks.k1.channel = c1

运行

1
bin/flume-ng agent --conf conf --conf-file conf/single-node.conf --name a1 -Dflume.root.logger=INFO,console

命令行输出:

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Info: Sourcing environment configuration script /csh/link/flume/conf/flume-env.sh
Info: Including Hadoop libraries found via (/csh/link/hadoop/bin/hadoop) for HDFS access
Info: Excluding /csh/software/hadoop-2.7.2/share/hadoop/common/lib/slf4j-api-1.7.10.jar from classpath
Info: Excluding /csh/software/hadoop-2.7.2/share/hadoop/common/lib/slf4j-log4j12-1.7.10.jar from classpath
Info: Including HBASE libraries found via (/csh/link/hbase/bin/hbase) for HBASE access
Info: Excluding /csh/link/hbase/lib/slf4j-api-1.7.7.jar from classpath
Info: Excluding /csh/link/hbase/lib/slf4j-log4j12-1.7.5.jar from classpath
Info: Excluding /csh/software/hadoop-2.7.2/share/hadoop/common/lib/slf4j-api-1.7.10.jar from classpath
Info: Excluding /csh/software/hadoop-2.7.2/share/hadoop/common/lib/slf4j-log4j12-1.7.10.jar from classpath
Info: Including Hive libraries found via (/csh/link/hive) for Hive access
+ exec /csh/link/jdk/bin/java -Xmx20m -Dflume.root.logger=INFO,console -cp '/csh/link/flume/conf:/csh/link/flume/lib/*:/csh/link/flume/lib:/csh/software/hadoop-2.7.2/etc/hadoop:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/activation-1.1.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/apacheds-i18n-2.0.0-M15.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/apacheds-kerberos-codec-2.0.0-M15.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/api-asn1-api-1.0.0-M20.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/api-util-1.0.0-M20.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/asm-3.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/avro-1.7.4.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/commons-beanutils-1.7.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/commons-beanutils-core-1.8.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/commons-cli-1.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/commons-codec-1.4.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/commons-collections-3.2.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/commons-compress-1.4.1.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/commons-configuration-1.6.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/commons-digester-1.8.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/commons-httpclient-3.1.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/commons-io-2.4.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/commons-lang-2.6.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/commons-logging-1.1.3.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/commons-math3-3.1.1.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/commons-net-3.1.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/curator-client-2.7.1.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/curator-framework-2.7.1.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/curator-recipes-2.7.1.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/gson-2.2.4.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/guava-11.0.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/hadoop-annotations-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/hadoop-auth-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/hamcrest-core-1.3.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/htrace-core-3.1.0-incubating.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/httpclient-4.2.5.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/httpcore-4.2.5.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/jackson-core-asl-1.9.13.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/jackson-jaxrs-1.9.13.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/jackson-mapper-asl-1.9.13.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/jackson-xc-1.9.13.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/java-xmlbuilder-0.4.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/jaxb-api-2.2.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/jaxb-impl-2.2.3-1.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/jersey-core-1.9.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/jersey-json-1.9.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/jersey-server-1.9.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/jets3t-0.9.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/jettison-1.1.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/jetty-6.1.26.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/jetty-util-6.1.26.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/jsch-0.1.42.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/jsp-api-2.1.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/jsr305-3.0.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/junit-4.11.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/log4j-1.2.17.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/mockito-all-1.8.5.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/netty-3.6.2.Final.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/paranamer-2.3.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/protobuf-java-2.5.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/servlet-api-2.5.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/snappy-java-1.0.4.1.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/stax-api-1.0-2.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/xmlenc-0.52.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/xz-1.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/zookeeper-3.4.6.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/hadoop-common-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/hadoop-common-2.7.2-tests.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/hadoop-nfs-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/jdiff:/csh/software/hadoop-2.7.2/share/hadoop/common/lib:/csh/software/hadoop-2.7.2/share/hadoop/common/sources:/csh/software/hadoop-2.7.2/share/hadoop/common/templates:/csh/software/hadoop-2.7.2/share/hadoop/hdfs:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/asm-3.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/commons-cli-1.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/commons-codec-1.4.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/commons-daemon-1.0.13.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/commons-io-2.4.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/commons-lang-2.6.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/commons-logging-1.1.3.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/guava-11.0.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/htrace-core-3.1.0-incubating.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/jackson-core-asl-1.9.13.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/jackson-mapper-asl-1.9.13.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/jersey-core-1.9.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/jersey-server-1.9.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/jetty-6.1.26.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/jetty-util-6.1.26.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/jsr305-3.0.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/leveldbjni-all-1.8.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/log4j-1.2.17.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/netty-3.6.2.Final.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/netty-all-4.0.23.Final.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/protobuf-java-2.5.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/servlet-api-2.5.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/xercesImpl-2.9.1.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/xml-apis-1.3.04.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/xmlenc-0.52.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/hadoop-hdfs-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/hadoop-hdfs-2.7.2-tests.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/hadoop-hdfs-nfs-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/jdiff:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/sources:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/templates:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/webapps:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/activation-1.1.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/aopalliance-1.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/asm-3.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/commons-cli-1.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/commons-codec-1.4.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/commons-collections-3.2.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/commons-compress-1.4.1.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/commons-io-2.4.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/commons-lang-2.6.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/commons-logging-1.1.3.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/guava-11.0.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/guice-3.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/guice-servlet-3.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/jackson-core-asl-1.9.13.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/jackson-jaxrs-1.9.13.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/jackson-mapper-asl-1.9.13.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/jackson-xc-1.9.13.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/javax.inject-1.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/jaxb-api-2.2.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/jaxb-impl-2.2.3-1.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/jersey-client-1.9.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/jersey-core-1.9.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/jersey-guice-1.9.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/jersey-json-1.9.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/jersey-server-1.9.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/jettison-1.1.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/jetty-6.1.26.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/jetty-util-6.1.26.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/jsr305-3.0.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/leveldbjni-all-1.8.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/log4j-1.2.17.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/netty-3.6.2.Final.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/protobuf-java-2.5.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/servlet-api-2.5.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/stax-api-1.0-2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/xz-1.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/zookeeper-3.4.6.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/zookeeper-3.4.6-tests.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/hadoop-yarn-api-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/hadoop-yarn-applications-distributedshell-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/hadoop-yarn-applications-unmanaged-am-launcher-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/hadoop-yarn-client-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/hadoop-yarn-common-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/hadoop-yarn-registry-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/hadoop-yarn-server-applicationhistoryservice-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/hadoop-yarn-server-common-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/hadoop-yarn-server-nodemanager-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/hadoop-yarn-server-resourcemanager-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/hadoop-yarn-server-sharedcachemanager-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/hadoop-yarn-server-tests-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/hadoop-yarn-server-web-proxy-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib:/csh/software/hadoop-2.7.2/share/hadoop/yarn/sources:/csh/software/hadoop-2.7.2/share/hadoop/yarn/test:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/aopalliance-1.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/asm-3.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/avro-1.7.4.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/commons-compress-1.4.1.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/commons-io-2.4.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/guice-3.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/guice-servlet-3.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/hadoop-annotations-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/hamcrest-core-1.3.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/jackson-core-asl-1.9.13.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/jackson-mapper-asl-1.9.13.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/javax.inject-1.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/jersey-core-1.9.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/jersey-guice-1.9.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/jersey-server-1.9.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/junit-4.11.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/leveldbjni-all-1.8.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/log4j-1.2.17.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/netty-3.6.2.Final.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/paranamer-2.3.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/protobuf-java-2.5.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/snappy-java-1.0.4.1.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/xz-1.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/hadoop-mapreduce-client-app-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/hadoop-mapreduce-client-common-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/hadoop-mapreduce-client-core-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/hadoop-mapreduce-client-hs-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/hadoop-mapreduce-client-hs-plugins-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/hadoop-mapreduce-client-jobclient-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/hadoop-mapreduce-client-jobclient-2.7.2-tests.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/hadoop-mapreduce-client-shuffle-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/hadoop-mapreduce-examples-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib-examples:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/sources:/csh/link/hadoop/contrib/capacity-scheduler/*.jar:/csh/link/hbase/conf:/csh/link/jdk/lib/tools.jar:/csh/link/hbase:/csh/link/hbase/lib/activation-1.1.jar:/csh/link/hbase/lib/antisamy-1.4.3.jar:/csh/link/hbase/lib/aopalliance-1.0.jar:/csh/link/hbase/lib/apacheds-i18n-2.0.0-M15.jar:/csh/link/hbase/lib/apacheds-kerberos-codec-2.0.0-M15.jar:/csh/link/hbase/lib/api-asn1-api-1.0.0-M20.jar:/csh/link/hbase/lib/api-util-1.0.0-M20.jar:/csh/link/hbase/lib/asm-3.1.jar:/csh/link/hbase/lib/avro-1.7.4.jar:/csh/link/hbase/lib/batik-css-1.7.jar:/csh/link/hbase/lib/batik-ext-1.7.jar:/csh/link/hbase/lib/batik-util-1.7.jar:/csh/link/hbase/lib/bsh-core-2.0b4.jar:/csh/link/hbase/lib/commons-beanutils-1.7.0.jar:/csh/link/hbase/lib/commons-beanutils-core-1.7.0.jar:/csh/link/hbase/lib/commons-cli-1.2.jar:/csh/link/hbase/lib/commons-codec-1.9.jar:/csh/link/hbase/lib/commons-collections-3.2.2.jar:/csh/link/hbase/lib/commons-compress-1.4.1.jar:/csh/link/hbase/lib/commons-configuration-1.6.jar:/csh/link/hbase/lib/commons-daemon-1.0.13.jar:/csh/link/hbase/lib/commons-digester-1.8.jar:/csh/link/hbase/lib/commons-el-1.0.jar:/csh/link/hbase/lib/commons-fileupload-1.2.jar:/csh/link/hbase/lib/commons-httpclient-3.1.jar:/csh/link/hbase/lib/commons-io-2.4.jar:/csh/link/hbase/lib/commons-lang-2.6.jar:/csh/link/hbase/lib/commons-logging-1.2.jar:/csh/link/hbase/lib/commons-math-2.2.jar:/csh/link/hbase/lib/commons-math3-3.1.1.jar:/csh/link/hbase/lib/commons-net-3.1.jar:/csh/link/hbase/lib/disruptor-3.3.0.jar:/csh/link/hbase/lib/esapi-2.1.0.jar:/csh/link/hbase/lib/findbugs-annotations-1.3.9-1.jar:/csh/link/hbase/lib/guava-12.0.1.jar:/csh/link/hbase/lib/guice-3.0.jar:/csh/link/hbase/lib/guice-servlet-3.0.jar:/csh/link/hbase/lib/hadoop-annotations-2.5.1.jar:/csh/link/hbase/lib/hadoop-auth-2.5.1.jar:/csh/link/hbase/lib/hadoop-client-2.5.1.jar:/csh/link/hbase/lib/hadoop-common-2.5.1.jar:/csh/link/hbase/lib/hadoop-hdfs-2.5.1.jar:/csh/link/hbase/lib/hadoop-mapreduce-client-app-2.5.1.jar:/csh/link/hbase/lib/hadoop-mapreduce-client-common-2.5.1.jar:/csh/link/hbase/lib/hadoop-mapreduce-client-core-2.5.1.jar:/csh/link/hbase/lib/hadoop-mapreduce-client-jobclient-2.5.1.jar:/csh/link/hbase/lib/hadoop-mapreduce-client-shuffle-2.5.1.jar:/csh/link/hbase/lib/hadoop-yarn-api-2.5.1.jar:/csh/link/hbase/lib/hadoop-yarn-client-2.5.1.jar:/csh/link/hbase/lib/hadoop-yarn-common-2.5.1.jar:/csh/link/hbase/lib/hadoop-yarn-server-common-2.5.1.jar:/csh/link/hbase/lib/hbase-annotations-1.1.4.jar:/csh/link/hbase/lib/hbase-annotations-1.1.4-tests.jar:/csh/link/hbase/lib/hbase-client-1.1.4.jar:/csh/link/hbase/lib/hbase-common-1.1.4.jar:/csh/link/hbase/lib/hbase-common-1.1.4-tests.jar:/csh/link/hbase/lib/hbase-examples-1.1.4.jar:/csh/link/hbase/lib/hbase-hadoop2-compat-1.1.4.jar:/csh/link/hbase/lib/hbase-hadoop-compat-1.1.4.jar:/csh/link/hbase/lib/hbase-it-1.1.4.jar:/csh/link/hbase/lib/hbase-it-1.1.4-tests.jar:/csh/link/hbase/lib/hbase-prefix-tree-1.1.4.jar:/csh/link/hbase/lib/hbase-procedure-1.1.4.jar:/csh/link/hbase/lib/hbase-protocol-1.1.4.jar:/csh/link/hbase/lib/hbase-resource-bundle-1.1.4.jar:/csh/link/hbase/lib/hbase-rest-1.1.4.jar:/csh/link/hbase/lib/hbase-server-1.1.4.jar:/csh/link/hbase/lib/hbase-server-1.1.4-tests.jar:/csh/link/hbase/lib/hbase-shell-1.1.4.jar:/csh/link/hbase/lib/hbase-thrift-1.1.4.jar:/csh/link/hbase/lib/htrace-core-3.1.0-incubating.jar:/csh/link/hbase/lib/httpclient-4.2.5.jar:/csh/link/hbase/lib/httpcore-4.1.3.jar:/csh/link/hbase/lib/jackson-core-asl-1.9.13.jar:/csh/link/hbase/lib/jackson-jaxrs-1.9.13.jar:/csh/link/hbase/lib/jackson-mapper-asl-1.9.13.jar:/csh/link/hbase/lib/jackson-xc-1.9.13.jar:/csh/link/hbase/lib/jamon-runtime-2.3.1.jar:/csh/link/hbase/lib/jasper-compiler-5.5.23.jar:/csh/link/hbase/lib/jasper-runtime-5.5.23.jar:/csh/link/hbase/lib/javax.inject-1.jar:/csh/link/hbase/lib/java-xmlbuilder-0.4.jar:/csh/link/hbase/lib/jaxb-api-2.2.2.jar:/csh/link/hbase/lib/jaxb-impl-2.2.3-1.jar:/csh/link/hbase/lib/jcodings-1.0.8.jar:/csh/link/hbase/lib/jersey-client-1.9.jar:/csh/link/hbase/lib/jersey-core-1.9.jar:/csh/link/hbase/lib/jersey-guice-1.9.jar:/csh/link/hbase/lib/jersey-json-1.9.jar:/csh/link/hbase/lib/jersey-server-1.9.jar:/csh/link/hbase/lib/jets3t-0.9.0.jar:/csh/link/hbase/lib/jettison-1.3.3.jar:/csh/link/hbase/lib/jetty-6.1.26.jar:/csh/link/hbase/lib/jetty-sslengine-6.1.26.jar:/csh/link/hbase/lib/jetty-util-6.1.26.jar:/csh/link/hbase/lib/joni-2.1.2.jar:/csh/link/hbase/lib/jruby-complete-1.6.8.jar:/csh/link/hbase/lib/jsch-0.1.42.jar:/csh/link/hbase/lib/jsp-2.1-6.1.14.jar:/csh/link/hbase/lib/jsp-api-2.1-6.1.14.jar:/csh/link/hbase/lib/jsr305-1.3.9.jar:/csh/link/hbase/lib/junit-4.12.jar:/csh/link/hbase/lib/leveldbjni-all-1.8.jar:/csh/link/hbase/lib/libthrift-0.9.0.jar:/csh/link/hbase/lib/log4j-1.2.17.jar:/csh/link/hbase/lib/metrics-core-2.2.0.jar:/csh/link/hbase/lib/nekohtml-1.9.12.jar:/csh/link/hbase/lib/netty-3.2.4.Final.jar:/csh/link/hbase/lib/netty-all-4.0.23.Final.jar:/csh/link/hbase/lib/paranamer-2.3.jar:/csh/link/hbase/lib/protobuf-java-2.5.0.jar:/csh/link/hbase/lib/servlet-api-2.5-6.1.14.jar:/csh/link/hbase/lib/servlet-api-2.5.jar:/csh/link/hbase/lib/snappy-java-1.0.4.1.jar:/csh/link/hbase/lib/spymemcached-2.11.6.jar:/csh/link/hbase/lib/xalan-2.7.0.jar:/csh/link/hbase/lib/xml-apis-1.3.03.jar:/csh/link/hbase/lib/xml-apis-ext-1.3.04.jar:/csh/link/hbase/lib/xmlenc-0.52.jar:/csh/link/hbase/lib/xom-1.2.5.jar:/csh/link/hbase/lib/xz-1.0.jar:/csh/link/hbase/lib/zookeeper-3.4.6.jar:/csh/software/hadoop-2.7.2/etc/hadoop:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/activation-1.1.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/apacheds-i18n-2.0.0-M15.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/apacheds-kerberos-codec-2.0.0-M15.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/api-asn1-api-1.0.0-M20.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/api-util-1.0.0-M20.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/asm-3.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/avro-1.7.4.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/commons-beanutils-1.7.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/commons-beanutils-core-1.8.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/commons-cli-1.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/commons-codec-1.4.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/commons-collections-3.2.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/commons-compress-1.4.1.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/commons-configuration-1.6.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/commons-digester-1.8.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/commons-httpclient-3.1.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/commons-io-2.4.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/commons-lang-2.6.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/commons-logging-1.1.3.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/commons-math3-3.1.1.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/commons-net-3.1.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/curator-client-2.7.1.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/curator-framework-2.7.1.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/curator-recipes-2.7.1.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/gson-2.2.4.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/guava-11.0.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/hadoop-annotations-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/hadoop-auth-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/hamcrest-core-1.3.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/htrace-core-3.1.0-incubating.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/httpclient-4.2.5.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/httpcore-4.2.5.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/jackson-core-asl-1.9.13.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/jackson-jaxrs-1.9.13.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/jackson-mapper-asl-1.9.13.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/jackson-xc-1.9.13.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/java-xmlbuilder-0.4.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/jaxb-api-2.2.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/jaxb-impl-2.2.3-1.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/jersey-core-1.9.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/jersey-json-1.9.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/jersey-server-1.9.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/jets3t-0.9.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/jettison-1.1.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/jetty-6.1.26.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/jetty-util-6.1.26.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/jsch-0.1.42.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/jsp-api-2.1.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/jsr305-3.0.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/junit-4.11.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/log4j-1.2.17.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/mockito-all-1.8.5.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/netty-3.6.2.Final.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/paranamer-2.3.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/protobuf-java-2.5.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/servlet-api-2.5.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/snappy-java-1.0.4.1.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/stax-api-1.0-2.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/xmlenc-0.52.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/xz-1.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/zookeeper-3.4.6.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/hadoop-common-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/hadoop-common-2.7.2-tests.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/hadoop-nfs-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/common/jdiff:/csh/software/hadoop-2.7.2/share/hadoop/common/lib:/csh/software/hadoop-2.7.2/share/hadoop/common/sources:/csh/software/hadoop-2.7.2/share/hadoop/common/templates:/csh/software/hadoop-2.7.2/share/hadoop/hdfs:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/asm-3.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/commons-cli-1.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/commons-codec-1.4.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/commons-daemon-1.0.13.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/commons-io-2.4.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/commons-lang-2.6.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/commons-logging-1.1.3.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/guava-11.0.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/htrace-core-3.1.0-incubating.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/jackson-core-asl-1.9.13.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/jackson-mapper-asl-1.9.13.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/jersey-core-1.9.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/jersey-server-1.9.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/jetty-6.1.26.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/jetty-util-6.1.26.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/jsr305-3.0.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/leveldbjni-all-1.8.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/log4j-1.2.17.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/netty-3.6.2.Final.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/netty-all-4.0.23.Final.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/protobuf-java-2.5.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/servlet-api-2.5.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/xercesImpl-2.9.1.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/xml-apis-1.3.04.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib/xmlenc-0.52.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/hadoop-hdfs-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/hadoop-hdfs-2.7.2-tests.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/hadoop-hdfs-nfs-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/jdiff:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/lib:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/sources:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/templates:/csh/software/hadoop-2.7.2/share/hadoop/hdfs/webapps:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/activation-1.1.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/aopalliance-1.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/asm-3.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/commons-cli-1.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/commons-codec-1.4.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/commons-collections-3.2.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/commons-compress-1.4.1.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/commons-io-2.4.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/commons-lang-2.6.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/commons-logging-1.1.3.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/guava-11.0.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/guice-3.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/guice-servlet-3.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/jackson-core-asl-1.9.13.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/jackson-jaxrs-1.9.13.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/jackson-mapper-asl-1.9.13.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/jackson-xc-1.9.13.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/javax.inject-1.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/jaxb-api-2.2.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/jaxb-impl-2.2.3-1.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/jersey-client-1.9.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/jersey-core-1.9.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/jersey-guice-1.9.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/jersey-json-1.9.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/jersey-server-1.9.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/jettison-1.1.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/jetty-6.1.26.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/jetty-util-6.1.26.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/jsr305-3.0.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/leveldbjni-all-1.8.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/log4j-1.2.17.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/netty-3.6.2.Final.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/protobuf-java-2.5.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/servlet-api-2.5.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/stax-api-1.0-2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/xz-1.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/zookeeper-3.4.6.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib/zookeeper-3.4.6-tests.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/hadoop-yarn-api-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/hadoop-yarn-applications-distributedshell-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/hadoop-yarn-applications-unmanaged-am-launcher-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/hadoop-yarn-client-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/hadoop-yarn-common-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/hadoop-yarn-registry-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/hadoop-yarn-server-applicationhistoryservice-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/hadoop-yarn-server-common-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/hadoop-yarn-server-nodemanager-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/hadoop-yarn-server-resourcemanager-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/hadoop-yarn-server-sharedcachemanager-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/hadoop-yarn-server-tests-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/hadoop-yarn-server-web-proxy-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/yarn/lib:/csh/software/hadoop-2.7.2/share/hadoop/yarn/sources:/csh/software/hadoop-2.7.2/share/hadoop/yarn/test:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/aopalliance-1.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/asm-3.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/avro-1.7.4.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/commons-compress-1.4.1.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/commons-io-2.4.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/guice-3.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/guice-servlet-3.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/hadoop-annotations-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/hamcrest-core-1.3.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/jackson-core-asl-1.9.13.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/jackson-mapper-asl-1.9.13.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/javax.inject-1.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/jersey-core-1.9.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/jersey-guice-1.9.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/jersey-server-1.9.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/junit-4.11.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/leveldbjni-all-1.8.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/log4j-1.2.17.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/netty-3.6.2.Final.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/paranamer-2.3.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/protobuf-java-2.5.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/snappy-java-1.0.4.1.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib/xz-1.0.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/hadoop-mapreduce-client-app-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/hadoop-mapreduce-client-common-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/hadoop-mapreduce-client-core-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/hadoop-mapreduce-client-hs-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/hadoop-mapreduce-client-hs-plugins-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/hadoop-mapreduce-client-jobclient-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/hadoop-mapreduce-client-jobclient-2.7.2-tests.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/hadoop-mapreduce-client-shuffle-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/hadoop-mapreduce-examples-2.7.2.jar:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/lib-examples:/csh/software/hadoop-2.7.2/share/hadoop/mapreduce/sources:/csh/link/hadoop/contrib/capacity-scheduler/*.jar:/csh/link/hbase/conf:/csh/link/hive/lib/*' -Djava.library.path=:/csh/software/hadoop-2.7.2/lib/native:/csh/software/hadoop-2.7.2/lib/native org.apache.flume.node.Application --conf-file conf/single-node.conf --name a1
2016-04-19 21:44:11,126 (lifecycleSupervisor-1-0) [INFO - org.apache.flume.node.PollingPropertiesFileConfigurationProvider.start(PollingPropertiesFileConfigurationProvider.java:61)] Configuration provider starting
2016-04-19 21:44:11,165 (conf-file-poller-0) [INFO - org.apache.flume.node.PollingPropertiesFileConfigurationProvider$FileWatcherRunnable.run(PollingPropertiesFileConfigurationProvider.java:133)] Reloading configuration file:conf/single-node.conf
2016-04-19 21:44:11,200 (conf-file-poller-0) [INFO - org.apache.flume.conf.FlumeConfiguration$AgentConfiguration.addProperty(FlumeConfiguration.java:931)] Added sinks: k1 Agent: a1
2016-04-19 21:44:11,200 (conf-file-poller-0) [INFO - org.apache.flume.conf.FlumeConfiguration$AgentConfiguration.addProperty(FlumeConfiguration.java:1017)] Processing:k1
2016-04-19 21:44:11,201 (conf-file-poller-0) [INFO - org.apache.flume.conf.FlumeConfiguration$AgentConfiguration.addProperty(FlumeConfiguration.java:1017)] Processing:k1
2016-04-19 21:44:11,285 (conf-file-poller-0) [INFO - org.apache.flume.conf.FlumeConfiguration.validateConfiguration(FlumeConfiguration.java:141)] Post-validation flume configuration contains configuration for agents: [a1]
2016-04-19 21:44:11,285 (conf-file-poller-0) [INFO - org.apache.flume.node.AbstractConfigurationProvider.loadChannels(AbstractConfigurationProvider.java:145)] Creating channels
2016-04-19 21:44:11,341 (conf-file-poller-0) [INFO - org.apache.flume.channel.DefaultChannelFactory.create(DefaultChannelFactory.java:42)] Creating instance of channel c1 type memory
2016-04-19 21:44:11,506 (conf-file-poller-0) [INFO - org.apache.flume.node.AbstractConfigurationProvider.loadChannels(AbstractConfigurationProvider.java:200)] Created channel c1
2016-04-19 21:44:11,507 (conf-file-poller-0) [INFO - org.apache.flume.source.DefaultSourceFactory.create(DefaultSourceFactory.java:41)] Creating instance of source r1, type netcat
2016-04-19 21:44:11,590 (conf-file-poller-0) [INFO - org.apache.flume.sink.DefaultSinkFactory.create(DefaultSinkFactory.java:42)] Creating instance of sink: k1, type: logger
2016-04-19 21:44:11,610 (conf-file-poller-0) [INFO - org.apache.flume.node.AbstractConfigurationProvider.getConfiguration(AbstractConfigurationProvider.java:114)] Channel c1 connected to [r1, k1]
2016-04-19 21:44:11,684 (conf-file-poller-0) [INFO - org.apache.flume.node.Application.startAllComponents(Application.java:138)] Starting new configuration:{ sourceRunners:{r1=EventDrivenSourceRunner: { source:org.apache.flume.source.NetcatSource{name:r1,state:IDLE} }} sinkRunners:{k1=SinkRunner: { policy:org.apache.flume.sink.DefaultSinkProcessor@7ceacc49 counterGroup:{ name:null counters:{} } }} channels:{c1=org.apache.flume.channel.MemoryChannel{name: c1}} }
2016-04-19 21:44:11,713 (conf-file-poller-0) [INFO - org.apache.flume.node.Application.startAllComponents(Application.java:145)] Starting Channel c1
2016-04-19 21:44:17,439 (lifecycleSupervisor-1-0) [INFO - org.apache.flume.instrumentation.MonitoredCounterGroup.register(MonitoredCounterGroup.java:120)] Monitored counter group for type: CHANNEL, name: c1: Successfully registered new MBean.
2016-04-19 21:44:17,440 (lifecycleSupervisor-1-0) [INFO - org.apache.flume.instrumentation.MonitoredCounterGroup.start(MonitoredCounterGroup.java:96)] Component type: CHANNEL, name: c1 started
2016-04-19 21:44:17,452 (conf-file-poller-0) [INFO - org.apache.flume.node.Application.startAllComponents(Application.java:173)] Starting Sink k1
2016-04-19 21:44:17,462 (conf-file-poller-0) [INFO - org.apache.flume.node.Application.startAllComponents(Application.java:184)] Starting Source r1
2016-04-19 21:44:17,474 (lifecycleSupervisor-1-1) [INFO - org.apache.flume.source.NetcatSource.start(NetcatSource.java:150)] Source starting
2016-04-19 21:44:18,012 (lifecycleSupervisor-1-1) [INFO - org.apache.flume.source.NetcatSource.start(NetcatSource.java:164)] Created serverSocket:sun.nio.ch.ServerSocketChannelImpl[/127.0.0.1:44444]

测试

在XShell中打开另一个窗口输入:

1
2
3
4
5
6
7
8
[root@node1 ~]# telnet 127.0.0.1 44444
Trying 127.0.0.1...
Connected to 127.0.0.1.
Escape character is '^]'.
hello flume!
OK
hello big data
OK

在原来启动flume的命令行中可以看到:

1
2
2016-04-19 21:47:35,653 (SinkRunner-PollingRunner-DefaultSinkProcessor) [INFO - org.apache.flume.sink.LoggerSink.process(LoggerSink.java:94)] Event: { headers:{} body: 68 65 6C 6C 6F 20 66 6C 75 6D 65 21 0D          hello flume!. }
2016-04-19 21:49:10,610 (SinkRunner-PollingRunner-DefaultSinkProcessor) [INFO - org.apache.flume.sink.LoggerSink.process(LoggerSink.java:94)] Event: { headers:{} body: 68 65 6C 6C 6F 20 62 69 67 20 64 61 74 61 0D hello big data. }

Sqoop HDFS导入到MySQL

发表于 2016-04-21   |   分类于 Sqoop   |     |   阅读次数

在MySQL中创建表

1
2
3
4
5
6
CREATE TABLE `tree1` (
`id` int(11) NOT NULL AUTO_INCREMENT,
`treeNumber` varchar(100) NOT NULL,
`productinformationId` int(11) NOT NULL,
PRIMARY KEY (`id`)
) DEFAULT CHARSET=utf8;

执行命令

1
sqoop export --connect jdbc:mysql://node1:3306/phx \--username root \--table tree1 -m 1 \--export-dir /sqoop/tree2

结果

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
[root@node1 conf]# sqoop export --connect jdbc:mysql://node1:3306/phx \--username root \--table tree1 -m 1 \--export-dir /sqoop/tree2
Warning: /csh/link/sqoop/../hcatalog does not exist! HCatalog jobs will fail.
Please set $HCAT_HOME to the root of your HCatalog installation.
Warning: /csh/link/sqoop/../accumulo does not exist! Accumulo imports will fail.
Please set $ACCUMULO_HOME to the root of your Accumulo installation.
16/04/19 01:10:23 INFO sqoop.Sqoop: Running Sqoop version: 1.4.6
16/04/19 01:10:23 INFO manager.MySQLManager: Preparing to use a MySQL streaming resultset.
16/04/19 01:10:23 INFO tool.CodeGenTool: Beginning code generation
16/04/19 01:10:24 INFO manager.SqlManager: Executing SQL statement: SELECT t.* FROM `tree1` AS t LIMIT 1
16/04/19 01:10:24 INFO manager.SqlManager: Executing SQL statement: SELECT t.* FROM `tree1` AS t LIMIT 1
16/04/19 01:10:24 INFO orm.CompilationManager: HADOOP_MAPRED_HOME is /csh/link/hadoop
Note: /tmp/sqoop-root/compile/8efdea8893437a1359c77de6d1bd2395/tree1.java uses or overrides a deprecated API.
Note: Recompile with -Xlint:deprecation for details.
16/04/19 01:10:28 INFO orm.CompilationManager: Writing jar file: /tmp/sqoop-root/compile/8efdea8893437a1359c77de6d1bd2395/tree1.jar
16/04/19 01:10:29 INFO mapreduce.ExportJobBase: Beginning export of tree1
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in [jar:file:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/slf4j-log4j12-1.7.10.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in [jar:file:/csh/software/hbase-1.1.4/lib/slf4j-log4j12-1.7.5.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
16/04/19 01:10:30 INFO Configuration.deprecation: mapred.jar is deprecated. Instead, use mapreduce.job.jar
16/04/19 01:10:38 INFO Configuration.deprecation: mapred.reduce.tasks.speculative.execution is deprecated. Instead, use mapreduce.reduce.speculative
16/04/19 01:10:38 INFO Configuration.deprecation: mapred.map.tasks.speculative.execution is deprecated. Instead, use mapreduce.map.speculative
16/04/19 01:10:38 INFO Configuration.deprecation: mapred.map.tasks is deprecated. Instead, use mapreduce.job.maps
16/04/19 01:10:38 INFO client.RMProxy: Connecting to ResourceManager at node1/192.168.161.11:8032
16/04/19 01:10:48 INFO input.FileInputFormat: Total input paths to process : 1
16/04/19 01:10:48 INFO input.FileInputFormat: Total input paths to process : 1
16/04/19 01:10:49 INFO mapreduce.JobSubmitter: number of splits:1
16/04/19 01:10:49 INFO Configuration.deprecation: mapred.map.tasks.speculative.execution is deprecated. Instead, use mapreduce.map.speculative
16/04/19 01:10:49 INFO mapreduce.JobSubmitter: Submitting tokens for job: job_1461026006811_0003
16/04/19 01:10:51 INFO impl.YarnClientImpl: Submitted application application_1461026006811_0003
16/04/19 01:10:51 INFO mapreduce.Job: The url to track the job: http://node1:8088/proxy/application_1461026006811_0003/
16/04/19 01:10:51 INFO mapreduce.Job: Running job: job_1461026006811_0003
16/04/19 01:11:33 INFO mapreduce.Job: Job job_1461026006811_0003 running in uber mode : false
16/04/19 01:11:33 INFO mapreduce.Job: map 0% reduce 0%
16/04/19 01:12:24 INFO mapreduce.Job: map 100% reduce 0%
16/04/19 01:12:27 INFO mapreduce.Job: Job job_1461026006811_0003 completed successfully
16/04/19 01:12:27 INFO mapreduce.Job: Counters: 30
File System Counters
FILE: Number of bytes read=0
FILE: Number of bytes written=137080
FILE: Number of read operations=0
FILE: Number of large read operations=0
FILE: Number of write operations=0
HDFS: Number of bytes read=251
HDFS: Number of bytes written=0
HDFS: Number of read operations=4
HDFS: Number of large read operations=0
HDFS: Number of write operations=0
Job Counters
Launched map tasks=1
Rack-local map tasks=1
Total time spent by all maps in occupied slots (ms)=45106
Total time spent by all reduces in occupied slots (ms)=0
Total time spent by all map tasks (ms)=45106
Total vcore-milliseconds taken by all map tasks=45106
Total megabyte-milliseconds taken by all map tasks=46188544
Map-Reduce Framework
Map input records=7
Map output records=7
Input split bytes=122
Spilled Records=0
Failed Shuffles=0
Merged Map outputs=0
GC time elapsed (ms)=258
CPU time spent (ms)=1680
Physical memory (bytes) snapshot=86806528
Virtual memory (bytes) snapshot=2086785024
Total committed heap usage (bytes)=17235968
File Input Format Counters
Bytes Read=0
File Output Format Counters
Bytes Written=0
16/04/19 01:12:27 INFO mapreduce.ExportJobBase: Transferred 251 bytes in 109.7035 seconds (2.288 bytes/sec)
16/04/19 01:12:27 INFO mapreduce.ExportJobBase: Exported 7 records.

在MySQL中查看tree1表中数据

数据库表

Sqoop MySQL 导入到Hive

发表于 2016-04-21   |   分类于 Sqoop   |     |   阅读次数

将数据库phx中的tree表的数据导入到Hive中

命令:

1
sqoop import --connect jdbc:mysql://node1:3306/phx \--username root \--table tree \--hive-import \--hive-overwrite \--create-hive-table \--hive-table tree1 \--target-dir /sqoop/tree3

结果:

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
[root@node1 conf]# sqoop import --connect jdbc:mysql://node1:3306/phx \--username root \--table tree \--hive-import \--hive-overwrite \--create-hive-table \--hive-table tree1 \--target-dir /sqoop/tree3
Warning: /csh/link/sqoop/../hcatalog does not exist! HCatalog jobs will fail.
Please set $HCAT_HOME to the root of your HCatalog installation.
Warning: /csh/link/sqoop/../accumulo does not exist! Accumulo imports will fail.
Please set $ACCUMULO_HOME to the root of your Accumulo installation.
16/04/18 21:20:00 INFO sqoop.Sqoop: Running Sqoop version: 1.4.6
16/04/18 21:20:00 INFO tool.BaseSqoopTool: Using Hive-specific delimiters for output. You can override
16/04/18 21:20:00 INFO tool.BaseSqoopTool: delimiters with --fields-terminated-by, etc.
16/04/18 21:20:00 INFO manager.MySQLManager: Preparing to use a MySQL streaming resultset.
16/04/18 21:20:00 INFO tool.CodeGenTool: Beginning code generation
16/04/18 21:20:00 INFO manager.SqlManager: Executing SQL statement: SELECT t.* FROM `tree` AS t LIMIT 1
16/04/18 21:20:00 INFO manager.SqlManager: Executing SQL statement: SELECT t.* FROM `tree` AS t LIMIT 1
16/04/18 21:20:00 INFO orm.CompilationManager: HADOOP_MAPRED_HOME is /csh/link/hadoop
Note: /tmp/sqoop-root/compile/9cff004a8ff405b712c978864c2775df/tree.java uses or overrides a deprecated API.
Note: Recompile with -Xlint:deprecation for details.
16/04/18 21:20:03 INFO orm.CompilationManager: Writing jar file: /tmp/sqoop-root/compile/9cff004a8ff405b712c978864c2775df/tree.jar
16/04/18 21:20:03 WARN manager.MySQLManager: It looks like you are importing from mysql.
16/04/18 21:20:03 WARN manager.MySQLManager: This transfer can be faster! Use the --direct
16/04/18 21:20:03 WARN manager.MySQLManager: option to exercise a MySQL-specific fast path.
16/04/18 21:20:03 INFO manager.MySQLManager: Setting zero DATETIME behavior to convertToNull (mysql)
16/04/18 21:20:03 INFO mapreduce.ImportJobBase: Beginning import of tree
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in [jar:file:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/slf4j-log4j12-1.7.10.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in [jar:file:/csh/software/hbase-1.1.4/lib/slf4j-log4j12-1.7.5.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
16/04/18 21:20:04 INFO Configuration.deprecation: mapred.jar is deprecated. Instead, use mapreduce.job.jar
16/04/18 21:20:09 INFO Configuration.deprecation: mapred.map.tasks is deprecated. Instead, use mapreduce.job.maps
16/04/18 21:20:10 INFO client.RMProxy: Connecting to ResourceManager at node1/192.168.161.11:8032
16/04/18 21:20:23 INFO db.DBInputFormat: Using read commited transaction isolation
16/04/18 21:20:23 INFO db.DataDrivenDBInputFormat: BoundingValsQuery: SELECT MIN(`id`), MAX(`id`) FROM `tree`
16/04/18 21:20:24 INFO mapreduce.JobSubmitter: number of splits:4
16/04/18 21:20:25 INFO mapreduce.JobSubmitter: Submitting tokens for job: job_1461026006811_0001
16/04/18 21:20:27 INFO impl.YarnClientImpl: Submitted application application_1461026006811_0001
16/04/18 21:20:28 INFO mapreduce.Job: The url to track the job: http://node1:8088/proxy/application_1461026006811_0001/
16/04/18 21:20:28 INFO mapreduce.Job: Running job: job_1461026006811_0001
16/04/18 21:21:01 INFO mapreduce.Job: Job job_1461026006811_0001 running in uber mode : false
16/04/18 21:21:01 INFO mapreduce.Job: map 0% reduce 0%
16/04/18 21:22:00 INFO mapreduce.Job: map 25% reduce 0%
16/04/18 21:23:13 INFO mapreduce.Job: map 50% reduce 0%
16/04/18 21:23:14 INFO mapreduce.Job: map 100% reduce 0%
16/04/18 21:23:21 INFO mapreduce.Job: Job job_1461026006811_0001 completed successfully
16/04/18 21:23:22 INFO mapreduce.Job: Counters: 31
File System Counters
FILE: Number of bytes read=0
FILE: Number of bytes written=549568
FILE: Number of read operations=0
FILE: Number of large read operations=0
FILE: Number of write operations=0
HDFS: Number of bytes read=398
HDFS: Number of bytes written=373
HDFS: Number of read operations=16
HDFS: Number of large read operations=0
HDFS: Number of write operations=8
Job Counters
Killed map tasks=4
Launched map tasks=7
Other local map tasks=7
Total time spent by all maps in occupied slots (ms)=585524
Total time spent by all reduces in occupied slots (ms)=0
Total time spent by all map tasks (ms)=585524
Total vcore-milliseconds taken by all map tasks=585524
Total megabyte-milliseconds taken by all map tasks=599576576
Map-Reduce Framework
Map input records=21
Map output records=21
Input split bytes=398
Spilled Records=0
Failed Shuffles=0
Merged Map outputs=0
GC time elapsed (ms)=722
CPU time spent (ms)=15670
Physical memory (bytes) snapshot=394702848
Virtual memory (bytes) snapshot=8350515200
Total committed heap usage (bytes)=68792320
File Input Format Counters
Bytes Read=0
File Output Format Counters
Bytes Written=373
16/04/18 21:23:22 INFO mapreduce.ImportJobBase: Transferred 373 bytes in 193.0206 seconds (1.9324 bytes/sec)
16/04/18 21:23:22 INFO mapreduce.ImportJobBase: Retrieved 21 records.
16/04/18 21:23:23 INFO manager.SqlManager: Executing SQL statement: SELECT t.* FROM `tree` AS t LIMIT 1
16/04/18 21:23:23 INFO hive.HiveImport: Loading uploaded data into Hive
16/04/18 21:25:54 INFO hive.HiveImport:
16/04/18 21:25:54 INFO hive.HiveImport: Logging initialized using configuration in file:/csh/software/apache-hive-1.2.1-bin/conf/hive-log4j.properties
16/04/18 21:25:57 INFO hive.HiveImport: SLF4J: Class path contains multiple SLF4J bindings.
16/04/18 21:25:57 INFO hive.HiveImport: SLF4J: Found binding in [jar:file:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/slf4j-log4j12-1.7.10.jar!/org/slf4j/impl/StaticLoggerBinder.class]
16/04/18 21:25:57 INFO hive.HiveImport: SLF4J: Found binding in [jar:file:/csh/software/hbase-1.1.4/lib/slf4j-log4j12-1.7.5.jar!/org/slf4j/impl/StaticLoggerBinder.class]
16/04/18 21:25:57 INFO hive.HiveImport: SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
16/04/18 21:25:57 INFO hive.HiveImport: SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
16/04/18 21:27:27 INFO hive.HiveImport: OK
16/04/18 21:27:28 INFO hive.HiveImport: Time taken: 23.5 seconds
16/04/18 21:27:30 INFO hive.HiveImport: Loading data to table default.tree1
16/04/18 21:27:35 INFO hive.HiveImport: Table default.tree1 stats: [numFiles=4, numRows=0, totalSize=373, rawDataSize=0]
16/04/18 21:27:35 INFO hive.HiveImport: OK
16/04/18 21:27:35 INFO hive.HiveImport: Time taken: 8.27 seconds
16/04/18 21:27:36 INFO hive.HiveImport: Hive import complete.
16/04/18 21:27:40 INFO hive.HiveImport: Export directory is contains the _SUCCESS file only, removing the directory.

浏览HDFS如下图:
HDFS

在Hive中检查是否有数据:

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
hive> select * from tree1;
OK
1 466464684640 1
2 466464684641 2
3 466464684642 3
4 466464684643 1
5 466464684644 2
6 466464684645 3
7 466464684646 1
8 466464684647 2
9 466464684648 3
10 466464684649 1
11 4664646846410 2
12 4664646846411 3
13 4664646846412 1
14 4664646846413 2
15 4664646846414 3
16 4664646846415 1
17 4664646846416 2
18 4664646846417 3
19 4664646846418 1
20 4664646846419 2
21 111111 1
Time taken: 2.622 seconds, Fetched: 21 row(s)

参数说明

参数 说明
–hive-home Hive的安装目录,可以通过该参数覆盖掉默认的hive目录
–hive-overwrite 覆盖掉在hive表中已经存在的数据
–create-hive-table 默认是false,如果目标表已经存在了,那么创建任务会失败
–hive-table 后面接要创建的hive表
–table 指定关系数据库表名

Sqoop MySQL 导入到HDFS

发表于 2016-04-21   |   分类于 Sqoop   |     |   阅读次数

从MySQL数据库phx中读取tree表到HDFS

执行命令:

1
sqoop import --connect jdbc:mysql://node1:3306/phx \--username root --table tree --m 1

命令行输出:

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
Warning: /csh/link/sqoop/../hcatalog does not exist! HCatalog jobs will fail.
Please set $HCAT_HOME to the root of your HCatalog installation.
Warning: /csh/link/sqoop/../accumulo does not exist! Accumulo imports will fail.
Please set $ACCUMULO_HOME to the root of your Accumulo installation.
16/04/10 07:14:24 INFO sqoop.Sqoop: Running Sqoop version: 1.4.6
16/04/10 07:14:25 INFO manager.MySQLManager: Preparing to use a MySQL streaming resultset.
16/04/10 07:14:25 INFO tool.CodeGenTool: Beginning code generation
16/04/10 07:14:25 INFO manager.SqlManager: Executing SQL statement: SELECT t.* FROM `tree` AS t LIMIT 1
16/04/10 07:14:26 INFO manager.SqlManager: Executing SQL statement: SELECT t.* FROM `tree` AS t LIMIT 1
16/04/10 07:14:26 INFO orm.CompilationManager: HADOOP_MAPRED_HOME is /csh/link/hadoop
Note: /tmp/sqoop-root/compile/7c9c9a4ffecefa757a4f2073109cc801/tree.java uses or overrides a deprecated API.
Note: Recompile with -Xlint:deprecation for details.
16/04/10 07:14:29 INFO orm.CompilationManager: Writing jar file: /tmp/sqoop-root/compile/7c9c9a4ffecefa757a4f2073109cc801/tree.jar
16/04/10 07:14:30 WARN manager.MySQLManager: It looks like you are importing from mysql.
16/04/10 07:14:30 WARN manager.MySQLManager: This transfer can be faster! Use the --direct
16/04/10 07:14:30 WARN manager.MySQLManager: option to exercise a MySQL-specific fast path.
16/04/10 07:14:30 INFO manager.MySQLManager: Setting zero DATETIME behavior to convertToNull (mysql)
16/04/10 07:14:30 INFO mapreduce.ImportJobBase: Beginning import of tree
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in [jar:file:/csh/software/hadoop-2.7.2/share/hadoop/common/lib/slf4j-log4j12-1.7.10.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in [jar:file:/csh/software/hbase-1.1.4/lib/slf4j-log4j12-1.7.5.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
16/04/10 07:14:30 INFO Configuration.deprecation: mapred.jar is deprecated. Instead, use mapreduce.job.jar
16/04/10 07:14:32 INFO Configuration.deprecation: mapred.map.tasks is deprecated. Instead, use mapreduce.job.maps
16/04/10 07:14:32 INFO client.RMProxy: Connecting to ResourceManager at node1/192.168.161.11:8032
16/04/10 07:14:43 INFO db.DBInputFormat: Using read commited transaction isolation
16/04/10 07:14:43 INFO mapreduce.JobSubmitter: number of splits:1
16/04/10 07:14:44 INFO mapreduce.JobSubmitter: Submitting tokens for job: job_1460265828636_0005
16/04/10 07:14:50 INFO impl.YarnClientImpl: Submitted application application_1460265828636_0005
16/04/10 07:14:50 INFO mapreduce.Job: The url to track the job: http://node1:8088/proxy/application_1460265828636_0005/
16/04/10 07:14:50 INFO mapreduce.Job: Running job: job_1460265828636_0005
16/04/10 07:15:29 INFO mapreduce.Job: Job job_1460265828636_0005 running in uber mode : false
16/04/10 07:15:29 INFO mapreduce.Job: map 0% reduce 0%
16/04/10 07:16:15 INFO mapreduce.Job: map 100% reduce 0%
16/04/10 07:16:17 INFO mapreduce.Job: Job job_1460265828636_0005 completed successfully
16/04/10 07:16:17 INFO mapreduce.Job: Counters: 30
File System Counters
FILE: Number of bytes read=0
FILE: Number of bytes written=137395
FILE: Number of read operations=0
FILE: Number of large read operations=0
FILE: Number of write operations=0
HDFS: Number of bytes read=87
HDFS: Number of bytes written=361
HDFS: Number of read operations=4
HDFS: Number of large read operations=0
HDFS: Number of write operations=2
Job Counters
Launched map tasks=1
Other local map tasks=1
Total time spent by all maps in occupied slots (ms)=42224
Total time spent by all reduces in occupied slots (ms)=0
Total time spent by all map tasks (ms)=42224
Total vcore-milliseconds taken by all map tasks=42224
Total megabyte-milliseconds taken by all map tasks=43237376
Map-Reduce Framework
Map input records=20
Map output records=20
Input split bytes=87
Spilled Records=0
Failed Shuffles=0
Merged Map outputs=0
GC time elapsed (ms)=151
CPU time spent (ms)=3460
Physical memory (bytes) snapshot=86683648
Virtual memory (bytes) snapshot=2083274752
Total committed heap usage (bytes)=16318464
File Input Format Counters
Bytes Read=0
File Output Format Counters
Bytes Written=361
16/04/10 07:16:17 INFO mapreduce.ImportJobBase: Transferred 361 bytes in 105.2373 seconds (3.4303 bytes/sec)
16/04/10 07:16:17 INFO mapreduce.ImportJobBase: Retrieved 20 records.

访问 http://node1:8088 可以看到生成一个MapReduce任务

结果:
这里写图片描述

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
[root@node1 sbin]# hadoop fs -cat /user/root/tree/p*
1,466464684640,1
2,466464684641,2
3,466464684642,3
4,466464684643,1
5,466464684644,2
6,466464684645,3
7,466464684646,1
8,466464684647,2
9,466464684648,3
10,466464684649,1
11,4664646846410,2
12,4664646846411,3
13,4664646846412,1
14,4664646846413,2
15,4664646846414,3
16,4664646846415,1
17,4664646846416,2
18,4664646846417,3
19,4664646846418,1
20,4664646846419,2

导入到指定目录

参数:

1
--target-dir /directory

例如:

1
sqoop import --connect jdbc:mysql://node1:3306/phx \--username root --table tree --m 1 \--target-dir /sqoop/

结果:
这里写图片描述

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
[root@node1 sbin]# hadoop fs -cat /sqoop/p*
1,466464684640,1
2,466464684641,2
3,466464684642,3
4,466464684643,1
5,466464684644,2
6,466464684645,3
7,466464684646,1
8,466464684647,2
9,466464684648,3
10,466464684649,1
11,4664646846410,2
12,4664646846411,3
13,4664646846412,1
14,4664646846413,2
15,4664646846414,3
16,4664646846415,1
17,4664646846416,2
18,4664646846417,3
19,4664646846418,1
20,4664646846419,2

导入表子集

参数:

1
--where <condition>

例如:

1
sqoop import --connect jdbc:mysql://node1:3306/phx \--username root --table tree --m 1 \--where "productinformationId"='1' \--target-dir /sqoop/tree2/

结果:
这里写图片描述

1
2
3
4
5
6
7
8
[root@node1 sbin]# hadoop fs -cat /sqoop/tree2/p*
1,466464684640,1
4,466464684643,1
7,466464684646,1
10,466464684649,1
13,4664646846412,1
16,4664646846415,1
19,4664646846418,1

增量导入(即只导入新添加的那部分数据)
参数:

1
2
3
--incremental <mode> 选择模式,有:append 和 lastmodified
--check-column <column name> 检查那一列的元素,比如 id
--last-value <last check column value> 原来数据最后一条的参数,比如 最后一条id为20,则填20

例如:
在数据库表tree中添加一条记录,如下图
这里写图片描述
运行命令:

1
sqoop import --connect jdbc:mysql://node1:3306/phx \--username root --table tree --m 1 \--target-dir /sqoop/ \--incremental append \--check-column id \--last-value 20

结果:
这里写图片描述

参数列表

参数 说明
–append 将数据追加到hdfs中已经存在的dataset中。使用该参数,sqoop将把数据先导入到一个临时目录中,然后重新给文件命名到一个正式的目录中,以避免和该目录中已存在的文件重名。
–as-avrodatafile 将数据导入到一个Avro数据文件中
–as-sequencefile 将数据导入到一个sequence文件中
–as-textfile 将数据导入到一个普通文本文件中,生成该文本文件后,可以在hive中通过sql语句查询出结果。
–boundary-query 边界查询,也就是在导入前先通过SQL查询得到一个结果集,然后导入的数据就是该结果集内的数据,格式如:–boundary-query ‘select id,no from t where id = 3’,表示导入的数据为id=3的记录,或者 select min(), max() from ,注意查询的字段中不能有数据类型为字符串的字段,否则会报错
–columns 指定要导入的字段值,格式如:–columns id,username
–direct 直接导入模式,使用的是关系数据库自带的导入导出工具。官网上是说这样导入会更快
–direct-split-size 在使用上面direct直接导入的基础上,对导入的流按字节数分块,特别是使用直连模式从PostgreSQL导入数据的时候,可以将一个到达设定大小的文件分为几个独立的文件。
–inline-lob-limit 设定大对象数据类型的最大值
-m,–num-mappers 启动N个map来并行导入数据,默认是4个,最好不要将数字设置为高于集群的节点数
–query,-e 从查询结果中导入数据,该参数使用时必须指定–target-dir、–hive-table,在查询语句中一定要有where条件且在where条件中需要包含 \$CONDITIONS,示例:–query ‘select * from t where \$CONDITIONS ‘ –target-dir /tmp/t –hive-table t
–split-by 表的列名,用来切分工作单元,一般后面跟主键ID
–table 关系数据库表名,数据从该表中获取
–delete-target-dir 删除目标目录
–target-dir 指定hdfs路径
–warehouse-dir 与 –target-dir 不能同时使用,指定数据导入的存放目录,适用于hdfs导入,不适合导入hive目录
–where 从关系数据库导入数据时的查询条件,示例:–where “id = 2”
-z,–compress 压缩参数,默认情况下数据是没被压缩的,通过该参数可以使用gzip压缩算法对数据进行压缩,适用于SequenceFile, text文本文件, 和Avro文件
–compression-codec Hadoop压缩编码,默认是gzip
–null-string 可选参数,如果没有指定,则字符串null将被使用
–null-non-string 可选参数,如果没有指定,则字符串null将被使用
1…192021…24
LieBrother

LieBrother

当才华撑不起野心时,应该静下心来学习;当能力驾驭不了目标时,应该沉下心来历练。

120 日志
38 分类
138 标签
© 2016 - 2019 LieBrother
由 Hexo 强力驱动
主题 - NexT.Mist
本站访客数人次  |  本站总访问量次