1、下载shark-0.9.1-bin-hadoop2
解压缩,配置conf/shark-env.sh
export JAVA_HOME=/usr/lib/jvm/java-7-oracleexport SCALA_HOME=/usr/lib/scalaexport HADOOP_HOME=/root/app/hadoop-2.2.0export SPARK_HOME=/root/app/spark-1.0.0-bin-hadoop2export HIVE_HOME=/root/app/apache-hive-0.13.1-binexport MASTER=spark://192.168.2.215:8088
2、下载apache-hive-0.13.1-bin
解压缩,配置conf/hive-env.sh
HADOOP_HOME=/root/app/hadoop-2.2.0# Hive Configuration Directory can be controlled by:export HIVE_CONF_DIR=/root/app/apache-hive-0.13.1-bin/conf
配置conf/hive-site.xml 以及conf/hive-default.xml(从hive-default.xml.template拷贝过来)
javax.jdo.option.ConnectionURL jdbc:mysql://192.168.2.215:3306/hive?createDatabaseIfNotExist=true JDBC connect string for a JDBC metastore javax.jdo.option.ConnectionDriverName com.mysql.jdbc.Driver Driver class name for a JDBC metastore
配置用户名和密码
javax.jdo.option.ConnectionUserName root username to use against metastore database javax.jdo.option.ConnectionPassword xxx password to use against metastore database
3、启动shark
./shark-withinfo
报错
java.lang.VerifyError: class org.apache.hadoop.hdfs.protocol.proto.ClientNamenodeProtocolProtos$SetOwnerRequestProto overrides final method getUnknownFields.()Lcom/google/protobuf/UnknownFieldSet
修改
root@xxx:~/app/shark-0.9.1-bin-hadoop2/lib_managed/jars/edu.berkeley.cs.shark/hive-exec# jar -xf hive-exec-0.11.0-shark-0.9.1.jar root@xxx:~/app/shark-0.9.1-bin-hadoop2/lib_managed/jars/edu.berkeley.cs.shark/hive-exec# rm -rf com/ hive-exec-0.11.0-shark-0.9.1.jar root@xxx:~/app/shark-0.9.1-bin-hadoop2/lib_managed/jars/edu.berkeley.cs.shark/hive-exec# jar -cf hive-exec-0.11.0-shark-0.9.1.jar .
创建表
~/app/shark-0.9.1-bin-hadoop2/bin# ./shark -f createTable.sql