java 如何找到 JAR：/home/hadoop/contrib/streaming/hadoop-streaming.jar

Question

提问by harshil bhatt

I'm practicing a video tutorial from plural sight about Amazon EMR. I am stuck as i cannot proceed as i am getting this error

我正在从多个角度练习有关 Amazon EMR 的视频教程。我被卡住了，因为我收到此错误而无法继续

Not a valid JAR: /home/hadoop/contrib/streaming/hadoop-streaming.jar

Please note that tutorial is old and it is using a older Emr version. I am using the latest version is that a problem ?

请注意，教程是旧的，它使用的是较旧的 Emr 版本。我用的是最新版本，有问题吗？

The steps that i took are after entering the credentials in putty

我采取的步骤是在腻子中输入凭据后

1) Hadoop
2) mkdir streamingCode`
3) wget -o ./streamingCode/wordSplitter.py s3://elasticmapreduce/samples/wordcount/wordSplitter.py
4) hadoop jar contrib/streaming/hadoop-streaming.jar -files streamingCode/wordSplitter.py -mapper wordSplitter.py input s3://elasticmapreduce/samples/wordcount/input -output streamingCode/wordCountOut -reducer aggregate`

1) Hadoop
2) mkdir 流代码`
3) wget -o ./streamingCode/wordSplitter.py s3://elasticmapreduce/samples/wordcount/wordSplitter.py
4) hadoop jar contrib/streaming/hadoop-streaming.jar -files streamingCode/wordSplitter.py -mapper wordSplitter.py input s3://elasticmapreduce/samples/wordcount/input -outputstreamingCode/wordCountOut -reducer聚合`

I cannot execute step 4 as i am getting the below error

我无法执行第 4 步，因为我收到以下错误

Not a valid JAR: /home/hadoop/contrib/streaming/hadoop-streaming.jar

Answer 1

回答by ChristopherB

The Hadoop streaming jar is still available in the latest release of EMR Hadoop. Starting with EMR release 4.0.0 it can be found at /usr/lib/hadoop-mapreduce/hadoop-streaming.jar.

Hadoop 流 jar 在最新版本的 EMR Hadoop 中仍然可用。从 EMR 版本 4.0.0 开始，可以在/usr/lib/hadoop-mapreduce/hadoop-streaming.jar.

Another good resource for differences between versions can be found at http://docs.aws.amazon.com/ElasticMapReduce/latest/ReleaseGuide/emr-release-differences.html.

另一个关于版本差异的好资源可以在http://docs.aws.amazon.com/ElasticMapReduce/latest/ReleaseGuide/emr-release-differences.html找到。

Answer 2

回答by Nikhil B Agarwal

For the variable, HADOOP_STREAMING, obtaining the path is a bit more complicated depending on the HDP you are using.

对于变量 HADOOP_STREAMING，根据您使用的 HDP，获取路径有点复杂。

Search for where it is located via command: find / -name 'hadoop-streaming*.jar'

通过命令搜索它所在的位置： find / -name 'hadoop-streaming*.jar'

Src: http://thecoatlessprofessor.com/programming/installing-r-studio-server-on-hortonworks-virtual-box-image-and-rmr2-a-k-a-rhadoop-r-package/

源代码：http: //thecoatlessprofessor.com/programming/installing-r-studio-server-on-hortonworks-virtual-box-image-and-rmr2-aka-rhadoop-r-package/

java 如何找到 JAR：/home/hadoop/contrib/streaming/hadoop-streaming.jar

提问by harshil bhatt

回答by ChristopherB

回答by Nikhil B Agarwal

相关推荐

最近更新

标签

java 如何找到 JAR：/home/hadoop/contrib/streaming/hadoop-streaming.jar

提问by harshil bhatt

回答by ChristopherB

回答by Nikhil B Agarwal

相关推荐

<JDK8 兼容性中的 Java 三元运算符与 if/else

java 如何在浏览器中显示PDF文件

java com.google.api.client.googleapis.json.GoogleJsonResponseException: 403 Forbidden

java 反编译jar文件作为intellij中的项目使用

相关推荐

最近更新

标签