java Hadoop 要求输入路径位于 localhost 9000

Question

提问by Bohn

I am trying to run the Tom Whites' Chapter 2 example

我正在尝试运行 Tom Whites 的第 2 章示例

When I run the command:

当我运行命令时：

hadoop MaxTemperature input/ncdc/sample.txt output

The error I am getting is this:

我得到的错误是这样的：

11/12/31 18:08:28 INFO mapred.JobClient: Cleaning up the staging area hdfs://localhost:9000/tmp/hadoop-mymac/mapred/staging/mymac/.staging/job_201112311807_0001
11/12/31 18:08:28 ERROR security.UserGroupInformation: PriviledgedActionException as:mymac (auth:SIMPLE) cause:org.apache.hadoop.mapreduce.lib.input.InvalidInputException: Input path does not exist: hdfs://localhost:9000/user/mymac/input/ncdc/sample.txt
Exception in thread "main" org.apache.hadoop.mapreduce.lib.input.InvalidInputException: Input path does not exist: hdfs://localhost:9000/user/mymac/input/ncdc/sample.txt

What is it that I have set wrong?

是什么我设置错了？

I haven't touched his source code, it can be found in here:

我没有接触过他的源代码，可以在这里找到：

https://github.com/tomwhite/hadoop-book/tree/3e/ch02

Answer 1

回答by Philippe Signoret

Your core-site.xmland hdfs-site.xmlfiles are configured to use localhost:9000. If this isn't what you expect (which is what I get from you post's title), what didyou expect?

您的core-site.xml和hdfs-site.xml文件被配置为使用localhost:9000. 如果这不是你所期望的（这是我从您发布的标题），什么没有你期待什么？

What version of Hadoop are we talking about? How did you install your Hadoop distribution? From your other question and the config files, I'm guessing you used CHD4. If you look over the instructions from Cloudera, can you see if you missed anything?

我们在谈论什么版本的 Hadoop？您是如何安装 Hadoop 发行版的？根据您的其他问题和配置文件，我猜您使用的是 CHD4。如果您查看Cloudera 的说明，您是否可以看到是否遗漏了什么？

Before starting Hadoop, did you format HDFS?

在开始Hadoop之前，你格式化过HDFS吗？

$ hadoop namenode -format

Then, after starting Hadoop, do you get anything other than INFO messages?

那么，启动Hadoop之后，除了INFO消息之外，你还有什么信息吗？

Did you copy the input data into HDFS?

您是否将输入数据复制到 HDFS 中？

$ hadoop dfs -put /tmp/my/input/data input

Finally, what do you get from simple HDFS commands such as:

最后，你从简单的 HDFS 命令中得到了什么，例如：

$ hadoop dfs -ls /

UPDATE: Run Word Count

更新：运行字数统计

Get HDFS up and running. Running hadoop dfs -ls /should work.
Copy a folder with text file(s) into HDFS: hadoop dfs -put text_files input_folder
Run hadoop dfs -ls .to see if your files got copied correctly.
Find the hadoop-examples-X.Y.Z.jarfile on your system.
Navigate to whatever directory it's in, and run:
$ hadoop jar hadoop-examples-*.jar WordCount input_folder output_folder.
You should see the progress of the MapReduce application.
When its finished, view the output with hadoop dfs -cat output_folder/*.

启动并运行 HDFS。跑步hadoop dfs -ls /应该有效。
将包含文本文件的文件夹复制到 HDFS： hadoop dfs -put text_files input_folder
运行hadoop dfs -ls .以查看您的文件是否被正确复制。
hadoop-examples-X.Y.Z.jar在您的系统上找到该文件。
导航到它所在的任何目录，然后运行：
$ hadoop jar hadoop-examples-*.jar WordCount input_folder output_folder.
您应该会看到 MapReduce 应用程序的进度。
完成后，查看输出hadoop dfs -cat output_folder/*。

Answer 2

回答by user1900344

Forgot to set JAVA_HOMEin etc/hadoop/hadoop-env.shmay also cause this error

忘了把JAVA_HOME中etc/hadoop/hadoop-env.sh可能也导致此错误

java Hadoop 要求输入路径位于 localhost 9000

提问by Bohn

回答by Philippe Signoret

回答by user1900344

相关推荐

最近更新

标签

java Hadoop 要求输入路径位于 localhost 9000

提问by Bohn

回答by Philippe Signoret

回答by user1900344

相关推荐

java 具有负循环的 Floyd-Warshall。如何找到所有未定义的路径？

如何使用 OutputStream 在 Java 中向电子邮件添加附件？

Java 代理设置 - Ubuntu

java 使用 poi api 从电子表格中读取时间值

相关推荐

最近更新

标签