Java “堆大小”对 Hadoop Namenode 意味着什么？

Question

提问by Bohdan

I'm trying to understand if there is something wrong with my Hadoop cluster. When I go to web UI in cluster summary it says:

我试图了解我的 Hadoop 集群是否有问题。当我在集群摘要中转到 Web UI 时，它说：

Cluster Summary

XXXXXXX files and directories, XXXXXX blocks = 7534776 total.
Heap Size is 1.95 GB / 1.95 GB (100%)

And I'm concerned about why is this Heap size metric at 100%

我担心为什么这个堆大小指标是 100%

Could someone please provide some explanation how namenode heap size impact cluster performance. And whether this needs to be fixed.

有人可以解释一下 namenode 堆大小如何影响集群性能。以及这是否需要修复。

Answer 1

回答by Remus Rusanu

The namenode Web UI shows the values as this:

namenode Web UI 显示的值如下：

<h2>Cluster Summary (Heap Size is <%= StringUtils.byteDesc(Runtime.getRuntime().totalMemory()) %>/<%= StringUtils.byteDesc(Runtime.getRuntime().maxMemory()) %>)</h2>

The Runtimedocuments these as:

这些Runtime文件如下：

totalMemory()Returns the total amount of memory in the Java virtual machine.
maxMemory()Returns the maximum amount of memory that the Java virtual machine will attempt to use

totalMemory()返回 Java 虚拟机中的总内存量。
maxMemory()返回 Java 虚拟机将尝试使用的最大内存量

Max is going to be the -Xmxparameter from the service start command. The total memory main factor is the number of blocks in your HDFS cluster. The namenode requires ~150 bytes for each block, +16 bytes for each replica, and it must be kept in live memory. So a default replication factor of 3 gives you 182 bytes, and you have 7534776 blocks gives about 1.3GB. Plus all other non-file related memory in use in the namenode, 1.95GB sounds about right. I would say that your HDFS cluster size requires a bigger namenode, more RAM. If possible, increase namenode startup -Xmx. If maxed out, you'll need a bigger VM/physical box.

Max 将成为-Xmxservice start 命令的参数。总内存主要因素是 HDFS 集群中的块数。namenode 每个块需要大约 150 个字节，每个副本需要 +16 个字节，并且必须保存在实时内存中。因此，默认复制因子 3 为您提供 182 个字节，您有 7534776 个块提供大约 1.3GB。加上 namenode 中使用的所有其他与文件无关的内存，1.95GB 听起来很合适。我会说您的 HDFS 集群大小需要更大的名称节点和更多的 RAM。如果可能，增加 namenode startup -Xmx。如果达到最大值，您将需要一个更大的虚拟机/物理机。

Read The Small Files Problesm, HDFS-5711.

阅读小文件问题，HDFS-5711。

Java “堆大小”对 Hadoop Namenode 意味着什么？

提问by Bohdan

回答by Remus Rusanu

相关推荐

最近更新

标签

Java “堆大小”对 Hadoop Namenode 意味着什么？

提问by Bohdan

回答by Remus Rusanu

相关推荐

Java Eclipse ADT appcompat...是什么？

Java的通用类型参数命名约定（具有多个字符）？

Java Http 状态 404 -/请求的资源不可用

Java 使用数据库信息填充 JSP 下拉列表

相关推荐

最近更新

标签