在 Eclipse 中使用 utf-8 文件

Question

提问by Pablo Cabrera

Quite straight forward question. Is there a way to configure Eclipse to work with text files encoded with utf-8 with and without the BOM?

很直接的问题。有没有办法将 Eclipse 配置为使用带有和不带有 BOM 的 utf-8 编码的文本文件？

So far I've used eclipse with utf-8 encoding and it works, but when I try to edit a file generated by another editor that includes the BOM, Eclipse doesn't handle it properly, it 'shows an invisible character' at the begining of the file (the BOM). Is there a way to make Eclipse understand utf-8 encoded files with BOM?

到目前为止，我已经使用了带有 utf-8 编码的 eclipse 并且它可以工作，但是当我尝试编辑由另一个包含 BOM 的编辑器生成的文件时，Eclipse 无法正确处理它，它在文件的开头（BOM）。有没有办法让 Eclipse 理解带有 BOM 的 utf-8 编码文件？

Answer 1

采纳答案by VonC

Both bug 78455("Provide an option to force writing a BOM to UTF-8 files") and bug 136854don't leave much hope for such an option.

这两个错误78455（“提供一个选项，以力写BOM为UTF-8文件”）和错误136854不要留下太大的希望了这样的选择。

The support for encoding in the workspace is based on what is available from Java.
For any given resource in the workspace, it is possible to obtain a charset string that can be used with any Java APIs that take charset strings.
Examples are:
'US-ASCII',
'UTF-8',
'Cp1252',
'UTF-16' (Big Endian, BOM inserted automatically),
'UTF-16BE' (Big Endian, BOM not inserted automatically),
'UTF-16LE' (Little Endian, BOM not inserted automatically).
For Java encodings, except for the 'UTF-16' encoding, BOMs are not inserted (when writing) or discarded (when reading) for free.
Even if this is puzzling to end users, this is how all Java applications work.
If applications want to support creating UTF-8 files with BOMs to match their users' expectations, they need to provide such capability on their own(as neither Java nor the Resources model will help with that).
Eclipse does provide some improvements towards detecting BOMs, but not with generating or skipping them.

对工作区中编码的支持基于 Java 中的可用内容。
对于工作区中的任何给定资源，都可以获得一个字符集字符串，该字符串可与任何采用字符集字符串的 Java API 一起使用。
例子是：
' US-ASCII',
' UTF-8',
' Cp1252',
' UTF-16' (Big Endian, BOM 自动插入),
' UTF-16BE' (Big Endian, BOM 不自动插入),
' UTF-16LE'（Little Endian，不会自动插入 BOM）。
对于 Java 编码，除“UTF-16”编码外，不会免费插入（写入时）或丢弃（读取时）BOM。
即使这让最终用户感到困惑，但这就是所有 Java 应用程序的工作方式。
如果应用程序想要支持创建带有 BOM 的 UTF-8 文件以满足用户的期望，他们需要自己提供这种功能（因为 Java 和 Resources 模型都无法提供帮助）。
Eclipse 确实在检测 BOM 方面提供了一些改进，但没有生成或跳过它们。

在 Eclipse 中使用 utf-8 文件

提问by Pablo Cabrera

采纳答案by VonC

相关推荐

最近更新

标签

在 Eclipse 中使用 utf-8 文件

提问by Pablo Cabrera

采纳答案by VonC

相关推荐

eclipse Java Build Path中Order和Export选项卡有什么用

Ubuntu 上 Eclipse 中的巨大选项卡

JUnit 不会在 Eclipse 中的断点处停止（使用 JDK 1.6.0.20）

eclipse PyDev 和 Django：如何重新启动开发服务器？

相关推荐

最近更新

标签