Java Windows-1252 编码 - 显示的字符不正确

Question

提问by user2707175

I have a buffer with chars encoded in Windows-1252. However when I create a new String with appropriate encoding, instead of expected result I've get quite often interrogation marks, ex.

我有一个用 Windows-1252 编码的字符的缓冲区。然而，当我用适当的编码创建一个新的字符串时，我经常得到询问标记，而不是预期的结果，例如。

byte[] tmps = new byte[] {(byte) 0xfb};
System.out.println (new String (tmps,0,1,"Windows-1252" ));

As result the system should display "u" char with "^" above it. Instead it displays "?".

结果系统应该显示“u”字符，上面有“^”。相反，它显示“？”。

Any idea?

任何的想法？

Answer 1

回答by Stephen C

First of all Windows-1252 isa supported encoding:

首先，Windows-1252是一种受支持的编码：

If it wasn't you'd get an UnsupportedEncodingExceptionin new String (...,"Windows-1252"). (That's what the javadocsays!)
The Oracle Java documentation say Windows-1252 is in the "Basic Encoding Set" - http://docs.oracle.com/javase/7/docs/technotes/guides/intl/encoding.doc.html, http://docs.oracle.com/javase/6/docs/technotes/guides/intl/encoding.doc.html, etcetera.

如果不是，你会得到一个UnsupportedEncodingExceptionin new String (...,"Windows-1252")。（这就是javadoc所说的！）
Oracle Java 文档说 Windows-1252 位于“基本编码集”中 - http://docs.oracle.com/javase/7/docs/technotes/guides/intl/encoding.doc.html, http://docs .oracle.com/javase/6/docs/technotes/guides/intl/encoding.doc.html等。

I think that the most likely problem here is on the output side. Specifically, Java may think that your locale's default charset is ASCII or something that doesn't support that codepoint.

我认为这里最有可能的问题出在输出端。具体来说，Java 可能认为您的语言环境的默认字符集是 ASCII 或不支持该代码点的东西。

One way to eliminate Windows-1252as the causeof the problem is to write the equivalent string using a Unicode escape; e.g.

消除的一种方法Windows-1252为原因的问题是写使用Unicode转义等价的字符串; 例如

    System.out.println("\u00fb");

Answer 2

回答by user2707175

I've already found this.

我已经找到了这个。

Menu Run/Run configurations/ next Java Application and your own app name/tab common/ next encoding set to UTF-8

菜单运行/运行配置/下一个 Java 应用程序和您自己的应用程序名称/选项卡通用/下一个编码设置为 UTF-8

And since now both windows 1250 and 1252 chars seems to be displayed ok.

从现在开始，Windows 1250 和 1252 个字符似乎都可以正常显示。

Java Windows-1252 编码 - 显示的字符不正确

提问by user2707175

回答by Stephen C

回答by user2707175

相关推荐

最近更新

标签

Java Windows-1252 编码 - 显示的字符不正确

提问by user2707175

回答by Stephen C

回答by user2707175

相关推荐

Java Spring Boot 应用程序 - 任何其余 API 端点的默认超时时间或控制所有端点超时的简单配置是什么

扩展类的 Java 序列化

Java 如何在某个弹簧配置文件中禁用飞路？

在服务器套接字java中从客户端获取数据

相关推荐

最近更新

标签