avatar
xml and chinese character# XML - WWW明日之星
y*o
1
Hi.
I am trying to process an xml-tagged chinese document using Python's minidom.
But my code gets stuck when it hits the first Chinese character in the
xml-tagged document. Python complains that the Chinese character is an invalid
token, and thus not well-formed.
I tried using encoding="UTF-8" and encoding="UTF-16" and encoding="GB2312" and
encoding="GBK" in the xml-tagged chinese document. None of them helped.
Would you please give a hint? Thanks.
avatar
c*r
2

minidom.
invalid
and
try ISO8859-1

【在 y********o 的大作中提到】
: Hi.
: I am trying to process an xml-tagged chinese document using Python's minidom.
: But my code gets stuck when it hits the first Chinese character in the
: xml-tagged document. Python complains that the Chinese character is an invalid
: token, and thus not well-formed.
: I tried using encoding="UTF-8" and encoding="UTF-16" and encoding="GB2312" and
: encoding="GBK" in the xml-tagged chinese document. None of them helped.
: Would you please give a hint? Thanks.

相关阅读
logo
联系我们隐私协议©2024 redian.news
Redian新闻
Redian.news刊载任何文章,不代表同意其说法或描述,仅为提供更多信息,也不构成任何建议。文章信息的合法性及真实性由其作者负责,与Redian.news及其运营公司无关。欢迎投稿,如发现稿件侵权,或作者不愿在本网发表文章,请版权拥有者通知本网处理。