问题描述
我在做一个Android应用程序,做DOM解析的XML文件。我有一个XML文件,它看起来像这样:
I'm making an android application that does DOM parsing on an xml file. I have an xml file that looks like this:
<?xml version="1.0" encoding="utf-8"?>
<family>
<grandparent>
<parent1>
<child1>Foo</child1>
<child2>Bar</child2>
</parent1>
<parent2>
<child1>Raz</child1>
<child2>Mataz</child2>
</parent2>
</grandparent>
</family>
如果我上运行一个DOM解析器,是这样的:
If I run a dom parser on it, like this:
try {
DocumentBuilder builder = DocumentBuilderFactory.newInstance().newDocumentBuilder();
Document doc = builder.parse(input);
doc.getDocumentElement().normalize(); //added in since the edit
NodeList nodd = doc.getElementsByTagName("grandparent");
for (int x = 0; x < nodd.getLength(); x++){
Node node = nodd.item(x);
NodeList nodes = node.getChildNodes();
for(int y = 0; y < nodes.getLength(); y++){
Node n = nodes.item(y);
System.out.println(n.getNodeName());
}
}
}
我的应用程序打印出以下
My application prints out the following
07-20 18:24:28.395:信息/的System.out(491):#text
07-20 18:24:28.395:信息/的System.out(491):parent1
07-20 18:24:28.395: INFO/System.out(491): parent1
07-20 18:24:28.395:信息/的System.out(491):#text
07-20 18:24:28.395: INFO/System.out(491): #text
07-20 18:24:28.395:信息/的System.out(491):parent2
07-20 18:24:28.395: INFO/System.out(491): parent2
07-20 18:24:28.395:信息/的System.out(491):#text
07-20 18:24:28.395: INFO/System.out(491): #text
我的问题是,那些是什么#text领域,更重要的是,我该如何摆脱他们?
My question is, what are those #text fields and more importantly, how do I get rid of them?
编辑:所以,现在我知道它们是什么,我想它正常化。我已经更新了code以反映更改,但同样的结果。
So now that I know what they are, I tried to normalize it. I have updated the code to reflect the changes, but same result.
推荐答案
这是空白(换行,空格,制表符):)
It's whitespace (newlines, spaces, tabs) :)
这篇关于为什么有#text在我的xml文件的节点?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持!