java - jsoup从表中提取数据

我有一个html文件，我需要使用jsoup从中提取部门名称。

Document doc = Jsoup.connect("http://directory.binghamton.edu/directory/directory.deptlist").get();
System.out.println(doc);
Elements departments = doc.select("deptlist");

for (Element department : departments) {
    System.out.println(department.text());
}

我已经做了类似的事情，但是没有用。

查看源代码：http：//directory.binghamton.edu/directory/directory.deptlist

谢谢。

最佳答案

开始了！

Document doc = Jsoup.connect("http://directory.binghamton.edu/directory/directory.deptlist").get();

Elements departments = doc.select("table#deptlist a"); // Select all 'a' in a 'table'-tag with id 'deptlist'
String name;


for( Element element : departments ) // Iterate over all Elements available
{
    name = element.text(); // Save the plaintext (no html) of the element
    System.out.println(name); // Simple output (as an example)
}

在代码中，选择一个标签“ deptlist”而不是表格。
如果要选择所有带有id=deptlist的元素（在我的示例中，您仅选择具有该ID的表），则可以使用以下选择器：doc.select("#deptlist")。

在此处查找一些更详细的信息：JSoup selector API