我如何解析一个网站的实时html，并提取具体的信息，并将其与视觉基本存储到数据库中

本文介绍了我如何解析一个网站的实时html，并提取具体的信息，并将其与视觉基本存储到数据库中的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

 < TD>< A HREF =http：// xxxxx .com / xxxxxx / index.html>< IMG SRC =../ xxxxx / thumbnails / xxxxx.jpg> < / A>< / TD> 
< TD> ===< B>< A HREF =http://xxxxxxxxx.com/xxxxxxxx/index.html> LINE 0< / A>< / B> ===< BR> 
< FONT SIZE =2COLOR =＃400080> 
第1行< BR> 
第2行< BR> 
第3行
< / FONT> 
< / TD>

我如何提取TD ===和/ a和1,2行之间的信息，以及3，并将其存储到来自现场网站的数据库中。

解决方案

看起来您正在寻找关于抓取网站的信息。有很多工具可以帮助你，也许你可以从这里开始：

The info i need extracted is formatted:

<TD><A HREF="http://xxxxx.com/xxxxxx/index.html"><IMG SRC="../xxxxx/thumbnails/xxxxx.jpg">    </A></TD>
<TD>=== <B><A HREF="http://xxxxxxxxx.com/xxxxxxxx/index.html">LINE 0</A></B> ===<BR>
<FONT SIZE="2" COLOR="#400080">
Line 1<BR>
Line 2<BR>
Line 3
</FONT>
</TD>

how do i extract the info between TD=== and /a and Line 1,2, and 3 and store it into a database from a live website

解决方案

It seems like you're looking for information about scraping a website. There are a lot of tools that can help you with this, perhaps you could start here: web scraping using visual basic (blogs.msdn.com)

这篇关于我如何解析一个网站的实时html，并提取具体的信息，并将其与视觉基本存储到数据库中的文章就介绍到这了，希望我们推荐的答案对大家有所帮助，也希望大家多多支持！