开发者

Parsing HTML with c#.net [duplicate]

开发者 https://www.devze.com 2023-03-07 06:28 出处:网络
This question already has answers here: What is the best way to parse html in C#? [closed] (15 answers)
This question already has answers here: What is the best way to parse html in C#? [closed] (15 answers) Closed 9 years ago.

I'm trying to parse the following HTML file, I'd like the get the value of key. This is being done on Silverlight for Windows phone.

<HTML>
<link ref="shortcut 开发者_如何转开发icon" href="favicon.ico">
<BODY>
<script Language="JavaScript">
location.href="login.html?key=UEFu1EIsgGTgAV7guTRhsgrTQU28TImSZkYhPMLj7BChpBkvlCO11aJU2Alj4jc5"
</script>
<CENTER><a href="login.html?key=UEFu1EIsgGTgAV7guTRhsgrTQU28TImSZkYhPMLj7BChpBkvlCO11aJU2Alj4jc5">Welcome</a></CENTER></BODY></HTML>

any idea's on where to go from here?

thanks


Give the HTMLAgilityPack a look into. Its a pretty decent HTML parser

http://html-agility-pack.net/?z=codeplex

Here's some code to get you started (requires error checking)

HtmlDocument document = new HtmlDocument(); 
string htmlString = "<html>blabla</html>";
document.LoadHtml(htmlString);
HtmlNodeCollection collection = document.DocumentNode.SelectNodes("//a");
foreach (HtmlNode link in collection)
{
     string target = link.Attributes["href"].Value;
}


You can use regular expression (Regex class) for it. The expression can be something like that: login.html\?key=[^"]*

0

精彩评论

暂无评论...
验证码 换一张
取 消