求C＃提取网页正文内容代码解决思路-C#教程-爱易网页

求C＃提取网页正文内容代码解决思路

日期：2014-05-19　浏览次数：21064 次

求C＃提取网页正文内容代码
哪位大虾有C＃提取网页正文内容的代码，可不可以发上来我参考参考。谢谢啦！！

------解决方案--------------------
public static int saveHtmlFile(string url,string filename)
{
int status = -1;
string respHTML = string.Empty;
StreamWriter sw = null;
try
{
if(ReadHttp(url,ref respHTML)== "OK ")
{
if(File.Exists(filename))
{
File.Copy(filename,filename+ ".bak ",true);
}
sw = new StreamWriter(filename,false,Encoding.GetEncoding( "GB2312 "));
sw.WriteLine(respHTML);
sw.Close();
status = 0;
}
else
{
System.Web.HttpContext.Current.Response.Write( "找不到该页或服务器错误 ");
}
}
catch(Exception err)
{
System.Web.HttpContext.Current.Response.Write(err.Message);
status = -1;
}
finally
{
if (sw != null)
{
sw.Close();
}
}
return(status);
}

public static string ReadHttp(string url,ref string content)
{
string status= "ERROR ";
HttpWebRequest Webreq = (HttpWebRequest) WebRequest.Create(url);
HttpWebResponse Webresp=null;
StreamReader strm = null;
try
{
Webresp = (HttpWebResponse) Webreq.GetResponse();
status = Webresp.StatusCode.ToString();
strm = new StreamReader(Webresp.GetResponseStream(),Encoding.GetEncoding( "GB2312 "));
content = strm.ReadToEnd();
}
catch
{
}
finally
{
if(Webresp != null) Webresp.Close();
if(strm != null) strm.Close();
}
return(status);
}

免责声明： 本文仅代表作者个人观点，与爱易网无关。其原创性以及文中陈述文字和内容未经本站证实，对本文以及其中全部或者部分内容、文字的真实性、完整性、及时性本站不作任何保证或承诺，请读者仅作参考，并请自行核实相关内容。

—linq修改数据

101规约遇到的有关问题

如何样对字符串中与数据库中相符字符进行操作

请教如何关闭当前程序，然后自动打开另一个程序

求教:怎么判断.NET Framework 4.0 已经完全安装完毕

关于listview中选定某项的有关问题

ComBobox的有关问题，请帮忙

winform向图片动态平添热点

香港全能空间免费试用15天香港高速云虚拟主机PHP/ASP/NET送MSSQL和MYSQL

求C＃提取网页正文内容代码解决思路

相关资料更多>

推荐阅读更多>