正则的求解,提取html中的内容解决方法-C#教程-爱易网页

正则的求解,提取html中的内容解决方法

日期：2014-05-17　浏览次数：20760 次

正则的求解,提取html中的内容

<div class="content" title="2013-03-26 12:00:42">我需要的内容</div>

正html 是很长的。有很多组div内容我需要全部取出。



            Regex regImg = new Regex(@"<div class=""content"" title="".*"">?<imgUrl>(.*)</div>", RegexOptions.IgnoreCase);

            // 搜索匹配的字符串sHtmlText 为html内容

            MatchCollection matches = regImg.Matches(sHtmlText);



            int i = 0;

            string[] sUrlList = new string[matches.Count];



            // 取得匹配项列表

            foreach (Match match in matches)

                sUrlList[i++] = match.Groups["imgUrl"].Value;



            return sUrlList;

我这样写错了。求指导。

html 正则提取内容

------解决方案--------------------
"(?is)(?<=<div[^<>]>)[^<>]+(?</div>)"
------解决方案--------------------
变量=[\s\S]*?
------解决方案--------------------
内容=(?<TARGET>[\s\S]+)
------解决方案--------------------
string pattern=@"(?<=<div[^>]*?class=""content""[^>]*?>).*?(?=</div>)";

免责声明： 本文仅代表作者个人观点，与爱易网无关。其原创性以及文中陈述文字和内容未经本站证实，对本文以及其中全部或者部分内容、文字的真实性、完整性、及时性本站不作任何保证或承诺，请读者仅作参考，并请自行核实相关内容。

正则的求解,提取html中的内容解决方法

相关资料更多>

推荐阅读更多>