简单正则问题第七问

有一字符串如下(每页面中这样的字符串只有一个)
string src = @"华夏时报"
为了取里面的文字内容，我用了环视
            string src = @"华夏时报"
            string sourcePattern = @"(?<=)[\s\S]*(?=)";            Regex regex = new Regex(sourcePattern, RegexOptions.IgnoreCase);
            Match match = regex.Match(src);
            if (match.Success)
            {
                Console.WriteLine(match.Value);
            }可是有些页面里面会有链接，比如
string src = @"<a href=""http://www.chinatimes.cc/"">华夏时报</a>";
我仍然只是想去里面的文字"华夏时报"，该如何写？请注意， 两个字符串中间，如果只包括文本则只取文本，如果中间包括了链接和文本(只有一个链接)，则仍然只要其文字内容，请问如何用一个正则完成。
谢谢！

解决方案 »

免费领取超大流量手机卡，每月29元包185G流量+100分钟通话, 中国电信官方发货

Regex regExp = new Regex(@"[\s\S]*?<a[^>]+>(?<text>[\s\S]*?)</a>[\s\S]*");
string src = @"<a href=""http://www.chinatimes.cc/"">华夏时报</a>";
Match match = regex.Match(src);if (match.Success)
{
    Console.WriteLine(match.Groups["text"].Value);
}
Regex regExp = new Regex(@"[\s\S]*?<a[^>]+>(?<text>[\s\S]*?)</a>[\s\S]*");
string src = @"<a href=""http://www.chinatimes.cc/"">华夏时报</a>";
Match match = regex.Match(src);if (match.Success)
{
    Console.WriteLine(match.Groups["text"].Value);
}
本帖最后由 lxcnn 于 2010-05-31 13:04:59 编辑
由于我有些项目上的需求，必须一个正则搞定，最后这样写的，请两位指教
string sourcePattern = @"

(
[\s\S]*<a[^>]+>(?<text>[\s\S]*?)</a>[\s\S]*
|
(?<text>[\s\S]*?)
)
";
怎么拆开了？拆开要写(?x)string sourcePattern = @"

(
[\s\S]*<a[^>]+>(?<text>[\s\S]*?)</a>[\s\S]*
|
(?<text>[\s\S]*?)
)
";
Regex regExp = new Regex(@"[\s\S]*?<a[^>]+>(?<text>[\s\S]*?)</a>[\s\S]*");
string src = @"<a href=""http://www.chinatimes.cc/"">华夏时报</a>";
Match match = regex.Match(src);if (match.Success)
{
    Console.WriteLine(match.Groups["text"].Value);
}