从描述中删除img路径
本文关键字:img 路径 删除 描述 | 更新日期: 2023-09-27 17:58:09
我需要rss提要中的描述,但总是得到带有路径图像的描述。
RSS提要是:
<description>
<![CDATA[<img src="http://example.com/img/1/title/1967304.jpg"/> Ukrainian forces launch an "anti-terrorist operation" after pro-Russian gunmen seize buildings in the eastern part of the country.]]>
</description>
我有代码:
if (this._groups.Count != 0)
return;
SyndicationClient client = new SyndicationClient();
Uri feedUri = new Uri("url_feed");
var feed = await client.RetrieveFeedAsync(feedUri);
foreach (SyndicationItem item in feed.Items)
{
string data = string.Empty;
if (feed.SourceFormat == SyndicationFormat.Rss20)
{
// Get description
data = item.Summary.Text;
}
Regex regx = new Regex("http://([''w+?''.''w+])+([a-zA-Z0-9''~''!''@''#''$''%''^''&''*''('')_''-''=''+''''''/''?''.'':'';''''',]*)?.(?:jpg|bmp|gif|png)"
, RegexOptions.IgnoreCase);
string filePath = regx.Match(data).Value;
DataGroup group = new DataGroup(item.Id,
item.Title.Text,
item.Links[0].Uri.ToString(),
filePath.Replace("small", "large"),
data.Split(new string[] { "<br>" }, StringSplitOptions.None)[0].ToString());
this.Groups.Add(group);
}
输出为(在文本块中):
<img src="http://example.com/img/1/title/1967304.jpg">亲俄枪手占领了乌克兰东部的建筑后,乌克兰军队发起了一场"反恐行动"。
我只需要文本,不需要带有img路径的文本。
如果您的文本不包含'<'或者'>'你可能会发现这个usfull:
使用:
using System.Text.RegularExpressions;
代码段:
// Do your stuff to get the description
string description = "< img src='"http://example.com/img/1/title/1967304.jpg'"> Ukrainian forces launch an '"anti-terrorist operation'" after pro-Russian gunmen seize buildings in the eastern part of the country.";
string cleaned = Regex.Replace(description, @"<[^>]*>", String.Empty, RegexOptions.IgnoreCase).Trim();
Console.WriteLine(cleaned);