从描述中删除img路径

本文关键字:img 路径 删除 描述 | 更新日期: 2023-09-27 17:58:09

我需要rss提要中的描述,但总是得到带有路径图像的描述。

RSS提要是:

<description>
<![CDATA[<img src="http://example.com/img/1/title/1967304.jpg"/> Ukrainian forces launch an "anti-terrorist operation" after pro-Russian gunmen seize buildings in the eastern part of the country.]]>
</description>

我有代码:

if (this._groups.Count != 0)
   return;
SyndicationClient client = new SyndicationClient();
Uri feedUri = new Uri("url_feed");
var feed = await client.RetrieveFeedAsync(feedUri);
foreach (SyndicationItem item in feed.Items)
{
    string data = string.Empty;
    if (feed.SourceFormat == SyndicationFormat.Rss20)
    {
       // Get description
       data = item.Summary.Text;
    }
    Regex regx = new Regex("http://([''w+?''.''w+])+([a-zA-Z0-9''~''!''@''#''$''%''^''&amp;''*''('')_''-''=''+''''''/''?''.'':'';''''',]*)?.(?:jpg|bmp|gif|png)"
                            , RegexOptions.IgnoreCase);
    string filePath = regx.Match(data).Value;
    DataGroup group = new DataGroup(item.Id,
                                       item.Title.Text,
                                       item.Links[0].Uri.ToString(),
                                       filePath.Replace("small", "large"),
                                       data.Split(new string[] { "<br>" }, StringSplitOptions.None)[0].ToString());
    this.Groups.Add(group);
}

输出为(在文本块中):

<img src="http://example.com/img/1/title/1967304.jpg">亲俄枪手占领了乌克兰东部的建筑后,乌克兰军队发起了一场"反恐行动"。

我只需要文本,不需要带有img路径的文本。

从描述中删除img路径

如果您的文本不包含'<'或者'>'你可能会发现这个usfull:

使用:

using System.Text.RegularExpressions;

代码段:

// Do your stuff to get the description
string description = "< img src='"http://example.com/img/1/title/1967304.jpg'"> Ukrainian forces launch an '"anti-terrorist operation'" after pro-Russian gunmen seize buildings in the eastern part of the country.";
string cleaned = Regex.Replace(description, @"<[^>]*>", String.Empty, RegexOptions.IgnoreCase).Trim();
Console.WriteLine(cleaned);