开发者

regex to trap img tag, both versions

开发者 https://www.devze.com 2022-12-11 03:49 出处:网络
I开发者_StackOverflow need to remove image tags from text, so both versions of the tag: <img src=\"\" ... ></img>

I开发者_StackOverflow need to remove image tags from text, so both versions of the tag:

<img src="" ... ></img>


<img src="" ... />


In C# you can use following code:

            string html = @"
<h1>
<img src="" ... >
</img>
<img></img>-bad
<img/>-bad
<img src="" ... />
</h1>";
            string result = Regex.Replace(html, @"<img\s[^>]*>(?:\s*?</img>)?", "", RegexOptions.IgnoreCase);

But be aware that modifying HTML with Regex isn't good approach, its better to use Html Agility Pack or other HTML parsers.


This is not something you do with regular expressions. Use the xpath-query //img instead.

0

精彩评论

暂无评论...
验证码 换一张
取 消

关注公众号