开发者

HTML errors when truncating?

开发者 https://www.devze.com 2023-03-29 20:59 出处:网络
I have the following function: function truncate($string, $limit, $break=\".\", $pad=\"...\") { if(strlen($string) <= $limit) return $string;

I have the following function:

function truncate($string, $limit, $break=".", $pad="...") { 
  if(strlen($string) <= $limit) return $string; 
    if(false !== ($breakpoint = strpos($string, $break, $limit))) { 
      if($breakpoint < strlen($string) - 1) { 
        $string = substr($string, 0, $breakpoint) . $pad; 
      } 
    } 

    return $string; 
开发者_运维技巧}

If i have the following code:

$html = '<div style="bla: bla;">somet30ext</div> <div id="bla">MORE AND MORE TEXT</div>';
print truncate($html, 30);

It will cut off at somet..... In other words, we end with:

<div style="bla: bla;">somet 

I.e. an unclosed div tag. How do I go about solving this?

Update:

I don't want to truncate only when I get to the . I want something that can automatically add divs. In this case, the output should be:

     <div style="bla: bla;">somet</div>

I.e. It actually added the because it knew it was unclosed? Am I correct in assuming I must use something like html purifier for this?


Don't truncate the $html but rather the real text where it's appropriate. To get hold of the text, you could use php's xml functions (DOM, SimpleXml) or regular expression. Though I'd advice the first.

Example using Dom:

$html = '<div style="bla: bla;">somet30ext</div> <div id="bla">MORE AND MORE TEXT</div>';

$dom = new DomDocument;
$dom->loadHtml($html);

$xpath = new DomXpath($dom);

// example of getting a div with id=bla
$bla = $xpath->query('//div[@id="bla"]')->item(0);
if ($bla instanceof DomNode) {

  // truncate here
  if (strlen($bla->nodeValue) > 10) {
    $bla->nodeValue = substr($bla->nodeValue, 0, 10) . '...';
  }
}

// collect result, this is needed due to dom->loadhtml wrapping the loaded string
// with html/body if not present
$result = '';
foreach ($xpath->query('//body/*') as $childNode) {
  $result .= $dom->saveHtml($childNode);
}

echo $result;


Well, in truncate function just look for opening tags, count them, and look for these tags to close, when all are closed that's is your string.

0

精彩评论

暂无评论...
验证码 换一张
取 消