Re: Issue with Matching UTF-8 Anchor Text in URIDetail plugin

2025-02-01 Thread Jimmy
*When adding debug to source like this: * * if (exists $rule->{text}) { next unless $info->{anchor_text}; my($op,$patt,$neg) = @{$rule->{text}}; my $match; for my $text (@{ $info->{anchor_text} }) {if ( ($op eq '=~' && $text =~ $patt) || ($op

Re: Issue with Matching UTF-8 Anchor Text in URIDetail plugin

2025-02-01 Thread John Hardin
On Sun, 2 Feb 2025, Jimmy wrote: Hello, I am experiencing difficulties creating a rule to match UTF-8 anchor text using the plugin, and I suspect there might be a bug related to UTF-8 matching. For example, I attempted to use the following rule: uri_detail UNICODE_LINK_TEXT text =~ /\\x{E0}\\

Issue with Matching UTF-8 Anchor Text in URIDetail plugin

2025-02-01 Thread Jimmy
Hello, I am experiencing difficulties creating a rule to match UTF-8 anchor text using the plugin, and I suspect there might be a bug related to UTF-8 matching. For example, I attempted to use the following rule: uri_detail UNICODE_LINK_TEXT text =~ /\\x{E0}\\x{B8}\\x{97}\\x{E0}\\x{B8}\\x{B1}\\x