Warren Togami wrote:
On 10/01/2009 02:36 PM, John Hardin wrote:
On Thu, 1 Oct 2009, Warren Togami wrote:

The "Oddity" I was pointing out at the beginning of the thread is not
prevalence of .cn URI's, but rather most of them appear to be exactly
8 characters long. Could someone please commit my T_CN_8_URL rule to
the sandbox so we can see if that trend holds beyond my own corpa?

I've put a .CN 8 URI rule into my sandbox file but it may be a few days
before it gets committed, my stuff is in flux right now...


# 8-letter .cn domain, per Warren Togami
uri            CN_EIGHT            m;^https?://(?:[^./]+\.)*[^./]{8}\.cn/;
describe       CN_EIGHT            .CN uri with eight-letter domain name
score          CN_EIGHT            0.10

Possible bug here... Do all URI's necessarily have a trailing slash?

Warren Togami
wtog...@redhat.com


\b might be better?


Reply via email to