Tightening parsing constraints on HTTP request method

Susan Hinrichs Mon, 17 Nov 2014 12:43:51 -0800

I'm tracking down a deployment issue that is incorrectly classifying aseries of bytes as correctly parsed HTTP request.

Walking through http_parser_parse_req(), it seems that to be marked as acorrectly formatted request you need

<bytes excluding white space>+ <white space>+ <bytes excluding whitespace>+\n

In my case this matches the method as a bunch of control characters andsome ascii characters and the URI as a set of ascii characters. Insteadof failing in the parsing, this request fails in the DNS lookup since myURI isn't a valid domain name. But the resulting error sent back talksabout DNS resolution and is misleading.

Looking at the W3 specs, it looks like HTTP 1.1 has the most lax rulesfor what characters can form a method token. From my reading, a methodcan be any token(http://www.w3.org/Protocols/rfc2616/rfc2616-sec5.html#sec5.1.1), andany character but white space and control characters are allowed to bein a token (http://www.w3.org/Protocols/rfc2616/rfc2616-sec2.html#sec2.2).

I'd like to change the parsing of the method token inhttp_parser_parse_req() to restrict control characters from the methodtoken as well as the white space characters. Since this seems like arather key part of the ATS processing and has been this way for quitesometime, I wanted to get confirmation from folks that this is areasonable change.


Thanks,
Susan

Tightening parsing constraints on HTTP request method

Reply via email to