[ 
https://issues.apache.org/jira/browse/HTTPCORE-778?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17934271#comment-17934271
 ] 

Oleg Kalnichevski edited comment on HTTPCORE-778 at 3/11/25 7:04 PM:
---------------------------------------------------------------------

> As per RFC3986, quite a bit more characters should not be encoded:

[~peterhalicky] What statement in RFC 3986 actually supports this assertion? 
This section of the specification defines what characters are valid for 
fragment component. There is nothing that I can see that makes encoding of 
reserved characters illegal or not recommended.

Oleg 


was (Author: olegk):
> As per RFC3986, quite a bit more characters should not be encoded:

[~peterhalicky] What statement in RFC 3986 actually supports this assertion? 
This session of the specification defines what characters are valid for 
fragment component. There is nothing that I can see that makes encoding of 
reservesd characters illegal or not recommended.

Oleg 

> URIBuilder uses incorrect encoding method for URI fragment
> ----------------------------------------------------------
>
>                 Key: HTTPCORE-778
>                 URL: https://issues.apache.org/jira/browse/HTTPCORE-778
>             Project: HttpComponents HttpCore
>          Issue Type: Bug
>          Components: HttpCore
>    Affects Versions: 5.3.3
>            Reporter: Peter Halicky
>            Priority: Major
>
> URI fragment is encoded in URIBuilder using:
> {code:java}
> PercentCodec.encode(sb, this.fragment, this.charset); {code}
> (line 401, end of buildString method)
> This encodes all characters except UNRESERVED using the percent-format.
> As per (obsoleted) RFC2396, URI fragment should use URIC safe-chars.
> As per RFC3986, quite a bit more characters should not be encoded:
> {code:java}
> pct-encoded   = "%" HEXDIG HEXDIG
> unreserved    = ALPHA / DIGIT / "-" / "." / "_" / "~"
> sub-delims    = "!" / "$" / "&" / "'" / "(" / ")" / "*" / "+" / "," / ";" / 
> "="
> pchar         = unreserved / pct-encoded / sub-delims / ":" / "@"
> fragment    = *( pchar / "/" / "?" ) {code}
> Note that URIBuilder in httpclient 4.5.13 conforms to at least the old 
> RFC2396, as it uses URIC set of safe characters (i.e. this is in fact a 
> regression).



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@hc.apache.org
For additional commands, e-mail: dev-h...@hc.apache.org

Reply via email to