Re: RFR 8177552: Compact Number Formatting support

naoto . sato Mon, 26 Nov 2018 07:32:53 -0800

Hi Nishit,

On 11/26/18 12:41 AM, Nishit Jain wrote:

Hi Naoto,
To add to my previous mail comment, the DecimalFormat spec also says that
/*"DecimalFormat can be instructed to format and parse scientificnotation only via a pattern; there is currently no factory method thatcreates a scientific notation format. In a pattern, the exponentcharacter immediately followed by one or more digit characters indicatesscientific notation. "
*/That is, exponent formatting and parsing is instructed only via ascientific notation pattern and I think should not be there with*general number* formatting.

I am not sure the quoted sentence should be interpreted that way. Myunderstanding is that the section means there is no publicNumberFormat.getScientificInstance() method (cf. line 601 atNumberFormat.java), so that users will have to use 'E' in their patternstring.

Anyway, my point is that if you prefer to treat the scientific notationdifferently between DecimalFormat and CompactDecimalFormat, then it willneed to be clarified in the spec. Personally I agree that it is notpractical to interpret E in the CNF.


Naoto

Updated webrev based on the other comments

http://cr.openjdk.java.net/~nishjain/8177552/webrevs/webrev.02/

 > Some more comments (all in CompactNumberFormat.java)
> line 807: expandAffix() seems to treat localizable special patterncharacters, but currently the implementation only cares for the minussign. Should other localizable pattern chars be taken care of, such aspercent sign?- Other special characters like '%' percent sign are not allowed as perCNF compact pattern spec
 > line 869, 888: Define what -1 means as a ret value.
- OK.
> line 897: iterMultiplier be better all capitalized as it is aconstant. And it could be statically defined in the class to be sharedwith other locations that use "10" for arithmetic operation.
- OK, made it static final and renamed it as RANGE_MULTIPLIER

 > line 1531: Any possibility this could lead to divide-by-zero?
- None which I am aware of, unless you are pointing to the issue likeJDK-8211161, which we know is not an issue.
Regards,
Nishit Jain
On 23-11-2018 15:55, Nishit Jain wrote:
Hi Naoto,
> I think DecimalFormat and CNF should behave the same, ie. 'E' shouldbe treated as the exponent without a quote.
Personally I don't think that the exponential parsing should besupported by CompactNumberFormat, because the objective of compactnumbers is to represent numbers in short form. So, parsing of numberformat like "1.05E4K" should not be expected from CompactNumberFormat,I am even doubtful that such forms ("1.05E4K") are used anywhere whereexponential and compact form are together used. If formatting andparsing of exponential numbers are needed it should be done byDecimalFormat scientific instance *not *with the general numberinstance.So, I don't think that we should allow parsing of exponentialnumbers.Comments welcome.
Regards,
Nishit Jain
On 22-11-2018 02:02, [email protected] wrote:
Hi Nishit,

On 11/21/18 12:53 AM, Nishit Jain wrote:
Hi Naoto,

Updated the webrev based on suggestions

http://cr.openjdk.java.net/~nishjain/8177552/webrevs/webrev.01/

Changes made:
- Replaced List<String> with String[] to be added to the theresource bundles
Good.
- refactored DecimalFormat.subparse() to be used by the CNF.parse(),to reduce code duplication.
I presume CNF is calling package-private methods in DF to share thesame code. Some comments noting the sharing would be helpful.
- Also updated it with other changes as suggested in the comments
Sorry I missed your question the last time:
>>> Do you think this is an issue with DecimalFormat.parse() and CNF
>>> should avoid parsing exponential numbers? Or, should CNF.parse() be
>>> modified to be consistent with DecimalFormat.parse() in this aspect?
I think DecimalFormat and CNF should behave the same, ie. 'E' shouldbe treated as the exponent without a quote.
Some more comments (all in CompactNumberFormat.java)
line 807: expandAffix() seems to treat localizable special patterncharacters, but currently the implementation only cares for the minussign. Should other localizable pattern chars be taken care of, suchas percent sign?
line 869, 888: Define what -1 means as a ret value.
line 897: iterMultiplier be better all capitalized as it is aconstant. And it could be statically defined in the class to beshared with other locations that use "10" for arithmetic operation.
line 1531: Any possibility this could lead to divide-by-zero?

Naoto
Regards,
Nishit Jain
On 20-11-2018 00:33, [email protected] wrote:
Hi Nishit,

On 11/18/18 10:29 PM, Nishit Jain wrote:
Hi Naoto,

Please check my comments inline.

On 17-11-2018 04:52, [email protected] wrote:
Hi Nishit,

Here are my comments:
- CLDRConverter: As the compact pattern no more employsList<String>, can we eliminate stringListEntry/Element, and useArray equivalent instead?
Since the CNF design does not put any limit on the size of compactpattern, so at the time of parsing the CLDR xmls using SAX parser,it becomes difficult to identify the size of array when the parentelement of compact pattern is encountered, so I think it is betterto keep the List<String> while extracting the resources.
OK. However I'd not keep the List<String> format on generating theresource bundle, as there is no reason to introduce yet anotherbundle format other than the existing array of String.
- CompactNumberFormat.java

Multiple locations: Use StringBuilder instead of StringBuffer.
OK
line 268: The link points toNumberFormat.getNumberInstance(Locale) instead of DecimalFormat
OK. Changed it at line 165 also.
line 855: no need to do toString(). length() can detect whetherit's empty or not.
line 884: "Overloaded method" reads odd here. I'd preferspecializing in the "given number" into either long or biginteger.
OK
line 1500: subparseNumber() pretty much shares the same code withDecimalFormat.subparse(). can they be merged?
The existing CNF.subParseNumber differs in the wayparseIntegerOnly is handled, DecimalFormat.parse()/subparse()behaviour is unpredictable with parseIntegeronly = true whenmultipliers are involved (Please see JDK-8199223).
Also, I had thought that the CNF.parse()/subparseNumber() should*not *parse the exponential notation e.g. while parsing "1.05E4K"the parsing should break at 'E' and returns 1.05, because 'E'should be considered as unparseable character for general numberformat pattern or compact number pattern, but this is not the casewith DecimalFormat.parse(). The below DecimalFormat general numberformat instance
NumberFormat nf =  NumberFormat.getNumberInstance();
nf.parse("1.05E4")
Successfully parse the string and returns 10500. The samebehaviour is there with other DecimalFormat instances also e.g.currency instance.
Do you think this is an issue with DecimalFormat.parse() and CNFshould avoid parsing exponential numbers? Or, should CNF.parse()be modified to be consistent with DecimalFormat.parse() in thisaspect?
No, I understand there are differences. But I see a lot ofduplicated piece of code which I would like to eliminate.
line 1913-1923, 1950-1960, 1987-1997, 2024-2034: It simply callssuper. No need to override them.
Since setters are overridden, I think that it is better tooverride getters also (even if they are just calling super andhave same javadoc) to keep them at same level. But, if you see nopoint in keeping them in CNF, I will remove them. Does that needCSR change?
I don't see any point for override. I don't think there needs aCSR, but better ask Joe about it.
line 2231: You need to test the type before cast. OtherwiseClassCastException may be thrown.
The type is checked in the superclass equals method getClass() !=obj.getClass(), so I think there is no need to check the type here.
OK.

Naoto
Regards,
Nishit Jain
Naoto

On 11/16/18 9:54 AM, Nishit Jain wrote:
Hi,
Please review this non trivial feature addition to NumberFormatAPI.
The existing NumberFormat API provides locale based support forformatting and parsing numbers which includes formattingdecimal, percent, currency etc, but the support for formatting anumber into a human readable or compact form is missing. ThisRFE adds that feature to format a decimal number in a compactformat (e.g. 1000 -> 1K, 1000000 -> 1M in en_US locale) , whichis useful for the environment where display space is limited, sothat the formatted string can be displayed in that limitedspace. It is defined by LDML's specification for Compact NumberFormats.
http://unicode.org/reports/tr35/tr35-numbers.html#Compact_Number_Formats
RFE: https://bugs.openjdk.java.net/browse/JDK-8177552
Webrev:http://cr.openjdk.java.net/~nishjain/8177552/webrevs/webrev.00/
CSR: https://bugs.openjdk.java.net/browse/JDK-8188147

Request to please help review the the change.

Regards,
Nishit Jain

Re: RFR 8177552: Compact Number Formatting support

Reply via email to