[ 
https://issues.apache.org/jira/browse/LUCENE-7179?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15226801#comment-15226801
 ] 

Robert Muir commented on LUCENE-7179:
-------------------------------------

{quote}
But you have to remember that GeoPointField works by way of a quad tree 
represented in unsigned long space. This isn't "quantization" for memory/disk 
purposes, its a dimensionality reduction technique. 
{quote}

This is unrelated to what I am talking about. I am only talking about the 
truncation of the data provided by the user. 

All encoding around this stuff needs to be stable: I think I cannot compromise 
on this! we can encode/decode 1 time or 10,000 times and expect the same 
result. If we don't handle the rounding issues and overflow issues, then 
encoded integers aren't sorted in a meaningful way in integer or binary space, 
which means things are buggy. 

> GeoPoint and LatLonPoint test data should quantize once
> -------------------------------------------------------
>
>                 Key: LUCENE-7179
>                 URL: https://issues.apache.org/jira/browse/LUCENE-7179
>             Project: Lucene - Core
>          Issue Type: Bug
>            Reporter: Nicholas Knize
>         Attachments: LUCENE-7179.patch
>
>
> {{LatLonPoint}} and {{GeoPointField}} tests pre quantizes test data to ensure 
> consistency with indexed (encoded) data. The pre quantized data then becomes 
> indexed, undergoing another quantization. To guarantee numerical stability 
> this should be changed such that the test data is quantized after indexing.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to