Hi Erick,
Thinking some more about the differences between the two sort orders has
suggested another possibility. We also have a geo spatial field defined in the
index:
echo "$(date) Creating geoLocation field"
curl -X POST -H 'Content-type:application/json' --data-binary '{
"add-field":{
"name":"geoLocation",
"type":"location",
"stored":true,
"indexed":true
}
}' http://localhost:8983/solr/address/schema
One of the differences between the two sort orders is that when the data is
sorted by locality and post code, it means that addresses that are close to
each other will be sorted together as both locality and postcode have
geographic meaning. So when they are indexed, they will be indexed in groups
of addresses that are quite near to each other.
When the data is sorted by DPID, the order is near random as the dpid has no
meaning at all, so the geo location sequence should be random as well.
I don't have time to test this at the moment, as I need to get my project back
on track after chasing this performance issue but it might ring a bell with
somebody.
Regards,
David
David Howe
Java Domain Architect
Postal Systems
Level 16, 111 Bourke Street Melbourne VIC 3000
T 0391067904
M 0424036591
E [email protected]
W auspost.com.au
W startrack.com.au
Australia Post is committed to providing our customers with excellent service.
If we can assist you in any way please telephone 13 13 18 or visit our website.
The information contained in this email communication may be proprietary,
confidential or legally professionally privileged. It is intended exclusively
for the individual or entity to which it is addressed. You should only read,
disclose, re-transmit, copy, distribute, act in reliance on or commercialise
the information if you are authorised to do so. Australia Post does not
represent, warrant or guarantee that the integrity of this email communication
has been maintained nor that the communication is free of errors, virus or
interference.
If you are not the addressee or intended recipient please notify us by replying
direct to the sender and then destroy any electronic or paper copy of this
message. Any views expressed in this email communication are taken to be those
of the individual sender, except where the sender specifically attributes those
views to Australia Post and is authorised to do so.
Please consider the environment before printing this email.