Hi,

There are mails about slow parsing of JML in July 2017 and I experienced slow 
parsing of dates from Spatialite in February 2019, discussion is also on the 
mailing list. It seems there is a need to test your fixes also with JML and 
SpatiaLite, not only with shapefiles.

-Jukka-

Lähettäjä: ede <e...@users.sourceforge.net>
Lähetetty: keskiviikko 19. elokuuta 2020 11.26
Vastaanottaja: [jump-pilot:bugs] <4...@bugs.jump-pilot.p.re.sourceforge.net>
Aihe: [jump-pilot:bugs] Re: #497 Shapefile export slowed down because of 
FlexibleDateParser


On 19.08.2020 09:25, michael michaud wrote:

Indeed, from my tests, things are getting worst :

well. it shouldn't after all i'm trying to improve things :)

I save my dataset to a shapefile after it has been freshly extracted from a 
database (it contains a geometry and a timestamp)
1.15 : 1st save : 55s
2nd save < 1s> r6382 : 1st save : 188s!
2nd save < 1s

let me check. actually r6382 should be faster now. in my tests it is.

what i do is.
load your data as JML (exported earlier), as it is text based and needs to be 
parsed. then save it as SHP and compare times and FlexdateParser behaviour.

what's important is if the loader creates FlexibleFeatures or not. and even 
then, if then lazy conversion is optional, as the loader may of course set the 
attribute as an object of the Date class.

Another thing about your explanation : I remember that you introduced the 
LazyConversion of attributes with the json parser. I think the reason was 
related to the type system of json (which has only strings, numbers and 
boolean).

wrt. GeoJSON that's true. but later on someone noticed that opening JML was 
very slow and we found date parsing to be the root cause, so lazy conversion 
was introduced.

In the case of shapefile or database where data is precisely typed, I think the 
lazy conversion is finally very costly.

that's incorrect ;) brute forcing date formats is what is very_ costly here. 
lazy conversion is merely distributing cost from time A to time B. if we do not 
need to convert at some place we should remove conversion there.

Data should be stored with the right type at the first place. It would avoid 
all these useless conversions of dates. What do you think ? Seems that lazy 
conversion made for the json should stay the exception, not the general way to 
handle data types.

the concept is sound. there is something else goin on.

here check out the commit
https://sourceforge.net/p/jump-pilot/code/5482/

for testing purposes you could simply replace FlexibleFeature with the 
BasicFeature again in your local copy to disable lazy conversion and observe 
the difference.

what i can't find right now where JML/GML parses/parsed the date which made it 
slow. i'll have a look in the bug description again. i think it was Jukka via 
mailing list.

so far.. ede

________________________________

[bugs:#497]<https://sourceforge.net/p/jump-pilot/bugs/497/> Shapefile export 
slowed down because of FlexibleDateParser

Status: open
Created: Mon Aug 17, 2020 11:47 AM UTC by michael michaud
Last Updated: Tue Aug 18, 2020 09:42 PM UTC
Owner: michael michaud

Exporting a big dataset to shapefile (1 300 000 objets - around 225+115 Mb) 
with version 1.15 lasted more than 20 minutes.
Most of the time is used by FlexibleDateParser.parse().

________________________________

Sent from sourceforge.net because you indicated interest in 
https://sourceforge.net/p/jump-pilot/bugs/497/

To unsubscribe from further messages, please visit 
https://sourceforge.net/auth/subscriptions/

________________________________

[bugs:#497]<https://sourceforge.net/p/jump-pilot/bugs/497/> Shapefile export 
slowed down because of FlexibleDateParser

Status: open
Created: Mon Aug 17, 2020 11:47 AM UTC by michael michaud
Last Updated: Wed Aug 19, 2020 07:39 AM UTC
Owner: michael michaud

Exporting a big dataset to shapefile (1 300 000 objets - around 225+115 Mb) 
with version 1.15 lasted more than 20 minutes.
Most of the time is used by FlexibleDateParser.parse().

________________________________

Sent from sourceforge.net because you indicated interest in 
https://sourceforge.net/p/jump-pilot/bugs/497/

To unsubscribe from further messages, please visit 
https://sourceforge.net/auth/subscriptions/



---

** [bugs:#497] Shapefile export slowed down because of FlexibleDateParser**

**Status:** open
**Created:** Mon Aug 17, 2020 11:47 AM UTC by michael michaud
**Last Updated:** Wed Aug 19, 2020 10:29 AM UTC
**Owner:** michael michaud


Exporting a big dataset to shapefile (1 300 000 objets - around 225+115 Mb) 
with version 1.15 lasted more than 20 minutes.
Most of the time is used by FlexibleDateParser.parse().


---

Sent from sourceforge.net because jump-pilot-devel@lists.sourceforge.net is 
subscribed to https://sourceforge.net/p/jump-pilot/bugs/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/jump-pilot/admin/bugs/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.
_______________________________________________
Jump-pilot-devel mailing list
Jump-pilot-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/jump-pilot-devel

Reply via email to