Re: Parsing a user-entered localized datetime

Denis Steckelmacher Fri, 12 Apr 2013 12:24:01 -0700

On 11/04/2013 19:17, John Layt wrote :


We do support a "FancyDate" parsing style in QLocale::readDate(), but
it is very limited to things like "Yesterday" and "Monday".  There are
no plans to extend our fancy date support at this time as it would be
very hard to get right in a generic way, besides which kdelibs is
frozen until KF5.  In the future (Qt5/KF5) we may move localization to
using ICU which doesn't offer any such feature so we would need a new
one class for this.

A new class for parsing "Relative Dates" separate from the existing
date parsing code would make the most sense.  This would just take
strings and guess a rough time period.  I do think it will be very
hard writing generic code that works for every language that we
support, you should talk to the translators about this, especially
Chusselove.  I know the Fuzzy Clock tried hard to find a way to output
dates in a similar way but it ended up requireing lots fo manual work
for each new language.

As Kevin mentions, we store our default locale settings in the
entry-desktop files at
http://quickgit.kde.org/?p=kde-runtime.git&a=tree&f=l10n [1] .  You
can have a default value for a setting that is used by all languages,
but then also language specific versions of each setting if needed. 
Alternatively you can use the standard i18n() calls.

Good luck :-)

John.

I've looked at KCalendarSystem and it seems that every calendar systemis builtaround some sorts of days, months and years. It simplifies things abit, it wouldhave been difficult to handle things like "two seasons ago" in specialcalendar

systems.

I like your idea of a dedicated "relative date" class. In fact, Ithought about aHumanDateParser class, that reads locale-specific parser rules (Iimagined themto be stored in XML files, as they are very easy to read using Qt, andsomethingmore rich that i18nc calls is required, except if you want translatorsto have totranslate things like"day(s)[1],week(s)[7],month(s)[31:(January,...)"), and use

them to parse strings.

Yesterday, I tried to note down what I consider are the strings that aparsershould be able to parse. If <period> is any word in day, week, month,year andtheir plural forms, and <day of week> is the name of a day of the week,it shouldbe feasible to parse "<number> <period> ago" (3 weeks ago), "next<period>"(next week), "last <period>|<day of week>" (last week, last year, lastMonday), orsomething more fancy like "first Thursday of May". Shortcuts can begiven, forinstance "tomorrow". I don't know of these rules have to be regularexpressions,as some languages may separate words differently or use complexexpression rules.

The parser rules will list the rules recognized by a given language ina givencalendar system, and provide parsing clues. For instance, somesentences typicallyrefer to a future event (next Friday, or even "in May"), while otherscan beunderstood as a past tense or a future tense, depending on theapplication's context(Dolphin is used to search files that exist, not that will exist in twoweeks).

Finally, the parsing would consists of finding parts of the string thatmatch onerule. The first match would be taken. When a date has been found, itsmatchingportion of the string is removed and a time is looked for. I hope thiscould makeit possible to parse strings like "Last Monday on 8 pm", without havingto worryabout the "on" word, that every user will place differently or replacewith a

comma or any other thing.

Denis Steckelmacher.

(on a side note, I have already written a parser matching only parts ofhuman-writtencontent. It extracted quantity information from strings like "2 bottlesof 1 l of milk"and was able to guess nearly 90% of the quantities. The Human likes towrite valuableinformation in recognizable ways, even if there are words between them.For instance,the "2" in my example is the only number not followed by a unit, and "1l" can only

mean "one liter". So, the algorithm found 2x 1 liter)

Re: Parsing a user-entered localized datetime

Reply via email to