bruns added inline comments.

INLINE COMMENTS

> michaelh wrote in epubextractor.cpp:85
> I think we should port away from libepub. Multiple titles result in one 
> ';'-joined string.
> Also it seems to be unmaintained.

The joined titles is the fault of this epubextractor AFAICS - see 
fetchMetadataString

> michaelh wrote in epubextractor.cpp:97
> Right, this inconsistency is intentional, and it needs discussion. That's why 
> I added a comment in D12197 <https://phabricator.kde.org/D12197> which was 
> probably overlooked.
> DC and IDPF aren't very clear on how to use `dc:subject`. Calibre interprets 
> it as tags, My impression is, that most provider also do. Hence I prefer to 
> use `Property::Keywords` only because it comes closest imo. That change would 
> not really be breaking as currently `Property::Subject` is one large string 
> joined with ';'.

Distinction between Subject and Keywords typically is keywords are just a bunch 
of words without further specification, while subject, as specified by DC, and 
as used by e.g. libraries, are taken from filed specific catalogs.

Baloos properties documentation specifically mentions dc:subject for 
Properties:Subject.

One of the file formats which has both, keywords and subject, is ODF, which 
uses dc:subject and meta:keywords.

DC specifies for //any// property:

> Recommendation 5. Multiple property values should be encoded by repeating the 
> XML element for that property.

My opinion is to **always** use Properties::Subject for dc:subject (as 
documented for baloo), and add each property instance individually. If 
properties are already messed up in the originating document, there is nothing 
we can do, but we should not make things worse.

REPOSITORY
  R286 KFileMetaData

BRANCH
  multi-subject

REVISION DETAIL
  https://phabricator.kde.org/D10694

To: michaelh, mgallien, dfaure
Cc: bruns, astippich, #frameworks, ashaposhnikov, michaelh, spoorun, 
navarromorales, isidorov, firef, andrebarros, emmanuelp

Reply via email to