I tested the same example sentence with Ubuntu 18.04 and LibreOffice
6.0.3.2. Here’s the output from pdftotext:

ه‬
اشترى للا خمسة آفا كتاب وَأنَا اشْ ت َ َريْتُهَا ِ‬
من ْ ُ‬

Here four out of the eight words are intact, so it’s an improvement to
5.4.6 but still leaves a lot to hope for. The last word of the sentence
(مِنْهُ) is broken into pieces so that the last full character ه is
found on the first line and the two others on the last. Diacritical
marks are sometimes placed where they are supposed to (such as the first
and the three last diacritics in the word اشْتَرَيْتُهَا) but sometimes
not (the middle of the same word and the last word of the sentence
مِنْهُ). This time ى is visible but the first letter of the following
word ب is not.

Here’s what MS Word 2007 (12.0.6787.5000, SP3 MSO 12.0.6785.5000) on
Windows 8.1 produces when processed by pdftotext:

اشترى بالل خمسة آالف كتاب وأنا اشتريتها منه‬

So Word 2007 drops all the diacritics, and mixes up the order of the
letters in the combination ل (U+0644) + ا (U+0627) producing ال instead
of لا. Otherwise the output is intact and definitely much better than
LO. I don't have any newer versions of MS Word at my disposal, so I
can't test it further.

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1772439

Title:
  Arabic text gets deformed when creating a PDF in LibreOffice Writer

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/libreoffice/+bug/1772439/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to