Hi Karen,
I had described my setup already a couple of weeks ago I think. I used
my BNS emulator, emubns, and connected it to a virtual machine running
SvarDOS. The SvarDOS system has a screen reader that outputs speech in
the BNS format. This BNS speech is received by emubns, who relays it for
actual speech generation to Piper. All this runs virtualized at this
moment, so it is likely not a setup that is desirable for blind people.
However, the exact same result can be obtained by running emubns and
Piper on a Raspberry Pi. In such setup, the DOS system would run on a
hardware PC and output speech to the Raspberry Pi connected via a
standard serial cable. In essence the Raspberry Pi would act as a dumb
speech generator. This, I believe, would be much more user-friendly
since no exotic configuration would be required other than flashing the
Raspberry Pi with a suitable image. I plan to prepare such image, but
have yet to find the time to look into this. Not something very high on
my list of priorities since I am almost sure that I would be the only
user of such system, but still something I'd like to get done eventually.
I believe you use a hardware DECTalk speech generator to make your DOS
talk. To make it simpler to understand, just imagine that instead of
this DECTalk device there is a Raspberry Pi box. The RPi would run
Linux, emubns and piper, but all this would be hidden from "normal"
users, who would consider the Raspberry as a black box that talks when
connected to a DOS PC.
Mateusz
On 19/11/2024 06:18, Karen Lewellen wrote:
Hi Mateusz,
Finally had a moment to check out your piper in DOS sample.
If I am forthright, that was frankly stunning.
granted, I imagine many factors impacted that sound quality, but I have
questions about your process, both how it differs because Piper is in
Linux <what does that mean exactly?> and what you used for the output
for that sample, sound card for example.
There is a discussion of Piper for Linux accessibility, one thing
profoundly missing, speaking personally, is the ease of adjusting basic
things like rate, pitch, inflection, my understanding from command line
Linux users is that those options are not solid. certainly not like the
many DOS screen reader packages.
before I get exacted about a possible safe for my use tts, I am
interested in your process?
Thanks,
Karen
On Wed, 16 Oct 2024, Mateusz Viste via Freedos-user wrote:
On 15/10/2024 17:12, Karen Lewellen via Freedos-user wrote:
Other factor speaking personally with espeak is the largely poor speech
quality.
I experimented a bit with emubns today. Ended up using "piper" instead
of espeak. The speech quality is outstanding now, almost lifelike. I
was able to install SvarDOS using only my ears and fingers and with
the natural voice generated by Piper it was almost a pleasant experience.
Piper is Linux-only, but that's not an issue in the context of running
emubns as a DOS-compatible hardware synthesizer on a Raspberry Pi.
Another problem is that Provox pronounces every period, comma, colon
and parenthesis, which becomes quickly annoying. Perhaps it can be
configured somehow, I have yet to find out. It's all new to me, I have
never used a screen reader before.
I have no idea what the pi would provide speech wise, nor the cost
factor.
With piper integration the speech quality is excellent. The cost
factor is the cost of a Raspberry Pi 3 device and a serial-to-USB
cable. That is, in theory, because I did not test it on real hardware
yet.
Mateusz
_______________________________________________
Freedos-user mailing list
Freedos-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/freedos-user
_______________________________________________
Freedos-user mailing list
Freedos-user@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/freedos-user