Rocksolid Light

Welcome to novaBBS (click a section below)

mail  files  register  newsreader  groups  login

Message-ID:  

Being overloaded is the sign of a true Debian maintainer. -- JHM on #Debian


computers / comp.text.pdf / Convert book pdf to an audio podcast to send as an email attachment

SubjectAuthor
* Convert book pdf to an audio podcast to send as an email attachmentPeter
+* Re: Convert book pdf to an audio podcast to send as an emailChris
|`* Re: Convert book pdf to an audio podcast to send as an email attachmentPeter
| `* Re: Convert book pdf to an audio podcast to send as an emailChris
|  `* Re: Convert book pdf to an audio podcast to send as an email attachmentPeter
|   +* Re: Convert book pdf to an audio podcast to send as an emailknuttle
|   |+- Re: Convert book pdf to an audio podcast to send as an emailPaul
|   |`* Re: Convert book pdf to an audio podcast to send as an email attachmentPeter
|   | `- Re: Convert book pdf to an audio podcast to send as an emailPaul
|   +- Re: Convert book pdf to an audio podcast to send as an emailMikeS
|   `- Re: Convert book pdf to an audio podcast to send as an emailkelown
+* Re: Convert book pdf to an audio podcast to send as an emailPaul
|`* Re: Convert book pdf to an audio podcast to send as an email attachmentPeter
| `* Re: Convert book pdf to an audio podcast to send as an emailRetirednoguilt
|  `* Re: Convert book pdf to an audio podcast to send as an email attachmentPeter
|   `- Re: Convert book pdf to an audio podcast to send as an emailPaul
+- Re: Convert book pdf to an audio podcast to send as an emailknuttle
+* Re: Convert book pdf to an audio podcast to send as an email attachmentTheo
|`- Re: Convert book pdf to an audio podcast to send as an emailChris
`- Re: Convert book pdf to an audio podcast to send as an email attachmentmilsabords

1
Convert book pdf to an audio podcast to send as an email attachment

<u675kg$34cnq$1@dont-email.me>

  copy mid

https://www.novabbs.com/computers/article-flat.php?id=163&group=comp.text.pdf#163

  copy link   Newsgroups: alt.comp.os.windows-10 comp.mobile.android comp.text.pdf
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: occassio...@nospam.co.uk (Peter)
Newsgroups: alt.comp.os.windows-10,comp.mobile.android,comp.text.pdf
Subject: Convert book pdf to an audio podcast to send as an email attachment
Date: Mon, 12 Jun 2023 14:13:20 +0100
Organization: -
Lines: 11
Message-ID: <u675kg$34cnq$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
Injection-Date: Mon, 12 Jun 2023 13:12:48 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="568e3dbed67d641ccf968b8be76dfbc7";
logging-data="3289850"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+shhm+nGUrDVQC8lyWQ8BD"
Cancel-Lock: sha1:fAdJ/A6r8uETdZhxTzaCu8/2/RY=
X-Newsreader: Forte Agent 3.3/32.846
X-No-Archive: yes
 by: Peter - Mon, 12 Jun 2023 13:13 UTC

I want to help an older person with severe macular degeneration from
thousands of miles afar where she could use an AUDIO file of long text
(like of a book or short story, normally, as in Great Books stories).

I'm on Windows 10 and Android 12. She's on the iPhone and iPads.

I can easily save a web page (like a news story) to text and sometimes I
can convert a PDF of a book to text (depending on how the PDF was made).

But how can I turn the text into a "podcast" for her to listen to?
And would it be too large to email to her in a typical email message?

Re: Convert book pdf to an audio podcast to send as an email attachment

<u67kqs$365rm$1@dont-email.me>

  copy mid

https://www.novabbs.com/computers/article-flat.php?id=164&group=comp.text.pdf#164

  copy link   Newsgroups: alt.comp.os.windows-10 comp.mobile.android comp.text.pdf
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: ithink...@gmail.com (Chris)
Newsgroups: alt.comp.os.windows-10,comp.mobile.android,comp.text.pdf
Subject: Re: Convert book pdf to an audio podcast to send as an email
attachment
Date: Mon, 12 Jun 2023 17:32:12 -0000 (UTC)
Organization: A noiseless patient Spider
Lines: 17
Message-ID: <u67kqs$365rm$1@dont-email.me>
References: <u675kg$34cnq$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
Injection-Date: Mon, 12 Jun 2023 17:32:12 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="e9e90064be48674c95590e393ea25f21";
logging-data="3348342"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+lOXKVRAE1AWQ7pEDzqXyQtaSowtQwa6I="
User-Agent: NewsTap/5.5 (iPhone/iPod Touch)
Cancel-Lock: sha1:tKYtAOXN+xCdKH+FX1+f6FGwjZE=
sha1:yMcFVnlHC0DzrDMAdG2u6Lv3ChU=
 by: Chris - Mon, 12 Jun 2023 17:32 UTC

Peter <occassionally-confused@nospam.co.uk> wrote:
> I want to help an older person with severe macular degeneration from
> thousands of miles afar where she could use an AUDIO file of long text
> (like of a book or short story, normally, as in Great Books stories).
>
> I'm on Windows 10 and Android 12. She's on the iPhone and iPads.
>
> I can easily save a web page (like a news story) to text and sometimes I
> can convert a PDF of a book to text (depending on how the PDF was made).
>
> But how can I turn the text into a "podcast" for her to listen to?
> And would it be too large to email to her in a typical email message?

iOS has very good accessibility features. I suspect it already can read any
pdf you provide.

https://support.apple.com/en-gb/guide/iphone/iph3e2e415f/ios

Re: Convert book pdf to an audio podcast to send as an email attachment

<u67lce$367o7$1@dont-email.me>

  copy mid

https://www.novabbs.com/computers/article-flat.php?id=165&group=comp.text.pdf#165

  copy link   Newsgroups: alt.comp.os.windows-10 comp.mobile.android comp.text.pdf
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: nos...@needed.invalid (Paul)
Newsgroups: alt.comp.os.windows-10,comp.mobile.android,comp.text.pdf
Subject: Re: Convert book pdf to an audio podcast to send as an email
attachment
Date: Mon, 12 Jun 2023 13:41:33 -0400
Organization: A noiseless patient Spider
Lines: 59
Message-ID: <u67lce$367o7$1@dont-email.me>
References: <u675kg$34cnq$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8; format=flowed
Content-Transfer-Encoding: 7bit
Injection-Date: Mon, 12 Jun 2023 17:41:34 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="5bbf700f6acacfbb3cc542aa90dc69c4";
logging-data="3350279"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19oCSV4rrwinaWMFVS27t/RmbasBwwuhSE="
User-Agent: Ratcatcher/2.0.0.25 (Windows/20130802)
Cancel-Lock: sha1:7tcgX9eeikPyqoeZsBOghf9nskg=
Content-Language: en-US
In-Reply-To: <u675kg$34cnq$1@dont-email.me>
 by: Paul - Mon, 12 Jun 2023 17:41 UTC

On 6/12/2023 9:13 AM, Peter wrote:
> I want to help an older person with severe macular degeneration from
> thousands of miles afar where she could use an AUDIO file of long text
> (like of a book or short story, normally, as in Great Books stories).
>
> I'm on Windows 10 and Android 12. She's on the iPhone and iPads.
>
> I can easily save a web page (like a news story) to text and sometimes I
> can convert a PDF of a book to text (depending on how the PDF was made).
>
> But how can I turn the text into a "podcast" for her to listen to?
> And would it be too large to email to her in a typical email message?
>

You will need:

1) Copy of Audacity or a similar audio recorder.
2) Male to male audio cable. Connect Line Out to Line In.
This step, is to avoid fighting with "What You Hear" setup,
which could waste half the afternoon.
3) Run Firefox.
4) In Windows Mixer, set Windows Sounds (beeps and boops) to zero.
This is to avoid the Windows Defender toast notification or a
You Have Mail ding, from ruining your recording.

Now, use Firefox Reader View, open your text file.

[Picture]

https://i.postimg.cc/DzhXyywq/firefox-reader-view-TTS.gif

After listening to a half dozen other TTS implementations
that suck, that one is almost bearable.

With some programming, as Newyana2 describes, you can do better
than that, but then it's a real long project. This method
may not be efficient, but you also don't have to be a programmer.

The Firefox page should auto-scroll, as it plays the text. I do not
know how long a web page (or text file) can be, and whether your attempt to
convert War & Peace to sound, is going to work.

I may have recorded for 16 hours before with Audacity, but that
was on the other, smaller machine. Make sure all the steps of
recording and saving are working on Audacity, before your run.
You would not want to generate all those tiny sound files,
and have it croak before saving.

I think Windows also has an "improved" sound recorder, which
you might want to research. That's if you can't get Audacity
to work (which... happens sometimes). The namespace for audio
is a bitch. An "improved" sound recorder, is one for which
you don't have to "define a fixed size audio file" before
you begin recording. Some of the Windows demo apps have
been absurd that way. The damn thing should just record,
and then you do the Save As, and done. Check that it works
that way.

Paul

Re: Convert book pdf to an audio podcast to send as an email attachment

<u6843a$384c4$1@dont-email.me>

  copy mid

https://www.novabbs.com/computers/article-flat.php?id=166&group=comp.text.pdf#166

  copy link   Newsgroups: alt.comp.os.windows-10 comp.mobile.android comp.text.pdf
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: keith_nu...@yahoo.com (knuttle)
Newsgroups: alt.comp.os.windows-10,comp.mobile.android,comp.text.pdf
Subject: Re: Convert book pdf to an audio podcast to send as an email
attachment
Date: Mon, 12 Jun 2023 17:52:41 -0400
Organization: A noiseless patient Spider
Lines: 26
Message-ID: <u6843a$384c4$1@dont-email.me>
References: <u675kg$34cnq$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 7bit
Injection-Date: Mon, 12 Jun 2023 21:52:42 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="7cfa44cdc3a5712124d7accf9bb90687";
logging-data="3412356"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1/5biPX2dd4S902qZJPsOoZ"
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101
Thunderbird/102.12.0
Cancel-Lock: sha1:RuuZIgIwiFN4IhE+PhqsxQzVFWY=
Content-Language: en-US
In-Reply-To: <u675kg$34cnq$1@dont-email.me>
 by: knuttle - Mon, 12 Jun 2023 21:52 UTC

On 06/12/2023 9:13 AM, Peter wrote:
> I want to help an older person with severe macular degeneration from
> thousands of miles afar where she could use an AUDIO file of long text
> (like of a book or short story, normally, as in Great Books stories).
>
> I'm on Windows 10 and Android 12. She's on the iPhone and iPads.
>
> I can easily save a web page (like a news story) to text and sometimes I
> can convert a PDF of a book to text (depending on how the PDF was made).
>
> But how can I turn the text into a "podcast" for her to listen to?
> And would it be too large to email to her in a typical email message?
This is a feature of Adobe Reader that I have never used but know it
exist.

Open the document that you want in the Adobe Reader (including the Free
version.) In View, Select "Read Out Aloud". Activate the Read Aloud
function and select the option you want.

There should be a way to capture the verbal document to an audio file.
Possibly by using the record feature of the Cellphone.

I may not be the best way, but it could be used to convert the write
document to an audio documents. May take some Googling.

Re: Convert book pdf to an audio podcast to send as an email attachment

<u68ng6$3fr13$1@dont-email.me>

  copy mid

https://www.novabbs.com/computers/article-flat.php?id=167&group=comp.text.pdf#167

  copy link   Newsgroups: alt.comp.os.windows-10 comp.mobile.android comp.text.pdf alt.comp.software.firefox
Followup: alt.comp.os.windows-10,comp.text.pdf,alt.comp.software.firefox
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: occassio...@nospam.co.uk (Peter)
Newsgroups: alt.comp.os.windows-10,comp.mobile.android,comp.text.pdf,alt.comp.software.firefox
Subject: Re: Convert book pdf to an audio podcast to send as an email attachment
Followup-To: alt.comp.os.windows-10,comp.text.pdf,alt.comp.software.firefox
Date: Tue, 13 Jun 2023 04:24:23 +0100
Organization: -
Lines: 31
Message-ID: <u68ng6$3fr13$1@dont-email.me>
References: <u675kg$34cnq$1@dont-email.me> <u67kqs$365rm$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
Injection-Date: Tue, 13 Jun 2023 03:23:50 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="d42b271e7ade67348151a7fed51c793d";
logging-data="3664931"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1/4f9RFLF9e20AxFWpzUYB0"
Cancel-Lock: sha1:gFJyGYz2pRB7FCWDpa9RT+69aII=
X-No-Archive: yes
X-Newsreader: Forte Agent 3.3/32.846
 by: Peter - Tue, 13 Jun 2023 03:24 UTC

Chris <ithinkiam@gmail.com> wrote:
>> But how can I turn the text into a "podcast" for her to listen to?
>> And would it be too large to email to her in a typical email message?
>
> iOS has very good accessibility features. I suspect it already can read any
> pdf you provide.

Thanks for that suggestion where the main goal is one way for everything.
She will get super confused if there isn't one way for everything.

Not one way for this kind of PDF. Another for that kind of PDF.
And yet another for text. And another for HTML. And so on.

It has to be ONE way for EVERYTHING, where there's no way any program, no
matter how smart it is, can read out ALL PDFs (since many are bitmaps).

For the bitmaps, I was thinking of looking up OCR or, if it's short, just
reading it aloud to myself and recording it on my phone's voice recorder.

> https://support.apple.com/en-gb/guide/iphone/iph3e2e415f/ios

Thanks for that suggestion since she only has iOS iPhones & iPads.
And I don't.

I looked at that page which seems to be much like what Windows does.
Which means it will drive any normal person nuts with extraneous talking.

It has to be an audio of the text only and nothing else for it to work.
The firefox text-only speaking suggestion from Paul is along those lines.
--
Removed android and replaced it with Firefox since the solution is there.

Re: Convert book pdf to an audio podcast to send as an email attachment

<u68ni2$3fr4i$1@dont-email.me>

  copy mid

https://www.novabbs.com/computers/article-flat.php?id=168&group=comp.text.pdf#168

  copy link   Newsgroups: alt.comp.os.windows-10 comp.mobile.android comp.text.pdf alt.comp.software.firefox
Followup: alt.comp.os.windows-10,comp.text.pdf,alt.comp.software.firefox
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: occassio...@nospam.co.uk (Peter)
Newsgroups: alt.comp.os.windows-10,comp.mobile.android,comp.text.pdf,alt.comp.software.firefox
Subject: Re: Convert book pdf to an audio podcast to send as an email attachment
Followup-To: alt.comp.os.windows-10,comp.text.pdf,alt.comp.software.firefox
Date: Tue, 13 Jun 2023 04:25:23 +0100
Organization: -
Lines: 103
Message-ID: <u68ni2$3fr4i$1@dont-email.me>
References: <u675kg$34cnq$1@dont-email.me> <u67lce$367o7$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
Injection-Date: Tue, 13 Jun 2023 03:24:50 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="d42b271e7ade67348151a7fed51c793d";
logging-data="3665042"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+jleSSMLT5VYsggtxcIMv6"
Cancel-Lock: sha1:eVpZJfsKH3EtKrU5Z4gpcH7V9Mg=
X-No-Archive: yes
X-Newsreader: Forte Agent 3.3/32.846
 by: Peter - Tue, 13 Jun 2023 03:25 UTC

Paul <nospam@needed.invalid> wrote:
> You will need:
>
> 1) Copy of Audacity or a similar audio recorder.

Absolutely no problem. I have Audacity on both Android & Windows.

> 2) Male to male audio cable. Connect Line Out to Line In.
> This step, is to avoid fighting with "What You Hear" setup,
> which could waste half the afternoon.

I can make one of those.

> 3) Run Firefox.

I can run that.

> 4) In Windows Mixer, set Windows Sounds (beeps and boops) to zero.
> This is to avoid the Windows Defender toast notification or a
> You Have Mail ding, from ruining your recording.

Windows Mixer? https://www.google.com/search?q=what+is+windows+mixer

OK. It's explained here for both Windows 10 and Windows 11.
https://www.asus.com/support/FAQ/1044702/
>
> Now, use Firefox Reader View, open your text file.
>
> [Picture]
>
> https://i.postimg.cc/DzhXyywq/firefox-reader-view-TTS.gif

Firefox reader view?
https://www.google.com/search?q=what+is+firefox+reader+view

OK. It's explained here.
https://support.mozilla.org/en-US/kb/firefox-reader-view-clutter-free-web-page
"Reader View is a Firefox feature that strips away clutter like buttons,
ads, background images, and videos while also letting you customize the
layout and theme to fit your reading preferences."
> After listening to a half dozen other TTS implementations
> that suck, that one is almost bearable.

Whatever Windows sound plays when I am in MerriamWebster seems fine to me.
https://www.merriam-webster.com/dictionary/otorhinolaryngologist

> With some programming, as Newyana2 describes, you can do better
> than that, but then it's a real long project. This method
> may not be efficient, but you also don't have to be a programmer.

I don't want to be a programmer since I can already convert most "things"
to text so that's why I wanted to turn text into something she can play.

> The Firefox page should auto-scroll, as it plays the text. I do not
> know how long a web page (or text file) can be, and whether your attempt to
> convert War & Peace to sound, is going to work.

Most things are about twenty pages as I've been converting them to text
already for her where she selects the text inside an email and plays it.

> I may have recorded for 16 hours before with Audacity, but that
> was on the other, smaller machine. Make sure all the steps of
> recording and saving are working on Audacity, before your run.
> You would not want to generate all those tiny sound files,
> and have it croak before saving.

I'm familiar with audacity. It can output MP3 which I assume is the
standard file format for all computers as it is for my phone & PC.
> I think Windows also has an "improved" sound recorder, which
> you might want to research. That's if you can't get Audacity
> to work (which... happens sometimes).

I can handle audacity. It's on both Android & Windows.
If it's on the iPad, maybe she can play the file in iOS Audacity?

When I looked up Audacity on the iPad I realized it's not on it.
But it's not on Android either.
https://www.audacityteam.org/download/

The app called Audacity on Android doesn't seem to be the "real" one.
https://play.google.com/store/search?q=audacity&c=apps

> The namespace for audio is a bitch.

Can you clarify what "namespace" means in this context?

> An "improved" sound recorder, is one for which
> you don't have to "define a fixed size audio file" before
> you begin recording.

Absolutely. I can't imagine defining the size ahead of time.
What program requires that? I've never run into it myself.

> Some of the Windows demo apps have
> been absurd that way. The damn thing should just record,
> and then you do the Save As, and done. Check that it works
> that way.

Yes. Agree. If it requires me to guess, I don't want to use it.
--
Removing Android & replacing with Firefox as Audacity isn't there.

Re: Convert book pdf to an audio podcast to send as an email attachment

<u69qne$3jmg7$1@dont-email.me>

  copy mid

https://www.novabbs.com/computers/article-flat.php?id=169&group=comp.text.pdf#169

  copy link   Newsgroups: alt.comp.os.windows-10 comp.text.pdf alt.comp.software.firefox
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: HapilyRe...@fakeaddress.com (Retirednoguilt)
Newsgroups: alt.comp.os.windows-10,comp.text.pdf,alt.comp.software.firefox
Subject: Re: Convert book pdf to an audio podcast to send as an email
attachment
Date: Tue, 13 Jun 2023 09:25:00 -0400
Organization: A noiseless patient Spider
Lines: 107
Message-ID: <u69qne$3jmg7$1@dont-email.me>
References: <u675kg$34cnq$1@dont-email.me> <u67lce$367o7$1@dont-email.me>
<u68ni2$3fr4i$1@dont-email.me>
Reply-To: HapilyRetired@fakeaddress.com
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 7bit
Injection-Date: Tue, 13 Jun 2023 13:25:06 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="9e209cb552f864e605add9cf36f47c62";
logging-data="3791367"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+URcX4cyLDQKkOnH+757+COcniaQuR/Ks="
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101
Thunderbird/102.12.0
Cancel-Lock: sha1:k3zlAJrkmHeAoVJ9RnyPqY+KEYY=
Content-Language: en-US
In-Reply-To: <u68ni2$3fr4i$1@dont-email.me>
 by: Retirednoguilt - Tue, 13 Jun 2023 13:25 UTC

On 6/12/2023 11:25 PM, Peter wrote:
> Paul <nospam@needed.invalid> wrote:
>> You will need:
>>
>> 1) Copy of Audacity or a similar audio recorder.
>
> Absolutely no problem. I have Audacity on both Android & Windows.
>
>> 2) Male to male audio cable. Connect Line Out to Line In.
>> This step, is to avoid fighting with "What You Hear" setup,
>> which could waste half the afternoon.
>
> I can make one of those.
>
>> 3) Run Firefox.
>
> I can run that.
>
>> 4) In Windows Mixer, set Windows Sounds (beeps and boops) to zero.
>> This is to avoid the Windows Defender toast notification or a
>> You Have Mail ding, from ruining your recording.
>
> Windows Mixer? https://www.google.com/search?q=what+is+windows+mixer
>
> OK. It's explained here for both Windows 10 and Windows 11.
> https://www.asus.com/support/FAQ/1044702/
>>
>> Now, use Firefox Reader View, open your text file.
>>
>> [Picture]
>>
>> https://i.postimg.cc/DzhXyywq/firefox-reader-view-TTS.gif
>
> Firefox reader view?
> https://www.google.com/search?q=what+is+firefox+reader+view
>
> OK. It's explained here.
> https://support.mozilla.org/en-US/kb/firefox-reader-view-clutter-free-web-page
> "Reader View is a Firefox feature that strips away clutter like buttons,
> ads, background images, and videos while also letting you customize the
> layout and theme to fit your reading preferences."
>
>> After listening to a half dozen other TTS implementations
>> that suck, that one is almost bearable.
>
> Whatever Windows sound plays when I am in MerriamWebster seems fine to me.
> https://www.merriam-webster.com/dictionary/otorhinolaryngologist
>
>> With some programming, as Newyana2 describes, you can do better
>> than that, but then it's a real long project. This method
>> may not be efficient, but you also don't have to be a programmer.
>
> I don't want to be a programmer since I can already convert most "things"
> to text so that's why I wanted to turn text into something she can play.
>
>> The Firefox page should auto-scroll, as it plays the text. I do not
>> know how long a web page (or text file) can be, and whether your attempt to
>> convert War & Peace to sound, is going to work.
>
> Most things are about twenty pages as I've been converting them to text
> already for her where she selects the text inside an email and plays it.
>
>> I may have recorded for 16 hours before with Audacity, but that
>> was on the other, smaller machine. Make sure all the steps of
>> recording and saving are working on Audacity, before your run.
>> You would not want to generate all those tiny sound files,
>> and have it croak before saving.
>
> I'm familiar with audacity. It can output MP3 which I assume is the
> standard file format for all computers as it is for my phone & PC.
>
>> I think Windows also has an "improved" sound recorder, which
>> you might want to research. That's if you can't get Audacity
>> to work (which... happens sometimes).
>
> I can handle audacity. It's on both Android & Windows.
> If it's on the iPad, maybe she can play the file in iOS Audacity?
>
> When I looked up Audacity on the iPad I realized it's not on it.
> But it's not on Android either.
> https://www.audacityteam.org/download/
>
> The app called Audacity on Android doesn't seem to be the "real" one.
> https://play.google.com/store/search?q=audacity&c=apps
>
>> The namespace for audio is a bitch.
>
> Can you clarify what "namespace" means in this context?
>
>> An "improved" sound recorder, is one for which
>> you don't have to "define a fixed size audio file" before
>> you begin recording.
>
> Absolutely. I can't imagine defining the size ahead of time.
> What program requires that? I've never run into it myself.
>
>> Some of the Windows demo apps have
>> been absurd that way. The damn thing should just record,
>> and then you do the Save As, and done. Check that it works
>> that way.
>

I haven't tried any of them, but I did a web search for "convert pdf to
mp3" and there are a whole bunch of free on line converters listed.
Have you explored that option?
> Yes. Agree. If it requires me to guess, I don't want to use it.

Re: Convert book pdf to an audio podcast to send as an email attachment

<u6f8rg$eh3c$1@dont-email.me>

  copy mid

https://www.novabbs.com/computers/article-flat.php?id=171&group=comp.text.pdf#171

  copy link   Newsgroups: alt.comp.os.windows-10 comp.text.pdf alt.comp.software.firefox
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: occassio...@nospam.co.uk (Peter)
Newsgroups: alt.comp.os.windows-10,comp.text.pdf,alt.comp.software.firefox
Subject: Re: Convert book pdf to an audio podcast to send as an email attachment
Date: Thu, 15 Jun 2023 15:57:20 +0100
Organization: -
Lines: 112
Message-ID: <u6f8rg$eh3c$1@dont-email.me>
References: <u675kg$34cnq$1@dont-email.me> <u67lce$367o7$1@dont-email.me> <u68ni2$3fr4i$1@dont-email.me> <u69qne$3jmg7$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
Injection-Date: Thu, 15 Jun 2023 14:56:48 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="6d90d765b4cc8ef325624d7bbb4d7fb0";
logging-data="476268"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX18phIiyiiOccUpYgrMYhRG6"
Cancel-Lock: sha1:kj1ypHX8E5xRpTXATM+uqFxodh8=
X-No-Archive: yes
X-Newsreader: Forte Agent 3.3/32.846
 by: Peter - Thu, 15 Jun 2023 14:57 UTC

In article <news:u69qne$3jmg7$1@dont-email.me>, Retirednoguilt
<HapilyRetired@fakeaddress.com> says...

>>> You will need:
>>>
>>> 1) Copy of Audacity or a similar audio recorder.
>>
>> Absolutely no problem. I have Audacity on both Android & Windows.
>>
>>> 2) Male to male audio cable. Connect Line Out to Line In.
>>> This step, is to avoid fighting with "What You Hear" setup,
>>> which could waste half the afternoon.
>>
>> I can make one of those.
>>
>>> 3) Run Firefox.
>>
>> I can run that.
>>
>>> 4) In Windows Mixer, set Windows Sounds (beeps and boops) to zero.
>>> This is to avoid the Windows Defender toast notification or a
>>> You Have Mail ding, from ruining your recording.
>>
>> Windows Mixer? https://www.google.com/search?q=what+is+windows+mixer
>>
>> OK. It's explained here for both Windows 10 and Windows 11.
>> https://www.asus.com/support/FAQ/1044702/
>>>
>>> Now, use Firefox Reader View, open your text file.
>>>
>>> [Picture]
>>>
>>> https://i.postimg.cc/DzhXyywq/firefox-reader-view-TTS.gif
>>
>> Firefox reader view?
>> https://www.google.com/search?q=what+is+firefox+reader+view
>>
>> OK. It's explained here.
>> https://support.mozilla.org/en-US/kb/firefox-reader-view-clutter-free-web-page
>> "Reader View is a Firefox feature that strips away clutter like buttons,
>> ads, background images, and videos while also letting you customize the
>> layout and theme to fit your reading preferences."
>>
>>> After listening to a half dozen other TTS implementations
>>> that suck, that one is almost bearable.
>>
>> Whatever Windows sound plays when I am in MerriamWebster seems fine to me.
>> https://www.merriam-webster.com/dictionary/otorhinolaryngologist
>>
>>> With some programming, as Newyana2 describes, you can do better
>>> than that, but then it's a real long project. This method
>>> may not be efficient, but you also don't have to be a programmer.
>>
>> I don't want to be a programmer since I can already convert most "things"
>> to text so that's why I wanted to turn text into something she can play.
>>
>>> The Firefox page should auto-scroll, as it plays the text. I do not
>>> know how long a web page (or text file) can be, and whether your attempt to
>>> convert War & Peace to sound, is going to work.
>>
>> Most things are about twenty pages as I've been converting them to text
>> already for her where she selects the text inside an email and plays it.
>>
>>> I may have recorded for 16 hours before with Audacity, but that
>>> was on the other, smaller machine. Make sure all the steps of
>>> recording and saving are working on Audacity, before your run.
>>> You would not want to generate all those tiny sound files,
>>> and have it croak before saving.
>>
>> I'm familiar with audacity. It can output MP3 which I assume is the
>> standard file format for all computers as it is for my phone & PC.
>>
>>> I think Windows also has an "improved" sound recorder, which
>>> you might want to research. That's if you can't get Audacity
>>> to work (which... happens sometimes).
>>
>> I can handle audacity. It's on both Android & Windows.
>> If it's on the iPad, maybe she can play the file in iOS Audacity?
>>
>> When I looked up Audacity on the iPad I realized it's not on it.
>> But it's not on Android either.
>> https://www.audacityteam.org/download/
>>
>> The app called Audacity on Android doesn't seem to be the "real" one.
>> https://play.google.com/store/search?q=audacity&c=apps
>>
>>> The namespace for audio is a bitch.
>>
>> Can you clarify what "namespace" means in this context?
>>
>>> An "improved" sound recorder, is one for which
>>> you don't have to "define a fixed size audio file" before
>>> you begin recording.
>>
>> Absolutely. I can't imagine defining the size ahead of time.
>> What program requires that? I've never run into it myself.
>>
>>> Some of the Windows demo apps have
>>> been absurd that way. The damn thing should just record,
>>> and then you do the Save As, and done. Check that it works
>>> that way.
>>
>
> I haven't tried any of them, but I did a web search for "convert pdf to
> mp3" and there are a whole bunch of free on line converters listed.
> Have you explored that option?
>> Yes. Agree. If it requires me to guess, I don't want to use it.

I have tried them in the past but most PDFs don't contain any text.
The text looks like text but try selecting it and you can't.

Is there an electronic OCR for PDFs that can reveal the true text?

Re: Convert book pdf to an audio podcast to send as an email attachment

<u6fdam$f678$1@dont-email.me>

  copy mid

https://www.novabbs.com/computers/article-flat.php?id=172&group=comp.text.pdf#172

  copy link   Newsgroups: alt.comp.os.windows-10 comp.mobile.android comp.text.pdf alt.comp.software.firefox
Followup: alt.comp.os.windows-10,comp.text.pdf,alt.comp.software.firefox
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: ithink...@gmail.com (Chris)
Newsgroups: alt.comp.os.windows-10,comp.mobile.android,comp.text.pdf,alt.comp.software.firefox
Subject: Re: Convert book pdf to an audio podcast to send as an email
attachment
Followup-To: alt.comp.os.windows-10,comp.text.pdf,alt.comp.software.firefox
Date: Thu, 15 Jun 2023 16:13:10 -0000 (UTC)
Organization: A noiseless patient Spider
Lines: 43
Message-ID: <u6fdam$f678$1@dont-email.me>
References: <u675kg$34cnq$1@dont-email.me>
<u67kqs$365rm$1@dont-email.me>
<u68ng6$3fr13$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
Injection-Date: Thu, 15 Jun 2023 16:13:10 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="78ace2f7a498f43bdca60552672e1006";
logging-data="497896"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1/o8nWT6wnrdLdo/uJ1IUuDyj+6NwSYYuk="
User-Agent: NewsTap/5.5 (iPhone/iPod Touch)
Cancel-Lock: sha1:bcsr7HP07tBI9meVHz5ArblfpVw=
sha1:0Qx9G+lSzLDaqM8B5liWx7gmTdA=
 by: Chris - Thu, 15 Jun 2023 16:13 UTC

Peter <occassionally-confused@nospam.co.uk> wrote:
> Chris <ithinkiam@gmail.com> wrote:
>>> But how can I turn the text into a "podcast" for her to listen to?
>>> And would it be too large to email to her in a typical email message?
>>
>> iOS has very good accessibility features. I suspect it already can read any
>> pdf you provide.
>
> Thanks for that suggestion where the main goal is one way for everything.
> She will get super confused if there isn't one way for everything.

Don't think that's possible. There will always be exceptions, but I guess
you/her need to decide which option has the fewest exceptions.

> Not one way for this kind of PDF. Another for that kind of PDF.
> And yet another for text. And another for HTML. And so on.
>
> It has to be ONE way for EVERYTHING, where there's no way any program, no
> matter how smart it is, can read out ALL PDFs (since many are bitmaps).

Unless it's very old pdfs the usual is certainly not bitmap pdfs

> For the bitmaps, I was thinking of looking up OCR or, if it's short, just
> reading it aloud to myself and recording it on my phone's voice recorder.

How is that "one way for everything"? A screen reader would be far more
consistent and quicker for you.

>> https://support.apple.com/en-gb/guide/iphone/iph3e2e415f/ios
>
> Thanks for that suggestion since she only has iOS iPhones & iPads.
> And I don't.
>
> I looked at that page which seems to be much like what Windows does.
> Which means it will drive any normal person nuts with extraneous talking.

People with visual impairment need that extraneous talking.

> It has to be an audio of the text only and nothing else for it to work.
> The firefox text-only speaking suggestion from Paul is along those lines.

Re: Convert book pdf to an audio podcast to send as an email attachment

<u6fdmc$f835$1@dont-email.me>

  copy mid

https://www.novabbs.com/computers/article-flat.php?id=173&group=comp.text.pdf#173

  copy link   Newsgroups: alt.comp.os.windows-10 comp.text.pdf alt.comp.software.firefox
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: occassio...@nospam.co.uk (Peter)
Newsgroups: alt.comp.os.windows-10,comp.text.pdf,alt.comp.software.firefox
Subject: Re: Convert book pdf to an audio podcast to send as an email attachment
Date: Thu, 15 Jun 2023 17:19:56 +0100
Organization: -
Lines: 8
Message-ID: <u6fdmc$f835$1@dont-email.me>
References: <u675kg$34cnq$1@dont-email.me> <u67kqs$365rm$1@dont-email.me> <u68ng6$3fr13$1@dont-email.me> <u6fdam$f678$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
Injection-Date: Thu, 15 Jun 2023 16:19:24 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="6d90d765b4cc8ef325624d7bbb4d7fb0";
logging-data="499813"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+EIkxMRC3h+71Cu8OTJtU8"
Cancel-Lock: sha1:3vGFAkLeQBahpwZxzwX5V3sJiPc=
X-No-Archive: yes
X-Newsreader: Forte Agent 3.3/32.846
 by: Peter - Thu, 15 Jun 2023 16:19 UTC

Chris <ithinkiam@gmail.com> wrote:
>> It has to be ONE way for EVERYTHING, where there's no way any program, no
>> matter how smart it is, can read out ALL PDFs (since many are bitmaps).
>
> Unless it's very old pdfs the usual is certainly not bitmap pdfs

There is probably a program to automatically OCR a file that is really an
image of the text. It's not really a PDF. It's more like an image of a PDF.

Re: Convert book pdf to an audio podcast to send as an email attachment

<u6fei4$fab4$1@dont-email.me>

  copy mid

https://www.novabbs.com/computers/article-flat.php?id=174&group=comp.text.pdf#174

  copy link   Newsgroups: alt.comp.os.windows-10 comp.text.pdf alt.comp.software.firefox
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: keith_nu...@yahoo.com (knuttle)
Newsgroups: alt.comp.os.windows-10,comp.text.pdf,alt.comp.software.firefox
Subject: Re: Convert book pdf to an audio podcast to send as an email
attachment
Date: Thu, 15 Jun 2023 12:34:11 -0400
Organization: A noiseless patient Spider
Lines: 11
Message-ID: <u6fei4$fab4$1@dont-email.me>
References: <u675kg$34cnq$1@dont-email.me> <u67kqs$365rm$1@dont-email.me>
<u68ng6$3fr13$1@dont-email.me> <u6fdam$f678$1@dont-email.me>
<u6fdmc$f835$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 7bit
Injection-Date: Thu, 15 Jun 2023 16:34:12 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="ddb16b1c116fc47f637a56ef3f84ff77";
logging-data="502116"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+mZmgkep87JubdasW22E2V"
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:102.0) Gecko/20100101
Thunderbird/102.12.0
Cancel-Lock: sha1:suXJPkJfK5uH7MIeadkx4x7nHBY=
In-Reply-To: <u6fdmc$f835$1@dont-email.me>
Content-Language: en-US
 by: knuttle - Thu, 15 Jun 2023 16:34 UTC

On 06/15/2023 12:19 PM, Peter wrote:
> Chris <ithinkiam@gmail.com> wrote:
>>> It has to be ONE way for EVERYTHING, where there's no way any program, no
>>> matter how smart it is, can read out ALL PDFs (since many are bitmaps).
>>
>> Unless it's very old pdfs the usual is certainly not bitmap pdfs
>
> There is probably a program to automatically OCR a file that is really an
> image of the text. It's not really a PDF. It's more like an image of a PDF.
A quick Google, seems to indicate that OCR is part of the professional
reader, ot a paid subscription

Re: Convert book pdf to an audio podcast to send as an email attachment

<u6fkp4$g534$1@dont-email.me>

  copy mid

https://www.novabbs.com/computers/article-flat.php?id=177&group=comp.text.pdf#177

  copy link   Newsgroups: alt.comp.os.windows-10 comp.text.pdf alt.comp.software.firefox
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: nos...@needed.invalid (Paul)
Newsgroups: alt.comp.os.windows-10,comp.text.pdf,alt.comp.software.firefox
Subject: Re: Convert book pdf to an audio podcast to send as an email
attachment
Date: Thu, 15 Jun 2023 14:20:18 -0400
Organization: A noiseless patient Spider
Lines: 104
Message-ID: <u6fkp4$g534$1@dont-email.me>
References: <u675kg$34cnq$1@dont-email.me> <u67kqs$365rm$1@dont-email.me>
<u68ng6$3fr13$1@dont-email.me> <u6fdam$f678$1@dont-email.me>
<u6fdmc$f835$1@dont-email.me> <u6fei4$fab4$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8; format=flowed
Content-Transfer-Encoding: 7bit
Injection-Date: Thu, 15 Jun 2023 18:20:21 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="db128ef98a0e91225aca9e4f43aea9ba";
logging-data="529508"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+DIdcGZsqfQ/19al7D2AJXiFEsxRfyO3g="
User-Agent: Ratcatcher/2.0.0.25 (Windows/20130802)
Cancel-Lock: sha1:jGM4gXa3cVm5v7IycwwWsJ/zlW0=
Content-Language: en-US
In-Reply-To: <u6fei4$fab4$1@dont-email.me>
 by: Paul - Thu, 15 Jun 2023 18:20 UTC

On 6/15/2023 12:34 PM, knuttle wrote:
> On 06/15/2023 12:19 PM, Peter wrote:
>> Chris <ithinkiam@gmail.com> wrote:
>>>> It has to be ONE way for EVERYTHING, where there's no way any program, no
>>>> matter how smart it is, can read out ALL PDFs (since many are bitmaps).
>>>
>>> Unless it's very old pdfs the usual is certainly not bitmap pdfs
>>
>> There is probably a program to automatically OCR a file that is really an
>> image of the text. It's not really a PDF. It's more like an image of a PDF.
>
> A quick Google, seems to indicate that OCR is part of the professional reader, ot a paid subscription

A PDF can be OCR-processed and have a "character overlay" done
by the OCR software. You can for example, wipe over the
new overlay with a mouse, then copy and paste the text, elsewhere.
This could also cause a font to be defined in the PDF, which will
show in the Properties of the document, where previously the
document had absolutely no fonts referenced.

The other kind of OCR, would be runtime OCR of some sort.

Bitmap-only PDFs come from cheap scanners. A scanner with
more software capability, can do the text overlay thing. An Adobe PDF
product can do OCR. My Acrobat Reader 4 or so (ancient), had OCR, but
the resolution of the bitmap PDF had to be between 200DPI and 400 DPI
or so. In fact, it did not like higher resolution scans to be offered
to it, which was a major nuisance.

Linux has Tesseract, and that would be an example of a keyword to use
when researching various PDF softwares there.

*******

You can deconstruct a PDF various ways.

mutool.exe extract example.pdf

would draw out the bitmaps for you, and save as image files.

That's not particularly useful as some sort of workflow.
It's mainly for the user to have a better understanding
of the PDF type in front of them.

In this example, the PDF file has no fonts. The document is
ten pages. There happen to be ten output images.

[Picture]

https://i.postimg.cc/WbY3MkBR/bitmap-pdf-sample.gif

Now, I process it:

D:\>mutool extract sony_srs-t1_t1pc_sm.pdf
extracting image-0001.png
extracting image-0014.png
extracting image-0019.png
extracting image-0024.png
extracting image-0029.png
extracting image-0034.png
extracting image-0039.png
extracting image-0044.png
extracting image-0049.png
extracting image-0054.png

Properties show:

image-0044.png (document page 8)

dimensions 6600 x 4529 17"x11" est. 388 dpi
8 bit depth (likely grayscale in this case)
421 KB (some decent compression)

On Ubuntu, using the keyword, I can find:

ocrmypdf - add an OCR text layer to PDF files

We give it a try.

ocrmypdf -l eng --image-dpi 400 --output-type pdf image-0044.png image-0044.pdf
INFO - Input file is not a PDF, checking if it is an image...
INFO - Input file is an image
INFO - Image seems valid. Try converting to PDF...
INFO - Successfully converted to PDF, processing...
Scan: 100% 1/1 [00:00<00:00, 625.83page/s]
INFO - Using Tesseract OpenMP thread limit 3
OCR: 100% 1.0/1.0 [00:07<00:00, 7.01s/page]
JPEGs: 0image [00:00, ?image/s]
JBIG2: 0item [00:00, ?item/s]
INFO - Optimize ratio: 1.00 savings: 0.0%

Wiping over the output PDF file image-0044.pdf reveals text is there.

[Picture]

https://i.postimg.cc/76gvZFkP/bitmap-pdf-output.gif

The OCR step in this case, was done in bash shell in Win11.
So I didn't even need a VM like VBox to do it.

Paul

Re: Convert book pdf to an audio podcast to send as an email attachment

<u6fldi$g7jg$1@dont-email.me>

  copy mid

https://www.novabbs.com/computers/article-flat.php?id=178&group=comp.text.pdf#178

  copy link   Newsgroups: alt.comp.os.windows-10 comp.text.pdf alt.comp.software.firefox
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: Mik...@fred.com (MikeS)
Newsgroups: alt.comp.os.windows-10,comp.text.pdf,alt.comp.software.firefox
Subject: Re: Convert book pdf to an audio podcast to send as an email
attachment
Date: Thu, 15 Jun 2023 19:31:13 +0100
Organization: A noiseless patient Spider
Lines: 14
Message-ID: <u6fldi$g7jg$1@dont-email.me>
References: <u675kg$34cnq$1@dont-email.me> <u67kqs$365rm$1@dont-email.me>
<u68ng6$3fr13$1@dont-email.me> <u6fdam$f678$1@dont-email.me>
<u6fdmc$f835$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8; format=flowed
Content-Transfer-Encoding: 7bit
Injection-Date: Thu, 15 Jun 2023 18:31:14 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="fe2877762d9e10a95d2864957c5b90d8";
logging-data="532080"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19gGLYI9FTqdPB2qpDWW9J9"
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:78.0) Gecko/20100101
Thunderbird/78.14.0
Cancel-Lock: sha1:YiK4LKo0R25CFc4ugu30An7tfGg=
Content-Language: en-GB
In-Reply-To: <u6fdmc$f835$1@dont-email.me>
 by: MikeS - Thu, 15 Jun 2023 18:31 UTC

On 15/06/2023 17:19, Peter wrote:
> Chris <ithinkiam@gmail.com> wrote:
>>> It has to be ONE way for EVERYTHING, where there's no way any program, no
>>> matter how smart it is, can read out ALL PDFs (since many are bitmaps).
>>
>> Unless it's very old pdfs the usual is certainly not bitmap pdfs
>
> There is probably a program to automatically OCR a file that is really an
> image of the text. It's not really a PDF. It's more like an image of a PDF.
>
Actually its a pdf of an image (usually a scan of a printed page). There
are hundreds of programs to OCR text images in pdf or image files, many
of them freeware. Many pdf readers include an OCR function - I use
PDF-XChange Viewer.

Re: Convert book pdf to an audio podcast to send as an email attachment

<u6flon$g8qv$1@dont-email.me>

  copy mid

https://www.novabbs.com/computers/article-flat.php?id=179&group=comp.text.pdf#179

  copy link   Newsgroups: alt.comp.os.windows-10 comp.text.pdf alt.comp.software.firefox
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: nos...@needed.invalid (Paul)
Newsgroups: alt.comp.os.windows-10,comp.text.pdf,alt.comp.software.firefox
Subject: Re: Convert book pdf to an audio podcast to send as an email
attachment
Date: Thu, 15 Jun 2023 14:37:09 -0400
Organization: A noiseless patient Spider
Lines: 60
Message-ID: <u6flon$g8qv$1@dont-email.me>
References: <u675kg$34cnq$1@dont-email.me> <u67lce$367o7$1@dont-email.me>
<u68ni2$3fr4i$1@dont-email.me> <u69qne$3jmg7$1@dont-email.me>
<u6f8rg$eh3c$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8; format=flowed
Content-Transfer-Encoding: 7bit
Injection-Date: Thu, 15 Jun 2023 18:37:12 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="db128ef98a0e91225aca9e4f43aea9ba";
logging-data="533343"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+7LeKc/vvojRvDfVhjbp5DatpymdqBnLk="
User-Agent: Ratcatcher/2.0.0.25 (Windows/20130802)
Cancel-Lock: sha1:sTVehoiUM1nTPeipZ4mhqO/fIEI=
Content-Language: en-US
In-Reply-To: <u6f8rg$eh3c$1@dont-email.me>
 by: Paul - Thu, 15 Jun 2023 18:37 UTC

On 6/15/2023 10:57 AM, Peter wrote:
> In article <news:u69qne$3jmg7$1@dont-email.me>, Retirednoguilt
> <HapilyRetired@fakeaddress.com> says...
>
>> I haven't tried any of them, but I did a web search for "convert pdf to
>> mp3" and there are a whole bunch of free on line converters listed.
>> Have you explored that option?
>>> Yes. Agree. If it requires me to guess, I don't want to use it.
>
> I have tried them in the past but most PDFs don't contain any text.
> The text looks like text but try selecting it and you can't.
>
> Is there an electronic OCR for PDFs that can reveal the true text?
>

In Win10, you can run Bash shell and use "ocrmypdf", a Ubuntu package.

It uses Tesseract. Tesseract is a free OCR, quite old now. The
latest innovation, is it has a new module trained on a neural net
or the like, which offers a second conversion process, and potentially
improves the quality.

In the past, 0 and O were easily confused, and that's one
of the simpler tests of whether an OCR is any good or not.

One Windows OCR, used three converters, and used majority voter
logic and grammar checks, in an attempt to get closer to 99%
accuracy. Such accuracy still isn't really good enough. You would
not predict Tesseract would be at that level.

OCRs work best, if "font training" was applied, you get lucky,
and the document font is one that it was trained on.

OCR is generally an annoying topic. Don't drink too much coffee
and then OCR, or you could punch the LCD monitor on the computer :-)
I gave up long ago, on my Windows OCR options (I have two $99 packages
and the Adobe one, as representations of my frustration). The OCR that
comes with some scanner-to-PDF products now, is probably better
than anything I have in the room here.

OCR really excels on closed glyphs. If you had a film negative with
characters drawn on it, and the thing had "never seen a printing press",
the accuracy would be 100% (the grain size on film is tiny compared
to printing presses and paper). But real samples, and real letters, are
pock-marked along the edges of the characters, and this is what
drives the OCR nuts. It's getting 99% accuracy on "noisy" samples.

Newspaper text is done with little dots of ink. The edges of letters are
porous and not particularly suited to OCR. Again, you would not
expect even the highest resolution photo of newsprint, to yield
good text extraction.

For a person needing vision assistance, this is their world,
a world filled with noise that sighted people don't have to
deal with. Your diet is filled with 0's and O's, spoken
by robots.

And this is what inspires some people to write software. To fix that.

Paul

Re: Convert book pdf to an audio podcast to send as an email attachment

<u6gir2$ngdo$1@dont-email.me>

  copy mid

https://www.novabbs.com/computers/article-flat.php?id=183&group=comp.text.pdf#183

  copy link   Newsgroups: alt.comp.os.windows-10 comp.text.pdf alt.comp.software.firefox rec.photo.digital
Followup: alt.comp.os.windows-10,comp.text.pdf,rec.photo.digital
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: occassio...@nospam.co.uk (Peter)
Newsgroups: alt.comp.os.windows-10,comp.text.pdf,alt.comp.software.firefox,rec.photo.digital
Subject: Re: Convert book pdf to an audio podcast to send as an email attachment
Followup-To: alt.comp.os.windows-10,comp.text.pdf,rec.photo.digital
Date: Fri, 16 Jun 2023 03:53:54 +0100
Organization: -
Lines: 66
Message-ID: <u6gir2$ngdo$1@dont-email.me>
References: <u675kg$34cnq$1@dont-email.me> <u67kqs$365rm$1@dont-email.me> <u68ng6$3fr13$1@dont-email.me> <u6fdam$f678$1@dont-email.me> <u6fdmc$f835$1@dont-email.me> <u6fei4$fab4$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
Injection-Date: Fri, 16 Jun 2023 02:53:22 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="138b9b70fefa1097b4278351920597b6";
logging-data="770488"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19m61OqP2DuR5u/NHsTxfLr"
Cancel-Lock: sha1:GFpFnMjyyPixGf8A67ZFQYrGREc=
X-No-Archive: yes
X-Newsreader: Forte Agent 3.3/32.846
 by: Peter - Fri, 16 Jun 2023 02:53 UTC

knuttle <keith_nuttle@yahoo.com> wrote
> A quick Google, seems to indicate that OCR is part of the professional
> reader, ot a paid subscription

I have Adobe Acrobat 6, which is a decade or more old but I tried that
after seeing your suggestion about the "professional reader" (which doesn't
really make sense to me as the professional is a writer and not a reader).

Adobe Acrobat 6 Standard has a line item menu for
"Document > Paper Capture > Start Capture" and then a choice of
All pages, Current page, and From Page x to y

The default "Paper Capture Settings" language is
"Primary OCR Language = English"
(although you can set any language in their long list).

The default "Paper Capture Settings" "PDF Output Style" is set to
"Searchable Image (Exact)"
But there were options of "Searchable Image (Compact)" & an option for
"Formatted Text and Graphics".

The default "Paper Capture Settings" "Downsample Images" dpi is set to
"None" but there were options for "Low (300 dpi)", "Medium (150 dpi)",
and "High (72 dpi)".
I thought I was heading toward the finish line when I tried it on a sample
full text page of a 200 page document whose text I couldn't select - but
the professional Adobe Acrobat 6 Paper Capture feature erred out saying
"Acrobat could not run Paper Capture on the page because
of the following error: This page contains renderable text."

I thought I'd be sneaky and run the Adobe Acrobat Professional feature of
"File > Reduce File Size > Compatible with Acrobat 4.0 and later"
But it erred out in trying to convert the PDF to Acrobat 4 saying
"This PDF cannot be made compatible with Acrobat 4.0 because it uses
transparency.", however it did convert to Acrobat 5.0 and 6.0 versions.

Unfortunately I wasn't sneaky enough as it still complained that
"This page contains renderable text."

Only did it slowly dawn on me that if the file contains "renderable text",
then maybe I could just save the entire file to plain text, which worked.

Which means there must be at least 3 kinds of PDF text documents at least
Real PDF (with selectable text which we can presume can be OCR'd)
Renderable text (without selectable text which can't be OCR'd apparently)
Bitmap text (without selectable text which we can presume can be OCR'd)

Being confused how to tell which is which, I dug up a PDF where I could
select the text, and THAT was able to be OCR'd by the professional Acrobat.
Paper Capture
"Rasterizing page and sending to Paper Capture..."
Performing page recognition
Converting to indexed color
Thresholding image
Deskewing image
Finding rules and frames
Reading characters
Forming words
Grouping characters and words
Writing ACP file
Do you want to save the changes to "file.pdf" before closing?

But where's the OCR results?
--
fup set to replace firefox with r.p.d instead.

Re: Convert book pdf to an audio podcast to send as an email attachment

<u6hf4o$qc6g$1@dont-email.me>

  copy mid

https://www.novabbs.com/computers/article-flat.php?id=186&group=comp.text.pdf#186

  copy link   Newsgroups: alt.comp.os.windows-10 comp.text.pdf rec.photo.digital
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: nos...@needed.invalid (Paul)
Newsgroups: alt.comp.os.windows-10,comp.text.pdf,rec.photo.digital
Subject: Re: Convert book pdf to an audio podcast to send as an email
attachment
Date: Fri, 16 Jun 2023 06:56:23 -0400
Organization: A noiseless patient Spider
Lines: 46
Message-ID: <u6hf4o$qc6g$1@dont-email.me>
References: <u675kg$34cnq$1@dont-email.me> <u67kqs$365rm$1@dont-email.me>
<u68ng6$3fr13$1@dont-email.me> <u6fdam$f678$1@dont-email.me>
<u6fdmc$f835$1@dont-email.me> <u6fei4$fab4$1@dont-email.me>
<u6gir2$ngdo$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8; format=flowed
Content-Transfer-Encoding: 7bit
Injection-Date: Fri, 16 Jun 2023 10:56:25 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="e9e635255937d9c5b290d92db16de619";
logging-data="864464"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19aV0i8RHaSlulMHXis0Ok1vgOF/7QJVeY="
User-Agent: Ratcatcher/2.0.0.25 (Windows/20130802)
Cancel-Lock: sha1:GlOkAfSx85w0Z4IbL6TZokn553E=
Content-Language: en-US
In-Reply-To: <u6gir2$ngdo$1@dont-email.me>
 by: Paul - Fri, 16 Jun 2023 10:56 UTC

On 6/15/2023 10:53 PM, Peter wrote:

> Adobe Acrobat Professional

> Paper Capture
> "Rasterizing page and sending to Paper Capture..."
> Performing page recognition
> Converting to indexed color
> Thresholding image
> Deskewing image
> Finding rules and frames
> Reading characters
> Forming words
> Grouping characters and words
> Writing ACP file
> Do you want to save the changes to "file.pdf" before closing?
>
> But where's the OCR results?
>
The text precisely overlays the scanned characters underneath.

If any of the bitmap below the letter "sticks out", that means
an OCR error has been made.

In Acrobat Free Reader, the File : Properties : Fonts,
is your friend. The richness and type of fonts listed
in there, tells you the kind of document you're dealing with.

The Sony service manual, which is just a scan of some paper and
stored in a PDF, the fonts list for that is: blank.

Finding no fonts, means you need to run OCR.

When I run the tool that uses Tesseract, well, Tesseract does
not add the PDF text, the thing running Tesseract on your
behalf, adds overlay text on the bitmap images. The Font
properties after it is finished ? Just one font is listed,
and it is a magical font which does not render in the reader.
You can only detect it is present, by doing a "select all",
and then a copy/paste.

When a document has a dozen fonts listed, then that has "rendered text"
in it, and you're not allowed by the tools, to be adding OCR overlay text
to a document that already has "rendered text".

Paul

Re: Convert book pdf to an audio podcast to send as an email attachment

<DkC*xiXiz@news.chiark.greenend.org.uk>

  copy mid

https://www.novabbs.com/computers/article-flat.php?id=187&group=comp.text.pdf#187

  copy link   Newsgroups: alt.comp.os.windows-10 comp.mobile.android comp.text.pdf
Path: i2pn2.org!i2pn.org!weretis.net!feeder8.news.weretis.net!news.szaf.org!nntp-feed.chiark.greenend.org.uk!ewrotcd!.POSTED.chiark.greenend.org.uk!not-for-mail
From: theom+n...@chiark.greenend.org.uk (Theo)
Newsgroups: alt.comp.os.windows-10,comp.mobile.android,comp.text.pdf
Subject: Re: Convert book pdf to an audio podcast to send as an email attachment
Date: 16 Jun 2023 12:32:45 +0100 (BST)
Organization: University of Cambridge, England
Message-ID: <DkC*xiXiz@news.chiark.greenend.org.uk>
References: <u675kg$34cnq$1@dont-email.me>
Injection-Info: chiark.greenend.org.uk; posting-host="chiark.greenend.org.uk:212.13.197.229";
logging-data="11550"; mail-complaints-to="abuse@chiark.greenend.org.uk"
User-Agent: tin/1.8.3-20070201 ("Scotasay") (UNIX) (Linux/5.10.0-22-amd64 (x86_64))
Originator: theom@chiark.greenend.org.uk ([212.13.197.229])
 by: Theo - Fri, 16 Jun 2023 11:32 UTC

In comp.mobile.android Peter <occassionally-confused@nospam.co.uk> wrote:
> But how can I turn the text into a "podcast" for her to listen to?
> And would it be too large to email to her in a typical email message?

AWS has a text to speech service called 'Amazon Polly'. There's a site for
testing it here:
https://ai-service-demos.go-aws.com/polly

You send Polly text and it generates you a URL with the speech, which you
could perhaps send to your correspondent.

AWS requires an account and payment over a certain number of characters.

Polly is an API rather than an app that you can use, but maybe somebody has
written a frontend for it that makes it easier to upload text (the demo is a
bit limited) and either pay them or use your own AWS account.

Theo

Re: Convert book pdf to an audio podcast to send as an email attachment

<u6i3ab$sisk$1@dont-email.me>

  copy mid

https://www.novabbs.com/computers/article-flat.php?id=189&group=comp.text.pdf#189

  copy link   Newsgroups: comp.mobile.android alt.comp.os.windows-10 comp.text.pdf
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: ithink...@gmail.com (Chris)
Newsgroups: comp.mobile.android,alt.comp.os.windows-10,comp.text.pdf
Subject: Re: Convert book pdf to an audio podcast to send as an email
attachment
Date: Fri, 16 Jun 2023 16:40:43 -0000 (UTC)
Organization: A noiseless patient Spider
Lines: 22
Message-ID: <u6i3ab$sisk$1@dont-email.me>
References: <u675kg$34cnq$1@dont-email.me>
<DkC*xiXiz@news.chiark.greenend.org.uk>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
Injection-Date: Fri, 16 Jun 2023 16:40:43 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="59badebb87e6ec4aedda1eafb59e8210";
logging-data="936852"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19/e8hv1JZiQINSOBa5PhcjBkhvsicexL4="
User-Agent: NewsTap/5.5 (iPhone/iPod Touch)
Cancel-Lock: sha1:4DD6lLYsUPngUJVsEO/nTjLHiNk=
sha1:mEB/ICrKJTc2cCbqf5YpSDObmuU=
 by: Chris - Fri, 16 Jun 2023 16:40 UTC

Theo <theom+news@chiark.greenend.org.uk> wrote:
> In comp.mobile.android Peter <occassionally-confused@nospam.co.uk> wrote:
>> But how can I turn the text into a "podcast" for her to listen to?
>> And would it be too large to email to her in a typical email message?
>
> AWS has a text to speech service called 'Amazon Polly'. There's a site for
> testing it here:
> https://ai-service-demos.go-aws.com/polly
>
> You send Polly text and it generates you a URL with the speech, which you
> could perhaps send to your correspondent.
>
> AWS requires an account and payment over a certain number of characters.
>
> Polly is an API rather than an app that you can use, but maybe somebody has
> written a frontend for it that makes it easier to upload text (the demo is a
> bit limited) and either pay them or use your own AWS account.

This thread has gone from the ridiculous to the sublime. All because the OP
doesn't want to use the in-built iOS functions due to his unfamiliarity and
perhaps unconscious ableism.

Re: Convert book pdf to an audio podcast to send as an email attachment

<u6ph8q$22kd4$1@dont-email.me>

  copy mid

https://www.novabbs.com/computers/article-flat.php?id=195&group=comp.text.pdf#195

  copy link   Newsgroups: alt.comp.os.windows-10 comp.mobile.android comp.text.pdf
Followup: alt.comp.os.windows-10
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: nos...@nospam.org (milsabords)
Newsgroups: alt.comp.os.windows-10,comp.mobile.android,comp.text.pdf
Subject: Re: Convert book pdf to an audio podcast to send as an email attachment
Followup-To: alt.comp.os.windows-10
Date: Mon, 19 Jun 2023 14:21:45 +0200
Organization: A noiseless patient Spider
Lines: 15
Message-ID: <u6ph8q$22kd4$1@dont-email.me>
References: <u675kg$34cnq$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset="iso-8859-15"; format=flowed
Content-Transfer-Encoding: 8bit
Injection-Date: Mon, 19 Jun 2023 12:21:47 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="2bff302f80e5bbe6155c772efc337c27";
logging-data="2183588"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19EJO0l2w2vqlf5SlEqsvTK3vzSmDtOQqU="
Cancel-Lock: sha1:QKRFW7HWV4y7QFQgzUVa+E0e95c=
X-Newsreader: MesNews/1.08.06.00-fr
 by: milsabords - Mon, 19 Jun 2023 12:21 UTC

Peter a exprimé avec précision :
> I want to help an older person with severe macular degeneration from
> thousands of miles afar where she could use an AUDIO file of long text
> (like of a book or short story, normally, as in Great Books stories).

> I'm on Windows 10 and Android 12. She's on the iPhone and iPads.

> I can easily save a web page (like a news story) to text and sometimes I
> can convert a PDF of a book to text (depending on how the PDF was made).

> But how can I turn the text into a "podcast" for her to listen to?
> And would it be too large to email to her in a typical email message?

Try Balabolka.
http://www.cross-plus-a.com/balabolka.htm

Re: Convert book pdf to an audio podcast to send as an email attachment

<u7a4g3$ire7$1@dont-email.me>

  copy mid

https://www.novabbs.com/computers/article-flat.php?id=204&group=comp.text.pdf#204

  copy link   Newsgroups: alt.comp.os.windows-10 comp.text.pdf alt.comp.software.firefox
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: kel...@privacy.invalid (kelown)
Newsgroups: alt.comp.os.windows-10,comp.text.pdf,alt.comp.software.firefox
Subject: Re: Convert book pdf to an audio podcast to send as an email
attachment
Date: Sun, 25 Jun 2023 14:28:01 -0500
Organization: A noiseless patient Spider
Lines: 9
Message-ID: <u7a4g3$ire7$1@dont-email.me>
References: <u675kg$34cnq$1@dont-email.me> <u67kqs$365rm$1@dont-email.me>
<u68ng6$3fr13$1@dont-email.me> <u6fdam$f678$1@dont-email.me>
<u6fdmc$f835$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8; format=flowed
Content-Transfer-Encoding: 7bit
Injection-Date: Sun, 25 Jun 2023 19:28:03 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="60cae3afccdf76fa3f8de7262ac3a038";
logging-data="617927"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1/xUYR4vPQIBAzuantToLYP"
User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:5.0) Aura/20220509
Interlink/52.9.8165
Cancel-Lock: sha1:xoh4JA8ju2yR8uoD9091wYB4NcY=
Content-Language: en-US
In-Reply-To: <u6fdmc$f835$1@dont-email.me>
 by: kelown - Sun, 25 Jun 2023 19:28 UTC

> ...a program to automatically OCR a file that is really an
> image of the text.

PDF-XChange Editor Portable (free)
https://portableapps.com/apps/office/pdf-xchange-editor-portable

Convert -> OCR Page(s)-> All

1
server_pubkey.txt

rocksolid light 0.9.81
clearnet tor