Rocksolid Light

Welcome to novaBBS (click a section below)

mail  files  register  newsreader  groups  login

Message-ID:  

You can't go home again, unless you set $HOME.


tech / sci.lang / Right-to-left mark, ‎

SubjectAuthor
* Right-to-left mark, ‎Ruud Harmsen
`* Re: Right-to-left mark, ‎Peter T. Daniels
 `* Re: Right-to-left mark, ‎Ruud Harmsen
  +* Re: Right-to-left mark, ‎Helmut Richter
  |`* Re: Right-to-left mark, ‎Ruud Harmsen via Google Groups
  | `* Re: Right-to-left mark, ‎Ruud Harmsen
  |  `- Re: Right-to-left mark, ‎Ruud Harmsen via Google Groups
  `* Re: Right-to-left mark, ‎Peter T. Daniels
   `* Re: Right-to-left mark, ‎Ruud Harmsen via Google Groups
    `* Re: Right-to-left mark, ‎Peter T. Daniels
     `- Re: Right-to-left mark, ‎Ruud Harmsen

1
Right-to-left mark, ‎

<3fbe2ihr5t85dvr57o7but7d6apgpsdesf@4ax.com>

  copy mid

https://www.novabbs.com/tech/article-flat.php?id=16019&group=sci.lang#16019

  copy link   Newsgroups: sci.lang
Path: i2pn2.org!i2pn.org!eternal-september.org!feeder.eternal-september.org!.POSTED!not-for-mail
From: rh...@rudhar.com (Ruud Harmsen)
Newsgroups: sci.lang
Subject: Right-to-left mark, &lrm;
Date: Fri, 31 Mar 2023 21:04:41 +0200
Organization: A noiseless patient Spider
Lines: 8
Message-ID: <3fbe2ihr5t85dvr57o7but7d6apgpsdesf@4ax.com>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
Injection-Info: dont-email.me; posting-host="fd40fa370ba1c332d53b761192ab409e";
logging-data="1625262"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1/7U1GCdTcFpN+hYEolF1iz"
Cancel-Lock: sha1:eKWS0cIsKe6gAnIEEMxA1Mf0y2A=
X-Newsreader: Forte Agent 1.93/32.576 English (American)
 by: Ruud Harmsen - Fri, 31 Mar 2023 19:04 UTC

Examples of using the right-to-left mark and left-to-right mark
(Unicode) to solve direction problems with occasional Arabic or
Yiddish embedded in left-to-right text.

https://rudhar.com/sfreview/rtolmark/en.htm

--
Ruud Harmsen, http://rudhar.com

Re: Right-to-left mark, &lrm;

<f1459bff-e07f-4b27-85b4-fc85682f2c15n@googlegroups.com>

  copy mid

https://www.novabbs.com/tech/article-flat.php?id=16020&group=sci.lang#16020

  copy link   Newsgroups: sci.lang
X-Received: by 2002:ad4:5502:0:b0:56e:a05a:2d92 with SMTP id pz2-20020ad45502000000b0056ea05a2d92mr5766744qvb.2.1680290851379;
Fri, 31 Mar 2023 12:27:31 -0700 (PDT)
X-Received: by 2002:aca:3bc1:0:b0:384:65af:e554 with SMTP id
i184-20020aca3bc1000000b0038465afe554mr3730812oia.1.1680290851020; Fri, 31
Mar 2023 12:27:31 -0700 (PDT)
Path: i2pn2.org!i2pn.org!weretis.net!feeder6.news.weretis.net!newsfeed.hasname.com!usenet.blueworldhosting.com!diablo1.usenet.blueworldhosting.com!peer03.iad!feed-me.highwinds-media.com!news.highwinds-media.com!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: sci.lang
Date: Fri, 31 Mar 2023 12:27:30 -0700 (PDT)
In-Reply-To: <3fbe2ihr5t85dvr57o7but7d6apgpsdesf@4ax.com>
Injection-Info: google-groups.googlegroups.com; posting-host=108.53.39.189; posting-account=tXYReAoAAABbl0njRzivyU02EBLaX9OF
NNTP-Posting-Host: 108.53.39.189
References: <3fbe2ihr5t85dvr57o7but7d6apgpsdesf@4ax.com>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <f1459bff-e07f-4b27-85b4-fc85682f2c15n@googlegroups.com>
Subject: Re: Right-to-left mark, &lrm;
From: gramma...@verizon.net (Peter T. Daniels)
Injection-Date: Fri, 31 Mar 2023 19:27:31 +0000
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Received-Bytes: 1589
 by: Peter T. Daniels - Fri, 31 Mar 2023 19:27 UTC

On Friday, March 31, 2023 at 3:04:45 PM UTC-4, Ruud Harmsen wrote:
> Examples of using the right-to-left mark and left-to-right mark
> (Unicode) to solve direction problems with occasional Arabic or
> Yiddish embedded in left-to-right text.
>
> https://rudhar.com/sfreview/rtolmark/en.htm

MSWord handles that with no problems at all.

InDesign, OTOH, which is what the vast majority of typesetters use,
does not.

Re: Right-to-left mark, &lrm;

<ubje2i93i5lr3tfqfn88i7uuenrj304ith@4ax.com>

  copy mid

https://www.novabbs.com/tech/article-flat.php?id=16021&group=sci.lang#16021

  copy link   Newsgroups: sci.lang
Path: i2pn2.org!i2pn.org!eternal-september.org!feeder.eternal-september.org!.POSTED!not-for-mail
From: rh...@rudhar.com (Ruud Harmsen)
Newsgroups: sci.lang
Subject: Re: Right-to-left mark, &lrm;
Date: Fri, 31 Mar 2023 23:23:25 +0200
Organization: A noiseless patient Spider
Lines: 32
Message-ID: <ubje2i93i5lr3tfqfn88i7uuenrj304ith@4ax.com>
References: <3fbe2ihr5t85dvr57o7but7d6apgpsdesf@4ax.com> <f1459bff-e07f-4b27-85b4-fc85682f2c15n@googlegroups.com>
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: 8bit
Injection-Info: dont-email.me; posting-host="fd40fa370ba1c332d53b761192ab409e";
logging-data="1667960"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX18tO3VW8jEY30J+9apnTITw"
Cancel-Lock: sha1:tSr0CJyvp7jlIBLCsg+glj1f2Qc=
X-Newsreader: Forte Agent 1.93/32.576 English (American)
 by: Ruud Harmsen - Fri, 31 Mar 2023 21:23 UTC

Fri, 31 Mar 2023 12:27:30 -0700 (PDT): "Peter T. Daniels"
<grammatim@verizon.net> scribeva:

>On Friday, March 31, 2023 at 3:04:45?PM UTC-4, Ruud Harmsen wrote:
>> Examples of using the right-to-left mark and left-to-right mark
>> (Unicode) to solve direction problems with occasional Arabic or
>> Yiddish embedded in left-to-right text.
>>
>> https://rudhar.com/sfreview/rtolmark/en.htm
>
>MSWord handles that with no problems at all.

1) I read a report in Facebook group "Learn Yiddish" that a copy-paste
from web to Word still goes wrong, reversed all the letters. Don’t
know which version that was.
https://www.facebook.com/groups/learnyiddish/posts/10159039924571056/

2) How can Word distinguish between a question mark that is part of an
English sentence with some embedded Hebrew or Yiddish, and the same
where the question mark belongs to the right-to-left part? As in the
examples I gave. The correct decision is only possible by
understanding the text.

In other words, in your "MSWord handles that with no problems at all",
what exactly do you mean by "that"?

>InDesign, OTOH, which is what the vast majority of typesetters use,
>does not.

That is bad.
--
Ruud Harmsen, http://rudhar.com

Re: Right-to-left mark, &lrm;

<87ae1ba9-7011-e88d-5d3c-93303a292a8b@email.de>

  copy mid

https://www.novabbs.com/tech/article-flat.php?id=16022&group=sci.lang#16022

  copy link   Newsgroups: sci.lang
Path: i2pn2.org!i2pn.org!weretis.net!feeder8.news.weretis.net!news.szaf.org!fu-berlin.de!uni-berlin.de!individual.net!not-for-mail
From: hr.use...@email.de (Helmut Richter)
Newsgroups: sci.lang
Subject: Re: Right-to-left mark, &lrm;
Date: Sat, 1 Apr 2023 10:57:27 +0200
Lines: 45
Message-ID: <87ae1ba9-7011-e88d-5d3c-93303a292a8b@email.de>
References: <3fbe2ihr5t85dvr57o7but7d6apgpsdesf@4ax.com> <f1459bff-e07f-4b27-85b4-fc85682f2c15n@googlegroups.com> <ubje2i93i5lr3tfqfn88i7uuenrj304ith@4ax.com>
Mime-Version: 1.0
Content-Type: multipart/mixed; boundary="8323329-271049192-1680339448=:2068"
X-Trace: individual.net HR6THbQlsAdkDFW3toBEvw3CAzn20+UbnbbHBsOmLdIEbJkP1k
X-Orig-Path: kiboko2!hr.usenet
Cancel-Lock: sha1:mRhoiRD2+kAqTN9iK/lHemJ168A=
In-Reply-To: <ubje2i93i5lr3tfqfn88i7uuenrj304ith@4ax.com>
 by: Helmut Richter - Sat, 1 Apr 2023 08:57 UTC

On Fri, 31 Mar 2023, Ruud Harmsen wrote:

> 2) How can Word distinguish between a question mark that is part of an
> English sentence with some embedded Hebrew or Yiddish, and the same
> where the question mark belongs to the right-to-left part? As in the
> examples I gave. The correct decision is only possible by
> understanding the text.

The Unicode Bidi algorithm does not understand but just handles the most
frequent cases correctly. IIRC the next character with unique writing
direction after the question mark decides on the assumed writing direction of
the question mark. The following examples do not contain any LRM or RLM:

Are the first letters in the Hebrew alphabet א, ב, ג? Yes.

Are the first letters in the Hebrew alphabet א, ב, ג? כן.

A more logical sentence would be for the first:
Are the first letters in the Hebrew alphabet א‎, ב‎, ג? Yes.

It requires left-to-right marks before the two commas but none before the
question mark, as this is determined by the following Y.

If the second sentence is meant as an English question with a Hebrew answer,
it reqires an additional left-to-right marks before the question mark and a
right-to-left mark after the final period.

Are the first letters in the Hebrew alphabet א‎, ב‎, ג‎? כן.‏

I don’t know why it does not work with RTL before the period as in the
following line:

Are the first letters in the Hebrew alphabet א‎, ב‎, ג‎? כן‏.

--
Helmut Richter

Re: Right-to-left mark, &lrm;

<249af7f0-7dfe-478b-889f-318dee535ae1n@googlegroups.com>

  copy mid

https://www.novabbs.com/tech/article-flat.php?id=16023&group=sci.lang#16023

  copy link   Newsgroups: sci.lang
X-Received: by 2002:a05:620a:990:b0:733:4e2d:7834 with SMTP id x16-20020a05620a099000b007334e2d7834mr7419646qkx.4.1680345190978;
Sat, 01 Apr 2023 03:33:10 -0700 (PDT)
X-Received: by 2002:a05:6830:138c:b0:6a1:24e3:6607 with SMTP id
d12-20020a056830138c00b006a124e36607mr8000585otq.7.1680345190739; Sat, 01 Apr
2023 03:33:10 -0700 (PDT)
Path: i2pn2.org!i2pn.org!weretis.net!feeder8.news.weretis.net!proxad.net!feeder1-2.proxad.net!209.85.160.216.MISMATCH!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: sci.lang
Date: Sat, 1 Apr 2023 03:33:10 -0700 (PDT)
In-Reply-To: <87ae1ba9-7011-e88d-5d3c-93303a292a8b@email.de>
Injection-Info: google-groups.googlegroups.com; posting-host=83.128.179.213; posting-account=XRxOkAoAAADRVgwnTokwatZ7Qx-JVtyI
NNTP-Posting-Host: 83.128.179.213
References: <3fbe2ihr5t85dvr57o7but7d6apgpsdesf@4ax.com> <f1459bff-e07f-4b27-85b4-fc85682f2c15n@googlegroups.com>
<ubje2i93i5lr3tfqfn88i7uuenrj304ith@4ax.com> <87ae1ba9-7011-e88d-5d3c-93303a292a8b@email.de>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <249af7f0-7dfe-478b-889f-318dee535ae1n@googlegroups.com>
Subject: Re: Right-to-left mark, &lrm;
From: goo...@rudhar.com (Ruud Harmsen via Google Groups)
Injection-Date: Sat, 01 Apr 2023 10:33:10 +0000
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
 by: Ruud Harmsen via Goo - Sat, 1 Apr 2023 10:33 UTC

On Saturday, April 1, 2023 at 10:57:31 AM UTC+2, Helmut Richter wrote:
> Are the first letters in the Hebrew alphabet א‎, ב‎, ג‎? כן.‏
>
> I don’t know why it does not work with RTL before the period as in the
> following line:
>
> Are the first letters in the Hebrew alphabet א‎, ב‎, ג‎? כן‏.

I think the full stop character, although it is neutral in matters of direction, defaults to being LTR, unless it is enclosed between two inherently RTL characters.
In the first of your examples that I quote here, you have:
question mark, space, kaf, nun, stop, RTL-mark, newline.
So the stop or dot character has neighbours on both sides that are clearly RTL.

In the second example:
question mark, space, kaf, nun, RTL-mark, stop, newline.
Now the stop character has an RTL neighbour and a neutral neighbour (the CR or LF, or similar control character dependeding on platform). Apparently RTL-neutral causes the interpretation LTR. But that is strange, because in longer running Hebrew text, you could have the sequence comma-space, so the the comma is between an RTL Hebrew character and the neutral space, and should be interpreted as RTL, not LTR.

Re: Right-to-left mark, &lrm;

<m52g2il9u81d57odrfkdb8h734k50l16rr@4ax.com>

  copy mid

https://www.novabbs.com/tech/article-flat.php?id=16024&group=sci.lang#16024

  copy link   Newsgroups: sci.lang
Path: i2pn2.org!i2pn.org!eternal-september.org!feeder.eternal-september.org!.POSTED!not-for-mail
From: rh...@rudhar.com (Ruud Harmsen)
Newsgroups: sci.lang
Subject: Re: Right-to-left mark, &lrm;
Date: Sat, 01 Apr 2023 12:34:54 +0200
Organization: A noiseless patient Spider
Lines: 14
Message-ID: <m52g2il9u81d57odrfkdb8h734k50l16rr@4ax.com>
References: <3fbe2ihr5t85dvr57o7but7d6apgpsdesf@4ax.com> <f1459bff-e07f-4b27-85b4-fc85682f2c15n@googlegroups.com> <ubje2i93i5lr3tfqfn88i7uuenrj304ith@4ax.com> <87ae1ba9-7011-e88d-5d3c-93303a292a8b@email.de> <249af7f0-7dfe-478b-889f-318dee535ae1n@googlegroups.com>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
Injection-Info: dont-email.me; posting-host="dac00bd970e96907849cb39197d89ff4";
logging-data="1978522"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1/RyiU40aAU3bPhV0GpsLPc"
Cancel-Lock: sha1:zKEdZSVm/mHvAs2e8deoyA7t+GA=
X-Newsreader: Forte Agent 1.93/32.576 English (American)
 by: Ruud Harmsen - Sat, 1 Apr 2023 10:34 UTC

Sat, 1 Apr 2023 03:33:10 -0700 (PDT): "Ruud Harmsen via Google Groups
<google@rudhar.com>" <google@rudhar.com> scribeva:
>I think the full stop character, although it is neutral in matters of direction, defaults to being LTR, unless it is enclosed between two inherently RTL characters.
>In the first of your examples that I quote here, you have:
>question mark, space, kaf, nun, stop, RTL-mark, newline.
>So the stop or dot character has neighbours on both sides that are clearly RTL.
>
>In the second example:
>question mark, space, kaf, nun, RTL-mark, stop, newline.
>Now the stop character has an RTL neighbour and a neutral neighbour (the CR or LF, or similar control character dependeding on platform). Apparently RTL-neutral causes the interpretation LTR. But that is strange, because in longer running Hebrew text, you could have the sequence comma-space, so the the comma is between an RTL Hebrew character and the neutral space, and should be interpreted as RTL, not LTR.

Which of course is exactly your point.
--
Ruud Harmsen, http://rudhar.com

Re: Right-to-left mark, &lrm;

<cc75c27f-f25b-4fc4-915b-1913b8a94e24n@googlegroups.com>

  copy mid

https://www.novabbs.com/tech/article-flat.php?id=16025&group=sci.lang#16025

  copy link   Newsgroups: sci.lang
X-Received: by 2002:ad4:559c:0:b0:56e:a203:5d1f with SMTP id f28-20020ad4559c000000b0056ea2035d1fmr6049086qvx.5.1680346213138;
Sat, 01 Apr 2023 03:50:13 -0700 (PDT)
X-Received: by 2002:a05:6870:1017:b0:17e:3fe:6711 with SMTP id
23-20020a056870101700b0017e03fe6711mr4906066oai.2.1680346212843; Sat, 01 Apr
2023 03:50:12 -0700 (PDT)
Path: i2pn2.org!i2pn.org!usenet.goja.nl.eu.org!weretis.net!feeder8.news.weretis.net!proxad.net!feeder1-2.proxad.net!209.85.160.216.MISMATCH!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: sci.lang
Date: Sat, 1 Apr 2023 03:50:12 -0700 (PDT)
In-Reply-To: <m52g2il9u81d57odrfkdb8h734k50l16rr@4ax.com>
Injection-Info: google-groups.googlegroups.com; posting-host=83.128.179.213; posting-account=XRxOkAoAAADRVgwnTokwatZ7Qx-JVtyI
NNTP-Posting-Host: 83.128.179.213
References: <3fbe2ihr5t85dvr57o7but7d6apgpsdesf@4ax.com> <f1459bff-e07f-4b27-85b4-fc85682f2c15n@googlegroups.com>
<ubje2i93i5lr3tfqfn88i7uuenrj304ith@4ax.com> <87ae1ba9-7011-e88d-5d3c-93303a292a8b@email.de>
<249af7f0-7dfe-478b-889f-318dee535ae1n@googlegroups.com> <m52g2il9u81d57odrfkdb8h734k50l16rr@4ax.com>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <cc75c27f-f25b-4fc4-915b-1913b8a94e24n@googlegroups.com>
Subject: Re: Right-to-left mark, &lrm;
From: goo...@rudhar.com (Ruud Harmsen via Google Groups)
Injection-Date: Sat, 01 Apr 2023 10:50:13 +0000
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
 by: Ruud Harmsen via Goo - Sat, 1 Apr 2023 10:50 UTC

On Saturday, April 1, 2023 at 12:34:57 PM UTC+2, Ruud Harmsen wrote:
>>Now the stop character has an RTL neighbour and a neutral neighbour (the CR or LF, or similar control character dependeding on platform). Apparently RTL-neutral causes the interpretation LTR. But that is strange, because in longer running Hebrew text, you could have the sequence comma-space, so the the comma is between an RTL Hebrew character and the neutral space, and should be interpreted as RTL, not LTR.
> Which of course is exactly your point.

It’s more complicated than that, that’s why:
https://unicode.org/reports/tr9/#Bidirectional_Character_Types
Comma and full stop are weak, a space is neutral.

Re: Right-to-left mark, &lrm;

<e9966359-0aa1-479c-afc0-78edf65cd69bn@googlegroups.com>

  copy mid

https://www.novabbs.com/tech/article-flat.php?id=16026&group=sci.lang#16026

  copy link   Newsgroups: sci.lang
X-Received: by 2002:a05:620a:3186:b0:745:a35f:ad71 with SMTP id bi6-20020a05620a318600b00745a35fad71mr8212510qkb.13.1680355384976;
Sat, 01 Apr 2023 06:23:04 -0700 (PDT)
X-Received: by 2002:a54:4716:0:b0:384:6607:401 with SMTP id
k22-20020a544716000000b0038466070401mr9115874oik.0.1680355384632; Sat, 01 Apr
2023 06:23:04 -0700 (PDT)
Path: i2pn2.org!i2pn.org!weretis.net!feeder8.news.weretis.net!proxad.net!feeder1-2.proxad.net!209.85.160.216.MISMATCH!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: sci.lang
Date: Sat, 1 Apr 2023 06:23:04 -0700 (PDT)
In-Reply-To: <ubje2i93i5lr3tfqfn88i7uuenrj304ith@4ax.com>
Injection-Info: google-groups.googlegroups.com; posting-host=108.53.39.189; posting-account=tXYReAoAAABbl0njRzivyU02EBLaX9OF
NNTP-Posting-Host: 108.53.39.189
References: <3fbe2ihr5t85dvr57o7but7d6apgpsdesf@4ax.com> <f1459bff-e07f-4b27-85b4-fc85682f2c15n@googlegroups.com>
<ubje2i93i5lr3tfqfn88i7uuenrj304ith@4ax.com>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <e9966359-0aa1-479c-afc0-78edf65cd69bn@googlegroups.com>
Subject: Re: Right-to-left mark, &lrm;
From: gramma...@verizon.net (Peter T. Daniels)
Injection-Date: Sat, 01 Apr 2023 13:23:04 +0000
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
 by: Peter T. Daniels - Sat, 1 Apr 2023 13:23 UTC

On Friday, March 31, 2023 at 5:23:27 PM UTC-4, Ruud Harmsen wrote:
> Fri, 31 Mar 2023 12:27:30 -0700 (PDT): "Peter T. Daniels"
> <gram...@verizon.net> scribeva:
> >On Friday, March 31, 2023 at 3:04:45?PM UTC-4, Ruud Harmsen wrote:
> >> Examples of using the right-to-left mark and left-to-right mark
> >> (Unicode) to solve direction problems with occasional Arabic or
> >> Yiddish embedded in left-to-right text.
> >>
> >> https://rudhar.com/sfreview/rtolmark/en.htm
> >
> >MSWord handles that with no problems at all.
> 1) I read a report in Facebook group "Learn Yiddish" that a copy-paste
> from web to Word still goes wrong, reversed all the letters. Don’t
> know which version that was.

Irrelevant. If the web site had screwed it up, that's not MSWord's
problem. You feed a perfect Word file into InDesign, it screws jt up.
I'm about to have THIRD proofs of a tiny article in *Journal of Semitic
Studies* because it couldn't handle THREE WORDS in Hebrew.

> https://www.facebook.com/groups/learnyiddish/posts/10159039924571056/
>
> 2) How can Word distinguish between a question mark that is part of an
> English sentence with some embedded Hebrew or Yiddish, and the same
> where the question mark belongs to the right-to-left part? As in the
> examples I gave. The correct decision is only possible by
> understanding the text.

Because the writer switches between scripts at the appropriate point.
Word has an option to "detect language" and change keyboards when
you change the font to a language-specific one. For Roman alphabet
when you switch to German or French, it automatically does the quotation
marks right, for instance. Presumably it does the same for Cyrillic languages
and hopefully for Arabic-script ones.

> In other words, in your "MSWord handles that with no problems at all",
> what exactly do you mean by "that"?

EXACTLY WHAT YOU SAID. R-to-l and l-to-r marks.

> >InDesign, OTOH, which is what the vast majority of typesetters use,
> >does not.
>
> That is bad.

Re: Right-to-left mark, &lrm;

<9ab16c4b-9f36-4087-aa9c-f2312d3cdef3n@googlegroups.com>

  copy mid

https://www.novabbs.com/tech/article-flat.php?id=16027&group=sci.lang#16027

  copy link   Newsgroups: sci.lang
X-Received: by 2002:a37:b94:0:b0:745:6fda:f04e with SMTP id 142-20020a370b94000000b007456fdaf04emr6389021qkl.9.1680381435292;
Sat, 01 Apr 2023 13:37:15 -0700 (PDT)
X-Received: by 2002:a9d:7301:0:b0:698:6b65:f563 with SMTP id
e1-20020a9d7301000000b006986b65f563mr9814612otk.4.1680381434991; Sat, 01 Apr
2023 13:37:14 -0700 (PDT)
Path: i2pn2.org!i2pn.org!weretis.net!feeder8.news.weretis.net!news.uzoreto.com!peer01.ams4!peer.am4.highwinds-media.com!peer03.iad!feed-me.highwinds-media.com!news.highwinds-media.com!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: sci.lang
Date: Sat, 1 Apr 2023 13:37:14 -0700 (PDT)
In-Reply-To: <e9966359-0aa1-479c-afc0-78edf65cd69bn@googlegroups.com>
Injection-Info: google-groups.googlegroups.com; posting-host=83.128.179.213; posting-account=XRxOkAoAAADRVgwnTokwatZ7Qx-JVtyI
NNTP-Posting-Host: 83.128.179.213
References: <3fbe2ihr5t85dvr57o7but7d6apgpsdesf@4ax.com> <f1459bff-e07f-4b27-85b4-fc85682f2c15n@googlegroups.com>
<ubje2i93i5lr3tfqfn88i7uuenrj304ith@4ax.com> <e9966359-0aa1-479c-afc0-78edf65cd69bn@googlegroups.com>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <9ab16c4b-9f36-4087-aa9c-f2312d3cdef3n@googlegroups.com>
Subject: Re: Right-to-left mark, &lrm;
From: goo...@rudhar.com (Ruud Harmsen via Google Groups)
Injection-Date: Sat, 01 Apr 2023 20:37:15 +0000
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
X-Received-Bytes: 4352
 by: Ruud Harmsen via Goo - Sat, 1 Apr 2023 20:37 UTC

On Saturday, April 1, 2023 at 3:23:05 PM UTC+2, Peter T. Daniels wrote:
> On Friday, March 31, 2023 at 5:23:27 PM UTC-4, Ruud Harmsen wrote:
> > Fri, 31 Mar 2023 12:27:30 -0700 (PDT): "Peter T. Daniels"
> > <gram...@verizon.net> scribeva:
> > >On Friday, March 31, 2023 at 3:04:45?PM UTC-4, Ruud Harmsen wrote:
> > >> Examples of using the right-to-left mark and left-to-right mark
> > >> (Unicode) to solve direction problems with occasional Arabic or
> > >> Yiddish embedded in left-to-right text.
> > >>
> > >> https://rudhar.com/sfreview/rtolmark/en.htm
> > >
> > >MSWord handles that with no problems at all.
> > 1) I read a report in Facebook group "Learn Yiddish" that a copy-paste
> > from web to Word still goes wrong, reversed all the letters. Don’t
> > know which version that was.
> Irrelevant. If the web site had screwed it up, that's not MSWord's
> problem.

Website do not screw up, because there are standards, HTML is simple, and all major browsers now adhere to standards. And there are largely only two engines: Gecko and Chromium. Even Edge now use Chromium.

https://forward.com/yiddish/ does not screw up.

So it anything goes wrong, it is Word’s and Microsoft’s fault. But I expect newer Words to do it right too.

>You feed a perfect Word file into InDesign, it screws jt up.
> I'm about to have THIRD proofs of a tiny article in *Journal of Semitic
> Studies* because it couldn't handle THREE WORDS in Hebrew.

So why still use such defective software?

> > https://www.facebook.com/groups/learnyiddish/posts/10159039924571056/
> >
> > 2) How can Word distinguish between a question mark that is part of an
> > English sentence with some embedded Hebrew or Yiddish, and the same
> > where the question mark belongs to the right-to-left part? As in the
> > examples I gave. The correct decision is only possible by
> > understanding the text.
> Because the writer switches between scripts at the appropriate point.
> Word has an option to "detect language" and change keyboards when
> you change the font to a language-specific one. For Roman alphabet
> when you switch to German or French, it automatically does the quotation
> marks right, for instance. Presumably it does the same for Cyrillic languages
> and hopefully for Arabic-script ones.

When I set the language to English at the appropriate point in my last example, it didn’t help.

> > In other words, in your "MSWord handles that with no problems at all",
> > what exactly do you mean by "that"?
> EXACTLY WHAT YOU SAID. R-to-l and l-to-r marks.

OK. So we agree. Not the first time.

> > >InDesign, OTOH, which is what the vast majority of typesetters use,
> > >does not.
> >
> > That is bad.

Re: Right-to-left mark, &lrm;

<ac6d6c2b-3e3f-4999-879e-e42343cebe6bn@googlegroups.com>

  copy mid

https://www.novabbs.com/tech/article-flat.php?id=16029&group=sci.lang#16029

  copy link   Newsgroups: sci.lang
X-Received: by 2002:a05:620a:2238:b0:742:412d:1dc6 with SMTP id n24-20020a05620a223800b00742412d1dc6mr8459806qkh.14.1680442485034;
Sun, 02 Apr 2023 06:34:45 -0700 (PDT)
X-Received: by 2002:a9d:6c10:0:b0:688:cf52:6e18 with SMTP id
f16-20020a9d6c10000000b00688cf526e18mr10691459otq.4.1680442484787; Sun, 02
Apr 2023 06:34:44 -0700 (PDT)
Path: i2pn2.org!i2pn.org!weretis.net!feeder8.news.weretis.net!proxad.net!feeder1-2.proxad.net!209.85.160.216.MISMATCH!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: sci.lang
Date: Sun, 2 Apr 2023 06:34:44 -0700 (PDT)
In-Reply-To: <9ab16c4b-9f36-4087-aa9c-f2312d3cdef3n@googlegroups.com>
Injection-Info: google-groups.googlegroups.com; posting-host=108.53.39.189; posting-account=tXYReAoAAABbl0njRzivyU02EBLaX9OF
NNTP-Posting-Host: 108.53.39.189
References: <3fbe2ihr5t85dvr57o7but7d6apgpsdesf@4ax.com> <f1459bff-e07f-4b27-85b4-fc85682f2c15n@googlegroups.com>
<ubje2i93i5lr3tfqfn88i7uuenrj304ith@4ax.com> <e9966359-0aa1-479c-afc0-78edf65cd69bn@googlegroups.com>
<9ab16c4b-9f36-4087-aa9c-f2312d3cdef3n@googlegroups.com>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <ac6d6c2b-3e3f-4999-879e-e42343cebe6bn@googlegroups.com>
Subject: Re: Right-to-left mark, &lrm;
From: gramma...@verizon.net (Peter T. Daniels)
Injection-Date: Sun, 02 Apr 2023 13:34:45 +0000
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
 by: Peter T. Daniels - Sun, 2 Apr 2023 13:34 UTC

On Saturday, April 1, 2023 at 4:37:16 PM UTC-4, Ruud Harmsen via Google Groups <google@rudhar.com> wrote:
> On Saturday, April 1, 2023 at 3:23:05 PM UTC+2, Peter T. Daniels wrote:

> Website do not screw up, because there are standards,

That should appear, with attribution, in someone's .sig

> >You feed a perfect Word file into InDesign, it screws jt up.
> > I'm about to have THIRD proofs of a tiny article in *Journal of Semitic
> > Studies* because it couldn't handle THREE WORDS in Hebrew.
>
> So why still use such defective software?

Do you ever pay attention?

Authors in journals have no say over what those contracted by the
publisher4 use.

Re: Right-to-left mark, &lrm;

<26uk2iln8ggpmsrdtdedk5o9d0ctioldru@4ax.com>

  copy mid

https://www.novabbs.com/tech/article-flat.php?id=16030&group=sci.lang#16030

  copy link   Newsgroups: sci.lang
Path: i2pn2.org!i2pn.org!eternal-september.org!feeder.eternal-september.org!.POSTED!not-for-mail
From: rh...@rudhar.com (Ruud Harmsen)
Newsgroups: sci.lang
Subject: Re: Right-to-left mark, &lrm;
Date: Mon, 03 Apr 2023 08:58:08 +0200
Organization: A noiseless patient Spider
Lines: 28
Message-ID: <26uk2iln8ggpmsrdtdedk5o9d0ctioldru@4ax.com>
References: <3fbe2ihr5t85dvr57o7but7d6apgpsdesf@4ax.com> <f1459bff-e07f-4b27-85b4-fc85682f2c15n@googlegroups.com> <ubje2i93i5lr3tfqfn88i7uuenrj304ith@4ax.com> <e9966359-0aa1-479c-afc0-78edf65cd69bn@googlegroups.com> <9ab16c4b-9f36-4087-aa9c-f2312d3cdef3n@googlegroups.com> <ac6d6c2b-3e3f-4999-879e-e42343cebe6bn@googlegroups.com>
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit
Injection-Info: dont-email.me; posting-host="a84718a4ed6936ac5a43cf6f5717542f";
logging-data="3032323"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1/6Bu08ppF9KHlJRK9ZnISE"
Cancel-Lock: sha1:nyx3HfnK6oPw2QQaJzclboheU+A=
X-Newsreader: Forte Agent 1.93/32.576 English (American)
 by: Ruud Harmsen - Mon, 3 Apr 2023 06:58 UTC

Sun, 2 Apr 2023 06:34:44 -0700 (PDT): "Peter T. Daniels"
<grammatim@verizon.net> scribeva:

>On Saturday, April 1, 2023 at 4:37:16?PM UTC-4, Ruud Harmsen via Google Groups <google@rudhar.com> wrote:
>> On Saturday, April 1, 2023 at 3:23:05?PM UTC+2, Peter T. Daniels wrote:
>
>> Website do not screw up, because there are standards,
>
>That should appear, with attribution, in someone's .sig

???

>> >You feed a perfect Word file into InDesign, it screws jt up.
>> > I'm about to have THIRD proofs of a tiny article in *Journal of Semitic
>> > Studies* because it couldn't handle THREE WORDS in Hebrew.
>>
>> So why still use such defective software?
>
>Do you ever pay attention?

Always AFAIK.

>Authors in journals have no say over what those contracted by the
>publisher4 use.

Then self-publish. Neem de teugels in eigen hand.
--
Ruud Harmsen, http://rudhar.com

1
server_pubkey.txt

rocksolid light 0.9.81
clearnet tor