Rocksolid Light

Welcome to novaBBS (click a section below)

mail  files  register  newsreader  groups  login

Message-ID:  

The world is coming to an end--save your buffers!


devel / comp.infosystems.gopher / Simple conversions from HTML to simple markups are disappointing

SubjectAuthor
* Simple conversions from HTML to simple markups are disappointingLuca Saiu
+- Re: Simple conversions from HTML to simple markups arertr
+* Re: Simple conversions from HTML to simple markups are disappointingbunburya
|`* Re: Simple conversions from HTML to simple markups are disappointingLuca Saiu
| `* Re: Simple conversions from HTML to simple markups are disappointingbunburya
|  `- Re: Simple conversions from HTML to simple markups are disappointingLuca Saiu
`- Re: Simple conversions from HTML to simple markups are disappointingmeff

1
Simple conversions from HTML to simple markups are disappointing

<ssjhge$k74$1@dont-email.me>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=40&group=comp.infosystems.gopher#40

  copy link   Newsgroups: comp.infosystems.gemini comp.infosystems.gopher
Path: i2pn2.org!i2pn.org!eternal-september.org!reader02.eternal-september.org!.POSTED!not-for-mail
From: luc...@ageinghacker.net (Luca Saiu)
Newsgroups: comp.infosystems.gemini,comp.infosystems.gopher
Subject: Simple conversions from HTML to simple markups are disappointing
Date: Sun, 23 Jan 2022 13:25:29 +0100
Organization: I only speak for myself
Lines: 145
Message-ID: <ssjhge$k74$1@dont-email.me>
Mime-Version: 1.0
Content-Type: multipart/mixed; boundary="=-=-="
Injection-Info: reader02.eternal-september.org; posting-host="22811730905c3a2549de53a3af78e6e0";
logging-data="20708"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX18KcvA5dfbI8h8mohmIHRqrI4hWhqxOMjk="
User-Agent: Gnus (Gnus v5.13), GNU Emacs 28.0.50, x86_64-pc-linux-gnu
Cancel-Lock: sha1:4b4bibgXTAwco4ZGNgwPeG8vksI=
Face: iVBORw0KGgoAAAANSUhEUgAAADAAAAAwCAMAAABg3Am1AAAAP1BMVEUDBgINBwUVEBAPExMa
EA8ZGhsiGBQxJiIqLC45KSJLPDRCR0xZSEBuW1FrXVlcYml1e4GKe3KNlp2ik4nGzNAAkK8TAAAC
RklEQVRIx92V2Y7DIAxFwWDMmkDS///WsSGpmmlVMq/jqovUe7CNlyj1H03Lm00p8xc9OIRbhB4G
zgWn73no8hBiDHRX74jlOefoburd0JcS7uiVBrAdKSXD9IbOlEPsSJg6GNcKIRcJqUQ9r4ECUJqW
xvKS4zxro4ECAcUYCQHnAHAeKLkj+ZwJzSxpuSSQZDyCj0HhLGkdiVRKXI1ASyQN07IRovbsyeX8
eEQJ8TvAXQe9Fi5vj8cKX4FRMhesAKG0tldvZgDrKTgb3NLY4qRq0qjSetzcjuV7c3PAySxYa0sr
DMybFcQkNC5a28vt+STu1CXv8d5Ec+I8c9KsoPSN/gZiB3ynrURj4M6EBhk2uSM0b4R/7y2+pZxD
ZmAxgL/1y4fLBakFA42MQXUh8GNhGOE8liWC4dVnXgcipd/ao9X4aE+mo5eA/Nvp3Ehj3Rkw4+u7
Xjm+zwWJq32E8hIRfho+AfbepiI0597nWK/JPHNAAcQKYSf0aXBJxxyZAj2BvHLVUEsn9ib2L6uc
/zDjbSgfQGs1Ge87wMfzQS9bED2iwQ55khIPoHKaejgAKwObn9l7z4gw3qczprZVj6cD66zNbTnP
HyZAWte0DKBu29odSL5O1gJEeAWSePErWz30W0V1Pu/6wpL56EAa5vsPBrZaRb9VPRJw3YGL3L8d
SGvXjc+0di2rty2JXnECro9h2cdS8H4Vea3r8b2dhuMB1h0AX98T6Dp+rVd9HevDWjgCOkLqAEd9
AvUCHHpLXIeyS+1+AI2+H3h3fKHnAAAAAElFTkSuQmCC
X-Home-Page: http://ageinghacker.net
OpenPGP: id=14DC72EB19F7D2E6C12E113CBF339ABE26C5D286 (old key, to be revoked); url=http://ageinghacker.net/public-keys/gpg-b.asc (new key)
Accept-Language: en, fr, it
 by: Luca Saiu - Sun, 23 Jan 2022 12:25 UTC
Attachments: html2simple (application/octet-stream)

Disgusted by the web with its anti-features, its enormous gratuitous
complexity and its essentially proprietary nature (the effort of
re-implementing a significant component from scratch is unrealistic for
single developers), I have recently opened Gemini and Gopher services
on ageinghacker.net .
I plan to publish new information on the simpler systems as well, rather
than only on my web site.

About existing web pages I had thought about machine-converting HTML to
Gopher and Gemini markup, using lynx -dump along with some simple Unix
shell scripting. The existing HTML on my web site is hand-written and
very simple; I had at least the good taste of not using JavaScript.

After experimenting for one day or two I have to admit that the result
is disappointing. The conversion is unnatural and I find that at the
same time some important information is lost (<tt> and <pre>) while some
which is irrelevant is preserved (icons). Having out-of-line links does
not help readability when references are numerous.

You can find my experiment at
gopher://ageinghacker.net
and
gemini://ageinghacker.net
, and my crude script attached to this message.

Now, it is possible to obtain a better conversion by spending more
effort: in particular lynx (which of course was never designed for this
task) is inadequate in preserving markup information. It is possible to
parse HTML instead, and start from an AST. On the other hand some fault
lies in the HTML source document as well: The document could have used,
for example, CSS for icons instead of <img> elements when the content
was not significant enough to deserve translation. However some style
information only encoded in CSS would be significant for translation:
had I used CSS in the place of old-style <tt> elements, recognising
“code”-type elements would have been an issue.
My html-to-gemini or html-to-gopher conversion would need a lot of the
complexity I want to avoid.

I have come to believe that the only really practical solution is
translating in the opposite direction: starting from a simple and clean
markup (I would say Gemini) and from that generating other simple
markups (Gopher) and the legacy system (HTML). This can and should
handle relative, intra-server links.

--
Luca Saiu -- http://ageinghacker.net
I support everyone's freedom of mocking any opinion or belief, no
matter how deeply held, with open disrespect and the same unrelented
enthusiasm of a toddler who has just learned the word "poo".

Attachments: html2simple (application/octet-stream)
Re: Simple conversions from HTML to simple markups are disappointing

<20220123203750.06a1ed14@haraya.local.net>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=41&group=comp.infosystems.gopher#41

  copy link   Newsgroups: comp.infosystems.gemini comp.infosystems.gopher
Path: i2pn2.org!i2pn.org!eternal-september.org!reader02.eternal-september.org!.POSTED!not-for-mail
From: rtr...@nospam.invalid (rtr)
Newsgroups: comp.infosystems.gemini,comp.infosystems.gopher
Subject: Re: Simple conversions from HTML to simple markups are
disappointing
Date: Sun, 23 Jan 2022 20:37:50 +0800
Organization: A noiseless patient Spider
Lines: 39
Message-ID: <20220123203750.06a1ed14@haraya.local.net>
References: <ssjhge$k74$1@dont-email.me>
Mime-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable
Injection-Info: reader02.eternal-september.org; posting-host="ffd4035e756882386ad9e4e3d48f2fa0";
logging-data="26303"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+3G6AIVfI23DiP3JOlYKMt"
Cancel-Lock: sha1:cy//TZf4DLHUmwiKIIxXWo24w7I=
X-Newsreader: Claws Mail 4.0.0 (GTK+ 3.24.31; x86_64-unknown-openbsd7.0)
 by: rtr - Sun, 23 Jan 2022 12:37 UTC

On Sun, 23 Jan 2022 13:25:29 +0100
Luca Saiu <luca@ageinghacker.net> wrote:

>
> [...]
>
> Now, it is possible to obtain a better conversion by spending more
> effort: in particular lynx (which of course was never designed for
> this task) is inadequate in preserving markup information. It is
> possible to parse HTML instead, and start from an AST. On the other
> hand some fault lies in the HTML source document as well: The
> document could have used, for example, CSS for icons instead of <img>
> elements when the content was not significant enough to deserve
> translation. However some style information only encoded in CSS
> would be significant for translation: had I used CSS in the place of
> old-style <tt> elements, recognising “code”-type elements would have
> been an issue. My html-to-gemini or html-to-gopher conversion would
> need a lot of the complexity I want to avoid.
>
> I have come to believe that the only really practical solution is
> translating in the opposite direction: starting from a simple and
> clean markup (I would say Gemini) and from that generating other
> simple markups (Gopher) and the legacy system (HTML). This can and
> should handle relative, intra-server links.
>

Interesting. I also do think that gemini/gopher -> html is easier to
deal with rather than the other way around. When I was first starting to
get into gemini I also dabbled with the idea of just converting my HTML
pages to gemtext. I figured that it's just easier to strip everything
of formatting and starting with plaintext and convert that gemtext.

Granted I don't have that much posts to mess with so that probably
played into my decision making process.

--
Give them an inch and they will take a mile.

Re: Simple conversions from HTML to simple markups are disappointing

<ssjn7f$1o0t$1@gioia.aioe.org>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=42&group=comp.infosystems.gopher#42

  copy link   Newsgroups: comp.infosystems.gemini comp.infosystems.gopher
Path: i2pn2.org!i2pn.org!aioe.org!vxNR6Z9z8DAL1HspykBuuw.user.46.165.242.75.POSTED!not-for-mail
From: bunbu...@tilde.club (bunburya)
Newsgroups: comp.infosystems.gemini,comp.infosystems.gopher
Subject: Re: Simple conversions from HTML to simple markups are disappointing
Date: Sun, 23 Jan 2022 14:03:27 +0000
Organization: Aioe.org NNTP Server
Message-ID: <ssjn7f$1o0t$1@gioia.aioe.org>
References: <ssjhge$k74$1@dont-email.me>
Mime-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 7bit
Injection-Info: gioia.aioe.org; logging-data="57373"; posting-host="vxNR6Z9z8DAL1HspykBuuw.user.gioia.aioe.org"; mail-complaints-to="abuse@aioe.org";
User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:91.0) Gecko/20100101
Thunderbird/91.5.0
Content-Language: en-US
X-Notice: Filtered by postfilter v. 0.9.2
 by: bunburya - Sun, 23 Jan 2022 14:03 UTC

On 23/01/2022 12:25, Luca Saiu wrote:
> I have come to believe that the only really practical solution is
> translating in the opposite direction: starting from a simple and clean
> markup (I would say Gemini) and from that generating other simple
> markups (Gopher) and the legacy system (HTML). This can and should
> handle relative, intra-server links.

I believe this is correct, because the features supported by HTML are a
superset of those supported by gemtext. So going from HTML -> gemtext
almost always results in a loss of some information, which means a
choice must be made as to how to handle the loss of information. I
suspect the optimal solution to the problem will depend heavily on the
context, so it is hard to create a perfect, generalised HTML -> gemtext
converter.

Alternatively, you could consider starting in markdown, which lies
somewhere between gemtext and HTML in terms of features. Markdown ->
HTML is easy and commonly done. In principle, markdown -> gemtext
suffers from the same issues as HTML -> gemtext (loss of information due
to moving to a format with fewer supported features), but much less
information is lost as markdown is much closer to gemtext to begin with.
There are some tools out there already to convert from markdown to
gemtext, such as https://pypi.org/project/md2gemini/

I'm guessing the main thing you are missing in gemtext is inline links.
If you start from markdown, these can be preserved perfectly when
converting to HTML, and handled sensibly when converting to gemtext
(there are a few common ways to do this, such as converting to
footnotes, which the above tool supports).

Re: Simple conversions from HTML to simple markups are disappointing

<ssk1fd$4lj$1@dont-email.me>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=43&group=comp.infosystems.gopher#43

  copy link   Newsgroups: comp.infosystems.gemini comp.infosystems.gopher
Path: i2pn2.org!i2pn.org!weretis.net!feeder8.news.weretis.net!eternal-september.org!reader02.eternal-september.org!.POSTED!not-for-mail
From: luc...@ageinghacker.net (Luca Saiu)
Newsgroups: comp.infosystems.gemini,comp.infosystems.gopher
Subject: Re: Simple conversions from HTML to simple markups are disappointing
Date: Sun, 23 Jan 2022 17:58:18 +0100
Organization: I only speak for myself
Lines: 72
Message-ID: <ssk1fd$4lj$1@dont-email.me>
References: <ssjhge$k74$1@dont-email.me> <ssjn7f$1o0t$1@gioia.aioe.org>
Mime-Version: 1.0
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: 8bit
Injection-Info: reader02.eternal-september.org; posting-host="22811730905c3a2549de53a3af78e6e0";
logging-data="4787"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19I78e9KpVJMEx34ZioC+4Ebwal5j2Ioc0="
User-Agent: Gnus (Gnus v5.13), GNU Emacs 28.0.50, x86_64-pc-linux-gnu
Cancel-Lock: sha1:t0o/8FAYL9I483OWJGNRejHKbb8=
Face: iVBORw0KGgoAAAANSUhEUgAAADAAAAAwCAMAAABg3Am1AAAAP1BMVEUDBgINBwUVEBAPExMa
EA8ZGhsiGBQxJiIqLC45KSJLPDRCR0xZSEBuW1FrXVlcYml1e4GKe3KNlp2ik4nGzNAAkK8TAAAC
RklEQVRIx92V2Y7DIAxFwWDMmkDS///WsSGpmmlVMq/jqovUe7CNlyj1H03Lm00p8xc9OIRbhB4G
zgWn73no8hBiDHRX74jlOefoburd0JcS7uiVBrAdKSXD9IbOlEPsSJg6GNcKIRcJqUQ9r4ECUJqW
xvKS4zxro4ECAcUYCQHnAHAeKLkj+ZwJzSxpuSSQZDyCj0HhLGkdiVRKXI1ASyQN07IRovbsyeX8
eEQJ8TvAXQe9Fi5vj8cKX4FRMhesAKG0tldvZgDrKTgb3NLY4qRq0qjSetzcjuV7c3PAySxYa0sr
DMybFcQkNC5a28vt+STu1CXv8d5Ec+I8c9KsoPSN/gZiB3ynrURj4M6EBhk2uSM0b4R/7y2+pZxD
ZmAxgL/1y4fLBakFA42MQXUh8GNhGOE8liWC4dVnXgcipd/ao9X4aE+mo5eA/Nvp3Ehj3Rkw4+u7
Xjm+zwWJq32E8hIRfho+AfbepiI0597nWK/JPHNAAcQKYSf0aXBJxxyZAj2BvHLVUEsn9ib2L6uc
/zDjbSgfQGs1Ge87wMfzQS9bED2iwQ55khIPoHKaejgAKwObn9l7z4gw3qczprZVj6cD66zNbTnP
HyZAWte0DKBu29odSL5O1gJEeAWSePErWz30W0V1Pu/6wpL56EAa5vsPBrZaRb9VPRJw3YGL3L8d
SGvXjc+0di2rty2JXnECro9h2cdS8H4Vea3r8b2dhuMB1h0AX98T6Dp+rVd9HevDWjgCOkLqAEd9
AvUCHHpLXIeyS+1+AI2+H3h3fKHnAAAAAElFTkSuQmCC
X-Home-Page: http://ageinghacker.net
OpenPGP: id=14DC72EB19F7D2E6C12E113CBF339ABE26C5D286 (old key, to be revoked); url=http://ageinghacker.net/public-keys/gpg-b.asc (new key)
Accept-Language: en, fr, it
 by: Luca Saiu - Sun, 23 Jan 2022 16:58 UTC

Hello bunburya.

On 2022-01-23 at 14:03 +0000, bunburya wrote:

> In principle, markdown -> gemtext suffers from the same issues as HTML
> -> gemtext (loss of information due to moving to a format with fewer
> supported features), but much less information is lost as markdown is
> much closer to gemtext to begin with.

Agreed.

> I'm guessing the main thing you are missing in gemtext is inline links.

To me the lack of control on preformatted text is more serious than the
lack of inline links, possibly because of the technical topics I
normally write about.

Not being able to display source code clearly is a fatal flaw for me;
and notice how Gopher is less flawed than Gemini in this sense, by
virtue of being less abstract.

I do not particularly mind non-inline links; in fact they may promote a
clear style. However numbered footnote-style links obtained by
conversion, without descriptive labels, are difficult to follow without
interrupting the flow of reading: see the end of my conversion example,
which I believe is representative in terms of clutter.

=> gemini://ageinghacker.net/test-conversion.gmi

One has to write, from the beginning, in the new “minimal markup” style.

Incidentally I am not saying that Gemini is perfect. In fact I miss a
comment syntax, which I would use to encode my own information
(examples: list of keywords, tags, priority in a site-wide page map).
But what I am thinking is a set of semantic extensions, the kind which
Gemini is designed to prevent; that is fair.

In the same way I also miss italic and bold -- call them “emphasis” if
you will -- but here it might be healthy to let them go altogether.
Part of this entire exercise is detoxing from the overabundance of
irrelevant information. For decades I have been planning never to use
smileys again and write text a dignified style, with meaning conveyed
through words instead of some flashy semi-literate replacement for them.
In the end once in a while laziness wins.

The more I consider the issue the more I lean towards defining my own
source format from which to machine-generate even simple formats like
Gemini. In the longer term it would be something powerful and
extensible, like M4 without the awful quoting mechanism. For the time
being it can be a trivial system.

I would keep my public site source tree, with markup files identified by
a specific extension linking each other, along with data files such as
images. A script would generate copies of the entire tree, with
symbolic links where appropriate, with notes translated into Gemini,
Gopher or HTML.

If I get to write this tool and make it even vaguely usable by others I
will announce it here.

How do you, and other people here, solve the problem? Do you write
sites accessible only to Gemini or only to Gopher?

Thanks for the conversation.

--
Luca Saiu -- http://ageinghacker.net
I support everyone's freedom of mocking any opinion or belief, no
matter how deeply held, with open disrespect and the same unrelented
enthusiasm of a toddler who has just learned the word "poo".

Re: Simple conversions from HTML to simple markups are disappointing

<sskc9c$962$1@dont-email.me>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=44&group=comp.infosystems.gopher#44

  copy link   Newsgroups: comp.infosystems.gemini comp.infosystems.gopher
Path: i2pn2.org!i2pn.org!eternal-september.org!reader02.eternal-september.org!.POSTED!not-for-mail
From: ema...@example.com (meff)
Newsgroups: comp.infosystems.gemini,comp.infosystems.gopher
Subject: Re: Simple conversions from HTML to simple markups are disappointing
Date: Sun, 23 Jan 2022 20:02:52 -0000 (UTC)
Organization: That of fools
Lines: 34
Message-ID: <sskc9c$962$1@dont-email.me>
References: <ssjhge$k74$1@dont-email.me>
Injection-Date: Sun, 23 Jan 2022 20:02:52 -0000 (UTC)
Injection-Info: reader02.eternal-september.org; posting-host="bdf0d64a15586e91bc2d03387e0ac518";
logging-data="9410"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19CzFDZuPatM7ksdSYq4lQ1"
User-Agent: slrn/1.0.3 (Linux)
Cancel-Lock: sha1:+gFJ5SrRZGHO2KP8n++15yGZXkA=
 by: meff - Sun, 23 Jan 2022 20:02 UTC

On 2022-01-23, Luca Saiu <luca@ageinghacker.net> wrote:
> Disgusted by the web with its anti-features, its enormous gratuitous
> complexity and its essentially proprietary nature (the effort of
> re-implementing a significant component from scratch is unrealistic for
> single developers), I have recently opened Gemini and Gopher services

Hm I didn't think Jehova's Witnesses made it from the Web onto the net
also...

> After experimenting for one day or two I have to admit that the result
> is disappointing. The conversion is unnatural and I find that at the
> same time some important information is lost (<tt> and <pre>) while some
> which is irrelevant is preserved (icons). Having out-of-line links does
> not help readability when references are numerous.

Indeed it's hard to move from HTML to more "lean" markup formats when
you are, at least somewhat, relying on the semantic information that
HTML is providing.

> I have come to believe that the only really practical solution is
> translating in the opposite direction: starting from a simple and clean
> markup (I would say Gemini) and from that generating other simple
> markups (Gopher) and the legacy system (HTML). This can and should
> handle relative, intra-server links.

HTML offers semantic information in its markup and browsers can take
that semantic information and make sense of it. There's obviously a
lot of conflation between visual and semantic information in HTML, but
the semantic information present makes it hard to translate to
non-semantic markup formats. Markdown, Gemtext, etc are mostly just
visual formats (where the browser is usually fairly "dumb" about how
to display the format.) It might make sense to format your information
in a non-semantic way first and then add semantic niceities, like
<pre> afterword.

Re: Simple conversions from HTML to simple markups are disappointing

<ssn5du$8k1$1@gioia.aioe.org>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=45&group=comp.infosystems.gopher#45

  copy link   Newsgroups: comp.infosystems.gemini comp.infosystems.gopher
Path: i2pn2.org!i2pn.org!aioe.org!6VA5CMh1oTC1UjxJVMibGA.user.46.165.242.75.POSTED!not-for-mail
From: bunbu...@tilde.club (bunburya)
Newsgroups: comp.infosystems.gemini,comp.infosystems.gopher
Subject: Re: Simple conversions from HTML to simple markups are disappointing
Date: Mon, 24 Jan 2022 21:24:14 +0000
Organization: Aioe.org NNTP Server
Message-ID: <ssn5du$8k1$1@gioia.aioe.org>
References: <ssjhge$k74$1@dont-email.me> <ssjn7f$1o0t$1@gioia.aioe.org>
<ssk1fd$4lj$1@dont-email.me>
Mime-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 7bit
Injection-Info: gioia.aioe.org; logging-data="8833"; posting-host="6VA5CMh1oTC1UjxJVMibGA.user.gioia.aioe.org"; mail-complaints-to="abuse@aioe.org";
User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:91.0) Gecko/20100101
Thunderbird/91.5.0
Content-Language: en-US
X-Notice: Filtered by postfilter v. 0.9.2
 by: bunburya - Mon, 24 Jan 2022 21:24 UTC

On 23/01/2022 16:58, Luca Saiu wrote:
> To me the lack of control on preformatted text is more serious than the
> lack of inline links, possibly because of the technical topics I
> normally write about.
>
> Not being able to display source code clearly is a fatal flaw for me;
> and notice how Gopher is less flawed than Gemini in this sense, by
> virtue of being less abstract.

What is the issue you are having with pre-formatted text? Line numbering
and syntax highlighting are the main things that come to mind - I think
these could be achieved on the client side, though I'm not aware of any
client that currently does so. (I know there was some discussion of
syntax highlighting in pre-formatted text on the mailing list a while
ago; the majority view seemed to be that the alt text part of the
pre-formatted text block could indicate the language, though some
disagreed with using alt text in that way).

> How do you, and other people here, solve the problem? Do you write
> sites accessible only to Gemini or only to Gopher?

Personally I publish only to Gemini; however, I write very little anyway
(really just my gemlog, which is not updated all that often) so I don't
claim to be any kind of example to follow.

Re: Simple conversions from HTML to simple markups are disappointing

<ssq3im$ugh$1@dont-email.me>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=46&group=comp.infosystems.gopher#46

  copy link   Newsgroups: comp.infosystems.gemini comp.infosystems.gopher
Path: i2pn2.org!i2pn.org!eternal-september.org!reader02.eternal-september.org!.POSTED!not-for-mail
From: luc...@ageinghacker.net (Luca Saiu)
Newsgroups: comp.infosystems.gemini,comp.infosystems.gopher
Subject: Re: Simple conversions from HTML to simple markups are disappointing
Date: Wed, 26 Jan 2022 01:10:59 +0100
Organization: I only speak for myself
Lines: 58
Message-ID: <ssq3im$ugh$1@dont-email.me>
References: <ssjhge$k74$1@dont-email.me> <ssjn7f$1o0t$1@gioia.aioe.org>
<ssk1fd$4lj$1@dont-email.me> <ssn5du$8k1$1@gioia.aioe.org>
Mime-Version: 1.0
Content-Type: text/plain
Injection-Info: reader02.eternal-september.org; posting-host="b1febdbc3f6a6ac7c12527df93836a19";
logging-data="31249"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX18rsvs1O5LruXBlIDUxQlAzuVE80Nz1XzM="
User-Agent: Gnus (Gnus v5.13), GNU Emacs 28.0.50, x86_64-pc-linux-gnu
Cancel-Lock: sha1:WcKDzyyzkLRfdOLCy21aKv3zQ24=
Face: iVBORw0KGgoAAAANSUhEUgAAADAAAAAwCAMAAABg3Am1AAAAP1BMVEUDBgINBwUVEBAPExMa
EA8ZGhsiGBQxJiIqLC45KSJLPDRCR0xZSEBuW1FrXVlcYml1e4GKe3KNlp2ik4nGzNAAkK8TAAAC
RklEQVRIx92V2Y7DIAxFwWDMmkDS///WsSGpmmlVMq/jqovUe7CNlyj1H03Lm00p8xc9OIRbhB4G
zgWn73no8hBiDHRX74jlOefoburd0JcS7uiVBrAdKSXD9IbOlEPsSJg6GNcKIRcJqUQ9r4ECUJqW
xvKS4zxro4ECAcUYCQHnAHAeKLkj+ZwJzSxpuSSQZDyCj0HhLGkdiVRKXI1ASyQN07IRovbsyeX8
eEQJ8TvAXQe9Fi5vj8cKX4FRMhesAKG0tldvZgDrKTgb3NLY4qRq0qjSetzcjuV7c3PAySxYa0sr
DMybFcQkNC5a28vt+STu1CXv8d5Ec+I8c9KsoPSN/gZiB3ynrURj4M6EBhk2uSM0b4R/7y2+pZxD
ZmAxgL/1y4fLBakFA42MQXUh8GNhGOE8liWC4dVnXgcipd/ao9X4aE+mo5eA/Nvp3Ehj3Rkw4+u7
Xjm+zwWJq32E8hIRfho+AfbepiI0597nWK/JPHNAAcQKYSf0aXBJxxyZAj2BvHLVUEsn9ib2L6uc
/zDjbSgfQGs1Ge87wMfzQS9bED2iwQ55khIPoHKaejgAKwObn9l7z4gw3qczprZVj6cD66zNbTnP
HyZAWte0DKBu29odSL5O1gJEeAWSePErWz30W0V1Pu/6wpL56EAa5vsPBrZaRb9VPRJw3YGL3L8d
SGvXjc+0di2rty2JXnECro9h2cdS8H4Vea3r8b2dhuMB1h0AX98T6Dp+rVd9HevDWjgCOkLqAEd9
AvUCHHpLXIeyS+1+AI2+H3h3fKHnAAAAAElFTkSuQmCC
X-Home-Page: http://ageinghacker.net
OpenPGP: id=14DC72EB19F7D2E6C12E113CBF339ABE26C5D286 (old key, to be revoked); url=http://ageinghacker.net/public-keys/gpg-b.asc (new key)
Accept-Language: en, fr, it
 by: Luca Saiu - Wed, 26 Jan 2022 00:10 UTC

On 2022-01-24 at 21:24 +0000, bunburya wrote:

> On 23/01/2022 16:58, Luca Saiu wrote:
>> To me the lack of control on preformatted text is more serious than the
>> lack of inline links, possibly because of the technical topics I
>> normally write about.
>> Not being able to display source code clearly is a fatal flaw for me;
>> and notice how Gopher is less flawed than Gemini in this sense, by
>> virtue of being less abstract.
>
> What is the issue you are having with pre-formatted text?

On Gemini we have to clearly indicate what is pre-formatted and what is
not, because the default is that whitespace can be congealed and lines
broken and moved in order to fill paragraphs; Gopher does not do it, but
that means that paragraphs may end up displayed too narrow or too wide
for the client.

If I convert from HTML my quick hack based on Lynx fails because the
information on what was pre-formatted is lost. Converting *well* from
HTML requires analysing CSS as well.

For new text, not obtained by conversion, the Gemini solution works
well.

> Line numbering and syntax highlighting are the main things that come
> to mind - I think these could be achieved on the client side, though
> I'm not aware of any client that currently does so.

Yes. I am not against these features as long as line numbering does not
interfere with cut-and-paste.

> (I know there was some discussion of syntax highlighting in
> pre-formatted text on the mailing list a while ago; the majority view seemed to
> be that the alt text part of the pre-formatted text block could indicate the
> language, though some disagreed with using alt text in that way).

The alt text is a good feature by itself (example: this kind of
colour-coding for different programming languages or abstraction layers:
http://ageinghacker.net/projects/jitter-tutorial/ ), but in my opinion
not very philosophically coherent with the rest of Gemini which is
otherwise so minimalistic. And the alt text, again, lends itself to
semantic extension.

>> How do you, and other people here, solve the problem? Do you write
>> sites accessible only to Gemini or only to Gopher?
>
> Personally I publish only to Gemini; however, I write very little
> anyway (really just my gemlog, which is not updated all that often) so
> I don't claim to be any kind of example to follow.

I see. Thanks.

I think I will write some simple translator to generate Gemini, Gopher
and HTML from the same source.

--
Luca Saiu -- http://ageinghacker.net

1
server_pubkey.txt

rocksolid light 0.9.8
clearnet tor