Rocksolid Light

Welcome to novaBBS (click a section below)

mail  files  register  newsreader  groups  login

Message-ID:  

Help stamp out Mickey-Mouse computer interfaces -- Menus are for Restaurants!


devel / comp.lang.forth / Re: An HTML parser in 8th

SubjectAuthor
* An HTML parser in 8thRon AARON
+* Re: An HTML parser in 8thHans Bezemer
|`- Re: An HTML parser in 8thRon AARON
`* Re: An HTML parser in 8thBrad Eckert
 `- Re: An HTML parser in 8thRon AARON

1
An HTML parser in 8th

<sjhn1l$d0v$1@dont-email.me>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=14860&group=comp.lang.forth#14860

  copy link   Newsgroups: comp.lang.forth
Path: rocksolid2!news.neodome.net!news.mixmin.net!eternal-september.org!reader02.eternal-september.org!.POSTED!not-for-mail
From: clf...@8th-dev.com (Ron AARON)
Newsgroups: comp.lang.forth
Subject: An HTML parser in 8th
Date: Tue, 5 Oct 2021 17:21:41 +0300
Organization: A noiseless patient Spider
Lines: 9
Message-ID: <sjhn1l$d0v$1@dont-email.me>
Mime-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 7bit
Injection-Date: Tue, 5 Oct 2021 14:21:41 -0000 (UTC)
Injection-Info: reader02.eternal-september.org; posting-host="4e9564f0b7fe5233bc7812c90b28f9af";
logging-data="13343"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1/ffs8Fh/KD5yu8vxET3WZE"
User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:91.0) Gecko/20100101
Thunderbird/91.1.2
Cancel-Lock: sha1:PbX5LRnsX1n1lkiB9y1JU3FIo7M=
Content-Language: en-US
 by: Ron AARON - Tue, 5 Oct 2021 14:21 UTC

This is a pretty cool parser IMO, and it works reasonably well.

It relies on using regex to snip off tags, and builds a DOM structure
from it.

Details:
https://8th-dev.com/forum/index.php/topic,2385.0.html

Re: An HTML parser in 8th

<8a772e9d-cac7-4f2e-b756-ffc666937cbdn@googlegroups.com>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=14861&group=comp.lang.forth#14861

  copy link   Newsgroups: comp.lang.forth
X-Received: by 2002:ac8:4089:: with SMTP id p9mr19983798qtl.363.1633447527742;
Tue, 05 Oct 2021 08:25:27 -0700 (PDT)
X-Received: by 2002:ac8:5788:: with SMTP id v8mr20941422qta.372.1633447527331;
Tue, 05 Oct 2021 08:25:27 -0700 (PDT)
Path: rocksolid2!i2pn.org!weretis.net!feeder6.news.weretis.net!news.misty.com!border2.nntp.dca1.giganews.com!nntp.giganews.com!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: comp.lang.forth
Date: Tue, 5 Oct 2021 08:25:27 -0700 (PDT)
In-Reply-To: <sjhn1l$d0v$1@dont-email.me>
Injection-Info: google-groups.googlegroups.com; posting-host=82.95.228.79; posting-account=Ebqe4AoAAABfjCRL4ZqOHWv4jv5ZU4Cs
NNTP-Posting-Host: 82.95.228.79
References: <sjhn1l$d0v$1@dont-email.me>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <8a772e9d-cac7-4f2e-b756-ffc666937cbdn@googlegroups.com>
Subject: Re: An HTML parser in 8th
From: the.beez...@gmail.com (Hans Bezemer)
Injection-Date: Tue, 05 Oct 2021 15:25:27 +0000
Content-Type: text/plain; charset="UTF-8"
Lines: 9
 by: Hans Bezemer - Tue, 5 Oct 2021 15:25 UTC

On Tuesday, October 5, 2021 at 4:21:43 PM UTC+2, Ron AARON wrote:
> This is a pretty cool parser IMO, and it works reasonably well.
I did something similar, although with an entirely different philosophy. https://sourceforge.net/p/forth-4th/code/HEAD/tree/trunk/4th.src/lib/parsexml.4th
And example of its use can be found here. https://sourceforge.net/p/forth-4th/code/HEAD/tree/trunk/4th.src/kjots2gn.4th
> It relies on using regex to snip off tags, and builds a DOM structure
> from it.
Mine obviously doesn't, although I think one could set up something like that. At the moment, I do not have much use for it.

Hans Bezemer

Re: An HTML parser in 8th

<sjhr98$f7r$1@dont-email.me>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=14862&group=comp.lang.forth#14862

  copy link   Newsgroups: comp.lang.forth
Path: rocksolid2!i2pn2.org!i2pn.org!eternal-september.org!reader02.eternal-september.org!.POSTED!not-for-mail
From: clf...@8th-dev.com (Ron AARON)
Newsgroups: comp.lang.forth
Subject: Re: An HTML parser in 8th
Date: Tue, 5 Oct 2021 18:34:00 +0300
Organization: A noiseless patient Spider
Lines: 21
Message-ID: <sjhr98$f7r$1@dont-email.me>
References: <sjhn1l$d0v$1@dont-email.me>
<8a772e9d-cac7-4f2e-b756-ffc666937cbdn@googlegroups.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 7bit
Injection-Date: Tue, 5 Oct 2021 15:34:00 -0000 (UTC)
Injection-Info: reader02.eternal-september.org; posting-host="4e9564f0b7fe5233bc7812c90b28f9af";
logging-data="15611"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX18PfhrTGjd/o7QTJkq0cj57"
User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:91.0) Gecko/20100101
Thunderbird/91.1.2
Cancel-Lock: sha1:sKQug9PB3sqowfBANw9PQVKgd6c=
In-Reply-To: <8a772e9d-cac7-4f2e-b756-ffc666937cbdn@googlegroups.com>
Content-Language: en-US
 by: Ron AARON - Tue, 5 Oct 2021 15:34 UTC

On 05/10/2021 18:25, Hans Bezemer wrote:
> On Tuesday, October 5, 2021 at 4:21:43 PM UTC+2, Ron AARON wrote:
>> This is a pretty cool parser IMO, and it works reasonably well.
> I did something similar, although with an entirely different philosophy. https://sourceforge.net/p/forth-4th/code/HEAD/tree/trunk/4th.src/lib/parsexml.4th
> And example of its use can be found here. https://sourceforge.net/p/forth-4th/code/HEAD/tree/trunk/4th.src/kjots2gn.4th

Nice.

>> It relies on using regex to snip off tags, and builds a DOM structure
>> from it.
> Mine obviously doesn't, although I think one could set up something like that. At the moment, I do not have much use for it.

I'm planning on creating an HTML display widget for the GUI. In the
(fairly) distant past, I did have one which embedded the OS-specific
browser. That was very heavy, and very painful to make cross-platform.
And then I stopped using some support library needed for that.

So now I'm planning on writing a simple HTML renderer just to have a
functional widget where HTML is useful.

Re: An HTML parser in 8th

<d37279f5-a460-4609-833a-b04c568f6208n@googlegroups.com>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=14868&group=comp.lang.forth#14868

  copy link   Newsgroups: comp.lang.forth
X-Received: by 2002:ac8:98:: with SMTP id c24mr28487707qtg.267.1633538395264;
Wed, 06 Oct 2021 09:39:55 -0700 (PDT)
X-Received: by 2002:a05:6214:cce:: with SMTP id 14mr6672980qvx.49.1633538395146;
Wed, 06 Oct 2021 09:39:55 -0700 (PDT)
Path: rocksolid2!i2pn.org!weretis.net!feeder6.news.weretis.net!news.misty.com!border2.nntp.dca1.giganews.com!nntp.giganews.com!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: comp.lang.forth
Date: Wed, 6 Oct 2021 09:39:54 -0700 (PDT)
In-Reply-To: <sjhn1l$d0v$1@dont-email.me>
Injection-Info: google-groups.googlegroups.com; posting-host=2600:8800:1600:ae2:e1f8:84c0:3a7e:4e9e;
posting-account=6GCGIQoAAAAyO8IjR_VTwqiqLwx0Q_G8
NNTP-Posting-Host: 2600:8800:1600:ae2:e1f8:84c0:3a7e:4e9e
References: <sjhn1l$d0v$1@dont-email.me>
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <d37279f5-a460-4609-833a-b04c568f6208n@googlegroups.com>
Subject: Re: An HTML parser in 8th
From: hwfw...@gmail.com (Brad Eckert)
Injection-Date: Wed, 06 Oct 2021 16:39:55 +0000
Content-Type: text/plain; charset="UTF-8"
Lines: 11
 by: Brad Eckert - Wed, 6 Oct 2021 16:39 UTC

On Tuesday, October 5, 2021 at 7:21:43 AM UTC-7, Ron AARON wrote:
> This is a pretty cool parser IMO, and it works reasonably well.
>
> It relies on using regex to snip off tags, and builds a DOM structure
> from it.
>
>
>
> Details:
> https://8th-dev.com/forum/index.php/topic,2385.0.html

Is 8th like Docker? Except one or two orders of magnitude smaller.

Re: An HTML parser in 8th

<sjknho$c0t$1@dont-email.me>

  copy mid

https://www.novabbs.com/devel/article-flat.php?id=14869&group=comp.lang.forth#14869

  copy link   Newsgroups: comp.lang.forth
Path: rocksolid2!news.neodome.net!news.mixmin.net!eternal-september.org!reader02.eternal-september.org!.POSTED!not-for-mail
From: clf...@8th-dev.com (Ron AARON)
Newsgroups: comp.lang.forth
Subject: Re: An HTML parser in 8th
Date: Wed, 6 Oct 2021 20:48:40 +0300
Organization: A noiseless patient Spider
Lines: 17
Message-ID: <sjknho$c0t$1@dont-email.me>
References: <sjhn1l$d0v$1@dont-email.me>
<d37279f5-a460-4609-833a-b04c568f6208n@googlegroups.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=utf-8; format=flowed
Content-Transfer-Encoding: 7bit
Injection-Date: Wed, 6 Oct 2021 17:48:41 -0000 (UTC)
Injection-Info: reader02.eternal-september.org; posting-host="b5713f25a3d41da9926e2e51ad0cdd56";
logging-data="12317"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+e588/19o8vgjHrP/xP6DU"
User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.15; rv:78.0)
Gecko/20100101 Thunderbird/78.14.0
Cancel-Lock: sha1:260bkVIN7WVcqyVyXAwuu18zayY=
In-Reply-To: <d37279f5-a460-4609-833a-b04c568f6208n@googlegroups.com>
Content-Language: en-US
 by: Ron AARON - Wed, 6 Oct 2021 17:48 UTC

On 2021-10-06 19:39, Brad Eckert wrote:
> On Tuesday, October 5, 2021 at 7:21:43 AM UTC-7, Ron AARON wrote:
>> This is a pretty cool parser IMO, and it works reasonably well.
>>
>> It relies on using regex to snip off tags, and builds a DOM structure
>> from it.
>>
>>
>>
>> Details:
>> https://8th-dev.com/forum/index.php/topic,2385.0.html
>
> Is 8th like Docker? Except one or two orders of magnitude smaller.

Like Docker? No, not in any way I can think of.

1
server_pubkey.txt

rocksolid light 0.9.8
clearnet tor