Rocksolid Light

Welcome to novaBBS (click a section below)

mail  files  register  newsreader  groups  login

Message-ID:  

If I have not seen so far it is because I stood in giant's footsteps.


computers / comp.text.tex / Thu 11 May: TeX Hour: Using LaTeXML to access audit arXiv LaTeX source files

SubjectAuthor
o Thu 11 May: TeX Hour: Using LaTeXML to access audit arXiv LaTeXJonathan Fine

1
Thu 11 May: TeX Hour: Using LaTeXML to access audit arXiv LaTeX source files

<302fa9a1-641b-49cd-826f-e4f1fa6eca21n@googlegroups.com>

  copy mid

https://www.novabbs.com/computers/article-flat.php?id=6565&group=comp.text.tex#6565

  copy link   Newsgroups: comp.text.tex
X-Received: by 2002:a05:620a:28cf:b0:74e:17da:5d7d with SMTP id l15-20020a05620a28cf00b0074e17da5d7dmr6958299qkp.13.1683749682129;
Wed, 10 May 2023 13:14:42 -0700 (PDT)
X-Received: by 2002:a25:38c:0:b0:b99:f14b:53c1 with SMTP id
134-20020a25038c000000b00b99f14b53c1mr8517676ybd.6.1683749681721; Wed, 10 May
2023 13:14:41 -0700 (PDT)
Path: i2pn2.org!i2pn.org!weretis.net!feeder8.news.weretis.net!3.eu.feeder.erje.net!1.us.feeder.erje.net!feeder.erje.net!border-1.nntp.ord.giganews.com!nntp.giganews.com!news-out.google.com!nntp.google.com!postnews.google.com!google-groups.googlegroups.com!not-for-mail
Newsgroups: comp.text.tex
Date: Wed, 10 May 2023 13:14:41 -0700 (PDT)
Injection-Info: google-groups.googlegroups.com; posting-host=146.199.238.158; posting-account=1n5iOQoAAAAdoKmXR0eD8Li08uSD4aUd
NNTP-Posting-Host: 146.199.238.158
User-Agent: G2/1.0
MIME-Version: 1.0
Message-ID: <302fa9a1-641b-49cd-826f-e4f1fa6eca21n@googlegroups.com>
Subject: Thu 11 May: TeX Hour: Using LaTeXML to access audit arXiv LaTeX
source files
From: jfine2...@gmail.com (Jonathan Fine)
Injection-Date: Wed, 10 May 2023 20:14:42 +0000
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
Lines: 29
 by: Jonathan Fine - Wed, 10 May 2023 20:14 UTC

Hi

The arXiv has about 2.5 million articles, most of which have been processed with LaTeX to produce PDF. In addition, most of these LaTeX articles have been processed with LaTeXML, to produce HTML. Recently the arXix has announced it will be making this HTML available, to improve accessibility. Tomorrow's TeX Hour is about using LaTeXML to audit accessibility of the arXiv LaTeX source.

TeX Hour: Thursday 11 May, 6:30 to 7:30pm BST
More information: https://texhour.github.io/2023/05/11/latex-access-audit-latex/
Zoom URL: https://us02web.zoom.us/j/78551255396?pwd=cHdJN0pTTXRlRCtSd1lCTHpuWmNIUT09

LaTeXML produces a log file, containing warnings and errors. It provides to some degree an accessibility audit of the LaTeX source files on the arXiv. Tomorrow's TeX Hour is an informal preliminary report on my efforts to use thes log files to audit arXiv source for accessibility. Results so far are outnumbered by problems, but it's early days.

Going to https://ar5iv.labs.arxiv.org/feeling_lucky will send you to a random arXiv article in HTML. At the bottom of that page there is a link to the LaTeX-to-HTML conversion report (the log file), and also the arXiv PDF. Getting the LaTeX source is more work. Automating all this is one of the early problems.

wishing you safe and accessible TeXing

Jonathan


computers / comp.text.tex / Thu 11 May: TeX Hour: Using LaTeXML to access audit arXiv LaTeX source files

1
server_pubkey.txt

rocksolid light 0.9.81
clearnet tor