Rocksolid Light

Welcome to novaBBS (click a section below)

mail  files  register  newsreader  groups  login

Message-ID:  

You are false data.


computers / alt.comp.os.windows-10 / Re: Curious email anomaly

SubjectAuthor
* Curious email anomalyNewyana2
`* Re: Curious email anomalyApd
 `* Re: Curious email anomalyNewyana2
  +* Re: Curious email anomalyApd
  |`* Re: Curious email anomalyNewyana2
  | +* Re: Curious email anomalyApd
  | |`* Re: Curious email anomalyNewyana2
  | | `- Re: Curious email anomalyApd
  | `* Re: Curious email anomalyFrank Slootweg
  |  `* Re: Curious email anomalyNewyana2
  |   `- Re: Curious email anomalyFrank Slootweg
  `* Re: Curious email anomalyPaul
   `- Re: Curious email anomalyNewyana2

1
Curious email anomaly

<udfcqk$3gcv2$1@dont-email.me>

  copy mid

https://www.novabbs.com/computers/article-flat.php?id=73303&group=alt.comp.os.windows-10#73303

  copy link   Newsgroups: alt.comp.os.windows-10
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: Newya...@invalid.nospam (Newyana2)
Newsgroups: alt.comp.os.windows-10
Subject: Curious email anomaly
Date: Fri, 8 Sep 2023 10:57:41 -0400
Organization: A noiseless patient Spider
Lines: 17
Message-ID: <udfcqk$3gcv2$1@dont-email.me>
Injection-Date: Fri, 8 Sep 2023 14:58:28 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="0ac670a8e0481a3f6add915b063bdae7";
logging-data="3683298"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+hBiGRm0u8KLm8tuqUjt+x5UnpJnVZpPc="
Cancel-Lock: sha1:fr35aVPBRXfALV4EnKmVRXIDrCA=
X-Priority: 3
X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.5512
X-Newsreader: Microsoft Outlook Express 6.00.2900.5512
X-MSMail-Priority: Normal
 by: Newyana2 - Fri, 8 Sep 2023 14:57 UTC

Just wondering if anyone's ever seen this before.

=?utf-8?Q?IMG=5F0506.PNG?=

That was a filename in email attachments. I saved
them and then couldn't rename or delete them! I've
never run into anything like this. Some kind of unicode
corruption? They saved like so: 5F0506.PNG I
edited the email source code like so: 5F0506.jpg.

The sender was using gmail, no program listed in
the header, and he mistakenly named the files PNG
when they were actually JPG. I'm guessing he was
probably doing gmail through Safari on a Mac, but
I'm not sure.

Re: Curious email anomaly

<udffh5$3gr2v$1@apd.eternal-september.org>

  copy mid

https://www.novabbs.com/computers/article-flat.php?id=73305&group=alt.comp.os.windows-10#73305

  copy link   Newsgroups: alt.comp.os.windows-10
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!apd.eternal-september.org!.POSTED!not-for-mail
From: not...@all.invalid (Apd)
Newsgroups: alt.comp.os.windows-10
Subject: Re: Curious email anomaly
Date: Fri, 8 Sep 2023 16:44:30 +0100
Organization: ad hoc
Lines: 18
Message-ID: <udffh5$3gr2v$1@apd.eternal-september.org>
References: <udfcqk$3gcv2$1@dont-email.me>
Injection-Date: Fri, 8 Sep 2023 15:44:37 -0000 (UTC)
Injection-Info: apd.eternal-september.org; posting-host="297bb790685362b48f666672eae73744";
logging-data="3697759"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+6egS37FnootvtV81HjRz1"
Cancel-Lock: sha1:XTqoVbSIdpy9QR6vqDUYJ3dfdM0=
X-Newsreader: Microsoft Outlook Express 6.00.2900.5512
X-RFC2646: Format=Flowed; Original
X-Priority: 3
X-MSMail-Priority: Normal
X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.5512
 by: Apd - Fri, 8 Sep 2023 15:44 UTC

"Newyana2" wrote:
> Just wondering if anyone's ever seen this before.

Yes, particularly fields in Usenet message headers where UTF-8 gets
hex-encoded.

> =?utf-8?Q?IMG=5F0506.PNG?=

5F = hex for underscore.

Without encoding: IMG_0506.png

> That was a filename in email attachments. I saved
> them and then couldn't rename or delete them!

Strange.

Re: Curious email anomaly

<udgb0a$3ocis$1@dont-email.me>

  copy mid

https://www.novabbs.com/computers/article-flat.php?id=73311&group=alt.comp.os.windows-10#73311

  copy link   Newsgroups: alt.comp.os.windows-10
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: Newya...@invalid.nospam (Newyana2)
Newsgroups: alt.comp.os.windows-10
Subject: Re: Curious email anomaly
Date: Fri, 8 Sep 2023 19:32:43 -0400
Organization: A noiseless patient Spider
Lines: 19
Message-ID: <udgb0a$3ocis$1@dont-email.me>
References: <udfcqk$3gcv2$1@dont-email.me> <udffh5$3gr2v$1@apd.eternal-september.org>
Injection-Date: Fri, 8 Sep 2023 23:33:31 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="b3908bffee29f817572ff2d899e6d9bb";
logging-data="3945052"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+s5uHEGi+oPOl9G05mdqGGt/GFGufB83I="
Cancel-Lock: sha1:LbJVMQSudKHg2f7lT0Oid8U7nyc=
X-Newsreader: Microsoft Outlook Express 6.00.2900.5512
X-MSMail-Priority: Normal
X-Priority: 3
X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.5512
 by: Newyana2 - Fri, 8 Sep 2023 23:32 UTC

"Apd" <not@all.invalid> wrote

| > =?utf-8?Q?IMG=5F0506.PNG?=
| | 5F = hex for underscore.
| | Without encoding: IMG_0506.png
|

Ah. Thanks. I didn't think of that. But it makes no sense,
since _ is within ASCII, so it's also proper UTF-8. and the
whole thing still doesn't make sense. UTF-8 is not valid
in the Windows file system as far as I know. The sender said he
used something called "Spark". I figured it was probably
some kinf of Apple shennanigans, but it seems to be a
Windows program.

Re: Curious email anomaly

<udhc2t$hob$1@apd.eternal-september.org>

  copy mid

https://www.novabbs.com/computers/article-flat.php?id=73313&group=alt.comp.os.windows-10#73313

  copy link   Newsgroups: alt.comp.os.windows-10
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!apd.eternal-september.org!.POSTED!not-for-mail
From: not...@all.invalid (Apd)
Newsgroups: alt.comp.os.windows-10
Subject: Re: Curious email anomaly
Date: Sat, 9 Sep 2023 09:56:56 +0100
Organization: ad hoc
Lines: 29
Message-ID: <udhc2t$hob$1@apd.eternal-september.org>
References: <udfcqk$3gcv2$1@dont-email.me> <udffh5$3gr2v$1@apd.eternal-september.org> <udgb0a$3ocis$1@dont-email.me>
Injection-Date: Sat, 9 Sep 2023 08:58:05 -0000 (UTC)
Injection-Info: apd.eternal-september.org; posting-host="1e6290c836147ab8e05774c96979e6e6";
logging-data="18187"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1/L6GSLhjjdkxS3X4NfUH1i"
Cancel-Lock: sha1:+0goW+b4Yle5Fkp39QG66tDEnFE=
X-MSMail-Priority: Normal
X-Newsreader: Microsoft Outlook Express 6.00.2900.5512
X-RFC2646: Format=Flowed; Original
X-Priority: 3
X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.5512
 by: Apd - Sat, 9 Sep 2023 08:56 UTC

"Newyana2" wrote:
> "Apd" wrote
> | > =?utf-8?Q?IMG=5F0506.PNG?=
> |
> | 5F = hex for underscore.
> |
> | Without encoding: IMG_0506.png
>
> Ah. Thanks. I didn't think of that. But it makes no sense,
> since _ is within ASCII, so it's also proper UTF-8.

Indeed. Just recently I saw an x-face in a Usenet message header that
partially encoded some ASCII like this. Only a few non-alphabetic
chars and not consistently. Of course, it completely broke it.

> and the whole thing still doesn't make sense. UTF-8 is not valid
> in the Windows file system as far as I know. The sender said he
> used something called "Spark". I figured it was probably
> some kinf of Apple shennanigans, but it seems to be a
> Windows program.

It's not unusual to see internet messages (which include email) using
UTF-8 when non-ASCII really is present. I think Windows tries to
convert it to 1252 (or whatever codepage charset it's using these
days). If it fails and inserts a '?' for a char it doesn't understand,
that's going to cause problems with file names. It shouldn't have been
an issue here.

Re: Curious email anomaly

<udhn91$2cmc$1@dont-email.me>

  copy mid

https://www.novabbs.com/computers/article-flat.php?id=73314&group=alt.comp.os.windows-10#73314

  copy link   Newsgroups: alt.comp.os.windows-10
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: Newya...@invalid.nospam (Newyana2)
Newsgroups: alt.comp.os.windows-10
Subject: Re: Curious email anomaly
Date: Sat, 9 Sep 2023 08:08:17 -0400
Organization: A noiseless patient Spider
Lines: 35
Message-ID: <udhn91$2cmc$1@dont-email.me>
References: <udfcqk$3gcv2$1@dont-email.me> <udffh5$3gr2v$1@apd.eternal-september.org> <udgb0a$3ocis$1@dont-email.me> <udhc2t$hob$1@apd.eternal-september.org>
Injection-Date: Sat, 9 Sep 2023 12:09:05 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="b3908bffee29f817572ff2d899e6d9bb";
logging-data="78540"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1986xKxyWVHe9R5ny2s34qCCOGSDnIoizM="
Cancel-Lock: sha1:3KOGViVXdjuZY6ST0Ltfvl6q5F0=
X-Newsreader: Microsoft Outlook Express 6.00.2900.5512
X-Priority: 3
X-MSMail-Priority: Normal
X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.5512
 by: Newyana2 - Sat, 9 Sep 2023 12:08 UTC

"Apd" <not@all.invalid> wrote

| > | > =?utf-8?Q?IMG=5F0506.PNG?=
| > |

| It's not unusual to see internet messages (which include email) using
| UTF-8 when non-ASCII really is present. I think Windows tries to
| convert it to 1252 (or whatever codepage charset it's using these
| days). If it fails and inserts a '?' for a char it doesn't understand,
| that's going to cause problems with file names. It shouldn't have been
| an issue here.
|

Found it: https://en.wikipedia.org/wiki/MIME

It's called Q-encoding. I'd never heard of this. Bizarre.
It includes the text encoding designation within the filename
field, and all those = and ? are part of the required format!

So it seems there were two problems. Spark email mistakenly
encoded _ and had to use Q-encoding, while my TBird seems
to only partially recognize Q-encoding. It dropped everything
except 0506.png, but it must have written a corrupt filename
to disk, perhaps including = and ?, resulting in saving file names
with "illegal" characters. (I've noticed that's often feasible. For
example I can create a .htaccess file with VBScript but Explorer
won't let me start a file name with a period.)

So that might explain why I couldn't rename or delete the files.
They were recorded with corrupt file names. I was able to delete
them with File Assassin, but that program had failed to let me
rename them. Which also makes sense, I guess, because the
files were never locked -- only corrupted as file system entries.

Re: Curious email anomaly

<udhtdk$3bof$1@apd.eternal-september.org>

  copy mid

https://www.novabbs.com/computers/article-flat.php?id=73315&group=alt.comp.os.windows-10#73315

  copy link   Newsgroups: alt.comp.os.windows-10
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!apd.eternal-september.org!.POSTED!not-for-mail
From: not...@all.invalid (Apd)
Newsgroups: alt.comp.os.windows-10
Subject: Re: Curious email anomaly
Date: Sat, 9 Sep 2023 14:53:48 +0100
Organization: ad hoc
Lines: 27
Message-ID: <udhtdk$3bof$1@apd.eternal-september.org>
References: <udfcqk$3gcv2$1@dont-email.me> <udffh5$3gr2v$1@apd.eternal-september.org> <udgb0a$3ocis$1@dont-email.me> <udhc2t$hob$1@apd.eternal-september.org> <udhn91$2cmc$1@dont-email.me>
Injection-Date: Sat, 9 Sep 2023 13:53:56 -0000 (UTC)
Injection-Info: apd.eternal-september.org; posting-host="1e6290c836147ab8e05774c96979e6e6";
logging-data="110351"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1/dZvAdTbPUAkYtkcpn43KL"
Cancel-Lock: sha1:Dl4JEBtTbyGvvwvrZTtI8NGwsos=
X-Newsreader: Microsoft Outlook Express 6.00.2900.5512
X-Priority: 3
X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.5512
X-MSMail-Priority: Normal
X-RFC2646: Format=Flowed; Original
 by: Apd - Sat, 9 Sep 2023 13:53 UTC

"Newyana2" wrote:
> Found it: https://en.wikipedia.org/wiki/MIME
>
> It's called Q-encoding. I'd never heard of this. Bizarre.
> It includes the text encoding designation within the filename
> field, and all those = and ? are part of the required format!

Yes, RFC 2047 refers.

> So it seems there were two problems. Spark email mistakenly
> encoded _ and had to use Q-encoding,

Something I didn't know is that an underscore represents a space and
so needs to be encoded. Normally, a space would be "=20" but they say
it's for readability. There was mention in the RFC about underscores
not passing through some mail gateways (I don't know how true that is
nowadays), so perhaps the email program was encoding just to be safe.

> while my TBird seems
> to only partially recognize Q-encoding. It dropped everything
> except 0506.png, but it must have written a corrupt filename
> to disk, perhaps including = and ?, resulting in saving file names
> with "illegal" characters.

Bad Tbird!

Re: Curious email anomaly

<udi4mb$4p21$1@dont-email.me>

  copy mid

https://www.novabbs.com/computers/article-flat.php?id=73316&group=alt.comp.os.windows-10#73316

  copy link   Newsgroups: alt.comp.os.windows-10
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: nos...@needed.invalid (Paul)
Newsgroups: alt.comp.os.windows-10
Subject: Re: Curious email anomaly
Date: Sat, 9 Sep 2023 11:58:01 -0400
Organization: A noiseless patient Spider
Lines: 138
Message-ID: <udi4mb$4p21$1@dont-email.me>
References: <udfcqk$3gcv2$1@dont-email.me>
<udffh5$3gr2v$1@apd.eternal-september.org> <udgb0a$3ocis$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: 7bit
Injection-Date: Sat, 9 Sep 2023 15:58:03 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="cd66d5844cdb0d3403284065d8cb4f35";
logging-data="156737"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1/3PMxB79XbldifiOkWh6+1C21e/VbTIeI="
User-Agent: Ratcatcher/2.0.0.25 (Windows/20130802)
Cancel-Lock: sha1:wz4rszEbesJD62BIkTeg4I8+0QU=
Content-Language: en-US
In-Reply-To: <udgb0a$3ocis$1@dont-email.me>
 by: Paul - Sat, 9 Sep 2023 15:58 UTC

On 9/8/2023 7:32 PM, Newyana2 wrote:
> "Apd" <not@all.invalid> wrote
>
> | > =?utf-8?Q?IMG=5F0506.PNG?=
> |
> | 5F = hex for underscore.
> |
> | Without encoding: IMG_0506.png
> |
>
> Ah. Thanks. I didn't think of that. But it makes no sense,
> since _ is within ASCII, so it's also proper UTF-8. and the
> whole thing still doesn't make sense. UTF-8 is not valid
> in the Windows file system as far as I know. The sender said he
> used something called "Spark". I figured it was probably
> some kinf of Apple shennanigans, but it seems to be a
> Windows program.

There is likely to be more than one program named Spark.

https://spark.apache.org/

*******

I knew NTFS accepted wide characters, but not the details of what
you stuff in them. The reason I have to know about this stuff,
is when analyzing Registry entries, as some file system paths are stored in 16-bit mode
in the Registry. The practice makes it damn hard to search for stuff.

https://stackoverflow.com/questions/2050973/what-encoding-are-filenames-in-ntfs-stored-as

"NTFS stores filenames in UTF-16, however fopen is using ANSI (not UTF-8).

In order to use an UTF16-encoded file name you will need to use the Unicode versions
of the file open calls. Do this by defining UNICODE and _UNICODE in your project.
Then use the CreateFile call or the wfopen call."

I saw in a ProcMon trace once, the usage of a file opening option,
which seemed to be "open the file but *delete* the file when you close it".
Basically a "read and delete" kind of semantic. A properly crafted wfopen
command, just might be enough to delete it :-) Deleting in NTFS is not difficult,
and a single byte in the 1024 byte $MFT carries the info that the file is deleted
and that the $MFT entry can be "reused, any time it is convenient for you". There
is no procedure in NTFS, to shrink or consolidate the $MFT, so filenames remain
visible until you "create" enough files, to reuse all the unused $MFT entries.

And I don't think changing languages would help. For one filesystem issue,
I was able to use Perl to make a correction. But the handling of anything
other than ANSI, is likely to be just as convoluted as the StackOverflow description.

It seems the NTFS file system calls, just don't have enough sanitization in them.
At a guess. Someone else recently had a problem, where a filename definitely violated
a cardinal NTFS rule, and of course, the user could not rename or delete either,
because as soon as the illegal filename was presented to file explorer, file explorer
said "here, let me fix this for you, by removing the illegal portion", and then
of course the result is "file not found". And that's why you're not able to
rename or delete, is it *does* do the sanitizing when it is inconvenient to do it.
But *does not* do the sanitizing, for the "browser wedges file system" cases :-/
Some kind of subroutine call browsers are using, seems to be bad for your situation.

*******

You can try deleting the file in question, using the short file name
in a Command Prompt windows. As that name may have fewer representation issues.

del somename.ext

You would need to look up, how to get the short names to show (if they exist).
The short name is effectively an alias.

*******

And I'm still chuckling here, as I DID find a way to make illegal filenames
on NTFS :-) (Removing an illegal file, may still have its challenges,
but I cannot reproduce your issue exactly, unless I can find a way to
duplicate it.)

It turns out, that in Linux,

[fuse filesystem ntfs.3g]

sudo mount -o windows_names,rw /dev/sda1 /mnt # This passes a mount option to sanitize
# filenames. This prevents "mistakes".

sudo mount -o rw /dev/sda1 /mnt # This is UNPROTECTED naming.
# Used to make the following picture

You can see I had fun, by putting a "dot" on the end of a filename.

I ran CHKDSK in Windows, and it does not do a damn thing about that file.

[Picture]

https://i.postimg.cc/DZS8LbY4/illegal-filename-via-knoppix531.gif

https://linux.die.net/man/8/ntfs-3g

"windows_names
This option prevents files, directories and extended attributes
to be created with a name not allowed by windows
"

*******

Why did I use Knoppix-531 DVD ?

There was no kernel level NTFS driver back then. Only NTFS-3G
existed, and it was ready to use from the DVD.

You click the disk icon on the desktop. From Terminal (icon on taskbar)

cat /etc/mtab

and that will show the options list for an ordinary mount. The context
menu has an option for "mount read/write" as normally Knoppix 531 safe-mounts
disks in read-only mode. In any case, you will notice that Knoppix
does not have "windows_names" in the options list.

At some point, the developers made "windows_names" the default, and,
they did not provide a "windows_names=No" option or similar. This means
the in-kernel NTFS mount on a modern (Ubuntu 23.04), would already be
enforcing valid NTFS filenames.

But, the ntfs-3g still exists, and on Ubuntu, it is already installed

gnome-disks (or just "disks" maybe) # This utility allows discovering names for things

sudo /sbin/mount.ntfs-3g -o rw /dev/sda1 /mnt

cd /mnt
ls
...
rm "funny-named-thing.ext" # This is the challenging part.

cd ~
sudo umount /mnt # Put away partition, before shutdown.

Paul

Re: Curious email anomaly

<udiahm$5ima$1@dont-email.me>

  copy mid

https://www.novabbs.com/computers/article-flat.php?id=73319&group=alt.comp.os.windows-10#73319

  copy link   Newsgroups: alt.comp.os.windows-10
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: Newya...@invalid.nospam (Newyana2)
Newsgroups: alt.comp.os.windows-10
Subject: Re: Curious email anomaly
Date: Sat, 9 Sep 2023 13:37:10 -0400
Organization: A noiseless patient Spider
Lines: 49
Message-ID: <udiahm$5ima$1@dont-email.me>
References: <udfcqk$3gcv2$1@dont-email.me> <udffh5$3gr2v$1@apd.eternal-september.org> <udgb0a$3ocis$1@dont-email.me> <udi4mb$4p21$1@dont-email.me>
Injection-Date: Sat, 9 Sep 2023 17:37:58 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="b3908bffee29f817572ff2d899e6d9bb";
logging-data="182986"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19z1sAskjDidjlrPhNwyqb1AxDx1V00lOk="
Cancel-Lock: sha1:AgXbVkL51QZzRJ+JRCSQdmqYw/4=
X-MSMail-Priority: Normal
X-Newsreader: Microsoft Outlook Express 6.00.2900.5512
X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.5512
X-Priority: 3
 by: Newyana2 - Sat, 9 Sep 2023 17:37 UTC

"Paul" <nospam@needed.invalid> wrote

| There is likely to be more than one program named Spark.
| | https://spark.apache.org/
| That's something else. I only find one Spark for
email, made by Readdle. There's no X-Mailer or
UserAgent field in the header. Looking around I see
that identifying the sending program has become rare.

| I knew NTFS accepted wide characters, but not the details of what
| you stuff in them. The reason I have to know about this stuff,
| is when analyzing Registry entries, as some file system paths are stored
in 16-bit mode
| in the Registry. The practice makes it damn hard to search for stuff.
| |
https://stackoverflow.com/questions/2050973/what-encoding-are-filenames-in-ntfs-stored-as
| | "NTFS stores filenames in UTF-16, however fopen is using ANSI (not
UTF-8).
|

Are you sitting down?... I'm on FAT32. Cuts down on the
nonsense and complications. Permissions are impossible to
enforce.

I didn't know that about NTFS file names. I've never run
into problems. But Windows has been mainly unicode for
a long time. I mostly work with VB6, which converts it
automatically. And Windows Script Host? I can't think of
any software that doesn't transfer seamlessly between
FAT32 and NTFS. I would have thought that Windows
would just manage that.

|
| You can try deleting the file in question, using the short file name
| in a Command Prompt windows. As that name may have fewer representation
issues.
|

File Assassin did it. I think the problem was just
that the stored file name in the file system probably
didn't match what Explorer saw.

Re: Curious email anomaly

<udiap1$5jq7$1@dont-email.me>

  copy mid

https://www.novabbs.com/computers/article-flat.php?id=73320&group=alt.comp.os.windows-10#73320

  copy link   Newsgroups: alt.comp.os.windows-10
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: Newya...@invalid.nospam (Newyana2)
Newsgroups: alt.comp.os.windows-10
Subject: Re: Curious email anomaly
Date: Sat, 9 Sep 2023 13:41:05 -0400
Organization: A noiseless patient Spider
Lines: 21
Message-ID: <udiap1$5jq7$1@dont-email.me>
References: <udfcqk$3gcv2$1@dont-email.me> <udffh5$3gr2v$1@apd.eternal-september.org> <udgb0a$3ocis$1@dont-email.me> <udhc2t$hob$1@apd.eternal-september.org> <udhn91$2cmc$1@dont-email.me> <udhtdk$3bof$1@apd.eternal-september.org>
Injection-Date: Sat, 9 Sep 2023 17:41:53 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="b3908bffee29f817572ff2d899e6d9bb";
logging-data="184135"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19zQAg5jtY7mBHd7tBHOKfLcxzDRa3SX8I="
Cancel-Lock: sha1:C6TL3P7+WkAGpWdvcndwWz9FpMg=
X-Priority: 3
X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.5512
X-Newsreader: Microsoft Outlook Express 6.00.2900.5512
X-MSMail-Priority: Normal
 by: Newyana2 - Sat, 9 Sep 2023 17:41 UTC

"Apd" <not@all.invalid> wrote

| Something I didn't know is that an underscore represents a space and
| so needs to be encoded. Normally, a space would be "=20" but they say
| it's for readability. There was mention in the RFC about underscores
| not passing through some mail gateways (I don't know how true that is
| nowadays), so perhaps the email program was encoding just to be safe.
|

I think it's just a bug. An underscore is often
used instead of a space, where a space can't be
used, like a URL. But it's not a space character.
And there's no problem with space characters in
ASCII. It's not necessary in email for a file name.
The name is in quotes. So filename: "kids at beach.jpg"
would be no problem. There's no reason at all to
be encoding the file name. On the other hand, there's
also no reason that TBird couldn't handle it. So it's
a screw-up on both ends.

Re: Curious email anomaly

<udipls$7mmf$1@apd.eternal-september.org>

  copy mid

https://www.novabbs.com/computers/article-flat.php?id=73322&group=alt.comp.os.windows-10#73322

  copy link   Newsgroups: alt.comp.os.windows-10
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!apd.eternal-september.org!.POSTED!not-for-mail
From: not...@all.invalid (Apd)
Newsgroups: alt.comp.os.windows-10
Subject: Re: Curious email anomaly
Date: Sat, 9 Sep 2023 22:56:09 +0100
Organization: ad hoc
Lines: 19
Message-ID: <udipls$7mmf$1@apd.eternal-september.org>
References: <udfcqk$3gcv2$1@dont-email.me> <udffh5$3gr2v$1@apd.eternal-september.org> <udgb0a$3ocis$1@dont-email.me> <udhc2t$hob$1@apd.eternal-september.org> <udhn91$2cmc$1@dont-email.me> <udhtdk$3bof$1@apd.eternal-september.org> <udiap1$5jq7$1@dont-email.me>
Injection-Date: Sat, 9 Sep 2023 21:56:12 -0000 (UTC)
Injection-Info: apd.eternal-september.org; posting-host="1e6290c836147ab8e05774c96979e6e6";
logging-data="252623"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+qD1YZEQhsWUh2IxWlS1Vk"
Cancel-Lock: sha1:ip8DZj1CC2nHRELqRmXQb17Evn8=
X-Newsreader: Microsoft Outlook Express 6.00.2900.5512
X-MSMail-Priority: Normal
X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.5512
X-RFC2646: Format=Flowed; Original
X-Priority: 3
 by: Apd - Sat, 9 Sep 2023 21:56 UTC

"Newyana2" wrote:
> "Apd" wrote
> | Something I didn't know is that an underscore represents a space and
> | so needs to be encoded. Normally, a space would be "=20" but they say
> | it's for readability. There was mention in the RFC about underscores
> | not passing through some mail gateways (I don't know how true that is
> | nowadays), so perhaps the email program was encoding just to be safe.
>
> I think it's just a bug. An underscore is often
> used instead of a space, where a space can't be
> used, like a URL. But it's not a space character.
> And there's no problem with space characters in
> ASCII.

Sure, but I was thinking it started out as an underscore, not a space,
so the mail agent decided to encode it as per RFC comments about
gateways.

Re: Curious email anomaly

<udkto3.rvc.1@ID-201911.user.individual.net>

  copy mid

https://www.novabbs.com/computers/article-flat.php?id=73334&group=alt.comp.os.windows-10#73334

  copy link   Newsgroups: alt.comp.os.windows-10
Path: i2pn2.org!i2pn.org!weretis.net!feeder8.news.weretis.net!fu-berlin.de!uni-berlin.de!individual.net!not-for-mail
From: thi...@ddress.is.invalid (Frank Slootweg)
Newsgroups: alt.comp.os.windows-10
Subject: Re: Curious email anomaly
Date: 10 Sep 2023 15:18:21 GMT
Organization: NOYB
Lines: 49
Message-ID: <udkto3.rvc.1@ID-201911.user.individual.net>
References: <udfcqk$3gcv2$1@dont-email.me> <udffh5$3gr2v$1@apd.eternal-september.org> <udgb0a$3ocis$1@dont-email.me> <udhc2t$hob$1@apd.eternal-september.org> <udhn91$2cmc$1@dont-email.me>
X-Trace: individual.net xVMhSo5tlTEw/h1UcN75pQK/EIMuGibWDk7+aoZ8qxIZ+kgrcj
X-Orig-Path: not-for-mail
Cancel-Lock: sha1:t2dpB0UqsaEBOLCVWxX3TlG79QI= sha256:oYEgm2aV9M9XqdDLVYuUEY3QVeN+2tBrVjXaF5ygk4E=
User-Agent: tin/1.6.2-20030910 ("Pabbay") (UNIX) (CYGWIN_NT-10.0-WOW/2.8.0(0.309/5/3) (i686)) Hamster/2.0.2.2
 by: Frank Slootweg - Sun, 10 Sep 2023 15:18 UTC

Newyana2 <Newyana2@invalid.nospam> wrote:
> "Apd" <not@all.invalid> wrote
>
> | > | > =?utf-8?Q?IMG=5F0506.PNG?=
> | > |
>
> | It's not unusual to see internet messages (which include email) using
> | UTF-8 when non-ASCII really is present. I think Windows tries to
> | convert it to 1252 (or whatever codepage charset it's using these
> | days). If it fails and inserts a '?' for a char it doesn't understand,
> | that's going to cause problems with file names. It shouldn't have been
> | an issue here.
> |
>
> Found it: https://en.wikipedia.org/wiki/MIME
>
> It's called Q-encoding. I'd never heard of this. Bizarre.
> It includes the text encoding designation within the filename
> field, and all those = and ? are part of the required format!
>
> So it seems there were two problems. Spark email mistakenly
> encoded _ and had to use Q-encoding, while my TBird seems
> to only partially recognize Q-encoding. It dropped everything
> except 0506.png, but it must have written a corrupt filename
> to disk, perhaps including = and ?, resulting in saving file names
> with "illegal" characters. (I've noticed that's often feasible. For
> example I can create a .htaccess file with VBScript but Explorer
> won't let me start a file name with a period.)
>
> So that might explain why I couldn't rename or delete the files.
> They were recorded with corrupt file names. I was able to delete
> them with File Assassin, but that program had failed to let me
> rename them. Which also makes sense, I guess, because the
> files were never locked -- only corrupted as file system entries.

You later mentioned that you use FAT32. The 'corrupt file names' issue
is probably related to that, as on my (Windows 11) system, with NTFS,
File Explorer *can* delete and rename the example (Q-encoded) file name.

As to why the file name got Q-encoded in the first place:

I suspect that at some time, the file name was used in some e-mail
header - probably in Subject; - and for some reason (see Apd's
responses), some mailer somewhere thought the file name should be
Q-encoded (perhaps indeed because of the underscore). Once encoded,
nobody should see the encoded form, only the decoded form. BUT if somone
would copy and paste the file name from the header, the clipboard would
probably contain the encoded name, resulting in the havoc you
experienced.

Re: Curious email anomaly

<udku0m$ldea$1@dont-email.me>

  copy mid

https://www.novabbs.com/computers/article-flat.php?id=73335&group=alt.comp.os.windows-10#73335

  copy link   Newsgroups: alt.comp.os.windows-10
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: Newya...@invalid.nospam (Newyana2)
Newsgroups: alt.comp.os.windows-10
Subject: Re: Curious email anomaly
Date: Sun, 10 Sep 2023 13:21:41 -0400
Organization: A noiseless patient Spider
Lines: 66
Message-ID: <udku0m$ldea$1@dont-email.me>
References: <udfcqk$3gcv2$1@dont-email.me> <udffh5$3gr2v$1@apd.eternal-september.org> <udgb0a$3ocis$1@dont-email.me> <udhc2t$hob$1@apd.eternal-september.org> <udhn91$2cmc$1@dont-email.me> <udkto3.rvc.1@ID-201911.user.individual.net>
Injection-Date: Sun, 10 Sep 2023 17:22:30 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="9e051a005910b1e30ed13c8862e72f95";
logging-data="701898"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19oCKt50h3tt++JZ7BogHF4b0nFsmpE1a0="
Cancel-Lock: sha1:LRpVgoaXFfBexu1LdYG+Z12AJUs=
X-Priority: 3
X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.5512
X-MSMail-Priority: Normal
X-Newsreader: Microsoft Outlook Express 6.00.2900.5512
 by: Newyana2 - Sun, 10 Sep 2023 17:21 UTC

"Frank Slootweg" <this@ddress.is.invalid> wrote

| You later mentioned that you use FAT32. The 'corrupt file names' issue
| is probably related to that, as on my (Windows 11) system, with NTFS,
| File Explorer *can* delete and rename the example (Q-encoded) file name.
| Yes. Paul mentioned that NTFS is more sophisticated.
I didn't test to see what the file might have been named
on another system. There were 3 different issues: Spark
writing an unnecessary, arguably corrupted file name.
TBird apparently not properly parsing that name. Then
Windows allowing the name to be recorded differently
from what Explorer saw.

| As to why the file name got Q-encoded in the first place:
| | I suspect that at some time, the file name was used in some e-mail
| header - probably in Subject; - and for some reason (see Apd's
| responses), some mailer somewhere thought the file name should be
| Q-encoded (perhaps indeed because of the underscore). Once encoded,
| nobody should see the encoded form, only the decoded form. BUT if somone
| would copy and paste the file name from the header, the clipboard would
| probably contain the encoded name, resulting in the havoc you
| experienced.

There was no excuse for the encoding, except that Spark
was composing in UTF-8 and it's "legal" to encode it. The
mystery is why TBird, or something, dropped out the underscore.
An underscore is a perfectly legit ASCII character. But somehow
Explorer ended up not showing it. That made me curious whether
there's a way to directly read the file system, but I'm not
aware of such a tool. I'm curious how the file was recorded.
Since File Assassin could delete it but enable me to rename or
delete, I'm guessing the name was corrupted between the file
system and Explorer. Maybe it was recorded as including an = sign,
for example.

I just sent myself an image with underscore and a space. It
came through normally:

Content-Type: image/jpeg;
name="_e-device spying.jpg"
Content-Transfer-Encoding: base64
Content-Disposition: attachment;
filename="_e-device spying.jpg"

I'm curious how common this Q-encoding is. I've never
seen it before. I'd never heard of it. It's clearly "legal",
but I've even written email software and never saw such
a thing. There's no possible reason for it except to transmit
characters that don't exist in ASCII. Even then, it would
be converted on most systems. That is, if you send me
something like a file with a Chinese character then I'd
probably receive something like ~1/4.jpg if it worked at
all.

Which raises the question of unicode on Windows. Windows
has been unicode-16 for many years, but that's different
from UTF-8, using 2 bytes for all characters. I'm not sure
Explorer is capable, or Windows itself capable, of handling
a UTF-8 file name if there are characters not allowed in
Explorer.

It's funny how quickly character encoding gets confusing.

Re: Curious email anomaly

<udleis.shg.1@ID-201911.user.individual.net>

  copy mid

https://www.novabbs.com/computers/article-flat.php?id=73338&group=alt.comp.os.windows-10#73338

  copy link   Newsgroups: alt.comp.os.windows-10
Path: i2pn2.org!i2pn.org!weretis.net!feeder8.news.weretis.net!fu-berlin.de!uni-berlin.de!individual.net!not-for-mail
From: thi...@ddress.is.invalid (Frank Slootweg)
Newsgroups: alt.comp.os.windows-10
Subject: Re: Curious email anomaly
Date: 10 Sep 2023 20:05:25 GMT
Organization: NOYB
Lines: 83
Message-ID: <udleis.shg.1@ID-201911.user.individual.net>
References: <udfcqk$3gcv2$1@dont-email.me> <udffh5$3gr2v$1@apd.eternal-september.org> <udgb0a$3ocis$1@dont-email.me> <udhc2t$hob$1@apd.eternal-september.org> <udhn91$2cmc$1@dont-email.me> <udkto3.rvc.1@ID-201911.user.individual.net> <udku0m$ldea$1@dont-email.me>
X-Trace: individual.net b+Nz5jhoLbYKy+T7K+dEpwXZ/8lHpJCUvV5AZaw5f104CS74qz
X-Orig-Path: not-for-mail
Cancel-Lock: sha1:AZwTsOvu6HUcm3e7iQmKAOcl4R0= sha256:S+UnfuIAZuTWqJt7CpDs/DHRnxWc9VoFqfSlbK/VHTA=
User-Agent: tin/1.6.2-20030910 ("Pabbay") (UNIX) (CYGWIN_NT-10.0-WOW/2.8.0(0.309/5/3) (i686)) Hamster/2.0.2.2
 by: Frank Slootweg - Sun, 10 Sep 2023 20:05 UTC

Newyana2 <Newyana2@invalid.nospam> wrote:
> "Frank Slootweg" <this@ddress.is.invalid> wrote
[...]
> | As to why the file name got Q-encoded in the first place:
> |
> | I suspect that at some time, the file name was used in some e-mail
> | header - probably in Subject; - and for some reason (see Apd's
> | responses), some mailer somewhere thought the file name should be
> | Q-encoded (perhaps indeed because of the underscore). Once encoded,
> | nobody should see the encoded form, only the decoded form. BUT if somone
> | would copy and paste the file name from the header, the clipboard would
> | probably contain the encoded name, resulting in the havoc you
> | experienced.
>
> There was no excuse for the encoding, except that Spark
> was composing in UTF-8 and it's "legal" to encode it. The
> mystery is why TBird, or something, dropped out the underscore.
> An underscore is a perfectly legit ASCII character. But somehow
> Explorer ended up not showing it. That made me curious whether
> there's a way to directly read the file system, but I'm not
> aware of such a tool. I'm curious how the file was recorded.
> Since File Assassin could delete it but enable me to rename or
> delete, I'm guessing the name was corrupted between the file
> system and Explorer. Maybe it was recorded as including an = sign,
> for example.
>
> I just sent myself an image with underscore and a space. It
> came through normally:
>
> Content-Type: image/jpeg;
> name="_e-device spying.jpg"
> Content-Transfer-Encoding: base64
> Content-Disposition: attachment;
> filename="_e-device spying.jpg"
>
> I'm curious how common this Q-encoding is. I've never
> seen it before. I'd never heard of it. It's clearly "legal",
> but I've even written email software and never saw such
> a thing. There's no possible reason for it except to transmit
> characters that don't exist in ASCII. Even then, it would
> be converted on most systems. That is, if you send me
> something like a file with a Chinese character then I'd
> probably receive something like ~1/4.jpg if it worked at
> all.

As I said, the Q-encoding is relevant to and possibly justified in
e-mail *headers*, for example in 'Subject:'. A header must be ASCII,
because any MIME headers define the encoding and charset of the *body*,
not of the headers.

This is nicely explained in the MIME page you referenced:

<https://en.wikipedia.org/wiki/MIME#Encoded-Word>

and specifically the example in

<https://en.wikipedia.org/wiki/MIME#Difference_between_Q-encoding_and_quoted-printable>

So as I mentioned, my suspicion is that the Q-encoded file name was
probably in some header, probably the 'Subject:' header.

Remains the question, *why* it was Q-encoded as all the characters in
the file name are normal printing characters? But as Apd mentioned,
perhaps the underscore ('_') is a printable character, but still an
exception on some systems, so it was encoded, just to be on the safe
side.

Just for kicks, I used Thunderbird to send myself a message with
"Subject: IMG_0506.PNG", but when viewing the Message Source, I saw that
the name in the message was *not* encoded. So I could not confirm my
suspicion (but also not disprove it).

> Which raises the question of unicode on Windows. Windows
> has been unicode-16 for many years, but that's different
> from UTF-8, using 2 bytes for all characters. I'm not sure
> Explorer is capable, or Windows itself capable, of handling
> a UTF-8 file name if there are characters not allowed in
> Explorer.
>
> It's funny how quickly character encoding gets confusing.

As your reference says, Q-encoding is similar to 'quoted-printable'.
That latter term was often qualified as 'quoted-unreadable'! :-)

1
server_pubkey.txt

rocksolid light 0.9.81
clearnet tor