Rocksolid Light

Welcome to novaBBS (click a section below)

mail  files  register  newsreader  groups  login

Message-ID:  

But maybe we don't really need that... -- Larry Wall in <199709011851.LAA07101@wall.org>


computers / news.software.readers / Re: [tin] Filtering for Message-ID

SubjectAuthor
* [tin] Filtering for Message-IDRene Kita
+* Re: [tin] Filtering for Message-IDRalph Fox
|`* Re: [tin] Filtering for Message-IDRene Kita
| `- Re: [tin] Filtering for Message-IDDrunkenThon
+- Re: [tin] Filtering for Message-IDissdr
+* Re: [tin] Filtering for Message-IDissdr
|`* Re: [tin] Filtering for Message-IDissdr
| `- Re: [tin] Filtering for Message-IDRene Kita
+- Re: [tin] Filtering for Message-IDDV
+* Re: [tin] Filtering for Message-IDRene Kita
|`* Re: [tin] Filtering for Message-IDUrs Janßen
| `- Re: [tin] Filtering for Message-IDRene Kita
+- Re: [tin] Filtering for Message-IDSn!pe
`* Re: [tin] Filtering for Message-IDJohn McCue
 `* Re: [tin] Filtering for Message-IDRene Kita
  `* Re: [tin] Filtering for Message-IDUrs Janßen
   `* Re: [tin] Filtering for Message-IDRene Kita
    `- Re: [tin] Filtering for Message-IDUrs Janßen

1
[tin] Filtering for Message-ID

<ug0bkv$3olc5$1@dont-email.me>

 copy mid

https://www.novabbs.com/computers/article-flat.php?id=1494&group=news.software.readers#1494

 copy link   Newsgroups: news.software.readers
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: mai...@rkta.de (Rene Kita)
Newsgroups: news.software.readers
Subject: [tin] Filtering for Message-ID
Date: Mon, 9 Oct 2023 07:55:13 -0000 (UTC)
Organization: A noiseless patient Spider
Lines: 19
Message-ID: <ug0bkv$3olc5$1@dont-email.me>
Injection-Date: Mon, 9 Oct 2023 07:55:13 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="92f22d5fa9d1e963a65f4f32234fe931";
logging-data="3954053"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+wN4HTanpK+/TUl+0ftOOH"
User-Agent: tin/2.4.5-20201224 ("Glen Albyn") (Linux/5.10.0-24-amd64 (x86_64))
Cancel-Lock: sha1:9I6hT+oH0ztBE7KacMFw8cwILJw=
 by: Rene Kita - Mon, 9 Oct 2023 07:55 UTC

Due to recent events I finally tried to filter Google Groups users with
tin. I do not want to filter followups to posts from GG users.

AFAIU GG posters have a Message-ID which always ends with
'@googlegroups.com'.

I have the following regex in my filter file:
group=*
case=0
score=-25
msgid=.*@googlegroups.com>

The above filter entry does not match initial articles. I tried multiple
versions, including using msgid_last and msgid_only, but could not
achieve what I wanted. I did not spot a pattern what is matched.

Can I somehow see what tin filters to debug this?

Does someone here has a working filter and would share it?

Re: [tin] Filtering for Message-ID

<r2e7iip7d7gqf2h2brkekrdtk2ta2lcb3m@4ax.com>

 copy mid

https://www.novabbs.com/computers/article-flat.php?id=1495&group=news.software.readers#1495

 copy link   Newsgroups: news.software.readers
Path: i2pn2.org!i2pn.org!usenet.blueworldhosting.com!diablo1.usenet.blueworldhosting.com!peer01.iad!feed-me.highwinds-media.com!news.highwinds-media.com!fx47.iad.POSTED!not-for-mail
From: -rf-...@-.invalid (Ralph Fox)
Newsgroups: news.software.readers
Subject: Re: [tin] Filtering for Message-ID
Message-ID: <r2e7iip7d7gqf2h2brkekrdtk2ta2lcb3m@4ax.com>
References: <ug0bkv$3olc5$1@dont-email.me>
User-Agent: ForteAgent/8.00.32.1272
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
X-Face: 5gSW~"1=jGDo(BXfTrgL2BnC3tUB_\d0u@mP~wA1fvK`z8I[>1jXVVZ!N6ittQ.K<5!i3l> ==jcyAk.[B>kLg8TY{+8%edZ(le:ncPt%s8Pr?]QXNXO]0RC#V_zt|%>=bt>rZ2iCI^-yl7Be(]Ep> OfyI!3Bf|e
Lines: 33
X-Complaints-To: abuse@easynews.com
Organization: Forte - www.forteinc.com
X-Complaints-Info: Please be sure to forward a copy of ALL headers otherwise we will be unable to process your complaint properly.
Date: Mon, 09 Oct 2023 21:25:13 +1300
X-Received-Bytes: 1824
 by: Ralph Fox - Mon, 9 Oct 2023 08:25 UTC

On Mon, 9 Oct 2023 07:55:13 -0000 (UTC), Rene Kita wrote:

> Due to recent events I finally tried to filter Google Groups users with
> tin. I do not want to filter followups to posts from GG users.
>
> AFAIU GG posters have a Message-ID which always ends with
> '@googlegroups.com'.
>
> I have the following regex in my filter file:
> group=*
> case=0
> score=-25
> msgid=.*@googlegroups.com>
>
> The above filter entry does not match initial articles. I tried multiple
> versions, including using msgid_last and msgid_only, but could not
> achieve what I wanted. I did not spot a pattern what is matched.
>
> Can I somehow see what tin filters to debug this?
>
> Does someone here has a working filter and would share it?

This site is now a few years old, but its tin filter may still work.

"The Usenet Improvement Project - Filters Examples 3"
<http://twovoyagers.com/improve-usenet.org/filters_ex3.html>

--
Kind regards
Ralph Fox
🦊

Re: [tin] Filtering for Message-ID

<87pm1omccy.fsf@ID-313311.news.uni-berlin.de>

 copy mid

https://www.novabbs.com/computers/article-flat.php?id=1496&group=news.software.readers#1496

 copy link   Newsgroups: news.software.readers
Path: i2pn2.org!i2pn.org!weretis.net!feeder8.news.weretis.net!fu-berlin.de!uni-berlin.de!individual.net!not-for-mail
From: p_u_n_k_...@yahoo.it (issdr)
Newsgroups: news.software.readers
Subject: Re: [tin] Filtering for Message-ID
Date: Mon, 09 Oct 2023 10:25:17 +0200
Lines: 7
Message-ID: <87pm1omccy.fsf@ID-313311.news.uni-berlin.de>
References: <ug0bkv$3olc5$1@dont-email.me>
Mime-Version: 1.0
Content-Type: text/plain
X-Trace: individual.net GW7eG+vF/135d5WCyNXXvAeqolm0hmBg9tHdwtIpyYsNujJP6C
X-Orig-Path: ID-313311.news.uni-berlin.de!not-for-mail
Cancel-Lock: sha1:Bbhqa3bWJn0Bx6CVCtJOPtca2fA= sha256:tHWP4mMcj3qXoLZXKmqPzTdTYQAU2+mQ1zb52+AKPhM=
User-Agent: Gnus/5.13 Emacs/30.0.50 (gnu/linux)
X-Face: Zw,R[}.rn5!7_4>.U1j!zNkszle\GP]4G+xPe`%F=i$PMP)8W/c57#P6B_TFI@1-LR6?a9
T@A$^Vh,|cG:_4p59{<cR\^_^r9<1Iq/8?4,7|-PhJpEyE/v42omF=F]M<p?9+;~(nPqwCCm*IaHu
N4S.pGVU=E!6R=0!PVz1`vb&zc`.X<jw
 by: issdr - Mon, 9 Oct 2023 08:25 UTC

Rene Kita wrote:

> AFAIU GG posters have a Message-ID which always ends with
> '@googlegroups.com'.

don't know specifically about tin, but... i'd use `$' (end of line) on
my regex

Re: [tin] Filtering for Message-ID

<87il7gmbx8.fsf@ID-313311.news.uni-berlin.de>

 copy mid

https://www.novabbs.com/computers/article-flat.php?id=1497&group=news.software.readers#1497

 copy link   Newsgroups: news.software.readers
Path: i2pn2.org!i2pn.org!weretis.net!feeder8.news.weretis.net!fu-berlin.de!uni-berlin.de!individual.net!not-for-mail
From: p_u_n_k_...@yahoo.it (issdr)
Newsgroups: news.software.readers
Subject: Re: [tin] Filtering for Message-ID
Date: Mon, 09 Oct 2023 10:34:43 +0200
Lines: 8
Message-ID: <87il7gmbx8.fsf@ID-313311.news.uni-berlin.de>
References: <ug0bkv$3olc5$1@dont-email.me>
Mime-Version: 1.0
Content-Type: text/plain
X-Trace: individual.net pKenJHNjmedUcc154KK8BQkdlxufu0k5plNT/ijA2OpcTVbZLD
X-Orig-Path: ID-313311.news.uni-berlin.de!not-for-mail
Cancel-Lock: sha1:gzhRIX+GNECVFMCeZZ5CpXTxdmY= sha256:ZSL3yN5XMuYM5A8WwfR5DA//hqScEdxtI1k+2O11/GI=
User-Agent: Gnus/5.13 Emacs/30.0.50 (gnu/linux)
X-Face: Zw,R[}.rn5!7_4>.U1j!zNkszle\GP]4G+xPe`%F=i$PMP)8W/c57#P6B_TFI@1-LR6?a9
T@A$^Vh,|cG:_4p59{<cR\^_^r9<1Iq/8?4,7|-PhJpEyE/v42omF=F]M<p?9+;~(nPqwCCm*IaHu
N4S.pGVU=E!6R=0!PVz1`vb&zc`.X<jw
 by: issdr - Mon, 9 Oct 2023 08:34 UTC

Rene Kita wrote:

> msgid=.*@googlegroups.com>

try this:

msgid=.*@googlegroups\.com

Re: [tin] Filtering for Message-ID

<87edi4mbea.fsf@ID-313311.news.uni-berlin.de>

 copy mid

https://www.novabbs.com/computers/article-flat.php?id=1498&group=news.software.readers#1498

 copy link   Newsgroups: news.software.readers
Path: i2pn2.org!i2pn.org!weretis.net!feeder8.news.weretis.net!fu-berlin.de!uni-berlin.de!individual.net!not-for-mail
From: p_u_n_k_...@yahoo.it (issdr)
Newsgroups: news.software.readers
Subject: Re: [tin] Filtering for Message-ID
Date: Mon, 09 Oct 2023 10:46:05 +0200
Lines: 15
Message-ID: <87edi4mbea.fsf@ID-313311.news.uni-berlin.de>
References: <ug0bkv$3olc5$1@dont-email.me>
<87il7gmbx8.fsf@ID-313311.news.uni-berlin.de>
Mime-Version: 1.0
Content-Type: text/plain
X-Trace: individual.net eEQodd8/4JrB4UxNwZYEiw5ce0wFAcWmOTFFWj6OF1srxvOOhw
X-Orig-Path: ID-313311.news.uni-berlin.de!not-for-mail
Cancel-Lock: sha1:VEqJRtc+Zbp9OHdXO8awiRvknp8= sha256:93dmVxd0Ie706DtqSx8j+d3rIrj/EmmhlyuX/Fv5Bzk=
User-Agent: Gnus/5.13 Emacs/30.0.50 (gnu/linux)
X-Face: Zw,R[}.rn5!7_4>.U1j!zNkszle\GP]4G+xPe`%F=i$PMP)8W/c57#P6B_TFI@1-LR6?a9
T@A$^Vh,|cG:_4p59{<cR\^_^r9<1Iq/8?4,7|-PhJpEyE/v42omF=F]M<p?9+;~(nPqwCCm*IaHu
N4S.pGVU=E!6R=0!PVz1`vb&zc`.X<jw
 by: issdr - Mon, 9 Oct 2023 08:46 UTC

issdr wrote:

> Rene Kita wrote:
>
>> msgid=.*@googlegroups.com>
>
> try this:
>
> msgid=.*@googlegroups\.com

nevermind. you should check tin's wildcard variable; however, this
should work in any case:

msgid_only:@googlegroups.com

Re: [tin] Filtering for Message-ID

<ug0euk$3olc5$2@dont-email.me>

 copy mid

https://www.novabbs.com/computers/article-flat.php?id=1499&group=news.software.readers#1499

 copy link   Newsgroups: news.software.readers
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: mai...@rkta.de (Rene Kita)
Newsgroups: news.software.readers
Subject: Re: [tin] Filtering for Message-ID
Date: Mon, 9 Oct 2023 08:51:34 -0000 (UTC)
Organization: A noiseless patient Spider
Lines: 37
Message-ID: <ug0euk$3olc5$2@dont-email.me>
References: <ug0bkv$3olc5$1@dont-email.me> <r2e7iip7d7gqf2h2brkekrdtk2ta2lcb3m@4ax.com>
Injection-Date: Mon, 9 Oct 2023 08:51:34 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="92f22d5fa9d1e963a65f4f32234fe931";
logging-data="3954053"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19SAIE1npupF1zfd0OHaWyy"
User-Agent: tin/2.4.5-20201224 ("Glen Albyn") (Linux/5.10.0-24-amd64 (x86_64))
Cancel-Lock: sha1:tXDfo3+ExsFc+vz9yNHtw2T8mmg=
 by: Rene Kita - Mon, 9 Oct 2023 08:51 UTC

Ralph Fox <-rf-nz-@-.invalid> wrote:
> On Mon, 9 Oct 2023 07:55:13 -0000 (UTC), Rene Kita wrote:
>
>> Due to recent events I finally tried to filter Google Groups users with
>> tin. I do not want to filter followups to posts from GG users.
>>
>> AFAIU GG posters have a Message-ID which always ends with
>> '@googlegroups.com'.
>>
>> I have the following regex in my filter file:
>> group=*
>> case=0
>> score=-25
>> msgid=.*@googlegroups.com>
>>
>> The above filter entry does not match initial articles. I tried multiple
>> versions, including using msgid_last and msgid_only, but could not
>> achieve what I wanted. I did not spot a pattern what is matched.
>>
>> Can I somehow see what tin filters to debug this?
>>
>> Does someone here has a working filter and would share it?
>
>
> This site is now a few years old, but its tin filter may still work.
>
> "The Usenet Improvement Project - Filters Examples 3"
> <http://twovoyagers.com/improve-usenet.org/filters_ex3.html>

To quote the important part from that site:
group=*
case=0
score=kill
msgid_only=googlegroups

I tried that and articles with a Message-Id containing googlegroups are
not killed.

Re: [tin] Filtering for Message-ID

<ug0f7f$3olc5$3@dont-email.me>

 copy mid

https://www.novabbs.com/computers/article-flat.php?id=1500&group=news.software.readers#1500

 copy link   Newsgroups: news.software.readers
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: mai...@rkta.de (Rene Kita)
Newsgroups: news.software.readers
Subject: Re: [tin] Filtering for Message-ID
Date: Mon, 9 Oct 2023 08:56:17 -0000 (UTC)
Organization: A noiseless patient Spider
Lines: 26
Message-ID: <ug0f7f$3olc5$3@dont-email.me>
References: <ug0bkv$3olc5$1@dont-email.me> <87il7gmbx8.fsf@ID-313311.news.uni-berlin.de> <87edi4mbea.fsf@ID-313311.news.uni-berlin.de>
Injection-Date: Mon, 9 Oct 2023 08:56:17 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="92f22d5fa9d1e963a65f4f32234fe931";
logging-data="3954053"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX18hS2SmmWGelEDS08EFCaCk"
User-Agent: tin/2.4.5-20201224 ("Glen Albyn") (Linux/5.10.0-24-amd64 (x86_64))
Cancel-Lock: sha1:RdwSBbmgZgwJzQZ99rD0FvP7Tx8=
 by: Rene Kita - Mon, 9 Oct 2023 08:56 UTC

issdr <p_u_n_k_i_n_d@yahoo.it> wrote:
> issdr wrote:
>
>> Rene Kita wrote:
>>
>>> msgid=.*@googlegroups.com>
>>
>> try this:
>>
>> msgid=.*@googlegroups\.com
>
> nevermind. you should check tin's wildcard variable;

wildcard is set to 1 in my tinrc.

> however, this should work in any case:
>
> msgid_only:@googlegroups.com

I suppose you meant
msgid_only=@googlegroups.com

Does not work.

E.g. Message-ID: <ab3059fc-cab2-4048-b901-f43283285324n@googlegroups.com>
still has a score of 0 here.

Re: [tin] Filtering for Message-ID

<ug0fs2$km9$1@rasp.pasdenom.info>

 copy mid

https://www.novabbs.com/computers/article-flat.php?id=1501&group=news.software.readers#1501

 copy link   Newsgroups: news.software.readers
Path: i2pn2.org!i2pn.org!paganini.bofh.team!pasdenom.info!.POSTED.aorleans-651-1-231-78.w90-20.abo.wanadoo.fr!not-for-mail
From: dv...@reply-to.not.invalid (DV)
Newsgroups: news.software.readers
Subject: Re: [tin] Filtering for Message-ID
Date: Mon, 9 Oct 2023 09:07:14 -0000 (UTC)
Organization: Yakakwatik !
Message-ID: <ug0fs2$km9$1@rasp.pasdenom.info>
References: <ug0bkv$3olc5$1@dont-email.me>
Reply-To: dv@s173327841.onlinehome.fr
Mime-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
Injection-Date: Mon, 9 Oct 2023 09:07:14 -0000 (UTC)
Injection-Info: rasp.pasdenom.info; posting-account="dv@usenet"; posting-host="aorleans-651-1-231-78.w90-20.abo.wanadoo.fr:90.20.166.78";
logging-data="21193"; mail-complaints-to="abuse@pasdenom.info"
User-Agent: MacCafe/3.01b3a (macOS 10.14.6 (18G9323) - iMac13,1)
Cancel-Lock: sha1:YirXaZg/UGfrhCk2SCJvoe8ZyJg= sha256:6e+8yhMFjIq1tC7Bi6izm9+854m2yraEFtvE5iJiQZE= sha256:9DIniujkMZgitLGX4m6QKQVpXBUt1J4iDrv92u6jE+E=
Face: iVBORw0KGgoAAAANSUhEUgAAADAAAAAwCAMAAABg3Am1AAAAMFBMVEX7+/vKzs6CcW2alZS
Ii4tmaWpVS0h7UkdVMiasr64oKC0LCgxMFwioPDyMDAzEAwM1qCwtAAAACXBIWXMAAAsTAAALEw
EAmpwYAAACN0lEQVRIx81W25biMAxr4kvipDP8/9+u7DS0hXLo05w1DwNTKbJj2bAsfx9pxj10J
gRzRtAtgmimgJdS8h0Rypp54EstdxjMuuG13mJYMAb+FiOJkZZJuMOoHIyBv8PIzZwxCbV8zakr
GBO+rutXhvacjBugKxha8td76p0SRMjsm0v8GfLvvT2h/h/7REhkborawdjR6Hv6aAsq3mUQunI
Wrc3f9tU+5K7NckGTB2yPD/eUcm8cTY6cEBNfLyUSASUm7gon7AJoxpWENZyqxHWTOGSEeJdIGl
lkU4UpTgreP7soIAhquRbkMIEl89a/iwIi2OowhHCE2GIXjNQ2PMomHIoAiFVcgOzVHgkFTAWf6
Diapjvw1xbJ5yGbcJdIlEGx0Egs8AUIxkeJ8NuTkhLBrLi0cHoYcbyOBNj/mRMxTkQOPDLtXcwq
hupAwKoz20S6cE9xLfDp6IPGM94JadwKhUhHDRien19UPQRwBJ4onVYRxXyJP4aY8c/jsWYaDml
ohuK2z30DHHm4iL+T9fGL+x1ZShaBX1q+YBgu2JcMRgc9y1mbI6etmp035GAQppNF/FBHPsHx4e
wO3kSOh2IYFOQWSi+EPS3a4A0ywt72SA2MV8sirRTWlEFAHV6xLwOPqvy6PtJWiEXWYNQx2UhJY
K3lfUElchGvfO4A96/bcPEVR+8LKtJKPFPyD0tgh98vt41rqDJtUJuz8XndD+Vx7Jjob18QSMu/
evkG9kkxu/tT4D+If9qAHl2x5idYAAAAAElFTkSuQmCC
X-Face: %#Y5+;k{8(i&083qlE0NiJXlA_f8@&4C~Wht.04?,h)ZC+1hXWem+cti:fHN`SdK~('$Fs+
_l\5t-n,/Tw,$UT>kS!5(g^LBT"qGP;?lU_@s,Rf+gk4.+%n*uMC%[PohI*J'//hyDdE
n3&=E;nc,A FM(pmGD8h3vksKInM/(_{uRqe586mT||H3\UaNcAq?0(+`!QtpiN:?
In-Reply-To: <ug0bkv$3olc5$1@dont-email.me>
 by: DV - Mon, 9 Oct 2023 09:07 UTC

Rene Kita wrote:

> AFAIU GG posters have a Message-ID which always ends with
> '@googlegroups.com'.

I found a message posted via Google Groups whose M-ID doesn't end with
@googlegroups.com (but it's from 2012):

<5593130.935.1330589899531.JavaMail.geo-discussion-forums@ynjc20>

I think the best header field to filter Google Groups messages is
Injection-Info, which always contains the string ’googlegroups’.

--
Denis

Serveurs de news et passerelles web : <http://usenet-fr.yakakwatik.org>
Lecteurs de news : <http://usenet-fr.yakakwatik.org/lecteurs-de-news.html>

Re: [tin] Filtering for Message-ID

<ug0fst$3pn94$1@dont-email.me>

 copy mid

https://www.novabbs.com/computers/article-flat.php?id=1502&group=news.software.readers#1502

 copy link   Newsgroups: news.software.readers
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: drunkent...@gmail.com (DrunkenThon)
Newsgroups: news.software.readers
Subject: Re: [tin] Filtering for Message-ID
Date: Mon, 9 Oct 2023 09:07:42 -0000 (UTC)
Organization: A noiseless patient Spider
Lines: 22
Message-ID: <ug0fst$3pn94$1@dont-email.me>
References: <ug0bkv$3olc5$1@dont-email.me> <r2e7iip7d7gqf2h2brkekrdtk2ta2lcb3m@4ax.com> <ug0euk$3olc5$2@dont-email.me>
Injection-Date: Mon, 9 Oct 2023 09:07:42 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="918c57e6d4425ec969e91ac42221bc64";
logging-data="3988772"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+m1+LAPDpONLjkdCeVkuqiV7iK9Xam+GY="
User-Agent: tin/2.6.2-20221225 ("Pittyvaich") (Linux/5.15.0-86-generic (x86_64))
Cancel-Lock: sha1:DwZ30IuOPriVNVIUyXiyZBiuAEk=
 by: DrunkenThon - Mon, 9 Oct 2023 09:07 UTC

Rene Kita <mail@rkta.de> wrote:
> To quote the important part from that site:
> group=*
> case=0
> score=kill
> msgid_only=googlegroups
>
> I tried that and articles with a Message-Id containing googlegroups are
> not killed.

For *wildmat* this will work:
msgid_only=*@googlegroups.com*

For *rexexp* use this:
msgid_only=@googlegroups\.com

You can set this option (wildcard) to either 0 or 1 (0 for wildmat
and 1 for regex) in your tinrc file (tipically its ~/.tin/tinrc).

--
Best regards,
DrunkenThon.

Re: [tin] Filtering for Message-ID

<ug0gb0$3prht$1@dont-email.me>

 copy mid

https://www.novabbs.com/computers/article-flat.php?id=1503&group=news.software.readers#1503

 copy link   Newsgroups: news.software.readers
Path: i2pn2.org!i2pn.org!news.nntp4.net!news.gegeweb.eu!gegeweb.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: mai...@rkta.de (Rene Kita)
Newsgroups: news.software.readers
Subject: Re: [tin] Filtering for Message-ID
Date: Mon, 9 Oct 2023 09:15:14 -0000 (UTC)
Organization: A noiseless patient Spider
Lines: 26
Message-ID: <ug0gb0$3prht$1@dont-email.me>
References: <ug0bkv$3olc5$1@dont-email.me>
Injection-Date: Mon, 9 Oct 2023 09:15:14 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="92f22d5fa9d1e963a65f4f32234fe931";
logging-data="3993149"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1/dyGGL3imcRJ2xUUE8GaI3"
User-Agent: tin/2.4.5-20201224 ("Glen Albyn") (Linux/5.10.0-24-amd64 (x86_64))
Cancel-Lock: sha1:zjkYaZWeFHKXeVg/w9uBzXRMC+U=
 by: Rene Kita - Mon, 9 Oct 2023 09:15 UTC

Rene Kita <mail@rkta.de> wrote:
> Due to recent events I finally tried to filter Google Groups users with
> tin. I do not want to filter followups to posts from GG users.
>
> AFAIU GG posters have a Message-ID which always ends with
> '@googlegroups.com'.
>
> I have the following regex in my filter file:
> group=*
> case=0
> score=-25
> msgid=.*@googlegroups.com>
>
> The above filter entry does not match initial articles. I tried multiple
> versions, including using msgid_last and msgid_only, but could not
> achieve what I wanted. I did not spot a pattern what is matched.
>
> Can I somehow see what tin filters to debug this?
>
> Does someone here has a working filter and would share it?

As always, it was a PEBKAC. I had another rule that also matched.
Apparently, if you have a msgid and a from in one rule those are not
ANDed but ORed.

Sorry for the noise and thanks for rubber-ducking.

Re: [tin] Filtering for Message-ID

<1qibxph.183zkkmt7nnvfN%snipeco.2@gmail.com>

 copy mid

https://www.novabbs.com/computers/article-flat.php?id=1504&group=news.software.readers#1504

 copy link   Newsgroups: news.software.readers
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!snipe.eternal-september.org!.POSTED!not-for-mail
From: snipec...@gmail.com (Sn!pe)
Newsgroups: news.software.readers
Subject: Re: [tin] Filtering for Message-ID
Date: Mon, 9 Oct 2023 10:44:41 +0100
Organization: Sn!peCo World Wide Wading Birds
Lines: 30
Message-ID: <1qibxph.183zkkmt7nnvfN%snipeco.2@gmail.com>
References: <ug0bkv$3olc5$1@dont-email.me>
Reply-To: snipeco.1@gmail.com (Sn!pe)
MIME-Version: 1.0
Content-Type: text/plain; charset=ISO-8859-1
Content-Transfer-Encoding: 8bit
Injection-Info: snipe.eternal-september.org; posting-host="793274499575ffa440b25d9b71b13fa4";
logging-data="4008965"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+Ht+zqqdPrsS/8PCVqGIrt"
User-Agent: MacSOUP/2.8.6b1 (ed136d9b90) (Mac OS 10.13.6)
Cancel-Lock: sha1:Isqj3XO+ZnpSG3SvRFSTu+XF0ds=
X-Copyright: Copyright (c) 2023 Sn!peCo WWWB, All Rights Reserved.
This article may be reproduced for the purposes of propagation and
personal use only, no commercial use without express permission.
X-Disclaimer: Any advice that I may give is worth only what I paid for it.
This article comprises only my personal opinions unless otherwise stated.
May contain traces of nuts.
X-Tongue-In-Cheek: Always
X-Validate: All genuine Sn!peCo articles contain the header:
"Injection-Info: snipe.eternal-september.org;" my registered FQDN.
X-Clacks-Overhead: GNU Terry Pratchett; WonK; Large Enid
X-Face: 5<x+vv{"AHN,F~/dhf,X*~1zNv[TF/WUe(Uw.*ZOw\P'Ju]C6].T~7Z5cVjV\xTO6&)1#VQ
iZ4vFDG
 by: Sn!pe - Mon, 9 Oct 2023 09:44 UTC

Rene Kita <mail@rkta.de> wrote:

> Due to recent events I finally tried to filter Google Groups users with
> tin. I do not want to filter followups to posts from GG users.
>
> AFAIU GG posters have a Message-ID which always ends with
> '@googlegroups.com'.
>
> I have the following regex in my filter file:
> group=*
> case=0
> score=-25
> msgid=.*@googlegroups.com>
>
> The above filter entry does not match initial articles. I tried multiple
> versions, including using msgid_last and msgid_only, but could not
> achieve what I wanted. I did not spot a pattern what is matched.
>
> Can I somehow see what tin filters to debug this?
>
> Does someone here has a working filter and would share it?
>

In MacSOUP I kill Google posts with 'User-Agent contains "G2/1.0"'
Perhaps that could work for tin too.

--
^Ï^. Sn!pe <https://youtu.be/_kqytf31a8E>

My pet rock Gordon just is.

Re: [tin] Filtering for Message-ID

<ug0lur$8at$1@nntp.de>

 copy mid

https://www.novabbs.com/computers/article-flat.php?id=1505&group=news.software.readers#1505

 copy link   Newsgroups: news.software.readers
Path: i2pn2.org!i2pn.org!weretis.net!feeder8.news.weretis.net!news.cgarbs.de!nntp.de!.POSTED.akk21-int.akk.kit.edu!not-for-mail
From: urs...@buil.tin.org (Urs Janßen)
Newsgroups: news.software.readers
Subject: Re: [tin] Filtering for Message-ID
Date: Mon, 9 Oct 2023 10:51:07 -0000 (UTC)
Organization: tin.org
Archive: no
Message-ID: <ug0lur$8at$1@nntp.de>
References: <ug0bkv$3olc5$1@dont-email.me> <ug0gb0$3prht$1@dont-email.me>
Injection-Date: Mon, 9 Oct 2023 10:51:07 -0000 (UTC)
Injection-Info: nntp.de; posting-host="akk21-int.akk.kit.edu:2a00:1398:5:f602:cafe:cafe:cafe:21";
logging-data="8541"; mail-complaints-to="abuse@nntp.de"
User-Agent: tin/2.6.3-20231002 ("Pittyvaich") (Linux/5.10.0-25-amd64 (x86_64))
Cancel-Lock: sha1:oNbl5zsj4Ap8q4xg13so3zLoEY8=
X-No-Archive: yes
X-No-HTML: yes
 by: Urs Janßen - Mon, 9 Oct 2023 10:51 UTC

>> I do not want to filter followups to posts from GG users.

then you should use msgid_only instead of msgid

>> I have the following regex in my filter file:
>> group=*
>> case=0
>> score=-25
>> msgid=.*@googlegroups.com>

the leading .* is unnecessary when using regex-filters, but the dot should
be escaped:

msgid_only=@googlegroups\.com>

and I would make the search case insensitive via either

case=1

or by prefixing the regexp with (?i):

case=0
msgid_only=(?i)@googlegroups\.com>

>> Can I somehow see what tin filters to debug this?

If tin is compiled with debugging support and started with "-D 2"
it will create $TMPDIR/FILTER (details about the filter rules) and
$TMPDIR/ARTS (status of the articles after applying the rules), but
those files are hard to read without having some knowledge about the
data-structures used. there is no simply "article killed due to the
following rule" output available.

> As always, it was a PEBKAC. I had another rule that also matched.
> Apparently, if you have a msgid and a from in one rule those are not
> ANDed but ORed.

yes, there is currently no AND (but you could use scores with sum up to
score_limit_kill/score_limit_select only if all the rules are matched as
some kind of a workaroud.

Re: [tin] Filtering for Message-ID

<ug1fpn$1r58$2@dont-email.me>

 copy mid

https://www.novabbs.com/computers/article-flat.php?id=1506&group=news.software.readers#1506

 copy link   Newsgroups: news.software.readers
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: jmc...@fuzzball.jmcunx.com (John McCue)
Newsgroups: news.software.readers
Subject: Re: [tin] Filtering for Message-ID
Date: Mon, 9 Oct 2023 18:12:08 -0000 (UTC)
Organization: A noiseless patient Spider
Lines: 26
Message-ID: <ug1fpn$1r58$2@dont-email.me>
References: <ug0bkv$3olc5$1@dont-email.me>
Reply-To: jmclnx@SPAMisBADgmail.com
Injection-Date: Mon, 9 Oct 2023 18:12:08 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="c242de84164a98c64e87a0d2c93f1933";
logging-data="60584"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX18Ni0iQ2oMtN5hfCKjpAFhA"
User-Agent: tin/2.4.4-20191224 ("Millburn") (OpenBSD/7.3 (amd64))
Cancel-Lock: sha1:onwiotIkDsGJuLTrMRSJowS30vg=
X-OS-Version: OpenBSD 7.3 amd64
 by: John McCue - Mon, 9 Oct 2023 18:12 UTC

Rene Kita <mail@rkta.de> wrote:
> Due to recent events I finally tried to filter Google Groups users with
> tin. I do not want to filter followups to posts from GG users.
>
> AFAIU GG posters have a Message-ID which always ends with
> '@googlegroups.com'.
>
> I have the following regex in my filter file:
> group=*
> case=0
> score=-25
> msgid=.*@googlegroups.com>

A few years ago tin made changes to their filter logic and I
found this works for me:

group=*
case=0
score=kill
msgid_last=*<*@googlegroups.com>*

> Does someone here has a working filter and would share it?

just did :)

HTH

Re: [tin] Filtering for Message-ID

<ug2ugu$10f6d$1@dont-email.me>

 copy mid

https://www.novabbs.com/computers/article-flat.php?id=1507&group=news.software.readers#1507

 copy link   Newsgroups: news.software.readers
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: mai...@rkta.de (Rene Kita)
Newsgroups: news.software.readers
Subject: Re: [tin] Filtering for Message-ID
Date: Tue, 10 Oct 2023 07:29:36 -0000 (UTC)
Organization: A noiseless patient Spider
Lines: 26
Message-ID: <ug2ugu$10f6d$1@dont-email.me>
References: <ug0bkv$3olc5$1@dont-email.me> <ug0gb0$3prht$1@dont-email.me> <ug0lur$8at$1@nntp.de>
MIME-Version: 1.0
Content-Type: text/plain; charset=ISO-8859-15
Content-Transfer-Encoding: 8bit
Injection-Date: Tue, 10 Oct 2023 07:29:36 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="c21e5fb4a2740df9a4a1b14a0f7495b9";
logging-data="1064141"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX19ainDov77Y+KG19ga9rjBy"
User-Agent: tin/2.4.5-20201224 ("Glen Albyn") (Linux/5.10.0-24-amd64 (x86_64))
Cancel-Lock: sha1:BqULuKIW0sHRobXsCUKiRR31DjE=
 by: Rene Kita - Tue, 10 Oct 2023 07:29 UTC

Urs Janßen <urs@buil.tin.org> wrote:
>>> Can I somehow see what tin filters to debug this?
>
> If tin is compiled with debugging support and started with "-D 2"
> it will create $TMPDIR/FILTER (details about the filter rules) and
> $TMPDIR/ARTS (status of the articles after applying the rules), but
> those files are hard to read without having some knowledge about the
> data-structures used. there is no simply "article killed due to the
> following rule" output available.

I will try this out. Seeing that an article is matched by multiple rules
would have been enough in my case.

>> As always, it was a PEBKAC. I had another rule that also matched.
>> Apparently, if you have a msgid and a from in one rule those are not
>> ANDed but ORed.
>
> yes, there is currently no AND (but you could use scores with sum up to
> score_limit_kill/score_limit_select only if all the rules are matched as
> some kind of a workaroud.

This was what I was trying to do. There is a user who sometimes posts
via Google Groups. My idea was to apply a positive score when he posts
via GG, but not when he uses a proper news reader.

Thanks for the detailed answer.

Re: [tin] Filtering for Message-ID

<ug2utl$10f6d$2@dont-email.me>

 copy mid

https://www.novabbs.com/computers/article-flat.php?id=1508&group=news.software.readers#1508

 copy link   Newsgroups: news.software.readers
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: mai...@rkta.de (Rene Kita)
Newsgroups: news.software.readers
Subject: Re: [tin] Filtering for Message-ID
Date: Tue, 10 Oct 2023 07:36:23 -0000 (UTC)
Organization: A noiseless patient Spider
Lines: 26
Message-ID: <ug2utl$10f6d$2@dont-email.me>
References: <ug0bkv$3olc5$1@dont-email.me> <ug1fpn$1r58$2@dont-email.me>
Injection-Date: Tue, 10 Oct 2023 07:36:23 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="c21e5fb4a2740df9a4a1b14a0f7495b9";
logging-data="1064141"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1982GaXA959XvUl5h3QWC4V"
User-Agent: tin/2.4.5-20201224 ("Glen Albyn") (Linux/5.10.0-24-amd64 (x86_64))
Cancel-Lock: sha1:lFHYdWjdrsgM1rMH38bw5nzS+AM=
 by: Rene Kita - Tue, 10 Oct 2023 07:36 UTC

John McCue <jmccue@fuzzball.jmcunx.com> wrote:
> Rene Kita <mail@rkta.de> wrote:
>> Due to recent events I finally tried to filter Google Groups users with
>> tin. I do not want to filter followups to posts from GG users.
>>
>> AFAIU GG posters have a Message-ID which always ends with
>> '@googlegroups.com'.
>>
>> I have the following regex in my filter file:
>> group=*
>> case=0
>> score=-25
>> msgid=.*@googlegroups.com>
>
> A few years ago tin made changes to their filter logic and I
> found this works for me:
>
> group=*
> case=0
> score=kill
> msgid_last=*<*@googlegroups.com>*

Thanks for sharing.

Urs recommended to use msgid_only. What difference does msgid_last make
here? I guess some replies will be killed, too.

Re: [tin] Filtering for Message-ID

<ug3b7n$o3m$1@nntp.de>

 copy mid

https://www.novabbs.com/computers/article-flat.php?id=1509&group=news.software.readers#1509

 copy link   Newsgroups: news.software.readers
Path: i2pn2.org!i2pn.org!weretis.net!feeder8.news.weretis.net!news.cgarbs.de!nntp.de!.POSTED.akk21-int.akk.kit.edu!not-for-mail
From: urs...@buil.tin.org (Urs Janßen)
Newsgroups: news.software.readers
Subject: Re: [tin] Filtering for Message-ID
Date: Tue, 10 Oct 2023 11:06:31 -0000 (UTC)
Organization: tin.org
Archive: no
Message-ID: <ug3b7n$o3m$1@nntp.de>
References: <ug1fpn$1r58$2@dont-email.me> <ug2utl$10f6d$2@dont-email.me>
Mime-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
Injection-Date: Tue, 10 Oct 2023 11:06:31 -0000 (UTC)
Injection-Info: nntp.de; posting-host="akk21-int.akk.kit.edu:2a00:1398:5:f602:cafe:cafe:cafe:21";
logging-data="24694"; mail-complaints-to="abuse@nntp.de"
User-Agent: tin/2.6.3-20231002 ("Pittyvaich") (Linux/5.10.0-25-amd64 (x86_64))
Cancel-Lock: sha1:jkMZi8jr7P5MLgvletkisRfwmbE=
X-No-Archive: yes
X-No-HTML: yes
 by: Urs Janßen - Tue, 10 Oct 2023 11:06 UTC

In <ug2utl$10f6d$2@dont-email.me> on Tue, 10 Oct 2023 09:36:23,
Rene Kita wrote:
> Urs recommended to use msgid_only. What difference does msgid_last make
> here? I guess some replies will be killed, too.

[tin(5)]
| ${TIN_HOMEDIR:-"$HOME"}/.tin/filter
[...]
| msgid_last
| Match against ’’Message-ID:’’ and last ’’References:’’
| entry only. The matching type used (wildmat or regex) de‐
| pends on the setting of the tinrc variable wildcard.
| | msgid_only
| Match against ’’Message-ID:’’. The matching type used
| (wildmat or regex) depends on the setting of the tinrc
| variable wildcard.

so with msgid_last direct responses would be matched too.

JFTR I've added the following to tin(5) to clarify things:

#v+
== modified file 'doc/tin.5'
--- old/doc/tin.5 2023-07-31 06:03:39 +0000
+++ new/doc/tin.5 2023-10-09 19:57:53 +0000
@@ -759 +759,3 @@
-lines beginning with a '#' are ignored.
+lines beginning with a '#' are ignored. Multiple matching-rules in the same
+scope are all applied individually and not only when they all hit at the
+same time (logical OR not AND).
@@ -879,0 +882,9 @@
+ +comment=downscore postings in nsr from google base on
+comment=message-id or path, if both hit they end up with
+comment=a score of -66 which is enough to mark them killed
+group=news.software.readers
+case=1
+score=-33
+path=\\.googlegroups\\.com!not-for-mail$
+msgid_only=@googlegroups\\.com>
#v-

Re: [tin] Filtering for Message-ID

<ug5ijp$1n5i2$1@dont-email.me>

 copy mid

https://www.novabbs.com/computers/article-flat.php?id=1510&group=news.software.readers#1510

 copy link   Newsgroups: news.software.readers
Path: i2pn2.org!i2pn.org!eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: mai...@rkta.de (Rene Kita)
Newsgroups: news.software.readers
Subject: Re: [tin] Filtering for Message-ID
Date: Wed, 11 Oct 2023 07:24:43 -0000 (UTC)
Organization: A noiseless patient Spider
Lines: 48
Message-ID: <ug5ijp$1n5i2$1@dont-email.me>
References: <ug1fpn$1r58$2@dont-email.me> <ug2utl$10f6d$2@dont-email.me> <ug3b7n$o3m$1@nntp.de>
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
Injection-Date: Wed, 11 Oct 2023 07:24:43 -0000 (UTC)
Injection-Info: dont-email.me; posting-host="c8c297b19092cd889acb318031dcc70f";
logging-data="1807938"; mail-complaints-to="abuse@eternal-september.org"; posting-account="U2FsdGVkX1+fnZWl3qljgtFF6pfId+c7"
User-Agent: tin/2.4.5-20201224 ("Glen Albyn") (Linux/5.10.0-24-amd64 (x86_64))
Cancel-Lock: sha1:FiPg9FNG/DzOkO2S/ImeJYGKfwI=
 by: Rene Kita - Wed, 11 Oct 2023 07:24 UTC

Urs Janßen <urs@buil.tin.org> wrote:
> In <ug2utl$10f6d$2@dont-email.me> on Tue, 10 Oct 2023 09:36:23,
> Rene Kita wrote:
>> Urs recommended to use msgid_only. What difference does msgid_last make
>> here? I guess some replies will be killed, too.
>
> [tin(5)]
> | ${TIN_HOMEDIR:-"$HOME"}/.tin/filter
> [...]
> | msgid_last
> | Match against ’’Message-ID:’’ and last ’’References:’’
> | entry only. The matching type used (wildmat or regex) de‐
> | pends on the setting of the tinrc variable wildcard.
> |
> | msgid_only
> | Match against ’’Message-ID:’’. The matching type used
> | (wildmat or regex) depends on the setting of the tinrc
> | variable wildcard.
>
> so with msgid_last direct responses would be matched too.

Maybe you could clarify this too. For me it was not obvious that 'last
References entry' means direct responses.

> JFTR I've added the following to tin(5) to clarify things:
>
> #v+
> == modified file 'doc/tin.5'
> --- old/doc/tin.5 2023-07-31 06:03:39 +0000
> +++ new/doc/tin.5 2023-10-09 19:57:53 +0000
> @@ -759 +759,3 @@
> -lines beginning with a '#' are ignored.
> +lines beginning with a '#' are ignored. Multiple matching-rules in the same
> +scope are all applied individually and not only when they all hit at the
> +same time (logical OR not AND).
> @@ -879,0 +882,9 @@
> +
> +comment=downscore postings in nsr from google base on
> +comment=message-id or path, if both hit they end up with
> +comment=a score of -66 which is enough to mark them killed
> +group=news.software.readers
> +case=1
> +score=-33
> +path=\\.googlegroups\\.com!not-for-mail$
> +msgid_only=@googlegroups\\.com>
> #v-

Thanks for this.

Re: [tin] Filtering for Message-ID

<ug5lkp$ci$1@nntp.de>

 copy mid

https://www.novabbs.com/computers/article-flat.php?id=1511&group=news.software.readers#1511

 copy link   Newsgroups: news.software.readers
Path: i2pn2.org!i2pn.org!weretis.net!feeder8.news.weretis.net!news.cgarbs.de!nntp.de!.POSTED.akk21-int.akk.kit.edu!not-for-mail
From: urs...@buil.tin.org (Urs Janßen)
Newsgroups: news.software.readers
Subject: Re: [tin] Filtering for Message-ID
Date: Wed, 11 Oct 2023 08:16:25 -0000 (UTC)
Organization: tin.org
Archive: no
Message-ID: <ug5lkp$ci$1@nntp.de>
References: <ug3b7n$o3m$1@nntp.de> <ug5ijp$1n5i2$1@dont-email.me>
Injection-Date: Wed, 11 Oct 2023 08:16:25 -0000 (UTC)
Injection-Info: nntp.de; posting-host="akk21-int.akk.kit.edu:2a00:1398:5:f602:cafe:cafe:cafe:21";
logging-data="402"; mail-complaints-to="abuse@nntp.de"
User-Agent: tin/2.6.3-20231002 ("Pittyvaich") (Linux/5.10.0-25-amd64 (x86_64))
Cancel-Lock: sha1:DLH+gjSmY0eWH3WxJYfa2c1XBOg=
X-No-Archive: yes
X-No-HTML: yes
 by: Urs Janßen - Wed, 11 Oct 2023 08:16 UTC

Rene Kita wrote:
> Maybe you could clarify this too. For me it was not obvious that 'last
> References entry' means direct responses.

I've updated it to

| msgid_last
| Match against ''Message-ID:'' and last ''References:''
| entry (direct responses) only. The matching type used
| (wildmat or regex) depends on the setting of the tinrc
| variable wildcard.

keep in mind that this is the man page about the various file-formats
used by tin and not to explain usenet (headers) in details.

1
server_pubkey.txt

rocksolid light 0.9.7
clearnet tor