GNU bug report logs - #75998
[guile-lib] html->sxml does not decode entities in attributes

Please note: This is a static page, with minimal formatting, updated once a day.
Click here to see this page with the latest information and nicer formatting.

Package: guile; Reported by: Tomas Volf <~@wolfsden.cz>; Done: Tomas Volf <~@wolfsden.cz>; Maintainer for guile is bug-guile@HIDDEN.

Message received at 75998 <at> debbugs.gnu.org:


Received: (at 75998) by debbugs.gnu.org; 15 Feb 2025 15:29:34 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Sat Feb 15 10:29:34 2025
Received: from localhost ([127.0.0.1]:57394 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1tjK6n-0004OP-TN
	for submit <at> debbugs.gnu.org; Sat, 15 Feb 2025 10:29:34 -0500
Received: from mail-pl1-x636.google.com ([2607:f8b0:4864:20::636]:52632)
 by debbugs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128)
 (Exim 4.84_2) (envelope-from <maxim.cournoyer@HIDDEN>)
 id 1tjK6l-0004O9-2N
 for 75998 <at> debbugs.gnu.org; Sat, 15 Feb 2025 10:29:31 -0500
Received: by mail-pl1-x636.google.com with SMTP id
 d9443c01a7336-220d28c215eso42505535ad.1
 for <75998 <at> debbugs.gnu.org>; Sat, 15 Feb 2025 07:29:31 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
 d=gmail.com; s=20230601; t=1739633365; x=1740238165; darn=debbugs.gnu.org;
 h=mime-version:user-agent:message-id:date:references:in-reply-to
 :subject:cc:to:from:from:to:cc:subject:date:message-id:reply-to;
 bh=10tuQQYu1cjmsUxvKdx05pSxw0mLviyPyTqxJa+SlBk=;
 b=VqY7Mv85bp1ZSdDs0lZe+fapvoos932fbv4/OhjZ52IseJXQtFXjxJUJLN2iqP+SyD
 rjXdbxfeHiGe2PFTN+GmTmh23sMDmJVXjdLAskV+ZT26KW6TuIXxmXqOUpQW/Ewc4dD2
 0Zldi6ucFfze0Z0mk7xBAucg3TktokRVmAQTtsQVyc3pLpndbQqSjJoDDbuv/nNFXNrQ
 FHKYVywlIwyP1nPNY3WOQFXCU+hicZmgMbaraqA27eTWvvgHivhUK1cTG5MoNiZ4fEoz
 n5Uird0Bye84q3nkqjPWsYio+G5rSS7EPmkFOnkBGqwqjeBtE1Jw6KtVqZUhdUr4CBc0
 TF2A==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
 d=1e100.net; s=20230601; t=1739633365; x=1740238165;
 h=mime-version:user-agent:message-id:date:references:in-reply-to
 :subject:cc:to:from:x-gm-message-state:from:to:cc:subject:date
 :message-id:reply-to;
 bh=10tuQQYu1cjmsUxvKdx05pSxw0mLviyPyTqxJa+SlBk=;
 b=oaAIK3J6F90J+E79/Nce7ot5Tpv9dOs50OXpol7Qi445IlyLXSPRQnhXSE73cNwVXy
 MHhqWHqQ5ClDeCC3GvEAXpURHRQYnpHlpDwYv2LEQT0FjWYXr4uEdiigsJ+pEIFO7H99
 CRsEYeeViLiIeZEiQTVWXOa94EsD+qHTudi8NT5uqNkhYGpCGBpeu24/WQsaD16LeDsA
 ZUaMUynK26nsrHE+hRlVC0av+6dPHSEzgdDGRvUaivt7BUayJblaFBTK0ZlO82olJw9z
 hITC0S9ZZQaH9BGz1gVaeY9Jv1rJ4aRFNfPLeTQXMkTnJZYDbmSD8qnbzobDQS39UKRT
 eCUw==
X-Forwarded-Encrypted: i=1;
 AJvYcCX8PBkfQMnlLBGN5nSp2lUPkVb1bdbxbz19P8tZwBkaaRwWFGUYS76Ml+y2nulmeS2013aR4A==@debbugs.gnu.org
X-Gm-Message-State: AOJu0YwrzW85YpaOLrwz54YI6JonDlrJ6sRBKEii3pGkdAn+Kc6I9y+s
 OfqdaaXAva28uK3MzqV33PmSpWdQh3J6A09P3K3o/tJKn1FPtiOV
X-Gm-Gg: ASbGncuK3fbQmqfvzcIeglY0XBzROYTa0bE8HnljMn9Ypcpt3jgEKODyGIrXmMutO3S
 XELUda7i5ZEdYZlminFcVpy60J82py0Ej0CflaQ7Lv7vOypnFG142BnSW3LDYvZZlAYZ0piWOjR
 g3YkRMivQ8Yx/Uvvi5PTTCL1Yeji52Y9i3oqplLexdaIEu/Jl7+d8GHKB1j3JXDxwg2g6LZAec3
 eivnhd6j63H/ZDR0eeeT5VlAjfJ+7vVpxyzqBc9zA18V231Zt3RBOw+4UlMmGMzbP4Vexf8YxiJ
 rY/OQgKUBThR
X-Google-Smtp-Source: AGHT+IHNCa7KOk1UWVLiaWVRfEbtP3MW/MRecQk4Hhs4IwUyKEXS/UQgjlBWZ5ASFJEMnCy7doGbzA==
X-Received: by 2002:a05:6a00:4f95:b0:730:794e:7aca with SMTP id
 d2e1a72fcca58-732618c1c26mr5964430b3a.16.1739633364844; 
 Sat, 15 Feb 2025 07:29:24 -0800 (PST)
Received: from terra ([2405:6586:be0:0:c8ff:1707:9b9:af89])
 by smtp.gmail.com with ESMTPSA id
 d2e1a72fcca58-73256fb09e1sm2964932b3a.65.2025.02.15.07.29.21
 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256);
 Sat, 15 Feb 2025 07:29:23 -0800 (PST)
From: Maxim Cournoyer <maxim.cournoyer@HIDDEN>
To: Tomas Volf <~@wolfsden.cz>
Subject: Re: bug#75998: [guile-lib] html->sxml does not decode entities in
 attributes
In-Reply-To: <874j13gwv1.fsf@HIDDEN> (Tomas Volf's message of "Sun, 09
 Feb 2025 12:50:58 +0100")
References: <87zfj58m03.fsf@HIDDEN> <Z58U6pXCY+tlHPMK@HIDDEN>
 <87seow8ya8.fsf@HIDDEN> <87ikprdrs0.fsf@HIDDEN>
 <871pwd76or.fsf@HIDDEN> <87ikpn6svv.fsf@HIDDEN>
 <20250206193458.1dc34a91@tintin> <87tt9553c9.fsf@HIDDEN>
 <874j13gwv1.fsf@HIDDEN>
Date: Sun, 16 Feb 2025 00:29:10 +0900
Message-ID: <87r03z6xbt.fsf@HIDDEN>
User-Agent: Gnus/5.13 (Gnus v5.13)
MIME-Version: 1.0
Content-Type: text/plain
X-Spam-Score: 0.0 (/)
X-Debbugs-Envelope-To: 75998
Cc: 75998 <at> debbugs.gnu.org, tomas@HIDDEN, David Pirotte <david@HIDDEN>
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -1.0 (-)

Hi Tomas,

Tomas Volf <~@wolfsden.cz> writes:

> Hi,
>
> Maxim Cournoyer <maxim.cournoyer@HIDDEN> writes:
>
>> Tomas, is it a fix you'd be interested in contributing?  Otherwise, I'll
>> get to it but my hands are rather full at the moment :-).
>
> To quote myself from the other thread:
>
>> Probably not.  I have spent 20 minutes staring into the file and do not
>> really have any idea where to start (ok, probably somewhere around
>> `scan-attr').  So I cannot really promise I will be able to work on this
>> (at least not soon), since I assume it will take me long time to figure
>> out.
>
> So I do not have any immediate plans to start working on this. :/

OK, no worries.  I'll look into it when I have a good chunk of time
ahead.

-- 
Thanks,
Maxim




Information forwarded to bug-guile@HIDDEN:
bug#75998; Package guile. Full text available.

Message received at 75998 <at> debbugs.gnu.org:


Received: (at 75998) by debbugs.gnu.org; 9 Feb 2025 11:51:09 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Sun Feb 09 06:51:09 2025
Received: from localhost ([127.0.0.1]:43419 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1th5q7-0005vd-VS
	for submit <at> debbugs.gnu.org; Sun, 09 Feb 2025 06:51:09 -0500
Received: from wolfsden.cz ([37.205.8.62]:35710)
 by debbugs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256)
 (Exim 4.84_2) (envelope-from <~@wolfsden.cz>) id 1th5q1-0005v1-IY
 for 75998 <at> debbugs.gnu.org; Sun, 09 Feb 2025 06:51:06 -0500
Received: by wolfsden.cz (Postfix, from userid 104)
 id A0F563251B4; Sun,  9 Feb 2025 11:50:59 +0000 (UTC)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=wolfsden.cz; s=mail;
 t=1739101859; bh=WB+gNJS+z2K7JhVoUNz/0ggRlauvs+vVk/ElLKKEvNw=;
 h=From:To:Cc:Subject:In-Reply-To:References:Date;
 b=mIP483DpVPWgB9JFLF8Smv92b4DprKrQ9M/9X+aHi2mTS9F0PZApCFU3u/Czd/n7M
 SEnkC8oLQMnjxON2xi8d862UyBrHoTXtiXJ0YKdxzHQktjpz0ufNJU7Qofvh0RQDQM
 YbczEtxBFWyNJ+5SZbKhP6n29RvJ68Z4FTU+YAb4Biga+JWSAyOSArjo/ZN5LAuywc
 qqakFrnd/Rem3JZ7RseDGQIGw+2wZT7qCE6qj/7TJgtCAQAXoFwHgPF5vbijcjPeEr
 LyDMKswe5oFotIw81CrYVN3+8dNWLkgrTWbmQxlr3mHJcXOhnTFEQ+lWjn5PhQZMp2
 lS4HMaem4eKDT9vSYwByOozLcDsL5OwuNQKl7hQuVS5enq8rlWxmwgoJ/P5dniC7pu
 +yd6gzoOtLA6eElZaN47VsN9KUY+6mQ3eUyojLK5zeC3Yc+Qho3U2gmGWRnfUfAZ2z
 srVk/AM8AqkaXlaVoYIGJZ8tswyYzDZarQ78sNMiBbTdEDsRBXgT22qKy/E3AZzSEd
 TlizJVF8j55I0MRatyg6Wq3nweXgeF44X01mY47V2GlQDJ1JO93ddkBSGYCdSyyz8y
 PJMpmtTKQG/ggiKke4vf3jp9KiABS+a4ISu0yV1ZYRcj5T6GXvseJElNNbUJyb0wri
 c5QuWULwawOOMiJ3laZnFkjc=
X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on wolfsden
X-Spam-Level: 
X-Spam-Status: No, score=-3.1 required=5.0 tests=ALL_TRUSTED,BAYES_00,
 DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,URIBL_BLOCKED
 autolearn=ham autolearn_force=no version=3.4.6
Received: from localhost (unknown [128.0.188.242])
 by wolfsden.cz (Postfix) with ESMTPSA id 4B789324D34;
 Sun,  9 Feb 2025 11:50:59 +0000 (UTC)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=wolfsden.cz; s=mail;
 t=1739101859; bh=WB+gNJS+z2K7JhVoUNz/0ggRlauvs+vVk/ElLKKEvNw=;
 h=From:To:Cc:Subject:In-Reply-To:References:Date;
 b=mIP483DpVPWgB9JFLF8Smv92b4DprKrQ9M/9X+aHi2mTS9F0PZApCFU3u/Czd/n7M
 SEnkC8oLQMnjxON2xi8d862UyBrHoTXtiXJ0YKdxzHQktjpz0ufNJU7Qofvh0RQDQM
 YbczEtxBFWyNJ+5SZbKhP6n29RvJ68Z4FTU+YAb4Biga+JWSAyOSArjo/ZN5LAuywc
 qqakFrnd/Rem3JZ7RseDGQIGw+2wZT7qCE6qj/7TJgtCAQAXoFwHgPF5vbijcjPeEr
 LyDMKswe5oFotIw81CrYVN3+8dNWLkgrTWbmQxlr3mHJcXOhnTFEQ+lWjn5PhQZMp2
 lS4HMaem4eKDT9vSYwByOozLcDsL5OwuNQKl7hQuVS5enq8rlWxmwgoJ/P5dniC7pu
 +yd6gzoOtLA6eElZaN47VsN9KUY+6mQ3eUyojLK5zeC3Yc+Qho3U2gmGWRnfUfAZ2z
 srVk/AM8AqkaXlaVoYIGJZ8tswyYzDZarQ78sNMiBbTdEDsRBXgT22qKy/E3AZzSEd
 TlizJVF8j55I0MRatyg6Wq3nweXgeF44X01mY47V2GlQDJ1JO93ddkBSGYCdSyyz8y
 PJMpmtTKQG/ggiKke4vf3jp9KiABS+a4ISu0yV1ZYRcj5T6GXvseJElNNbUJyb0wri
 c5QuWULwawOOMiJ3laZnFkjc=
From: Tomas Volf <~@wolfsden.cz>
To: Maxim Cournoyer <maxim.cournoyer@HIDDEN>
Subject: Re: bug#75998: [guile-lib] html->sxml does not decode entities in
 attributes
In-Reply-To: <87tt9553c9.fsf@HIDDEN> (Maxim Cournoyer's message of "Fri, 07
 Feb 2025 21:47:18 +0900")
References: <87zfj58m03.fsf@HIDDEN> <Z58U6pXCY+tlHPMK@HIDDEN>
 <87seow8ya8.fsf@HIDDEN> <87ikprdrs0.fsf@HIDDEN>
 <871pwd76or.fsf@HIDDEN> <87ikpn6svv.fsf@HIDDEN>
 <20250206193458.1dc34a91@tintin> <87tt9553c9.fsf@HIDDEN>
Date: Sun, 09 Feb 2025 12:50:58 +0100
Message-ID: <874j13gwv1.fsf@HIDDEN>
User-Agent: Gnus/5.13 (Gnus v5.13)
MIME-Version: 1.0
Content-Type: multipart/signed; boundary="=-=-=";
 micalg=pgp-sha512; protocol="application/pgp-signature"
X-Spam-Score: 0.0 (/)
X-Debbugs-Envelope-To: 75998
Cc: 75998 <at> debbugs.gnu.org, tomas@HIDDEN, David Pirotte <david@HIDDEN>
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -1.0 (-)

--=-=-=
Content-Type: text/plain
Content-Transfer-Encoding: quoted-printable


Hi,

Maxim Cournoyer <maxim.cournoyer@HIDDEN> writes:

> Tomas, is it a fix you'd be interested in contributing?  Otherwise, I'll
> get to it but my hands are rather full at the moment :-).

To quote myself from the other thread:

> Probably not.  I have spent 20 minutes staring into the file and do not
> really have any idea where to start (ok, probably somewhere around
> `scan-attr').  So I cannot really promise I will be able to work on this
> (at least not soon), since I assume it will take me long time to figure
> out.

So I do not have any immediate plans to start working on this. :/

Tomas

=2D-=20
There are only two hard things in Computer Science:
cache invalidation, naming things and off-by-one errors.

--=-=-=
Content-Type: application/pgp-signature; name="signature.asc"

-----BEGIN PGP SIGNATURE-----

iQJCBAEBCgAsFiEEt4NJs4wUfTYpiGikL7/ufbZ/wakFAmeolqIOHH5Ad29sZnNk
ZW4uY3oACgkQL7/ufbZ/wamoWg//ZTTdp0St4gS3tW0bHNkBmrpxZhXGo9VIyhLt
rp9P2UylIuVYttoGUuzdEozG/RFDkdmhwhKj3r6vzDtKNwp42w0iDHPmgQGBA1vQ
0JEMZCIc0xtLPlc+UI+vCm9C0lXBAngJO1EQv7iIszbjTehd41p1340IAYsB1ttK
cyxYFX5g6GYEyRPYQYiSyWOa2eaXCqW+XbDSqHimsczT6CIZai32ISSAnTKGZp13
BUAL8nMTiMPrhCHI7wtdd0f/LYy1CZRK4BDgwJdqvchzTj+wQZu5VlyGYU5l0NLA
umrbEaeK8oOZ2SjY5S1z151JXCWg8cpsUc0mIUaID5cTRbnQuZOA2XBMggxeUGuO
SNfsDNhWPYetzQ2TBIi+nXJDRDot1RvBoFEDqQyfAedtCFGl39nBLN0qYgUIgiKO
T86GTozWLz6Cc3Zb/Gp3/MaOgAOm4LLgayzQt6mM6hCPCZeNteWHpWCtApjXxrxl
R985Pm9+/fvMbofukyMeJObq0oJkpyrdYXYBHNwwcpkQWwYUjj/mDBsrw5/sD105
uW0bDLWoKXc6YcIe9axzquHeChZBjFDajGifwlysObhicjyky1iJG0Nu+Mt8jVp9
wgBpl8HxWWxwRwY352ha8wTVYhv90+GqBVVUuPXK1O5HUOv0rZpo9bG8HhbCZmsM
k+sW9yE=
=UWF6
-----END PGP SIGNATURE-----
--=-=-=--




Information forwarded to bug-guile@HIDDEN:
bug#75998; Package guile. Full text available.

Message received at 75998 <at> debbugs.gnu.org:


Received: (at 75998) by debbugs.gnu.org; 7 Feb 2025 12:47:41 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Fri Feb 07 07:47:41 2025
Received: from localhost ([127.0.0.1]:33327 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1tgNlk-0001hK-Vf
	for submit <at> debbugs.gnu.org; Fri, 07 Feb 2025 07:47:41 -0500
Received: from mail-pl1-x62a.google.com ([2607:f8b0:4864:20::62a]:59747)
 by debbugs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128)
 (Exim 4.84_2) (envelope-from <maxim.cournoyer@HIDDEN>)
 id 1tgNlh-0001h3-Qi
 for 75998 <at> debbugs.gnu.org; Fri, 07 Feb 2025 07:47:38 -0500
Received: by mail-pl1-x62a.google.com with SMTP id
 d9443c01a7336-2166651f752so44305105ad.3
 for <75998 <at> debbugs.gnu.org>; Fri, 07 Feb 2025 04:47:37 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
 d=gmail.com; s=20230601; t=1738932451; x=1739537251; darn=debbugs.gnu.org;
 h=mime-version:user-agent:message-id:date:references:in-reply-to
 :subject:cc:to:from:from:to:cc:subject:date:message-id:reply-to;
 bh=Gl/daOcOfB24F9NUxAhbGSRsfMn3MTWr9dpJUnxh41E=;
 b=XcMmLa32rq/w+FNgNchgtj6Lhio9Y/R6NbN2dTnstmXTp47J+ASrFn7A9eHnoAfk4+
 oEYRiCowSW8oOlahrb7/RSqY+Pc9LVhxRxlvi01fa7Y79/3gWuaQjZ/e8hMpYc/ul4HC
 iVBCQQ6eZAamlU+fLqfEQ4GJgU4/a/cfrr75+bM8jlY75tKRiFKeR3noq7jixs1f+/Yo
 hvuAwqYyb3q54likZUpXpTuKVJLtHJRjDA2/qSGYnETfXdmhuOF7PW5z8jXHH5iG9QcU
 ssu3TdkA3Di1Edwe0o6uTv6SWyAnL6Jm3bHXq4AR4q9UnYJ1HcW+SL8LozycSn+bwHDv
 BwvA==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
 d=1e100.net; s=20230601; t=1738932451; x=1739537251;
 h=mime-version:user-agent:message-id:date:references:in-reply-to
 :subject:cc:to:from:x-gm-message-state:from:to:cc:subject:date
 :message-id:reply-to;
 bh=Gl/daOcOfB24F9NUxAhbGSRsfMn3MTWr9dpJUnxh41E=;
 b=v2/qWA4AJFhT+Bb9LebtO77TdLQXlgc/XzOHa4q7DvLkG6wBOQ2iFjNjirbcf0h028
 vq+UeIEB1xvnihieCkqQBln/mh9C8UxWS8iaCJOudIFFZbkndym3vFJhGNrSLqe1T0Wm
 05JkfZFKSCae86/1EzRaVYnawXRT3vSHotYYTXU3/kn59InOEuDa+hFoNgWG7C0Pc151
 Fp7zzG4IX6FSYLGosIZZpQcKUejIFdJISCw3i/mKgkoRiGvwDxL5xRYsc9h13j7Okqka
 Z25SQfMWCT6/wxRqpwM19+z4mctiXt7tQEtpIxpJAvQxPWqDekVyDohgpRl1cvIdIcGx
 aUPw==
X-Forwarded-Encrypted: i=1;
 AJvYcCUs1TBngexcZTbZCjcvx9C8avr+TZYIUqn4KKIoaYA0wuw8XFDychDlcKqT00dquzn4/8mCfQ==@debbugs.gnu.org
X-Gm-Message-State: AOJu0YxiHtENv8N2b0j86xZdu6wjxy4ptYWgbVyx14k5D6pqQoz7GSIv
 pCRtw2FM+o+F0QcxX0BlD/aOn5w2dUCdkL6f29OuniPm2BmGUUjnrXmBn/ZX
X-Gm-Gg: ASbGncuo8woXw7FpLM/Fm46XIKu4s9wShDsvlbhyn9ySjudG7S6WxIf9OKX9PnWn5WL
 mWvFpd0bUfJlL7abKLU1Nb9OYjPr02dsCiy1KbFcRcU0jMJOwJ9PeJmTGR36fQyNHJy9Y7BUnht
 IbQHpT90ItdP7FodZGULYiwnX2AdytS/34vWDwDpFkRKzpnxCDwhzikFrI9m/Vv1CMpFzl/UiFp
 i4voSSzeD187az55VLbVFe7sCj/BXxZk7p2pLuWwnDTaYLfPib596nw/97/zLw4aPhvmScADQhW
 3oxtj/sjxnHR
X-Google-Smtp-Source: AGHT+IHi4SsZKaMDixU/YmaHXZatH/1o38NVwEOOUbWRIwRSGtG9VeyLRRINU1LZk4Ns90IT8sFnYA==
X-Received: by 2002:a05:6a20:394e:b0:1e5:f930:c710 with SMTP id
 adf61e73a8af0-1ee03a966d8mr5305160637.20.1738932451594; 
 Fri, 07 Feb 2025 04:47:31 -0800 (PST)
Received: from terra ([2405:6586:be0:0:c8ff:1707:9b9:af89])
 by smtp.gmail.com with ESMTPSA id
 d2e1a72fcca58-73048a9d4b4sm3002312b3a.19.2025.02.07.04.47.29
 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256);
 Fri, 07 Feb 2025 04:47:30 -0800 (PST)
From: Maxim Cournoyer <maxim.cournoyer@HIDDEN>
To: David Pirotte <david@HIDDEN>
Subject: Re: bug#75998: [guile-lib] html->sxml does not decode entities in
 attributes
In-Reply-To: <20250206193458.1dc34a91@tintin> (David Pirotte's message of
 "Thu, 6 Feb 2025 19:34:58 -0300")
References: <87zfj58m03.fsf@HIDDEN> <Z58U6pXCY+tlHPMK@HIDDEN>
 <87seow8ya8.fsf@HIDDEN> <87ikprdrs0.fsf@HIDDEN>
 <871pwd76or.fsf@HIDDEN> <87ikpn6svv.fsf@HIDDEN>
 <20250206193458.1dc34a91@tintin>
Date: Fri, 07 Feb 2025 21:47:18 +0900
Message-ID: <87tt9553c9.fsf@HIDDEN>
User-Agent: Gnus/5.13 (Gnus v5.13)
MIME-Version: 1.0
Content-Type: text/plain
X-Spam-Score: 0.0 (/)
X-Debbugs-Envelope-To: 75998
Cc: 75998 <at> debbugs.gnu.org, Tomas Volf <~@wolfsden.cz>, tomas@HIDDEN
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -1.0 (-)

Hi,

David Pirotte <david@HIDDEN> writes:

> Hi Maxim,
> Thomas,
>
>> But this here could well be considered a bug.  So perhaps something
>> we could do is fix this correctly, and bump at least the minor digit
>> in our version (we're still in an unstable 0 version (last one was
>> 0.2.8.1), so technically we don't promise stability yet (perhaps
>> never, as this guile-lib project aims to be a lab for components that
>> could later be included in Guile).  But we should communicate this
>> change well in the NEWS file.
>
> 1+ for
>
> 	a proper fix
> 	bump the version to 0.3.0
> 	well written NEWS entry(ies)
> 		clearly state that the htmlprag module was fixed, in a
> 		way that users who locally applied their own work
> 		around to the fixed problem/bug will have to review
> 		their code and adpat to this new version ...

Thanks for weighing in.

Tomas, is it a fix you'd be interested in contributing?  Otherwise, I'll
get to it but my hands are rather full at the moment :-).

-- 
Thanks,
Maxim




Information forwarded to bug-guile@HIDDEN:
bug#75998; Package guile. Full text available.

Message received at 75998 <at> debbugs.gnu.org:


Received: (at 75998) by debbugs.gnu.org; 6 Feb 2025 22:35:13 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Thu Feb 06 17:35:13 2025
Received: from localhost ([127.0.0.1]:59721 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1tgASn-00024P-0u
	for submit <at> debbugs.gnu.org; Thu, 06 Feb 2025 17:35:13 -0500
Received: from moses.all2all.org ([79.99.200.14]:48102)
 by debbugs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256)
 (Exim 4.84_2) (envelope-from <david@HIDDEN>) id 1tgASk-00022h-Ui
 for 75998 <at> debbugs.gnu.org; Thu, 06 Feb 2025 17:35:12 -0500
Received: from localhost (localhost [127.0.0.1])
 by moses.all2all.org (Postfix) with ESMTP id 0C1BA67C009B;
 Thu,  6 Feb 2025 23:35:07 +0100 (CET)
X-Virus-Scanned: Debian amavisd-new at moses.all2all.org
Received: from moses.all2all.org ([127.0.0.1])
 by localhost (moses.all2all.org [127.0.0.1]) (amavisd-new, port 10024)
 with ESMTP id RKbYS4QGbTLH; Thu,  6 Feb 2025 23:35:06 +0100 (CET)
Received: from tintin (unknown [168.227.184.188])
 by moses.all2all.org (Postfix) with ESMTPSA id 6F14E67C0091;
 Thu,  6 Feb 2025 23:35:05 +0100 (CET)
Date: Thu, 6 Feb 2025 19:34:58 -0300
From: David Pirotte <david@HIDDEN>
To: Maxim Cournoyer <maxim.cournoyer@HIDDEN>
Subject: Re: bug#75998: [guile-lib] html->sxml does not decode entities in
 attributes
Message-ID: <20250206193458.1dc34a91@tintin>
In-Reply-To: <87ikpn6svv.fsf@HIDDEN>
References: <87zfj58m03.fsf@HIDDEN> <Z58U6pXCY+tlHPMK@HIDDEN>
 <87seow8ya8.fsf@HIDDEN> <87ikprdrs0.fsf@HIDDEN>
 <871pwd76or.fsf@HIDDEN> <87ikpn6svv.fsf@HIDDEN>
X-Mailer: Claws Mail 4.3.0 (GTK 3.24.43; x86_64-pc-linux-gnu)
MIME-Version: 1.0
Content-Type: multipart/signed; boundary="Sig_/c7RIzrVi4oDdqkah+62=T/C";
 protocol="application/pgp-signature"; micalg=pgp-sha512
X-Spam-Score: -0.7 (/)
X-Debbugs-Envelope-To: 75998
Cc: 75998 <at> debbugs.gnu.org, Tomas Volf <~@wolfsden.cz>, tomas@HIDDEN
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -1.7 (-)

--Sig_/c7RIzrVi4oDdqkah+62=T/C
Content-Type: text/plain; charset=US-ASCII
Content-Transfer-Encoding: quoted-printable

Hi Maxim,
Thomas,

> But this here could well be considered a bug.  So perhaps something
> we could do is fix this correctly, and bump at least the minor digit
> in our version (we're still in an unstable 0 version (last one was
> 0.2.8.1), so technically we don't promise stability yet (perhaps
> never, as this guile-lib project aims to be a lab for components that
> could later be included in Guile).  But we should communicate this
> change well in the NEWS file.

1+ for

	a proper fix
	bump the version to 0.3.0
	well written NEWS entry(ies)
		clearly state that the htmlprag module was fixed, in a
		way that users who locally applied their own work
		around to the fixed problem/bug will have to review
		their code and adpat to this new version ...

David

--Sig_/c7RIzrVi4oDdqkah+62=T/C
Content-Type: application/pgp-signature
Content-Description: OpenPGP digital signature

-----BEGIN PGP SIGNATURE-----

iQEzBAEBCgAdFiEEhCJlRZtBM3furJHe83T9k6MFetcFAmelORIACgkQ83T9k6MF
etdXVgf/VMr8JKgRplOH2bz31BLUhxwoPvq+l/qtMCkMk0Bwo1dEom8cRJuVFMAc
wN0kPq8sQ3pzI+lorIh+18t79fW5VDePXtk6wmDVJ9D/zdA5UQBBLPZ39Kij418Z
uoPgTktp0scgMC8JJpy/b5fSpSIS+X852kTZxrAeTBJDt49OX/NXd8vJ4cWD/A3k
LTr189j5X2H7oAJxINe+Fab0ee376kobzZ5wzHGAkIGG/ZFo6akeLpeJIdcAKDID
zBv0rU/8Oq16pW18SPBesv/pbPLVnzBH8nKi8xbOzO6slFUXyqxMo/4P1HgxxS+j
gqkv8EX5rjdwSmiEgJ6NL7eTI4ZgGg==
=BbiG
-----END PGP SIGNATURE-----

--Sig_/c7RIzrVi4oDdqkah+62=T/C--




Information forwarded to bug-guile@HIDDEN:
bug#75998; Package guile. Full text available.

Message received at 75998 <at> debbugs.gnu.org:


Received: (at 75998) by debbugs.gnu.org; 6 Feb 2025 14:38:17 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Thu Feb 06 09:38:17 2025
Received: from localhost ([127.0.0.1]:55080 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1tg31E-00060F-Qs
	for submit <at> debbugs.gnu.org; Thu, 06 Feb 2025 09:38:17 -0500
Received: from mail-pl1-x632.google.com ([2607:f8b0:4864:20::632]:49326)
 by debbugs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128)
 (Exim 4.84_2) (envelope-from <maxim.cournoyer@HIDDEN>)
 id 1tg31C-0005zx-MM
 for 75998 <at> debbugs.gnu.org; Thu, 06 Feb 2025 09:38:15 -0500
Received: by mail-pl1-x632.google.com with SMTP id
 d9443c01a7336-21634338cfdso23441615ad.2
 for <75998 <at> debbugs.gnu.org>; Thu, 06 Feb 2025 06:38:14 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
 d=gmail.com; s=20230601; t=1738852688; x=1739457488; darn=debbugs.gnu.org;
 h=mime-version:user-agent:message-id:date:references:in-reply-to
 :subject:cc:to:from:from:to:cc:subject:date:message-id:reply-to;
 bh=rehPyN2pm6zU3djZUHZSdoYAioXu1ht9yFVTY7FGRec=;
 b=bVMMLsLKuF0ZaalKVrFneurBPAAiQcih45aI3tfCS1x0NJCzm3hkCnNIIAYxRjf4iB
 J5ItBYb03iorf9hoRBkj1Vk4l7aEFDAugFsvxAKYsXsAAe1xqYczY5ZIJSnQXSw/CTQc
 lO3CMsiDKkDMt2eO1xcmiDaWd4nEBltNUR9YoGLq5+twppYg94mdiLYfMKd6LC2o7Fsc
 WxBokf0mKKY0hmutt0YZHDGtHuAH5+YRnLtuYCP6WeMoXnHPBsD94PAz1yKK1NKjojXO
 5+plWP73YauPxRqSNzUJnl6mKlBwSdxuwoQ7LupKcu9XuPbEizy2libJ3G+uILqGpE08
 r6lg==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
 d=1e100.net; s=20230601; t=1738852688; x=1739457488;
 h=mime-version:user-agent:message-id:date:references:in-reply-to
 :subject:cc:to:from:x-gm-message-state:from:to:cc:subject:date
 :message-id:reply-to;
 bh=rehPyN2pm6zU3djZUHZSdoYAioXu1ht9yFVTY7FGRec=;
 b=eFeaaUBVzqqb36C8OmlDOh4/MZxucmntaUNmUPWNfCzw+JnV1zZzd0Sn/tUb5YOP3s
 MNY4w5bJSXJ5XsNWbECkIJtW2hTcbS1EmXpU94Y6Cpf+v1dKeyBmVt1pbLK1QYl9g7kF
 7jAIth8Hf8Aig2SW6SAek506KndLcylQizhfPhFGhL90j2FDqwN56ghWGKazt5rTJxYi
 SEUGl5qYUMzJMrZeZmZ+MNCMebZXbeIkBj3RCCmS+gEsVtFuLlM66et/+Qpf7RVPYDqD
 egI3FoP9eZCRvyCB3HdQznef+3s7N+qRRA3nrCm8agkPXpMEpnc1GVrIcUo7nm8f8qeq
 krmw==
X-Forwarded-Encrypted: i=1;
 AJvYcCXDVKFN6RFfv/dWLgjja5DlcKJE3QrUU6zdpv105y6Clt65lcbRsuqVhHn82FQKj/DvXavvfw==@debbugs.gnu.org
X-Gm-Message-State: AOJu0Yw9c/9VCr7rUYt0i6hPQ0teFqDUyR/r9mkvZvLyGDAZRWUixw/6
 dbxtL3qfHgeDMqSpzpNhhbavSV45m+xycSYK3LY8UjG+YMroXd72QUdIKKEj
X-Gm-Gg: ASbGncviPWhGuRJfbScnAawlTCGwghs1XCERe/BnhoqTlXX3v5XN9bWnzTw7LEGkuZ8
 m1puOvMWdsDmhutwqqI/t/cKZJ/b1x2ViRN7jIxNIPJey0i+AwI9TRoCiFAF+H25LekWGvkBaZ0
 r77TbkkGk2OqEDRT+UDsCI7Zlaf7lOxwdOmH0UOH0nN7qYWmdgqYSVyzayZU8a7tbLpEduOzrby
 rl3eNaf1OAbJGK01/pp8Sggb548RNPjQeD9waip72k7jT8MWHalH2mGTx7jICqhdTGfyaGcnT1e
 z5R1NBLJHBdO
X-Google-Smtp-Source: AGHT+IFNumWDPGyyZZr5BOXL7YZD8SA3fqJXFPoNAkf0TwmmUq7658zkm8rQeBt72BJ3MASTgJuK/g==
X-Received: by 2002:a05:6a21:648c:b0:1e1:d26:6657 with SMTP id
 adf61e73a8af0-1ede88b887emr11467696637.42.1738852687817; 
 Thu, 06 Feb 2025 06:38:07 -0800 (PST)
Received: from terra ([2405:6586:be0:0:c8ff:1707:9b9:af89])
 by smtp.gmail.com with ESMTPSA id
 d2e1a72fcca58-73048e2046fsm1399157b3a.173.2025.02.06.06.38.05
 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256);
 Thu, 06 Feb 2025 06:38:07 -0800 (PST)
From: Maxim Cournoyer <maxim.cournoyer@HIDDEN>
To: Tomas Volf <~@wolfsden.cz>
Subject: Re: bug#75998: [guile-lib] html->sxml does not decode entities in
 attributes
In-Reply-To: <871pwd76or.fsf@HIDDEN> (Tomas Volf's message of "Tue, 04
 Feb 2025 22:15:16 +0100")
References: <87zfj58m03.fsf@HIDDEN> <Z58U6pXCY+tlHPMK@HIDDEN>
 <87seow8ya8.fsf@HIDDEN> <87ikprdrs0.fsf@HIDDEN>
 <871pwd76or.fsf@HIDDEN>
Date: Thu, 06 Feb 2025 23:37:56 +0900
Message-ID: <87ikpn6svv.fsf@HIDDEN>
User-Agent: Gnus/5.13 (Gnus v5.13)
MIME-Version: 1.0
Content-Type: text/plain
X-Spam-Score: 0.0 (/)
X-Debbugs-Envelope-To: 75998
Cc: 75998 <at> debbugs.gnu.org, tomas@HIDDEN
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -1.0 (-)

Hi Tomas,

[...]

> It already modifies the raw value for regular HTML text:
>
> scheme@(htmlprag)> (html->sxml "a&amp;b")
> $10 = (*TOP* "a&b")
> scheme@(htmlprag)> (sxml->html '(*TOP* "a&b"))
> $13 = "a&amp;b"
>
>
> I now noticed this also affect encoding:
>
> scheme@(htmlprag)> (sxml->html '(*TOP* (a (@ (href "a&b")))))
> $12 = "<a href=\"a&b\"></a>"
>
>
> I am not sure why attributes should be special here.
>
> For what it is worth, (sxml simple) itself decodes even attributes:
>
> scheme@(htmlprag)> (xml->sxml "<a href=\"a&amp;b\"></a>")
> $11 = (*TOP* (a (@ (href "a&b"))))
>
> For comparison, Firefox seems to decode the attributes as well even in
> HTML.  That is actually how I discovered this issue, links I extracted
> from <a href=".."> using html->sxml were not working until I ran a
> decoding pass on them.

Good points.  Thanks for these.

>> Users may haev different use cases requiring to apply different
>> transformation themselves?
>
> I agree in the abstract, but do you have any specific use case in mind
> when you would want to use the raw content of attributes (especially
> since you already cannot get raw content of text nodes).

>> If we hard-code a decoding scheme ourselves, then force that choice
>> onto users, no?
>
> I agree we cannot hard-code or change it now due to compatibility
> concerns, but adding #:decode-attributes to html->sxml,
> #:encode-attributes to sxml->html and possibly %deencode-attributes?
> parameter, in the spirit of %strict-tokenizer? would seem reasonable.

I see this situation and %strict-tokenizer as a bit different; the
htmlprag module was designed to be lenient, so being lenient could not
really be considered a bug :-).  But this here could well be considered
a bug.  So perhaps something we could do is fix this correctly, and bump
at least the minor digit in our version (we're still in an unstable 0
version (last one was 0.2.8.1), so technically we don't promise
stability yet (perhaps never, as this guile-lib project aims to be a lab
for components that could later be included in Guile).  But we should
communicate this change well in the NEWS file.

-- 
Thanks,
Maxim




Information forwarded to bug-guile@HIDDEN:
bug#75998; Package guile. Full text available.

Message received at 75998 <at> debbugs.gnu.org:


Received: (at 75998) by debbugs.gnu.org; 4 Feb 2025 21:15:22 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Tue Feb 04 16:15:22 2025
Received: from localhost ([127.0.0.1]:46932 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1tfQGQ-0002R4-4l
	for submit <at> debbugs.gnu.org; Tue, 04 Feb 2025 16:15:22 -0500
Received: from wolfsden.cz ([37.205.8.62]:34214)
 by debbugs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256)
 (Exim 4.84_2) (envelope-from <~@wolfsden.cz>) id 1tfQGN-0002Qn-HM
 for 75998 <at> debbugs.gnu.org; Tue, 04 Feb 2025 16:15:21 -0500
Received: by wolfsden.cz (Postfix, from userid 104)
 id 64CC31EA0CC; Tue,  4 Feb 2025 21:15:17 +0000 (UTC)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=wolfsden.cz; s=mail;
 t=1738703717; bh=71b8ZHq6CmKJontf8iDumUkCtqtI3jEIqx/iWo5yJq0=;
 h=From:To:Cc:Subject:In-Reply-To:References:Date;
 b=V/UH0SOlrXe1ByodvuV5k1eucPegfPmnI5ViCR4fCOggDK+hYkhQJ67UE63uf2GgL
 1e9ru1BHnOvn5xp3pLXxEiXqioM77h5aqbmdoseAr5m4ai7GHgx+xUCK6GpijHmRSg
 CnlHRt3XQBhIsapwDtYPH0gG47W6gmxgLZxnT+IuEJoufmezo8q+fRYkSPhpJi92+L
 D7HCcZLKENmg9wilxiJ9m5EaPWobuXi/rsTMlnRVKSPumZhA1RRGApTOZJdOe4pRZ+
 ZUJtNBv+sLBNnHOnVjl+U5UMuwxDDI9RO0z3xcM4m4z/mWewvh3TBqxzsP4+lxlv2u
 SQ+7WwyQJtSkeR4j9fZ5Vg+mC2L3KbRaWzGeHJW2W3KhxEfX40RQ57zI/X0xYJ/vIF
 3rNUgrbtbbj+YBjfFRifNrNUCGjS5M+vHBuILm47+Az3hqNKRT9eDQsI6uBErf2wMi
 zSGzCnvX9qJyp3/pTvQpEYmQdZa/jFzUWaQfmbhKH1A5VDSDUFQA84+ES9kX57lofC
 ZXuDtCKx93qfwjMRvG9sQFsgx68An5GThsY4k166HgSZUpRx1XQWy22ZiAbA7M0WPK
 vvqVKF5OOAh0rov51nE4WP8mws2ZkOoLaT4cLDjYxZlYBSk2XAQh02bMCpvQs8BCR7
 wyYMpI7aTAOf4+PJEEq48mLU=
X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on wolfsden
X-Spam-Level: 
X-Spam-Status: No, score=-3.1 required=5.0 tests=ALL_TRUSTED,BAYES_00,
 DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,URIBL_BLOCKED
 autolearn=ham autolearn_force=no version=3.4.6
Received: from localhost (unknown [128.0.188.242])
 by wolfsden.cz (Postfix) with ESMTPSA id 816B81E9D37;
 Tue,  4 Feb 2025 21:15:16 +0000 (UTC)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=wolfsden.cz; s=mail;
 t=1738703716; bh=71b8ZHq6CmKJontf8iDumUkCtqtI3jEIqx/iWo5yJq0=;
 h=From:To:Cc:Subject:In-Reply-To:References:Date;
 b=GhN7owzFEmDt+xVxbLekJfChFHZi7kG5RdNdBWSX2OZtSYnYXaNnE/y/KPr4bJGWL
 qznWHfLR0+mXAHbkshjkKBiG9BB4OvajlHU4QeLySuwDGilBCNWsHSUxmI+0qoaxZw
 XuLYzjhObTBFqWtd3bLZrUCt5evCcZ+6UNuOb9A97hzPYj8ajpdZzne8faNH0gKXF/
 mg9PH+Fv05Uu2u21VPedmhSFWM3VO4aEBDWnYm1YdUzViTucJiMrnD4ajQqHAwfnVf
 fdWrGVVG2GWhaPmZFF5ODDtrF8P5A/q6xJqCXKEnBH31oKM2DDsAEt9EtOj/k+FYIE
 QDOebFr6BDPZYDv9wPKfPl9aD35hAjrsh4zfKCa6eYtBp22isAy4LP2eCCQRPskada
 bRVyVLicrmYBS25OKGkdq19fGwixT5N0TghU7GiRuwT5a0DWzLaIUEXI7+lCdYRAoy
 PXyMDtFNOyCASnVB3OoNZ/rHoLB6SbGsGg7mudDeFRhBjXcUACM98HmxrTLUWyozyZ
 8Dm9Tn1+2Zwnz00DwIl8FbTZvxjiChHr2/G1SWR6RYZxegE0G2xGrbIfDWmZR/ls1O
 o9eNme+DWvWkpKqzSmDZUhuBfA+YRqIDbOXL/qZghhihxk3z386aaxoeUyKdCzPDwd
 D/YX1YagP3umKjzBgUi7DkWY=
From: Tomas Volf <~@wolfsden.cz>
To: Maxim Cournoyer <maxim.cournoyer@HIDDEN>
Subject: Re: bug#75998: [guile-lib] html->sxml does not decode entities in
 attributes
In-Reply-To: <87ikprdrs0.fsf@HIDDEN> (Maxim Cournoyer's message of "Mon, 03
 Feb 2025 23:30:55 +0900")
References: <87zfj58m03.fsf@HIDDEN> <Z58U6pXCY+tlHPMK@HIDDEN>
 <87seow8ya8.fsf@HIDDEN> <87ikprdrs0.fsf@HIDDEN>
User-Agent: Gnus/5.13 (Gnus v5.13)
Date: Tue, 04 Feb 2025 22:15:16 +0100
Message-ID: <871pwd76or.fsf@HIDDEN>
MIME-Version: 1.0
Content-Type: multipart/signed; boundary="=-=-=";
 micalg=pgp-sha512; protocol="application/pgp-signature"
X-Spam-Score: 0.0 (/)
X-Debbugs-Envelope-To: 75998
Cc: 75998 <at> debbugs.gnu.org, tomas@HIDDEN
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -1.0 (-)

--=-=-=
Content-Type: text/plain
Content-Transfer-Encoding: quoted-printable

Maxim Cournoyer <maxim.cournoyer@HIDDEN> writes:

> Hi Tomas,
>
> Thank you for reporting this issue.
>
> Tomas Volf <~@wolfsden.cz> writes:
>
>> <tomas@HIDDEN> writes:
>>
>>> On Sat, Feb 01, 2025 at 09:10:04PM +0100, Tomas Volf wrote:
>>>>=20
>>>> Hello,
>>>>=20
>>>> I think I found a bug in the htmlprag module in guile-lib.  When parsi=
ng
>>>> attributes, the values are not properly decoded:
>>>>=20
>>>> --8<---------------cut here---------------start------------->8---
>>>> scheme@(guile-user)> ,use (htmlprag)
>>>> scheme@(guile-user)> (html->sxml "<hr aaa=3D\"bbb&quot;ccc'ddd\" />")
>>>> $1 =3D (*TOP* (hr (@ (aaa "bbb&quot;ccc'ddd"))))
>>>> scheme@(guile-user)> (html->sxml "<a href=3D\"a&amp;b\" />")
>>>> $2 =3D (*TOP* (a (@ (href "a&amp;b"))))
>>>> --8<---------------cut here---------------end--------------->8---
>>>>=20
>>>> I think that $1 should be "bbb\"ccc'ddd" and $2 should be "a&b".
>>>
>>> Ouch. Have you contacted Oleg Kiselyov about it? He's usually pretty
>>> responsive and very friendly.
>>
>> I did not.  I did not find a "how to report bugs" section on guile-lib's
>> website, and on the (htmlprag) documentation section Oleg Kiselyov is
>> mentioned only in one sentence as a "Thanks".
>>
>> I think I have managed to find his email in one Haskell paper of his, so
>> I will CC him on the bug report, as suggested.
>
> And also for containing Oleg.  I hope they can provide us with their
> opinion on whether this is an actual bug or was designed that way.  To
> me, it's not clear whether html->sxml should alterate the raw value of
> attributes in any way.

It already modifies the raw value for regular HTML text:

=2D-8<---------------cut here---------------start------------->8---
scheme@(htmlprag)> (html->sxml "a&amp;b")
$10 =3D (*TOP* "a&b")
scheme@(htmlprag)> (sxml->html '(*TOP* "a&b"))
$13 =3D "a&amp;b"
=2D-8<---------------cut here---------------end--------------->8---

I now noticed this also affect encoding:

=2D-8<---------------cut here---------------start------------->8---
scheme@(htmlprag)> (sxml->html '(*TOP* (a (@ (href "a&b")))))
$12 =3D "<a href=3D\"a&b\"></a>"
=2D-8<---------------cut here---------------end--------------->8---

I am not sure why attributes should be special here.

For what it is worth, (sxml simple) itself decodes even attributes:

=2D-8<---------------cut here---------------start------------->8---
scheme@(htmlprag)> (xml->sxml "<a href=3D\"a&amp;b\"></a>")
$11 =3D (*TOP* (a (@ (href "a&b"))))
=2D-8<---------------cut here---------------end--------------->8---

For comparison, Firefox seems to decode the attributes as well even in
HTML.  That is actually how I discovered this issue, links I extracted
from <a href=3D".."> using html->sxml were not working until I ran a
decoding pass on them.

> Users may haev different use cases requiring to apply different
> transformation themselves?

I agree in the abstract, but do you have any specific use case in mind
when you would want to use the raw content of attributes (especially
since you already cannot get raw content of text nodes).

> If we hard-code a decoding scheme ourselves, then force that choice
> onto users, no?

I agree we cannot hard-code or change it now due to compatibility
concerns, but adding #:decode-attributes to html->sxml,
#:encode-attributes to sxml->html and possibly %deencode-attributes?
parameter, in the spirit of %strict-tokenizer? would seem reasonable.

Tomas

=2D-=20
There are only two hard things in Computer Science:
cache invalidation, naming things and off-by-one errors.

--=-=-=
Content-Type: application/pgp-signature; name="signature.asc"

-----BEGIN PGP SIGNATURE-----

iQJCBAEBCgAsFiEEt4NJs4wUfTYpiGikL7/ufbZ/wakFAmeig2QOHH5Ad29sZnNk
ZW4uY3oACgkQL7/ufbZ/wanxHxAAs3PKplqcaclgmySzVDymHdgkGoSoHU9CDOKT
EGT40VmG0jglWxerH9hFja1JInZHWRD4I5rOLEsmnNHzlcnKEVLrE6W5Ja/goYu7
deSuFkU/IOmSbQWc9ULSS/ETCacgaENUVSARwdDnpEowAjAFsP0rDLXESnIZgQfV
Zbl3TDsPTo6r30cgBgpHHN/95ubRDvBzT1bFOlyN3sUDh1jMLGOqH+5zcGBB0OE1
rsGrvLNG8hp53j9xafDADOLCQeNX6VlZ4oDFv8HOgM6oI33GyqFS00KSqAujB9lX
aPrzL8iTC6DVrF7toqyv0DRpLHohyjWJB1E2clIn1WRafe/UeYxOtupmpBB3HDZ6
fgTL9fRRtKAoN07wYSVx+yH+LBVqVYbzHA1osU3lrh+5T9s0Fb/M3kR7fGsE5vj6
CmYPzfYqz10mcGogdDEHQNj1Rz5aHKFZd8qtbVvsVTjgwY7yxuUcLueu+SsvnBbC
vX6gp3DO595Nc00frM2csSvTdDw+NeLW1yNE1vPrzlTIMjKpUA9BUAgpdG30o9ks
Tkm7F9DUIbgJ8IwDzneQWqyLA9NOvF2ytTESjL9AC1AfqHs/eWczJUYoyUn19wMn
PzjEYoa2IUEcg5jiqmssTpwImBQyNHVHJhW7EOD5Xr8je50F/UbYd6CgZJCZZuS4
6YxLoRA=
=JUjj
-----END PGP SIGNATURE-----
--=-=-=--




Information forwarded to bug-guile@HIDDEN:
bug#75998; Package guile. Full text available.
bug closed, send any further explanations to 75998 <at> debbugs.gnu.org and Tomas Volf <~@wolfsden.cz> Request was from Tomas Volf <~@wolfsden.cz> to control <at> debbugs.gnu.org. Full text available.

Message received at 75998 <at> debbugs.gnu.org:


Received: (at 75998) by debbugs.gnu.org; 4 Feb 2025 20:55:17 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Tue Feb 04 15:55:17 2025
Received: from localhost ([127.0.0.1]:46898 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1tfPwy-0006h2-QU
	for submit <at> debbugs.gnu.org; Tue, 04 Feb 2025 15:55:17 -0500
Received: from wolfsden.cz ([37.205.8.62]:43332)
 by debbugs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256)
 (Exim 4.84_2) (envelope-from <~@wolfsden.cz>) id 1tfPwv-0006gl-Bd
 for 75998 <at> debbugs.gnu.org; Tue, 04 Feb 2025 15:55:15 -0500
Received: by wolfsden.cz (Postfix, from userid 104)
 id 2A4581E95C4; Tue,  4 Feb 2025 20:55:11 +0000 (UTC)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=wolfsden.cz; s=mail;
 t=1738702511; bh=ztZXTaefdByegpupU82GUsuBszOPRWWRQKHvNVwSeqw=;
 h=From:To:Cc:Subject:In-Reply-To:References:Date;
 b=kUAUyS/9B2EEK/xeebiMdYsGh4czDBn7ZUm5sFRm6iuoMI7pklBzLe4bMPTN0mnBy
 VLCfcwy2FfxCKVOqnCk0+5Q0l8uGoqb5Wy3aQ9wTPSyB/WUuV/4KkCep+pt4rO7U1T
 fPQrwQPkxT3JgdpQSR/LBKHcj9jsRKVca0Gn+ijHlFGypHzbs4AS+7hgjejLvLNkLt
 f/FJlEycItXPJ1sObik2M4n5IMPorgIsbxEAhOPleCInPvFf+TLwabGIzKsihwdWUc
 kCORyik417VBe27BHmhCvtiGAJygjKgdHsdC+qMCGQHvR6JspJt88C8l5DxS/QqYzt
 dcTQVVyJDaLZyacyEbNwqRabz7VK7ukhpAS8lbmrN3BTKpYv9RvRhMJ5P0oDEvQaKt
 YAFjiGvUwqLRRG3shHCHeQQMhqaz9mYjwEBcQcAhQ+Re2NlUf4oSUYWuh7QJ6v7sR9
 tsPoxFErKwPvqM+Bk+gyPIQjAfJ1qdjEgVDD1yXATWwrF6StbZy0//olfwvbKQJp81
 4DwHH2FvSxYDpPi8VHOuhnwiOOrgpBqE6FKedSRYi8N62UaxqAWN1m/6JXh3RZrp7C
 +Hs0w6yu/Bo6DLDb0AoEEqwSVsbUeOPE5HL93rTMiBlpDCh83zrZnZ2HdlOB3V6oht
 1k7vRzLZ1IdRwEQzUW4TpYH4=
X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on wolfsden
X-Spam-Level: 
X-Spam-Status: No, score=-3.1 required=5.0 tests=ALL_TRUSTED,BAYES_00,
 DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,URIBL_BLOCKED
 autolearn=unavailable autolearn_force=no version=3.4.6
Received: from localhost (unknown [128.0.188.242])
 by wolfsden.cz (Postfix) with ESMTPSA id D99271E98A4;
 Tue,  4 Feb 2025 20:55:08 +0000 (UTC)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=wolfsden.cz; s=mail;
 t=1738702508; bh=ztZXTaefdByegpupU82GUsuBszOPRWWRQKHvNVwSeqw=;
 h=From:To:Cc:Subject:In-Reply-To:References:Date;
 b=gh3ZhFU7Qq2+NEhhA8aZ2vG/LL9tUBFMy9XrrLX77X/sPxucxetWisjXBQ3JfYDcG
 EcjKaLY7gndqXXsaRUw/hOD+vRjv8NwGUDFKs5M+0aYHnEfIt8hrBwfEirqbfS6zj9
 z50gx9YcCtjPfMMk074b5dygynGS9nKK53EKSfpoQgn26k+dNmWBz6GWjtjyPZPxVy
 /1WkmiWmZfXapmoxiWgjF2e+MjbIbD4PzIzOAWjbOV5dXO5saTNJcS8VqHg4wf4wZ7
 N2aYTwnvQuClPs40piEj3ZTMG6mCk38ZMY1ptREEIdn4kkEjIhfHHeoylQEIH45zkB
 UKM8pCuliJ2L+l/KL/sqZzggpSwX1KmuUkGY/0oUADv7WYO3XARx/gpPhhkgFRdO0c
 /T7p+dRCqt9jeDSHDEDnTji7FkaX0pGlJ+tZ5l9+Uw8F7/DpPDu19eyGFuXcG0RlZQ
 VwUV2uYldsD6r9Sb6HnsCXAvhWNE06k7pbc8ttVuMe3Sij5NyXX2VXlAJodRaJIVrU
 Boa2dbL8FL3vcIvNKojohIgg79/JBxIA7U5r8rXPW6XT8eoy9IsuoiR8HMVDQ4Z18u
 RYxxCDJzEeNsERD+c3MqhTjh/DPB/ujX4TX1G5i6w1PYrYn/ctYj0FQ3P9wtfUhG4A
 LIReTje7LZbL6gv+czLBjBHI=
From: Tomas Volf <~@wolfsden.cz>
To: David Pirotte <david@HIDDEN>
Subject: Re: bug#75998: [guile-lib] html->sxml does not decode entities in
 attributes
In-Reply-To: <20250202184813.788926b0@tintin> (David Pirotte's message of
 "Sun, 2 Feb 2025 18:48:13 -0300")
References: <87zfj58m03.fsf@HIDDEN> <Z58U6pXCY+tlHPMK@HIDDEN>
 <87seow8ya8.fsf@HIDDEN> <20250202184813.788926b0@tintin>
Date: Tue, 04 Feb 2025 21:55:08 +0100
Message-ID: <87r04d77mb.fsf@HIDDEN>
User-Agent: Gnus/5.13 (Gnus v5.13)
MIME-Version: 1.0
Content-Type: multipart/signed; boundary="=-=-=";
 micalg=pgp-sha512; protocol="application/pgp-signature"
X-Spam-Score: 0.0 (/)
X-Debbugs-Envelope-To: 75998
Cc: 75998 <at> debbugs.gnu.org, tomas@HIDDEN
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -1.0 (-)

--=-=-=
Content-Type: text/plain
Content-Transfer-Encoding: quoted-printable

David Pirotte <david@HIDDEN> writes:

> 	HACKING
> 	INSTALL
> 	NEWS
> 	README	http://git.savannah.nongnu.org/cgit/guile-lib.git/tree/README
>
> 	all do mention, in their header [HACKING as an example]:
>
>                           Guile-Lib - HACKING
>               =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=
=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D
>=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=20=
=20=20=20=20=20=20=20=20=20=20=20=20=20
>                   Please send Guile-Lib bug reports to
>                            guile-devel@HIDDEN
>
> I'd recommend to close this bug report here saying 'not a guile bug' and
> repost on guile-devel.

Ah, I see.  I admit I was checking only the website, and then I asked on
IRC.  Will re-post on guile-devel as instructed.

Tomas

=2D-=20
There are only two hard things in Computer Science:
cache invalidation, naming things and off-by-one errors.

--=-=-=
Content-Type: application/pgp-signature; name="signature.asc"

-----BEGIN PGP SIGNATURE-----

iQJCBAEBCgAsFiEEt4NJs4wUfTYpiGikL7/ufbZ/wakFAmeifqwOHH5Ad29sZnNk
ZW4uY3oACgkQL7/ufbZ/wakqtA//Y2V1YCIZpK1S1M9uW3b2kbmL4Psf5rJkytvQ
3/GiNquxaSdMFfc/2xHawYWLtFWd/9leesOKozCaTt4KQafh1ScYlAdD2dmVuL6q
R7rx6w53LAvMkiIQ/z6lvm4G2S/aULCTHH+pnn/8bVPY6wD/OtHlNahRZh+KuZ2u
yHB5kyhLCqPmToYfxhT+sfZTXav4OJPVosvo3e+830nNToWhScGDq9SV/JBzQ37R
fdWTP7KjWFiLnRmqzVxckIZZQZXAKTXFN4a0F5VoCLIE8czTPr7txFHpJp4koOxH
scej5qDeEnHCR0/fyXqN4di0QLBXG6WD02Ju2b7tVoOMemXonxG5Q50VjngZPw2l
bVV5Rup5xf53CvqjxFYnXEmLV5JiIyZuKbu2Fgc3CcE7gVUbLAuOsc7kPevZYumh
qgkR45ow9HgGJZOn3SWZNehu3woWaRH5ZeZBj0JqtVABKEJQbKPPsk9s/4tFwLoJ
Cow73H5QaTuEnLxWK4BD8Blgqkl00Y5s3wR1bmx3rOUHTbYNgD2eIysRXol14wor
87IqsJFrQfYQZt5qVt5+XgHvaAgznFhKFzdWKkN1F2iy7PWRVhBfQsNTNQHvyqkH
AzXUWOB6n1rwdZnOnxXyzXca1ZWk5/j93Pf+v+pzhTnaFXY3C17/HLfZANz+6FG6
hM2fIfI=
=6hgU
-----END PGP SIGNATURE-----
--=-=-=--




Information forwarded to bug-guile@HIDDEN:
bug#75998; Package guile. Full text available.

Message received at 75998 <at> debbugs.gnu.org:


Received: (at 75998) by debbugs.gnu.org; 3 Feb 2025 14:31:20 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Mon Feb 03 09:31:19 2025
Received: from localhost ([127.0.0.1]:38461 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1texTr-0002Go-Cf
	for submit <at> debbugs.gnu.org; Mon, 03 Feb 2025 09:31:19 -0500
Received: from mail-pl1-x62d.google.com ([2607:f8b0:4864:20::62d]:56743)
 by debbugs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128)
 (Exim 4.84_2) (envelope-from <maxim.cournoyer@HIDDEN>)
 id 1texTn-0002GP-G0
 for 75998 <at> debbugs.gnu.org; Mon, 03 Feb 2025 09:31:17 -0500
Received: by mail-pl1-x62d.google.com with SMTP id
 d9443c01a7336-2166360285dso72943555ad.1
 for <75998 <at> debbugs.gnu.org>; Mon, 03 Feb 2025 06:31:15 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
 d=gmail.com; s=20230601; t=1738593069; x=1739197869; darn=debbugs.gnu.org;
 h=mime-version:user-agent:message-id:date:references:in-reply-to
 :subject:cc:to:from:from:to:cc:subject:date:message-id:reply-to;
 bh=oSbE8oJGkXdyS+Pk8rlsSdriRakH1QMxtl15dfYVaH0=;
 b=ASXfRcfib/JEnZUHh+y2pp6+3m+5nqLKEewYal/6FNFzDjTuYayOgVWw6wAwBpzVX4
 PrArX75NDAHRenvkNujFilEDroWfinok+PdZ0yUsaCPI7BEuiAhJ6UUTywawZMZeCQdo
 zhqW/PjC8QEqWnqXrPld5bW5UN6Vr63uMg2e4J+phTvsFqpHP0eptZ4VS1lZqYtIvT9/
 Y8wJGEQ1Q5xvLkPRToCcfmo26IPKtH5ILswdX+HqmvOPDf1PfJ+fVoM6+1iWwYQBu6JL
 0RV9AwAlNHaE/3gFm93+xcHUkTSIVLcHSbObTMtCKugbjmSsk7IVie4n8Goj+flM1GAz
 +UPA==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
 d=1e100.net; s=20230601; t=1738593069; x=1739197869;
 h=mime-version:user-agent:message-id:date:references:in-reply-to
 :subject:cc:to:from:x-gm-message-state:from:to:cc:subject:date
 :message-id:reply-to;
 bh=oSbE8oJGkXdyS+Pk8rlsSdriRakH1QMxtl15dfYVaH0=;
 b=tYyr9TqIYpclXYl+Ky4TiVr+otfDbV3jzp3gJEvv/BO78iymjAiK3mrC9cbenjczQT
 XRCWWxsWqWMOuWe+EMPBZajzfdBQ4zQk6ZS0maOFBXYVs1H1rrMYjBJ2fn2/oCcXbV6W
 5PI4+G8Bd3i+sMPSg4ifrFtAxyStYVSzOP0WHxq9E2o6TV8MlsFxiofak+R+SZZxxbfI
 V7xK5vqzqtXrb6F2R1a8v7el8FJREDaSx0qoS4w6M20rv50JM1FcnsCgOibV7CK365n1
 SJLpC+CtEFJFP0Y/RlrI/Qdi4hW5ez21gNqOf9/cMy7oJP+aCmKpR7ZJrE0F2lpEMXMw
 9Mmw==
X-Forwarded-Encrypted: i=1;
 AJvYcCXvpuiiiujDEg6Mo8Sl7t+QldcMGdH63oX0haG0bzKJqUQpaS8/ETkSMITPcocuYPXzDm2lWA==@debbugs.gnu.org
X-Gm-Message-State: AOJu0YxgoJU5jvLarkR0QNyBJjTwbnA4rkfbd4bIsUziLDoOS7L4vp7U
 N8tAgM00QhN/BxZrKaHLsV9Hr0dIYF3cWnDpZZMnollRufGO91PizynSKh7c
X-Gm-Gg: ASbGncsy92fKGmUyj36/mAueV3gI0F263CaEBZjXSFFDkMTRx/jWbg49wAzSTrzMDzV
 +aCl/fYBJ5qKY20FGL44kloiolKGuiyojxlu7IcRZlrZJu59AP0DE/LWDnJPr9oTxWNn1xYcFJu
 6DlXVL3AqmG/FQ0xIOKri/vaxjF85HK8Qq45TLiltXRO6ZL4CVbuO0WCOLOVYQtWbIRnT5nTprl
 fzzPz+VuGsitmN6LfWCCfAtwZCs15lxDHZAui2A7lzVEbCaGdckMgStGV7U8dxzFDWAdz82nhyw
 w9lvuR+Ic41/
X-Google-Smtp-Source: AGHT+IGmU2dfCr7PiPoyVNaZksqk0eFTOtC51+96NkwS2YeGzDmbfglrvvJs6/3bF6/svZoekgBxdw==
X-Received: by 2002:a05:6a00:35c5:b0:728:e745:23cd with SMTP id
 d2e1a72fcca58-72fd0bce22bmr39188024b3a.3.1738593068225; 
 Mon, 03 Feb 2025 06:31:08 -0800 (PST)
Received: from terra ([2405:6586:be0:0:c8ff:1707:9b9:af89])
 by smtp.gmail.com with ESMTPSA id
 d2e1a72fcca58-72fe631beb1sm8453255b3a.5.2025.02.03.06.31.05
 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256);
 Mon, 03 Feb 2025 06:31:07 -0800 (PST)
From: Maxim Cournoyer <maxim.cournoyer@HIDDEN>
To: Tomas Volf <~@wolfsden.cz>
Subject: Re: bug#75998: [guile-lib] html->sxml does not decode entities in
 attributes
In-Reply-To: <87seow8ya8.fsf@HIDDEN> (Tomas Volf's message of "Sun, 02
 Feb 2025 10:57:03 +0100")
References: <87zfj58m03.fsf@HIDDEN> <Z58U6pXCY+tlHPMK@HIDDEN>
 <87seow8ya8.fsf@HIDDEN>
Date: Mon, 03 Feb 2025 23:30:55 +0900
Message-ID: <87ikprdrs0.fsf@HIDDEN>
User-Agent: Gnus/5.13 (Gnus v5.13)
MIME-Version: 1.0
Content-Type: text/plain
X-Spam-Score: 0.0 (/)
X-Debbugs-Envelope-To: 75998
Cc: 75998 <at> debbugs.gnu.org, tomas@HIDDEN
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -1.0 (-)

Hi Tomas,

Thank you for reporting this issue.

Tomas Volf <~@wolfsden.cz> writes:

> <tomas@HIDDEN> writes:
>
>> On Sat, Feb 01, 2025 at 09:10:04PM +0100, Tomas Volf wrote:
>>> 
>>> Hello,
>>> 
>>> I think I found a bug in the htmlprag module in guile-lib.  When parsing
>>> attributes, the values are not properly decoded:
>>> 
>>> --8<---------------cut here---------------start------------->8---
>>> scheme@(guile-user)> ,use (htmlprag)
>>> scheme@(guile-user)> (html->sxml "<hr aaa=\"bbb&quot;ccc'ddd\" />")
>>> $1 = (*TOP* (hr (@ (aaa "bbb&quot;ccc'ddd"))))
>>> scheme@(guile-user)> (html->sxml "<a href=\"a&amp;b\" />")
>>> $2 = (*TOP* (a (@ (href "a&amp;b"))))
>>> --8<---------------cut here---------------end--------------->8---
>>> 
>>> I think that $1 should be "bbb\"ccc'ddd" and $2 should be "a&b".
>>
>> Ouch. Have you contacted Oleg Kiselyov about it? He's usually pretty
>> responsive and very friendly.
>
> I did not.  I did not find a "how to report bugs" section on guile-lib's
> website, and on the (htmlprag) documentation section Oleg Kiselyov is
> mentioned only in one sentence as a "Thanks".
>
> I think I have managed to find his email in one Haskell paper of his, so
> I will CC him on the bug report, as suggested.

And also for containing Oleg.  I hope they can provide us with their
opinion on whether this is an actual bug or was designed that way.  To
me, it's not clear whether html->sxml should alterate the raw value of
attributes in any way.  Users may haev different use cases requiring to
apply different transformation themselves?  If we hard-code a decoding
scheme ourselves, then force that choice onto users, no?

-- 
Thanks,
Maxim




Information forwarded to bug-guile@HIDDEN:
bug#75998; Package guile. Full text available.

Message received at 75998 <at> debbugs.gnu.org:


Received: (at 75998) by debbugs.gnu.org; 2 Feb 2025 21:48:27 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Sun Feb 02 16:48:27 2025
Received: from localhost ([127.0.0.1]:36521 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1tehpK-0000CK-Qa
	for submit <at> debbugs.gnu.org; Sun, 02 Feb 2025 16:48:27 -0500
Received: from moses.all2all.org ([79.99.200.14]:60448)
 by debbugs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256)
 (Exim 4.84_2) (envelope-from <david@HIDDEN>) id 1tehpI-0000CA-4S
 for 75998 <at> debbugs.gnu.org; Sun, 02 Feb 2025 16:48:25 -0500
Received: from localhost (localhost [127.0.0.1])
 by moses.all2all.org (Postfix) with ESMTP id 602BC67C007E;
 Sun,  2 Feb 2025 22:48:21 +0100 (CET)
X-Virus-Scanned: Debian amavisd-new at moses.all2all.org
Received: from moses.all2all.org ([127.0.0.1])
 by localhost (moses.all2all.org [127.0.0.1]) (amavisd-new, port 10024)
 with ESMTP id evYYvvFoXM3V; Sun,  2 Feb 2025 22:48:20 +0100 (CET)
Received: from tintin (unknown [168.227.187.186])
 by moses.all2all.org (Postfix) with ESMTPSA id 56A9867C007B;
 Sun,  2 Feb 2025 22:48:19 +0100 (CET)
Date: Sun, 2 Feb 2025 18:48:13 -0300
From: David Pirotte <david@HIDDEN>
To: Tomas Volf <~@wolfsden.cz>
Subject: Re: bug#75998: [guile-lib] html->sxml does not decode entities in
 attributes
Message-ID: <20250202184813.788926b0@tintin>
In-Reply-To: <87seow8ya8.fsf@HIDDEN>
References: <87zfj58m03.fsf@HIDDEN> <Z58U6pXCY+tlHPMK@HIDDEN>
 <87seow8ya8.fsf@HIDDEN>
X-Mailer: Claws Mail 4.3.0 (GTK 3.24.43; x86_64-pc-linux-gnu)
MIME-Version: 1.0
Content-Type: multipart/signed; boundary="Sig_/dRlHhH8PJS.xy0/m8n_FfDW";
 protocol="application/pgp-signature"; micalg=pgp-sha512
X-Spam-Score: -0.7 (/)
X-Debbugs-Envelope-To: 75998
Cc: 75998 <at> debbugs.gnu.org, tomas@HIDDEN
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -1.7 (-)

--Sig_/dRlHhH8PJS.xy0/m8n_FfDW
Content-Type: text/plain; charset=US-ASCII
Content-Transfer-Encoding: quoted-printable

Hello Thomas,

> I did not.  I did not find a "how to report bugs" section on
> guile-lib's website

	HACKING
	INSTALL
	NEWS
	README	http://git.savannah.nongnu.org/cgit/guile-lib.git/tree/README

	all do mention, in their header [HACKING as an example]:

                          Guile-Lib - HACKING
              =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=
=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D
                                   =20
                  Please send Guile-Lib bug reports to
                           guile-devel@HIDDEN

I'd recommend to close this bug report here saying 'not a guile bug' and
repost on guile-devel.

> and on the (htmlprag) documentation section Oleg Kiselyov is
> mentioned only in one sentence as a "Thanks". I think I have managed
> to find his email in one Haskell paper of his, so I will CC him on
> the bug report, as suggested.

Note and be aware that ther version in guile-lib has been patched
'recently', see commit 84c420769, i Pushed on behalf of Maxim Cournoyer
<maxim.cournoyer@HIDDEN>, who's the actual guile-lib maintainer.

David



--Sig_/dRlHhH8PJS.xy0/m8n_FfDW
Content-Type: application/pgp-signature
Content-Description: OpenPGP digital signature

-----BEGIN PGP SIGNATURE-----

iQEzBAEBCgAdFiEEhCJlRZtBM3furJHe83T9k6MFetcFAmef6B0ACgkQ83T9k6MF
etf09ggAkpP9lEe7606cFfzZ07tyFl8/IZXkYhROgBWqTum4Chx4/1ArQm5RV7u5
uJpPbRbWHATzyeec7/tDmwMA0evLdez/DHjewLC2HFHNA3kzfSxZszJXgHGMAPV1
Uqp9TgZN+xsCmdtX/vOTXe5C1P+WRgJvEcmJjnXCLWjIEbDpXTKb6JJ9TLNvmmJ1
CEGQkaMMe2SW5TCFeMjG+fGiUxNyx8a3zXDD+n2sPiA4is8ryPH9zanGzdsy3lcc
fsFAsaKUABJ637yiJ1xDR8jyTWBoGHQk1ihmzA/KS1ijOSUT05/HYdw9dKlBuW2z
ia2Ky6y/9BuTghz98xIwxmZKTXIiaQ==
=ulcC
-----END PGP SIGNATURE-----

--Sig_/dRlHhH8PJS.xy0/m8n_FfDW--




Information forwarded to bug-guile@HIDDEN:
bug#75998; Package guile. Full text available.
Set bug forwarded-to-address to 'oleg@HIDDEN'. Request was from Tomas Volf <~@wolfsden.cz> to control <at> debbugs.gnu.org. Full text available.

Message received at 75998 <at> debbugs.gnu.org:


Received: (at 75998) by debbugs.gnu.org; 2 Feb 2025 09:57:11 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Sun Feb 02 04:57:11 2025
Received: from localhost ([127.0.0.1]:33179 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1teWj0-0007G5-RK
	for submit <at> debbugs.gnu.org; Sun, 02 Feb 2025 04:57:11 -0500
Received: from wolfsden.cz ([37.205.8.62]:55698)
 by debbugs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256)
 (Exim 4.84_2) (envelope-from <~@wolfsden.cz>) id 1teWix-0007Ft-34
 for 75998 <at> debbugs.gnu.org; Sun, 02 Feb 2025 04:57:08 -0500
Received: by wolfsden.cz (Postfix, from userid 104)
 id F26D836F3D2; Sun,  2 Feb 2025 09:57:04 +0000 (UTC)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=wolfsden.cz; s=mail;
 t=1738490224; bh=7yT3p6CuJTfHE+4s3Wt5BZv36fhedH3wj1gqJHrjD2w=;
 h=From:To:Cc:Subject:In-Reply-To:References:Date;
 b=H3ORW8JoOpm2vLvsm8/9OLt0ZV6IdUyZiWVuU+tALeY8iZtTKVyepNmjk37zUJSjh
 80SgrBzy0cK53PllnY2MnZiQvjg0iHujiHqZQ3OCmEGqIs4KnOv6kPkdoZ0iAX0QHP
 Im7z+CuVoMx8k8WbT67hEeIOUsmxlKJwHeNeSqYmz8uuK1ijz+rKwEwn/syl26y2Tw
 ekiV/8NYQ1P/tmGicW2UUhqn/+QfOuWs2QsDUw3dF8XmSHFktWRiGRC+xiNA7k7oyV
 pLz2FVrD7XjejGD5l294uVVemhEtQQh/9GTwwfN3SQ3LK9Mn/4j3CRmmJY+vpFTbUF
 XQ9flj0qOImoNHBzjyYKOOhhEGxdq0titaP2xdcvDiNa0HWZ+WqNZuONogsDesnK7h
 PR0yYjeXkwPl18Sq5tRwsmDbUN6loHD/0rs2FFakpy/d8hmGyUVxdNAoQXNjC1pUjD
 5X/G7L6fqcK1AD/nY/Uhr7FY45qrm8dctoSGIzSBqlNz2RVAG+vwBBfkcIXegtidKA
 a/2q9c001t0EoaczWxaASuujfb69/fQxisSp/7KAD9n1KX98DTNUOCVraXT6dEdFWc
 EjAo5Gh9mYmhMqZiBsXiQEUmbGedGgRhMnX/n98NfC1u1pZMbbglS7vSC7C81ANCvV
 gv7CUVrrSdNZMGNOkxhLGdG0=
X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on wolfsden
X-Spam-Level: 
X-Spam-Status: No, score=-3.1 required=5.0 tests=ALL_TRUSTED,BAYES_00,
 DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,URIBL_BLOCKED
 autolearn=ham autolearn_force=no version=3.4.6
Received: from localhost (unknown [128.0.188.242])
 by wolfsden.cz (Postfix) with ESMTPSA id 1F95736FE22;
 Sun,  2 Feb 2025 09:57:04 +0000 (UTC)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=wolfsden.cz; s=mail;
 t=1738490224; bh=7yT3p6CuJTfHE+4s3Wt5BZv36fhedH3wj1gqJHrjD2w=;
 h=From:To:Cc:Subject:In-Reply-To:References:Date;
 b=H3ORW8JoOpm2vLvsm8/9OLt0ZV6IdUyZiWVuU+tALeY8iZtTKVyepNmjk37zUJSjh
 80SgrBzy0cK53PllnY2MnZiQvjg0iHujiHqZQ3OCmEGqIs4KnOv6kPkdoZ0iAX0QHP
 Im7z+CuVoMx8k8WbT67hEeIOUsmxlKJwHeNeSqYmz8uuK1ijz+rKwEwn/syl26y2Tw
 ekiV/8NYQ1P/tmGicW2UUhqn/+QfOuWs2QsDUw3dF8XmSHFktWRiGRC+xiNA7k7oyV
 pLz2FVrD7XjejGD5l294uVVemhEtQQh/9GTwwfN3SQ3LK9Mn/4j3CRmmJY+vpFTbUF
 XQ9flj0qOImoNHBzjyYKOOhhEGxdq0titaP2xdcvDiNa0HWZ+WqNZuONogsDesnK7h
 PR0yYjeXkwPl18Sq5tRwsmDbUN6loHD/0rs2FFakpy/d8hmGyUVxdNAoQXNjC1pUjD
 5X/G7L6fqcK1AD/nY/Uhr7FY45qrm8dctoSGIzSBqlNz2RVAG+vwBBfkcIXegtidKA
 a/2q9c001t0EoaczWxaASuujfb69/fQxisSp/7KAD9n1KX98DTNUOCVraXT6dEdFWc
 EjAo5Gh9mYmhMqZiBsXiQEUmbGedGgRhMnX/n98NfC1u1pZMbbglS7vSC7C81ANCvV
 gv7CUVrrSdNZMGNOkxhLGdG0=
From: Tomas Volf <~@wolfsden.cz>
To: <tomas@HIDDEN>
Subject: Re: bug#75998: [guile-lib] html->sxml does not decode entities in
 attributes
In-Reply-To: <Z58U6pXCY+tlHPMK@HIDDEN> (tomas@HIDDEN's message of
 "Sun, 2 Feb 2025 07:47:06 +0100")
References: <87zfj58m03.fsf@HIDDEN> <Z58U6pXCY+tlHPMK@HIDDEN>
Date: Sun, 02 Feb 2025 10:57:03 +0100
Message-ID: <87seow8ya8.fsf@HIDDEN>
User-Agent: Gnus/5.13 (Gnus v5.13)
MIME-Version: 1.0
Content-Type: multipart/signed; boundary="=-=-=";
 micalg=pgp-sha512; protocol="application/pgp-signature"
X-Spam-Score: 0.0 (/)
X-Debbugs-Envelope-To: 75998
Cc: 75998 <at> debbugs.gnu.org
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -1.0 (-)

--=-=-=
Content-Type: text/plain
Content-Transfer-Encoding: quoted-printable

<tomas@HIDDEN> writes:

> On Sat, Feb 01, 2025 at 09:10:04PM +0100, Tomas Volf wrote:
>>=20
>> Hello,
>>=20
>> I think I found a bug in the htmlprag module in guile-lib.  When parsing
>> attributes, the values are not properly decoded:
>>=20
>> --8<---------------cut here---------------start------------->8---
>> scheme@(guile-user)> ,use (htmlprag)
>> scheme@(guile-user)> (html->sxml "<hr aaa=3D\"bbb&quot;ccc'ddd\" />")
>> $1 =3D (*TOP* (hr (@ (aaa "bbb&quot;ccc'ddd"))))
>> scheme@(guile-user)> (html->sxml "<a href=3D\"a&amp;b\" />")
>> $2 =3D (*TOP* (a (@ (href "a&amp;b"))))
>> --8<---------------cut here---------------end--------------->8---
>>=20
>> I think that $1 should be "bbb\"ccc'ddd" and $2 should be "a&b".
>
> Ouch. Have you contacted Oleg Kiselyov about it? He's usually pretty
> responsive and very friendly.

I did not.  I did not find a "how to report bugs" section on guile-lib's
website, and on the (htmlprag) documentation section Oleg Kiselyov is
mentioned only in one sentence as a "Thanks".

I think I have managed to find his email in one Haskell paper of his, so
I will CC him on the bug report, as suggested.

Thanks,
Tomas

=2D-=20
There are only two hard things in Computer Science:
cache invalidation, naming things and off-by-one errors.

--=-=-=
Content-Type: application/pgp-signature; name="signature.asc"

-----BEGIN PGP SIGNATURE-----

iQJCBAEBCgAsFiEEt4NJs4wUfTYpiGikL7/ufbZ/wakFAmefQW8OHH5Ad29sZnNk
ZW4uY3oACgkQL7/ufbZ/wak4+A//ZX91aovYDMfdhC9gzQootS3pPeKtSOns5dYj
W8aeLayEDJJA8hlXKT1KQPqDkNSLIT4fUwgTaGmqTBfE80+5KAwnDFYCsn+qQlIJ
5MQhjPyjHX9NA84ZtBGKN8o4QGMOOxHUIJwP35nlxLsHrQwXPd3IVKFyDl1TubSh
s3Uj33BeK2zDQiBXvqKmBIXZ85pWsFVd/w7/D3lZSRkrNjhMNmk0OoDzouqAQSCw
FO50Zt2MG17jrhjGutbYAmMYn3gny8yneJmfG9EmUZpvGU3OH8w8HUVWZYkLzSeO
LNG+zLWtMZmZOX8q+PVefbTcxj4hcIylafpNuZd5gd16UdXtW35cDMzIc275mekJ
ZSH0aY/GDRc69UCecPBWKts+XVQia/+PaGzTBnVuoutzB1Mq1WdenC9GgHFRBRVp
adw1XpoGQohfHBdNF7J7/C1Pn9fsxxUveG7BTgzzTPKygz9jYMK0Ob7TG2JMfdYR
lUITdkIc/56YT23pFtzZxc6BIkSxyJ0bTFTqkdUYFxHrkhyar6QyI8CqR/izubUp
pBZuv7t7q3b/cPQvLmDcD0JqErGs+WT4smeuAHg0AteU9Tiq2KnAjEvmIX14frKr
DkpP6tyhslTfvKPyFSAaR6dPtfT5xNn1mKtuALULziZsSTsEWCXsxdDI8/4iJXx7
ZBWa8L0=
=NDGY
-----END PGP SIGNATURE-----
--=-=-=--




Information forwarded to bug-guile@HIDDEN:
bug#75998; Package guile. Full text available.

Message received at 75998 <at> debbugs.gnu.org:


Received: (at 75998) by debbugs.gnu.org; 2 Feb 2025 06:47:12 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Sun Feb 02 01:47:12 2025
Received: from localhost ([127.0.0.1]:32789 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1teTl9-0006YP-Nu
	for submit <at> debbugs.gnu.org; Sun, 02 Feb 2025 01:47:12 -0500
Received: from mail.tuxteam.de ([5.199.139.25]:47392)
 by debbugs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256)
 (Exim 4.84_2) (envelope-from <tomas@HIDDEN>) id 1teTl6-0006YA-O0
 for 75998 <at> debbugs.gnu.org; Sun, 02 Feb 2025 01:47:09 -0500
DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=tuxteam.de; 
 s=mail;
 h=From:In-Reply-To:Content-Type:MIME-Version:References:Message-ID:
 Subject:Cc:To:Date:Sender:Reply-To:Content-Transfer-Encoding:Content-ID:
 Content-Description:Resent-Date:Resent-From:Resent-Sender:Resent-To:Resent-Cc
 :Resent-Message-ID:List-Id:List-Help:List-Unsubscribe:List-Subscribe:
 List-Post:List-Owner:List-Archive;
 bh=dHh50arL3KmU1zLiUD2+6MSrFzufbUbiFk0D9PEV6WI=; b=e1g0M22zZlouSlNkea4kM8GGeS
 NtkeFM2QbfHd2HcuVonxqv+L5DNCXFqX/28Wpn4TJt7SRon6m08fDH0PPCTiuLXuWQwzBOHVWiPDf
 Z1fr3AAnsZmtmV1M7uPMzGxX89hXy2C6Pv79FsIfPzGoSLpspbnkcCFda+i25Xv/3IsesWtTG0jv6
 wdVSJAiAMiSOirQZjB/2//c8ATopF+bY6TIs3Z2HgnzvCIJPc/ByRUs+UDPWsoeZPF5P+SMqCdWeo
 L8VU2fNj99fL7n4J4oemMnrauRFASbhgc8PKQmfDC7VH0vB3hTJL8rISRfJJb+70fp/Xxd1vXJqvl
 rPWoxkUA==;
Received: from tomas by mail.tuxteam.de with local (Exim 4.94.2)
 (envelope-from <tomas@HIDDEN>)
 id 1teTl4-0007ZV-D3; Sun, 02 Feb 2025 07:47:06 +0100
Date: Sun, 2 Feb 2025 07:47:06 +0100
To: Tomas Volf <~@wolfsden.cz>
Subject: Re: bug#75998: [guile-lib] html->sxml does not decode entities in
 attributes
Message-ID: <Z58U6pXCY+tlHPMK@HIDDEN>
References: <87zfj58m03.fsf@HIDDEN>
MIME-Version: 1.0
Content-Type: multipart/signed; micalg=pgp-sha1;
 protocol="application/pgp-signature"; boundary="BWrhbJLXxvPqfVbG"
Content-Disposition: inline
In-Reply-To: <87zfj58m03.fsf@HIDDEN>
From: <tomas@HIDDEN>
X-Spam-Score: 0.0 (/)
X-Debbugs-Envelope-To: 75998
Cc: 75998 <at> debbugs.gnu.org
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -1.0 (-)


--BWrhbJLXxvPqfVbG
Content-Type: text/plain; charset=utf-8
Content-Disposition: inline
Content-Transfer-Encoding: quoted-printable

On Sat, Feb 01, 2025 at 09:10:04PM +0100, Tomas Volf wrote:
>=20
> Hello,
>=20
> I think I found a bug in the htmlprag module in guile-lib.  When parsing
> attributes, the values are not properly decoded:
>=20
> --8<---------------cut here---------------start------------->8---
> scheme@(guile-user)> ,use (htmlprag)
> scheme@(guile-user)> (html->sxml "<hr aaa=3D\"bbb&quot;ccc'ddd\" />")
> $1 =3D (*TOP* (hr (@ (aaa "bbb&quot;ccc'ddd"))))
> scheme@(guile-user)> (html->sxml "<a href=3D\"a&amp;b\" />")
> $2 =3D (*TOP* (a (@ (href "a&amp;b"))))
> --8<---------------cut here---------------end--------------->8---
>=20
> I think that $1 should be "bbb\"ccc'ddd" and $2 should be "a&b".

Ouch. Have you contacted Oleg Kiselyov about it? He's usually pretty
responsive and very friendly.

> The annoying part is that this cannot really be changed now, because
> people (me included) already have workarounds in place, and
> automatically decoding now would lead to double decoding.
>=20
> I see few ways forward:
>=20
> 1. Document the current behavior and keep it as it is.
> 2. Add argument #:decode-attributes, defaulting to #f, to the relevant
>    procedures, so that people can opt into the fixed behavior.
> 3. Introduce parameter %decode-attributes, so that people can opt into
>    the fixed behavior.
>=20
> I am sure there are also other approaches possible.

If it were me, I'd take 2.

Cheers
--=20
tom=C3=A1s

--BWrhbJLXxvPqfVbG
Content-Type: application/pgp-signature; name="signature.asc"

-----BEGIN PGP SIGNATURE-----

iF0EABECAB0WIQRp53liolZD6iXhAoIFyCz1etHaRgUCZ58U4gAKCRAFyCz1etHa
RgB5AJ4m7u2MxVWwOjPy1544CGVzH+QAowCdEsYbR86oyEQDS8Z1DicLyeXfbeM=
=0hZ0
-----END PGP SIGNATURE-----

--BWrhbJLXxvPqfVbG--




Information forwarded to bug-guile@HIDDEN:
bug#75998; Package guile. Full text available.

Message received at submit <at> debbugs.gnu.org:


Received: (at submit) by debbugs.gnu.org; 1 Feb 2025 20:10:17 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Sat Feb 01 15:10:17 2025
Received: from localhost ([127.0.0.1]:60136 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1teJom-0004a7-PK
	for submit <at> debbugs.gnu.org; Sat, 01 Feb 2025 15:10:17 -0500
Received: from lists.gnu.org ([2001:470:142::17]:50496)
 by debbugs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256)
 (Exim 4.84_2) (envelope-from <~@wolfsden.cz>) id 1teJol-0004YX-D7
 for submit <at> debbugs.gnu.org; Sat, 01 Feb 2025 15:10:15 -0500
Received: from eggs.gnu.org ([2001:470:142:3::10])
 by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256)
 (Exim 4.90_1) (envelope-from <~@wolfsden.cz>) id 1teJof-0007sA-JX
 for bug-guile@HIDDEN; Sat, 01 Feb 2025 15:10:09 -0500
Received: from wolfsden.cz ([37.205.8.62])
 by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256)
 (Exim 4.90_1) (envelope-from <~@wolfsden.cz>) id 1teJoe-0007UU-1S
 for bug-guile@HIDDEN; Sat, 01 Feb 2025 15:10:09 -0500
Received: by wolfsden.cz (Postfix, from userid 104)
 id 32B1636F810; Sat,  1 Feb 2025 20:10:05 +0000 (UTC)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=wolfsden.cz; s=mail;
 t=1738440605; bh=0v6MJJH4VZmGsUEAR9AnUqR5FEoc9yOl/n9EZasEVw8=;
 h=From:To:Subject:Date;
 b=a50OyYDY/jlj82QBHOyacy/Ekqqe3lVn/G+8iqsonr0gLEF3JVS0z0ElrSkwo7u9X
 jXt34akLbOrdn5bnEHIFlYeD25QRSIdON59zaQ6fSDTdzti+JNc/GRPkFDkfQXAzgV
 KVX4upNdepb1Nz3GOXuCCUHErP8HGvFO4medxSHSz3BwQiS+0i2RhfW0s2WCveEtI1
 ljC0jesNWjp1IzhL0zIkwd2INxk29fFPl6uqUr0jf3LaS5rOq5ZSqsWmzM70BQBvwK
 dTjZGqzQ3FhmKyxktKGKvqlHYAiQsE2M4+fBKmZthMop3g3DozxBBlole2TaKci0VT
 GNwOXiU2ih/BppQk1qOx4sOJgjHdHNIFUrVgvtvkOtT8L0mG1h3pJlSsqgDvSAp2VF
 vSTiLUA6j7rRbXNMB1Ru92MoAbhUak3HfOxsjW+7lbOeckf1dcUZvdh9iJbsmLi/vl
 +81BHnAu/iCd90yyslLQwcorYKjz1C9iVfP2Bq/xG3K1zujZQhfSVMvvDNE3b8/NWT
 Px7oWtO+1bwhGpdY8yFXctoM0+bnZvL8u7zKqKveex3gl7qtxzyg6mFU79vvR+J71S
 aPYrbjJKOT7nBoLYwG19CTmN8YWil+Wgz/kJ028L703gt059nZ0+46YqyLtdMcIb8I
 Vmo0m34h38F6+bNJjftzzSmI=
X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on wolfsden
X-Spam-Level: 
X-Spam-Status: No, score=-3.1 required=5.0 tests=ALL_TRUSTED,BAYES_00,
 DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,URIBL_BLOCKED
 autolearn=ham autolearn_force=no version=3.4.6
Received: from localhost (unknown [128.0.188.242])
 by wolfsden.cz (Postfix) with ESMTPSA id D462836EA24
 for <bug-guile@HIDDEN>; Sat,  1 Feb 2025 20:10:04 +0000 (UTC)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=wolfsden.cz; s=mail;
 t=1738440604; bh=0v6MJJH4VZmGsUEAR9AnUqR5FEoc9yOl/n9EZasEVw8=;
 h=From:To:Subject:Date;
 b=PrzD7q6+YWGdgt/yV0Jm+Bo5NjYezQeTkOgn4Re86oNR0kw3JzEUG/EHb023BYaYY
 dvyDVEnTfgK6NTnCEdpDdGn58AE5KtuW9fqTQ7pL9hcOOKOntvAmVsTvGiD6X4Zx2F
 BdjYKKqzW54ta7UxSAZwuIZZvSTWAjwBF75bWE130ZPvsSTRcO1ZbmKO3LnBPulfiQ
 hrGYJUz/zk2Ex9ugN6PY+juvKlahYXpLa2W4TBDuCFj+agD3HY5IGbQ+ECA7KxgaCX
 Pzw7uVWumywFPjQgunFIcQLmvDRIhQQVDnhF0LV1fh9RtgKifBz7WlyQzCqhxcLtxJ
 jpLXSfJvXWUaNABTKZu+bhWA5Mt7euu8o0uXVU9ewuHthWsgFW722RHUY0baehvOav
 tmfAT2//621IPMuntIFRaReGwrG6EXzaubz5ibzcdNuVR+soo4F3PdYhFTa0IKXIa8
 8LP4oFjjpA+pEkDabLqbdlcBozzVU2ELh8KyP1TS8R2H5DovEQMUKrTXZU4S5NlqVO
 41AAh27hWS9yw6U0jzCMGDfO43R4InCqu74vmQfOF40E8I7nxzax1RBGrmMKGLGeEc
 wOwBiklJebvoulxUy0g8mTB4AVD6OSPYJhnjz2bYpvPl17acw5MDXNiAREm1HvTK22
 88AiJN1NhHqLbW+2IzzZktp0=
From: Tomas Volf <~@wolfsden.cz>
To: bug-guile@HIDDEN
Subject: [guile-lib] html->sxml does not decode entities in attributes
Date: Sat, 01 Feb 2025 21:10:04 +0100
Message-ID: <87zfj58m03.fsf@HIDDEN>
User-Agent: Gnus/5.13 (Gnus v5.13)
MIME-Version: 1.0
Content-Type: text/plain
Received-SPF: pass client-ip=37.205.8.62; envelope-from=~@wolfsden.cz;
 helo=wolfsden.cz
X-Spam_score_int: -20
X-Spam_score: -2.1
X-Spam_bar: --
X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1,
 DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1,
 RCVD_IN_VALIDITY_CERTIFIED_BLOCKED=0.001, RCVD_IN_VALIDITY_RPBL_BLOCKED=0.001,
 SPF_HELO_PASS=-0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no
X-Spam_action: no action
X-Spam-Score: 1.0 (+)
X-Debbugs-Envelope-To: submit
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -0.0 (/)


Hello,

I think I found a bug in the htmlprag module in guile-lib.  When parsing
attributes, the values are not properly decoded:

--8<---------------cut here---------------start------------->8---
scheme@(guile-user)> ,use (htmlprag)
scheme@(guile-user)> (html->sxml "<hr aaa=\"bbb&quot;ccc'ddd\" />")
$1 = (*TOP* (hr (@ (aaa "bbb&quot;ccc'ddd"))))
scheme@(guile-user)> (html->sxml "<a href=\"a&amp;b\" />")
$2 = (*TOP* (a (@ (href "a&amp;b"))))
--8<---------------cut here---------------end--------------->8---

I think that $1 should be "bbb\"ccc'ddd" and $2 should be "a&b".

The annoying part is that this cannot really be changed now, because
people (me included) already have workarounds in place, and
automatically decoding now would lead to double decoding.

I see few ways forward:

1. Document the current behavior and keep it as it is.
2. Add argument #:decode-attributes, defaulting to #f, to the relevant
   procedures, so that people can opt into the fixed behavior.
3. Introduce parameter %decode-attributes, so that people can opt into
   the fixed behavior.

I am sure there are also other approaches possible.

Have a nice day,
Tomas

-- 
There are only two hard things in Computer Science:
cache invalidation, naming things and off-by-one errors.




Acknowledgement sent to Tomas Volf <~@wolfsden.cz>:
New bug report received and forwarded. Copy sent to bug-guile@HIDDEN. Full text available.
Report forwarded to bug-guile@HIDDEN:
bug#75998; Package guile. Full text available.
Please note: This is a static page, with minimal formatting, updated once a day.
Click here to see this page with the latest information and nicer formatting.
Last modified: Sat, 15 Feb 2025 15:45:02 UTC

GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997 nCipher Corporation Ltd, 1994-97 Ian Jackson.