GNU bug report logs - #72166
Shepherd periodically goes unresponsive on one of my machines

Please note: This is a static page, with minimal formatting, updated once a day.
Click here to see this page with the latest information and nicer formatting.

Package: guix; Reported by: "Jonathan Frederickson" <jonathan@HIDDEN>; dated Thu, 18 Jul 2024 00:44:01 UTC; Maintainer for guix is bug-guix@HIDDEN.

Message received at 72166 <at> debbugs.gnu.org:


Received: (at 72166) by debbugs.gnu.org; 4 Sep 2024 09:28:06 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Wed Sep 04 05:28:06 2024
Received: from localhost ([127.0.0.1]:33580 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1slmJ3-00034j-Ut
	for submit <at> debbugs.gnu.org; Wed, 04 Sep 2024 05:28:06 -0400
Received: from eggs.gnu.org ([209.51.188.92]:47646)
 by debbugs.gnu.org with esmtp (Exim 4.84_2)
 (envelope-from <ludo@HIDDEN>) id 1slmJ1-00034C-92
 for 72166 <at> debbugs.gnu.org; Wed, 04 Sep 2024 05:28:04 -0400
Received: from fencepost.gnu.org ([2001:470:142:3::e])
 by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256)
 (Exim 4.90_1) (envelope-from <ludo@HIDDEN>)
 id 1slmFn-0002D5-MH; Wed, 04 Sep 2024 05:24:43 -0400
DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnu.org;
 s=fencepost-gnu-org; h=MIME-Version:Date:References:In-Reply-To:Subject:To:
 From; bh=SHYhRSwJuP91fnW9q/Phg7J7ubTif/qO5azAqHm7dLA=; b=Ql0Ceku4+bO0opT7i6mZ
 mjlWonVnID3cYaxGHktcumpDgTIt7m19NipMnyb0FHOUIZj57OQXUNDE1ajRpYFJUL2rESJ7rWY75
 hLvL5K0S0Jtn8Z8XD5WEATQC8E7nlWvyYxufSyPNoUN3NSEKY2AphtlMnkVfkuTHx/AZ3Ii/Jqgvq
 Ot6kbghGVIP0oW+gnsuNhSDVK8B/BqvdEP/0Q54BcsVVjsdgvzHYeGGidR92GrxY2cIvvHEtXjVw3
 VaDYf3xU8+eR0ahRTS2j712wzD7buae3YIcl3Ybonb0hEYb0ki0Un6woydQhDIEQX4OO0j7QRQcpb
 GKLlZe0Z8GpWnQ==;
From: =?utf-8?Q?Ludovic_Court=C3=A8s?= <ludo@HIDDEN>
To: Jonathan Frederickson <jonathan@HIDDEN>
Subject: Re: bug#72166: Shepherd periodically goes unresponsive on one of my
 machines
In-Reply-To: <6f4711b4-3092-4bbe-91d9-8cb2ce44ed1b@HIDDEN> (Jonathan
 Frederickson's message of "Thu, 22 Aug 2024 13:52:54 -0400 (EDT)")
References: <df6e8894-fd84-446f-a67f-50cdcc9de5b3@HIDDEN>
 <878qxxtmwu.fsf@HIDDEN>
 <7974c622-e7d8-48b3-9948-14e8d7654793@HIDDEN>
 <87zfq9kiei.fsf@HIDDEN>
 <5477099d-fbc5-4acd-8320-f88ed3107de7@HIDDEN>
 <877ccg78gy.fsf@HIDDEN>
 <b559c5a9-1c87-43f7-a294-b5248e117b2d@HIDDEN>
 <eec39e18-8a2b-440f-ad97-4779e56362af@HIDDEN>
 <8734mztl9u.fsf@HIDDEN>
 <b86349b4-4c6f-4fef-b29b-95db86065a85@HIDDEN>
 <87zfp4nbn0.fsf@HIDDEN>
 <6f4711b4-3092-4bbe-91d9-8cb2ce44ed1b@HIDDEN>
X-URL: http://www.fdn.fr/~lcourtes/
X-Revolutionary-Date: Nonidi 19 Fructidor an 232 de la =?utf-8?Q?R=C3=A9vo?=
 =?utf-8?Q?lution=2C?= jour du Tagette
X-PGP-Key-ID: 0x090B11993D9AEBB5
X-PGP-Key: http://www.fdn.fr/~lcourtes/ludovic.asc
X-PGP-Fingerprint: 3CE4 6455 8A84 FDC6 9DB4  0CFB 090B 1199 3D9A EBB5
X-OS: x86_64-pc-linux-gnu
Date: Wed, 04 Sep 2024 11:24:40 +0200
Message-ID: <87bk13g49z.fsf@HIDDEN>
User-Agent: Gnus/5.13 (Gnus v5.13)
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: quoted-printable
X-Spam-Score: 1.3 (+)
X-Spam-Report: Spam detection software, running on the system "debbugs.gnu.org",
 has NOT identified this incoming email as spam.  The original
 message has been attached to this so you can view it or label
 similar future email.  If you have any questions, see
 the administrator of that system for details.
 Content preview:  Hi, Jonathan Frederickson <jonathan@HIDDEN> skribis:
 > I've also run into what looks like the same thing on a laptop that does
 have an RTC, but it also has an NTP daemon running on it. (I have also noticed
 that this is most common on that laptop after a [...] 
 Content analysis details:   (1.3 points, 10.0 required)
 pts rule name              description
 ---- ---------------------- --------------------------------------------------
 3.6 RCVD_IN_SBL_CSS        RBL: Received via a relay in Spamhaus SBL-CSS
 [209.51.188.92 listed in zen.spamhaus.org]
 -2.3 RCVD_IN_DNSWL_MED      RBL: Sender listed at https://www.dnswl.org/,
 medium trust [209.51.188.92 listed in list.dnswl.org]
 -0.0 SPF_PASS               SPF: sender matches SPF record
 -0.0 SPF_HELO_PASS          SPF: HELO matches SPF record
X-Debbugs-Envelope-To: 72166
Cc: 72166 <at> debbugs.gnu.org
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: 0.3 (/)

Hi,

Jonathan Frederickson <jonathan@HIDDEN> skribis:

> I've also run into what looks like the same thing on a laptop that does h=
ave an RTC, but it also has an NTP daemon running on it. (I have also notic=
ed that this is most common on that laptop after a suspend/resume cycle, so=
 maybe that's triggering the bug as well?)

Suspend/resume does not trigger the bug no (fortunately), but a clock
jump due to NTP sync can cause it (specifically, shepherd will consume
CPU time proportional to the drift).

Ludo=E2=80=99.




Information forwarded to bug-guix@HIDDEN:
bug#72166; Package guix. Full text available.

Message received at 72166 <at> debbugs.gnu.org:


Received: (at 72166) by debbugs.gnu.org; 22 Aug 2024 17:54:02 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Thu Aug 22 13:54:02 2024
Received: from localhost ([127.0.0.1]:38316 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1shC0X-0001aM-St
	for submit <at> debbugs.gnu.org; Thu, 22 Aug 2024 13:54:02 -0400
Received: from fout5-smtp.messagingengine.com ([103.168.172.148]:54633)
 by debbugs.gnu.org with esmtp (Exim 4.84_2)
 (envelope-from <jonathan@HIDDEN>) id 1shC0U-0001Zu-Rm
 for 72166 <at> debbugs.gnu.org; Thu, 22 Aug 2024 13:53:59 -0400
Received: from phl-compute-08.internal (phl-compute-08.nyi.internal
 [10.202.2.48])
 by mailfout.nyi.internal (Postfix) with ESMTP id 3E131138DE75;
 Thu, 22 Aug 2024 13:53:07 -0400 (EDT)
Received: from phl-mailfrontend-01 ([10.202.2.162])
 by phl-compute-08.internal (MEProxy); Thu, 22 Aug 2024 13:53:07 -0400
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=terracrypt.net;
 h=cc:cc:content-type:content-type:date:date:from:from
 :in-reply-to:in-reply-to:message-id:mime-version:references
 :reply-to:subject:subject:to:to; s=fm2; t=1724349187; x=
 1724435587; bh=o6UAXdsR+Ba4yd0hwjpcm9HVAW5H1rpOMj3DuDhlkbU=; b=3
 bi/4S+S5TINJZnQfkHhENNrokOSykFGg+jJ61zkOb2yWuYEqC46hrj8ij0TWtrn1
 jM/52+WA8kGNJDDFKRYN0yVJob5iGv8/WY3v/JexG936KJ5Oyq3qVK+oMtIKGqqF
 Kgz7618kbWFUFULc4R5mDv82m15BLMv1q6fUjadVhSlHsTxPOXHxOaQdkDSQ9E6V
 ZKey/isIy+mLxYyGXwVQCRFvuB1Yxm6DBlQgqcprl+1kgXRW5ZNHRZK/THGs2/P/
 H0tCqWblUEGdew8d25n6x2V3NHK2txkc20QXZ/3jv2FujpQIjbGEI7/uYsL/Wshm
 QZ6iA58qlNPBmPhUMDsDg==
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=
 messagingengine.com; h=cc:cc:content-type:content-type:date:date
 :feedback-id:feedback-id:from:from:in-reply-to:in-reply-to
 :message-id:mime-version:references:reply-to:subject:subject:to
 :to:x-me-proxy:x-me-proxy:x-me-sender:x-me-sender:x-sasl-enc; s=
 fm1; t=1724349187; x=1724435587; bh=o6UAXdsR+Ba4yd0hwjpcm9HVAW5H
 1rpOMj3DuDhlkbU=; b=fLvx1HXLmRjpfaOhJUM/XJpod4Aitj9pttkwKKDve/47
 Z0b3SrNOz7LnV+/ENtZwwupiIvTEKZvcdv+u4N73gr+sFu4IqGm3/jV3w307ypD8
 XYjZHscau2tM9wqnyOGiLmfghou3TXOUfrCFd1gD+BH02UyAA0coUE66hP0mmdlr
 eK/Y2X9s/O9Kfzb1LGL1Y1O5BnqvON0HIsS/IIueXHtE++dGCO0KMyNWwF7eB5iI
 q8hazpJvcIKlJjeOALOfJfP+w9BLYV62WwCsFK0cc/iOoTJW1bFp+AJdSWL3mZ6u
 kiNxvTLUgJ/bCaw/XCCdUz/NYfmDY0F8Ys2ZfijoAA==
X-ME-Sender: <xms:A3vHZiyGHUvUWrv7BNSYNr-A1DuVcXqreaX25BhSJ2yHhHfIFPkykw>
 <xme:A3vHZuTEo6I8uUWYh7gAzoS8bW4CmM7lE4tNGnT36ph5sFDudrAEkYZ3IpCQCzYwo
 y2p0w6rRrvSKqhRfA>
X-ME-Received: <xmr:A3vHZkUFMJQjLqLixSAJuMbuOlhs31ZlfzzCUmjPf8M85w6Wp9ULIpklxeuJ4Uw>
X-ME-Proxy-Cause: gggruggvucftvghtrhhoucdtuddrgeeftddruddvtddgudduiecutefuodetggdotefrod
 ftvfcurfhrohhfihhlvgemucfhrghsthforghilhdpggftfghnshhusghstghrihgsvgdp
 uffrtefokffrpgfnqfghnecuuegrihhlohhuthemuceftddtnecusecvtfgvtghiphhivg
 hnthhsucdlqddutddtmdenucfjughrpeffhffvvefkjghfufggtgesrgdtregstddtjeen
 ucfhrhhomheplfhonhgrthhhrghnucfhrhgvuggvrhhitghkshhonhcuoehjohhnrghthh
 grnhesthgvrhhrrggtrhihphhtrdhnvghtqeenucggtffrrghtthgvrhhnpeetleehffef
 ffegffetfeegtdduffduteejiefhvdfhueeugfefjeehudeuleevheenucffohhmrghinh
 epghhnuhdrohhrghenucevlhhushhtvghrufhiiigvpedtnecurfgrrhgrmhepmhgrihhl
 fhhrohhmpehjohhnrghthhgrnhesthgvrhhrrggtrhihphhtrdhnvghtpdhnsggprhgtph
 htthhopedvpdhmohguvgepshhmthhpohhuthdprhgtphhtthhopehluhguohesghhnuhdr
 ohhrghdprhgtphhtthhopeejvdduieeiseguvggssghughhsrdhgnhhurdhorhhg
X-ME-Proxy: <xmx:A3vHZoiorTOZt67FdLKUu15hiheET1aSzvJY9mUTVc0p3KJkvemisA>
 <xmx:A3vHZkC2_o9I45rXCbkY0qY16uY7eaRTnV66oKf9_ND_yvzw2zzY9g>
 <xmx:A3vHZpK1FyvEHcau-8NXM29HL7vGECNU4G5nQ3bghEMN-821LMTDPQ>
 <xmx:A3vHZrAMbxfaAQB7MCBChRUCs12w_ZMmw4389IvpjMy3OdXF4hTNyw>
 <xmx:A3vHZnNLEpeCESIAvSTzcboQl19F8n4h7oTSU3_VCuBiMQ7dzzo9D_BG>
Feedback-ID: if4194509:Fastmail
Received: by mail.messagingengine.com (Postfix) with ESMTPA; Thu,
 22 Aug 2024 13:53:05 -0400 (EDT)
Date: Thu, 22 Aug 2024 13:52:54 -0400 (EDT)
From: Jonathan Frederickson <jonathan@HIDDEN>
To: =?UTF-8?Q?Ludovic_Court=C3=A8s?= <ludo@HIDDEN>
Message-ID: <6f4711b4-3092-4bbe-91d9-8cb2ce44ed1b@HIDDEN>
In-Reply-To: <87zfp4nbn0.fsf@HIDDEN>
References: <df6e8894-fd84-446f-a67f-50cdcc9de5b3@HIDDEN>
 <878qxxtmwu.fsf@HIDDEN>
 <7974c622-e7d8-48b3-9948-14e8d7654793@HIDDEN>
 <87zfq9kiei.fsf@HIDDEN>
 <5477099d-fbc5-4acd-8320-f88ed3107de7@HIDDEN>
 <877ccg78gy.fsf@HIDDEN>
 <b559c5a9-1c87-43f7-a294-b5248e117b2d@HIDDEN>
 <eec39e18-8a2b-440f-ad97-4779e56362af@HIDDEN>
 <8734mztl9u.fsf@HIDDEN>
 <b86349b4-4c6f-4fef-b29b-95db86065a85@HIDDEN>
 <87zfp4nbn0.fsf@HIDDEN>
Subject: Re: bug#72166: Shepherd periodically goes unresponsive on one of my
 machines
MIME-Version: 1.0
Content-Type: multipart/alternative; 
 boundary="----=_Part_35_113027044.1724349174214"
X-Correlation-ID: <6f4711b4-3092-4bbe-91d9-8cb2ce44ed1b@HIDDEN>
X-Spam-Score: -0.7 (/)
X-Debbugs-Envelope-To: 72166
Cc: 72166 <at> debbugs.gnu.org
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -1.7 (-)

------=_Part_35_113027044.1724349174214
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

Aug 22, 2024 05:35:47 Ludovic Court=C3=A8s <ludo@HIDDEN>:

> Hi,
>=20
> You wrote:
>=20
>> The machine in question does not have a battery-backed RTC, so it
>> loses time when it loses power, but notice the time changes once ntpd
>> starts up.
>=20
> So you=E2=80=99re hitting the Fibers bug described here:
>=20
> =C2=A0 https://issues.guix.gnu.org/70848
>=20
> There is still no fix upstream unfortunately, but I hope we=E2=80=99ll ge=
t there
> in time for Shepherd 1.0.
>=20
> The only workaround so far is to build shepherd against
> =E2=80=98guile-fibers-1.1=E2=80=99, which is what is done in Guix on AArc=
h64 and RISC-V
> since these platforms are likely to be used with single-board computers
> lacking a battery-backed RTC.
>=20
> Thanks,
> Ludo=E2=80=99.
Ah! Yes, that seems likely.

I've also run into what looks like the same thing on a laptop that does hav=
e an RTC, but it also has an NTP daemon running on it. (I have also noticed=
 that this is most common on that laptop after a suspend/resume cycle, so m=
aybe that's triggering the bug as well?)

I'll check to see if I've recently had an NTP sync the next time this happe=
ns, because that would be a definitive confirmation that I'm running into t=
hat bug.

------=_Part_35_113027044.1724349174214
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<html>
 <head>
  <meta name=3D"viewport" content=3D"width=3Ddevice-width, initial-scale=3D=
1.0">
 </head>
 <body>
  <div class=3D"fairemail_quote">
   <div dir=3D"ltr">
    <p>Aug 22, 2024 05:35:47 Ludovic Court=C3=A8s &lt;ludo@HIDDEN&gt;:</p>
   </div>
   <blockquote style=3D"margin:0;border-left:3px solid #ccc; padding-left:1=
0px;">
    <div>
     Hi,=20
     <br>
     <br>
      You wrote:=20
     <br>
     <br>
     <blockquote style=3D"margin:0;border-left:3px solid #ccc; padding-left=
:10px;">
      The machine in question does not have a battery-backed RTC, so it=20
      <br>
       loses time when it loses power, but notice the time changes once ntp=
d=20
      <br>
       starts up.=20
      <br>
     </blockquote>
     <br>
      So you=E2=80=99re hitting the Fibers bug described here:=20
     <br>
     <br>
      &nbsp; https://issues.guix.gnu.org/70848=20
     <br>
     <br>
      There is still no fix upstream unfortunately, but I hope we=E2=80=99l=
l get there=20
     <br>
      in time for Shepherd 1.0.=20
     <br>
     <br>
      The only workaround so far is to build shepherd against=20
     <br>
      =E2=80=98guile-fibers-1.1=E2=80=99, which is what is done in Guix on =
AArch64 and RISC-V=20
     <br>
      since these platforms are likely to be used with single-board compute=
rs=20
     <br>
      lacking a battery-backed RTC.=20
     <br>
     <br>
      Thanks,=20
     <br>
      Ludo=E2=80=99.=20
     <br>
    </div>
   </blockquote>
  </div><span dir=3D"ltr" style=3D"margin-top:0; margin-bottom:0;">Ah! Yes,=
 that seems likely.</span>
  <br>
  <br><span dir=3D"ltr" style=3D"margin-top:0; margin-bottom:0;">I've also =
run into what looks like the same thing on a laptop that does have an RTC, =
but it also has an NTP daemon running on it. (I have also noticed that this=
 is most common on that laptop after a suspend/resume cycle, so maybe that'=
s triggering the bug as well?)</span>
  <br>
  <br><span dir=3D"ltr" style=3D"margin-top:0; margin-bottom:0;">I'll check=
 to see if I've recently had an NTP sync the next time this happens, becaus=
e that would be a definitive confirmation that I'm running into that bug.</=
span>
  <br>
 </body>
</html>
------=_Part_35_113027044.1724349174214--




Information forwarded to bug-guix@HIDDEN:
bug#72166; Package guix. Full text available.

Message received at 72166 <at> debbugs.gnu.org:


Received: (at 72166) by debbugs.gnu.org; 22 Aug 2024 09:38:49 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Thu Aug 22 05:38:49 2024
Received: from localhost ([127.0.0.1]:36733 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1sh4HJ-0003yB-5d
	for submit <at> debbugs.gnu.org; Thu, 22 Aug 2024 05:38:49 -0400
Received: from eggs.gnu.org ([209.51.188.92]:45172)
 by debbugs.gnu.org with esmtp (Exim 4.84_2)
 (envelope-from <ludo@HIDDEN>) id 1sh4HF-0003xt-UU
 for 72166 <at> debbugs.gnu.org; Thu, 22 Aug 2024 05:38:47 -0400
Received: from fencepost.gnu.org ([2001:470:142:3::e])
 by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256)
 (Exim 4.90_1) (envelope-from <ludo@HIDDEN>)
 id 1sh4EK-0008SQ-Ta; Thu, 22 Aug 2024 05:35:44 -0400
DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnu.org;
 s=fencepost-gnu-org; h=MIME-Version:Date:References:In-Reply-To:Subject:To:
 From; bh=auYJ0ke8JTDXan3Z9VimM9dKbRTbkSIUKv97C5JA7MM=; b=SKTdeiC1kRv1I3pvCQVO
 TZBf8AshjieVgKgVu8Qzm3XU854r70VzhdTmJP16X+ksZJwMyliLD9wW/hsGy+hRpqIdgUkw9D++v
 1JYJVa00O05lA/mOF6AC1i+pWBD6qx8Zxmnn7zGg83IMDG3D69EZ7TspDWA5rbfdjoTkgvMtyu3R8
 BCEi+QCa42kAQU0q1aUPxCsjiRIMTCJ0qq4QgsWlDSv/5qTHmqsWsy+UNRAgMC1XmLlq2i1spdJqe
 NIyAcUf+YXYSDvOiSK9vKra9re9c9yphDwoYiFrrvxhxKZ5YBHehneXoDS3yZgMIXqcQYMy7zEAWp
 +8TgWh9M4S/hzA==;
From: =?utf-8?Q?Ludovic_Court=C3=A8s?= <ludo@HIDDEN>
To: Jonathan Frederickson <jonathan@HIDDEN>
Subject: Re: bug#72166: Shepherd periodically goes unresponsive on one of my
 machines
In-Reply-To: <b86349b4-4c6f-4fef-b29b-95db86065a85@HIDDEN> (Jonathan
 Frederickson's message of "Tue, 20 Aug 2024 09:53:21 -0400 (EDT)")
References: <df6e8894-fd84-446f-a67f-50cdcc9de5b3@HIDDEN>
 <878qxxtmwu.fsf@HIDDEN>
 <7974c622-e7d8-48b3-9948-14e8d7654793@HIDDEN>
 <87zfq9kiei.fsf@HIDDEN>
 <5477099d-fbc5-4acd-8320-f88ed3107de7@HIDDEN>
 <877ccg78gy.fsf@HIDDEN>
 <b559c5a9-1c87-43f7-a294-b5248e117b2d@HIDDEN>
 <eec39e18-8a2b-440f-ad97-4779e56362af@HIDDEN>
 <8734mztl9u.fsf@HIDDEN>
 <b86349b4-4c6f-4fef-b29b-95db86065a85@HIDDEN>
X-URL: http://www.fdn.fr/~lcourtes/
X-Revolutionary-Date: Sextidi 6 Fructidor an 232 de la =?utf-8?Q?R=C3=A9vo?=
 =?utf-8?Q?lution=2C?= jour de
 la =?utf-8?Q?Tub=C3=A9reuse?=
X-PGP-Key-ID: 0x090B11993D9AEBB5
X-PGP-Key: http://www.fdn.fr/~lcourtes/ludovic.asc
X-PGP-Fingerprint: 3CE4 6455 8A84 FDC6 9DB4  0CFB 090B 1199 3D9A EBB5
X-OS: x86_64-pc-linux-gnu
Date: Thu, 22 Aug 2024 11:35:15 +0200
Message-ID: <87zfp4nbn0.fsf@HIDDEN>
User-Agent: Gnus/5.13 (Gnus v5.13)
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: quoted-printable
X-Spam-Score: -2.3 (--)
X-Debbugs-Envelope-To: 72166
Cc: 72166 <at> debbugs.gnu.org
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -3.3 (---)

Hi,

You wrote:

> The machine in question does not have a battery-backed RTC, so it
> loses time when it loses power, but notice the time changes once ntpd
> starts up.

So you=E2=80=99re hitting the Fibers bug described here:

  https://issues.guix.gnu.org/70848

There is still no fix upstream unfortunately, but I hope we=E2=80=99ll get =
there
in time for Shepherd 1.0.

The only workaround so far is to build shepherd against
=E2=80=98guile-fibers-1.1=E2=80=99, which is what is done in Guix on AArch6=
4 and RISC-V
since these platforms are likely to be used with single-board computers
lacking a battery-backed RTC.

Thanks,
Ludo=E2=80=99.




Information forwarded to bug-guix@HIDDEN:
bug#72166; Package guix. Full text available.

Message received at 72166 <at> debbugs.gnu.org:


Received: (at 72166) by debbugs.gnu.org; 19 Aug 2024 20:03:06 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Mon Aug 19 16:03:06 2024
Received: from localhost ([127.0.0.1]:59288 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1sg8ao-0007CA-4n
	for submit <at> debbugs.gnu.org; Mon, 19 Aug 2024 16:03:06 -0400
Received: from fhigh5-smtp.messagingengine.com ([103.168.172.156]:54933)
 by debbugs.gnu.org with esmtp (Exim 4.84_2)
 (envelope-from <jonathan@HIDDEN>) id 1sg8al-0007Bd-JR
 for 72166 <at> debbugs.gnu.org; Mon, 19 Aug 2024 16:03:04 -0400
Received: from phl-compute-08.internal (phl-compute-08.nyi.internal
 [10.202.2.48])
 by mailfhigh.nyi.internal (Postfix) with ESMTP id 5F9651146FF0;
 Mon, 19 Aug 2024 16:02:16 -0400 (EDT)
Received: from phl-imap-02 ([10.202.2.81])
 by phl-compute-08.internal (MEProxy); Mon, 19 Aug 2024 16:02:16 -0400
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=terracrypt.net;
 h=cc:cc:content-transfer-encoding:content-type:content-type
 :date:date:from:from:in-reply-to:in-reply-to:message-id
 :mime-version:references:reply-to:subject:subject:to:to; s=fm2;
 t=1724097736; x=1724184136; bh=J2ZnUtwa7FSSM/ukKbQhteUb9qPBqkAp
 5vBmQvfPj8w=; b=FB2q8+02mVQUpWeVfrQIS2Y+hacKF667O1LnLCdYWFc8+ell
 N0MZX2jSizRWCUBn46y75q+ajGGp8WmutefGjD45aT93YBBPYwQBHXfd2KJQq8F7
 qlOHb/T8esv+itESEJelcZPulULy4tTNTxbFmDW1pAUWzmTfCgBnzOLKRaRfCiQy
 kYIBifLCTTuEgXty1yBtxo+s5Ms9xyqgWXvTDVgS9jVzjN3Hrtj32JnPcOnIlQYV
 UL33JB1JKBRodsR+ow0TDPL4hB++FeWLa0NJBC3kHPkg+82irvaQbLNPUyfjQMzR
 ZIcNoMH1NNoJWrhb4jpg+HVyIyT0KutCgrMpyA==
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=
 messagingengine.com; h=cc:cc:content-transfer-encoding
 :content-type:content-type:date:date:feedback-id:feedback-id
 :from:from:in-reply-to:in-reply-to:message-id:mime-version
 :references:reply-to:subject:subject:to:to:x-me-proxy:x-me-proxy
 :x-me-sender:x-me-sender:x-sasl-enc; s=fm1; t=1724097736; x=
 1724184136; bh=J2ZnUtwa7FSSM/ukKbQhteUb9qPBqkAp5vBmQvfPj8w=; b=a
 Jp9TQaITFoUeYbMvHlSirgOuAu1nfto6Q1L1KpNEjI4fcl3boGaHHACva1K5biA1
 IVYgR7liV+Pq0fIhvBNM0PDFAH0Nyx/7A4h1oac1i4hav2qBAMoUp0iEJOr4chYU
 vPiQEouFsxhfDXzjOtLy81QIk/9VuEjj0+ePjAR9jM+utfGCH+XPe9qF7X5Q4r1Q
 npIIiZg0dIRTei/uz9JvYvsPkLzPns+7ZByQDslC2uQJXPBSm9EZFTWoRLg+l+kZ
 OFhGytKcMp8ncs0gkY8buYM2t5Pq9d6eaDnI4C2Lca+wJi93KgTxnfTmaR9ppbnX
 zjlhlR9Ivvu2LccepfOng==
X-ME-Sender: <xms:x6TDZgnAV21ukh4vovk-0jOrDwFSZXV3f7eusjsOxHDiGLbliZ68Ew>
 <xme:x6TDZv31JoOz51dwb-SbDVZEr7GMJkGbHX8Ne-8iZEBPckx78bVEaVXhuxwfDB6d_
 4r1eBi8g4M_GRhJMA>
X-ME-Proxy-Cause: gggruggvucftvghtrhhoucdtuddrgeeftddruddugedgudeghecutefuodetggdotefrod
 ftvfcurfhrohhfihhlvgemucfhrghsthforghilhdpggftfghnshhusghstghrihgsvgdp
 uffrtefokffrpgfnqfghnecuuegrihhlohhuthemuceftddtnecusecvtfgvtghiphhivg
 hnthhsucdlqddutddtmdenucfjughrpefoggffhffvvefkjghfufgtgfesthejredtredt
 tdenucfhrhhomhepfdflohhnrghthhgrnhcuhfhrvgguvghrihgtkhhsohhnfdcuoehjoh
 hnrghthhgrnhesthgvrhhrrggtrhihphhtrdhnvghtqeenucggtffrrghtthgvrhhnpeej
 vdeuieehhfffffdttedvteelgfethfdtieeiteevgfeuvefgkedvleeuudejjeenucevlh
 hushhtvghrufhiiigvpedtnecurfgrrhgrmhepmhgrihhlfhhrohhmpehjohhnrghthhgr
 nhesthgvrhhrrggtrhihphhtrdhnvghtpdhnsggprhgtphhtthhopedvpdhmohguvgepsh
 hmthhpohhuthdprhgtphhtthhopeejvdduieeiseguvggssghughhsrdhgnhhurdhorhhg
 pdhrtghpthhtoheplhhuughosehgnhhurdhorhhg
X-ME-Proxy: <xmx:x6TDZupiNPvU3ngHaqPPSQ8Yg1gNfzYzpMe4VqeIypZp1yU2NO9Cmw>
 <xmx:x6TDZsnFx2aTiQLun6CNyYS6HCABc0iR-EU-JvecFhjbxeXWcdxKeQ>
 <xmx:x6TDZu2QUR7zYMP-Ma1qA2vNzBgiMRNL-nioli9m_Ol8n7JBa4PCZw>
 <xmx:x6TDZjs_UtLcX9QV6BPDWUcLB8w6FcfaQDKJDTeVYt-OBWMFYdCFUw>
 <xmx:yKTDZj9h_NudcYbI0_hi43yLCUfxgKNLSj_ASZkXTJWGaVjG6IUeZuIh>
Feedback-ID: if4194509:Fastmail
Received: by mailuser.nyi.internal (Postfix, from userid 501)
 id A9BCE5E0139; Mon, 19 Aug 2024 16:02:15 -0400 (EDT)
X-Mailer: MessagingEngine.com Webmail Interface
MIME-Version: 1.0
Date: Mon, 19 Aug 2024 16:01:55 -0400
From: "Jonathan Frederickson" <jonathan@HIDDEN>
To: =?UTF-8?Q?Ludovic_Court=C3=A8s?= <ludo@HIDDEN>
Message-Id: <8f52c269-dea8-4856-aa6e-740d99cc72e6@HIDDEN>
In-Reply-To: <b559c5a9-1c87-43f7-a294-b5248e117b2d@HIDDEN>
References: <df6e8894-fd84-446f-a67f-50cdcc9de5b3@HIDDEN>
 <878qxxtmwu.fsf@HIDDEN>
 <7974c622-e7d8-48b3-9948-14e8d7654793@HIDDEN>
 <87zfq9kiei.fsf@HIDDEN>
 <5477099d-fbc5-4acd-8320-f88ed3107de7@HIDDEN>
 <877ccg78gy.fsf@HIDDEN>
 <b559c5a9-1c87-43f7-a294-b5248e117b2d@HIDDEN>
Subject: Re: bug#72166: Shepherd periodically goes unresponsive on one of my
 machines
Content-Type: text/plain
Content-Transfer-Encoding: 7bit
X-Spam-Score: -0.7 (/)
X-Debbugs-Envelope-To: 72166
Cc: 72166 <at> debbugs.gnu.org
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -1.7 (-)

No news yet on reproduction, but I did spot something else interesting when running 'guix home switch-generation' on one of my other machines that's been having similar issues:

> SLoading /gnu/store/b7sbff937mm32lwv1a0dbh4mqhwkvpfn-shepherd.conf.
> herd: error: exception caught while executing 'load' on service 'root':
> In procedure fport_write: Input/output error
> Comparing /gnu/store/69shsgavrzkb408hvrfi4yjs26z7n112-home/profile/share/fonts and
> /gnu/store/dn7kk3d6hhmgjm89y2p3mrlwz0dgkzpy-home/profile/share/fonts... done (same)
> Evaluating on-change gexps.
> 
> On-change gexps evaluation finished.

This is another machine whose user shepherd instance is currently in this unresponsive state.




Information forwarded to bug-guix@HIDDEN:
bug#72166; Package guix. Full text available.

Message received at 72166 <at> debbugs.gnu.org:


Received: (at 72166) by debbugs.gnu.org; 18 Aug 2024 22:55:47 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Sun Aug 18 18:55:47 2024
Received: from localhost ([127.0.0.1]:56963 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1sfooN-0005QI-E5
	for submit <at> debbugs.gnu.org; Sun, 18 Aug 2024 18:55:47 -0400
Received: from fhigh8-smtp.messagingengine.com ([103.168.172.159]:55207)
 by debbugs.gnu.org with esmtp (Exim 4.84_2)
 (envelope-from <jonathan@HIDDEN>) id 1sfooL-0005Q2-CQ
 for 72166 <at> debbugs.gnu.org; Sun, 18 Aug 2024 18:55:45 -0400
Received: from phl-compute-08.internal (phl-compute-08.nyi.internal
 [10.202.2.48])
 by mailfhigh.nyi.internal (Postfix) with ESMTP id 138801146D52;
 Sun, 18 Aug 2024 18:54:59 -0400 (EDT)
Received: from phl-imap-02 ([10.202.2.81])
 by phl-compute-08.internal (MEProxy); Sun, 18 Aug 2024 18:54:59 -0400
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=terracrypt.net;
 h=cc:cc:content-type:content-type:date:date:from:from
 :in-reply-to:in-reply-to:message-id:mime-version:references
 :reply-to:subject:subject:to:to; s=fm1; t=1724021699; x=
 1724108099; bh=c0iK79uZzJOUWsfTQtlbHke8k+iP37uHJ8KhsLGo+70=; b=y
 /RPx79GJLi4ZLpm679RcftSE1MihPOxbWqwiseIW4Cjb0AS2Mx7qpUO1oo1PMADI
 E60UVzPU6ZiIqxxMQ73FGjuPYVH7ZohfmBTFLMZSRMKpHRrcmVkaPVZ16QpYEk5d
 cTCIyn9EYAzgpiB4Lz1hVK0zjKB/YxQvMGQfOILJxAfN16vjR8JqElq+FAQ/7N10
 re7UnJt7lJ+FfOlKaaPQNyfPnIO3WdDX18jgkgXaQFj76XcOgZ9nxKWek23eIPc/
 5qJ9vZl3Y3xdhtjXwVNyLX6YlQ+JquBgfDMebH6vopXt45Ws/qJs19U/K0V023wY
 oHr9FfuCznqE8YD5yolFA==
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=
 messagingengine.com; h=cc:cc:content-type:content-type:date:date
 :feedback-id:feedback-id:from:from:in-reply-to:in-reply-to
 :message-id:mime-version:references:reply-to:subject:subject:to
 :to:x-me-proxy:x-me-proxy:x-me-sender:x-me-sender:x-sasl-enc; s=
 fm3; t=1724021699; x=1724108099; bh=c0iK79uZzJOUWsfTQtlbHke8k+iP
 37uHJ8KhsLGo+70=; b=mWl1erGbUZxbnBd1vkd0tFDLtASMrWTMKg0LlXfoCozb
 RvA6xwx8dYd36P2zgj7e9b6f+fPbZXPIkByTct71kvG6evENksDQZbZepgkvAmo0
 v/VzB5cm/4fTNGR4MJZFnrQlh0LRA7Ekz3e/bkq1AQtnbWndcyMRJ8F8SdbtjkEe
 aqevdcsFKeoXIgf6/lg61zWilVTNjpmBauxQdQS0ISnUkW7FIzN7u9fe8pulPkpP
 8EwM20B+VoZTflf7A3aj0pkDa/MbWn6XmoZRd3S0VdWakJLIAnpGjyjdzMo7mB0n
 1VjOTmDRSFAVUDQOsRGAS3ezngQkzdjsjSNF47FRlA==
X-ME-Sender: <xms:wnvCZi9mK3g21RKgwJyriyugHRiq9PtgAqtbwDGQMEUQQnwn1GCN-g>
 <xme:wnvCZivYkmXj5Y11vyIvJnjEopIkwFh25On0t1pxxTtT0pnlQxRWeLEzuB34Syl21
 SE4woVt4GvcLCEg3Q>
X-ME-Proxy-Cause: gggruggvucftvghtrhhoucdtuddrgeeftddruddufedgudejucetufdoteggodetrfdotf
 fvucfrrhhofhhilhgvmecuhfgrshhtofgrihhlpdggtfgfnhhsuhgsshgtrhhisggvpdfu
 rfetoffkrfgpnffqhgenuceurghilhhouhhtmecufedttdenucesvcftvggtihhpihgvnh
 htshculddquddttddmnecujfgurhepofggfffhvfevkfgjfhfutgesrgdtreerredtjeen
 ucfhrhhomhepfdflohhnrghthhgrnhcuhfhrvgguvghrihgtkhhsohhnfdcuoehjohhnrg
 hthhgrnhesthgvrhhrrggtrhihphhtrdhnvghtqeenucggtffrrghtthgvrhhnpeeutdei
 keejieeggffgtdelfeefhfekffevvdfgfefgfeejudejveduudeihfeigeenucevlhhush
 htvghrufhiiigvpedtnecurfgrrhgrmhepmhgrihhlfhhrohhmpehjohhnrghthhgrnhes
 thgvrhhrrggtrhihphhtrdhnvghtpdhnsggprhgtphhtthhopedvpdhmohguvgepshhmth
 hpohhuthdprhgtphhtthhopeejvdduieeiseguvggssghughhsrdhgnhhurdhorhhgpdhr
 tghpthhtoheplhhuughosehgnhhurdhorhhg
X-ME-Proxy: <xmx:wnvCZoAEwbatDFhMzMgOad6GslMEushvcmfmBPpM5YOp4XniSDUMEg>
 <xmx:wnvCZqdWHaWYWIIxrlJPdQ4QQDlyP_5lfBzF6zruB_V2GrGFNQ0w4w>
 <xmx:wnvCZnOzj5keP9ZVqI3BOIYcg5YDXnZFbIsKU3p6vFbEy0GQNKhHbg>
 <xmx:wnvCZklWUyatadCXUB8KTmZgAo1RSV43MZPUcxjOfLL03G7XS_CBUw>
 <xmx:w3vCZt3vE9wtJD9XZsNgdt2eMB0HxMh5Ffnbek39_PYAqRfzI3MiWqJs>
Feedback-ID: if4194509:Fastmail
Received: by mailuser.nyi.internal (Postfix, from userid 501)
 id 921FE5E013B; Sun, 18 Aug 2024 18:54:58 -0400 (EDT)
X-Mailer: MessagingEngine.com Webmail Interface
MIME-Version: 1.0
Date: Sun, 18 Aug 2024 18:54:38 -0400
From: "Jonathan Frederickson" <jonathan@HIDDEN>
To: =?UTF-8?Q?Ludovic_Court=C3=A8s?= <ludo@HIDDEN>
Message-Id: <b559c5a9-1c87-43f7-a294-b5248e117b2d@HIDDEN>
In-Reply-To: <877ccg78gy.fsf@HIDDEN>
References: <df6e8894-fd84-446f-a67f-50cdcc9de5b3@HIDDEN>
 <878qxxtmwu.fsf@HIDDEN>
 <7974c622-e7d8-48b3-9948-14e8d7654793@HIDDEN>
 <87zfq9kiei.fsf@HIDDEN>
 <5477099d-fbc5-4acd-8320-f88ed3107de7@HIDDEN>
 <877ccg78gy.fsf@HIDDEN>
Subject: Re: bug#72166: Shepherd periodically goes unresponsive on one of my
 machines
Content-Type: multipart/alternative; boundary=95c41e12620f4db096e01778165e6242
X-Spam-Score: -0.7 (/)
X-Debbugs-Envelope-To: 72166
Cc: 72166 <at> debbugs.gnu.org
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -1.7 (-)

--95c41e12620f4db096e01778165e6242
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: quoted-printable

On Fri, Aug 16, 2024, at 12:12 PM, Ludovic Court=C3=A8s wrote:
> Could you share (maybe privately) the relevant excerpt of
> /var/log/messages?
>=20
> Could you also share (ideally) a minimum Guix System config and a
> sequence of commands to reproduce it?
>=20
> Are you able to reproduce it in =E2=80=98guix system vm=E2=80=99?

I'll send some logs to you privately; I don't actually see any logs that=
 look relevant in /var/log/messages but maybe you'll spot something I ha=
ven't.

I'll try to come up with a reproduction and a minimal system config to r=
eproduce the issue, though I will say that I haven't yet tracked down th=
e exact situation that triggers it.

(Perhaps notably, both this machine running Guix System and another mach=
ine running Guix Home on a foreign distro where I've experienced the sam=
e issue are running Sway, and I think swaylock may be related somehow.)

I'll also try reproducing it with 'guix system vm' if I can, though that=
 may be challenging as this machine only has 4 GB of RAM.
--95c41e12620f4db096e01778165e6242
Content-Type: text/html; charset=utf-8
Content-Transfer-Encoding: quoted-printable

<!DOCTYPE html><html><head><title></title><style type=3D"text/css">p.Mso=
Normal,p.MsoNoSpacing{margin:0}</style></head><body><div>On Fri, Aug 16,=
 2024, at 12:12 PM, Ludovic Court=C3=A8s wrote:<br></div><blockquote typ=
e=3D"cite" id=3D"qt" style=3D""><div>Could you share (maybe privately) t=
he relevant excerpt of<br></div><div>/var/log/messages?<br></div><div><b=
r></div><div>Could you also share (ideally) a minimum Guix System config=
 and a<br></div><div>sequence of commands to reproduce it?<br></div><div=
><br></div><div>Are you able to reproduce it in =E2=80=98guix system vm=E2=
=80=99?<br></div></blockquote><div><br></div><div>I'll send some logs to=
 you privately; I don't actually see any logs that look relevant in /var=
/log/messages but maybe you'll spot something I haven't.<br></div><div><=
br></div><div>I'll try to come up with a reproduction and a minimal syst=
em config to reproduce the issue, though I will say that I haven't yet t=
racked down the exact situation that triggers it.<br></div><div><br></di=
v><div>(Perhaps notably, both this machine running Guix System and anoth=
er machine running Guix Home on a foreign distro where I've experienced =
the same issue are running Sway, and I think swaylock may be related som=
ehow.)<br></div><div><br></div><div>I'll also try reproducing it with 'g=
uix system vm' if I can, though that may be challenging as this machine =
only has 4 GB of RAM.<br></div></body></html>
--95c41e12620f4db096e01778165e6242--




Information forwarded to bug-guix@HIDDEN:
bug#72166; Package guix. Full text available.

Message received at 72166 <at> debbugs.gnu.org:


Received: (at 72166) by debbugs.gnu.org; 16 Aug 2024 16:13:01 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Fri Aug 16 12:13:01 2024
Received: from localhost ([127.0.0.1]:52749 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1sezZV-00085E-8D
	for submit <at> debbugs.gnu.org; Fri, 16 Aug 2024 12:13:01 -0400
Received: from eggs.gnu.org ([209.51.188.92]:47998)
 by debbugs.gnu.org with esmtp (Exim 4.84_2)
 (envelope-from <ludo@HIDDEN>) id 1sezZT-00084x-DE
 for 72166 <at> debbugs.gnu.org; Fri, 16 Aug 2024 12:13:00 -0400
Received: from fencepost.gnu.org ([2001:470:142:3::e])
 by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256)
 (Exim 4.90_1) (envelope-from <ludo@HIDDEN>)
 id 1sezYl-0007Bf-L5; Fri, 16 Aug 2024 12:12:16 -0400
DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnu.org;
 s=fencepost-gnu-org; h=MIME-Version:Date:References:In-Reply-To:Subject:To:
 From; bh=jJ1IM6SsD0D2uKp02IJ5/Z79mDDw0febsqAApVOMXRk=; b=mdD1BEMOgIFchhYjXSCJ
 9GyoEF2AV74T7xtiVqCYdcSssSwsY35GusQXB3D3bOQ1EtI8PKXXPYdBf+UKG5RPCCS/oFUVDnfIy
 eGwO98iHD+kAxVNRKYjiOAaah2dsS+1beZ2eDjjtevKD9yPCAjeucMCuB47Xj/priHWUGneVN6C2i
 CFvOurp3bnNFGZ3msxsurwKyqcGbewWwylCPz/xgaCPUWGAWR9kBLms73L/gRSF8zoj95uyjbaMQX
 xP1gCLwvcU+BWMw1KSP3OTZRwDQQe8EDMCIgpGowkMw/H6kVtHpIMHqCYM0NYn0yYO/rDt2tETIW6
 BOtHYWJXlfC5MQ==;
From: =?utf-8?Q?Ludovic_Court=C3=A8s?= <ludo@HIDDEN>
To: "Jonathan Frederickson" <jonathan@HIDDEN>
Subject: Re: bug#72166: Shepherd periodically goes unresponsive on one of my
 machines
In-Reply-To: <5477099d-fbc5-4acd-8320-f88ed3107de7@HIDDEN> (Jonathan
 Frederickson's message of "Wed, 24 Jul 2024 20:08:38 -0400")
References: <df6e8894-fd84-446f-a67f-50cdcc9de5b3@HIDDEN>
 <878qxxtmwu.fsf@HIDDEN>
 <7974c622-e7d8-48b3-9948-14e8d7654793@HIDDEN>
 <87zfq9kiei.fsf@HIDDEN>
 <5477099d-fbc5-4acd-8320-f88ed3107de7@HIDDEN>
X-URL: http://www.fdn.fr/~lcourtes/
X-Revolutionary-Date: =?utf-8?Q?D=C3=A9cadi?= 30 Thermidor an 232 de la
 =?utf-8?Q?R=C3=A9volution=2C?= jour du Moulin
X-PGP-Key-ID: 0x090B11993D9AEBB5
X-PGP-Key: http://www.fdn.fr/~lcourtes/ludovic.asc
X-PGP-Fingerprint: 3CE4 6455 8A84 FDC6 9DB4  0CFB 090B 1199 3D9A EBB5
X-OS: x86_64-pc-linux-gnu
Date: Fri, 16 Aug 2024 18:12:13 +0200
Message-ID: <877ccg78gy.fsf@HIDDEN>
User-Agent: Gnus/5.13 (Gnus v5.13)
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: quoted-printable
X-Spam-Score: -2.3 (--)
X-Debbugs-Envelope-To: 72166
Cc: 72166 <at> debbugs.gnu.org
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -3.3 (---)

Hi,

"Jonathan Frederickson" <jonathan@HIDDEN> skribis:

> I've gotten this machine upgraded to 0.10.5 and just experienced the same=
 thing again:

Argh.

> jfred@terracard ~$ ps aux | grep swaylo
> jfred      544  0.0  0.0   3700  2432 ?        S    19:02   0:00 swayidle=
 -w timeout 300 swaylock -f -i ~/.wallpapers/user-manual.jpg timeout 10 if =
pgrep swaylock; then swaymsg "output * dpms off"; fi resume swaymsg "output=
 * dpms on" before-sleep swaylock -f -i ~/.wallpapers/user-manual.jpg
> jfred     1956  0.0  0.0      0     0 ?        Z    19:22   0:00 [swayloc=
k] <defunct>
> jfred     1957  0.0  0.0      0     0 ?        Zs   19:22   0:00 [swayloc=
k] <defunct>
> jfred     2162  0.0  0.0      0     0 ?        Z    19:38   0:00 [swayloc=
k] <defunct>
> jfred     2163  0.0  0.0      0     0 ?        Zs   19:38   0:00 [swayloc=
k] <defunct>
> jfred     2604  0.0  0.0   6116  2432 pts/2    S+   20:04   0:00 grep --c=
olor=3Dauto swaylo
> jfred@terracard ~$ cat /proc/1/cmdline=20
> /gnu/store/bhynhk0c6ssq3fqqc59fvhxjzwywsjbb-guile-3.0.9/bin/guile--no-aut=
o-compile/gnu/store/wrmyav254ydjn9cad3q169fxg7x6p80b-shepherd-0.10.5/bin/sh=
epherd--config/gnu/store/sfjww12mipyx4zxa6i9x8nxxfyb7h3y4-shepherd.conf
>
> Of note, I haven't run 'guix system reconfigure' or any manual 'herd' com=
mands on this machine since boot.

Could you share (maybe privately) the relevant excerpt of
/var/log/messages?

Could you also share (ideally) a minimum Guix System config and a
sequence of commands to reproduce it?

Are you able to reproduce it in =E2=80=98guix system vm=E2=80=99?

Thanks,
Ludo=E2=80=99.




Information forwarded to bug-guix@HIDDEN:
bug#72166; Package guix. Full text available.

Message received at 72166 <at> debbugs.gnu.org:


Received: (at 72166) by debbugs.gnu.org; 25 Jul 2024 00:09:16 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Wed Jul 24 20:09:16 2024
Received: from localhost ([127.0.0.1]:34693 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1sWm2l-00026x-QG
	for submit <at> debbugs.gnu.org; Wed, 24 Jul 2024 20:09:16 -0400
Received: from fhigh2-smtp.messagingengine.com ([103.168.172.153]:41859)
 by debbugs.gnu.org with esmtp (Exim 4.84_2)
 (envelope-from <jonathan@HIDDEN>) id 1sWm2i-00026i-6G
 for 72166 <at> debbugs.gnu.org; Wed, 24 Jul 2024 20:09:13 -0400
Received: from compute8.internal (compute8.nyi.internal [10.202.2.227])
 by mailfhigh.nyi.internal (Postfix) with ESMTP id 034B9114010C;
 Wed, 24 Jul 2024 20:09:00 -0400 (EDT)
Received: from wimap21 ([10.202.2.81])
 by compute8.internal (MEProxy); Wed, 24 Jul 2024 20:09:00 -0400
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=terracrypt.net;
 h=cc:cc:content-transfer-encoding:content-type:content-type
 :date:date:from:from:in-reply-to:in-reply-to:message-id
 :mime-version:references:reply-to:subject:subject:to:to; s=fm1;
 t=1721866139; x=1721952539; bh=QNpn52zwfVj/lAGK1ieR5r8Ih1r/8F7o
 VZK4MjfeENQ=; b=2eyWduy5IZmEGVLWG1v/3IOflq1Z4ylPT5xXAsoK2Xb6ZMd4
 0tL9AAwmU4+PimQZMaqnSXJOX/fPMoTa+/rtKWFflFOGjambauhU1MRKIiktHslK
 7MVh76j3vlQEVbF0P/13+qfFhoH4P1YiPSIfqJ9bivp+31cKBMtHx6iQakb2te2K
 Y9MhOJzUxzkNvLrcaHWMuRzAvPB+s0X1fKgfOI4WROLHnyAIF243o/yZBJwpcuRr
 x5f9yPDhMBVkH/xaebzwOOSIVux6RKaF4pixROKoencsEgs7jjmvjyDbv8lUXZGo
 mO58QUQ9bRw2BEvrRTitXW+q2B1WPFQQsVh7eA==
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=
 messagingengine.com; h=cc:cc:content-transfer-encoding
 :content-type:content-type:date:date:feedback-id:feedback-id
 :from:from:in-reply-to:in-reply-to:message-id:mime-version
 :references:reply-to:subject:subject:to:to:x-me-proxy:x-me-proxy
 :x-me-sender:x-me-sender:x-sasl-enc; s=fm3; t=1721866139; x=
 1721952539; bh=QNpn52zwfVj/lAGK1ieR5r8Ih1r/8F7oVZK4MjfeENQ=; b=O
 jVQD1HBwqxnfhXSKLJwiuoYVjAQl4a/wciPS8MYkW8Y/ZJ1Bu1y2LuIoOxN4F9+I
 FSjuqCZ5fU2TTZ4jbIim13Yle7YydP+BA3UiUF866zsIJQEVBA0ZCN7whwoED+jg
 HIfmOi8vK31pNQBZD0a0ZEdQ6zxDgR1XduU0kzX7oO2Ct5yWc91sNcgzj2fUM9A2
 nJo2jfWcbeEvP9PcQm3wp+7xjzp4nQYgvo/5yhkcjVUuAC8EhdB4w+t9Zzn/JkG1
 VXAHbkxDruV4IQ+nPTeP6oZBoS56hpFGmAK6mVsbSYzV5Y4blwd22S1XonLmPmrE
 zzLKAnV/wqsZLGBFQNczQ==
X-ME-Sender: <xms:m5ehZuLmtm7HBFKx4hl2rBNJWa_CY_cIrvyYzaO3psF-apjC4-PL2w>
 <xme:m5ehZmJ2nZMIOwLGY9BR_6OaMj_v8w840wQ_iH3L8-tTPsdl3pwbznCsQRk9NK5t3
 817bD1kf9KlVqFxKQ>
X-ME-Proxy-Cause: gggruggvucftvghtrhhoucdtuddrgeeftddriedvgdefvdcutefuodetggdotefrodftvf
 curfhrohhfihhlvgemucfhrghsthforghilhdpqfgfvfdpuffrtefokffrpgfnqfghnecu
 uegrihhlohhuthemuceftddtnecusecvtfgvtghiphhivghnthhsucdlqddutddtmdenuc
 fjughrpefofgggkfgjfhffhffvvefutgfgsehtqhertderreejnecuhfhrohhmpedflfho
 nhgrthhhrghnucfhrhgvuggvrhhitghkshhonhdfuceojhhonhgrthhhrghnsehtvghrrh
 grtghrhihpthdrnhgvtheqnecuggftrfgrthhtvghrnhepfffhgfeujedvffdtveejvdeh
 geekgfetudevtdfgleekleetheeffedufeejveefnecuffhomhgrihhnpehgnhhurdhorh
 hgnecuvehluhhsthgvrhfuihiivgeptdenucfrrghrrghmpehmrghilhhfrhhomhepjhho
 nhgrthhhrghnsehtvghrrhgrtghrhihpthdrnhgvthdpnhgspghrtghpthhtoheptd
X-ME-Proxy: <xmx:m5ehZuvLH2gGEpxtfTkUC6IxcZIgEiLG5wqDpz5G31CRpjIwwmclPA>
 <xmx:m5ehZjZhRvLMH6pbv-I0QrMeKmQw6nkl0Z5vV68CIKIZsZriFFQaFQ>
 <xmx:m5ehZlasWRGZnSSdUIjCkcVBpMz1QOyW89NIq57X2DbaxhWvyvRutA>
 <xmx:m5ehZvBAMujb2THk_H3N9zGYe3tat5GbFeLmnLj--T47JW42Gctqgw>
 <xmx:m5ehZkwdmKMDhP6OWHU3JPQceaahyLQIwKKuMTmEfP4nTvLjkatO5-mP>
Feedback-ID: if4194509:Fastmail
Received: by mailuser.nyi.internal (Postfix, from userid 501)
 id AB4CF37A0084; Wed, 24 Jul 2024 20:08:59 -0400 (EDT)
X-Mailer: MessagingEngine.com Webmail Interface
User-Agent: Cyrus-JMAP/3.11.0-alpha0-582-g5a02f8850-fm-20240719.002-g5a02f885
MIME-Version: 1.0
Message-Id: <5477099d-fbc5-4acd-8320-f88ed3107de7@HIDDEN>
In-Reply-To: <87zfq9kiei.fsf@HIDDEN>
References: <df6e8894-fd84-446f-a67f-50cdcc9de5b3@HIDDEN>
 <878qxxtmwu.fsf@HIDDEN>
 <7974c622-e7d8-48b3-9948-14e8d7654793@HIDDEN>
 <87zfq9kiei.fsf@HIDDEN>
Date: Wed, 24 Jul 2024 20:08:38 -0400
From: "Jonathan Frederickson" <jonathan@HIDDEN>
To: =?UTF-8?Q?Ludovic_Court=C3=A8s?= <ludo@HIDDEN>
Subject: Re: bug#72166: Shepherd periodically goes unresponsive on one of my
 machines
Content-Type: text/plain;charset=utf-8
Content-Transfer-Encoding: quoted-printable
X-Spam-Score: -0.7 (/)
X-Debbugs-Envelope-To: 72166
Cc: 72166 <at> debbugs.gnu.org
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -1.7 (-)

On Mon, Jul 22, 2024, at 3:14 AM, Ludovic Court=C3=A8s wrote:
> Hi,
>=20
> "Jonathan Frederickson" <jonathan@HIDDEN> skribis:
>=20
> > Hi Ludo, thanks for the troubleshooting help. Looks like I'm running=
 0.10.4:
> >
> > jfred@terracard ~$ cat /proc/1/cmdline | xargs -0
> > /gnu/store/bhynhk0c6ssq3fqqc59fvhxjzwywsjbb-guile-3.0.9/bin/guile --=
no-auto-compile /gnu/store/39li5qpiaj1lx89xgahlbgvfnjhpcpwg-shepherd-0.1=
0.4/bin/shepherd --config /gnu/store/hfyri6ygfdjq4w3nkha2ypa2k98hhfxj-sh=
epherd.conf
> >
> > I see now that 0.10.5 was released a few weeks ago, does that have a=
 fix that could be related?
>=20
> Yes, it could be related.  Per the =E2=80=98NEWS=E2=80=99 file of Shep=
herd:
>=20
>   ** =E2=80=98herd unload root SERVICE=E2=80=99 no longer hands when t=
here=E2=80=99s a replacement
>      (<https://issues.guix.gnu.org/71478>)
>=20
>   It used to be that, for a running service S that has a replacement r=
egistered,
>   =E2=80=98herd unload root S=E2=80=99 would hang shepherd, making it =
totally unresponsive=E2=80=94=E2=80=98herd
>   status=E2=80=99, =E2=80=98halt=E2=80=99, etc. would hang forever, an=
d inetd-style services would no
>   longer start, etc.  This is now fixed.
>=20
> Depending on previous =E2=80=98guix system reconfigure=E2=80=99 invoca=
tions on these
> machines, it=E2=80=99s possible that you ended up in this state.
>=20
> Would be great if you could upgrade and see if the problem still occur=
s.
>=20
> Thanks,
> Ludo=E2=80=99.

I've gotten this machine upgraded to 0.10.5 and just experienced the sam=
e thing again:

jfred@terracard ~$ ps aux | grep swaylo
jfred      544  0.0  0.0   3700  2432 ?        S    19:02   0:00 swayidl=
e -w timeout 300 swaylock -f -i ~/.wallpapers/user-manual.jpg timeout 10=
 if pgrep swaylock; then swaymsg "output * dpms off"; fi resume swaymsg =
"output * dpms on" before-sleep swaylock -f -i ~/.wallpapers/user-manual=
.jpg
jfred     1956  0.0  0.0      0     0 ?        Z    19:22   0:00 [swaylo=
ck] <defunct>
jfred     1957  0.0  0.0      0     0 ?        Zs   19:22   0:00 [swaylo=
ck] <defunct>
jfred     2162  0.0  0.0      0     0 ?        Z    19:38   0:00 [swaylo=
ck] <defunct>
jfred     2163  0.0  0.0      0     0 ?        Zs   19:38   0:00 [swaylo=
ck] <defunct>
jfred     2604  0.0  0.0   6116  2432 pts/2    S+   20:04   0:00 grep --=
color=3Dauto swaylo
jfred@terracard ~$ cat /proc/1/cmdline=20
/gnu/store/bhynhk0c6ssq3fqqc59fvhxjzwywsjbb-guile-3.0.9/bin/guile--no-au=
to-compile/gnu/store/wrmyav254ydjn9cad3q169fxg7x6p80b-shepherd-0.10.5/bi=
n/shepherd--config/gnu/store/sfjww12mipyx4zxa6i9x8nxxfyb7h3y4-shepherd.c=
onf

Of note, I haven't run 'guix system reconfigure' or any manual 'herd' co=
mmands on this machine since boot.




Information forwarded to bug-guix@HIDDEN:
bug#72166; Package guix. Full text available.

Message received at 72166 <at> debbugs.gnu.org:


Received: (at 72166) by debbugs.gnu.org; 22 Jul 2024 07:14:42 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Mon Jul 22 03:14:42 2024
Received: from localhost ([127.0.0.1]:56741 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1sVnFq-0006A4-F6
	for submit <at> debbugs.gnu.org; Mon, 22 Jul 2024 03:14:42 -0400
Received: from hera.aquilenet.fr ([185.233.100.1]:43500)
 by debbugs.gnu.org with esmtp (Exim 4.84_2)
 (envelope-from <ludo@HIDDEN>) id 1sVnFo-00069o-7Z
 for 72166 <at> debbugs.gnu.org; Mon, 22 Jul 2024 03:14:41 -0400
Received: from localhost (localhost [127.0.0.1])
 by hera.aquilenet.fr (Postfix) with ESMTP id D09A4207;
 Mon, 22 Jul 2024 09:14:31 +0200 (CEST)
X-Virus-Scanned: Debian amavisd-new at hera.aquilenet.fr
Received: from hera.aquilenet.fr ([127.0.0.1])
 by localhost (hera.aquilenet.fr [127.0.0.1]) (amavisd-new, port 10024)
 with ESMTP id LtT8vylRAwr0; Mon, 22 Jul 2024 09:14:31 +0200 (CEST)
Received: from ribbon (unknown [193.50.110.239])
 by hera.aquilenet.fr (Postfix) with ESMTPSA id 59E803C;
 Mon, 22 Jul 2024 09:14:31 +0200 (CEST)
From: =?utf-8?Q?Ludovic_Court=C3=A8s?= <ludo@HIDDEN>
To: "Jonathan Frederickson" <jonathan@HIDDEN>
Subject: Re: bug#72166: Shepherd periodically goes unresponsive on one of my
 machines
In-Reply-To: <7974c622-e7d8-48b3-9948-14e8d7654793@HIDDEN> (Jonathan
 Frederickson's message of "Fri, 19 Jul 2024 12:25:37 -0400")
References: <df6e8894-fd84-446f-a67f-50cdcc9de5b3@HIDDEN>
 <878qxxtmwu.fsf@HIDDEN>
 <7974c622-e7d8-48b3-9948-14e8d7654793@HIDDEN>
Date: Mon, 22 Jul 2024 09:14:29 +0200
Message-ID: <87zfq9kiei.fsf@HIDDEN>
User-Agent: Gnus/5.13 (Gnus v5.13)
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: quoted-printable
X-Spam-Score: 1.0 (+)
X-Debbugs-Envelope-To: 72166
Cc: 72166 <at> debbugs.gnu.org
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -0.0 (/)

Hi,

"Jonathan Frederickson" <jonathan@HIDDEN> skribis:

> Hi Ludo, thanks for the troubleshooting help. Looks like I'm running 0.10=
.4:
>
> jfred@terracard ~$ cat /proc/1/cmdline | xargs -0
> /gnu/store/bhynhk0c6ssq3fqqc59fvhxjzwywsjbb-guile-3.0.9/bin/guile --no-au=
to-compile /gnu/store/39li5qpiaj1lx89xgahlbgvfnjhpcpwg-shepherd-0.10.4/bin/=
shepherd --config /gnu/store/hfyri6ygfdjq4w3nkha2ypa2k98hhfxj-shepherd.conf
>
> I see now that 0.10.5 was released a few weeks ago, does that have a fix =
that could be related?

Yes, it could be related.  Per the =E2=80=98NEWS=E2=80=99 file of Shepherd:

  ** =E2=80=98herd unload root SERVICE=E2=80=99 no longer hands when there=
=E2=80=99s a replacement
     (<https://issues.guix.gnu.org/71478>)

  It used to be that, for a running service S that has a replacement regist=
ered,
  =E2=80=98herd unload root S=E2=80=99 would hang shepherd, making it total=
ly unresponsive=E2=80=94=E2=80=98herd
  status=E2=80=99, =E2=80=98halt=E2=80=99, etc. would hang forever, and ine=
td-style services would no
  longer start, etc.  This is now fixed.

Depending on previous =E2=80=98guix system reconfigure=E2=80=99 invocations=
 on these
machines, it=E2=80=99s possible that you ended up in this state.

Would be great if you could upgrade and see if the problem still occurs.

Thanks,
Ludo=E2=80=99.




Information forwarded to bug-guix@HIDDEN:
bug#72166; Package guix. Full text available.

Message received at 72166 <at> debbugs.gnu.org:


Received: (at 72166) by debbugs.gnu.org; 19 Jul 2024 16:26:06 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Fri Jul 19 12:26:06 2024
Received: from localhost ([127.0.0.1]:50261 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1sUqQn-0002to-R3
	for submit <at> debbugs.gnu.org; Fri, 19 Jul 2024 12:26:06 -0400
Received: from fhigh3-smtp.messagingengine.com ([103.168.172.154]:56121)
 by debbugs.gnu.org with esmtp (Exim 4.84_2)
 (envelope-from <jonathan@HIDDEN>) id 1sUqQk-0002tI-G1
 for 72166 <at> debbugs.gnu.org; Fri, 19 Jul 2024 12:26:04 -0400
Received: from compute8.internal (compute8.nyi.internal [10.202.2.227])
 by mailfhigh.nyi.internal (Postfix) with ESMTP id CD1B7114031F;
 Fri, 19 Jul 2024 12:25:57 -0400 (EDT)
Received: from wimap21 ([10.202.2.81])
 by compute8.internal (MEProxy); Fri, 19 Jul 2024 12:25:57 -0400
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=terracrypt.net;
 h=cc:cc:content-transfer-encoding:content-type:content-type
 :date:date:from:from:in-reply-to:in-reply-to:message-id
 :mime-version:references:reply-to:subject:subject:to:to; s=fm1;
 t=1721406357; x=1721492757; bh=0GMiKGNSEMi5xNIcqKzUjE/xdKeG68LX
 QcR8g/cUOVo=; b=DxIAPcC54vOaIT02ri7S7NOi2xxBR3xPl5xcAtJaf3mC1kpd
 dC56Xba4veS1x0/28A5fdSilbmJvIrkG1Hx+wx13kQRp+DAs0Zq9dkDcZzDPk2Hc
 qvolJQRHFYgWp8hkE0dwG1pvNvWCiJ/RzVUK7VitQioheLtCpP1AslKTR+hmW5Nd
 zTYSLhZZ1mbPR2doyZQV4pNy4UtcKnDaCMm483LZsQwa0f46UW16jgU9tipsy00T
 0gxQi2eyM4+itMp+psf1MzSzGiIf0JxEN1HYHBg8OpPIo//SsXHzcW/J5btV0poF
 qb6+DihKf/zLp7vvg7zpHQwvWxawcola7MyAyg==
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=
 messagingengine.com; h=cc:cc:content-transfer-encoding
 :content-type:content-type:date:date:feedback-id:feedback-id
 :from:from:in-reply-to:in-reply-to:message-id:mime-version
 :references:reply-to:subject:subject:to:to:x-me-proxy:x-me-proxy
 :x-me-sender:x-me-sender:x-sasl-enc; s=fm3; t=1721406357; x=
 1721492757; bh=0GMiKGNSEMi5xNIcqKzUjE/xdKeG68LXQcR8g/cUOVo=; b=P
 gEuAUM6HJEGn6tLHiG8W40xJbzN7DgA1rYztnnvvHK4RmSe9fz3FC5wpV8hxvVgv
 jd2q0z7WRtC0PZnpuhAkVtZqaezy+bYLUjuWF38pE+niic1y2KCgtoJcnnAVRWmP
 wM/RyqjME76LubiX53dKcx0heN8dLxlUnPNbDZe3reYQjr4vJMMHOPbr8BbNo65f
 oWibhk1uE+v98WktnQf+O7mxxBc8cCHVntE60hYdjT+CcOUvDWq3KFawoAj8SCCV
 vz8S4R7X6FuC+QhVHuZO3VHisl3lwdwoigGHxyiROsjsqIQk306mFGBU26KFBaNy
 eJF/j9puZMZEBS2y+3W/Q==
X-ME-Sender: <xms:lZOaZnunoJYC_1mOFhYrBl6M-XVV7DBIUADfLpcTp4Siu5fNymPMFg>
 <xme:lZOaZoftOdDqe2jZYLXNmrVERxfcN8wzG9m1yHQvtE1HH0dXBILweAy7G_MClRMvg
 pAwD8K9exwp1imGPg>
X-ME-Proxy-Cause: gggruggvucftvghtrhhoucdtuddrgeeftddrhedugddutddtucetufdoteggodetrfdotf
 fvucfrrhhofhhilhgvmecuhfgrshhtofgrihhlpdfqfgfvpdfurfetoffkrfgpnffqhgen
 uceurghilhhouhhtmecufedttdenucesvcftvggtihhpihgvnhhtshculddquddttddmne
 cujfgurhepofgfggfkjghffffhvfevufgtgfesthhqredtreerjeenucfhrhhomhepfdfl
 ohhnrghthhgrnhcuhfhrvgguvghrihgtkhhsohhnfdcuoehjohhnrghthhgrnhesthgvrh
 hrrggtrhihphhtrdhnvghtqeenucggtffrrghtthgvrhhnpeejudelledvueetgfetleel
 vdelheefhefhgfdthfffhfelkeevgeekvdeffeekjeenucevlhhushhtvghrufhiiigvpe
 dtnecurfgrrhgrmhepmhgrihhlfhhrohhmpehjohhnrghthhgrnhesthgvrhhrrggtrhih
 phhtrdhnvght
X-ME-Proxy: <xmx:lZOaZqzE5bWwhAbd4YlFnziTSu-7TNV29qXcv7cMqrzDTgr2Zm3xbA>
 <xmx:lZOaZmOx-pz4dvQqZkM8As4UZE-8v_I-Udsdn8VES0x_98wDTMpIpQ>
 <xmx:lZOaZn89pK-0DSnMbNkvrS5OnwLCeJJSuiGO7MaN_zI2YHyy9NjwNQ>
 <xmx:lZOaZmXlKKymEafTT1GxtaLTLUto4W-KKwbH4qsnu4Nka1XYzvZNLQ>
 <xmx:lZOaZhlmIyw-L4y7-cmhc1TAKNXnafHGkCx6fx5bGGjKWgvuCytxPRDN>
Feedback-ID: if4194509:Fastmail
Received: by mailuser.nyi.internal (Postfix, from userid 501)
 id 8433A37A0084; Fri, 19 Jul 2024 12:25:57 -0400 (EDT)
X-Mailer: MessagingEngine.com Webmail Interface
User-Agent: Cyrus-JMAP/3.11.0-alpha0-568-g843fbadbe-fm-20240701.003-g843fbadb
MIME-Version: 1.0
Message-Id: <7974c622-e7d8-48b3-9948-14e8d7654793@HIDDEN>
In-Reply-To: <878qxxtmwu.fsf@HIDDEN>
References: <df6e8894-fd84-446f-a67f-50cdcc9de5b3@HIDDEN>
 <878qxxtmwu.fsf@HIDDEN>
Date: Fri, 19 Jul 2024 12:25:37 -0400
From: "Jonathan Frederickson" <jonathan@HIDDEN>
To: =?UTF-8?Q?Ludovic_Court=C3=A8s?= <ludo@HIDDEN>
Subject: Re: bug#72166: Shepherd periodically goes unresponsive on one of my
 machines
Content-Type: text/plain;charset=utf-8
Content-Transfer-Encoding: quoted-printable
X-Spam-Score: -0.7 (/)
X-Debbugs-Envelope-To: 72166
Cc: 72166 <at> debbugs.gnu.org
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -1.7 (-)

On Fri, Jul 19, 2024, at 11:35 AM, Ludovic Court=C3=A8s wrote:
> Hi Jonathan,
>=20
> "Jonathan Frederickson" <jonathan@HIDDEN> skribis:
>=20
> > I've been running into an issue with Shepherd on one of my machines.=
 Every so often (and I haven't figured out what conditions trigger it), =
my Shepherd instances (both home and PID 1) will go unresponsive. I thou=
ght I had tracked it down to a misbehaving home service that I had confi=
gured, but it's just happened again without that service running.
> >
> > 'herd status' hangs indefinitely:
> >
> > jfred@terracard ~$ sudo herd status
> > Password:=20
> > <never returns>
> >
> > ...on both instances:
> >
> > jfred@terracard ~$ herd status
> > <never returns>
>=20
> Ouch.  What version of shepherd is running?  (You can view it with
> =E2=80=9Ccat /proc/1/cmdline | xargs -0=E2=80=9D.)
>=20
> > The PID 1 shepherd instance isn't reaping defunct processes:
> >
> > jfred@terracard ~$ ps aux | grep -i lock
> > jfred      541  0.0  0.0   3700  2304 ?        S    18:30   0:00 swa=
yidle -w timeout 300 swaylock -f -i ~/.wallpapers/user-manual.jpg timeou=
t 10 if pgrep swaylock; then swaymsg "output * dpms off"; fi resume sway=
msg "output * dpms on" before-sleep swaylock -f -i ~/.wallpapers/user-ma=
nual.jpg
> > jfred     3111  0.0  0.0      0     0 ?        Z    18:53   0:00 [sw=
aylock] <defunct>
> > jfred     3112  0.0  0.0      0     0 ?        Zs   18:53   0:00 [sw=
aylock] <defunct>
> >
> > Some further troubleshooting... strace indicates that it's waiting o=
n a read() on its fd 9:
>=20
> Interesting.  There were bugs in earlier 0.10.x version that could cau=
se
> this sort of thing; let=E2=80=99s see what version you have, first.
>=20
> Ludo=E2=80=99.
>=20

Hi Ludo, thanks for the troubleshooting help. Looks like I'm running 0.1=
0.4:

jfred@terracard ~$ cat /proc/1/cmdline | xargs -0
/gnu/store/bhynhk0c6ssq3fqqc59fvhxjzwywsjbb-guile-3.0.9/bin/guile --no-a=
uto-compile /gnu/store/39li5qpiaj1lx89xgahlbgvfnjhpcpwg-shepherd-0.10.4/=
bin/shepherd --config /gnu/store/hfyri6ygfdjq4w3nkha2ypa2k98hhfxj-shephe=
rd.conf

I see now that 0.10.5 was released a few weeks ago, does that have a fix=
 that could be related?




Information forwarded to bug-guix@HIDDEN:
bug#72166; Package guix. Full text available.

Message received at 72166 <at> debbugs.gnu.org:


Received: (at 72166) by debbugs.gnu.org; 19 Jul 2024 15:36:12 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Fri Jul 19 11:36:11 2024
Received: from localhost ([127.0.0.1]:50213 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1sUpeV-0001fH-J4
	for submit <at> debbugs.gnu.org; Fri, 19 Jul 2024 11:36:11 -0400
Received: from hera.aquilenet.fr ([185.233.100.1]:60734)
 by debbugs.gnu.org with esmtp (Exim 4.84_2)
 (envelope-from <ludo@HIDDEN>) id 1sUpeQ-0001ej-3y
 for 72166 <at> debbugs.gnu.org; Fri, 19 Jul 2024 11:36:10 -0400
Received: from localhost (localhost [127.0.0.1])
 by hera.aquilenet.fr (Postfix) with ESMTP id A8B301F24;
 Fri, 19 Jul 2024 17:35:30 +0200 (CEST)
X-Virus-Scanned: Debian amavisd-new at hera.aquilenet.fr
Received: from hera.aquilenet.fr ([127.0.0.1])
 by localhost (hera.aquilenet.fr [127.0.0.1]) (amavisd-new, port 10024)
 with ESMTP id wa4uM9Ge4WZP; Fri, 19 Jul 2024 17:35:30 +0200 (CEST)
Received: from ribbon (91-160-117-201.subs.proxad.net [91.160.117.201])
 by hera.aquilenet.fr (Postfix) with ESMTPSA id 1FAE21EE6;
 Fri, 19 Jul 2024 17:35:30 +0200 (CEST)
From: =?utf-8?Q?Ludovic_Court=C3=A8s?= <ludo@HIDDEN>
To: "Jonathan Frederickson" <jonathan@HIDDEN>
Subject: Re: bug#72166: Shepherd periodically goes unresponsive on one of my
 machines
In-Reply-To: <df6e8894-fd84-446f-a67f-50cdcc9de5b3@HIDDEN> (Jonathan
 Frederickson's message of "Wed, 17 Jul 2024 20:43:15 -0400")
References: <df6e8894-fd84-446f-a67f-50cdcc9de5b3@HIDDEN>
User-Agent: Gnus/5.13 (Gnus v5.13)
Date: Fri, 19 Jul 2024 17:35:29 +0200
Message-ID: <878qxxtmwu.fsf@HIDDEN>
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: quoted-printable
X-Spam-Score: 1.0 (+)
X-Debbugs-Envelope-To: 72166
Cc: 72166 <at> debbugs.gnu.org
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -0.0 (/)

Hi Jonathan,

"Jonathan Frederickson" <jonathan@HIDDEN> skribis:

> I've been running into an issue with Shepherd on one of my machines. Ever=
y so often (and I haven't figured out what conditions trigger it), my Sheph=
erd instances (both home and PID 1) will go unresponsive. I thought I had t=
racked it down to a misbehaving home service that I had configured, but it'=
s just happened again without that service running.
>
> 'herd status' hangs indefinitely:
>
> jfred@terracard ~$ sudo herd status
> Password:=20
> <never returns>
>
> ...on both instances:
>
> jfred@terracard ~$ herd status
> <never returns>

Ouch.  What version of shepherd is running?  (You can view it with
=E2=80=9Ccat /proc/1/cmdline | xargs -0=E2=80=9D.)

> The PID 1 shepherd instance isn't reaping defunct processes:
>
> jfred@terracard ~$ ps aux | grep -i lock
> jfred      541  0.0  0.0   3700  2304 ?        S    18:30   0:00 swayidle=
 -w timeout 300 swaylock -f -i ~/.wallpapers/user-manual.jpg timeout 10 if =
pgrep swaylock; then swaymsg "output * dpms off"; fi resume swaymsg "output=
 * dpms on" before-sleep swaylock -f -i ~/.wallpapers/user-manual.jpg
> jfred     3111  0.0  0.0      0     0 ?        Z    18:53   0:00 [swayloc=
k] <defunct>
> jfred     3112  0.0  0.0      0     0 ?        Zs   18:53   0:00 [swayloc=
k] <defunct>
>
> Some further troubleshooting... strace indicates that it's waiting on a r=
ead() on its fd 9:

Interesting.  There were bugs in earlier 0.10.x version that could cause
this sort of thing; let=E2=80=99s see what version you have, first.

Ludo=E2=80=99.




Information forwarded to bug-guix@HIDDEN:
bug#72166; Package guix. Full text available.

Message received at submit <at> debbugs.gnu.org:


Received: (at submit) by debbugs.gnu.org; 18 Jul 2024 00:43:47 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Wed Jul 17 20:43:47 2024
Received: from localhost ([127.0.0.1]:36414 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1sUFFL-0001UP-CN
	for submit <at> debbugs.gnu.org; Wed, 17 Jul 2024 20:43:47 -0400
Received: from lists.gnu.org ([209.51.188.17]:33914)
 by debbugs.gnu.org with esmtp (Exim 4.84_2)
 (envelope-from <jonathan@HIDDEN>) id 1sUFFJ-0001UG-Mp
 for submit <at> debbugs.gnu.org; Wed, 17 Jul 2024 20:43:46 -0400
Received: from eggs.gnu.org ([2001:470:142:3::10])
 by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256)
 (Exim 4.90_1) (envelope-from <jonathan@HIDDEN>)
 id 1sUFFF-0004vs-UU
 for bug-guix@HIDDEN; Wed, 17 Jul 2024 20:43:42 -0400
Received: from fhigh2-smtp.messagingengine.com ([103.168.172.153])
 by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256)
 (Exim 4.90_1) (envelope-from <jonathan@HIDDEN>)
 id 1sUFFE-00083z-1Q
 for bug-guix@HIDDEN; Wed, 17 Jul 2024 20:43:41 -0400
Received: from compute4.internal (compute4.nyi.internal [10.202.2.44])
 by mailfhigh.nyi.internal (Postfix) with ESMTP id 93CAC1140114
 for <bug-guix@HIDDEN>; Wed, 17 Jul 2024 20:43:37 -0400 (EDT)
Received: from imap48 ([10.202.2.98])
 by compute4.internal (MEProxy); Wed, 17 Jul 2024 20:43:37 -0400
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=terracrypt.net;
 h=cc:content-type:content-type:date:date:from:from:in-reply-to
 :message-id:mime-version:reply-to:subject:subject:to:to; s=fm3;
 t=1721263417; x=1721349817; bh=21PgET032XXcC31BSCIzr6mXXXHIOXlx
 MqF6j8eZsrg=; b=06KHpmcXv7WGtMBtVSmtaslZPubW1UEPqrz5wCbJxtdP3w2t
 RyOC0G7EAeYpt1ZMrGbVIJxer/2UtgHfb8GMnV1Rl/H6vPKSK7JOAXQ7v8a/+Ny+
 iSmYp/meJRdpUZlW/pvSIe4VxnTLao6L5RgeDxYoOluTbFTB5+sOjyLxMaUM4UbS
 J9jEleOQkiAv15i88MSl+JnpN0umQsd2hhuMKufOTtXmxttFvT9kaNdT0J5pxQKs
 VDnEGCUUDKPFRp0zCILXJKUReIcsLuzO7e7VD77G0+0Xru5nR0EfU7xE72QRTZf0
 FbZLb8089PHRjK7JVM4bhUyrsTENTAqgg1/F1A==
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=
 messagingengine.com; h=cc:content-type:content-type:date:date
 :feedback-id:feedback-id:from:from:in-reply-to:message-id
 :mime-version:reply-to:subject:subject:to:to:x-me-proxy
 :x-me-proxy:x-me-sender:x-me-sender:x-sasl-enc; s=fm2; t=
 1721263417; x=1721349817; bh=21PgET032XXcC31BSCIzr6mXXXHIOXlxMqF
 6j8eZsrg=; b=Wdb256i65zONehXIr0PNO19QACsxDd+Z3jtu3DZYC3FUWP9m8Zy
 DF7MLqZSVnOx6FISlZUFZBdDrz8i1kre0aFXEpjik4jISxGlAnn/ZZRFZz7yNRsB
 H4VYTlC4k9vCh0BL5oAA6jALr8NvonLurW+00ITl8iMLwZJKri/39UA2q51J4vrm
 z56z+VY9QrYA5ovFUR11hyfyWeuSAO7uDylxKAsk5ruCXql6vlrl7G4E7Cl1bajt
 IT+GhLXEUrfX6eamfH+P4pZKPaQWyTFlUfeg9VpqzBn0jH5P+7WB3omAktPvNQKR
 Yl0TQ7NP6TYTH6I+k6Ohs67+X7hqlELJgPA==
X-ME-Sender: <xms:OGWYZkuujESNJvJbFbVtwfFI8ZzK3ExHs5thnJ1V5WPTE1zpW4JMlg>
 <xme:OGWYZhe0l5OWBIBxZMAAgxJ1WcFMr42yZHDhO7azf8o8FY9NXuD8lHK78pG7Ex0I7
 76-Tfj5BMjkyLQq6Q>
X-ME-Proxy-Cause: gggruggvucftvghtrhhoucdtuddrgeeftddrgeekgdegtdcutefuodetggdotefrodftvf
 curfhrohhfihhlvgemucfhrghsthforghilhdpqfgfvfdpuffrtefokffrpgfnqfghnecu
 uegrihhlohhuthemuceftddtnecunecujfgurhepofgfggfkfffhvffutgesthdtredtre
 ertdenucfhrhhomhepfdflohhnrghthhgrnhcuhfhrvgguvghrihgtkhhsohhnfdcuoehj
 ohhnrghthhgrnhesthgvrhhrrggtrhihphhtrdhnvghtqeenucggtffrrghtthgvrhhnpe
 dvffeugfetgfelleevfeevuefhudejtdfgfeejfeehjeegkefhjefgueeuffekffenucff
 ohhmrghinhepghhithhhuhgsrdgtohhmnecuvehluhhsthgvrhfuihiivgeptdenucfrrg
 hrrghmpehmrghilhhfrhhomhepjhhonhgrthhhrghnsehtvghrrhgrtghrhihpthdrnhgv
 th
X-ME-Proxy: <xmx:OGWYZvyv-BGRIXq3h8UxLZYxwNRDMafeODvbilulK8b9ILEgO8q52g>
 <xmx:OGWYZnPCMh5pg2dHDmpoRTbg2p8sTH17NOxZnOC9tgu-Ol_Wv53oNA>
 <xmx:OGWYZk_hl65U_OZYRmMDAukaP6xfml9hyOwuk1oedqpTsVoXmOHrTg>
 <xmx:OGWYZvWE0oIuMP_Sb5iQUiF345VVElXATlzIN4EtbbFfTEqJx9EB0Q>
 <xmx:OWWYZpFQFeJRqz3kReq1vTbn3bPRbyq-Zk0jEWg-jD-sveFH80aku4ge>
Feedback-ID: if4194509:Fastmail
Received: by mailuser.nyi.internal (Postfix, from userid 501)
 id 3B79731A0065; Wed, 17 Jul 2024 20:43:36 -0400 (EDT)
X-Mailer: MessagingEngine.com Webmail Interface
User-Agent: Cyrus-JMAP/3.11.0-alpha0-568-g843fbadbe-fm-20240701.003-g843fbadb
MIME-Version: 1.0
Message-Id: <df6e8894-fd84-446f-a67f-50cdcc9de5b3@HIDDEN>
Date: Wed, 17 Jul 2024 20:43:15 -0400
From: "Jonathan Frederickson" <jonathan@HIDDEN>
To: bug-guix@HIDDEN
Subject: Shepherd periodically goes unresponsive on one of my machines
Content-Type: text/plain
Received-SPF: pass client-ip=103.168.172.153;
 envelope-from=jonathan@HIDDEN; helo=fhigh2-smtp.messagingengine.com
X-Spam_score_int: -27
X-Spam_score: -2.8
X-Spam_bar: --
X-Spam_report: (-2.8 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1,
 DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1,
 RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H4=0.001, RCVD_IN_MSPIKE_WL=0.001,
 SPF_HELO_PASS=-0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no
X-Spam_action: no action
X-Spam-Score: -1.6 (-)
X-Debbugs-Envelope-To: submit
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -2.6 (--)

I've been running into an issue with Shepherd on one of my machines. Every so often (and I haven't figured out what conditions trigger it), my Shepherd instances (both home and PID 1) will go unresponsive. I thought I had tracked it down to a misbehaving home service that I had configured, but it's just happened again without that service running.

'herd status' hangs indefinitely:

jfred@terracard ~$ sudo herd status
Password: 
<never returns>

...on both instances:

jfred@terracard ~$ herd status
<never returns>

The PID 1 shepherd instance isn't reaping defunct processes:

jfred@terracard ~$ ps aux | grep -i lock
jfred      541  0.0  0.0   3700  2304 ?        S    18:30   0:00 swayidle -w timeout 300 swaylock -f -i ~/.wallpapers/user-manual.jpg timeout 10 if pgrep swaylock; then swaymsg "output * dpms off"; fi resume swaymsg "output * dpms on" before-sleep swaylock -f -i ~/.wallpapers/user-manual.jpg
jfred     3111  0.0  0.0      0     0 ?        Z    18:53   0:00 [swaylock] <defunct>
jfred     3112  0.0  0.0      0     0 ?        Zs   18:53   0:00 [swaylock] <defunct>

Some further troubleshooting... strace indicates that it's waiting on a read() on its fd 9:

jfred@terracard ~ [env]$ sudo strace -fp 1
Password: 
strace: Process 1 attached with 5 threads
[pid   144] read(9,  <unfinished ...>
[pid   142] futex(0x7fa43892abe8, FUTEX_WAIT_BITSET_PRIVATE|FUTEX_CLOCK_REALTIME, 0, NULL, FUTEX_BITSET_MATCH_ANY <unfinished ...>
[pid   141] futex(0x7fa43892abe8, FUTEX_WAIT_BITSET_PRIVATE|FUTEX_CLOCK_REALTIME, 0, NULL, FUTEX_BITSET_MATCH_ANY <unfinished ...>
[pid   140] futex(0x7fa43892abe8, FUTEX_WAIT_BITSET_PRIVATE|FUTEX_CLOCK_REALTIME, 0, NULL, FUTEX_BITSET_MATCH_ANY^

...which seems to be:

jfred@terracard ~ [env]$ sudo ls -l /proc/1/fd/9
lr-x------ 1 root root 64 Jul 17 20:39 /proc/1/fd/9 -> 'pipe:[4015]'
jfred@terracard ~ [env]$ sudo lsof -n | grep 4015
lsof: WARNING: can't stat() fuse.portal file system /run/user/1000/doc
      Output information may be incomplete.
shepherd     1                      root    9r     FIFO               0,15       0t0       4015 pipe
shepherd     1                      root   11w     FIFO               0,15       0t0       4015 pipe
shepherd     1  140 GC-marker       root    9r     FIFO               0,15       0t0       4015 pipe
shepherd     1  140 GC-marker       root   11w     FIFO               0,15       0t0       4015 pipe
shepherd     1  141 GC-marker       root    9r     FIFO               0,15       0t0       4015 pipe
shepherd     1  141 GC-marker       root   11w     FIFO               0,15       0t0       4015 pipe
shepherd     1  142 GC-marker       root    9r     FIFO               0,15       0t0       4015 pipe
shepherd     1  142 GC-marker       root   11w     FIFO               0,15       0t0       4015 pipe
shepherd     1  144 shepherd        root    9r     FIFO               0,15       0t0       4015 pipe
shepherd     1  144 shepherd        root   11w     FIFO               0,15       0t0       4015 pipe

My system configuration for this machine can be found here, and I last ran a 'guix pull' on June 21: https://github.com/jfrederickson/dotfiles/blob/master/guix/guix/system/machines/terracard/config.scm

Has anyone else run into this?




Acknowledgement sent to "Jonathan Frederickson" <jonathan@HIDDEN>:
New bug report received and forwarded. Copy sent to bug-guix@HIDDEN. Full text available.
Report forwarded to bug-guix@HIDDEN:
bug#72166; Package guix. Full text available.
Please note: This is a static page, with minimal formatting, updated once a day.
Click here to see this page with the latest information and nicer formatting.
Last modified: Sun, 12 Jan 2025 05:45:02 UTC

GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997 nCipher Corporation Ltd, 1994-97 Ian Jackson.