Stefan Kangas <stefankangas@HIDDEN>
to control <at> debbugs.gnu.org
.
Full text available.Received: (at 34862) by debbugs.gnu.org; 15 Feb 2025 02:23:01 +0000 From debbugs-submit-bounces <at> debbugs.gnu.org Fri Feb 14 21:23:01 2025 Received: from localhost ([127.0.0.1]:52613 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>) id 1tj7pc-0004JM-NX for submit <at> debbugs.gnu.org; Fri, 14 Feb 2025 21:23:01 -0500 Received: from mail-ed1-x531.google.com ([2a00:1450:4864:20::531]:51279) by debbugs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.84_2) (envelope-from <stefankangas@HIDDEN>) id 1tj7pZ-0004Iz-AE for 34862 <at> debbugs.gnu.org; Fri, 14 Feb 2025 21:22:58 -0500 Received: by mail-ed1-x531.google.com with SMTP id 4fb4d7f45d1cf-5de849a0b6cso4833796a12.2 for <34862 <at> debbugs.gnu.org>; Fri, 14 Feb 2025 18:22:57 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1739586171; x=1740190971; darn=debbugs.gnu.org; h=cc:to:subject:message-id:date:mime-version:references:in-reply-to :from:from:to:cc:subject:date:message-id:reply-to; bh=pVb6m3dP7xzhfW3c8KXU2er1zYS1647fXluh/kSsv7Y=; b=Mb2qXFSckUyF+sI6gy61/ebnAJDkENnfRKp/oAUB7OXkN8e8yMjM+Nywf/jRVmfRXW NjJy4H4OZBOihzd3tZDcu5plBXbHk6s3fGL0O3TR7NS2+MkMWIF4sW4b7ipYN9fDC4RN ZM5VXr4pw/QaQ5MdB6u/iu4ZPwxXKuJ0TWKgZsgg+DJ06KdjILI1XNYLNpoe1QWrkigz 35ng7/acfJSkfhnPYXqB58dIhwcQUzYexH4t88PTl6QvZOv3fHuBZQrJJGMsc1m2etiy wO2neHqBkMpyS0WWk6Lfmc4MxfG2eAodqBOtFgfMR2S15z7IwF9yHOCm7wBoX/JoMhTc kD7w== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1739586171; x=1740190971; h=cc:to:subject:message-id:date:mime-version:references:in-reply-to :from:x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=pVb6m3dP7xzhfW3c8KXU2er1zYS1647fXluh/kSsv7Y=; b=qk9oX7QtPtEaJgZpID5K/EzHspbYdF0zazXZXtLZWmwj5p59sC+aRo3EogxDHu/MGA MjLkA+zSHllryl9Naj9wmK4U5FJ/uKhwYsHPnI3Xz8hdJMC53Y3EaRCP4I+Jw6bj0kJW a0TuXg0wN8UyYzrbrtqBOuRoXOGu4w+kveseltdfdlH6z8WkYegg0mu3Fyxk/T3n/Hjs MotTIqT3NvvPazql+mDJEAY8/8OeLdnhsqhAQO45m8N4f8ag4L24AfWvhx1MWHTsbTn1 miPLcfqbeIackg/TCR23Mala7tFvKXbXfxwk7azF+H6t00q5KS8i0r51/2KpB7b8+rmT nApw== X-Forwarded-Encrypted: i=1; AJvYcCWT6VPH7u+DOcLCMXG4uSamGEXOFqkhdy1vHsOknNGiswasznZSootx8UFwKeAbXroz8PuZcQ==@debbugs.gnu.org X-Gm-Message-State: AOJu0YwjaoNlnyvisUE3SsbmzwcXpXtbWoPpzTDYrwVokVcAHJC0jeXa 7asqP8cpfca1xS1nPY132f6XtHP6duZ+S8zQxte5N+K+UdIFBwhk+LpRq6PWo2fcNg7Q2SXmTco d9WCr0Mr2Tt7qHEKpk1FJDhDOs6Y= X-Gm-Gg: ASbGncvxx0LJ/NKZ5XMG5Mus8Ivpi3lQ1n92WvhSwrdwqb79BrAB6dR1pFLvtjTiyYL Jo3fDu6K380K4EHGt+9kTb/9qOVMJ9lA3LKUdtkWiATrX0S6C3eEGeV44aEemQzIACidSJm3HF7 4= X-Google-Smtp-Source: AGHT+IG8qudPdlVWn5u+Au/mlXcBGhnPOKDdPVADvxwkUNZRkqyOj2s0jIEw+ydWUfCztd8fSdPSAL+X3BU+bXEPPWM= X-Received: by 2002:a05:6402:3488:b0:5dc:74f1:8a32 with SMTP id 4fb4d7f45d1cf-5e0361f4885mr1703092a12.28.1739586170869; Fri, 14 Feb 2025 18:22:50 -0800 (PST) Received: from 753933720722 named unknown by gmailapi.google.com with HTTPREST; Sat, 15 Feb 2025 02:22:50 +0000 From: Stefan Kangas <stefankangas@HIDDEN> In-Reply-To: <87bkzhvqcz.fsf@HIDDEN> References: <87y22mp5ji.fsf@HIDDEN> <87bkzhvqcz.fsf@HIDDEN> MIME-Version: 1.0 Date: Sat, 15 Feb 2025 02:22:50 +0000 X-Gm-Features: AWEUYZlfBIOSP-CUaeNdh6Wq0yIXPTcgIAHbyTp5tYs8gy-gTri1cLShxgvbjkI Message-ID: <CADwFkmk4jakg0MidoYVsXm=JAfNfab52wCWz+W3Vp8MJqTLzJQ@HIDDEN> Subject: Re: bug#34862: 27.0.50; Trying to update pinyin.map To: Lars Ingebrigtsen <larsi@HIDDEN> Content-Type: text/plain; charset="UTF-8" X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 34862 Cc: Eric Abrahamsen <eric@HIDDEN>, Eli Zaretskii <eliz@HIDDEN>, 34862 <at> debbugs.gnu.org, Richard Stallman <rms@HIDDEN> X-BeenThere: debbugs-submit <at> debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: <debbugs-submit.debbugs.gnu.org> List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe> List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/> List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org> List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help> List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe> Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org> X-Spam-Score: -1.0 (-) Lars Ingebrigtsen <larsi@HIDDEN> writes: > Eric Abrahamsen <eric@HIDDEN> writes: > >> I guess I still didn't know if I should be writing the script as a part >> of the make process or not... > > Yes, I think it would be natural to have it be part of the make process. (That was three years ago.) Eric, did you get anywhere with this?
bug-gnu-emacs@HIDDEN
:bug#34862
; Package emacs
.
Full text available.Received: (at 34862) by debbugs.gnu.org; 8 Feb 2022 06:12:41 +0000 From debbugs-submit-bounces <at> debbugs.gnu.org Tue Feb 08 01:12:41 2022 Received: from localhost ([127.0.0.1]:44839 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>) id 1nHJk0-0006Ee-RS for submit <at> debbugs.gnu.org; Tue, 08 Feb 2022 01:12:41 -0500 Received: from quimby.gnus.org ([95.216.78.240]:48056) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <larsi@HIDDEN>) id 1nHJjz-0006ER-4L for 34862 <at> debbugs.gnu.org; Tue, 08 Feb 2022 01:12:39 -0500 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnus.org; s=20200322; h=Content-Type:MIME-Version:Message-ID:In-Reply-To:Date: References:Subject:Cc:To:From:Sender:Reply-To:Content-Transfer-Encoding: Content-ID:Content-Description:Resent-Date:Resent-From:Resent-Sender: Resent-To:Resent-Cc:Resent-Message-ID:List-Id:List-Help:List-Unsubscribe: List-Subscribe:List-Post:List-Owner:List-Archive; bh=Wu5Zp8fdXXv91LNp9jw7p2KoQuwdlbUmuj8+pCZw0qY=; b=T9WIkFjCzZcRXYuYSs7CpJXr9G RjM7CSBKhTb9ebZJzByfqWXCEiZqHGlzI4CCL1u4uQS5B52/ygCPr4KFFF009M1EnMror/UJ7Uubv P4wAdMcPxgKVoINyR4zL3Dk9dtN5ItVkMw5/SZovimc233+zk/1gkGnbKYPMhtE1YGPE=; Received: from [84.212.220.105] (helo=giant) by quimby.gnus.org with esmtpsa (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from <larsi@HIDDEN>) id 1nHJjp-0001f2-H6; Tue, 08 Feb 2022 07:12:32 +0100 From: Lars Ingebrigtsen <larsi@HIDDEN> To: Eric Abrahamsen <eric@HIDDEN> Subject: Re: bug#34862: 27.0.50; Trying to update pinyin.map References: <87y22mp5ji.fsf@HIDDEN> Face: iVBORw0KGgoAAAANSUhEUgAAADAAAAAwBAMAAAClLOS0AAAABGdBTUEAALGPC/xhBQAAACBj SFJNAAB6JgAAgIQAAPoAAACA6AAAdTAAAOpgAAA6mAAAF3CculE8AAAAElBMVEWooZvx7uZjXVqE fnkyLi3///8QUN7IAAAAAWJLR0QF+G/pxwAAAAd0SU1FB+YCCAYCFJ/c4/0AAAG6SURBVDjLZZMN coQgDIWDeABsL+AEDpBCT0Bz/zP1PWTVdTO7I+Yj/1Fku8sStpRiVVWx7Sn2FQn2N5OEnw4RtXAH AfpMX6JZwhYusKkCdgITyASyhUV/khaCFqPJYSMxVKvaWgXgI7dhlGLLVU33qAMgXIM/lCDMp2uu wwKSq0lkKL6UPUmcQNUijeo49y3VE+TW2gTeLe4q8QTIos1r8W+RxJTyAIx5REfVkCWLoZJpOszh FOmnaImVJxbKGM60RjNkO1s8ktmlNnlOIzU4Wqx+ACAUtMinhYQR9tMijWm/W8yXmdAl3383LliY MVm8rf0MjkCVY0gTCC8ZOlM6hrlxeskIDMuQutamE2Br2gC8gh5q7YLeRosoWMKvdzE5Fq7LWC8s HaxW94xLx7AEYcYOS2vFXXMW9tddrikoAPzXCXi+gwz/B6B0ant5nR7A6eKlH64I6GaqJoCOiBne 9Vi4ad1cTzewjocrHks+8qRkrTJv9MXfPJUJfF9vWprK68al7yLrCe7XsTz+CUZv3MsTqBkrqg+L zj1fvHMAd8BvSuLq+3IDvXQsP6X4Xq7uWqAWf3z6xQn+AWl4oE3+7z5XAAAAJXRFWHRkYXRlOmNy ZWF0ZQAyMDIyLTAyLTA4VDA2OjAyOjIwKzAwOjAww14+UQAAACV0RVh0ZGF0ZTptb2RpZnkAMjAy Mi0wMi0wOFQwNjowMjoyMCswMDowMLIDhu0AAAAASUVORK5CYII= X-Now-Playing: Melanie de Biasio's _Blackened Cities_: "Blackened Cities" Date: Tue, 08 Feb 2022 07:12:28 +0100 In-Reply-To: <87y22mp5ji.fsf@HIDDEN> (Eric Abrahamsen's message of "Mon, 07 Feb 2022 16:26:25 -0800") Message-ID: <87bkzhvqcz.fsf@HIDDEN> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/29.0.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-Spam-Report: Spam detection software, running on the system "quimby.gnus.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see @@CONTACT_ADDRESS@@ for details. Content preview: Eric Abrahamsen <eric@HIDDEN> writes: > I guess I still didn't know if I should be writing the script as a part > of the make process or not... Yes, I think it would be natural to have it be part of the make process. Content analysis details: (-2.9 points, 5.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -1.0 ALL_TRUSTED Passed through trusted hosts only via SMTP -1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1% [score: 0.0000] X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: 34862 Cc: Eli Zaretskii <eliz@HIDDEN>, 34862 <at> debbugs.gnu.org, Richard Stallman <rms@HIDDEN> X-BeenThere: debbugs-submit <at> debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: <debbugs-submit.debbugs.gnu.org> List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe> List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/> List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org> List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help> List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe> Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org> X-Spam-Score: -3.3 (---) Eric Abrahamsen <eric@HIDDEN> writes: > I guess I still didn't know if I should be writing the script as a part > of the make process or not... Yes, I think it would be natural to have it be part of the make process. -- (domestic pets only, the antidote for overdose, milk.) bloggy blog: http://lars.ingebrigtsen.no
bug-gnu-emacs@HIDDEN
:bug#34862
; Package emacs
.
Full text available.Received: (at 34862) by debbugs.gnu.org; 8 Feb 2022 00:26:37 +0000 From debbugs-submit-bounces <at> debbugs.gnu.org Mon Feb 07 19:26:37 2022 Received: from localhost ([127.0.0.1]:44292 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>) id 1nHEL7-0000P2-Bs for submit <at> debbugs.gnu.org; Mon, 07 Feb 2022 19:26:37 -0500 Received: from mail.ericabrahamsen.net ([52.70.2.18]:32938) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <eric@HIDDEN>) id 1nHEL4-0000Oo-33 for 34862 <at> debbugs.gnu.org; Mon, 07 Feb 2022 19:26:36 -0500 Received: from localhost (c-71-197-232-41.hsd1.wa.comcast.net [71.197.232.41]) (Authenticated sender: eric@HIDDEN) by mail.ericabrahamsen.net (Postfix) with ESMTPSA id 884D0102210; Tue, 8 Feb 2022 00:26:27 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ericabrahamsen.net; s=mail; t=1644279987; bh=4CQDaXDOhM+bnh0cNU/ED531N7Qka3iKNF0geWmr4K4=; h=From:To:Cc:Subject:Date:In-Reply-To:From; b=B993lFKZVhbJEo0iUyE1VxlBwrmJ/dBm/Azn4/2bLRQ1fV5HKtz2IwR66KBe8FKZZ 5D1UaVLwtu9E+V6LY2jNjC/ZiOkhjbzDJumBDqeabGrGMMOK5BqFsOb7iCwuDnxZ/I G+z9sfV7Ros4DiZrlcU81vIWMbEFcKqK3VnX9X0I= From: Eric Abrahamsen <eric@HIDDEN> To: Lars Ingebrigtsen <larsi@HIDDEN> Subject: Re: bug#34862: 27.0.50; Trying to update pinyin.map Date: Mon, 07 Feb 2022 16:26:25 -0800 In-Reply-To: <87iltxp01e.fsf@HIDDEN> (Lars Ingebrigtsen's message of "Wed, 02 Feb 2022 19:59:25 +0100") Message-ID: <87y22mp5ji.fsf@HIDDEN> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/29.0.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: 34862 Cc: Eli Zaretskii <eliz@HIDDEN>, 34862 <at> debbugs.gnu.org, Richard Stallman <rms@HIDDEN> X-BeenThere: debbugs-submit <at> debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: <debbugs-submit.debbugs.gnu.org> List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe> List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/> List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org> List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help> List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe> Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org> X-Spam-Score: -3.3 (---) On 02/02/22 19:59 PM, Lars Ingebrigtsen wrote: > Eric Abrahamsen <eric@HIDDEN> writes: > >>> I think this should be done with a script, and that script should be >>> in our repository. The easiest kind of a script is a Lisp program, of >>> course, but we can also use other kinds, such as Awk scripts. >> >> Awk seems just right for the problem, but I haven't written much in it; >> I did the original munging in elisp. Would this be a script written for >> use with -batch and a custom make target? > > It's fine to parse the files with Lisp instead of awk (unless they're > needed to boot Emacs, which I don't think is the case here). > > Did you get any further with this? I guess I still didn't know if I should be writing the script as a part of the make process or not...
bug-gnu-emacs@HIDDEN
:bug#34862
; Package emacs
.
Full text available.Received: (at 34862) by debbugs.gnu.org; 2 Feb 2022 18:59:37 +0000 From debbugs-submit-bounces <at> debbugs.gnu.org Wed Feb 02 13:59:37 2022 Received: from localhost ([127.0.0.1]:53610 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>) id 1nFKqv-0000Co-2e for submit <at> debbugs.gnu.org; Wed, 02 Feb 2022 13:59:37 -0500 Received: from quimby.gnus.org ([95.216.78.240]:42946) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <larsi@HIDDEN>) id 1nFKqt-0000CZ-Em for 34862 <at> debbugs.gnu.org; Wed, 02 Feb 2022 13:59:36 -0500 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=gnus.org; s=20200322; h=Content-Type:MIME-Version:Message-ID:In-Reply-To:Date: References:Subject:Cc:To:From:Sender:Reply-To:Content-Transfer-Encoding: Content-ID:Content-Description:Resent-Date:Resent-From:Resent-Sender: Resent-To:Resent-Cc:Resent-Message-ID:List-Id:List-Help:List-Unsubscribe: List-Subscribe:List-Post:List-Owner:List-Archive; bh=XkI7OuvctiXmMd9U/yHR5KnInK2q/ttT3iW3S2ZF2TE=; b=kmLrKV2D2wsVvJV/wjyMmxrCpy k5miylNlksBf/kQBakqUUIBV/9AKhg4e7a1FG86xGk8sxyU57r1DQGRvJ6Rj4CEBS5qvxA7btYOY/ gwtf/KkGSsGOZGSXJXmfSrInRwnCzM39FN3sHkFPvyx1vQpfCH3UUuMgOgDPqQjUGq60=; Received: from [84.212.220.105] (helo=giant) by quimby.gnus.org with esmtpsa (TLS1.3:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.92) (envelope-from <larsi@HIDDEN>) id 1nFKqk-0007Aq-5K; Wed, 02 Feb 2022 19:59:28 +0100 From: Lars Ingebrigtsen <larsi@HIDDEN> To: Eric Abrahamsen <eric@HIDDEN> Subject: Re: bug#34862: 27.0.50; Trying to update pinyin.map References: <87zhpxyvls.fsf@HIDDEN> <83ftro20gt.fsf@HIDDEN> <87o96cbrwp.fsf@HIDDEN> <83ef781uuh.fsf@HIDDEN> <871s38at0z.fsf@HIDDEN> <83woktswud.fsf@HIDDEN> <87wokts5rl.fsf@HIDDEN> X-Now-Playing: Brian Eno, Jah Wobble's _Spinner_: "Marine Radio" Date: Wed, 02 Feb 2022 19:59:25 +0100 In-Reply-To: <87wokts5rl.fsf@HIDDEN> (Eric Abrahamsen's message of "Wed, 20 Mar 2019 12:30:22 -0700") Message-ID: <87iltxp01e.fsf@HIDDEN> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/29.0.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-Spam-Report: Spam detection software, running on the system "quimby.gnus.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see @@CONTACT_ADDRESS@@ for details. Content preview: Eric Abrahamsen <eric@HIDDEN> writes: >> I think this should be done with a script, and that script should be >> in our repository. The easiest kind of a script is a Lisp program, of >> course, but we can also use other kinds, such as Awk [...] Content analysis details: (-2.9 points, 5.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- -1.0 ALL_TRUSTED Passed through trusted hosts only via SMTP -1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1% [score: 0.0000] X-Spam-Score: -2.3 (--) X-Debbugs-Envelope-To: 34862 Cc: Eli Zaretskii <eliz@HIDDEN>, 34862 <at> debbugs.gnu.org, Richard Stallman <rms@HIDDEN> X-BeenThere: debbugs-submit <at> debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: <debbugs-submit.debbugs.gnu.org> List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe> List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/> List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org> List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help> List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe> Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org> X-Spam-Score: -3.3 (---) Eric Abrahamsen <eric@HIDDEN> writes: >> I think this should be done with a script, and that script should be >> in our repository. The easiest kind of a script is a Lisp program, of >> course, but we can also use other kinds, such as Awk scripts. > > Awk seems just right for the problem, but I haven't written much in it; > I did the original munging in elisp. Would this be a script written for > use with -batch and a custom make target? It's fine to parse the files with Lisp instead of awk (unless they're needed to boot Emacs, which I don't think is the case here). Did you get any further with this? -- (domestic pets only, the antidote for overdose, milk.) bloggy blog: http://lars.ingebrigtsen.no
bug-gnu-emacs@HIDDEN
:bug#34862
; Package emacs
.
Full text available.Received: (at 34862) by debbugs.gnu.org; 20 Mar 2019 19:41:55 +0000 From debbugs-submit-bounces <at> debbugs.gnu.org Wed Mar 20 15:41:55 2019 Received: from localhost ([127.0.0.1]:52183 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>) id 1h6h63-00010w-Gr for submit <at> debbugs.gnu.org; Wed, 20 Mar 2019 15:41:55 -0400 Received: from ericabrahamsen.net ([52.70.2.18]:58382 helo=mail.ericabrahamsen.net) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <eric@HIDDEN>) id 1h6h61-00010i-OR for 34862 <at> debbugs.gnu.org; Wed, 20 Mar 2019 15:41:54 -0400 Received: from localhost (50-251-205-17-static.hfc.comcastbusiness.net [50.251.205.17]) (Authenticated sender: eric@HIDDEN) by mail.ericabrahamsen.net (Postfix) with ESMTPSA id 1A57BFA17F; Wed, 20 Mar 2019 19:41:47 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=ericabrahamsen.net; s=mail; t=1553110908; bh=S/gI4DjH25R4goHvyGqFCCi7VT4Tt6hih5IreR+23L0=; h=From:To:Cc:Subject:References:Date:In-Reply-To:From; b=OWcAkzmdK/0+fL1WRdYOjOnQ5QXJQeaG3ZVjO1mgdg37FnuPuX11BMPS0Q6iKD6+l 4wCQa9WpLLkTtcc7JZ7T7Spz3LmC+kagZN89gGdP7sgJNtphNgIPRDfwdWx9K+ZVW/ cS0HgbbdYK4cda636HYJcL1ZpE+plGcD1i79sivo= From: Eric Abrahamsen <eric@HIDDEN> To: Eli Zaretskii <eliz@HIDDEN> Subject: Re: bug#34862: 27.0.50; Trying to update pinyin.map References: <87zhpxyvls.fsf@HIDDEN> <83ftro20gt.fsf@HIDDEN> <87o96cbrwp.fsf@HIDDEN> <83ef781uuh.fsf@HIDDEN> <871s38at0z.fsf@HIDDEN> <83woktswud.fsf@HIDDEN> <87wokts5rl.fsf@HIDDEN> <83imwds5cw.fsf@HIDDEN> Date: Wed, 20 Mar 2019 12:41:45 -0700 In-Reply-To: <83imwds5cw.fsf@HIDDEN> (Eli Zaretskii's message of "Wed, 20 Mar 2019 21:39:11 +0200") Message-ID: <87ftrhs58m.fsf@HIDDEN> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.0.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 34862 Cc: 34862 <at> debbugs.gnu.org, rms@HIDDEN X-BeenThere: debbugs-submit <at> debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: <debbugs-submit.debbugs.gnu.org> List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe> List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/> List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org> List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help> List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe> Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org> X-Spam-Score: -1.0 (-) Eli Zaretskii <eliz@HIDDEN> writes: >> From: Eric Abrahamsen <eric@HIDDEN> >> Cc: Richard Stallman <rms@HIDDEN>, 34862 <at> debbugs.gnu.org >> Date: Wed, 20 Mar 2019 12:30:22 -0700 >> >> > I think this should be done with a script, and that script should be >> > in our repository. The easiest kind of a script is a Lisp program, of >> > course, but we can also use other kinds, such as Awk scripts. >> >> Awk seems just right for the problem, but I haven't written much in it; >> I did the original munging in elisp. Would this be a script written for >> use with -batch and a custom make target? > > Yes. > >> should it also be responsible for downloading a recent copy of the >> source file, or should that be done first, and the function pointed >> at the file? > > The latter, I think. That's what we do with the other data files we > use from external sources, e.g. see admin/unidata/. Understood -- thanks for this.
bug-gnu-emacs@HIDDEN
:bug#34862
; Package emacs
.
Full text available.Received: (at 34862) by debbugs.gnu.org; 20 Mar 2019 19:39:34 +0000 From debbugs-submit-bounces <at> debbugs.gnu.org Wed Mar 20 15:39:34 2019 Received: from localhost ([127.0.0.1]:52179 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>) id 1h6h3k-0000xO-8v for submit <at> debbugs.gnu.org; Wed, 20 Mar 2019 15:39:34 -0400 Received: from eggs.gnu.org ([209.51.188.92]:49754) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <eliz@HIDDEN>) id 1h6h3i-0000xA-JS for 34862 <at> debbugs.gnu.org; Wed, 20 Mar 2019 15:39:30 -0400 Received: from fencepost.gnu.org ([2001:470:142:3::e]:32913) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from <eliz@HIDDEN>) id 1h6h3c-0001ZC-EL; Wed, 20 Mar 2019 15:39:24 -0400 Received: from [176.228.60.248] (port=2577 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from <eliz@HIDDEN>) id 1h6h3U-0000d8-AO; Wed, 20 Mar 2019 15:39:16 -0400 Date: Wed, 20 Mar 2019 21:39:11 +0200 Message-Id: <83imwds5cw.fsf@HIDDEN> From: Eli Zaretskii <eliz@HIDDEN> To: Eric Abrahamsen <eric@HIDDEN> In-reply-to: <87wokts5rl.fsf@HIDDEN> (message from Eric Abrahamsen on Wed, 20 Mar 2019 12:30:22 -0700) Subject: Re: bug#34862: 27.0.50; Trying to update pinyin.map References: <87zhpxyvls.fsf@HIDDEN> <83ftro20gt.fsf@HIDDEN> <87o96cbrwp.fsf@HIDDEN> <83ef781uuh.fsf@HIDDEN> <871s38at0z.fsf@HIDDEN> <83woktswud.fsf@HIDDEN> <87wokts5rl.fsf@HIDDEN> X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 34862 Cc: 34862 <at> debbugs.gnu.org, rms@HIDDEN X-BeenThere: debbugs-submit <at> debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: <debbugs-submit.debbugs.gnu.org> List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe> List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/> List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org> List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help> List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe> Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org> X-Spam-Score: -1.0 (-) > From: Eric Abrahamsen <eric@HIDDEN> > Cc: Richard Stallman <rms@HIDDEN>, 34862 <at> debbugs.gnu.org > Date: Wed, 20 Mar 2019 12:30:22 -0700 > > > I think this should be done with a script, and that script should be > > in our repository. The easiest kind of a script is a Lisp program, of > > course, but we can also use other kinds, such as Awk scripts. > > Awk seems just right for the problem, but I haven't written much in it; > I did the original munging in elisp. Would this be a script written for > use with -batch and a custom make target? Yes. > should it also be responsible for downloading a recent copy of the > source file, or should that be done first, and the function pointed > at the file? The latter, I think. That's what we do with the other data files we use from external sources, e.g. see admin/unidata/.
bug-gnu-emacs@HIDDEN
:bug#34862
; Package emacs
.
Full text available.Received: (at 34862) by debbugs.gnu.org; 20 Mar 2019 19:30:33 +0000 From debbugs-submit-bounces <at> debbugs.gnu.org Wed Mar 20 15:30:33 2019 Received: from localhost ([127.0.0.1]:52174 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>) id 1h6gv3-0000lN-82 for submit <at> debbugs.gnu.org; Wed, 20 Mar 2019 15:30:33 -0400 Received: from ericabrahamsen.net ([52.70.2.18]:58356 helo=mail.ericabrahamsen.net) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <eric@HIDDEN>) id 1h6gv1-0000lA-Bc for 34862 <at> debbugs.gnu.org; Wed, 20 Mar 2019 15:30:32 -0400 Received: from localhost (50-251-205-17-static.hfc.comcastbusiness.net [50.251.205.17]) (Authenticated sender: eric@HIDDEN) by mail.ericabrahamsen.net (Postfix) with ESMTPSA id 9E631FA17F; Wed, 20 Mar 2019 19:30:25 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=ericabrahamsen.net; s=mail; t=1553110226; bh=Rffk+PFHjFGcuqhzgpbJGkqHGrgqCAT8xIPkOQuPeaA=; h=From:To:Cc:Subject:References:Date:In-Reply-To:From; b=niMEa4ExnEaEUmnE1OAXQPjVpFeBi+L3AiYxSjoxv8Q9zBQn3SUubWZkCneWCl+EQ Q1ILwv9mP/beBOROO3MB5HCOtbkEiRjKA6dx2KkdfhqSfA57MWCLa0uOiswIt2hyBz C0uyub/bdevQe3FvIlc4uJQql9eH/v1g1DJvw+mQ= From: Eric Abrahamsen <eric@HIDDEN> To: Eli Zaretskii <eliz@HIDDEN> Subject: Re: bug#34862: 27.0.50; Trying to update pinyin.map References: <87zhpxyvls.fsf@HIDDEN> <83ftro20gt.fsf@HIDDEN> <87o96cbrwp.fsf@HIDDEN> <83ef781uuh.fsf@HIDDEN> <871s38at0z.fsf@HIDDEN> <83woktswud.fsf@HIDDEN> Date: Wed, 20 Mar 2019 12:30:22 -0700 In-Reply-To: <83woktswud.fsf@HIDDEN> (Eli Zaretskii's message of "Wed, 20 Mar 2019 11:45:30 +0200") Message-ID: <87wokts5rl.fsf@HIDDEN> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.0.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 34862 Cc: 34862 <at> debbugs.gnu.org, Richard Stallman <rms@HIDDEN> X-BeenThere: debbugs-submit <at> debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: <debbugs-submit.debbugs.gnu.org> List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe> List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/> List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org> List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help> List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe> Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org> X-Spam-Score: -1.0 (-) On 03/20/19 11:45 AM, Eli Zaretskii wrote: [...] >> > Btw, I understand that the Google pinyin method is Apache licensed, >> > but does this mean we can freely use its data for updating pinyin.map? >> > IANAL. Could you perhaps describe how you intend to extract the data >> > from the Google input method for the purpose of updating our file? I >> > think someone will have to audit that process for being legal and >> > compatible with both the Apache license and the GPL. >>=20 >> This[2] is the source file I used. I chopped off all the >> multiple-character dictionary entries, and munged the remaining data >> into the format we need. Ie, lines like this: >>=20 >> =E5=85=AB 6677.54934466 0 ba >> =E6=8A=8A 165484.231697 0 ba >> =E5=90=A7 385205.434615 0 ba >>=20 >> Became this: >>=20 >> ba =E5=90=A7=E6=8A=8A=E5=85=AB >>=20 >> A straight rearrangement, with frequency of use translated into simple >> ordering of the characters. While this is obviously pretty manual, and a >> bit of work, a file like this really only needs to be updated every five >> years or so -- if that. Whenever someone thinks of it. > > I think this should be done with a script, and that script should be > in our repository. The easiest kind of a script is a Lisp program, of > course, but we can also use other kinds, such as Awk scripts. Awk seems just right for the problem, but I haven't written much in it; I did the original munging in elisp. Would this be a script written for use with -batch and a custom make target? Or something to be loaded into a running Emacs and called interactively? In either case, should it also be responsible for downloading a recent copy of the source file, or should that be done first, and the function pointed at the file? >> Regarding the license, I'm even less of a lawyer than you, but these[3] >> are the terms that cover this data. > > Richard, could you please look at that license and tell if we can use > this data file? > >> > (Also, I'm somewhat surprised that gbk isn't capable of covering the >> > characters you want to add. Or did you not try using it?) >>=20 >> I did not try using it! Mostly because the error message suggested >> gb18030 first. gbk also works. I don't have any opinion about encoding, >> apart from assuming utf8 unless there's a good reason not to. > > I see no good reason to use anything other than UTF-8. Excellent. I will think about the script, and look forward to word from Richard. Eric
bug-gnu-emacs@HIDDEN
:bug#34862
; Package emacs
.
Full text available.Received: (at 34862) by debbugs.gnu.org; 20 Mar 2019 09:45:50 +0000 From debbugs-submit-bounces <at> debbugs.gnu.org Wed Mar 20 05:45:50 2019 Received: from localhost ([127.0.0.1]:51217 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>) id 1h6XnC-0001BO-CY for submit <at> debbugs.gnu.org; Wed, 20 Mar 2019 05:45:50 -0400 Received: from eggs.gnu.org ([209.51.188.92]:42701) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <eliz@HIDDEN>) id 1h6XnA-0001BB-Hf for 34862 <at> debbugs.gnu.org; Wed, 20 Mar 2019 05:45:49 -0400 Received: from fencepost.gnu.org ([2001:470:142:3::e]:52358) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from <eliz@HIDDEN>) id 1h6Xn4-0001c7-QM; Wed, 20 Mar 2019 05:45:43 -0400 Received: from [176.228.60.248] (port=1428 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from <eliz@HIDDEN>) id 1h6Xmx-0001kO-FU; Wed, 20 Mar 2019 05:45:35 -0400 Date: Wed, 20 Mar 2019 11:45:30 +0200 Message-Id: <83woktswud.fsf@HIDDEN> From: Eli Zaretskii <eliz@HIDDEN> To: Eric Abrahamsen <eric@HIDDEN>, Richard Stallman <rms@HIDDEN> In-reply-to: <871s38at0z.fsf@HIDDEN> (message from Eric Abrahamsen on Fri, 15 Mar 2019 11:31:40 -0700) Subject: Re: bug#34862: 27.0.50; Trying to update pinyin.map References: <87zhpxyvls.fsf@HIDDEN> <83ftro20gt.fsf@HIDDEN> <87o96cbrwp.fsf@HIDDEN> <83ef781uuh.fsf@HIDDEN> <871s38at0z.fsf@HIDDEN> MIME-version: 1.0 Content-type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 34862 Cc: 34862 <at> debbugs.gnu.org X-BeenThere: debbugs-submit <at> debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: <debbugs-submit.debbugs.gnu.org> List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe> List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/> List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org> List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help> List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe> Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org> X-Spam-Score: -1.0 (-) > From: Eric Abrahamsen <eric@HIDDEN> > Date: Fri, 15 Mar 2019 11:31:40 -0700 > > > That file is imported from an external source, isn't it? Are you > > saying we should stop synchronizing it with that source, and instead > > fork it, maintain our own separate copy, and never resync with that > > source again? If so, then I see no reason not to recode it in UTF-8. > > Near as I can tell that file was imported into Emacs in 2001 and not > touched since (apart from copyright and encoding stuff). The Debian > package from which it comes seems to have been orphaned in 2003[1]. So > there's not much to either synchronize or fork! OK, sounds reasonable. > > Btw, I understand that the Google pinyin method is Apache licensed, > > but does this mean we can freely use its data for updating pinyin.map? > > IANAL. Could you perhaps describe how you intend to extract the data > > from the Google input method for the purpose of updating our file? I > > think someone will have to audit that process for being legal and > > compatible with both the Apache license and the GPL. > > This[2] is the source file I used. I chopped off all the > multiple-character dictionary entries, and munged the remaining data > into the format we need. Ie, lines like this: > > 八 6677.54934466 0 ba > 把 165484.231697 0 ba > 吧 385205.434615 0 ba > > Became this: > > ba 吧把八 > > A straight rearrangement, with frequency of use translated into simple > ordering of the characters. While this is obviously pretty manual, and a > bit of work, a file like this really only needs to be updated every five > years or so -- if that. Whenever someone thinks of it. I think this should be done with a script, and that script should be in our repository. The easiest kind of a script is a Lisp program, of course, but we can also use other kinds, such as Awk scripts. > Regarding the license, I'm even less of a lawyer than you, but these[3] > are the terms that cover this data. Richard, could you please look at that license and tell if we can use this data file? > > (Also, I'm somewhat surprised that gbk isn't capable of covering the > > characters you want to add. Or did you not try using it?) > > I did not try using it! Mostly because the error message suggested > gb18030 first. gbk also works. I don't have any opinion about encoding, > apart from assuming utf8 unless there's a good reason not to. I see no good reason to use anything other than UTF-8. > [1] https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=189523;msg=18 > > [2] https://android.googlesource.com/platform/packages/inputmethods/PinyinIME/+/refs/heads/master/jni/data/rawdict_utf16_65105_freq.txt > > [3] https://android.googlesource.com/platform/packages/inputmethods/PinyinIME/+/refs/heads/master/NOTICE Thanks.
bug-gnu-emacs@HIDDEN
:bug#34862
; Package emacs
.
Full text available.Received: (at submit) by debbugs.gnu.org; 15 Mar 2019 18:32:06 +0000 From debbugs-submit-bounces <at> debbugs.gnu.org Fri Mar 15 14:32:06 2019 Received: from localhost ([127.0.0.1]:45505 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>) id 1h4rck-0005ar-AU for submit <at> debbugs.gnu.org; Fri, 15 Mar 2019 14:32:06 -0400 Received: from eggs.gnu.org ([209.51.188.92]:46476) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <geb-bug-gnu-emacs@HIDDEN>) id 1h4rch-0005Zv-Jp for submit <at> debbugs.gnu.org; Fri, 15 Mar 2019 14:32:05 -0400 Received: from lists.gnu.org ([209.51.188.17]:59816) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_256_CBC_SHA1:32) (Exim 4.71) (envelope-from <geb-bug-gnu-emacs@HIDDEN>) id 1h4rcc-0000lM-EA for submit <at> debbugs.gnu.org; Fri, 15 Mar 2019 14:31:58 -0400 Received: from eggs.gnu.org ([209.51.188.92]:60859) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from <geb-bug-gnu-emacs@HIDDEN>) id 1h4rcb-0001nB-B7 for bug-gnu-emacs@HIDDEN; Fri, 15 Mar 2019 14:31:58 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: * X-Spam-Status: No, score=1.6 required=5.0 tests=BAYES_50,RDNS_NONE, URIBL_BLOCKED autolearn=disabled version=3.3.2 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from <geb-bug-gnu-emacs@HIDDEN>) id 1h4rca-0000kR-9b for bug-gnu-emacs@HIDDEN; Fri, 15 Mar 2019 14:31:57 -0400 Received: from [195.159.176.226] (port=40356 helo=blaine.gmane.org) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_256_CBC_SHA1:32) (Exim 4.71) (envelope-from <geb-bug-gnu-emacs@HIDDEN>) id 1h4rcZ-0000jw-SA for bug-gnu-emacs@HIDDEN; Fri, 15 Mar 2019 14:31:56 -0400 Received: from list by blaine.gmane.org with local (Exim 4.89) (envelope-from <geb-bug-gnu-emacs@HIDDEN>) id 1h4rcX-000HDA-88 for bug-gnu-emacs@HIDDEN; Fri, 15 Mar 2019 19:31:53 +0100 X-Injected-Via-Gmane: http://gmane.org/ To: bug-gnu-emacs@HIDDEN From: Eric Abrahamsen <eric@HIDDEN> Subject: Re: bug#34862: 27.0.50; Trying to update pinyin.map Date: Fri, 15 Mar 2019 11:31:40 -0700 Message-ID: <871s38at0z.fsf@HIDDEN> References: <87zhpxyvls.fsf@HIDDEN> <83ftro20gt.fsf@HIDDEN> <87o96cbrwp.fsf@HIDDEN> <83ef781uuh.fsf@HIDDEN> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.0.50 (gnu/linux) Cancel-Lock: sha1:pPiBZli7MzsPPxE+Tur51Akm9Xs= X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 195.159.176.226 X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: submit X-BeenThere: debbugs-submit <at> debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: <debbugs-submit.debbugs.gnu.org> List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe> List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/> List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org> List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help> List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe> Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org> X-Spam-Score: -1.0 (-) Eli Zaretskii <eliz@HIDDEN> writes: >> From: Eric Abrahamsen <eric@HIDDEN> >> Cc: 34862 <at> debbugs.gnu.org >> Date: Thu, 14 Mar 2019 22:58:14 -0700 >> >> > I'm not sure I understand the encoding of which file would you like to >> > change? Could you please clarify? >> >> Sorry, I'm trying to add more characters to ./leim/MISC-DIC/pinyin.map, >> which is encoded as chinese-iso-8bit-dos, and it can't accept the new >> characters with that current encoding. That's the file I'd like to >> change. > > That file is imported from an external source, isn't it? Are you > saying we should stop synchronizing it with that source, and instead > fork it, maintain our own separate copy, and never resync with that > source again? If so, then I see no reason not to recode it in UTF-8. Near as I can tell that file was imported into Emacs in 2001 and not touched since (apart from copyright and encoding stuff). The Debian package from which it comes seems to have been orphaned in 2003[1]. So there's not much to either synchronize or fork! > Btw, I understand that the Google pinyin method is Apache licensed, > but does this mean we can freely use its data for updating pinyin.map? > IANAL. Could you perhaps describe how you intend to extract the data > from the Google input method for the purpose of updating our file? I > think someone will have to audit that process for being legal and > compatible with both the Apache license and the GPL. This[2] is the source file I used. I chopped off all the multiple-character dictionary entries, and munged the remaining data into the format we need. Ie, lines like this: 八 6677.54934466 0 ba 把 165484.231697 0 ba 吧 385205.434615 0 ba Became this: ba 吧把八 A straight rearrangement, with frequency of use translated into simple ordering of the characters. While this is obviously pretty manual, and a bit of work, a file like this really only needs to be updated every five years or so -- if that. Whenever someone thinks of it. Regarding the license, I'm even less of a lawyer than you, but these[3] are the terms that cover this data. > (Also, I'm somewhat surprised that gbk isn't capable of covering the > characters you want to add. Or did you not try using it?) I did not try using it! Mostly because the error message suggested gb18030 first. gbk also works. I don't have any opinion about encoding, apart from assuming utf8 unless there's a good reason not to. Thanks, Eric [1] https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=189523;msg=18 [2] https://android.googlesource.com/platform/packages/inputmethods/PinyinIME/+/refs/heads/master/jni/data/rawdict_utf16_65105_freq.txt [3] https://android.googlesource.com/platform/packages/inputmethods/PinyinIME/+/refs/heads/master/NOTICE
bug-gnu-emacs@HIDDEN
:bug#34862
; Package emacs
.
Full text available.Received: (at 34862) by debbugs.gnu.org; 15 Mar 2019 07:05:21 +0000 From debbugs-submit-bounces <at> debbugs.gnu.org Fri Mar 15 03:05:21 2019 Received: from localhost ([127.0.0.1]:44334 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>) id 1h4gu9-0008GZ-FT for submit <at> debbugs.gnu.org; Fri, 15 Mar 2019 03:05:21 -0400 Received: from eggs.gnu.org ([209.51.188.92]:44912) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <eliz@HIDDEN>) id 1h4gu7-0008GM-T1 for 34862 <at> debbugs.gnu.org; Fri, 15 Mar 2019 03:05:20 -0400 Received: from fencepost.gnu.org ([2001:470:142:3::e]:51601) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from <eliz@HIDDEN>) id 1h4gu2-0002Tf-AF; Fri, 15 Mar 2019 03:05:14 -0400 Received: from [176.228.60.248] (port=2123 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from <eliz@HIDDEN>) id 1h4gu0-0003to-Rr; Fri, 15 Mar 2019 03:05:13 -0400 Date: Fri, 15 Mar 2019 09:04:54 +0200 Message-Id: <83ef781uuh.fsf@HIDDEN> From: Eli Zaretskii <eliz@HIDDEN> To: Eric Abrahamsen <eric@HIDDEN> In-reply-to: <87o96cbrwp.fsf@HIDDEN> (message from Eric Abrahamsen on Thu, 14 Mar 2019 22:58:14 -0700) Subject: Re: bug#34862: 27.0.50; Trying to update pinyin.map References: <87zhpxyvls.fsf@HIDDEN> <83ftro20gt.fsf@HIDDEN> <87o96cbrwp.fsf@HIDDEN> X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 34862 Cc: 34862 <at> debbugs.gnu.org X-BeenThere: debbugs-submit <at> debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: <debbugs-submit.debbugs.gnu.org> List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe> List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/> List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org> List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help> List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe> Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org> X-Spam-Score: -1.0 (-) > From: Eric Abrahamsen <eric@HIDDEN> > Cc: 34862 <at> debbugs.gnu.org > Date: Thu, 14 Mar 2019 22:58:14 -0700 > > > I'm not sure I understand the encoding of which file would you like to > > change? Could you please clarify? > > Sorry, I'm trying to add more characters to ./leim/MISC-DIC/pinyin.map, > which is encoded as chinese-iso-8bit-dos, and it can't accept the new > characters with that current encoding. That's the file I'd like to > change. That file is imported from an external source, isn't it? Are you saying we should stop synchronizing it with that source, and instead fork it, maintain our own separate copy, and never resync with that source again? If so, then I see no reason not to recode it in UTF-8. Btw, I understand that the Google pinyin method is Apache licensed, but does this mean we can freely use its data for updating pinyin.map? IANAL. Could you perhaps describe how you intend to extract the data from the Google input method for the purpose of updating our file? I think someone will have to audit that process for being legal and compatible with both the Apache license and the GPL. (Also, I'm somewhat surprised that gbk isn't capable of covering the characters you want to add. Or did you not try using it?) Thanks.
bug-gnu-emacs@HIDDEN
:bug#34862
; Package emacs
.
Full text available.Received: (at 34862) by debbugs.gnu.org; 15 Mar 2019 05:58:24 +0000 From debbugs-submit-bounces <at> debbugs.gnu.org Fri Mar 15 01:58:24 2019 Received: from localhost ([127.0.0.1]:44326 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>) id 1h4frM-0006bR-DE for submit <at> debbugs.gnu.org; Fri, 15 Mar 2019 01:58:24 -0400 Received: from ericabrahamsen.net ([52.70.2.18]:44440 helo=mail.ericabrahamsen.net) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <eric@HIDDEN>) id 1h4frJ-0006bC-HA for 34862 <at> debbugs.gnu.org; Fri, 15 Mar 2019 01:58:22 -0400 Received: from localhost (97-126-92-188.tukw.qwest.net [97.126.92.188]) (Authenticated sender: eric@HIDDEN) by mail.ericabrahamsen.net (Postfix) with ESMTPSA id 88FCDFA02C; Fri, 15 Mar 2019 05:58:15 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=ericabrahamsen.net; s=mail; t=1552629495; bh=1WMbuR8psBguiBCANIupZgiIuYR1OJBmZG5zgvK5KHw=; h=From:To:Cc:Subject:References:Date:In-Reply-To:From; b=Uapj+FAKBS+XfRf9a8AidQQr7wYobaii9tebvzHmo/8Rm0Ese4b23hpVxLOq5JmH7 CBikJ0Ri0S1e8EC6APFv20IgZCORXU11LXFTZJbwEx1dun25+Ntk/1kszey4mBfnMB t950anuQJ7G//tyFCduf76Zs9I0+N6p+43nM9ILg= From: Eric Abrahamsen <eric@HIDDEN> To: Eli Zaretskii <eliz@HIDDEN> Subject: Re: bug#34862: 27.0.50; Trying to update pinyin.map References: <87zhpxyvls.fsf@HIDDEN> <83ftro20gt.fsf@HIDDEN> Date: Thu, 14 Mar 2019 22:58:14 -0700 In-Reply-To: <83ftro20gt.fsf@HIDDEN> (Eli Zaretskii's message of "Fri, 15 Mar 2019 07:03:30 +0200") Message-ID: <87o96cbrwp.fsf@HIDDEN> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.0.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 34862 Cc: 34862 <at> debbugs.gnu.org X-BeenThere: debbugs-submit <at> debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: <debbugs-submit.debbugs.gnu.org> List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe> List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/> List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org> List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help> List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe> Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org> X-Spam-Score: -1.0 (-) On 03/15/19 07:03 AM, Eli Zaretskii wrote: >> From: Eric Abrahamsen <eric@HIDDEN> >> Date: Thu, 14 Mar 2019 14:49:51 -0700 >> >> >> As discussed in bug#34215, I'm trying to update the >> romanization-to-Chinese-character mapping in the >> file ./leim/MISC-DIC/pinyin.map to use the more complete mapping >> provided by the Google pinyin input method, licensed under Apache 2.0. >> This expands the number of characters recognized by Emacs from around >> 7,000 to around 17,000. (And increases the size of the mapping file from >> 18K to 53K.) >> >> I'm running into encoding problems when adding the new characters -- >> Emacs says some of the characters can't be written using the existing >> coding system. The original file has an encoding cookie reading coding: >> cn-gb-2312, and describing the coding system gives me: >> >> chinese-iso-8bit-dos (alias: cn-gb-2312-dos euc-china-dos euc-cn-dos >> cn-gb-dos gb2312-dos) >> >> The characters *can* be encoded using gb18030, and of course utf8. The >> wikipedia page for gb18030 describes gb2312 as "legacy"[1], and says >> gb18030 is a superset of 2312. >> >> Is there any reason not to go straight to utf8 for this file? If that's >> not okay, would gb18030 be acceptable? > > I'm not sure I understand the encoding of which file would you like to > change? Could you please clarify? Sorry, I'm trying to add more characters to ./leim/MISC-DIC/pinyin.map, which is encoded as chinese-iso-8bit-dos, and it can't accept the new characters with that current encoding. That's the file I'd like to change. Thanks, Eric
bug-gnu-emacs@HIDDEN
:bug#34862
; Package emacs
.
Full text available.Received: (at 34862) by debbugs.gnu.org; 15 Mar 2019 05:03:57 +0000 From debbugs-submit-bounces <at> debbugs.gnu.org Fri Mar 15 01:03:57 2019 Received: from localhost ([127.0.0.1]:44311 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>) id 1h4f0f-0005AD-5r for submit <at> debbugs.gnu.org; Fri, 15 Mar 2019 01:03:57 -0400 Received: from eggs.gnu.org ([209.51.188.92]:42748) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <eliz@HIDDEN>) id 1h4f0c-00059z-J3 for 34862 <at> debbugs.gnu.org; Fri, 15 Mar 2019 01:03:55 -0400 Received: from fencepost.gnu.org ([2001:470:142:3::e]:50070) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from <eliz@HIDDEN>) id 1h4f0W-000627-VQ; Fri, 15 Mar 2019 01:03:49 -0400 Received: from [176.228.60.248] (port=2589 helo=home-c4e4a596f7) by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256) (Exim 4.82) (envelope-from <eliz@HIDDEN>) id 1h4f0W-0001W9-Dc; Fri, 15 Mar 2019 01:03:48 -0400 Date: Fri, 15 Mar 2019 07:03:30 +0200 Message-Id: <83ftro20gt.fsf@HIDDEN> From: Eli Zaretskii <eliz@HIDDEN> To: Eric Abrahamsen <eric@HIDDEN> In-reply-to: <87zhpxyvls.fsf@HIDDEN> (message from Eric Abrahamsen on Thu, 14 Mar 2019 14:49:51 -0700) Subject: Re: bug#34862: 27.0.50; Trying to update pinyin.map References: <87zhpxyvls.fsf@HIDDEN> X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Spam-Score: 0.0 (/) X-Debbugs-Envelope-To: 34862 Cc: 34862 <at> debbugs.gnu.org X-BeenThere: debbugs-submit <at> debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: <debbugs-submit.debbugs.gnu.org> List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe> List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/> List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org> List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help> List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe> Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org> X-Spam-Score: -1.0 (-) > From: Eric Abrahamsen <eric@HIDDEN> > Date: Thu, 14 Mar 2019 14:49:51 -0700 > > > As discussed in bug#34215, I'm trying to update the > romanization-to-Chinese-character mapping in the > file ./leim/MISC-DIC/pinyin.map to use the more complete mapping > provided by the Google pinyin input method, licensed under Apache 2.0. > This expands the number of characters recognized by Emacs from around > 7,000 to around 17,000. (And increases the size of the mapping file from > 18K to 53K.) > > I'm running into encoding problems when adding the new characters -- > Emacs says some of the characters can't be written using the existing > coding system. The original file has an encoding cookie reading coding: > cn-gb-2312, and describing the coding system gives me: > > chinese-iso-8bit-dos (alias: cn-gb-2312-dos euc-china-dos euc-cn-dos > cn-gb-dos gb2312-dos) > > The characters *can* be encoded using gb18030, and of course utf8. The > wikipedia page for gb18030 describes gb2312 as "legacy"[1], and says > gb18030 is a superset of 2312. > > Is there any reason not to go straight to utf8 for this file? If that's > not okay, would gb18030 be acceptable? I'm not sure I understand the encoding of which file would you like to change? Could you please clarify?
bug-gnu-emacs@HIDDEN
:bug#34862
; Package emacs
.
Full text available.Received: (at submit) by debbugs.gnu.org; 14 Mar 2019 21:51:19 +0000 From debbugs-submit-bounces <at> debbugs.gnu.org Thu Mar 14 17:51:19 2019 Received: from localhost ([127.0.0.1]:44137 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>) id 1h4YFz-0002yM-4d for submit <at> debbugs.gnu.org; Thu, 14 Mar 2019 17:51:19 -0400 Received: from eggs.gnu.org ([209.51.188.92]:41342) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <eric@HIDDEN>) id 1h4YFw-0002xv-Kr for submit <at> debbugs.gnu.org; Thu, 14 Mar 2019 17:51:17 -0400 Received: from lists.gnu.org ([209.51.188.17]:60462) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_256_CBC_SHA1:32) (Exim 4.71) (envelope-from <eric@HIDDEN>) id 1h4YFr-0007if-Bc for submit <at> debbugs.gnu.org; Thu, 14 Mar 2019 17:51:11 -0400 Received: from eggs.gnu.org ([209.51.188.92]:55692) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from <eric@HIDDEN>) id 1h4YFq-0000uz-9G for bug-gnu-emacs@HIDDEN; Thu, 14 Mar 2019 17:51:11 -0400 X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org X-Spam-Level: X-Spam-Status: No, score=0.8 required=5.0 tests=BAYES_50,URIBL_BLOCKED autolearn=disabled version=3.3.2 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from <eric@HIDDEN>) id 1h4YEi-000744-Ry for bug-gnu-emacs@HIDDEN; Thu, 14 Mar 2019 17:50:01 -0400 Received: from ericabrahamsen.net ([52.70.2.18]:33086 helo=mail.ericabrahamsen.net) by eggs.gnu.org with esmtps (TLS1.0:DHE_RSA_AES_256_CBC_SHA1:32) (Exim 4.71) (envelope-from <eric@HIDDEN>) id 1h4YEi-00072H-CZ for bug-gnu-emacs@HIDDEN; Thu, 14 Mar 2019 17:50:00 -0400 Received: from localhost (unknown [207.109.85.82]) (Authenticated sender: eric@HIDDEN) by mail.ericabrahamsen.net (Postfix) with ESMTPSA id 446A3FA17C for <bug-gnu-emacs@HIDDEN>; Thu, 14 Mar 2019 21:49:52 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=ericabrahamsen.net; s=mail; t=1552600192; bh=fXrEWI/bQEIDvrgeD4GW0+72Y793ZlGMCfxGLvqW6VM=; h=From:To:Subject:Date:From; b=s3/WdRg1nz1fv4BNwwjZbOcN0K8vagP97FBysXCcwDicRYcEfIM81zJiNS7fpzRlR jqtTcD2OQxh5mYutSFu/Hee0lAhLjavifHE42djnk656/BT+byXo8DEIEMQ0YzsrBs yxHskBjky6WqnzQ2Tzm08oBHgGGqKtW7Ny4pKJkA= From: Eric Abrahamsen <eric@HIDDEN> To: bug-gnu-emacs@HIDDEN Subject: 27.0.50; Trying to update pinyin.map Date: Thu, 14 Mar 2019 14:49:51 -0700 Message-ID: <87zhpxyvls.fsf@HIDDEN> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.0.50 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic] X-Received-From: 52.70.2.18 X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x X-Spam-Score: 0.9 (/) X-Debbugs-Envelope-To: submit X-BeenThere: debbugs-submit <at> debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list List-Id: <debbugs-submit.debbugs.gnu.org> List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe> List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/> List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org> List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help> List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe> Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org> X-Spam-Score: -0.1 (/) As discussed in bug#34215, I'm trying to update the romanization-to-Chinese-character mapping in the file ./leim/MISC-DIC/pinyin.map to use the more complete mapping provided by the Google pinyin input method, licensed under Apache 2.0. This expands the number of characters recognized by Emacs from around 7,000 to around 17,000. (And increases the size of the mapping file from 18K to 53K.) I'm running into encoding problems when adding the new characters -- Emacs says some of the characters can't be written using the existing coding system. The original file has an encoding cookie reading coding: cn-gb-2312, and describing the coding system gives me: chinese-iso-8bit-dos (alias: cn-gb-2312-dos euc-china-dos euc-cn-dos cn-gb-dos gb2312-dos) The characters *can* be encoded using gb18030, and of course utf8. The wikipedia page for gb18030 describes gb2312 as "legacy"[1], and says gb18030 is a superset of 2312. Is there any reason not to go straight to utf8 for this file? If that's not okay, would gb18030 be acceptable? Codepoint 23744 is an example of a character that can be encoded with 18030 but not 2312. It also exercises my font engine. I have two other questions, about reducing vc churn, and how to insert the license at the top of the file, but I figured I'd ask this first. Thanks, Eric [1] https://en.wikipedia.org/wiki/GB_18030
Eric Abrahamsen <eric@HIDDEN>
:bug-gnu-emacs@HIDDEN
.
Full text available.bug-gnu-emacs@HIDDEN
:bug#34862
; Package emacs
.
Full text available.
GNU bug tracking system
Copyright (C) 1999 Darren O. Benham,
1997 nCipher Corporation Ltd,
1994-97 Ian Jackson.