GNU bug report logs - #73115
[PATCH] gnu: Add python-sentence-transformers.

Please note: This is a static page, with minimal formatting, updated once a day.
Click here to see this page with the latest information and nicer formatting.

Package: guix-patches; Reported by: Nicolas Graves <ngraves@HIDDEN>; Keywords: patch; dated Sun, 8 Sep 2024 00:10:02 UTC; Maintainer for guix-patches is guix-patches@HIDDEN.
Added indication that bug 73115 blocks73266 Request was from Nicolas Graves <ngraves@HIDDEN> to control <at> debbugs.gnu.org. Full text available.
Added blocking bug(s) 73109 Request was from Nicolas Graves <ngraves@HIDDEN> to control <at> debbugs.gnu.org. Full text available.

Message received at submit <at> debbugs.gnu.org:


Received: (at submit) by debbugs.gnu.org; 8 Sep 2024 00:09:54 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Sat Sep 07 20:09:54 2024
Received: from localhost ([127.0.0.1]:57755 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1sn5V3-0005Gr-PB
	for submit <at> debbugs.gnu.org; Sat, 07 Sep 2024 20:09:54 -0400
Received: from lists.gnu.org ([209.51.188.17]:33626)
 by debbugs.gnu.org with esmtp (Exim 4.84_2)
 (envelope-from <ngraves@HIDDEN>) id 1sn5V0-0005Gf-Qc
 for submit <at> debbugs.gnu.org; Sat, 07 Sep 2024 20:09:52 -0400
Received: from eggs.gnu.org ([2001:470:142:3::10])
 by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256)
 (Exim 4.90_1) (envelope-from <ngraves@HIDDEN>)
 id 1sn5Uy-00062H-KH
 for guix-patches@HIDDEN; Sat, 07 Sep 2024 20:09:48 -0400
Received: from 4.mo576.mail-out.ovh.net ([46.105.42.102])
 by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256)
 (Exim 4.90_1) (envelope-from <ngraves@HIDDEN>)
 id 1sn5Us-0001nL-Fx
 for guix-patches@HIDDEN; Sat, 07 Sep 2024 20:09:48 -0400
Received: from director8.ghost.mail-out.ovh.net (unknown [10.109.139.3])
 by mo576.mail-out.ovh.net (Postfix) with ESMTP id 4X1VhJ4FRFz1nht
 for <guix-patches@HIDDEN>; Sun,  8 Sep 2024 00:09:36 +0000 (UTC)
Received: from ghost-submission-55b549bf7b-mnlw7 (unknown [10.108.42.240])
 by director8.ghost.mail-out.ovh.net (Postfix) with ESMTPS id 0A7611FD58;
 Sun,  8 Sep 2024 00:09:35 +0000 (UTC)
Received: from ngraves.fr ([37.59.142.110])
 by ghost-submission-55b549bf7b-mnlw7 with ESMTPSA
 id djCSJz/r3GYUwQAAPR9d2Q
 (envelope-from <ngraves@HIDDEN>); Sun, 08 Sep 2024 00:09:35 +0000
Authentication-Results: garm.ovh; auth=pass
 (GARM-110S004d9bc809f-b723-4a19-abe8-4f3d86fdad2c,
 1B24502937BE6AAC558E984F82BC0F282F1693D3) smtp.auth=ngraves@HIDDEN
X-OVh-ClientIp: 81.67.146.208
From: Nicolas Graves <ngraves@HIDDEN>
To: guix-patches@HIDDEN
Subject: [PATCH] gnu: Add python-sentence-transformers.
Date: Sun,  8 Sep 2024 02:09:24 +0200
Message-ID: <20240908000927.29091-1-ngraves@HIDDEN>
X-Mailer: git-send-email 2.45.2
MIME-Version: 1.0
Content-Transfer-Encoding: 8bit
X-Ovh-Tracer-Id: 16951548997830566626
X-VR-SPAMSTATE: OK
X-VR-SPAMSCORE: 0
X-VR-SPAMCAUSE: gggruggvucftvghtrhhoucdtuddrgeeftddrudeigedgfedtucetufdoteggodetrfdotffvucfrrhhofhhilhgvmecuqfggjfdpvefjgfevmfevgfenuceurghilhhouhhtmecuhedttdenucenucfjughrpefhvfevufffkffoggfgsedtkeertdertddtnecuhfhrohhmpefpihgtohhlrghsucfirhgrvhgvshcuoehnghhrrghvvghssehnghhrrghvvghsrdhfrheqnecuggftrfgrthhtvghrnhepvdffvdfghffffedtvefftdetkeetueejuedvtdekgfffffehhedulefhkeevtdehnecuffhomhgrihhnpehssggvrhhtrdhnvghtnecukfhppeduvdejrddtrddtrddupdekuddrieejrddugeeirddvtdekpdefjedrheelrddugedvrdduuddtnecuvehluhhsthgvrhfuihiivgeptdenucfrrghrrghmpehinhgvthepuddvjedrtddrtddruddpmhgrihhlfhhrohhmpehnghhrrghvvghssehnghhrrghvvghsrdhfrhdpnhgspghrtghpthhtohepuddprhgtphhtthhopehguhhigidqphgrthgthhgvshesghhnuhdrohhrghdpoffvtefjohhsthepmhhoheejiedpmhhouggvpehsmhhtphhouhht
DKIM-Signature: a=rsa-sha256; bh=PvuTKGFqA5+jNvVIiou4FFNmN9QhyLWPLkTSuGlYyG4=; 
 c=relaxed/relaxed; d=ngraves.fr; h=From;
 s=ovhmo4487190-selector1; t=1725754176; v=1;
 b=hRBqDAWe91/HMUXAZn4ri1ZOju9NJn1RQ/pEpllkheM0Z8vprao8hegvxZO4H0jVZtZTg61g
 1yC5labkLR92hEcBn3wfrTLHAt48QDDBrZCWvPlu4Gkbx9i4Dtn/IkhQQEMBPtjV6ycNnUD+4AO
 A8Mb9dEVHW3gFGA8CxD3QqRdkNsDbgLqxYwPwTI+q9TuS2xI4JKOSzZo5GQ7AIyf5vBkKUXHZNQ
 pSUxM93EnsN0P5Q99xhQwo8g0cvRmWSV7DBxV6JXVeubop46qu90JejClTJ3CD6jMYcxPLCBArM
 AO3uESKnGoVfUP1pOfC5CoMOvqFrRLp0t/kVxJcr+c4Rg==
Received-SPF: pass client-ip=46.105.42.102; envelope-from=ngraves@HIDDEN;
 helo=4.mo576.mail-out.ovh.net
X-Spam_score_int: -20
X-Spam_score: -2.1
X-Spam_bar: --
X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1,
 DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1,
 RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H3=0.001, RCVD_IN_MSPIKE_WL=0.001,
 RCVD_IN_VALIDITY_RPBL_BLOCKED=0.001, RCVD_IN_VALIDITY_SAFE_BLOCKED=0.001,
 SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no
X-Spam_action: no action
X-Spam-Score: -1.3 (-)
X-Debbugs-Envelope-To: submit
Cc: ngraves@HIDDEN
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -2.3 (--)

* gnu/packages/machine-learning.scm (python-sentence-transformers): New variable.

Change-Id: Iedab56f6c2bdde12e654ba67695cd996122bdb0b
---
 gnu/packages/machine-learning.scm | 54 +++++++++++++++++++++++++++++++
 1 file changed, 54 insertions(+)

diff --git a/gnu/packages/machine-learning.scm b/gnu/packages/machine-learning.scm
index 42842d7d61..b2da07e8f0 100644
--- a/gnu/packages/machine-learning.scm
+++ b/gnu/packages/machine-learning.scm
@@ -1239,6 +1239,60 @@ (define-public python-sentencepiece
 unsupervised text tokenizer.")
     (license license:asl2.0)))
 
+(define-public python-sentence-transformers
+  (package
+    (name "python-sentence-transformers")
+    (version "3.0.1")
+    (source
+     (origin
+       (method url-fetch)
+       (uri (pypi-uri "sentence_transformers" version))
+       (sha256
+        (base32 "1xmzbyrlp6wa7adf42n67c544db17nz95b10ri603lf4gi9jqgca"))))
+    (build-system pyproject-build-system)
+    (arguments
+     (list
+      #:test-flags `(list
+                     ;; Missing fixture / train or test data.
+                     ;; Requires internet access.
+                     "--ignore=tests/test_sentence_transformer.py"
+                     "--ignore=tests/test_train_stsb.py"
+                     "--ignore=tests/test_compute_embeddings.py"
+                     "--ignore=tests/test_cross_encoder.py"
+                     "--ignore=tests/test_model_card_data.py"
+                     "--ignore=tests/test_multi_process.py"
+                     "--ignore=tests/test_pretrained_stsb.py"
+                     "-k" ,(string-append
+                            "not test_LabelAccuracyEvaluator"
+                            " and not test_ParaphraseMiningEvaluator"
+                            " and not test_cmnrl_same_grad"
+                            " and not test_paraphrase_mining"
+                            " and not test_simple_encode"))))
+    (propagated-inputs (list python-huggingface-hub
+                             python-numpy
+                             python-pillow
+                             python-scikit-learn
+                             python-scipy
+                             python-pytorch
+                             python-tqdm
+                             python-transformers))
+    (native-inputs (list python-pytest))
+    (home-page "https://www.SBERT.net")
+    (synopsis "Multilingual text embeddings")
+    (description "This framework provides an easy method to compute dense
+vector representations for sentences, paragraphs, and images.  The models are
+based on transformer networks like BERT / RoBERTa / XLM-RoBERTa and achieve
+state-of-the-art performance in various tasks. Text is embedded in vector
+space such that similar text are closer and can efficiently be found using
+cosine similarity.
+
+This package provides easy access to pretrained models for more than 100
+languages, fine-tuned for various use-cases.
+
+Further, this framework allows an easy fine-tuning of custom embeddings
+models, to achieve maximal performance on your specific task.")
+    (license license:asl2.0)))
+
 (define-public python-spacy-legacy
   (package
     (name "python-spacy-legacy")
-- 
2.45.2





Acknowledgement sent to Nicolas Graves <ngraves@HIDDEN>:
New bug report received and forwarded. Copy sent to guix-patches@HIDDEN. Full text available.
Report forwarded to guix-patches@HIDDEN:
bug#73115; Package guix-patches. Full text available.
Please note: This is a static page, with minimal formatting, updated once a day.
Click here to see this page with the latest information and nicer formatting.
Last modified: Sun, 12 Jan 2025 05:45:02 UTC

GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997 nCipher Corporation Ltd, 1994-97 Ian Jackson.