Received: from mail-qk0-f188.google.com ([209.85.220.188]:36841) by stodi.digitalkingdom.org with esmtps (TLSv1.2:ECDHE-RSA-AES128-GCM-SHA256:128) (Exim 4.87) (envelope-from ) id 1db7Gf-0007VC-VK for lojban-list-archive@lojban.org; Fri, 28 Jul 2017 08:33:38 -0700 Received: by mail-qk0-f188.google.com with SMTP id d145sf11383001qkc.3 for ; Fri, 28 Jul 2017 08:33:32 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1501256005; cv=pass; d=google.com; s=arc-20160816; b=ctXRPArwM77Bngg4vyiM701NdarA1zEpAf3Jm9Aw1wRyzSjqUlPox6Hs8rrAZwdnoX FbxVTuH6rBiY/As+XPUgQPOPkiHEAJTZnLrBcCY2bXOBfDJqq2U65RHghV2obnx0Pip2 9XthiRQtMzIrGCpkN4V+0E/dm/8XePh9FTcc2Q3CjElZuZl/mZ5PsiQ5KhJrMuKn2KAD /B8qSc8zMpvTsOxa0CcOF0S8i5NbltiNYWwJNhLVWjm4TIzUfc/ADXAp6LcSBM9RTtus WmyLlTM1pKccL2lF15K+K3/KsWuafGG2ai46Yx6xKIUkIlijzKuufhpHyeU7LN+kDJGh XIWg== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to:to:subject:message-id:date :from:in-reply-to:references:mime-version:arc-authentication-results :arc-message-signature:sender:dkim-signature:dkim-signature :arc-authentication-results; bh=ytB1HodJ3uOL4k6WEq9590BMCr/RiQOTAyFGP7blElc=; b=Oyl1YwPZ3YaLiT+carDOWDoWWuvDCWj7NJ+AOheU+sbbR1q6tRZXdH3T9Cv/Q4sC4U p9NxDhTv4t7zpLQVzn2bi2cSvtmPRCoUOeDv4ew7tT3hZQdqvIprWy7794zksJiIvPRt NaNxQlJbF4uBiC7JnTsO4xavbKXrALGn2tXyve/lWClU/wajRE5w08p5Or8WxRa/Kb94 pngarIugMuaBussw05tDJzRAL8IJS4oR7bZqPS0RLQpMKrtZatKsG2NyjgsuqCXW0R75 uZOGodFXbjRxoK9ZzrNs+0Ro+8NtebtD/ZaOStJlzWOpPxsQU5TYm3Ll9Dj9DcRZoxKd h7yA== ARC-Authentication-Results: i=2; gmr-mx.google.com; dkim=pass header.i=@gmail.com header.b=NyvsbxN/; spf=pass (google.com: domain of adamlopresto@gmail.com designates 2607:f8b0:400c:c08::22f as permitted sender) smtp.mailfrom=adamlopresto@gmail.com; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=gmail.com DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20161025; h=sender:mime-version:references:in-reply-to:from:date:message-id :subject:to:x-original-sender:x-original-authentication-results :reply-to:precedence:mailing-list:list-id:list-post:list-help :list-archive:list-subscribe:list-unsubscribe; bh=ytB1HodJ3uOL4k6WEq9590BMCr/RiQOTAyFGP7blElc=; b=CpYF4h0vQTGiGwWBTI/PE/ylGbta/y7NKMfy4XHWKMJG0WKInCUt07e+AVOxB6lU1L 3pFJTc7mg4jfWXzkIhVvl9i59zgzZqoeFrinszk5vGcO6txuN3tCQlxRUdTjz7hEw/Es 1kTQgXXlsw0cDIPJgY5dEpQ+mSD23UtqgE4so4MsCNCEZ462qecCqvJ4CtW1RmU8WWfL DmTOLn3N/G2csAiC0BXfIyFpCUZvBdhYwhPSnm8ilAnBVbwQDrHkSyz7u2Hk/rQAQd0x i64o5CXlZT+pehjMN8giuZtN4M2xmyms1CeCLegwGSbGlRJwkQxyphSVDvWM9g+zWuqo TmXQ== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :x-original-sender:x-original-authentication-results:reply-to :precedence:mailing-list:list-id:list-post:list-help:list-archive :list-subscribe:list-unsubscribe; bh=ytB1HodJ3uOL4k6WEq9590BMCr/RiQOTAyFGP7blElc=; b=hoiqqASgN1V9HchdHhs/Y1WzK16gAUDpKuaHRzfh1B/Ufd0gI6T0J0B24ojCOnUNOD v42LVZiG5uuBEKF9AyqpQNpPbsZqrLhtrnYlQRlWbDKE6iNe+JsdwF4FqxnZ+FC/b3Wc /Ve34VLHj+gil4t2k3jlCgUyqDAEslkZW+VlaTMyKoH4OBElMrknlihjvuZRuCEAzLIW crsjcriKazAhlUVEUY7Ho9jQt1xZm5Jc3B59GgxwZm2+E+IHRhLlTH0Hu4GZi7T5ad4C vsAwNud1EEcufgAikYOlropAKU55tYCi6nx/SwJYuDghjysuFdGgIRE21uNl6O2PjEv+ vUAA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=sender:x-gm-message-state:mime-version:references:in-reply-to:from :date:message-id:subject:to:x-original-sender :x-original-authentication-results:reply-to:precedence:mailing-list :list-id:x-spam-checked-in-group:list-post:list-help:list-archive :list-subscribe:list-unsubscribe; bh=ytB1HodJ3uOL4k6WEq9590BMCr/RiQOTAyFGP7blElc=; b=oFbu90f2jB0w+2pHuwDtJKeYcVS+znP0XRdLSiXIfyxk66bw64ckQLNlZX+a0etsDQ OfjzlBAV6sIiQ3l9pVycoMNRh3cWRhTt3o+3MmHe2M7n1GdHmv0kHVL9IINoYWfq62i9 WaTW6Non1wMc3rY6BTKCDfmJa5Q9pfV+hxreoiAp/uPUToDnizuJawngg4QIblmbV7uk kzuVO2NVkccyWJSDVw3yCr/efkEyJpn6Gg3EG6acIYUUjLH8DTcBstxY67E89C4x9PYs oT9HKtAN+8h05xAKXDzezg9wphoynRU5wWUf0C+lxPkUvJMf0+Nvt9b93XW2dfmkJD47 ZDng== Sender: lojban@googlegroups.com X-Gm-Message-State: AIVw112kHL0oDMMlsYbrpDzekd/k7eo8/CqAxb2MMxV9InX2zcASnIZ1 ZHIDV9qprLxEew== X-Received: by 10.36.93.142 with SMTP id w136mr317423ita.10.1501256005618; Fri, 28 Jul 2017 08:33:25 -0700 (PDT) X-BeenThere: lojban@googlegroups.com Received: by 10.36.65.21 with SMTP id x21ls4322699ita.21.gmail; Fri, 28 Jul 2017 08:33:25 -0700 (PDT) X-Received: by 10.129.75.216 with SMTP id y207mr6063227ywa.118.1501256005168; Fri, 28 Jul 2017 08:33:25 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1501256005; cv=none; d=google.com; s=arc-20160816; b=s14BpOyKN1VYlwlRanUhOpa9j7g3CdVeffaVjCDpRtO/0OlwInF2zFLNaUi4hRbA2o NIma7DROLbrvL/75g8EjIbxqIN8XKW3+jk+FJFHbY+2ryrvKuVaYwp7e9e+vteNRhw1D hsu0ZlLdnJtjbK08WmOKnFXxCaXflDIqVdp8058U1/lIGbKEdOLEDz4k7VPGPS2S7za5 ZejYmfZ3y88yqdO5lYtZ8EtoffpnwZ5M71KslmPvjPX3Wh0vVocx5OK+E7t9Sb/LvE4U DvHg6yzPZVsEpnsH7AduQiurqu1cO/3OMNhnoeIAssubBJG2iKQG31r2CPcaxX0xqiJ+ 9p2g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=to:subject:message-id:date:from:in-reply-to:references:mime-version :dkim-signature:arc-authentication-results; bh=9JDyKWbEN33WXu4ViedMaINfM4/tV042EvG/gdijicM=; b=bjuo0U+Sd7QxaSEAKxA91RSp4g9Ywt3zB/dJDfiMARDEyzZAxUzVaYnWlRIhcwRMef jalANfgrfHO0pZhnzIJlaDLB2kjUtl4NxDz6DT6LyDR6m5xRLVfQbtxnS24kKZxyi8ir Bs/GWdvj4lAPZSVe2swg46CMLbqIUPAOMZlQBAhkrkibBo3D1RqSHFOnWl+Mi+gM7OfU njvyJxZ1+symf3ta7xl0kdaDtVBUr+n6ULCjXgK2vZ9YgruldI5dGkipBDBvFfHIlZDv 4HojXEqH/M3rLHwKz2+YpcjRbfYg2crTb1Y/MJf93RbZVzL93qYb5eBM7mY/DPA/Qzq0 lSoA== ARC-Authentication-Results: i=1; gmr-mx.google.com; dkim=pass header.i=@gmail.com header.b=NyvsbxN/; spf=pass (google.com: domain of adamlopresto@gmail.com designates 2607:f8b0:400c:c08::22f as permitted sender) smtp.mailfrom=adamlopresto@gmail.com; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=gmail.com Received: from mail-ua0-x22f.google.com (mail-ua0-x22f.google.com. [2607:f8b0:400c:c08::22f]) by gmr-mx.google.com with ESMTPS id n79si378952vke.3.2017.07.28.08.33.25 for (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Fri, 28 Jul 2017 08:33:25 -0700 (PDT) Received-SPF: pass (google.com: domain of adamlopresto@gmail.com designates 2607:f8b0:400c:c08::22f as permitted sender) client-ip=2607:f8b0:400c:c08::22f; Received: by mail-ua0-x22f.google.com with SMTP id k43so129375150uaf.3 for ; Fri, 28 Jul 2017 08:33:25 -0700 (PDT) X-Received: by 10.176.81.93 with SMTP id f29mr5955602uaa.113.1501256004644; Fri, 28 Jul 2017 08:33:24 -0700 (PDT) MIME-Version: 1.0 References: <3c86d96b-e0ea-af6b-2ee8-51d4e0741fe5@gmail.com> In-Reply-To: From: Adam Lopresto Date: Fri, 28 Jul 2017 15:33:14 +0000 Message-ID: Subject: Re: [lojban] Spaces in jbovlaste To: lojban Content-Type: multipart/alternative; boundary="94eb2c190c2adfb4cb05556266a3" X-Original-Sender: adamlopresto@gmail.com X-Original-Authentication-Results: gmr-mx.google.com; dkim=pass header.i=@gmail.com header.b=NyvsbxN/; spf=pass (google.com: domain of adamlopresto@gmail.com designates 2607:f8b0:400c:c08::22f as permitted sender) smtp.mailfrom=adamlopresto@gmail.com; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=gmail.com Reply-To: lojban@googlegroups.com Precedence: list Mailing-list: list lojban@googlegroups.com; contact lojban+owners@googlegroups.com List-ID: X-Spam-Checked-In-Group: lojban@googlegroups.com X-Google-Group-Id: 1004133512417 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , X-Spam-Score: -1.3 (-) X-Spam_score: -1.3 X-Spam_score_int: -12 X-Spam_bar: - --94eb2c190c2adfb4cb05556266a3 Content-Type: text/plain; charset="UTF-8" If you're going to allow cmavo to be combined arbitrarily (which is probably appropriate), then there's no reason for {lonu} to have its own entry. So I'd suggest not adding any cmavo clusters. And {lonulonucalo} can be grammatical, you just need the right text after it. {lonulonucalo nu jamna kei mi damba cu nandu mi cu se zungi mi}, "I feel guilty that it was hard for me to fight during the war." As you said, a fully grammar checker would be needed to really get things right, and that's a separate problem. On Fri, Jul 28, 2017 at 6:54 AM wrote: > coi la .ilmen. > > I just applied your idea (added split entries) and added merged entries... > And I also found a very simple way to add compound cmavo! > Indeed: > > - I created a script that splits jbovlaste entries into cmavo and > non-cmavo, by using a simple regex (using rules listed in the CLL, chapter > 4.2) > - Then I tagged all cmavo with a flag "C", and added the Hunspell rule > "CCC*" (~= "CC+"), which means you can "glue" 2 or more cmavo together. > > Of course, this will allow un-grammatical things such as "lonulonucalo", > but once again this is not the spell-checker role. > > I tried your example "calonu". It seems the "lonu" entry exists, so my > dictionary inteprets that as a "normal word" (= non-simple-cmavo) instead > of a "compound cmavo". But all following combinations are now valid : > > - ca, lo, nu > - lo nu, lonu, ca lo, calo > - ca lonu, calo nu, calonu > > Only calo & calonu are detected as a compound (remember "lonu" is an > entry), but anyway that works as expected. > Experimental cmavo support will be added soon. > > Do you know other rules that could be great integrating? > Please test ( https://github.com/Sukender/lojban-spell-check-dist ) and > give feedback! ki'e > > I still have issues with dots in LibreOffice (.i .a and such)... And some > words of "le cmalu noltru" are not recognized yet. Is there any other word > source I can use? > > co'o > > -- > Sukender > > -- > You received this message because you are subscribed to the Google Groups > "lojban" group. > To unsubscribe from this group and stop receiving emails from it, send an > email to lojban+unsubscribe@googlegroups.com. > To post to this group, send email to lojban@googlegroups.com. > Visit this group at https://groups.google.com/group/lojban. > For more options, visit https://groups.google.com/d/optout. > -- You received this message because you are subscribed to the Google Groups "lojban" group. To unsubscribe from this group and stop receiving emails from it, send an email to lojban+unsubscribe@googlegroups.com. To post to this group, send email to lojban@googlegroups.com. Visit this group at https://groups.google.com/group/lojban. For more options, visit https://groups.google.com/d/optout. --94eb2c190c2adfb4cb05556266a3 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
If you're going to allow cmavo to be combined arbitrar= ily (which is probably appropriate), then there's no reason for {lonu} = to have its own entry. So I'd suggest not adding any cmavo clusters.
And {lonulonucalo} can be grammatical, you just need the r= ight text after it. {lonulonucalo nu jamna kei mi damba cu nandu mi cu se z= ungi mi}, "I feel guilty that it was hard for me to fight during the w= ar." As you said, a fully grammar checker would be needed to really ge= t things right, and that's a separate problem.

On Fri, Jul 28, 2017 at 6:54 AM <sukender1@gmail.com> wrote:
coi la .ilmen.

=
I just applied your idea (added split entries) and added merged = entries... And I also found a very simple way to add compound cmavo!
<= div>Indeed:
  • I created a script that splits jbovlaste entr= ies into cmavo and non-cmavo, by using a simple regex (using rules listed i= n the CLL, chapter 4.2)
  • Then I tagged all cmavo with a flag "C= ", and added the Hunspell rule "CCC*" (~=3D "CC+")= , which means you can "glue" 2 or more cmavo together.
Of course, this will allow un-grammatical things such as &quo= t;lonulonucalo", but once again this is not the spell-checker role.

I tried your example "calonu". It seems th= e "lonu" entry exists, so my dictionary inteprets that as a "= ;normal word" (=3D non-simple-cmavo) instead of a "compound cmavo= ". But all following combinations are now valid :
  • ca= , lo, nu
  • lo nu, lonu, ca lo, calo
  • ca lonu, calo nu, cal= onu
Only calo & calonu are detected as a compound (remembe= r "lonu" is an entry), but anyway that works as expected.
Expe= rimental cmavo support will be added soon.

Do = you know other rules that could be great integrating?
Please test= (=C2=A0https://github.com/Sukender/lojban-spell-check-dist=C2= =A0) and give feedback! ki'e

I still have issu= es with dots in LibreOffice (.i .a and such)... And some words of "le = cmalu noltru" are not recognized yet. Is there any other word source I= can use?

co'o

--=C2=A0
Sukender

--
You received this message because you are subscribed to the Google Groups &= quot;lojban" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to lojban+unsubscribe@googlegroups.com.
To post to this group, send email to lojban@googlegroups.com.
Visit this group at https://groups.google.com/group/lojban.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups &= quot;lojban" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to lojban+unsub= scribe@googlegroups.com.
To post to this group, send email to lojban@googlegroups.com.
Visit this group at http= s://groups.google.com/group/lojban.
For more options, visit http= s://groups.google.com/d/optout.
--94eb2c190c2adfb4cb05556266a3--