Received: from mail-oi0-f62.google.com ([209.85.218.62]:36378) by stodi.digitalkingdom.org with esmtps (TLSv1.2:ECDHE-RSA-AES128-GCM-SHA256:128) (Exim 4.87) (envelope-from ) id 1db3qt-0004ny-9R for lojban-list-archive@lojban.org; Fri, 28 Jul 2017 04:54:47 -0700 Received: by mail-oi0-f62.google.com with SMTP id e124sf8347209oig.3 for ; Fri, 28 Jul 2017 04:54:41 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20161025; h=sender:date:from:to:message-id:in-reply-to:references:subject :mime-version:x-original-sender:reply-to:precedence:mailing-list :list-id:list-post:list-help:list-archive:list-subscribe :list-unsubscribe; bh=pz9XLAynaToqE+1Tb1NtiC4Q0mz83mzNgKSkurztXG0=; b=NiMLdsbae0gA1QUHakBxOn88iEelBwAzEvbe3g0E+s7IL71l5+cM89+IK5rWOPBXqt mhgNaVyqEhym/VCquQulvXKnl3trvIAPxnMBQrkH6X1+J222vNDjxxWi67BKaZMy8eR6 GMom02/XxrJZsKD5fEH6U5GCDw0t864mvNPKE8sBH/5+UjzTwqiUurVOJaw26GQwo58z B0g4Pluh4ku+6VplxZ+/rMRm22QogPxH9hDJiH07aiOSIgopjrAjS+vunQ/wicUqV6aY Bi53+ED4FO2Xb/v2Xi7S4cI0b3cHD2dhwEUPfjLP/Oid9Jk2ow5DYsynWO1PzxW1PXfo lnQQ== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=date:from:to:message-id:in-reply-to:references:subject:mime-version :x-original-sender:reply-to:precedence:mailing-list:list-id :list-post:list-help:list-archive:list-subscribe:list-unsubscribe; bh=pz9XLAynaToqE+1Tb1NtiC4Q0mz83mzNgKSkurztXG0=; b=h2UCcvy8ELF6wmPjfBrjy7bU+oNO6Ddg11QjufO5gNesDuRJCgnNzLqbbSlE+JMH5r Dyug/atkWoUDJub3cl6DqYYt6QjgBitXBE/nvHdo+S+QsSL0sIY+XT6Rrqt3fxsaXRGC KqoeMMTFh+TIzX4HS/zxDJjy9cnVYkOvhD/dJu4oFQiTSYT8dJdohwXYYG/4ns4cTCrI ZfKACEdZPI3+pWHVJ1btcaQq0+GjpJmEcgiHO2goEMIu8iYbKbF/5FiARFgWjwunkYNW 9oCSCS1/1a4Z+rDSXxsJJAqYLkNH1dRiFHzFCoBfrxyOAkYNWgHUU3XxZgnSGpydKNUs o11A== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=sender:x-gm-message-state:date:from:to:message-id:in-reply-to :references:subject:mime-version:x-original-sender:reply-to :precedence:mailing-list:list-id:x-spam-checked-in-group:list-post :list-help:list-archive:list-subscribe:list-unsubscribe; bh=pz9XLAynaToqE+1Tb1NtiC4Q0mz83mzNgKSkurztXG0=; b=OM2B/szy1Dsb2u2qtaykPcRDhrC5sSiUbzopVHFZ/9W3PBUw2DOmnjgjMaepnxc03l PIsR/5DAtyQCizbWGKfbVi4W+0iTxjXRApCJVWzm8La8dgTBPwhhOA6cWI0Pq8FqqFFH ugjjei1066Rpyw49BB/eftbYSTwl6zT3zn114bTQ1r4ahWW39lRU6G0CSRqD+DcG1Nvv E0r/aGRp1b3duAsUKyoE0gZPaL5ztP6NgYT3YOnC/EihWjJ6Buk+2yxT22xh+t1otOGk aSLkW8aiYkJyml9LLj9tMxzSo9M+bMnHbzp8rOayu8yZsXVNu+MQJJLh6t1EaQR5egZ1 Ztyg== Sender: lojban@googlegroups.com X-Gm-Message-State: AIVw112W3doDNCJjEmOj9EW603w5NrntqPc/pdYgIoawWHI87vzlXNOh lC53zdxqNUBhUg== X-Received: by 10.36.80.193 with SMTP id m184mr284941itb.6.1501242874045; Fri, 28 Jul 2017 04:54:34 -0700 (PDT) X-BeenThere: lojban@googlegroups.com Received: by 10.107.21.2 with SMTP id 2ls3668772iov.21.gmail; Fri, 28 Jul 2017 04:54:33 -0700 (PDT) X-Received: by 10.31.54.206 with SMTP id d197mr34155vka.26.1501242873633; Fri, 28 Jul 2017 04:54:33 -0700 (PDT) Date: Fri, 28 Jul 2017 04:54:33 -0700 (PDT) From: sukender1@gmail.com To: lojban Message-Id: In-Reply-To: References: <3c86d96b-e0ea-af6b-2ee8-51d4e0741fe5@gmail.com> Subject: Re: [lojban] Spaces in jbovlaste MIME-Version: 1.0 Content-Type: multipart/mixed; boundary="----=_Part_1637_493675176.1501242873399" X-Original-Sender: sukender1@gmail.com Reply-To: lojban@googlegroups.com Precedence: list Mailing-list: list lojban@googlegroups.com; contact lojban+owners@googlegroups.com List-ID: X-Spam-Checked-In-Group: lojban@googlegroups.com X-Google-Group-Id: 1004133512417 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , X-Spam-Score: -1.8 (-) X-Spam_score: -1.8 X-Spam_score_int: -17 X-Spam_bar: - ------=_Part_1637_493675176.1501242873399 Content-Type: multipart/alternative; boundary="----=_Part_1638_1573052964.1501242873399" ------=_Part_1638_1573052964.1501242873399 Content-Type: text/plain; charset="UTF-8" coi la .ilmen. I just applied your idea (added split entries) and added merged entries... And I also found a very simple way to add compound cmavo! Indeed: - I created a script that splits jbovlaste entries into cmavo and non-cmavo, by using a simple regex (using rules listed in the CLL, chapter 4.2) - Then I tagged all cmavo with a flag "C", and added the Hunspell rule "CCC*" (~= "CC+"), which means you can "glue" 2 or more cmavo together. Of course, this will allow un-grammatical things such as "lonulonucalo", but once again this is not the spell-checker role. I tried your example "calonu". It seems the "lonu" entry exists, so my dictionary inteprets that as a "normal word" (= non-simple-cmavo) instead of a "compound cmavo". But all following combinations are now valid : - ca, lo, nu - lo nu, lonu, ca lo, calo - ca lonu, calo nu, calonu Only calo & calonu are detected as a compound (remember "lonu" is an entry), but anyway that works as expected. Experimental cmavo support will be added soon. Do you know other rules that could be great integrating? Please test ( https://github.com/Sukender/lojban-spell-check-dist ) and give feedback! ki'e I still have issues with dots in LibreOffice (.i .a and such)... And some words of "le cmalu noltru" are not recognized yet. Is there any other word source I can use? co'o -- Sukender -- You received this message because you are subscribed to the Google Groups "lojban" group. To unsubscribe from this group and stop receiving emails from it, send an email to lojban+unsubscribe@googlegroups.com. To post to this group, send email to lojban@googlegroups.com. Visit this group at https://groups.google.com/group/lojban. For more options, visit https://groups.google.com/d/optout. ------=_Part_1638_1573052964.1501242873399 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable
coi la .ilmen.

I just applied your idea= (added split entries) and added merged entries... And I also found a very = simple way to add compound cmavo!
Indeed:
  • I cre= ated a script that splits jbovlaste entries into cmavo and non-cmavo, by us= ing a simple regex (using rules listed in the CLL, chapter 4.2)
  • The= n I tagged all cmavo with a flag "C", and added the Hunspell rule= "CCC*" (~=3D "CC+"), which means you can "glue&qu= ot; 2 or more cmavo together.
Of course, this will = allow un-grammatical things such as "lonulonucalo", but once agai= n this is not the spell-checker role.

I tried your= example "calonu". It seems the "lonu" entry exists, so= my dictionary inteprets that as a "normal word" (=3D non-simple-= cmavo) instead of a "compound cmavo". But all following combinati= ons are now valid :
  • ca, lo, nu
  • lo nu, lonu, c= a lo, calo
  • ca lonu, calo nu, calonu
Only calo & ca= lonu are detected as a compound (remember "lonu" is an entry), bu= t anyway that works as expected.
Experimental cmavo support will be adde= d soon.

Do you know other rules that could be = great integrating?
Please test (=C2=A0https://github.com/Sukender/lojban-spe= ll-check-dist=C2=A0) and give feedback! ki'e

I still have issues with dots in LibreOffice (.i .a and such)... And som= e words of "le cmalu noltru" are not recognized yet. Is there any= other word source I can use?

co'o
<= br>
--=C2=A0
Sukender

--
You received this message because you are subscribed to the Google Groups &= quot;lojban" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to lojban+unsub= scribe@googlegroups.com.
To post to this group, send email to lojban@googlegroups.com.
Visit this group at http= s://groups.google.com/group/lojban.
For more options, visit http= s://groups.google.com/d/optout.
------=_Part_1638_1573052964.1501242873399-- ------=_Part_1637_493675176.1501242873399--