Received: from mail-pa0-f58.google.com ([209.85.220.58]:59605) by stodi.digitalkingdom.org with esmtps (TLSv1:RC4-SHA:128) (Exim 4.80.1) (envelope-from ) id 1WbCU4-0008AE-Ct for lojban-list-archive@lojban.org; Fri, 18 Apr 2014 10:22:05 -0700 Received: by mail-pa0-f58.google.com with SMTP id fa1sf453777pad.3 for ; Fri, 18 Apr 2014 10:21:46 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20120806; h=date:from:to:message-id:subject:mime-version:x-original-sender :reply-to:precedence:mailing-list:list-id:list-post:list-help :list-archive:sender:list-subscribe:list-unsubscribe:content-type; bh=HMNkNDTWcidMtqNAuA4mOvOC6BA8VD/3rsuXCk/2q88=; b=wI+NFBhNTyYne6mCxuGsT4LG5m+3Lj6stP0Fp84hj3sa6WijIa7fTSUSztFY7z8bUl 9Czl8RPw5Gi/VyN5XU5DZmZPGlD6/EvdKoH34Gd2hZ3e5mZVBhupq03wuifMco80OsJd UjwSw/Um3EDZWHSMsFCiddgQhOBq3rFqu8X28c/SVGjG9NDdpc8X5cCNWYBMNMC/ufT8 jrH435sB3NSuSJ4bilksmQ6ZecIMqEn2CYeVr3cd5YD4XwxbXKKtS1C/3Z7lBm9YYMQA 14E9LIagdBDR8rryaEVj3pWuGAQcV9Bio3ZrOmOj4Qk4k6oXdyDHG/OZGh5YCsfJtuh/ cl5Q== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=date:from:to:message-id:subject:mime-version:x-original-sender :reply-to:precedence:mailing-list:list-id:list-post:list-help :list-archive:sender:list-subscribe:list-unsubscribe:content-type; bh=HMNkNDTWcidMtqNAuA4mOvOC6BA8VD/3rsuXCk/2q88=; b=v49OJLbNSjmtn1C7+iLfLcJvDrCVpMxv2lOIJNQ7ialgtgNF/iFLzpkvNqJfBZSHov bdnEJfTnWFLhtWqKn7xgFPj46xqiojtTQUTX6afjiHpugJqwspY+luqlucXdSo3osqIq XvGuEVQ5mHVgfT93TXONbyQqEOn9qNEBBeGYj4cpa1CQtxa7wgdmWtPLN0dXuHcKu3w1 PdTRiIrAL+C/zkVvi4tvf9Gbdp5KIb5g0oRnMPKJZKkBiiqEMd4l1P7w+bG1grpADhtr M/RBn5GuRtXOr+w+Yw9VSgyh44RZ/x57/QqSnHiUhjTxAXdcaWjQMkP8QL01BD9ulFpJ mcvQ== X-Received: by 10.50.131.201 with SMTP id oo9mr113318igb.4.1397841706299; Fri, 18 Apr 2014 10:21:46 -0700 (PDT) X-BeenThere: lojban@googlegroups.com Received: by 10.50.119.100 with SMTP id kt4ls358007igb.5.gmail; Fri, 18 Apr 2014 10:21:45 -0700 (PDT) X-Received: by 10.51.18.101 with SMTP id gl5mr113905igd.2.1397841705857; Fri, 18 Apr 2014 10:21:45 -0700 (PDT) Date: Fri, 18 Apr 2014 10:21:44 -0700 (PDT) From: Riley Martinez-Lynch To: lojban@googlegroups.com Message-Id: Subject: [lojban] jbovlaste, vlatai, camxes and morphology MIME-Version: 1.0 X-Original-Sender: shunpiker@gmail.com Reply-To: lojban@googlegroups.com Precedence: list Mailing-list: list lojban@googlegroups.com; contact lojban+owners@googlegroups.com List-ID: X-Google-Group-Id: 1004133512417 List-Post: , List-Help: , List-Archive: Sender: lojban@googlegroups.com List-Subscribe: , List-Unsubscribe: , Content-Type: multipart/alternative; boundary="----=_Part_2485_23146501.1397841704691" X-Spam-Score: -0.1 (/) X-Spam_score: -0.1 X-Spam_score_int: 0 X-Spam_bar: / ------=_Part_2485_23146501.1397841704691 Content-Type: text/plain; charset=UTF-8 A couple of issues (#26 , #37) have been recently raised against jbovlaste which can be traced to "vlatai", a tool which is used to verify and classify the morphology of new words. vlatai is a tool which is built as part of jbofi'e: Accordingly it has not been substantially updated for some time, and is known to exhibit bugs, including failing to parse some valid words. One suggestion has been to replace vlatai with camxes. I've taken some initial steps in that direction, but need some help verifying what the correct behavior should be for the cases where camxes and jbofi'e return different results. I ran all of the words in jbovlaste through camxes and filed issues #38, #39 , #40and #41 to record issues encountered with (respectively) cmavo, cmene, fu'ivla and lujvo. I will include a few examples: - camxes parses {y} as "initialSpaces", but it is considered a cmavo in jbovlaste. - camxes doesn't parse {bu} compounds like {denpa bu} as cmavo - {ybu} is a "cmavo cluster" in jbovlaste but a single cmavo per camxes - {aierne} is a fu'ivla in jbovlaste but a "cmavo + fu'ivla" per camxes - {selda'ergau} is a fu'ivla in jbovlaste but a lujvo per camxes - {zei} compounds aren't recognized as lujvo by camxes (nor in vlatai: jbovlaste has a workaround) If you can offer verifications or corrections for any of these issues, please respond here or add comments to the issues in github. I'd also like to know if there's consensus on which is more correct or current: "cmavo cluster" (jbovlaste) or "compound cmavo" (CLL), or if there's a distinction between these terms. Thank you! --Riley mi'e la mukti mu'o -- You received this message because you are subscribed to the Google Groups "lojban" group. To unsubscribe from this group and stop receiving emails from it, send an email to lojban+unsubscribe@googlegroups.com. To post to this group, send email to lojban@googlegroups.com. Visit this group at http://groups.google.com/group/lojban. For more options, visit https://groups.google.com/d/optout. ------=_Part_2485_23146501.1397841704691 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
A couple of issues (#26, #37) have been recently raised against jbovlaste which= can be traced to "vlatai", a tool which is used to verify and classify the= morphology of new words. vlatai is a tool which is built as part of <= a href=3D"https://github.com/lojban/jbofihe">jbofi'e: Accordingly it ha= s not been substantially updated for some time, and is known to exhibit bug= s, including failing to parse some valid words.

On= e suggestion has been to replace vlatai with camxes. I've taken some initia= l steps in that direction, but need some help verifying what the correct be= havior should be for the cases where camxes and jbofi'e return different re= sults.

I ran all of the words in jbovlaste through= camxes and filed issues #38, = #39, #40 = and #41 = to record issues encountered with (respectively) cmavo, cmene, fu'ivla and = lujvo. I will include a few examples:
  • camxes parses {y} a= s "initialSpaces", but it is considered a cmavo in jbovlaste.
  • c= amxes doesn't parse {bu} compounds like {denpa bu} as cmavo
  • {ybu} i= s a "cmavo cluster" in jbovlaste but a single cmavo per camxes
  • {a= ierne} is a fu'ivla in jbovlaste but a "cmavo + fu'ivla" per camxes<= br>
  • {selda'ergau} is a fu'ivla in jbovlaste but a lujvo per camxe= s
  • {zei} compounds aren't recognized as lujvo by camxes= (nor in vlatai: jbovlaste has a workaround)
If = you can offer verifications or corrections for any of these issues, please = respond here or add comments to the issues in github.

<= div>I'd also like to know if there's consensus on which is more correct or = current: "cmavo cluster" (jbovlaste) or "compound cmavo" (CLL), or if there= 's a distinction between these terms.

Thank you!

--Riley
mi'e la mukti mu'o

--
You received this message because you are subscribed to the Google Groups &= quot;lojban" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to lojban+unsub= scribe@googlegroups.com.
To post to this group, send email to lojban@googlegroups.com.
Visit this group at http:= //groups.google.com/group/lojban.
For more options, visit http= s://groups.google.com/d/optout.
------=_Part_2485_23146501.1397841704691--