Received: from mail-ob0-f191.google.com ([209.85.214.191]:35252) by stodi.digitalkingdom.org with esmtps (TLSv1:RC4-SHA:128) (Exim 4.80.1) (envelope-from ) id 1XhcHs-0002Io-FA; Fri, 24 Oct 2014 03:40:11 -0700 Received: by mail-ob0-f191.google.com with SMTP id wm4sf130756obc.8 for ; Fri, 24 Oct 2014 03:39:57 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20120806; h=date:from:to:message-id:in-reply-to:references:subject:mime-version :x-original-sender:reply-to:precedence:mailing-list:list-id :list-post:list-help:list-archive:sender:list-subscribe :list-unsubscribe:content-type; bh=H5Z8+vUFHE9PM03pjhWLXNO9IqVXlzvNAxObIoicDLo=; b=ewoL8QhoolmOf6VgyMl0l1M8PpRgPNYI/IDd/IpbEJKWFjINGa+jzBvK9TfoTmu11T UEusVqQroeAznhZTAAW17SqDP6YodrFS/T+SyTG1ALNgR68cuxKgW4jw9d6tffJ+r7hb ENkPEC86oBUJ5N0nKAK7pPfTJ3dwVtiznaQQ0anyFXN4yhAKdo5MWEA/b/CFAzdcSFp8 mSirbZw5zQhsJFnrrszR39UvJcR1AwIcZoxXtuQY+xKRVj1G/gM2YDIP1rTzwlMjSjqL laiMaTBjtqklcinLYVjzV28V8DL3fqFms+LAt9c3eCc6nrmTuf1AQYMEweKK4/7AxZgI 0PvA== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=date:from:to:message-id:in-reply-to:references:subject:mime-version :x-original-sender:reply-to:precedence:mailing-list:list-id :list-post:list-help:list-archive:sender:list-subscribe :list-unsubscribe:content-type; bh=H5Z8+vUFHE9PM03pjhWLXNO9IqVXlzvNAxObIoicDLo=; b=gflR6M/N3urPA+RF47nSkpGlBorWOthrb3hbJZJ6SdGvajz4o6+srxK7zfaopf4hUL g6CYIY8hQz5PE/I8UxZkxzBOMvEe5uFRObmYRMoc+tbBGDBi9eRD0eNnGqQYYX1y37Lu 6dXIN07eGwuz4ykUE8IbWdXcgOHPged7Lb2Ihod1lIp43OBbw345ILD97c4v499EJ2mH +vkCkB5845AK3ohMThmTV/+BKxIgq1oa2dgmp2xSAoPL1RueMhBEzwp4VAwx4kz/5T0p c+ncsJmrhrPf3nQUJft3ghw4Ro6e8oE4g9+wHynFLQwA2fM1dKnq1ozqUqVkWdBEM6L2 Gi9w== X-Received: by 10.140.22.239 with SMTP id 102mr137664qgn.1.1414147197710; Fri, 24 Oct 2014 03:39:57 -0700 (PDT) X-BeenThere: bpfk-list@googlegroups.com Received: by 10.140.31.163 with SMTP id f32ls1626537qgf.39.gmail; Fri, 24 Oct 2014 03:39:57 -0700 (PDT) X-Received: by 10.140.29.196 with SMTP id b62mr1798qgb.25.1414147197578; Fri, 24 Oct 2014 03:39:57 -0700 (PDT) Date: Fri, 24 Oct 2014 03:39:57 -0700 (PDT) From: mukti To: bpfk-list@googlegroups.com Message-Id: <9c2066d4-8da6-48ec-9cfb-63f79ca42187@googlegroups.com> In-Reply-To: References: <33A9DB5129C54FFF85FCDD708B6909D8@gmail.com> Subject: Re: [bpfk] camxes and syllabification in zi'evla MIME-Version: 1.0 X-Original-Sender: shunpiker@gmail.com Reply-To: bpfk-list@googlegroups.com Precedence: list Mailing-list: list bpfk-list@googlegroups.com; contact bpfk-list+owners@googlegroups.com List-ID: X-Google-Group-Id: 972099695765 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , Content-Type: multipart/alternative; boundary="----=_Part_864_2137943066.1414147197114" X-Spam-Score: 0.1 (/) X-Spam_score: 0.1 X-Spam_score_int: 1 X-Spam_bar: / X-Spam-Report: Spam detection software, running on the system "stodi.digitalkingdom.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see @@CONTACT_ADDRESS@@ for details. Content preview: On Thursday, October 23, 2014 6:01:20 PM UTC-3, xorxes wrote: > > That'a bug in the morphology, well found! I've changed the > consonantal-syllable rule to: > consonantal-syllable <- consonant syllabic !nucleus (consonant &spaces)? > [...] Content analysis details: (0.1 points, 5.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- 0.0 URIBL_BLOCKED ADMINISTRATOR NOTICE: The query to URIBL was blocked. See http://wiki.apache.org/spamassassin/DnsBlocklists#dnsbl-block for more information. [URIs: googlegroups.com] -0.0 RCVD_IN_MSPIKE_H2 RBL: Average reputation (+2) [209.85.214.191 listed in wl.mailspike.net] 0.0 T_HEADER_FROM_DIFFERENT_DOMAINS From and EnvelopeFrom 2nd level mail domains are different -0.0 SPF_PASS SPF: sender matches SPF record 0.0 FREEMAIL_FROM Sender email is commonly abused enduser mail provider (shunpiker[at]gmail.com) 0.0 HTML_MESSAGE BODY: HTML included in message -1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1% [score: 0.0000] -0.1 DKIM_VALID Message has at least one valid DKIM or DK signature -0.1 DKIM_VALID_AU Message has a valid DKIM or DK signature from author's domain 0.1 DKIM_SIGNED Message has a DKIM or DK signature, not necessarily valid 2.0 LONGWORDS Long string of long words 0.0 T_FREEMAIL_FORGED_FROMDOMAIN 2nd level domains in From and EnvelopeFrom freemail headers are different Content-Length: 8195 ------=_Part_864_2137943066.1414147197114 Content-Type: text/plain; charset=UTF-8 On Thursday, October 23, 2014 6:01:20 PM UTC-3, xorxes wrote: > > That'a bug in the morphology, well found! I've changed the > consonantal-syllable rule to: > consonantal-syllable <- consonant syllabic !nucleus (consonant &spaces)? > Using this rule change, I reverified the current contents of jbovlaste and the camxes test corpus. The following words in jbovlaste are no longer considered morphologically valid according to this rule: *cmevla:* klivlynd smanyjinkytoldu'evir *fu'ivla:* artmozaiko asnrlatna asnrtarbi badnrgrute baknrzebu bilmrtuberkulosi catnrpepiskopo ciblrmoru cipnrdodo cipnrdromai cipnrfalko cipnrfasani cipnrkanario cipnrkorvo cipnrkuku cipnrlaridei cipnrlori cipnrpaseru cipnrpika cipnrsagitariidai cipnrsikonia cipnrstrutio cipnrxirundo cipnrxuazine cirlnrokforte cirlnxiogluto cirlrbri cirlrceda cirlrfeta cirlrgorgonzola cirlrkamumberti cirlrmozarela cirlrpanira cirlrparmaregio cirlrpreste cirlrstilto cirlrxalumi cirlrxauda dasrngeko datnrzbaselpla finprsinxnatfidai finprsinxnatfinai fipnrpetoikti fipnrprotopteru fiprntosfenu gurnrbulguru gurnrtefi guzmrkukurbita jatnrpapa jinmrberilo jinmrplati jinmrtitani jinmrtuli jinmrxafni jisrnxananase jivnlragbi jivnrfarzu'e juknrfalangida kamrngogolo klaktno koblrsinapi kulnrfarsi kulnrnorge kulnrnorgo kulnrsfe'enska kulnrtai kulnrturkie kulnrturko kulnrxirani latmrbizanto mabrnfuru mastla matnrmiristika maxrnspelta mivrlge mudrnsia mustlei navnlrado navnrkripto navnrxeno nimrnlatifolia nimrnlimone nimrnxaurantifolia pipnrpiano postmo purmrderi ranmrdrakono rartni ricrlbizi ricrnsia runtngasnrproni sodnlrubidi sodnrcesi sodnrfransi sodnrkali sodnrlito sparknipofia srasrnrupia tabrntromba tabrnvuvuzela venzla vibnrbarpinji xagrnklarineto xagrnsaksofono xaslrkianga xipfne xubrnre'u xubrnrumeksa *zei-lujvo:* pipnrpiano zei konceto Additionally, the following words from the camxes test corpus, which includes examples from CLL and text from "Alice", no longer parse: bangrtlingana banrtlingana banrtlinganu bilmrmautisma bilmrdisleksia bongnanba cipnrparota cipnrpisitako cipnrxakuila cirlrkotadja danlrxelefanta danlrkoralo datnrselecti gudjrati ingmeme kulnrmerka kulnrperu kulnrsu,omi kulnrtcosena mablrbastarda mabrnmustela natmrnorge xukmrkokeina xukmrkokeini If these changes are approved, I will update jbovlaste, reclassifying the words as obsolete. I will also mark as unparseable the sentences in the test corpus which contain the problematic words. mi'e la mukti mu'o -- You received this message because you are subscribed to the Google Groups "BPFK" group. To unsubscribe from this group and stop receiving emails from it, send an email to bpfk-list+unsubscribe@googlegroups.com. To post to this group, send email to bpfk-list@googlegroups.com. Visit this group at http://groups.google.com/group/bpfk-list. For more options, visit https://groups.google.com/d/optout. ------=_Part_864_2137943066.1414147197114 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable On Thursday, October 23, 2014 6:01:20 PM UTC-3, xorxes wrote:
That'a bug in the morphology, well found! I've changed the conson= antal-syllable rule to:
  consonantal-syllable <- consona= nt syllabic !nucleus (consonant &spaces)?

Using this rule change, I reverified the current= contents of jbovlaste and the camxes test corpus. The following words in j= bovlaste are no longer considered morphologically valid according to this r= ule:

cmevla:
klivlynd
s= manyjinkytoldu'evir

fu'ivla:
artm= ozaiko
asnrlatna
asnrtarbi
badnrgrute
baknrzebu
bilmrtuberkulosi
catnrpepiskopo
= ciblrmoru
cipnrdodo
cipnrdromai
cipnrfalko
cipnrfasani
cipnrkanario
cipnrkorvo
c= ipnrkuku
cipnrlaridei
cipnrlori
cipnrpaseru
cipnrpika
cipnrsagitariidai
cipnrsikonia
=
cipnrstrutio
cipnrxirundo
cipnrxuazine
c= irlnrokforte
cirlnxiogluto
cirlrbri
cirlrceda=
cirlrfeta
cirlrgorgonzola
cirlrkamumberti
cirlrmozarela
cirlrpanira
cirlrparmaregio
=
cirlrpreste
cirlrstilto
cirlrxalumi
cirl= rxauda
dasrngeko
datnrzbaselpla
finprsinxnatf= idai
finprsinxnatfinai
fipnrpetoikti
fipnrpro= topteru
fiprntosfenu
gurnrbulguru
gurnrtefi
guzmrkukurbita
jatnrpapa
jinmrberilo
jinmrplati
jinmrtitani
jinmrtuli
jinmrxafni<= /div>
jisrnxananase
jivnlragbi
jivnrfarzu'e
juknrfalangida
kamrngogolo
klaktno
koblrsi= napi
kulnrfarsi
kulnrnorge
kulnrnorgo
kulnrsfe'enska
kulnrtai
kulnrturkie
kulnrt= urko
kulnrxirani
latmrbizanto
mabrnfuru
=
mastla
matnrmiristika
maxrnspelta
mivrlg= e
mudrnsia
mustlei
navnlrado
navnrk= ripto
navnrxeno
nimrnlatifolia
nimrnlimone
nimrnxaurantifolia
pipnrpiano
postmo
= purmrderi
ranmrdrakono
rartni
ricrlbizi
=
ricrnsia
runtngasnrproni
sodnlrubidi
sod= nrcesi
sodnrfransi
sodnrkali
sodnrlito
<= div>sparknipofia
srasrnrupia
tabrntromba
tabr= nvuvuzela
venzla
vibnrbarpinji
xagrnklarineto=
xagrnsaksofono
xaslrkianga
xipfne
= xubrnre'u
xubrnrumeksa

zei-lujvo:=
pipnrpiano zei konceto

Additionally, th= e following words from the camxes test corpus, which includes examples from= CLL and text from "Alice", no longer parse:

= bangrtlingana
banrtlingana
banrtlinganu
bilmr= mautisma
bilmrdisleksia
bongnanba
cipnrparota=
cipnrpisitako
cipnrxakuila
cirlrkotadja
danlrxelefanta
danlrkoralo
datnrselecti
gudjrati
ingmeme
kulnrmerka
kulnrperu
<= div>kulnrsu,omi
kulnrtcosena
mablrbastarda
ma= brnmustela
natmrnorge
xukmrkokeina
xukmrkokei= ni

If these changes are approved, I will upd= ate jbovlaste, reclassifying the words as obsolete. I will also mark as unp= arseable the sentences in the test corpus which contain the problematic wor= ds.

mi'e la mukti mu'o

--
You received this message because you are subscribed to the Google Groups &= quot;BPFK" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to bpfk-list= +unsubscribe@googlegroups.com.
To post to this group, send email to bpfk-list@googlegroups.com.
Visit this group at ht= tp://groups.google.com/group/bpfk-list.
For more options, visit http= s://groups.google.com/d/optout.
------=_Part_864_2137943066.1414147197114--