Received: from mail-vk0-f63.google.com ([209.85.213.63]:33882) by stodi.digitalkingdom.org with esmtps (TLSv1.2:ECDHE-RSA-AES128-GCM-SHA256:128) (Exim 4.87) (envelope-from ) id 1czJnz-00067W-SX for lojban-list-archive@lojban.org; Sat, 15 Apr 2017 02:15:51 -0700 Received: by mail-vk0-f63.google.com with SMTP id j137sf9259031vke.1 for ; Sat, 15 Apr 2017 02:15:43 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20161025; h=sender:date:from:to:message-id:subject:mime-version :x-original-sender:reply-to:precedence:mailing-list:list-id :x-spam-checked-in-group:list-post:list-help:list-archive :list-subscribe:list-unsubscribe; bh=HsFyqMAcT/yLPaqdNjPnbnPMRVeniJmAxPbMTsXAzqQ=; b=NQB6Jzs/uNcrB08xHA125rFf+LvGD13/O3R442VcVUmXD7j18QMF3cvCNOlQdiwqFU h2J4Y66damYyFUeuoGFj4ZEFL8FCFcp45bZNSn5MDifeNMcJyQGxr6DgTpcVDVgK4ud9 yZqyxPYQzrl2mhtrkHKhxyb7T+By5WhyF60YkPS+ZCnxX4gRmGAj+nIxtZEP9lOHI2Ta VoIUbWSAoLzy/2V07x3S9ialxNB7kFLfmWJlDMTmwgoUJmRdI0YlkbnfGgFmkJwDCl2c 83fycC4drUvosRcA4ZXnfvd3ccH7avtYDrmfTbZG6W0B2bALYlidRenMSB7IkedHeA/n CJ4g== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=date:from:to:message-id:subject:mime-version:x-original-sender :reply-to:precedence:mailing-list:list-id:x-spam-checked-in-group :list-post:list-help:list-archive:list-subscribe:list-unsubscribe; bh=HsFyqMAcT/yLPaqdNjPnbnPMRVeniJmAxPbMTsXAzqQ=; b=SjsjHQlAM8KdwmH73fuwZ/1WSiKTPv4fNsWX0ILhDLm1fgPJh64Vxe9imSkVKB6No4 g+tXnlr3aS37ggoGuzES4r5S5ymF5juZhdz5OoRY90ymT+FlIYdZUWDGWihdceyofqjY YaMj6jfnSRP75Jh4AP7Fj3hQxzlvZNQyAeXq6+bOUh9AqzOX3QFdwJZfIxwBb68EIQX2 13iyMU2d/iRtLAv73b60muru4X03FQ3KAqyRK5RC0/nwS7SD9R937NRHMB3P6VAa1vdV JpW0Sch5dUYkXugDsQ9dZllVVFPF8KZj6SINfQY9kymbkJ2tEZcRr5s3x1ofQ3izxaIg xJww== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=sender:x-gm-message-state:date:from:to:message-id:subject :mime-version:x-original-sender:reply-to:precedence:mailing-list :list-id:x-spam-checked-in-group:list-post:list-help:list-archive :list-subscribe:list-unsubscribe; bh=HsFyqMAcT/yLPaqdNjPnbnPMRVeniJmAxPbMTsXAzqQ=; b=CkbyU9/Uyr5njntUXfL78NRjWOEKUL4jwpibnHTI8lZYgF7HswWO44jQ6BadPc0Mx2 R9f8yTNA7cL7BvnXxVmg1VekN8+mNDJEZum6EF+jBDrCAHl7AqC+etUKxCIqEAU6vhoL PKXJ4Q8wOpWVBWtUEZrpbUxRu85sAWkzRIey0VvL+y/M0a04jAh0jlGlIqbcGyDEVaeN RJruPehbOsk04gi6YPTVzKJ7BSmkqhonatjCeZQOzXkc2N0d9f6SIFB+hW/Xdpgc5CFX vBnUTF34u4J7VfkB0B/Aq6ji3OGR6ggUXhykHTCfNpfAvWvjfVGI2g/+yvl/O3IqT1de kr3w== Sender: lojban@googlegroups.com X-Gm-Message-State: AN3rC/72nE58j9LreYEkqAVfaeR0xY2qtERBT1yUCQEDKPoWMoh2NY02 Q2CMYg1ldiOyIQ== X-Received: by 10.157.4.17 with SMTP id 17mr27465otc.18.1492247737392; Sat, 15 Apr 2017 02:15:37 -0700 (PDT) X-BeenThere: lojban@googlegroups.com Received: by 10.157.6.111 with SMTP id 102ls5699576otn.3.gmail; Sat, 15 Apr 2017 02:15:37 -0700 (PDT) X-Received: by 10.157.6.166 with SMTP id 35mr27603otx.14.1492247737027; Sat, 15 Apr 2017 02:15:37 -0700 (PDT) Date: Sat, 15 Apr 2017 02:15:36 -0700 (PDT) From: gleki.is.my.name@gmail.com To: lojban Message-Id: <8f7cc5e1-e77e-47a1-862d-e3127c8016be@googlegroups.com> Subject: [lojban] Perfect taxonomy of live beings (April 1 aftershocks) MIME-Version: 1.0 Content-Type: multipart/mixed; boundary="----=_Part_1198_751492245.1492247736736" X-Original-Sender: gleki.is.my.name@gmail.com Reply-To: lojban@googlegroups.com Precedence: list Mailing-list: list lojban@googlegroups.com; contact lojban+owners@googlegroups.com List-ID: X-Spam-Checked-In-Group: lojban@googlegroups.com X-Google-Group-Id: 1004133512417 List-Post: , List-Help: , List-Archive: , List-Unsubscribe: , X-Spam-Score: 0.8 (/) X-Spam_score: 0.8 X-Spam_score_int: 8 X-Spam_bar: / X-Spam-Report: Spam detection software, running on the system "stodi.digitalkingdom.org", has NOT identified this incoming email as spam. The original message has been attached to this so you can view it or label similar future email. If you have any questions, see the administrator of that system for details. Content preview: Let's take taxonomy run by NCBI. Approach 1. .ence- pseudoprefix for id of a taxon provides us with around than 800 000 taxa. [...] Content analysis details: (0.8 points, 5.0 required) pts rule name description ---- ---------------------- -------------------------------------------------- 0.0 URIBL_BLOCKED ADMINISTRATOR NOTICE: The query to URIBL was blocked. See http://wiki.apache.org/spamassassin/DnsBlocklists#dnsbl-block for more information. [URIs: jbotcan.org] -0.0 RCVD_IN_MSPIKE_H2 RBL: Average reputation (+2) [209.85.213.63 listed in wl.mailspike.net] -0.0 SPF_PASS SPF: sender matches SPF record 0.0 HEADER_FROM_DIFFERENT_DOMAINS From and EnvelopeFrom 2nd level mail domains are different 0.0 FREEMAIL_FROM Sender email is commonly abused enduser mail provider (gleki.is.my.name[at]gmail.com) 2.8 FUZZY_XPILL BODY: Attempt to obfuscate words in spam -1.9 BAYES_00 BODY: Bayes spam probability is 0 to 1% [score: 0.0000] 0.0 HTML_MESSAGE BODY: HTML included in message -0.1 DKIM_VALID Message has at least one valid DKIM or DK signature 0.1 DKIM_SIGNED Message has a DKIM or DK signature, not necessarily valid -0.1 DKIM_VALID_AU Message has a valid DKIM or DK signature from author's domain 0.0 FREEMAIL_FORGED_FROMDOMAIN 2nd level domains in From and EnvelopeFrom freemail headers are different ------=_Part_1198_751492245.1492247736736 Content-Type: multipart/alternative; boundary="----=_Part_1199_1044434807.1492247736736" ------=_Part_1199_1044434807.1492247736736 Content-Type: text/plain; charset=UTF-8 Let's take taxonomy run by NCBI. Approach 1. .ence- pseudoprefix for id of a taxon provides us with around than 800 000 taxa. .encesoxanoxa is a live being of species Homo sapiens (id 9606). .encesomusoze is Bonobo earlier known in Lojban as jbonobo. Definitely, {.encesoxabimu} (Felis catus, id 9685) is easier to use than {zdani mlatu}. jbonobo happily got a new name, jboremna can start using it. Approach 2. Another approach is to make an algorithm where Latin names are adapted into Lojban morphology so that back conversion is unambiguous. We've successfully converted >99% of names of species. You can see a table with all those names and a special columns telling whether back conversion from Lojban correctly restores original Latin name. We've ignored distinction between upper and lower case since it's of no importance: Open table Few names haven't been converted: Opisthoteuthis sp. B-PCHH2001 Gautieria sp. HH2221Swiss and only 2,600 other names out of ~800,000. Those are mostly names of subspecies (that can be additionally handled when capital letter abbreviations are found) or ad hoc names that are better to quote with zoi. ... zoi. as provided by Lojban language. la jbovlaste is the main database of Lojban words and soon it will get 800,000 new words. [source ] -- You received this message because you are subscribed to the Google Groups "lojban" group. To unsubscribe from this group and stop receiving emails from it, send an email to lojban+unsubscribe@googlegroups.com. To post to this group, send email to lojban@googlegroups.com. Visit this group at https://groups.google.com/group/lojban. For more options, visit https://groups.google.com/d/optout. ------=_Part_1199_1044434807.1492247736736 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
Let's take taxonomy run by NCBI.
Approach 1.

.ence- pseudoprefix for i= d of a taxon provides us with around than 800 000 taxa.

.encesoxanoxa is a live being of species Homo sapiens (id 9606).
.encesomusoze is Bonobo earlier known in Lojban as jbonobo.

Definitely, {.encesoxabimu} (Felis catus, id 9685) is easie= r to use than {zdani mlatu}.

jbonobo happily got a= new name, jboremna can start using it.

Approach 2= .
Another approach is to make an algorithm where Latin names are = adapted into Lojban morphology so that back conversion is unambiguous.

We've successfully converted >99% of names of s= pecies.
You can see a table with all those names and a special co= lumns telling whether back conversion from Lojban correctly restores origin= al Latin name. We've ignored distinction between upper and lower case s= ince it's of no importance:

Open table

Few names haven't been converted:

Opisthoteuthis sp. B-PCHH2001
Gautieria sp. HH2221Swiss<= /div>
and only 2,600 other names out of ~800,000. Those are mostly name= s of subspecies (that can be additionally handled when capital letter abbre= viations are found) or ad hoc names that are better to quote with zoi. ... = zoi. as provided by Lojban language.

la jbovlaste = is the main database of Lojban words and soon it will get 800,000 new words= .=C2=A0


</aftershock>

--
You received this message because you are subscribed to the Google Groups &= quot;lojban" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to lojban+unsub= scribe@googlegroups.com.
To post to this group, send email to lojban@googlegroups.com.
Visit this group at http= s://groups.google.com/group/lojban.
For more options, visit http= s://groups.google.com/d/optout.
------=_Part_1199_1044434807.1492247736736-- ------=_Part_1198_751492245.1492247736736--