[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [lojban] Spaces in jbovlaste
- To: lojban@googlegroups.com
- Subject: Re: [lojban] Spaces in jbovlaste
- From: Ilmen <ilmen.pokebip@gmail.com>
- Date: Thu, 27 Jul 2017 13:04:33 +0200
- Arc-authentication-results: i=2; gmr-mx.google.com; dkim=pass header.i=@gmail.com header.b=H5u+qGMa; spf=pass (google.com: domain of ilmen.pokebip@gmail.com designates 2a00:1450:400c:c0c::22e as permitted sender) smtp.mailfrom=ilmen.pokebip@gmail.com; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=gmail.com
- Arc-authentication-results: i=1; gmr-mx.google.com; dkim=pass header.i=@gmail.com header.b=H5u+qGMa; spf=pass (google.com: domain of ilmen.pokebip@gmail.com designates 2a00:1450:400c:c0c::22e as permitted sender) smtp.mailfrom=ilmen.pokebip@gmail.com; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=gmail.com
- Arc-message-signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-unsubscribe:list-subscribe:list-archive:list-help:list-post :list-id:mailing-list:precedence:reply-to:content-language :content-transfer-encoding:in-reply-to:mime-version:user-agent:date :message-id:from:references:to:subject:arc-authentication-results :arc-message-signature:sender:dkim-signature:dkim-signature :arc-authentication-results; bh=DWREXaPKtBYj11GLvOG0ChGITKDSR09/RxVxptjlt8Y=; b=alFItCFfjMmNJr+DfZu1rx2YKm9O9eX0nPs0nR/JUnLVIXdj/ApzxaifIjWyJ+pKvQ 9XYhJuX2kZjsD6eJL+Nbg9HcjFi2ZDNSwBxFnGiQPG7VKQicuN/iL5FKMekSCCjoe0bT xDe5ujcNuqNK5gkRg5VbBWkcm39dvN0l157jRwef1SJT1ZaKlEcuLJ38bysHhtrAHG1m aOk/qvgCWj6nDfBD2sckrnBuErdH3mLo1Ra8i0J5MCcyDzPRwDmWH2UBAq5kjZbH0bBd HH2/zrT9tQYYPxZ9mWE+jrw404bamQxDRsVEylfgKd6V4c6xxEgHDfG1zp14SUMZT9Kx 45tA==
- Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-language:content-transfer-encoding:in-reply-to:mime-version :user-agent:date:message-id:from:references:to:subject :dkim-signature:arc-authentication-results; bh=ZOsx4ViHvLN+Wa5TsNLVDJFz4I7O+syTLUGpYttDMKc=; b=S9ZvuLnFaMn8gQEagbPqeUMKlMmzi78jqnaWC0K8iyyYmph6YRwHXh6eiJRpfwR8mO QRTYMdrk3/x9ONtpITMi9EhZL3X++TJSGhiU2dvGm9FKF8SnqaTodwGdLKQIuBj9keGx tQfhBL64/GTEqP3hMEqsPy0x9pzzLaSvgiZwZMgMx6kx0iRoTp6P3Xz3Z9GRg4s+045l nALcpExoWFYKQXVEDGonMU2qzzet1K6O85nVI0tAWmEFWFve2DabpcW5GPRZ4WinEaqA d3oaTeZD/uRWZudalt4m5o7efq+BzHDIdsBpqq+hyOon3VYrHcQy++84Vs71EatmV4c+ lBhw==
- Arc-seal: i=2; a=rsa-sha256; t=1501153493; cv=pass; d=google.com; s=arc-20160816; b=lZE6NIM+VPEPEGzpqmjPaxCQJkUQ9Zs3CbMawLHjlY7eXqpA7Qg6ANMd1mGEQ47V/1 x4ka7Of91cw7FWUHfJ578WS8JaAYouCuoHkaIblR/iryLfxm3qHaMyuH9HJABQ1HbeHW 6u/y4gKJpJBjy7YzdGXucosY1NFvt7YyFFVejYCYSsP1+ZHWAUXuxpt3V9OUnZ8Bp6L2 x+LlSdTBGidj0OAwfp3YADQwizDq4sIwzlMIoChmXbRhtZ2q9C+xWDYsz/MaxlgwcooP IV6f3AgxYW3fYazvQHHlbBb4/yyVR7Ma0v5aRqsdkXeOQn2jVd/gGdgCDZw3YnfdEpgq NayA==
- Arc-seal: i=1; a=rsa-sha256; t=1501153491; cv=none; d=google.com; s=arc-20160816; b=dn0+M6uOwgYrB/jZo2k0C4flYX4a8aMv725yCa916vy+SDQP3EUIeIMkqL2cmG8cik 9KnrKAnPrNFz9UBksbY6EBY0YDxYvTQ7sD/BOyx91FidEb6sRdT5HmfzHhFo9DlmneOW zWCB0kvyo/tZPv2tK8n6oiZc+sWBnfYk67+2PidftLeeSY+vNMdcp1vCqB0jQ4vkKg8h iQrUC4zEllIkVIEL59xhJpKnUJo6PR517kiVxI+eoHkpKb/ASPTBoW7Km2xxp/HmVhuJ sABDXKXwRewknXx1temkfILdRhXyt2YNl19f0cx26M2rlx5sA6EH+Hpbr1I9iWQXDC/5 KoXQ==
- Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20161025; h=sender:subject:to:references:from:message-id:date:user-agent :mime-version:in-reply-to:content-transfer-encoding:content-language :x-original-sender:x-original-authentication-results:reply-to :precedence:mailing-list:list-id:list-post:list-help:list-archive :list-subscribe:list-unsubscribe; bh=DWREXaPKtBYj11GLvOG0ChGITKDSR09/RxVxptjlt8Y=; b=TW+s374t1JQccurxgs26bBIiKYEp8/G4PjBl4FP4eW8TaCVSlsTaOoDBl7Q+mPNmeX aZ7aeQA2C0jjYQ2OueSDvif+LMMsnA0XLyR3g68ZSqDeBnaDF9mAM4Tciv6VrArmb9qI cKeW42nQbsMe9B7qEzDrviJ/7wvSYWiEiQiRBlS97D5eR0psJomxt0j2wlbcbdpJBoj7 X43ZtDgcW7t9pKFq+9wB0QLNf03HQJLPOTMPUmVhwnpwo8T3wLpQQqUA6/Vm40O6YsbJ BHZF6opV5YBDw+OJOtf9Vvbi8OroDBXbmMs4Gg5b0iVFf3Vuqxz+QxGKIYozN3BcIf7B SNiw==
- Dkim-signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=subject:to:references:from:message-id:date:user-agent:mime-version :in-reply-to:content-transfer-encoding:content-language :x-original-sender:x-original-authentication-results:reply-to :precedence:mailing-list:list-id:list-post:list-help:list-archive :list-subscribe:list-unsubscribe; bh=DWREXaPKtBYj11GLvOG0ChGITKDSR09/RxVxptjlt8Y=; b=N+U1F8M5bsBiSLJc101aANndMKaCCuTUmtfagQ52nkqEsFq5jSsLAGXe8l4j36XcXb Qx3aR/FIq4tZxKWNR466g/TlzQ2KrckCYu5+kvRoRFj7NT0dEFThV/c+lNEMrzEcVu5m KaZiuUIAchHsAneTuhjncsKIuoEfRU9R+KfF4tsZRmVjdkxcpg1PPmgjyHdFsAvgMUww OJKOs87Uobf9zV339pTUiNbd8o6uwqEih/Wu4fO9jBIw2dtn1sZjQXVipDtz5bYvNiot Oei1V0ZQQmiM5XivEIwxBQPDgFIBUQVB5HDE+vruBc8hEFopyS4N33moHC5pvkGw4FJU DhwQ==
- In-reply-to: <b8da37b0-bc48-417f-ad27-6ba85424a312@googlegroups.com>
- List-archive: <https://groups.google.com/group/lojba>
- List-help: <https://groups.google.com/support/>, <mailto:lojban+help@googlegroups.com>
- List-id: <lojban.googlegroups.com>
- List-post: <https://groups.google.com/group/lojban/post>, <mailto:lojban@googlegroups.com>
- List-subscribe: <https://groups.google.com/group/lojban/subscribe>, <mailto:lojban+subscribe@googlegroups.com>
- List-unsubscribe: <mailto:googlegroups-manage+1004133512417+unsubscribe@googlegroups.com>, <https://groups.google.com/group/lojban/subscribe>
- Mailing-list: list lojban@googlegroups.com; contact lojban+owners@googlegroups.com
- References: <b8da37b0-bc48-417f-ad27-6ba85424a312@googlegroups.com>
- Reply-to: lojban@googlegroups.com
- Sender: lojban@googlegroups.com
- User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.2.1
If spell checkers are only concerned with identifying what is a correct
word and what isn't, then you should disregard Jbovlaste entries
containing whitespace (they are multi-words lexemes), or even better,
check all the words that compose them to see if any of them is missing
from your spell-check whitelist (I strongly suspect there exists bu and
zei compounds containing words that appears nowhere else in the
dictionary…).
"re zei zgabube" is indeed a sequence of three words. It is present in
the dictionary because it is an independent lexeme, you cannot
accurately derive its meaning from its parts. This occurs all the times
in natlangs, think for example to the English "take off".
As for cmavo sequences, people are allowed to chain them up without
whitespaces in between (this causes no ambiguity), although nowadays it
seems more common to always separate them with whitespaces. For a
spell-checker, two strategy are possible: the lazy one would be to
enforce the style of putting whitespaces between every cmavo, thus
marking e.g. "lonu" as incorrect; the second strategy, more involved,
would be to check any unknown letter string to see if it matchs a
sequence of cmavo, and allow it if it does (e.g. if the program hits
"calonu" and is able to find it can be a sequence of cmavo ca+lo+nu,
only then it would allow it). But I don't know if the software you're
using is able to do that without an explicit and systematic list of all
allowable cmavo strings…
If the software were to need an explicit and exhaustive list of allowed
words, I guess it wouldn't be very handy to use for very synthetic
languages (e.g. Turkish, Quechua, Greenlandic…), which might have an
infinite number of valid words.
—Ilmen.
On 27/07/2017 10:49, sukender1@gmail.com wrote:
coi ro do
I found entries with spaces in jbovlaste. This is an issue for spell
checking dictionaries (actually in "aspell"). I know that spaces are
not relevant when parsing Lojban, but they're still important for
human reading. This is why I would not like a rule like "import every
entry and remove spaces everywhere"...
So, I understand that it may be normal for compound cmavo, like "tai
da'i", but can't these be written without space ("taida'i") without
breaking the reading flow?
However, some entries seem very strange to me, such as "re zei
zgabube". Aren't these 3 separated words??
Thank you for your explanations.
co'o
--
You received this message because you are subscribed to the Google Groups "lojban" group.
To unsubscribe from this group and stop receiving emails from it, send an email to lojban+unsubscribe@googlegroups.com.
To post to this group, send email to lojban@googlegroups.com.
Visit this group at https://groups.google.com/group/lojban.
For more options, visit https://groups.google.com/d/optout.