Return-Path: Received: by snark.thyrsus.com (/\==/\ Smail3.1.21.1 #21.19) id ; Fri, 20 Mar 92 10:02 EST Received: by cbmvax.cbm.commodore.com (5.57/UUCP-Project/Commodore 2/8/91) id AA10344; Fri, 20 Mar 92 06:11:02 EST Received: from rutgers.edu by relay1.UU.NET with SMTP (5.61/UUNET-internet-primary) id AA12446; Fri, 20 Mar 92 05:42:16 -0500 Received: from cbmvax.UUCP by rutgers.edu (5.59/SMI4.0/RU1.4/3.08) with UUCP id AA19097; Fri, 20 Mar 92 05:02:45 EST Received: by cbmvax.cbm.commodore.com (5.57/UUCP-Project/Commodore 2/8/91) id AA07283; Fri, 20 Mar 92 04:58:44 EST Received: from pucc.Princeton.EDU by relay1.UU.NET with SMTP (5.61/UUNET-internet-primary) id AA20977; Fri, 20 Mar 92 03:06:24 -0500 Message-Id: <9203200806.AA20977@relay1.UU.NET> Received: from PUCC.PRINCETON.EDU by pucc.Princeton.EDU (IBM VM SMTP V2R2) with BSMTP id 2863; Fri, 20 Mar 92 03:05:55 EST Received: by PUCC (Mailer R2.08 ptf012) id 5979; Fri, 20 Mar 92 03:05:38 EST Date: Thu, 19 Mar 1992 23:52:30 -0500 Reply-To: Logical Language Group Sender: Lojban list From: Logical Language Group Subject: corpus of Lojban text X-To: lojban@cuvmb.cc.columbia.edu To: John Cowan Status: RO X-From-Space-Date: Fri Mar 20 10:02:05 1992 X-From-Space-Address: cbmvax!uunet!cuvma.bitnet!LOJBAN >Dump your text into some corpus. I don't know if the PLS is suitable. Given what Mark and Jerry have told me, that prefer to keep PLS from dealing with open-ended accumulations. All text posted to Lojban-List, as well as all text sent to me, is archived, whether or not we analyze it here. It will eventually be analyzable by those building dictionaries, or looking at the evolution of the language (I do also keep the drafts and the comments when I get them, since these often reveal WHY a text ended up the way it did.) The people here are analyzing text at a truly slow rate (Tuesday nite group is only doiung a paragraph or two, and that group will be switching to class mode woithin a couple of months). I hope this will pick up when we have Nora's version of Shoebox, or if Mark gets his version in a shape that someone else vcan use it. When anlyzing text (as opposed to merely having fun practicing reading it, both Nora and I feel obliged to check every word's place structures and a bunch of other things, as well as not to trust our memories on either glosses or place structures. Thus, I can read Lojban text MUCH faster than I 'analyze' it. Also, both Nora and I analyze text on paper much better than on-line, which presents the problem of getting the results typoed in. Thus, our mechanism for reviewing text should have no relation to what people do with text they have written. I would still like a copy sent here for archival purposes. That archive can serve as the 'corpus' Nick talks about. I'm not sure what can be done to make more old texts available on-line for others not in DC to access. lojbab