Received: from mail-qe0-f57.google.com ([209.85.128.57]:64047) by stodi.digitalkingdom.org with esmtps (TLSv1:RC4-SHA:128) (Exim 4.80.1) (envelope-from ) id 1VdtRZ-0001xY-1P for lojban-list-archive@lojban.org; Tue, 05 Nov 2013 19:06:14 -0800 Received: by mail-qe0-f57.google.com with SMTP id 6sf2042555qeb.22 for ; Tue, 05 Nov 2013 19:06:00 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20120806; h=date:from:to:cc:message-id:in-reply-to:references:subject :mime-version:x-original-sender:reply-to:precedence:mailing-list :list-id:list-post:list-help:list-archive:sender:list-subscribe :list-unsubscribe:content-type; bh=B+S4bL3YJez/uW811Gc/uqES9fuxhrwXAwiGn2cMBJY=; b=Ik2iuk9cnBwz9dZ1wiqAAKSraV6aG/JX369TpEvixTZ6qhkuBTDr00ZGOABRdL1wRs ppdYKRma2D1XsW4LB4IeCEP8cx0qwSmEIXqFECjR4eCkWpnD9BOhvfqjU26huRMylh9n MJaLaZQRBtt+ZoIKkK+rMP843hcy9+8Pk9q0DONdf/2bTAUcNqVL0kkDl7MKpFMmge3H 6CZxRqdQH5o+v7akN7VzMoi2tqtv56pkrS7SdTinqrXZ7sKcOFXC8X4Fhdr9dePkPS68 25Dxz0hsdyPIJa2HFTJVjCdMAtYyB+wepLE/pcpqp1fkjMlYmiG5Dg3M/PNM2sE/Rse2 rVTQ== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=date:from:to:cc:message-id:in-reply-to:references:subject :mime-version:x-original-sender:reply-to:precedence:mailing-list :list-id:list-post:list-help:list-archive:sender:list-subscribe :list-unsubscribe:content-type; bh=B+S4bL3YJez/uW811Gc/uqES9fuxhrwXAwiGn2cMBJY=; b=XdWQC6tyAtJ2zU+DwAExZ4ss8GpjpQQY7Zklg7tfuDaB5qZNwXumuZbcrK26K/trmW 2oE95xcNJKFzZDqEc2FWNpva0dP89unnrTF3VvoXed01fQ06AgRYXPU14JckXYa6SQVw VCyvE07WkODLhV3YI8XFYt+Sf/2RHLhgL7KUTzOPUUXtyClBFrItmTslnRBWj2O/QP3C b6kjsqKA69v5970HPr6WvYQlQxKVhAuoymkl40ssmwcFQGvx1t5BeHeyg5XSyUVmBhmA SM2o5imfdn/uyzMYg39vxXr+QqKbkfW6j8dpdZhsdXm1p5/34TDVPsBcjUtU2Hd6ijiw Koig== X-Received: by 10.182.61.11 with SMTP id l11mr8774obr.5.1383707160678; Tue, 05 Nov 2013 19:06:00 -0800 (PST) X-BeenThere: lojban@googlegroups.com Received: by 10.182.148.106 with SMTP id tr10ls313318obb.77.gmail; Tue, 05 Nov 2013 19:05:59 -0800 (PST) X-Received: by 10.182.213.41 with SMTP id np9mr8807obc.3.1383707159127; Tue, 05 Nov 2013 19:05:59 -0800 (PST) Date: Tue, 5 Nov 2013 19:05:58 -0800 (PST) From: qx4096@gmail.com To: lojban@googlegroups.com Cc: qx4096@gmail.com Message-Id: <57ebe6f6-1063-4a6b-ae14-f8ebdb54e58c@googlegroups.com> In-Reply-To: <3534bf0f-d0a8-4b25-a0c8-52945fda2b4b@googlegroups.com> References: <3534bf0f-d0a8-4b25-a0c8-52945fda2b4b@googlegroups.com> Subject: [lojban] Re: jboselkei is back MIME-Version: 1.0 X-Original-Sender: qx4096@gmail.com Reply-To: lojban@googlegroups.com Precedence: list Mailing-list: list lojban@googlegroups.com; contact lojban+owners@googlegroups.com List-ID: X-Google-Group-Id: 1004133512417 List-Post: , List-Help: , List-Archive: Sender: lojban@googlegroups.com List-Subscribe: , List-Unsubscribe: , Content-Type: multipart/alternative; boundary="----=_Part_177_3822125.1383707158444" X-Spam-Score: -0.1 (/) X-Spam_score: -0.1 X-Spam_score_int: 0 X-Spam_bar: / ------=_Part_177_3822125.1383707158444 Content-Type: text/plain; charset=ISO-8859-1 I added some language support to the test jboselkei so that people can select a language when posting and searching. I don't know how to fix the problem of unicode characters being converted into ASCII characters. I also had some other problems with unicode. I had a PHP file that contained literal strings with unicode characters in it and saved it in unicode format, but when I displayed a file that included that file, the PHP code itself was displayed followed by a large amount of Chinese characters. I resaved the file in ANSI format and now names of some other languages appear as question marks on the jboselkei test. I'm also wondering about the database efficiency of some of the new features I added to jboselkei. I have queries like "SELECT translations.id,translations.entry trentry,translations.postid,posts.entry ptentry,posts.difficulty,translations.date,posts.language FROM translations INNER JOIN posts ON translations.postid=posts.id WHERE (translations.authorid<>1) AND 1 NOT IN (SELECT userid FROM reviews WHERE translations.id=reviews.transid) AND (difficulty BETWEEN 1 AND 5) AND language='en' ORDER BY difficulty DESC LIMIT 0,25;". I don't know much about writing efficient queries. I hope this doesn't just go through the computing resources to retrieve the whole database and then return 25 with the highest difficulty level. Also, I don't believe there is an index on the difficulty field. Should there be? The difficulty field changes a lot so I don't know if it is a good idea to add an index on it. I was also wondering if I should index the languages field since people will be searching by language. However, that field will have a small number of values that repeat many times. I know that Oracle SQL has a bitwise index which is ideal for this type of data, but I believe that MySQL does not. -- You received this message because you are subscribed to the Google Groups "lojban" group. To unsubscribe from this group and stop receiving emails from it, send an email to lojban+unsubscribe@googlegroups.com. To post to this group, send email to lojban@googlegroups.com. Visit this group at http://groups.google.com/group/lojban. For more options, visit https://groups.google.com/groups/opt_out. ------=_Part_177_3822125.1383707158444 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable
I added some language support to the test jboselkei so tha= t people can select a language when posting and searching. I don't know how= to fix the problem of unicode characters being converted into ASCII charac= ters. I also had some other problems with unicode. I had a PHP file that co= ntained literal strings with unicode characters in it and saved it in unico= de format, but when I displayed a file that included that file, the PHP cod= e itself was displayed followed by a large amount of Chinese characters. I = resaved the file in ANSI format and now names of some other languages appea= r as question marks on the jboselkei test.

I'm also wondering about = the database efficiency of some of the new features I added to jboselkei. I= have queries like "SELECT translations.id,translations.entry trentry,trans= lations.postid,posts.entry ptentry,posts.difficulty,translations.date,posts= .language FROM translations INNER JOIN posts ON translations.postid=3Dposts= .id WHERE (translations.authorid<>1) AND 1 NOT IN (SELECT userid FROM= reviews WHERE translations.id=3Dreviews.transid) AND (difficulty BETWEEN 1= AND 5) AND language=3D'en' ORDER BY difficulty DESC LIMIT 0,25;". I don't = know much about writing efficient queries. I hope this doesn't just go thro= ugh the computing resources to retrieve the whole database and then return = 25 with the highest difficulty level. Also, I don't believe there is an ind= ex on the difficulty field. Should there be? The difficulty field changes a= lot so I don't know if it is a good idea to add an index on it.

I w= as also wondering if I should index the languages field since people will b= e searching by language. However, that field will have a small number of va= lues that repeat many times. I know that Oracle SQL has a bitwise index whi= ch is ideal for this type of data, but I believe that MySQL does not.

--
You received this message because you are subscribed to the Google Groups &= quot;lojban" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to lojban+unsubscribe@googlegroups.com.
To post to this group, send email to lojban@googlegroups.com.
Visit this group at http:= //groups.google.com/group/lojban.
For more options, visit https://groups.google.com/groups/opt_out.
------=_Part_177_3822125.1383707158444--