Received: from mail-qc0-f190.google.com ([209.85.216.190]:59332) by stodi.digitalkingdom.org with esmtps (TLSv1:RC4-SHA:128) (Exim 4.80.1) (envelope-from ) id 1ViDFu-0002TH-5g for lojban-list-archive@lojban.org; Sun, 17 Nov 2013 17:04:05 -0800 Received: by mail-qc0-f190.google.com with SMTP id n4sf972382qcx.17 for ; Sun, 17 Nov 2013 17:03:51 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20120806; h=date:from:to:message-id:in-reply-to:references:subject:mime-version :x-original-sender:reply-to:precedence:mailing-list:list-id :list-post:list-help:list-archive:sender:list-subscribe :list-unsubscribe:content-type; bh=MI5swTkA2dUPMmWGQ+mtMIigdF/UTiStVjwI47K3pAo=; b=dvcOYquWdpYLNPKWEOqr03df0oS7OZ01PcivX4fNO7ggEqM5mx/Z872WDVr9PE0QX0 i5eYCFW3avsGlYhyvN8eFketfBjd3DTT8SAXqgI156iGvx92uB53W68tzzWekvFIECqX j2r2yRFwr0DAQdftIyObeoZjBuvJF9+mQtBOQttk2VMD7njZHxVAeqvFBvVl0dzDy9L4 V3u/arM55qUGyWy6FjTTyEryvKpvfhSKu4gElxWBfKmYImhbA5g9k3Qol5j7zF0Y8z5j brmJfxhRmY7FzqlzmMMhvkpIHpiDrnwKE+3hnYYijJaJShq61Sat4jWc4fwbmNphDY4g zdrw== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=date:from:to:message-id:in-reply-to:references:subject:mime-version :x-original-sender:reply-to:precedence:mailing-list:list-id :list-post:list-help:list-archive:sender:list-subscribe :list-unsubscribe:content-type; bh=MI5swTkA2dUPMmWGQ+mtMIigdF/UTiStVjwI47K3pAo=; b=obWoZUp2aIUoL0e7PX9vZFvbIiEoUd6bcoN6saaDtSYJrrKVurwgEjULrnCG4EjxK0 GZeD1A36nNpy3AMEt1eGcvKQrdzxgG/+K141KYmaRC9eKFE6+AX75MzodznLytV2p3TT d81teJZA/4CeexHHMu7a2CRLEDj2If9HojS+QseJXoQ3vYaVkZliXH5Y3GD+04XVZMv6 mrrTZ07+Vn69XEVBrOMfMzDteSBWeEDDlQB5vRPxyk2dMdxTGyFsTEexcZymgzfd8nKf wmuhw74/fcs6mT94AHw1s3Nqqj4yBzg0ug8S3+KtMBigK68cSp84KJWQuNg7wKJJq4Q7 5+5w== X-Received: by 10.50.87.71 with SMTP id v7mr291338igz.11.1384736631744; Sun, 17 Nov 2013 17:03:51 -0800 (PST) X-BeenThere: lojban@googlegroups.com Received: by 10.50.61.168 with SMTP id q8ls600216igr.26.gmail; Sun, 17 Nov 2013 17:03:51 -0800 (PST) X-Received: by 10.50.83.6 with SMTP id m6mr252858igy.1.1384736631329; Sun, 17 Nov 2013 17:03:51 -0800 (PST) Date: Sun, 17 Nov 2013 17:03:50 -0800 (PST) From: qx4096@gmail.com To: lojban@googlegroups.com Message-Id: In-Reply-To: <2189420.Qb1DWKTUXO@caracal> References: <3534bf0f-d0a8-4b25-a0c8-52945fda2b4b@googlegroups.com> <2189420.Qb1DWKTUXO@caracal> Subject: Re: [lojban] Re: jboselkei is back MIME-Version: 1.0 X-Original-Sender: qx4096@gmail.com Reply-To: lojban@googlegroups.com Precedence: list Mailing-list: list lojban@googlegroups.com; contact lojban+owners@googlegroups.com List-ID: X-Google-Group-Id: 1004133512417 List-Post: , List-Help: , List-Archive: Sender: lojban@googlegroups.com List-Subscribe: , List-Unsubscribe: , Content-Type: multipart/alternative; boundary="----=_Part_1510_5599420.1384736630229" X-Spam-Score: -0.1 (/) X-Spam_score: -0.1 X-Spam_score_int: 0 X-Spam_bar: / ------=_Part_1510_5599420.1384736630229 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Most of the characters appear as question marks. I tried "$b =3D=20 iconv("UTF-8","ISO-8859-1",$b);" but I got an error saying, "Detected an=20 illegal character in input string".=20 On Sunday, 17 November 2013 16:19:45 UTC-8, Pierre Abbat wrote: > > On Sunday, November 17, 2013 15:02:38 qx4...@gmail.com wrote= :=20 > > Sorry, but it doesn't look like a keyword search will work. From the=20 > > research that I have done, full text indices can only be added to text= =20 > > columns and not to blob columns, and the data for the posts and=20 > > translations are blobs not text. When I changed the column from a blob= =20 > to a=20 > > text on a test database on my local computer, it converted the unicode= =20 > > characters to random looking ASCII characters, so I had to change it=20 > back.=20 > > If you take the random-looking characters and run them through iconv, do= =20 > you=20 > get Unicode? Are they actually ASCII, or do they have accents?=20 > > $ iconv -f latin1 -t utf-8=20 > A vonaton egy =C5=91r=C3=BClt, mellette egy =C5=91r =C3=BClt. =C3=96r=C3= =BClt az =C5=91r=C3=BClt, hogy mellette=20 > egy =C5=91r=20 > =C3=BClt.=20 > A vonaton egy =C3=85r=C3=83=C2=BClt, mellette egy =C3=85r =C3=83=C2=BClt.= =C3=83r=C3=83=C2=BClt az =C3=85r=C3=83=C2=BClt, hogy=20 > mellette=20 > egy =C3=85r =C3=83=C2=BClt.=20 > > The second byte of "=C5=91", 0x91, is invisible because it's a control=20 > character.=20 > > (lo fenki cu zvati lo trene .i lo zgaku'i cu mlana ra .i lo fenki cu glek= i=20 > le=20 > nu lo zgaku'i cu mlana ra)=20 > > mu'omi'e .pier.=20 > --=20 > loi mintu se ckaji danlu cu jmaji=20 > > --=20 You received this message because you are subscribed to the Google Groups "= lojban" group. To unsubscribe from this group and stop receiving emails from it, send an e= mail to lojban+unsubscribe@googlegroups.com. To post to this group, send email to lojban@googlegroups.com. Visit this group at http://groups.google.com/group/lojban. For more options, visit https://groups.google.com/groups/opt_out. ------=_Part_1510_5599420.1384736630229 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
Most of the characters appear as question marks. I tried "= $b =3D iconv("UTF-8","ISO-8859-1",$b);" but I got an error saying, "Detected an illegal character in input string".

On= Sunday, 17 November 2013 16:19:45 UTC-8, Pierre Abbat wrote:
On Sunday, November 17, 2013 15:02:38 qx4...@gmail.com wrote:
> Sorry, but it doesn't look like a keyword search will work. From t= he
> research that I have done, full text indices can only be added to = text
> columns and not to blob columns, and the data for the posts and
> translations are blobs not text. When I changed the column from a = blob to a
> text on a test database on my local computer, it converted the uni= code
> characters to random looking ASCII characters, so I had to change = it back.

If you take the random-looking characters and run them through iconv, d= o you=20
get Unicode? Are they actually ASCII, or do they have accents?

$ iconv -f latin1 -t utf-8
A vonaton egy =C5=91r=C3=BClt, mellette egy =C5=91r =C3=BClt. =C3=96r= =C3=BClt az =C5=91r=C3=BClt, hogy mellette egy =C5=91r=20
=C3=BClt.
A vonaton egy =C3=85r=C3=83=C2=BClt, mellette egy =C3=85r =C3=83=C2=BCl= t. =C3=83r=C3=83=C2=BClt az =C3=85r=C3=83=C2=BClt, hogy mellette=20
egy =C3=85r =C3=83=C2=BClt.

The second byte of "=C5=91", 0x91, is invisible because it's a control = character.

(lo fenki cu zvati lo trene .i lo zgaku'i cu mlana ra .i lo fenki cu gl= eki le=20
nu lo zgaku'i cu mlana ra)

mu'omi'e .pier.
--=20
loi mintu se ckaji danlu cu jmaji

--
You received this message because you are subscribed to the Google Groups &= quot;lojban" group.
To unsubscribe from this group and stop receiving emails from it, send an e= mail to lojban+unsubscribe@googlegroups.com.
To post to this group, send email to lojban@googlegroups.com.
Visit this group at http:= //groups.google.com/group/lojban.
For more options, visit https://groups.google.com/groups/opt_out.
------=_Part_1510_5599420.1384736630229--