Page 1 of 1

Lithuanian ISO to UTF8

Posted: Sun May 24, 2009 11:01 am
by Sonya
Sorry for English in this board. I can read Lithuanian a bit but not able to write it at all  ::)

I have a Lithuanian database that I have to transfer from cp1521(!) to utf8 :) Latin characters are not the problem, but special Lithuanian characters is something that has to be replaced in the database dump.

What I am looking for is the table like this http://forumtreff.pytalhost.de/viewtopic.php?p=147#p147 where the second and the fourth column for special characters to be replaced are shown. Example of Lithuanian phrases from database:
Jeigu norit raЕЎykit ДЇ skype maglytux ir prisistatykit....
Kas ДЇ 4tД… pamaina vaЕѕiuosite?
So I need to know what Lithuanian characters correspond to ЕЎ and ДЇ and 4tД in these example. I know that there are not so many special characters in Lithuanian, so that I can replace them manually in the sql file.

Thank you!

Re: Lithuanian ISO to UTF8

Posted: Sun May 24, 2009 11:10 am
by Augustas
As far as I understood you need matches only for the texts you have quoted.
Here it is:
raЕЎykit - rašykit
ДЇ skype - į skype
Kas ДЇ 4tД… pamaina vaЕѕiuosite? - [not sure but maybe:] Kas į 4tą pamaina važiuosite

Re: Lithuanian ISO to UTF8

Posted: Sun May 24, 2009 11:12 am
by Augustas
there are only 9 special characters ąčęėįšųūž - ĄČĘĖĮŠŲŪŽ

So if you could post (or send me personaly) more words from your database dump, I can try to gather all the spec chars.

Re: Lithuanian ISO to UTF8

Posted: Sun May 24, 2009 11:15 am
by Sonya
Augustas wrote: As far as I understood you need matches only for the texts you have quoted.
Here it is:
raЕЎykit - rašykit
ДЇ skype - į skype
Kas ДЇ 4tД… pamaina vaЕѕiuosite? - [not sure but maybe:] Kas į 4tą pamaina važiuosite
Thank you for you quick reply. The text quoted is just an example. I need full table of Lithuanian special characters to convert the whole database. It would be very helpful if you had a link to the table where all the characters are shown, if such table exists. Thank you again!

Re: Lithuanian ISO to UTF8

Posted: Sun May 24, 2009 11:20 am
by Sonya
Augustas wrote: there are only 9 special characters ąčęėįšųūž - ĄČĘĖĮŠŲŪŽ

So if you could post (or send me personaly) more words from your database dump, I can try to gather all the spec chars.
Thank you. I try to make the table here so that it can be used later if someone needs it:

ЕЎš
ДЇį
Д—ė
Д™ę
Д…ą
Е«ū
Еіų
„"
“"
ЕЅŽ
ДЌč




In the next post I give more examples from database and then edit this post to make the table complete :)

Re: Lithuanian ISO to UTF8

Posted: Sun May 24, 2009 11:21 am
by Augustas
to tell the truth I have never seen ЕЎ ДЇ ДЇ Д… characters as incorrect coding for Lithuanian letters.
I think those "hieroglyphs" can always be different depending on the source encoding, I guess.
For example in one of my database dumps I have lots of "Å" things.

So better if you send a bunch of words and I will check them to find all 9 letters and their matches to your hieroglyphs.

And I am not aware of such table online you can refer to.

Re: Lithuanian ISO to UTF8

Posted: Sun May 24, 2009 11:23 am
by Sonya
skipped

Re: Lithuanian ISO to UTF8

Posted: Sun May 24, 2009 11:26 am
by Augustas
Here is the correct coding for the text you have posted:

http://sigov.de/web/forum/index.php?act ... topic=30.0

Re: Lithuanian ISO to UTF8

Posted: Sun May 24, 2009 11:32 am
by Sonya
Augustas wrote: Here is the correct coding for the text you have posted:

http://sigov.de/web/forum/index.php?act ... topic=30.0
Thank you! I was not able to find the corresponding post in forum, since the search does not work on the site. Happy :)

Re: Lithuanian ISO to UTF8

Posted: Sun May 24, 2009 11:33 am
by Augustas
I used google for that :)

Re: Lithuanian ISO to UTF8

Posted: Sun May 24, 2009 11:46 am
by Sonya
Augustas wrote: I used google for that :)
You have replaced the special characters before searching. :) Thank you, I have replaced the characters in the text above and compared to the link you have found. It seems to be OK. Now I can search myself, since a lot of characters are replaced and it looks very Lithuanian now :)