[Evangelism] Help to improve text search for East-Asia languages

Xavier Heymans xavier at zeapartners.org
Thu Nov 6 12:22:16 UTC 2008


Hi,

You are welcome to join the new Plone Asia Pacific user group mailing  
list:
http://lists.plone.org/mailman/listinfo/plone-asiapacific

An official announcement will follow as soon more information will be  
published on the group webpage:
http://plone.org/countries/asia-pacific

Xavier


On 06 Nov 2008, at 05:33, Takeshi Yamamoto wrote:

> Let me post this other than Plone-AsiaPacific ML for the people
> who uses Plone with non-English/Latin languages.
> Some languages need to be handled differently for better text  
> searching.
>
> Help to improve text search for East-Asia languages
>
> Let me post the first initiative (requesting help in other word)  
> for Asia Pacific area.
> Some of you may know Japanese Plone community is working on improving
> text search feature of plone for East Asian languages.  For  
> example, Japanese
> words can not be distinguished by space, as well as Chinese and  
> Korean languages.
> Mr. Terada, CEO of CMSCOM has stood up and worked on google summer  
> of code
> as one of Plone foundation-supported project this year.   
> Unfortunately, the student
> has gave up and it was not complete.  Terada-san has decided to  
> make it completed
> and started it again as his company's project.  Since that feature  
> is valuable for many
> people(1.5 billion people are living in Kanji region), and it is  
> open source, and
> we hope it could be built into out-of-the-box Plone, Japanese  
> community  is
> supporting this project.  We will have a sprint event for this in  
> the World Plone Day 2008 Tokyo.
>
> The software current status is BETA version and you can download  
> and try, or
> just access to the test and play with it.  We appreciate any of  
> your bug report
> or suggestions.  We do not have enough testers for "non-Japanese"  
> languages.
>
> Languages what we would like to cover with that bigramsplitter are:
>
> Japanese
> Mandarin Chinese (Beijing)
> Cantonese (Canton)
> Taiwanese (Taiwan)
> Korean (Korea)
> Mongolian (Mongol)
> Thai (Thailand)
> Vietnamese (Viet Nam)
> Jawi (Malaysia)
> Bahasa Indonesia (Indonesia)
> Hebrew (Israel)
> Arabic (Middle-East)
> etc.
>
> The languages which are not used in Asia, but different from  
> English/Latin
> languages are welcome, of course.
>
> The project site is here:
> http://code.google.com/p/bigramsplitter/
>
> You can download the code from here:
> http://code.google.com/p/bigramsplitter/downloads/list
>
> The test site is here to play with.
> http://c2search.cmscom.jp/
>
> You may need an account to put some text to be searched in your own  
> language.
> Request your login account here.
> http://c2search.cmscom.jp/contact-info
>
> Sorry for the test site is not well internationalized, but there is  
> no problem if you
> write your request in English.
>
> Thanks a lot in advance.
> Takeshi Yamamoto / retsu
>
> _______________________________________________
> Evangelism mailing list
> Evangelism at lists.plone.org
> http://lists.plone.org/mailman/listinfo/evangelism
>





More information about the Evangelism mailing list