CMS MADE SIMPLE FORGE

CMS Made Simple Core

 

[#11456] Bad encoding UTF8 in WYSIVYG editor, words are'nt searchable

avatar
Created By: Petr N. (esmiran)
Date Submitted: Tue Jul 04 05:17:39 -0400 2017

Assigned To:
Version: 2.2.1
CMSMS Version: 2.2.1
Severity: Major
Resolution: Fixed
State: Closed
Summary:
Bad encoding UTF8 in WYSIVYG editor, words are'nt searchable
Detailed Description:
Some UTF8 characters are misspelled in the WYSIVYG editor for entities. Although
the characters appear correctly in the result, but the words with these
characters are not searchable.  Words are badly indexed by the Search module,
because they contain entities.

I found some characters that are changed by the WYSIVYG editor to entities:
- š - š
- ý - ý
- á - á
- í - í
- é - é
- ú - ú
- ó - ó
and so similar capitals...

For example: if the Czech word "vývoj" is to be indexed, it changes to the
"vývoj" and Search module indexes it as two words:
1. vý
2. voj
This way they can not be found at all.


History

Comments
avatar
Date: 2017-07-04 12:09
Posted By: Robert Campbell (calguy1000)

Fixed in svn for 2.2.2

      
avatar
Date: 2017-07-12 04:23
Posted By: Petr N. (esmiran)

Fixed in the Search Module, but the character encoding error, as I wrote above,
still remained in the MicroTiny module.
      
Updates

Updated: 2017-07-08 18:35
state: Open => Closed

Updated: 2017-07-04 12:09
resolution_id: => 7