View Issue Details

IDProjectCategoryView StatusLast Update
0001461Blogpublic2021-08-25 13:41
ReporterL10N Assigned To 
PrioritynormalSeveritytextReproducibilityalways
Status newResolutionopen 
PlatformIiggg 
Summary0001461: Hatchek, etc. not displayed
Descriptionzkouškou" is not displayed. Letters with hatchek or a cercle about an u are replaced with "?".
Steps To ReproducePut some czech text in the text window. Press the preview button or the publish button. Accents will be replaced by "?".
Additional InformationThe example shown in the uploaded pictures was written in LibreOffice, saved as png-picture and published as picture, not as text.
Tagsblog, unicode

Relationships

related to 0000769 needs workTed Main CAcert Website Client certificate broken with unicode 

Activities

L10N

2019-04-26 20:53

reporter  

CAcertCATSCzech.png (223,664 bytes)   
CAcertCATSCzech.png (223,664 bytes)   

jandd

2020-05-12 20:00

administrator   ~0005883

An idea: maybe the charset of some database tables is not utf8mb4. The database has been migrated from older versions of Wordpress. Needs to be checked by an admin (Dirk or me) we should also check whether wordpress sends the correct encoding in the Content-Type header.

egal

2020-05-16 20:35

administrator   ~0005885

The "older" tables are "latin1_swedish_ci":

| wp_commentmeta | utf8mb4_unicode_ci |
| wp_postmeta | latin1_swedish_ci |
| wp_terms | latin1_swedish_ci |
| wp_term_relationships | utf8mb4_unicode_ci |
| wp_usermeta | latin1_swedish_ci |
| wp_users | latin1_swedish_ci |
| wp_termmeta | utf8mb4_unicode_ci |
| wp_comments | latin1_swedish_ci |
| wp_posts | latin1_swedish_ci |
| wp_term_taxonomy | latin1_swedish_ci |
| wp_links | latin1_swedish_ci |
| wp_options | latin1_swedish_ci |

Wordpress itself uses utf-8:
define('DB_CHARSET', 'utf8');
define('DB_COLLATE', '');

as wordpress was updated in april 2020, please test again ...

L10N

2020-08-07 22:11

reporter   ~0005899

I tested with an article of the main page of cs.wikipedia.org (pic 1 Wikipedia) and copy pasted it into the blog (pic 2 Ad New Post), then clicked on preview (pic 3 lánek). The result is as follows: In Wikipedia all accents/hatcheks are displayed. In the "new post section", they are displayed as well - only in the title not. In the preview are still "?".

You can check in the 2nd line "Arpodvocu" (u with circle on it -> ?), "peceneszke" (c and e with hatchek -> ? [s and z with hatchek is displayed]),

Issue History

Date Modified Username Field Change
2019-04-26 20:53 L10N New Issue
2019-04-26 20:53 L10N Tag Attached: blog
2019-04-26 20:53 L10N Tag Attached: unicode
2019-04-26 20:53 L10N File Added: CAcertCATSCzech.png
2020-05-12 18:26 Adakah Priority low => normal
2020-05-12 18:26 Adakah Status new => needs feedback
2020-05-12 18:26 Adakah Category website content =>
2020-05-12 18:26 Adakah Platform => Iiggg
2020-05-12 18:26 Adakah Summary Blog does not support Hacek accent => Hhhj
2020-05-12 18:26 Adakah Description Updated
2020-05-12 20:00 jandd Note Added: 0005883
2020-05-16 20:35 egal Note Added: 0005885
2020-08-07 22:11 L10N Note Added: 0005899
2020-08-07 22:11 L10N File Added: Screenshot_2020-08-08 Wikipedie, otevřená encyklopedie.png
2020-08-07 22:11 L10N File Added: Screenshot_2020-08-08 Add New Post ‹ CAcert Blog — WordPress.png
2020-08-07 22:11 L10N File Added: Screenshot_2020-08-08 lánek týdne.png
2020-08-07 22:11 L10N Status needs feedback => new
2020-09-16 21:25 L10N Summary Hhhj => Hatchek, etc. not displayed
2020-09-16 21:25 L10N Description Updated
2021-08-25 13:41 bdmc Relationship added related to 0000769