In cases where we're operating on text data (and not binary data),
use e.g. "\u{00A0}" to refer directly to the Unicode character
'NO-BREAK SPACE' instead of "\xc2\xa0" to specify the bytes C2h A0h
(which correspond to the UTF-8 encoding of that character). This
makes it easier to look up those mysterious sequences, as not all
are as recognizable as the no-break space.
This is not enforced by PHP, but I think we should write those in
uppercase and zero-padded to at least four characters, like the
Unicode standard does.
Note that not all "\xNN" escapes can be automatically replaced:
* We can't use Unicode escapes for binary data that is not UTF-8
(e.g. in code converting from legacy encodings or testing the
handling of invalid UTF-8 byte sequences).
* '\xNN' escapes in regular expressions in single-quoted strings
are actually handled by PCRE and have to be dealt with carefully
(those regexps should probably be changed to use the /u modifier).
* "\xNN" referring to ASCII characters ("\x7F" and lower) should
probably be left as-is.
The replacements in this commit were done semi-manually by piping
the existing "\xNN" escapes through the following terrible Ruby
script I devised:
chars = eval('"' + ARGV[0] + '"').force_encoding('utf-8')
puts chars.split('').map{|char|
'\\u{' + char.ord.to_s(16).upcase.rjust(4, '0') + '}'
}.join('')
Change-Id: Idc3dee3a7fb5ebfaef395754d8859b18f1f8769a
47 lines
1.5 KiB
PHP
47 lines
1.5 KiB
PHP
<?php
|
||
/** Udmurt (удмурт)
|
||
*
|
||
* To improve a translation please visit https://translatewiki.net
|
||
*
|
||
* @ingroup Language
|
||
* @file
|
||
*
|
||
* @author Andrewboltachev
|
||
* @author Kaganer
|
||
* @author Udmwiki
|
||
* @author ОйЛ
|
||
* @author לערי ריינהארט
|
||
*/
|
||
|
||
$fallback = 'ru';
|
||
|
||
$namespaceNames = [
|
||
NS_MEDIA => 'Медиа',
|
||
NS_SPECIAL => 'Панель',
|
||
NS_TALK => 'Вераськон',
|
||
NS_USER => 'Викиавтор',
|
||
NS_USER_TALK => 'Викиавтор_сярысь_вераськон',
|
||
NS_PROJECT_TALK => '$1_сярысь_вераськон',
|
||
NS_FILE => 'Файл',
|
||
NS_FILE_TALK => 'Файл_сярысь_вераськон',
|
||
NS_MEDIAWIKI => 'MediaWiki',
|
||
NS_MEDIAWIKI_TALK => 'MediaWiki_сярысь_вераськон',
|
||
NS_TEMPLATE => 'Шаблон',
|
||
NS_TEMPLATE_TALK => 'Шаблон_сярысь_вераськон',
|
||
NS_HELP => 'Валэктон',
|
||
NS_HELP_TALK => 'Валэктон_сярысь_вераськон',
|
||
NS_CATEGORY => 'Категория',
|
||
NS_CATEGORY_TALK => 'Категория_сярысь_вераськон',
|
||
];
|
||
|
||
$namespaceAliases = [
|
||
'Суред' => NS_FILE,
|
||
'Суред_сярысь_вераськон' => NS_FILE_TALK,
|
||
];
|
||
|
||
// Remove Russian aliases
|
||
$namespaceGenderAliases = [];
|
||
|
||
$linkTrail = '/^([a-zа-яёӝӟӥӧӵ]+)(.*)$/sDu';
|
||
$fallback8bitEncoding = 'windows-1251';
|
||
$separatorTransformTable = [ ',' => "\u{00A0}", '.' => ',' ];
|