Thijs/wiki.techinc.nl

Author	SHA1	Message	Date
C. Scott Ananian	103a4f76dc	Deprecate $wgFixArabicUnicode / $wgFixMalayalamUnicode These were introduced in MW 1.17 and are always true in production. They were useful to allow folks to defer title conversion, but it's been a long time now. We don't need to make this optional any more. Change-Id: I65dcfe80dc3e1dfeb4d63924a8928655e012a20c	2018-10-21 21:55:39 -04:00
jenkins-bot	690f563edc	Merge "Accept BCP 47 codes as aliases for nonstandard variants"	2018-10-11 20:46:42 +00:00
jenkins-bot	64ef09d6a8	Merge "Ensure LanguageCode::bcp47() returns a valid BCP 47 language code"	2018-10-11 20:46:35 +00:00
C. Scott Ananian	d59f27aeab	Accept BCP 47 codes as aliases for nonstandard variants The browser Accept-Language header uses BCP 47 codes, which don't precisely match our internal mediawiki variant names in a number of places. Allow proper BCP 47 codes to alias our internal variants for: Accept-Language parsing, URL parsing, user preferences, and explicit enumeration of codes in LanguageConverter rules. This is a replay of an earlier merged patch, `0818070c59`, which had to be reverted because it was based on `8380f0173e` which caused regressions in the Babel extension (T199941). Change-Id: Ica89d9547c58967747ab0fa15d4e83be5378796d	2018-10-11 02:23:20 -04:00
C. Scott Ananian	21ead7a98d	Ensure LanguageCode::bcp47() returns a valid BCP 47 language code MediaWiki uses a number of nonstandard codes which do not validate according to the IANA language subtag registry. Some of them have the wrong semantics entirely: MediaWiki's `sr-ec` variant maps to BCP 47 `sr-EC` which is "Serbian as used in Ethiopia" (!). Extend LanguageCode::bcp47() to map our nonstandard codes to valid BCP 47 language codes. Export the mapping so that it can be used in JavaScript's corresponding mw.language.bcp47() implementation as well, and return the standard BCP 47 codes in the siteinfo API. Thanks to TheDJ (I10b4473c7e53f027812bbccf26bb47aec15fddfd) and Fomafix (I93efc190714ba76247d30ba49fc21ae872fc3555) for previous attempts at this! Also removed a fixme for the name of 'Twi', dating back to 2004 (`f59c3be23b`) -- checking tw.wikipedia.org it certainly appears that the autonym of 'Twi' is correctly 'Twi'. Tracking bugs for invalid language codes are T125073 and T145535. Discussion of zh-XX => zh-HanX-XX mapping is at T198419. This is a replay of an earlier merged patch, `8380f0173e`, which had to be reverted because it caused regressions in the Babel extension (T199941). Bug: T34483 Bug: T106367 Bug: T120847 Depends-On: I27a5b8e45b34c6b57c1b612b11548001c88cd483 Change-Id: Iebbc604af21d7f2af9c1f1ab2574cb5f309bf6ed	2018-10-11 01:53:54 -04:00
Kunal Mehta	a4e8bea57d	tests: Add helper function for ini_set with automatic cleanup Some tests need to change the value of an ini setting, and typically implement cleanup handling themselves, usually imperfectly. Provide a helper function, $this->setIniSetting(), which will take care of teardown in the same way that $this->setMwGlobals() does. Change-Id: I7be4198592f0aaf73a28d3c60acb307a918b1a1f	2018-10-10 22:31:37 -07:00
Fomafix	5632815976	Write Latin and other scripts with captial letter Change-Id: I16c660e54191b63cd6eb3407cb00504665930c4e	2018-10-05 18:49:08 +02:00
Fomafix	50944a1410	Deprecate Language::setCode as public method setCode changes the language code for the Language object but it also replaces the whole language codes for all Language objects. > $lang = Language::factory( 'fr' ) > $lang2 = Language::factory( 'fr' ) > $lang->setCode( 'it' ) > print $lang2->getCode() it > $lang3 = Language::factory( 'fr' ) > print $lang3->getCode() it Better assign a new Language object. Also add more tests for Language::equals. Depends-On: I61439bac82021344c3f9a6056cccd937b3450af2 Depends-On: I2d9e551d6eb33f28f42aeaf48160eba21b83881f Change-Id: I201b479f58e63c9c40fb8a3ec9575a551fb35235	2018-10-02 23:48:53 -07:00
Timo Tijhof	dbe89abb9e	languages: Add coverage for 'ar' and 'ml' normalize() * Exclude the data files from PHPUnit coverage. * Add tests covering the normalize() implementations. * Fix a small todo about using data providers. * Set explicit visibility. Change-Id: Ib104cc3215a36901cff853ad5969d92a6e0cf6a0	2018-08-14 23:19:35 +00:00
Aryeh Gregor	90d4f56fe4	Mass conversion of $wgContLang to service Brought to you by vim macros. Bug: T200246 Change-Id: I79e919f4553e3bd3eb714073fed7a43051b4fb2a	2018-08-11 22:44:29 -06:00
Aryeh Gregor	63d7f2ad13	Automatically reset namespace caches when needed This avoids error-prone code written separately in every test. In addition to no existing tests resetting the TitleFormatter (more services probably need to be reset as well), they mostly reset only the namespace cache on $wgContLang, which wouldn't help for any other language. The parser test runner still doesn't do this, but maybe it should. Change-Id: I44b7a1aec48f14b0950907fa14bd0df80f674296	2018-08-01 16:30:08 +03:00
Aryeh Gregor	355e21590a	Use setContentLang() instead of setMwGlobals() This changes behavior in some tests by making them set $wgLanguageCode as well as $wgContLang, but that seems like a good thing. Bug: T200246 Change-Id: I936888f46ff9fefe2707efba837e2ce3a7ca5e3f	2018-07-26 11:35:58 +00:00
Greg Grossmeier	b302b0cd1c	Revert "Ensure LanguageCode::bcp47() returns a valid BCP 47 language code" This reverts commit `8380f0173e`. Reason for revert: Caused T199941 Bug: T199941 Change-Id: I93af756a2d70d6bc91f828fe6ac19bf10ca8788f	2018-07-23 17:27:23 +00:00
Greg Grossmeier	dc282a46d7	Revert "Accept BCP 47 codes as aliases for nonstandard variants" This reverts commit `0818070c59`. Reason for revert: Caused T199941 Bug: T199941 Change-Id: I24c178eb33890477de79cbb3122861c140578011	2018-07-23 16:44:55 +00:00
C. Scott Ananian	0818070c59	Accept BCP 47 codes as aliases for nonstandard variants The browser Accept-Language header uses BCP 47 codes, which don't precisely match our internal mediawiki variant names in a number of places. Allow proper BCP 47 codes to alias our internal variants for: Accept-Language parsing, URL parsing, user preferences, and explicit enumeration of codes in LanguageConverter rules. Change-Id: I8468a56d5b88f5786abd0a17b67bda2f1687fd0c	2018-07-13 17:43:20 -04:00
C. Scott Ananian	8380f0173e	Ensure LanguageCode::bcp47() returns a valid BCP 47 language code MediaWiki uses a number of nonstandard codes which do not validate according to the IANA language subtag registry. Some of them have the wrong semantics entirely: MediaWiki's `sr-ec` variant maps to BCP 47 `sr-EC` which is "Serbian as used in Ethiopia" (!). Extend LanguageCode::bcp47() to map our nonstandard codes to valid BCP 47 language codes. Export the mapping so that it can be used in JavaScript's corresponding mw.language.bcp47() implementation as well. Thanks to TheDJ (I10b4473c7e53f027812bbccf26bb47aec15fddfd) and Fomafix (I93efc190714ba76247d30ba49fc21ae872fc3555) for previous attempts at this! Also removed a fixme for the name of 'Twi', dating back to 2004 (`f59c3be23b`) -- checking tw.wikipedia.org it certainly appears that the autonym of 'Twi' is correctly 'Twi'. Tracking bugs for invalid language codes are T125073 and T145535. Discussion of zh-XX => zh-HanX-XX mapping is at T198419. Bug: T34483 Bug: T106367 Bug: T120847 Change-Id: I807dd55d49e9bd19443329231326a5b0d3e6c453	2018-07-13 14:56:18 -04:00
jenkins-bot	8c96aec32c	Merge "Fix the bug for dates between 1912 and 1941 in Thai language"	2018-07-10 08:55:56 +00:00
Kunal Mehta	4acb7ed51c	Add @coversNothing to tests that don't cover specific PHP classes Change-Id: Idbd364561bc28547e9fac20d7a80b9a44edf14a9	2018-06-12 13:27:40 -07:00
jenkins-bot	e602b197ab	Merge "(y)etsin fixes, test refactoring, and misc fixes"	2018-06-08 20:46:12 +00:00
Bartosz Dziewoński	0313128b10	Use PHP 7 "\u{NNNN}" Unicode codepoint escapes in string literals In cases where we're operating on text data (and not binary data), use e.g. "\u{00A0}" to refer directly to the Unicode character 'NO-BREAK SPACE' instead of "\xc2\xa0" to specify the bytes C2h A0h (which correspond to the UTF-8 encoding of that character). This makes it easier to look up those mysterious sequences, as not all are as recognizable as the no-break space. This is not enforced by PHP, but I think we should write those in uppercase and zero-padded to at least four characters, like the Unicode standard does. Note that not all "\xNN" escapes can be automatically replaced: * We can't use Unicode escapes for binary data that is not UTF-8 (e.g. in code converting from legacy encodings or testing the handling of invalid UTF-8 byte sequences). * '\xNN' escapes in regular expressions in single-quoted strings are actually handled by PCRE and have to be dealt with carefully (those regexps should probably be changed to use the /u modifier). * "\xNN" referring to ASCII characters ("\x7F" and lower) should probably be left as-is. The replacements in this commit were done semi-manually by piping the existing "\xNN" escapes through the following terrible Ruby script I devised: chars = eval('"' + ARGV[0] + '"').force_encoding('utf-8') puts chars.split('').map{\|char\| '\\u{' + char.ord.to_s(16).upcase.rjust(4, '0') + '}' }.join('') Change-Id: Idc3dee3a7fb5ebfaef395754d8859b18f1f8769a	2018-06-04 16:20:13 +00:00
Bartosz Dziewoński	4fd27f006f	Use PHP 5.6 '**' operator instead of 'pow()' function Change-Id: Ieb22e1dbfcffaa4e7b3dcfabbcc999e5dd59a4bf	2018-05-30 18:05:19 -07:00
tjones	669d1ed192	(y)etsin fixes, test refactoring, and misc fixes * Fix etsin/етсин/этсин as noted in If933fc67845ac994d9ddfdf8349aff445ec9b13a ** only convert tsin to тсин and let the other rules sort out the e * Refactor most tests to be word-specific, which uncovered a couple of bugs in corner cases ** rol/üst prefix matches should match whole words (original [^ü] regex assumed word could not be end of string * Fixed incidental bugs I noticed while looking into the items above куркчи => kürkçi was in the wrong section cönk => джонк was in the right section, but reversed * Added additional tests cases for all of the above. Change-Id: Ia96be488a7b41c3ddba623b5c9262703b1c82687	2018-05-29 14:30:04 -04:00
tjones	cbb07cdc33	Crimean Tatar/crh transliteration odds and ends * refactor '\b' into WB const to make it easy to update in the future * add new ц-related exceptions Bug: T193764 Change-Id: Ib707136f8f2598d1f8ec995bf129b436dfb53cd9	2018-05-22 14:59:55 -04:00
C. Scott Ananian	685eba4360	Minor fixes to CRH language conversion. * Move a many-to-one mapping from the L2C to the C2L table where it belongs. * Fix some regular expression patterns which ended up with misnumbered replacement strings. * All regular expressions should have the `u` (unicode) flag set. * Typo/spelling fixes in comments Change-Id: If933fc67845ac994d9ddfdf8349aff445ec9b13a	2018-05-12 14:37:09 -04:00
superyetkin	3aaa2367b2	Fix the bug for dates between 1912 and 1941 in Thai language Added an if-else block to see if the parameters passed to the function designate a year between 1912 and 1941 or not. Resulting month values are also adjusted. Added a unit test for the related formatting. Bug: T68648 Change-Id: Ic676b5c140de8878971a786a1a1811770a848016	2018-05-12 15:10:13 +00:00
tjones	14f8dc35db	CRH Transliteration Pattern Matching Fixes Refactor to match exceptions as patterns, not words - break exception list to C2L and L2C pattern sets - change main loop to break only on Roman numerals and transliterate everything else, rather than tokenizing on single-script words (this fixes the km² problem, too) - update word anchors from ^ and $ to \b - only process Roman numerals for L2C translit - add exception for single "Roman" character followed by a period which looks like an initial - consolidate multi-step transliteration into regsConverter() - remove regex support from main exception list to support strtr() - re-organize some prefix/suffix/whole word patterns to the right place - add tests for recently fixed use cases - add support for many-to-one mappings in both directions - update character classes, exception lists, and regexes based on speaker feedback and example texts Misc other fixes: - fix some character classes errors - remove unneeded character classes - add tests for Roman numerals and quotes - add tests for affixes and regexes Bug: T188321 Bug: T189512 Change-Id: I056d36ff2b8f63b3998a5d3a442d8d539c15488d	2018-04-27 19:17:51 -04:00
jenkins-bot	a6abe2ad7a	Merge "Add Russian grammar forms to support Wikiversity"	2018-03-14 08:37:27 +00:00
jenkins-bot	3c198b9dc8	Merge "Fix table loading bug for CRH transliteration"	2018-02-28 21:09:01 +00:00
tjones	70dede013c	Fix table loading bug for CRH transliteration In production, the regex and exception tables were not being loaded, resulting in very poor transliteration. The loading has been moved to the contructor, similar to the implementation of the Kazakh transliteration. Also, a bug in the mappings for Ö/ö -> Ё/ё and Ü/ü -> Ю/ю has been fixed. Test cases for specific additional examples have been added. (Though it is worth noting that the regex and exception tables did load properly during unit testing, so the problem wasn't caught there.) Bug: T186727 Change-Id: I6bacee7d9de6f4a870a8a9ef1f04b819ad489c02	2018-02-26 13:22:04 -05:00
Amire80	398e2a7c9d	Add Russian grammar forms to support Wikiversity Change-Id: I70fcb03db62307116ec96d4c242e6796534b57a1	2018-02-26 14:18:01 +02:00
Fomafix	7855ec8385	SpecialPageAliasTest: Fix arguments of Language::fetchLanguageNames Language::fetchLanguageNames( 'mwfile' ) means all languages with the default filter 'mw' and names in the language 'mwfile'. Language::fetchLanguageNames( null, 'mwfile' ) means language all languages with the filter 'mwfile' and names in the default language. This change removes the test for the language codes: * aa * als * bat-smg * be-x-old * cho * fiu-vro * ho * hz * kj * kr * mh * mus * ng * no * rn * roa-rup * shi-latn * shi-tfng * simple * tum * uz-cyrl * uz-latn * zh-classical * zh-min-nan * zh-yue Change-Id: I7266a67e37862daf863d1565d84cfeebaf5cb680	2018-02-25 13:31:43 +01:00
jenkins-bot	e46d0694ac	Merge "Truncate tag filter descriptions"	2018-02-21 12:52:23 +00:00
Umherirrender	63d96c15fd	build: Updating mediawiki/mediawiki-codesniffer to 16.0.0 Change-Id: I59b59f79bbf3ce4feff3b3a20c1c31bc16370531	2018-02-17 13:29:13 +01:00
petarpetkovic	2d2575852c	Truncate tag filter descriptions Introduce truncateInternal() method in Language class, based on existing truncate() method. New method abstracts string truncation, allowing users to specify callable functions for text length measurement and string truncation. New method, truncateInternal(), is used to provide two options for text truncation: * For DB usage: truncateForDatabase() method is truncating text by number of bytes. * For UI usage: truncateForVisual() method is truncating text by number of characters, using multibyte string PHP methods. Old truncate() method is deprecated and just returns the results of truncateForDatabase() method. Newly introduced truncateForVisual() method is used for truncation of long tag descriptions in RCFilters menu. Bug: T179626 Change-Id: Ib01a8c303304064dde3ce983b817d93a88a5affd	2018-02-09 22:45:20 +01:00
Timo Tijhof	bee9f4db96	Remove various redundant '@license' tags in file headers Redundant given this is the project-wide license already, especially in file headers that already include the GPL license header. This and other minor fixups based on feedback from Ie0cea0ef5027c7e5. * Add @file where missing. * Move @ingroup and @deprecated from file to class doc where needed. Change-Id: I7067abb7abee1f0c238cb2536e16192e946d8daa	2018-01-12 18:15:11 +00:00
Bartosz Dziewoński	eb6bb6b7b9	Generalize non-digit-grouping of four-digit numbers In some languages it's conventional not to insert a thousands separator in numbers that are four digits long (1000-9999). Rather than copy-paste the custom code to do this between 13 files, introduce another option and have the base Language class handle it. This also fixes an issue in several languages where this logic previously would not work for negative or fractional numbers. To implement this, a new option is added to MessagesXx.php files, `$minimumGroupingDigits = 2;`, with the meaning as defined in <http://unicode.org/reports/tr35/tr35-numbers.html>. It is a little roundabout, but it could allow us to migrate the number formatting (currently all custom code) to some generic library easily. Bug: T177846 Change-Id: Iedd8de5648cf2de1c94044918626de2f96365d48	2018-01-02 11:17:25 +01:00
Umherirrender	255d76f2a1	build: Updating mediawiki/mediawiki-codesniffer to 15.0.0 Clean up use of @codingStandardsIgnore - @codingStandardsIgnoreFile -> phpcs:ignoreFile - @codingStandardsIgnoreLine -> phpcs:ignore - @codingStandardsIgnoreStart -> phpcs:disable - @codingStandardsIgnoreEnd -> phpcs:enable For phpcs:disable always the necessary sniffs are provided. Some start/end pairs are changed to line ignore Change-Id: I92ef235849bcc349c69e53504e664a155dd162c8	2018-01-01 14:10:16 +01:00
Kunal Mehta	75160bdd3b	Use MediaWikiCoversValidator for tests that don't use MediaWikiTestCase Change-Id: I8c4de7e9c72c9969088666007b54c6fd23f6cc13	2018-01-01 08:28:02 +00:00
Kunal Mehta	fc23633035	Add @covers tags to languages tests I removed comments that merely repeated the location of the class being tested. There are other tests in this directory that don't have a corresponding class and need further investigation. Change-Id: Ic16f0887b5030ac53fab4382cfaedfb5426cdb08	2017-12-28 08:52:56 +00:00
Sam Wilson	313675320f	Always return a string from Language::formatNum() It says it returns a string, and so it should. Bug: T182277 Change-Id: Ic68c65c634c2557a1d07281623cd6c971b000323	2017-12-07 13:59:56 +08:00
tjones	a0b511319c	Crimean Tatar Transliteration This is a first pass at Latin/Cyrillic translitertion for Crimean Tatar (crh). Includes transliteration tables, prefix/suffix mappings, regex mappings, and exceptions lists for words and abbreviations. Regularize CRH language name in messages/* files. Fix "varient" typos in qqq.json. Add unit tests for CRH transliteration. Bug: T23582 Change-Id: I424703f99adf837f6217872b882d1ea26bfdd068	2017-11-20 16:56:38 -05:00
Reedy	f600b4ede9	Fix phpcs issues from LanguageConverter patches Change-Id: I34e57c90ffd40fbd9f8afe3c57dd73fa7f655841	2017-11-15 03:37:27 +00:00
Brian Wolff	fbe78cfa09	SECURITY: XSS in langconverter when regex hits pcre.backtrack_limit Adjust regexes for what not to convert to avoid backtracking by preferring possesive quantifiers Add check that we really have matched to the end of the string, and log error if the regex hits some sort of error preventing the entire string from being matched. Should the regex not match to the end, then language conversion is disabled for the string. Bug: T124404 Change-Id: I4f0c171c7da804e9c1508ef1f59556665a318f6a	2017-11-15 03:33:03 +00:00
Thiemo Mättig	1f2ff32cca	Family name of Thiemo changed Change-Id: I5477d02111e53790e858624c4b7c4f09dbc418fa	2017-11-14 13:59:15 +01:00
zoranzoki21	f0828ff475	Removed Toki Pona localization files Bug: T132899 Bug: T178730 Change-Id: I4c61b3ef42cdc24fee74587965240ca08242867e	2017-10-24 21:27:47 +00:00
Bartosz Dziewoński	3f62813c51	Add test cases for digit grouping (commafy) in Polish According to the typographical convention, a thousands separator should not be inserted in numbers that are four digits long (between 1000 and 9999), unlike in English where it's usually acceptable. This logic is currently implemented in LanguagePl::commafy(). Bug: T177846 Change-Id: I6dbd8febcf59000067cdd7d3c11111f2f77f4e66	2017-10-10 22:52:11 +02:00
Fomafix	ea0bd74a94	Refactor global function wfBCP47 to static function LanguageCode::bcp47 Deprecate global function wfBCP47. Change-Id: Ie6bb061b5d6ca67289bb18bc468a87421f38fc94	2017-10-05 09:54:45 +02:00
Fomafix	55ecf3e215	Add new static function LanguageCode::replaceDeprecatedCodes Refactor the deprecatedLanguageCodeMapping to a private variable. Change-Id: I5f8e601e53de183e6268c9ef601eef8390b725cd	2017-08-10 15:21:59 -04:00
Liangent	d8375bee24	New language variant 'en-x-piglatin' for easier variant testing Guarded by the $wgUsePigLatinVariant variable, off by default. Pig Latin is a language game where words in English are altered according to the following rules: * Words starting with a vowel have a '-way' suffix appended. * Words starting with a consonant have the initial consonants (or 'qu' group) moved to the end and an '-ay' suffix appended. https://en.wikipedia.org/wiki/Pig_Latin * Added 'en-x-piglatin' as a language name. * Added 'en' to LanguageConverter::$languagesWithVariants. * Added LanguageEn class and its corresponding EnConverter which provides one-way translation from English to Pig Latin. * Some minor internal changes in code that assumed that English doesn't have a language class or converter. Bug: T45547 Depends-On: I1d9691c784032669979f8109c9a5f65cbf4122c9 Change-Id: I7fa2d85d6364958c5138366e8b4504a2697a8731	2017-06-12 16:59:57 -04:00
jenkins-bot	bdfa96eb72	Merge "Break up $wgDummyLanguageCodes"	2017-03-08 20:46:47 +00:00
This, that and the other	48ab87d0a3	Break up $wgDummyLanguageCodes $wgDummyLanguageCodes is a set and mapping of different language codes: * Renamed language codes: ['als' => 'gsw', 'bat-smg' => 'sgs', 'be-xold' => 'be-tarask', 'fiu-vro' => 'vro', 'roa-rup' => 'rup', 'zh-classical' => 'lzh', 'zh-min-nan' => 'nan', 'zh-yue' => 'yue']. The old language codes are deprecated because they are invalid but should be supported for compatibility reasons for a while. * Language codes of macro languages, which get mapped to the main language: ['bh' => 'bho', 'no' => 'nb']. * Language variants which get mapped to main language: ['simple' => 'en']. * Internal language codes of the private-use-area which get mapped to itself: ['qqq' => 'qqq', 'qqx' => 'qqx'] This is a very strange conglomeration which should get differentiated, and were split up in the following ways: * Renamed language codes are available from LanguageCode::getDeprecatedCodeMapping(). * Language codes of macro languages and the variants that are mapped to the main language are available as $wgExtraLanguageCodes and are set in DefaultSettings.php. * Internal language codes are set in $wgDummyLanguageCodes in Setup.php. Change-Id: If73c74ee87d8235381449cab7dcd9f46b0f23590	2017-03-08 12:11:30 -08:00
James D. Forrester	1e9c361960	tests: Replace implicit Bugzilla bug numbers with Phab ones It's unreasonable to expect newbies to know that "bug 12345" means "Task T14345" except where it doesn't, so let's just standardise on the real numbers. Change-Id: I46261416f7603558dceb76ebe695a5cac274e417	2017-02-21 02:14:34 +00:00
Zhuyifei1999	0effd172ce	translateBlockExpiry: Duration is block expiry minus current time For relative timestamps in $str, strtotime( $str, $now ) returns an absolute Unix timestamp $str since $now, and this timestamp is given to $time. However, Language::formatDuration expects a time duration, not an absolute timestamp. We obtain this duration from the difference between $time, the absolute timestamp of block expiry, and $now, the absolute timestamp of the time in which the block action happened. Tests have been added to test both this patch and `01936fa`, the patch that caused this regression. Bug: T156453 Change-Id: I6fd8c02dc3c6456067fe25cb9f33f5b4c78332aa	2017-01-28 07:22:00 +00:00
Amir E. Aharoni	6b03e2e88e	Make the code for grammar data processing common This makes the code for processing JSON files with grammar transformations reusable by different languages and applies the same logic to Russian and Hebrew. It will be done to other languages in further patches. This patch is not supposed to change any functionality, and the tests are intact (except a comment in the test for Hebrew - the class doesn't exist any longer). PHP: * Move the JSON grammar transformation data processing logic from LanguageRu.php to convertGrammar() in Language.php. By default all these data files are supposed to be processed identically, so the code should be common. If there is no JSON data file, nothing new happens. * LanguageRu's own convertGrammar() method is removed. * The LanguageHe class is removed, now that all its functionality is handled by generic JSON data processing in the Language class. LanguageHe.php file is removed from the repo and from autoloading. JavaScript: * Move the JSON grammar transformation data processing logic from ru.js to mediawiki.language.js. * JavaScript grammar code files he.js and ru.js are removed from the repo and from Resources.php, because all the data is in JSON, and the default logic in mediawiki.language.js works for both languages. Bug: T115217 Change-Id: I5e75467121c3d791bb84f9e6fdfcf07c1840f81a	2016-12-16 15:52:14 +02:00
Fomafix	7de07e8991	Update weblinks in comments from HTTP to HTTPS Use HTTPS instead of HTTP where the HTTP link is a redirect to the HTTPS link. Change-Id: I06d9e043730accc4ae71b927e0f8229f0fc3b340	2016-10-11 17:25:10 +00:00
Marius Hoch	9ca0f6c620	Only attempt to calculate the TTL in Language::sprintfDate if needed Change-Id: Ifd24c9206be05bb4fd2277efc574c9d1018e1957	2016-06-23 12:36:25 +02:00
daniel	bbd518baff	add LanguageTest::testEquals for Id7ed6a21c Change-Id: I99ea4c51bfc5245eab0bcca73870c56a6fab2c43	2016-05-23 16:45:06 +02:00
Reedy	83fb19cb13	Swap the rest of array() -> [] Change-Id: I76a7259ed952a0673a1941f08b39b545211fba07	2016-03-30 22:04:58 +00:00
Reedy	b5656b6953	Many more function case mismatches Change-Id: I5d3a5eb8adea1ecbf136415bb9fd7a162633ccca	2016-03-19 00:20:58 +00:00
Timo Tijhof	46b04ec7ae	Use static::class instead of get_called_class() Available as of PHP 5.5 and more idomatic. Foo::class (explicit), self::class (defined), and static::class (late bound). Change-Id: I66937f32095a4e4ecde94ca20a935a3c3efc9cee	2016-02-29 22:43:58 +00:00
Kunal Mehta	6e9b4f0e9c	Convert all array() syntax to [] Per wikitech-l consensus: https://lists.wikimedia.org/pipermail/wikitech-l/2016-February/084821.html Notes: * Disabled CallTimePassByReference due to false positives (T127163) Change-Id: I2c8ce713ce6600a0bb7bf67537c87044c7a45c4b	2016-02-17 01:33:00 -08:00
Tim Starling	f0ba7a69a1	Add tests for LanguageConverter classes that didn't have them Some of them don't have many test cases, or have test cases that don't represent the ideal transliteration and so are subject to change. But this is better than nothing. Change-Id: I4aae693bd77d9ff365f48113923ed7f9fed8d668	2016-02-08 09:19:25 +11:00
Timo Tijhof	3b35719e74	tests: Remove unused $wgMemc resets If we really need this we can do it in MediaWikiTestCase, next to the setting of wgMainCacheType. But from what I can see the code being tested here already doesn't use the old $wgMemc. Change-Id: I9e4b2109b2f3c18d8d5551bbadae5711c1d4c0a6	2015-12-06 18:06:08 +00:00
Roan Kattouw	e4d6238c00	Language::truncate(): don't chop up multibyte characters when input contains newlines To detect whether the truncation had chopped up a multibyte character after the first byte, a regex was used. But in this regex, the dot (.) didn't match newlines, so it failed to detect chopped multibyte characters (after the first byte) if there was a newline preceding the chopped character. Bug: T116693 Change-Id: I66e4fd451acac0a1019da7060d5a37d70963a15a	2015-10-26 20:17:37 -07:00
jenkins-bot	88081365b3	Merge "Add new grammar forms for language names in Russian"	2015-09-28 13:41:33 +00:00
Amir E. Aharoni	8b0c0b49ce	Add new grammar forms for language names in Russian CLDR provides translated language names. They are useful for showing names by themselves in menus and lists, but it's often problematic to add them to Russian sentences, because they need to be declined, so a message like "This page is not available in the $1 language" is hard to localize. This patch adds new cases for Russian - "languagegen", "languageprep" and "languageadverb". (The last one, as its name says, it's not actually a grammatical case, but a transformation to an adverbial expression.) This covers most of the needs for language names that MediaWiki supports. Change-Id: Ib6a0afa5c3736f8b9b2e121cd752c53ee50fad75	2015-09-28 15:51:24 +03:00
Amir E. Aharoni	b175f585db	Update Ukrainian grammar rules and tests * Fix the '-ти' rule to match the name of Wikiquote. * Add tests for '-ти' and '-ник' rules. * Remove the '-ь' and '-ка' rules, which were copied from Russian and are not used in Ukrainian, and remove their tests as well. * Remove non-implemented ("stub") cases. * Cleanup the code of commafy(). Change-Id: I98647ceb8806d845f3c8150b92a5d9f7fe5866f2	2015-09-27 15:21:49 +03:00
Amir E. Aharoni	5ccbaf2c48	Update grammar rules and test for Ukrainian The grammar rules for Ukrainian have several mistakes. This is the first in a series of commits that fix this. * Add grammar tests for PHP. There weren't any tests at all, and now there are some. Not tests are added for rules that are wrong and irrelevant and will be removed in subsequent commits. * Add tests for JavaScript, and update a grammar rule that was incorrectly copied from Russian. Change-Id: I6de4581e2908eba39b33a13b07d048a34a3bd803	2015-09-27 11:49:07 +03:00
Vivek Ghaisas	c54766586a	Fix issues identified by SpaceBeforeSingleLineComment sniff Change-Id: I048ccb1fa260e4b7152ca5f09b053defdd72d8f9	2015-09-26 23:06:52 +00:00
Niklas Laxström	4a3fd2e42a	Use wikimedia/cldr-plural-rule-parser Replaces the parser included in MediaWiki with same code in a library. Change-Id: I1d2675466a543269e17faf213aa68d2b7afaf78e	2015-09-24 21:41:50 +02:00
This, that and the other	31d0283957	Improve wording of "size-bytes" and "size-pixel" messages "B" and "P" are vanishingly rare abbreviations for "bytes" and "pixels" respectively. Let's use the full English terms for these, combined with a PLURAL magic word for good measure. Change-Id: Id59c4b9dea2c13940ae790b6a236ac08abe0a768	2015-08-30 15:23:13 +10:00
David Chan	f5c88ef8e5	Add {{bidi:}} syntax for directionality-safe arguments In parallel with jquery.i18n version: https://github.com/wikimedia/jquery.i18n/pull/76 Bug: T104472 Change-Id: I25afa50ab1e0521bd0b3779cbd16b6c190d72722	2015-07-01 11:06:45 -07:00
Vivek Ghaisas	9f5b6f5aeb	Fix whitespace issues around parentheses Fix issues found by MediaWiki.WhiteSpace.SpaceyParenthesis sniff. Bug: T102617 Change-Id: Iec7f71e64081659fba373ec20d9d2006306a98f4	2015-06-16 22:14:02 +03:00
Amir E. Aharoni	c9678525eb	Rewrite Language::hebrewNumeral() Use arrays instead of strings, to avoid using string functions with Unicode. Handle thousands according to how years like 1000, 2000, etc. are named in the Hebrew Wikipedia. Bug: T97444 Change-Id: I5334e86793d28dfcf8939a249b03a5ea85fa4e69	2015-06-02 15:10:52 +02:00
Amir E. Aharoni	399d00e7e7	Add tests for Language::hebrewNumeral() Some failing tests are commented out and will be properly fixed in subsequent commits. Bug: T97444 Change-Id: I19721b5dc3dc6bbe923d9bf401fcf5d765fb7a7c	2015-06-02 12:26:48 +00:00
Timo Tijhof	b4bac102b6	tests: Clean up file headers * Remove redundant @licence/@license from test suite files. They already have full licence headers. And @licence raises a warning in Doxygen. * Fix weird messes of comments inside comments and other things. Change-Id: I38da8ca76330f72b8dc22b0ecf1ea69d5ea55ede	2015-04-01 00:17:12 +01:00
umherirrender	0d39b3bb0d	Move Test files under same folder structure where class is (/languages/) Change-Id: I25c99272a1c2e318e6c61b4a497bf04886430e9b	2015-01-10 19:53:59 +00:00
Santhosh Thottingal	8dc3631093	Update plural data to CLDR 26 * The updates include incompatible changes for plural forms in Russian, Prussian, Tagalog, Manx and several languages that fall back to Russian. In addition there are minor changes for other languages. * Test cases were updated to reflect these changes. Bug: 62861 Change-Id: I7cce477925330fe5bbf51a8470060dc1223981d0	2014-10-27 08:30:34 +00:00
umherirrender	cd80906d4a	Change @return to start with type MediaWiki default is "@return type Description", so set a type after return and start the description with a capital letter. Also use the more common spelling of boolean. See http://phpdoc.org/docs/latest/references/phpdoc/tags/return.html for more about @return Change-Id: I4e5198822fe92836f9cef9918a9fc1a1a1e0a043	2014-08-20 20:35:41 +02:00
jsahleen	00a5c07b5c	Fixes Algerian messages file so it does not convert to Arabic digits. Bug: 69172 Change-Id: I8ba9e135daa2fc80907703b1023172c680fa571b	2014-08-07 09:47:39 +01:00
Chad Horohoe	fca0d37a2c	Language::isValidBuiltInCode() should not accept uppercase input The results of this function are used to decide whether a code is valid for loading an i18n file without any further normalization. Partially reverts `93348f3` which made the regular expression case- insensitive. Per IRC discussion, language codes should always be lowercase and it's up to callers to deal with that. Change-Id: I8975c3374a37935080d9f7eca6a602e32f67a87b	2014-07-16 18:20:22 +00:00
Amir E. Aharoni	fab8c6f541	Add grammar forms for Russian This adds support for the Russian name of Wikimedia Commons. Change-Id: If531e9ff8f46ac5294b117eec43172b4975e2ad6	2014-07-04 12:35:59 +00:00
Jackmcbarn	d998c8e96c	Return a TTL when formatting times Add an out parameter to Language::sprintfDate that returns the amount of time that its output is valid for (e.g., an output format of 'Y-m-d' at 11:50 PM would be valid for 600 seconds). Change-Id: I3f5a80aa4d303f92c97d24ab780af920894d24ef	2014-06-01 14:10:28 -04:00
umherirrender	e10ee4304e	Adjust indent of some comment blocks Change-Id: Ic25419490fa6a35c11ccc2b7810527e6661e027c	2014-05-01 18:46:34 +00:00
jenkins-bot	ab2d63b53e	Merge "Fix Language::parseFormattedNumber for lzh and zh-classical"	2014-04-25 07:13:29 +00:00
aude	60e1d9996c	Fix Language::parseFormattedNumber for lzh and zh-classical When parsing, filter any array values that are empty string before using strtr php function so that strtr can handle the array. Bug: 64347 Change-Id: I94761caa70d44febfa0999c91048a01044fc1fbe	2014-04-25 08:52:54 +02:00
Siebrand Mazeland	4ede8c2e9d	Pass phpcs-strict on some test files (11/11) Woo! Change-Id: I9fc116dfdf18c2772d047adb5bb14535d0bd39ed	2014-04-24 13:51:05 -07:00
Siebrand Mazeland	69ec133bc5	Pass phpcs-strict on some test files (10/11) Change-Id: I5624292143fcabe890779f5095eae735d7afb176	2014-04-24 13:50:56 -07:00
umherirrender	87fe91344e	Remove # from dataProvider Change-Id: Ie5414173b95e846d735827bffa34c73698e48c17	2014-04-18 19:10:38 +02:00
umherirrender	092cd8ee31	Fixed some @params documentation (tests) Swapped some "$var type" to "type $var" or added missing types before the $var. Changed some other types to match the more common spelling. Makes beginning of some text in captial. Also added some missing @param. Change-Id: Ic8aaf0a93796b97d0fa4617c1f86ff59f4b36131	2014-04-17 20:43:42 +02:00
Siebrand Mazeland	8e0c0a9fc9	Preparations for migrating core to use JSON based i18n LocalisationCache and Language have to take the JSON files into account in deciding if a language is present or not. Standardizing language validity checking with isSupportedLanguage and isValidBuiltInCode. Co-Authored-By: Niklas Laxström <niklas.laxstrom@gmail.com> Co-Authored-By: Siebrand Mazeland <siebrand@kitano.nl> Change-Id: I35bbb3a7a145fc48d14fff620407dff5ecfdd4fc	2014-04-01 14:22:03 -07:00
umherirrender	2000672ac3	Fixed spacing - Added spaces after if/foreach/catch - Added new line before end of file - Added or removed spaces before/after parenthesis, comma - Added spaces around string concat Change-Id: I0590070f1b3542108e242730e8d9a3ba9831e94f	2014-03-20 20:37:30 +00:00
Ladsgroup	16a5102765	Change URLs to mediawiki.org in comments to HTTPS These are only documentation fixes http://www.mediawiki.org --> https://www.mediawiki.org Change-Id: I62ad42be1a3aac410cc53e98ce79389ceddd8988	2014-03-20 16:59:46 +00:00
jenkins-bot	ea48400750	Merge "Add test to validate special page aliases"	2014-03-06 14:18:15 +00:00
Santhosh Thottingal	f50d3eb61e	Update Russian(ru) plural rules to CLDR 24 Russian (ru) plural rules have a major change. The 'few' form is merged with the 'other' form. The current forms are 'one', 'many', 'other'. In MW ru plural rules were overridden using convertPlural methdod in LanguagesRu.php with 3 forms. Effectively forms[1] and forms[2] are swapped. Followup: I9930b290d004667a3bb09e5c1663ec2c9c27d8a6 Bug: 56931 Change-Id: Ia5779e42315d3f41f52dce2bfffaee0a4297d23b	2014-01-03 13:46:19 +05:30
Santhosh Thottingal	1441f511a2	Update plural rules to CLDR 24 Updated plurals.xml with new data from CLDR 24. This data is according to UTS #35 Rev 33. Update the CLDRPluralRuleParser.js to version 1.1 from upstream https://github.com/santhoshtr/CLDRPluralRuleParser Changes to the plural rules: * Hebrew override removed since CLDR 24 matches with MW plural rules. * Updated the syntax of overridden rules to TR35 Rev 33 for Lower Sorbian (dsb), Upper Sorbian (hsb), Belarusian in Taraskievica orthography (be_tarask), Old Church Slavonic (cu), Bhojpuri (bho), Samogitian (sgs). * Removed Manx (gv) override. See I46ab3dadc7fe08c1e60bbd81a1ee841e166e9608. * Removed the overriden convertPlural method for Serbian from LanguageSr.php, since CLDR 24 matches with MW rules. Updated and added more tests. Tests updated for Serbocroatian (sh), too. Old CLDR versions had 4 plural rules and MW had only 3. In CLDR 24, the form 'many' was removed and it became identical to the MW. Same for Bosnian (bs) and Croatian (hr). Also for variants sr-ec and sr-el * Macedonian (mk) used to count 11 as 'other' form. CLDR 24 counts it as 'one'. Not overriding, using CLDR 24 here. Updated the tests. MW will not override this. * Armenian (hy) used to count 0 as 'other'. Now it is 'one' form. Updated the tests. MW will not override this. * Latvian (lv) used to count only 0 as 'zero' form, but CLDR 24, any number satisifying the following formula is counted as zero: n % 10 = 0 or n % 100 = 11..19 or v = 2 and f % 100 = 11..19 Examples: 0, 10~20, 30, 40, 50, 60, 100. Updated the tests accordingly. Not overriding it in MW. Users will see different plural form for the above numbers. * Removed Ukranian custom plural rule since it match with MW * Russian (ru) plural rules have a major change. The 'few' form is merged with the 'other' form. The current forms are 'one', 'many', 'other'. In MW ru plural rules were overridden using convertPlural methdod in LanguagesRu.php with 3 forms. Effectively forms[1] and forms[2] are swapped. This will affect the messages, and such messages must be reviewed and updated. This change is not included in this patch and wil be done separately. Russian is the only remaining language class with convertPlural method overridden. Notable impact on the exising messages: * For languages ru, uk, be_tarask, sr, For the special case of two plural forms and first mapped to 1 and rest to the other form, syntax like {{plural:$1\|1=one\|other}} should be used. For further information regarding each of the above language changes, see 1. http://unicode.org/cldr/trac/ticket/3727 2. http://goo.gl/H2HEz CLDR 24 can handle fractions. Ideally it should start working in MW without any code changes, but MW language test suite does not have enough tests to confirm. Followup: `e571717e06` Bug: 56931 Change-Id: I9930b290d004667a3bb09e5c1663ec2c9c27d8a6	2014-01-03 11:53:10 +05:30
jenkins-bot	c9eaaf7093	Merge "Plural rules: updates for UTS #35 Rev 33"	2013-12-23 14:00:34 +00:00
Tim Starling	e571717e06	Plural rules: updates for UTS #35 Rev 33 * New operands i, v, w, f, t * New operators =, !=, % * Ignore "samples", which are basically unit tests embedded in rule specifications * Ignore the new "other" rules, which have an empty condition. It doesn't really makes sense to parse them, since the empty condition means special handling should be done in the caller, it is not equivalent to an unconditional true or false. * Trailing zero support requires that the input number be a string. Documented this. * Fixed some comments * Added test cases for new features Bug: 56931 Change-Id: I96986c0c664f785e75b0a4ced2ec9e37b72681c1	2013-12-13 11:53:29 +11:00
Santhosh Thottingal	3e6cb8ac1a	Correct the plural forms for Manx (Gaelg) Backported the plural rules from CLDR 24 as an override to CLDR 23 rules exising in MediaWiki. The syntax for plural rules changed in CLDR 24, so modified the syntax to fit the CLDR 23 syntax Once we are ready with the updated parsers for CLDR 24(See bug 56931), we should remove the override. Since we remove the custom plural forms in MW in favor of CLDR, the following changes comes into effect: 1. 'few' form used as 'zero' form in MW. Practially that make 'one' form used as 'two' form and 'two' used as 'few' form. This breaks existing gv {{PURAL}} usage as dicussed in the bug report 2. CLDR defines 'few' form as n % 100 = 0,20,40,60 but MW adds 80 also to that list, ie n % 100 = 0,20,40,60, 80. So with this patch, 80 is no longer considered as 'few' plural form. Bug: 47099 Change-Id: I46ab3dadc7fe08c1e60bbd81a1ee841e166e9608	2013-12-11 12:05:43 +05:30
aude	49b987b7ca	Add test to validate special page aliases Bug 57410 Change-Id: I185f58a618a0f0632d464552a94d704afd000e94	2013-12-10 12:44:18 +00:00
Santhosh Thottingal	a35f710791	Make explicit plural forms work for Russian Russian has overridden convertPlural method, that was not taking care of explicit plural forms. Follow up: I2a9f93567087babb896999f1214d3c56afc67c96 Bug: 54514 Change-Id: Ia977fa544b1d0e40222c7296b7145dcd6f93ecc2	2013-12-04 06:46:57 +00:00
Pavel Selitskas	81fc875c0b	Handle explicit plural forms in custom convertPlural in language classes A new protected method looks for explicitly defined forms. Every overriden language class is required to use this method. Includes tests. Redoing old patch I6dc759e3dfb05d6673209ba00da6592a384d5300 Bug: 46422 Change-Id: I2a9f93567087babb896999f1214d3c56afc67c96	2013-12-03 12:50:02 +00:00
daniel	107bd92ec7	(bug #56685 ) make sure commafy can deal with strings. Localization of numeric values should operate on the values as strings, and should handle strings representign very large numbers gracefully. Change-Id: I95394b96f9b70deb06ab818b54e08ac4ccb38c6c	2013-11-26 20:40:53 +01:00
Dereckson	8c8ff51233	Improving CLDR Plural Rule Evaluator documentation. Change-Id: Ic6581de7cc69dea7af0bee5596497db509d6b9ba	2013-11-22 02:17:31 +01:00
umherirrender	5dbfd5bf80	Fixed spacing - Removed trailing spaces in comments - Removed multiple empty lines - Removed space after object operator Change-Id: I9fd3256ab490c7cd2034de3fd94e6be6e6d6d8f2	2013-11-21 18:52:25 +00:00
UltrasonicNXT	87fe16c445	Prevent space before ellipsis when truncating When truncating a string at a point where it contains a space (ie "hello world" to 9 chars), the resultant string will have a space before the ellipsis ("hello ..."). This is both gramatically incorrect and just looks wrong, and is fixed by trimming the string before appending the ellipsis. Change-Id: Iec86b17bfc8c50e4c1a96fd373861841fc57848d	2013-11-15 14:29:18 -04:00
addshore	caec5f920a	@covers tags for the rest of test files.. Change-Id: I0fafe80531325a412472ab7c9fc6d81c861b3751	2013-10-24 21:38:08 +01:00
addshore	46a17d0fc3	Cleanup /languages/* tests This change: - Adds method scope - adds @covers tags - adds various @todos - fixes some comments Before the changes tests ran with: 1383 tests, 1412 assertions 10 skips After changes the results remain the same Change-Id: Iee57447bdb47026952ef5dcce6fed5dad0f80e52	2013-10-22 12:32:29 +02:00
Chad Horohoe	c9b831a02a	Clean up language test cases objection construction I have no clue how this ever actually worked under Zend, but it definitely didn't with HHVM. Now it works on both. Change-Id: I521dfb74f30306736adda5662598fd036ad9849b	2013-09-26 16:09:13 -07:00
Liangent	d0e3dc94c3	Add converted namespace names as aliases to avoid confusion. Currently if the site language is zh and a user is using variant zh-tw, namespace names from zh-hant are displayed because of the language converter, but they're not accepted by MediaWiki as valid namespace names by default because zh falls back to zh-hans. For core namespaces, all converted namespace names are manually added as $namespaceAliases in MessagesZh.php but it's not always done in extensions. With this patch converted namespace names are automatically added as namespace aliases when namespace aliases are loaded. In some followup commit it makes sense to remove existing core namespace aliases which were created for this reason. Change-Id: I01873d9c64a9943afbb655d6203cec9ebd39fb72	2013-08-13 13:01:40 +00:00
jenkins-bot	b8f9b16b84	Merge "New function Language::getParentLanguage()."	2013-06-28 15:15:04 +00:00
Niklas Laxström	a7a693f4b0	Avoid exceptions by first checking language code validity Bug: 49423 Change-Id: I3fd98ba08393856311a48fa40769027460c72ef9	2013-06-13 09:35:32 +00:00
Liangent	396e18a8e5	New function Language::getParentLanguage(). Change-Id: Ib2109176b7dfc7ec2d0ee827c804cf93ea83b9e5	2013-06-10 18:07:49 +00:00
Kevin Israel	d510d0c0c7	Language::convertPlural: check if matching form exists It is possible that only explicit plural forms are specified, and therefore, it is possible that none match. However, handling of explicit forms came after the count( $forms ) check, so input such as {{PLURAL:\|1=}} would trigger a "PHP Notice: Undefined offset: -1". Change-Id: I8494de8ceb9e0cfff7203c69c21f02b3731275af Follows-Up: I50eb0c6d1c02ca936848d310de625ed1fe43d91a	2013-05-25 19:30:53 -04:00
Alexandre Emsenhuber	4742232b0d	Fix bootstrap in unit tests - Remove check for version, that version is already enforced in phpunit.php, so there is not point showing a warning for it is useless - Remove call to MessageCache::destroyInstance(), there is no need for it, since $wgMessageCacheType is set in phpunit.php before running Setup.php - Remove includes of bootstrap.php in LanguageSrTest.php and LanguageUzTest.php Change-Id: I4b2db6b3e6f001175e1a407c5add2972aade5e60	2013-05-03 21:45:06 +02:00
jenkins-bot	9cd8ce5034	Merge "Add input checks for Language::sprintfDate()"	2013-04-29 08:50:08 +00:00
Siebrand Mazeland	791d0b2a98	Update code formatting Change-Id: I16a9b42651f1cfb1a70dffbb67b7b83dfeb90d03	2013-04-26 14:21:20 +00:00
Siebrand Mazeland	35f0a66f32	Add input checks for Language::sprintfDate() Check if the timestamp has a length of 14 characters and if it is numeric. Throw an exception otherwise. Includes tests. Bug: 47629 Change-Id: I9a4fd0af88cf20c2a6bd72fd7048743466c1600f	2013-04-26 10:05:18 +02:00
Brad Jorsch	e9e1b0a777	(bug 33454) Add timezone support to Language::sprintfDate Add an optional timezone parameter to Language::sprintfDate, add format characters eIOPTZ, and correct crU. While we're at it, remove backwards-compatability code for 'N' and then merge the existing switch cases for cr and wNzWtLoU that are basically identical, since all those cases need to be changed anyway. Bug: 33454 Change-Id: Iea1f78428bc0d32d6395818311dbe4b94d776c42	2013-04-09 22:49:04 +00:00
Siebrand Mazeland	6da93fc6f6	Update code formatting Also update some previous inconsistencies pointed out by Krinkle in change IDs: * Ide20743a2e84ff68549286120e6cff9d9f396f54 * I811ca957b6588085d67606ebc0cd4033a1e53839 Change-Id: Ife33b931870d0d7e04fcb40974997436d27f528f	2013-03-27 14:15:11 +01:00
Amir E. Aharoni	40c01f2492	Update plural rules for Hebrew A recent CLDR update changed the plural rules for Hebrew and added a "many" form. That rule has a mistake, however - it is not supposed to include the number 10. This is reported as http://unicode.org/cldr/trac/ticket/5828 This commit updates the plural overrides for Hebrew and makes them closer to the latest CLDR, but with a fix for that rule. It also updates the tests to include support for the new rule and to make sure that the right fallbacks are used when less than four rules are supplied, because usually that is the case. It is thus a partial revert of the changes introduced in I3d72e4105f6244b0695116940e62a2ddef66eb66 . Mingle-Task: 2715 Change-Id: I1315e6900ef7a89bf246e748b786f7fc31a629c6	2013-03-22 15:21:50 +02:00
Timo Tijhof	b36d883017	Tests: Make phpunit providers "public static". Follows-up I9d2b148e57 (including phpunit/languages this time). Bug: 46434 Change-Id: I30e5efcd88c516121c454676bd7a18f9b7c8fca6	2013-03-22 03:12:37 +01:00
Kaldari	06b0967caa	Allow the retrieval of the plural rule type for a given number For example, find out which rule type should be applied for 5 items in Arabic. The result would be 'few'. This implementation should be non-disruptive and completely backwards compatible (which is the main reason it isn't a lot simpler). Change-Id: I3d72e4105f6244b0695116940e62a2ddef66eb66	2013-03-20 14:34:12 +05:30
Santhosh Thottingal	133f5952fd	Remove custom plurals for Nso and Sl in favour of CLDR Nso - Northern Sotho Sl - Slovenian Plural rules were not changed. They are same in CLDR and MW Change-Id: I0e0c84352de2de8f58af5a9147ba18b0fe1fb39a	2013-03-20 08:35:59 +00:00
Santhosh Thottingal	43f5eb600b	Move plural rules of Samogitian(sgs) to plurals-mediawiki.xml * CLDR does not define plural rules for sgs. * Port the plural rules present in LanguageSgs.php to CLDR plural definition syntax * Remove LanguageSgs.php * Update the tests, reorder/rename the plural form names Change-Id: I44658402d69a6805cdfd189fe780eadee94056c7	2013-03-18 13:41:46 +05:30
Siebrand Mazeland	ee93ce6699	Add space between number and unit of measure Also update tests. Spotted by Opraco in https://translatewiki.net/wiki/Thread:Support/About_MediaWiki:Bitrate-kilobits/en Change-Id: I81ad428ebc8b2511b871bf98540bc74508f00939	2013-03-12 19:40:47 +01:00
Santhosh Thottingal	406d958795	Remove custom Latvian(lv) language plural rules CLDR is now in sync with MW plural rules. So no need of custom plural logic Change-Id: I399f99ddd40eea67e981d5710658ba635f115a31	2013-03-04 16:49:02 +05:30
Niklas Laxström	cdef54377e	Docs, typofix, additional testcase for I7be51e Change-Id: I0ab17cb749c23b666e0bb1ee61fe7d424e717fde	2013-02-27 08:56:06 +00:00
Santhosh Thottingal	d61373ec40	Remove MediaWiki overrides for plural rules for Scots Gaelic (gd) Also cleanup the tests. Change-Id: Ic29026a7a8128b890882b8869569309ab05e4226	2013-02-26 16:50:56 +01:00
jenkins-bot	78271824d2	Merge "(Bug 44987) Allow n=form in plural syntax"	2013-02-26 15:27:31 +00:00
Amir E. Aharoni	6848470450	Wrote proper skip reason Change-Id: I80627dab1db279ee0b9d27ec929671bc458558b7	2013-02-18 23:58:43 +05:30
Siebrand Mazeland	454d92fb7c	Update formatting 8 of n. Change-Id: I55551510e7afde5b6b981697d5c0efd7b9507585	2013-02-15 13:08:55 +00:00
Siebrand Mazeland	feb9419a51	Update formatting 7 of n. Change-Id: I07687a4381f29fd9fc73666e460f25769ed54092	2013-02-15 12:53:41 +00:00
Santhosh Thottingal	ff3df41363	(Bug 44987) Allow n=form in plural syntax phpunit testcases included Change-Id: I7be51e24a0b953dcd1f9cb21f54af9b4127a5cdb	2013-02-14 15:32:46 +05:30
Santhosh Thottingal	1f650cb9b7	Update plural rules from CLDR, and correct Armenian plural rules * Upgrade to revision 8007, Contains minor change - Armenian(hy) is added * Remove MW custom plural logic from LanguageHy.php * Add qunit test case * Correct phpunit testcase Change-Id: If78436fa1597e6f3b7f050c5eede4521018904c0	2013-02-14 11:37:24 +05:30
Amir E. Aharoni	2a481e74cf	Russian grammar updates * Replace == with === * Add support for the prepositional case * Add support for Wikidata * Add tests Change-Id: Ic02bfb9ce88e93775036f3d15921cedca602237c	2013-02-11 07:15:13 +01:00
jenkins-bot	75ef257c29	Merge "pass codesniffer on tests/"	2013-01-28 23:47:45 +00:00
Antoine Musso	0fd05285d7	pass codesniffer on tests/ Fix almost all occurences of the following sniffs: Generic.CodeAnalysis.UselessOverridingMethod.Found Generic.Formatting.NoSpaceAfterCast.SpaceFound Generic.Functions.FunctionCallArgumentSpacing.SpaceBeforeComma Generic.Functions.OpeningFunctionBraceKernighanRitchie.BraceOnNewLine Generic.PHP.LowerCaseConstant.Found PSR2.Classes.PropertyDeclaration.ScopeMissing PSR2.Files.EndFileNewline.TooMany PSR2.Methods.MethodDeclaration.StaticBeforeVisibility Change-Id: I96aacef5bafe5a2bca659744fba1380999cfc37d	2013-01-28 12:14:26 +01:00
Amir E. Aharoni	fee2b0045e	(bug 41476) Implement Language::isKnownLanguageTag() Change-Id: I130d8e0b397323e21058cf46510440da066fa12b	2013-01-28 09:10:11 +00:00
Amir E. Aharoni	4a25561370	(bug 41478) Implement Language::isWellFormedLanguageTag() Change-Id: Ief5643e9a7d3883d6d131503087aca15207b0a44	2013-01-25 13:05:13 +02:00
Niklas Laxström	0e08362b40	(bug 41477) Add Language::isSupportedLanguage Change-Id: If48c23fd580133bf78c19d4a0e8e00e74a639fa1	2013-01-21 08:47:53 +00:00
jenkins-bot	8b1f803478	Merge "Eliminate dummy Language instances from being created"	2013-01-12 10:24:17 +00:00
Yuri Astrakhan	8551f29ae2	Language::listToText cleanup with unit test Change-Id: If88ab7da07e336fc5f6264c7d6b4f6ce542f99c9	2013-01-07 20:02:56 +00:00
Niklas Laxström	04bf35d331	Eliminate dummy Language instances from being created By checking the code against $wgDummyLanguageCodes we can get rid of checking it on a case-by-case basis. It's only a suggestion (I don't know if it can break anything), and Amir Aharoni said that big changes are coming (Bug 41103). In private case, this change fixes Bug 27571 and maybe some other language fallback related issues. Change-Id: I5212beabd5cc212b50ee98b5b53ec01b20ffd0c3	2013-01-02 17:42:22 +00:00
IAlex	83aafac2b0	Merge "Do correct average year length arithmetic."	2012-12-29 19:18:05 +00:00
Liangent	70e300e270	Do correct average year length arithmetic. We have ( 24 * 3 + 25 ) leap years in total every 400 years in the Gregorian calendar. Change-Id: I2ad9036473afa914ecf8ddcf99ce27e316178f76	2012-12-29 22:15:36 +08:00
Kaldari	5e13ecaa9b	Fixing some variable names and comment formatting - no functional changes Changing $_ to $number; changing $numberpart to $integerpart Also adding a unit test for the commafy function Change-Id: Iaf6dd027bd70722d316d1a9c10c9913fff8300ce	2012-12-24 07:47:21 +00:00
Liangent	c50cd60069	Always return something nice in Language::translateBlockExpiry() Change-Id: I30a1950df5ae018cb9124392dc8d6e99ca3b98b8	2012-11-21 00:01:35 +08:00
Amir E. Aharoni	b6664e6dc8	Add grammar tests for Hebrew Change-Id: I0114c87ec63cc224f42c81497af41fe6e72a59bc	2012-11-10 15:15:27 +05:30
Antoine Musso	a03bf9e27f	tests: rm duplicate code in language classes The language classes have been using the same setUp() tearDown() to craft a new language object. I have abstracted that code in LanguageClassesTestCase and made all the language test classes to extend it. The language is interpolated directly from the class name and an object for it can be retrieved with the getLang() method. Change-Id: Ib931336ce219edabe2c72b7e9f04c976a500723e	2012-10-29 09:40:30 +01:00

1 2 3 4 5 ...

323 commits