Thijs/wiki.techinc.nl

Author	SHA1	Message	Date
Aaron Schulz	950cf6016c	Rename DB_SLAVE constant to DB_REPLICA This is more consistent with LoadBalancer, modern, and inclusive of master/master mysql, NDB cluster, and MariaDB galera cluster. The old constant is an alias now. Change-Id: I0b37299ecb439cc446ffbe8c341365d1eef45849	2016-09-05 22:55:53 -07:00
Aaron Schulz	d957cb7347	Cache revision lookups done by Parser Inverse flame graphs shows revision lookups as one of the big three queries (Revision, LinkCache, getTitleInfo of ResourceLoaderWikiModule). This works via a new Revision::newKnownCurrent() method needs both page/rev ID from the DB (to avoid invalidation) and fetches the user name and rev_deleted if needed (again to avoid invalidation). Parser does not care about fields anyway in the template path. Also improved cross-wiki support a bit, and fixed up some docs and IDEA errors. Change-Id: Icad602dba5de18c7758b77fd23b0a450ff21d09f	2016-09-05 02:22:51 +00:00
Aaron Schulz	97f004694c	Adapt the ParserOutput cache TTL when including special pages For simple pages that transclude special pages, like user pages including Special:PrefixIndex, the TTL is allowed to drop to 15 seconds if the page parses fast enough. Bug: T139893 Change-Id: If41885ded648d68352fe3d06336d98aa0ab53966	2016-08-31 17:17:38 +00:00
Amir Sarabadani	6b221fa96a	Clean up array() syntax in docs, part IV Change-Id: If626409a93d31bf90c054c9bf7ba44a78ea9a621	2016-08-26 16:06:58 +04:30
Kunal Mehta	85034abca5	content: Refactor normalization of line endings code The code that normalizes line endings ("\r\n" and "\r" to "\n") and trims trailing whitespace is buried in Parser::preSaveTransform(), and was duplicated to TextContent in `96b6afb31d`, as non-wikitext content models should still be normalizing line endings. This splits the duplicated code into TextContent::normalizeLineEndings(), and utilize it in the Parser. Additionally, expand the documentation of TextContent::preSaveTransform() to document that subclasses should make sure they normalize line endings during the PST stage. And remove a useless rtrim() call from WikitextContent that did nothing. Change-Id: I9094c671d4bbd23d75436f8f1d682d6dd6e6d2fc	2016-08-23 11:09:59 -07:00
Brian Wolff	e2a6fe5711	SECURITY: XSS in unclosed internal links rawurldecode was being run on unclosed internal links which could allow an attacker to insert arbitrary html into the page. See also related: r13302 Bug: T137264 Change-Id: I4e112a9e918df9fe78b62c311939239b483a21f5	2016-08-23 03:39:36 +00:00
jenkins-bot	ead55128f7	Merge "TextContent: Normalize newlines in preSaveTransform()"	2016-08-16 15:34:16 +00:00
Brad Jorsch	96b6afb31d	TextContent: Normalize newlines in preSaveTransform() This does the same normalization of newlines that Parser::preSaveTransform() does. This should be appropriate for any text content type, especially considering that EditPage uses WebRequest::getText() which does a less-strict version of this same transformation. This also cleans up the code for doing that newline replacement to be a bit less verbose. Bug: T142805 Change-Id: I462afcda502f031a8b0360d982ce2398a0383a96	2016-08-16 10:21:32 -04:00
Tim Starling	6a04d86149	Remove all assert() calls with string parameters These fail when HHVM is in RepoAuthoritative mode Change-Id: Ifb1628f8269b2b651154b740b95cc14163a1b186	2016-08-15 23:11:18 +00:00
Florian	794bb8bb25	Fix comment of get/setLinkRenderer in doxygen Doxygen requires the full qualified name of the class in a comment or in the @aram/@return annotation, otherwise the class isn't linked in the resulting output[1]. This commit changes the LinkRenderer annotations in SpecialPage and Parser to \MediaWiki\Linker\LinkRenderer. [1] https://doc.wikimedia.org/mediawiki-core/master/php/classSpecialPage.html#a3560214f63fc2f20c63b4025db5cd81d Change-Id: I74cedcd764a6053cc5a0c6d2eedbedb72651f57c	2016-08-09 17:23:00 +02:00
Reedy	c6fc119c0a	Add/update doc blocks for MWTidy Change-Id: I0b87e119048fd993f8bfda25a6c6b744d59804d1	2016-07-29 01:24:34 +01:00
Tim Starling	134f8c4513	Don't run the non-Tidy "bug 2702" hack unless Tidy is really missing We have two hacks which are used when Tidy is not available: one in Sanitizer::removeHTMLtags(), and the second here as a late Parser pass equivalent to Tidy itself. But the Sanitizer one was enabled only if MWTidy::isEnabled() returned false, whereas the Parser one was enabled also when tidy was disabled in ParserOptions. This patch makes them both consistent, it enables the bug 2702 hack only when MWTidy::isEnabled() returns false, and when Tidy is disabled in parser options, the output is simply passed through. This allows tidying to be done separately on the ParserOutput, as is required by the proposed ParserMigration extension (I24d0776a933fa3f). Eventually the bug 2702 hack will be removed in favour of a pure-PHP HTML 5 parser, but it looks like it is too early for that. Change-Id: I94be6c9dec531c23ef80cb36732243bd6858bf22	2016-07-27 14:47:36 +10:00
Aaron Schulz	b7c4c8717f	Move NewPP limit report HTML comments to JS variables * Instead of having messy code to create a hidden HTML comment of English strings at the bottom of the page, expose the structured data of the parse information to JS so tools can use it. * Make makeConfigSetScript() use pretty output so these variables are also easy to read in "view source". * Remove ParserLimitReportFormat hook, since the data is not formatted to HTML anymore. Bug: T110763 Change-Id: I2783c46c6d80f828f9ecf5e71fc8f35910454582	2016-07-26 11:31:20 -07:00
jenkins-bot	e657b4bac8	Merge "Remove modulemessages from ApiParse and Output (deprecated in 1.26)"	2016-07-26 12:15:22 +00:00
Tim Starling	7a5fbec82d	Add MWTidy::factory() A convenient factory function to eliminate code duplication in ParserMigration's MigrationEditPage::tidyParserOutput(). Change-Id: I058912885025e7a9402912236c65c44e32ef036e	2016-07-26 15:12:55 +10:00
Timo Tijhof	9093af0a28	Remove modulemessages from ApiParse and Output (deprecated in 1.26) No uses of 'modulemessages', getModuleMessages() or addModuleMessages() anywhere in Wikimedia Git. Change-Id: I59420880f3545d1aabf9bcbea1e34b1475697d26	2016-07-26 00:13:04 +01:00
Tim Starling	b2f7bb4d76	Preprocessor_Hash: use child arrays instead of linked lists The singly-linked list data structure of Preprocessor_Hash was causing stack exhaustion due to the need for a recursion depth proportional to the number of children of a given PPNode, in serialize() and on object destruction. So, switch to array-based storage. PPNode_* becomes a temporary proxy around the underlying storage, which avoids circular references and keeps the storage very compact. Preprocessor_DOM uses similar temporary PPNode objects, so the fact that $node->getFirstChild() !== $node->getFirstChild() should not cause any new problems. * Increment cache version * Use JSON serialization of the store array instead of serialize(), since JSON is more compact, even after gzipping. * For efficiency, make $accum a plain array, and use it as an array where possible, instead of using helper functions. Performance and memory usage for typical input are slightly improved: something like 4% faster for the whole parse, and 20% less memory for the tree. Bug: T73486 Change-Id: I0d6c162b790d6dc1ddb0352aba6e4753854f4c56	2016-07-22 05:25:11 +00:00
jenkins-bot	e305dd6ced	Merge "Move Linker::getLinkColour() into LinkRenderer"	2016-07-18 16:16:07 +00:00
Tim Starling	d3d682fb45	Hide marked empty elements by default (stage 1) We originally imagined rolling out the display of empty elements simultaneously with the Html5Depurate, but now we have added support for marking empty elements to Html5Depurate and plan on having some sort of longer migration period. So, move the relevant CSS to content.css, and remove the concept of CSS dependant on tidy driver. Add a body class which will allow the effect to be toggled in a gadget or extension. Actual toggling in the CSS will be in the stage 2 patch, to be deployed after the varnish cache and parser cache have expired. I originally imagined that there would be a gadget that overrides the rule with an !important selector, but that method does not allow you to recover the original display property, which is often overridden by the style attribute or site CSS to be "inline". Also, in RaggettWrapper, switch to the new class mw-empty-elt, following Html5Depurate, instead of mw-empty-li. The old class will be removed in the stage 2 patch. Change-Id: Ic0f432c43a006629ca5a1a7c2dda3552ceb4dc4f	2016-07-14 14:24:27 -07:00
Tim Starling	8a57d86ea7	Rewrite TidySupport and add option --use-tidy-config * Have TidySupport provide $wgTidyConfig instead of the legacy globals * Add --use-tidy-config option to parserTests.php. This tells TidySupport to use the tidy configuration from LocalSettings.php instead of the traditional safe defaults. * Add a way for TidySupport to disable tidy via $wgTidyConfig, using driver=>disabled Change-Id: Ie76e68e2d5238d0a1aef49a1a815c0d1cd8bfdae	2016-07-12 14:25:03 -07:00
C. Scott Ananian	ce081a3d7b	Hook up Balancer as a Tidy implementation. This is an HTML5-compliant parse/serialize tidy implementation, with well-delineated hacks to support the <p>-wrapping done by legacy tidy. Change-Id: I4fd433fd6f1847061b0bf4b3e249c918720d4fae	2016-07-12 14:18:04 +10:00
C. Scott Ananian	6cdae80513	Add tracking category when editors use the deprecated self-closed tag hack. Some pages use constructs like `<b/>` or `<span/>` to protect spaces or special characters at the beginning/end of templates. This syntax is incompatible with HTML5 parsing rules, which dictate that these should be treated as open tags, and instead rely on an unusual quirk of the `tidy` program that removes invalid constructs. This syntax is deprecated as part of the process of reconciling `tidy` with modern HTML5 parsing semantics. Authors can use ` ` or `<nowiki/>` as valid replacements. In order to provide time to transition existing content, pages using self-closing tags in violation of the HTML5 parsing specification will have their templates/pages added to a new tracking category. After these uses are fixed, we will change the sanitizer to treat these as normal open tags, to be consistent with the HTML5 parsing spec. Note that this construct is already disallowed if tidy is disabled; it is rendered as `<b/>`. We add a tracking category in the no-tidy case as well, in preparation for eventually making the no-tidy and with-tidy behaviors consistent. Bug: T134423 Change-Id: Ie1cf3aa40d5483bf395ece539f0240b694ff04ab	2016-07-12 14:18:04 +10:00
Aaron Schulz	005b4d6fff	Try to predict the rev_id when preparing edits During both the edit stash and first parse in on page save, guess what the rev_id will be and use that instead of null. Only reparse if it turns out to be wrong. This avoids extra parsing on wikis that have low-medium traffic, and does not cost much. The parsing that can be avoided is: a) in doEditContent() by using the stash b) in doEditUpdates() by using the doEditContent() result, whether that was able to use the stash or not itself Also improved the parse operation logging in save paths. Bug: T137900 Change-Id: Ic6faae70a78b4e223e4d3585cefd482c0fa00677	2016-06-29 05:39:33 -07:00
Kunal Mehta	fb04b0ce28	Parser: Use LinkRenderer for building ISBN magic links Instead of manually building the <a> tag, use LinkRenderer to create it. Change-Id: Iaefe85527307a8399e9f52dde58fb2c24c4753c2	2016-06-23 14:11:24 +00:00
Kunal Mehta	8a6326c211	Add SpecialPage::getLinkRenderer() And SpecialPage::setLinkRenderer(), so the Parser can pass on its LinkRenderer instance for when special pages are being included in a page. Change-Id: If9a9c648ab670b824ce534e7cf0d20d41e1bfd12	2016-06-22 23:32:00 +02:00
Jackmcbarn	d05dde4329	Render bad images in wikitext as links In galleries, bad images are rendered as links. This causes the same behavior to occur in wikitext, rather than the current behavior of not rendering anything. Change-Id: I1a074bff7cb661b5b4e6db9503eb6a5de702ee2f	2016-06-19 03:24:57 +00:00
Aaron Schulz	7d42e96748	Deprecate Parser::disableCache Few maintained extensions still rely on this and it is bad practice to use this for handling cache correctness. Change-Id: I2de481198bbff5c4f3dd81fc6d1b137e4c37b93f	2016-06-18 19:55:43 +00:00
Brian Wolff	7730dee63b	Make transcluded special pages not disable cache in miser mode. Previously {{Special:Foo}} would cause parser cache to be disabled, now have a method in SpecialPage to control this behaviour and set arbitrary caching times. Note: This does not affect caching of direct views to the special page The new default is now disabling cache if not in miser mode, otherwise setting to 1 hour, except for Special:Recentchanges and Special:Newpages which set to 5 minutes. These values are possibly really low, but for now I think best to be close to the old behaviour. We had 0 caching for these things for years, and afaik it hasn't caused any big issues. Part of me wonders if Special:Recentchanges should stay at 0, but that sounds crazy. This change also causes transcluded special pages to not be "per-user" if they are being cached (Specificly $wgUser et al become 127.0.0.1). Bug: 60561 Change-Id: Id9ce987adeaa69d886eb1c5cd74c01072583e84d	2016-06-14 20:46:32 -07:00
Aaron Schulz	879ebfb18a	Use a low TTL for parser output when special pages are included Previously, no TTL at all was used, which is quite harsh on performance and had downstream effects like disabling edit stashing for affected pages. Bug: T136678 Change-Id: I2462057aa189cfb05fe65d0b3c081a9fd10066a2	2016-06-14 17:48:04 -07:00
Aaron Schulz	147f79eedd	Improvements to {{REVISIONUSER}} handling * Do not change the result to a null editing user anymore. * Use a new vary-user flag instead of vary-revision. This will only cause a reparse on null edits. Normal edits can still use the prepared output now. * Edit stashing now applies for pages with this magic word. * Fixed bug where the second prepareContentForEdit() call (due to vary-X flags) would still check the edit stash. Bug: T135261 Bug: T136678 Change-Id: Id1733443ac3bf053ca61e5ae25db3fbf4499e9f9	2016-06-14 19:28:09 +00:00
Timo Tijhof	8eca3b5027	parser: Remove redundant comment about revisionsize cache vary Follows-up `457431b`. Change-Id: Iac3e4d6c11de3737155e7f7ff35ec7a6a3873865	2016-06-14 01:26:37 +02:00
Aaron Schulz	457431b57b	Avoid setting vary-revision for {{REVISIONSIZE}} Just always use the input size for new revisions. If they are saved, then that should be the revision size. If they are just null edits, then the size must have matched the current revision. This also enables edit stashing for this case. Change-Id: I428c0cc87750eeddd1d7dcebd1a2b03817cec441	2016-06-13 23:00:05 +00:00
jenkins-bot	089612544d	Merge "Show ParserOutput warning instead of on the actual page output for ignored display titles"	2016-06-02 21:23:04 +00:00
Kunal Mehta	d671429e41	Parser: Pass Title onto Linker::makeExternalLink() Otherwise $wgNoFollowNsExceptions functionality won't work. Change-Id: I2e1c5ad41f94568bff7f24a400d555b604cfe22e	2016-05-31 22:47:51 -07:00
Kunal Mehta	b07eb85267	Make $url parameter to Parser::getExternalLinkAttribs() required All callers in Gerrit pass $url in. Change-Id: I36246f6510db414dcc7023f8779796c060c3eba5	2016-05-31 21:25:18 -07:00
Kunal Mehta	5119236d4d	Move Linker::getLinkColour() into LinkRenderer * Rename to getLinkClasses() since it's not really returning colours, but CSS classes. * Dependency inject LinkCache into LinkRenderer * Update all callers of Linker::getLinkColour(), and mark it as deprecated (no other uses in Gerrit) * Update a bunch of tests for new dependency Change-Id: Id178e2dcc60b833ce2dbad4920896b93cabba1bf	2016-05-27 09:18:09 -07:00
Kunal Mehta	96e15c9bd2	Parser: Make makeKnownLinkHolder() protected, and remove $query handling Extensions shouldn't be calling this, just the Parser, so make it protected. And since the only caller passes an empty array for $query, we can just remove it entirely. Change-Id: I3adbcaabbb40870eb3df1495c3c2743ff21f0c64	2016-05-26 15:00:49 -07:00
Kunal Mehta	9d867e3c7a	Parser: Replace Linker::link() with LinkRenderer Replaces usage of Linker::link() in Parser and LinkHolderArray with the new LinkRenderer. Change-Id: Icb796ef08d70926728732ab5468940c09ba5eaf8	2016-05-26 14:05:47 -07:00
jenkins-bot	67a97fced6	Merge "Language: Introduce new method equals( Language $lang )"	2016-05-23 16:03:50 +00:00
jenkins-bot	408e9de28c	Merge "Add pages with ignored restricted {{DISPLAYTITLE}}s to a tracking category"	2016-05-22 21:17:10 +00:00
Bartosz Dziewoński	06b9d0af42	CoreParserFunctions: Return 0 from {{PAGESIZE:}} when length is unknown Revision::getSize() might return null when the revision.rev_len field is null. That should never happen normally (the field should get backfilled as part of the update process), but we've also had a bug where rev_len was not being recorded for empty pages (see T135414 for details). It's saner to return a number here rather than empty string, and 0 should actually be correct for all pages affected by that issue. Bug: T20998 Change-Id: Ie12f0be24f00aaf8b90b25c4921a97df3b789369	2016-05-22 18:39:11 +00:00
Glaisher	bacd87e494	Show ParserOutput warning instead of on the actual page output for ignored display titles Ignored restricted DISPLAYTITLE warning isn't really relevant for the casual reader so don't show it in the page output. Instead show it above the edit box. Bug: T135949 Change-Id: I009dd865bec7b6e3a7492c49db97074483f93ee4	2016-05-22 22:45:56 +05:00
Glaisher	8af59afa0d	Add pages with ignored restricted {{DISPLAYTITLE}}s to a tracking category Added to "Pages with ignored display titles" category (message key: "restricted-displaytitle-ignored") Follow up to I6ae6d5d0e567ba9c86e46c32240ee51a2ca5d8d1 Bug: T135949 Change-Id: I9e0f8b1e3d39a62c13191bea6734fb136e976e0c	2016-05-22 17:19:46 +00:00
umherirrender	72632115d6	Fix various phpcs error from last security patches Found by tests: https://integration.wikimedia.org/ci/job/mediawiki-core-phpcs-trusty/1069/console Breaking merges Change-Id: If01b94705cd7b939ac380053730b1b602c838a8e	2016-05-20 20:20:36 +02:00
Brian Wolff	13ece3550e	Add rel="noreferrer noopener" when target attribute would open window noreferrer is used as support for noopener is very limited. This is to prevent the attack detailed at https://mathiasbynens.github.io/rel-noopener/ where you can navigate the parent window, even if the new window is a cross-origin. Bug: T133507 Change-Id: I6e4ab938861e246ff44048077b94847e303f1859 Signed-off-by: Chad Horohoe <chadh@wikimedia.org>	2016-05-20 09:49:41 -07:00
Brian Wolff	7e4a134f49	SECURITY: Include quote characters in strip markers so esc in attr Strip markers get substituted for general html, which means the substitution text general does not escape quote characters. If someone can convince MW to put a strip marker in an attribute, you can get around escaping requirements that way. This patch adds the characters `"' to the strip marker text. At least one of these characters should be escaped inside attributes (regardless of what quote character you use for attributes), thus normal html escaping will deactivate the strip markers, preventing the vulnrability. This will break any extension that escapes input with htmlspecialchars, to add to html/half parsed html output, but assumes that strip markers are unmangled. I don't think its very common to do this. The primary example I found was some core usages of Xml::escapeTagsOnly(). (And even in that case, it only affected the corner case of being called via {{#tag:..}}) Based on MatmaRex's suggestion. Change-Id: If887065e12026530f36e5f35dd7ab0831d313561 Signed-off-by: Chad Horohoe <chadh@wikimedia.org>	2016-05-20 09:25:49 -07:00
Fomafix	796d62d034	Language: Introduce new method equals( Language $lang ) Use $lang->equals( $wgContLang ) instead of $lang->getCode() === $wgContLang->getCode() Change-Id: Id7ed6a21ce5e2ea2887ec98c7bd9d3eba83d733b	2016-05-16 22:33:33 +00:00
jenkins-bot	720c86a77b	Merge "Require strip marker names to not have & ' " < or > in them"	2016-05-13 21:49:26 +00:00
jenkins-bot	c641b2e80a	Merge "Add LinkCache::getSelectFields() and use it in a few places"	2016-05-13 19:48:31 +00:00
jenkins-bot	8088e40837	Merge "Warn when a restricted displaytitle is ignored"	2016-05-13 19:48:17 +00:00

1 2 3 4 5 ...

1457 commits