Quantum Flow Engineering Newsletter #11
The basic idea is that we want to process events with a different order depending on their type, but we don’t want to change the respective ordering of the events of the same type. This will allow Gecko to serve requests to paint pages and handle user input events first, and then process other things like scripts, DOM events, timer callbacks, network events, etc., and finally when there is no other immediate work of higher priority to be done, low priority work such as garbage and cycle collection and idle callbacks are run. This is now mostly finished (except for high priority input events), and we have already started porting a few things to run as idle callbacks! This part of Quantum DOM will hopefully be part of Firefox 55.
Michael Layzell removed the PPrinting::ShowProgress synchronous IPC message. It’s great to see the whitelist of sync IPC messages shrinking! He also landed a telemetry probe for measuring how long we spend processing synchronous IPC messages. This will be helpful for those remaining sync IPC messages where we’re wondering whether the processing side of the IPC message is taking a long time to finish or whether the overhead of the message dispatch is the slowing factor.
Shih-Chiang Chien lazified the loading of UserAgentOverrides.jsm until the first network connection is made in order to improve startup speed.
André Bargull improved the performance of initializing default JS Intl objects. He also made some improvements to the performance of [email protected]@split and [email protected]@replace.
Boris Zbarsky made calling Element.scrollTop = 0; a lot cheaper by avoiding an unnecessary synchronous layout flush.
Dão Gottwald changed the tab strip scrolling code to flush once instead of once per ‘scroll’ event. He also removed a layout flush that used to occur when clicking on the “List all tabs” button in the tab bar.
Alessio Placitelli ensured Telemetry doesn’t initialize the search service before first paint.
Felipe Gomes made us skip nsURLClassifier initialization for about: loads, causing a start-up perf improvement.
Ting-Yu Chou made it far less expensive to clean up closed tabs and windows.
Will Wang reduced the cost of initializing SessionCookies for improved start-up time.
Mats Palmgren improved our reflow performance on pastebin.com. He also devirtualuized various nsIFrame members (including nsIFrame::IsLeaf() which took a lot of effort), which contributed some Speedometer improvements.
David Baron made the hidden window inactive by default, which should help improve start-up time.
Marco Bonardo made it so that the favicon database isn’t opened on startup to check for corrupt-ness, which should help improve start-up time for existing profiles.
Andreas Farre exposed requestIdleCallback to non-DOM JS execution contexts! This will help our front-end engineers schedule main thread jobs using a more intelligent mechanism. He also, with some help from Olli Pettay, ensured that idle callbacks do not run when we are about to fire a timer. This is important to ensure that those callbacks cannot interfere with higher priority timers that may be about to fire. He also enabled throttling of timeouts scheduled by tracking scripts by default. This should help reduce the overhead of pages left open in background tabs.
Morris Tseng made table frames get their own display items.
Mike Conley got rid of some unnecessary focus changes during tab switching.
Nihanth Subramanya moved the initialization of Captive Portal detection so that it no longer blocks first paint, which should help to improve start-up time.
David Keeler disabled OCSP verification for DV TLS certificates on Nightly. Currently Firefox is the only major browser that support OCSP verification by contacting the CA’s OCSP server during TLS handshake and our telemetry data suggests that this is a major source of slowness during the initial TLS handshake.
Nicholas Nethercote reduced the locking overhead that the Gecko Profiler macros used for profiler labels and such introduce, which reduces the overhead of these macros even when the profiler isn’t running.
Jan de Mooij generalized baseline JIT type update stubs, allowing us to deal better with polymorphic code. In addition, he made type monitor stubs work with unknown objects/values. He also ensured most RegExp objects are allocated in the incremental GC nursery to avoid requiring a full GC to collect them. He also made Array.prototype.splice() be O(1).
Felipe Gomes made Flash click to play by default! I have lost count on how many years this has been in the works!
- Kris Maglione took his off-thread script decoding infrastructure designed to improve startup performance to a new level! You may remember this setup was introduced just recently, and now Kris taught it how to decode multiple scripts at a time, to save the cost of off-thread script decoding setup per script. It should be obvious in the graph below from ts_paint measurements when the patches landed!