A Comparative Evaluation of Animation & Small Multiples for Trend Visualization on Mobile Phones
Matthew Brehmer * · Bongshin Lee · Petra Isenberg · Eun Kyoung Choe
mb@mattbrehmer.ca | @mattbrehmer · @bongshin · @dr_pi · @slowalpaca
mattbrehmer.ca / talks /
mobiletrendvis (slides) | tinyurl.com / mobiletrendvis
(pre-print)
IEEE VIS 2019 · Oct 23 2019 * Work conducted while affiliated with Microsoft Research, Now with Tableau Research
Good morning. Thank you for the introduction.
Before getting started I'd like to point out that while I'm now with Tableau Research, this work was carried out while I was a postdoc with Microsoft Research.
I should also mention that this work is a continuation of a broader research agenda relating to visualization on mobile devices which follows a CHI workshop last year and a Dagstuhl seminar this summer, and it is an area that I personally continue to be interested in my new role at Tableau.
Animation vs. Small Multiples on Mobile Phones
Crowdsourced Experiment : Animation vs. Small Multiples , between subjects Task : identify 1-3 targets from a set of 16 items.
In a nutshell, with this experiment we people to identify 1-3 targets among a set of 16 items in either an animated scatterplot or a static small multiples scatterplot where each item includes its trajectory trail.
And we asked people to do this on their mobile phones.
The Rise of the Data GIF
Sources (L → R): The Washington Post | Cornell Lab of Ornithology | Pew Research Center.
Curated collections of Data GIFs:
lenagroeger.com / datagifs | Lena Groeger (ProPublica, NICAR Tutorial 2017)
pinterest.com / jsvine / datagifs | Jeremy Singer-Vine (Buzzfeed)
Why did we do this experiment?
A few years ago we noticed that animated graphics were becoming more prevalent in online news and especially in social media,
The ones that interest me the most are those that make use of position change over time, such as in these examples.
Animation on Mobile, Small Multiples on Desktop?
Data viz solutions: small multiples on desktop, GIFs on yer phone! Brian Boyer (NPR News Graphics, 2015).
Question : Is there evidence to justify this allocation of designs to different device profiles?
The NPR News Graphics Team had promoted this design strategy of displaying small multiples on desktop and looped animated GIFs on phones,
But we wanted to know if there was evidence behind this allocation of designs to different device profiles.
Prior Work: Animation vs. Trails vs. Small Multiples
Effectiveness of Animation in Trend Visualization . G. Robertson, R. Fernandez, D. Fisher, B. Lee, and J. Stasko. IEEE TVCG (Proc InfoVis 2008).
Inspired by Hans Rosling's 2006-07 TED Talks | Test-of-Time Award recipient at InfoVis 2018.
Some of you may be reminded of research published over a decade ago
where George Robertson and colleagues
Were also intrigued animation,
Particularly after seeing the late public health expert Hans Rosling's amazing TED talks,
In which he narrates a story about shifting global health and economic metrics set to animated scatterplots.
So George and the other researchers conducted an experiment comparing animated and static scatterplots,
And their paper about the experiment was quite influential, earning a Test-of-Time award at InfoVis last year.
Prior Work: Animation vs. Trails vs. Small Multiples
Effectiveness of Animation in Trend Visualization . IEEE TVCG (Proc InfoVis 2008).
Lab Experiment :
2 Contexts (Analysis vs. Presentation) | 3 Design conditions | 2 Dataset sizes (large, small) | 24 Tasks
Notable results : Analysis performance higher with multiples; participants preferred animation.
Question : Will we arrive at similar results with mobile phones?
Their experiment considered two settings, a data analysis setting and a presentation setting with a narrator like to emulate Rosling's TED talks,
They had 3 design conditions (animation, trajectory trails, and small multiples with individual trajectory trails), a large and a small dataset, and 24 trajectory comparison tasks.
I won't go over all of their results, but in the data analysis setting, the study participants had better overall performance with small multiples,
Though interestingly, they preferred animation, finding it to be fun and engaging.
This was all of course done using a large display, and with experimenters observing the participants in a lab setting.
A Crowdsourced Experiment on Mobile Phones
A crowdsourced visualization evaluation study performed exclusively on phones .
Crowdsourced Experiment :
1 Context | 2 conditions: vs. | 1 Dataset size (16 items) | 9 Test Tasks (+ Training & QA Tasks)
In contrast, we conducted a crowdsourced experiment on mobile phones that borrows some of the elements of Robertson's experiment,
We did not consider a narrated presentation setting, and we only considered a small dataset of 16 items from the 2008 study,
And we adapted a subset of the 2008 study's experimental tasks, the ones that could be answered with a small dataset.
I'll also emphasize here that our study should not be seen as a reproduction of the 2008 study but a follow-on or sequel, one that incorporates aspects of its stimuli and experimental design.
Tasks *: Targets & Distractors
Tasks adapted from Robertson et al (IEEE TVCG 2008).
* These are simplified representations of our stimuli.
One of the things we had observed about the tasks used in the 2008 study was that they were fairly different from one another,
In that they asked participants to compare and select one to three targets exhibiting different trajectories,
And with differing starting and ending distributions.
So in this new experiment we wanted to examine the experimental tasks individually.
And that different participants saw different shuffled orders of these tasks.
I should also mention that these are simplified illustrations of the stimuli, not the stimuli themselves.
5-Stage Task Format
Show instruction & axes → 5s delay → Start → Select 1 - 3 countries → Done
This is what the task stimuli actually looked like on a phone.
Any individual participant saw either animation or small multiples,
For each task, we first showed the instruction and the pair of axes.
After that, the animation begins to loop in the animation condition.
In both conditions they have to select responses from a multiple choice list corresponding to alphabetically labeled points in the plot.
And when they select the correct number of responses, they tap done to complete the task and proceed to the next one.
96 Participants
Small Multiples (N = 45), Animation (N = 51)
We recruited participants from Mechanical Turk and after excluding participants who responded incorrectly to an obvious quality control task,
Our final participant counts were 45 in the Small Multiples condition and 51 in the Animation condition.
Animation or Small Multiples: Overall Results
Detailed results in paper: tinyurl.com / mobiletrendvis
= Faster
= More accurate
= Multiples
= Animation
Animation participants were slightly more confident.
I'm going to focus on some of the important take-away results, and there's more details in the paper
This first thing I'll highlight is that we found that despite the smaller screen size, those using the small multiples scatterplots were still faster in 7 of 9 tasks, with no difference in the remaining 2 tasks.
With respect to accuracy, it was a draw in a sense, in that 5 other tasks, there was no evidence for a difference in accuracy,
And of the remaining 4 tasks, Small Multiples prevailed in 2 and Animation had higher accuracy in the other 2.
And finally, we noted that the animation group participants felt slightly more confident than their small multiples counterparts.
Trajectory Reversals: Animation & SMs at Parity
Small Multiples | Animation | (no performance differences in these two tasks).
These tasks involve targets having trajectories that reverse course,
unlike the targets in the other 7 tasks.
Beyond the overall results, what's interesting is when we consider individual tasks,
And particularly those where there was no noticeable difference in completion time or accurary between the two groups,
As indicated by the 3rd and 4th columns here.
If you take a look at the task instructions and illustrations, you'll notice that both tasks involve targets exhibiting a trajectory reversal.
And these were the only two tasks where the targets exhibit this behavior.
These reversals may be fairly apparent using animation,
While in a static small multiple design, the trajectory may be obscured since the trail might occlude itself.
Vertical or Long Trajectories: A Case for SMs
Small Multiples (more accurate in these two tasks) | Animation
These tasks involve comparing ∆Ys: a ∆Y with no ∆X (top) or the 2 largest ∆Y values (bottom).
Similarly if we consider the two tasks were the small multiples exhibited higher accuracy,
Both tasks involve comparisons over delta Y
Either a straight downward trajectory or an absence of movement along the X axis in the case of the top Task
Or the largest decreases over the Y axis in the case of bottom task.
In our small multiples design, this means that you have to look for a downward trajectory or a large vertical distance along a trajectory,
Visual comparisons can be done at a glance
While with animation, these angle and distance comparisons requires a reliance on memory.
Direction of Motion Outliers: A Case for Animation
Small Multiples | Animation (more accurate in these two tasks)
These tasks involve assessing both ∆Y and ∆X: target trajectories quite different from distractors.
Finally, there was two tasks where animation had higher accuracy and comparable completion times relative to small multiples,
Where it would appear as though the target items have outlier trajectories in terms of the direction of motion,
Where all the distractors are moving up in the first task, the targets are moving to the right.
And in the case of the second task shown here, the target is moving to the left while the distractors are moving to the right or almost vertically
Conclusions: Animation or Small Multiples ?
Surprisingly, Small Multiples are still viable on small displays.
Follow-up work should examine distance and angle comparisons.
Animation and Small Multiples are comparably accurate in > 50% of tasks.
Follow-up work should examine outlier trajectories and reversals .
aka.ms / multiples | (mobile only) experimental app.
github.com / Microsoft / MobileTrendVis | open source app and analysis.
tinyurl.com / mobiletrendvis (pre-print) | mattbrehmer.ca / talks /
mobiletrendvis (slides)
mb@mattbrehmer.ca | @mattbrehmer · @bongshin · @dr_pi · @slowalpaca
Ultimately, we were surprised that small multiples remains to be a viable design option for small mobile phone displays.
And that they may be well-suited if the viewer's task requires detecting or noticing trajectory lengths or angles, and researchers should examine this further
Meanwhile, animation also seems to have circumstances in which it is suited,
Such as when there are trajectory outliers or reversals.
But this too is deserving of further research with more tasks and trials,
Likely with synthetic data, as we were using a subset of the real socioeconomic metric data from the United Nations used in Robertson and colleagues' 2008 study.
CLICK! I now welcome your questions and comments and I'll point you to our study application, its source code, our study results and analyses, our pre-print, and these slides.
Thank you,