Correlation is a powerful measure of relationships assisting in estimating trends and making forecasts. Its use is widespread, being a critical data analysis component of fields including science, engineering, and business. Unfortunately, visualization methods used to identify and estimate correlation are designed to be general, supporting many visualization tasks. Due in large part to their generality, they do not provide the most efficient interface, in terms of speed and accuracy for correlation identifying. To address this shortcoming, we first propose a new correlation task-specific visual design called Correlation Coordinate Plots (CCPs). CCPs transform data into a powerful coordinate system for estimating the direction and strength of correlation. To extend the functionality of this approach to multiple attribute datasets, we propose two approaches. The first design is the Snowflake Visualization, a focus+context layout for exploring all pairwise correlations. The second design enhances the CCP by using principal component analysis to project multiple attributes. We validate CCP by applying it to real-world data sets and test its performance in correlation-specific tasks through an extensive user study that showed improvement in both accuracy and speed of correlation identification.
Correlation Coordinate Plots: Efficient Layouts for Correlation Tasks
H Nguyen, P Rosen
International Joint Conference on Computer Vision, Imaging and Computer Graphics