Exploratory Data Analysis
On the Stats tab, in the Exploratory Data Analysis group:
-
Click Quick Summary to generate a quick statistical summary and a quantile report for the data in a file.
-
Click Histogram to calculate and display distribution statistics for a numeric field in a file. You can display histograms, cumulative frequency curves, or probability plots, using actual values or Ln (Natural Log) transformed values.
-
Select Histogram (Multivariate) to calculate and display distribution statistics for the numeric fields in a file, and show multiple variables on separate graphs.
-
Click Box and Whisker to visually summarise and compare the distribution of values of multiple variables. Any number of variables may be used. They must all be fields in the same file, but there is no need to keep the integrity of individual rows. A Box and Whisker chart includes options for highlighting outliers, showing the drift between individual means, and displaying confidence bands. Confidence bands refer to the confidence of the mean, not the data set.
-
Click Scattergrams to calculate correlation statistics for any variables in a file. Both simple linear (when there is only one X-axis field) and multiple linear (when there is more than one X-axis field) regressions can be calculated and displayed.
Different (exponential, logarithmic, polynomial, power and moving average) regression types are supported whenever there is only one X-field. Optionally, the regression line and its confidence bands may be displayed and colour coded. Confidence bands refer to the confidence of the mean, not the data set.
-
Click Q-Q Plot to create a Quantile-Quantile (Q-Q) Chart or a Probability-Probability (P-P) Chart.
-
Click Show Data Ranges to check the minimum and maximum values for all numeric fields in a file.
-
Click Variance-Covariance to display variance-covariance and correlation matrices for multivariate data. To use this function, specify the name of the File and the Field(s) to calculate, specify allowable Minimum and Maximum values, and apply an on-the-fly upper Cut value.