Chart | Histogram

The Chart | Histogram window comprises two parts: the Graph and the Properties window. The Graph comprises three parts:

  1. The raw data, displayed using the input colour
  2. The standard normal curve of a theoretical sample with the same mean and standard deviation as the raw data, displayed using the distribution colour, and
  3. Vertical lines at the mean (M) and standard deviations (S, 2S) either side of the mean, displayed using the band colour.

When you generate a chart, many tools and options are common to most charts. See: Chart Tools

Visually, the Histogram graph allows you to determine much about your data:

  • Is it normally distributed? In other words, how well does it fit the bell-shaped standard normal curve?
  • If the fit is poor, can you describe the discrepancies in terms of Skewness and Kurtosis, multiple peaks or anomalous regions?

Tools on the Chart | Histogram tab, in the Histogram group, allow you to visualise and quantify the answers to many of these questions:

  • Click the 3 Parameter Log Normal button to estimate the additive constant for a 3-parameter log-normal distribution.
  • Select an option from the Decompose tool menu:
    • Select Decompose to decompose (divide) the data into a mixture of normal distributions. See: Decomposition
    • Select Load to load a saved decomposition.
    • Select Edit to edit the current decomposition.
    • Select Clear to clear the current decomposition from the chart.
  • Click the Percentile button to display a data value corresponding to a given percentile. Specify a colour for the line that will be drawn on the chart.
  • Toggle Multi-chart layout mode on and off. When this mode is active, each variable is shown on a separate graph.

Display mode

Choose a Display mode for the distribution:

  • HISTOGRAM  Displays a bar chart. The width of each bar is equal to the bin size you set. The height of each bar is proportional to the number of values in each bin.
  • CUM FREQUENCY  An alternative way to visually analyse the statistical distribution is to display a line chart that shows the amount of data less than or equal to the bin value.
  • PROBABILITY PLOT  From the X axis you can read the probability of observations less than the specific value defined on the Y axis.

The probability plot uses a non-uniform scale that starts with a probability of 0.01 and goes to a probability of 0.9999. On this graph, if the Scale is set to Linear, a normal distribution will appear as a straight line. If the Scale is set to Natural Log, the graph of a log-normal (Ln) distribution will be a straight line.

Load a Form Set

To display an existing form set in a new chart:

  1. Click the Forms button at right of the form and choose the form set.
  2. Click OK to apply the settings to the new chart.

    Errors at this point will typically refer to a new chart feature with no equivalent in the old graph. To assist you, Micromine will highlight the offending form response once you read and dismiss the error message.

To display multiple copies of the same chart:

  • On the Stats tab, in the Exploratory Data Analysis group, select Histogram (Multivariate) to calculate and display distribution statistics for the numeric fields in a file, and show multiple variables on separate graphs.

Change the Graph Minimum and Maximum

The Histogram function uses the true minimum and maximum for charts whenever the natural log transformation is used. Previous versions used padded values that often reached a minimum of zero, creating very long left-hand histogram tails. The new method avoids the need for manual adjustment.

You may need to recalculate the minimum and maximum graph values to produce a correct histogram from older (earlier version) form sets, but generally only for those incorporating the natural log transformation.

To change the graph minimum/maximum:

  1. Click the Form button at the far left of the Chart tab.

  2. Switch to the Graph Limits tab and then click the calculator icons at right of the Minimum and Maximum values (don’t click the big Calculate button yet!).

You should not have to change the graph minimum or maximum of existing forms incorporating normally-distributed data.

The histogram is unique because it includes options for setting the graph limits and axes ranges. You use Graph Limits to limit the data used in the chart, and Axis Range to configure the visible axis limits of the graph.

Calculate the Bin Size

We recommend using around 50 bins for any histogram that is intended for statistical decomposition. You can use the Calculate button on the Graph Limits tab to determine the bin size from the number of bins you specify and the graph minimum and maximum values.

To calculate the bin size:

  1. Ensure the Minimum and Maximum values are correct.
  2. Click the Calculate button and enter the Number of bins.
  3. Click OK to apply the calculation, and note the Bin size value.
  4. Optionally round the bin size to a simpler number.