Statdisk User Manual
Statdisk User Manual
Page 2
Basics of STATDISK
You can perform all STATDISK functions from the Sample Editor Screen using the following menus:
File, Edit, Analysis, Data, Datasets, Window, and Help.
Along with performing statistical calculations, STATDISK is also compatible with many popular
application software packages. You can import, copy, paste, save, print and transform data sets. You
can also copy, paste, save, or print any of the STATDISK numerical or graphical outputs and export
them into other programs such as Microsoft Word. Those options are available as clickable buttons at
the bottom of the Sample Editor screen.
Page 3
Once the install package is unzipped, you will need to find the application program Statdisk. Unlike
other programs that need to be installed on your computer, STATDISK is just a file.
You might want to create shortcut for this file and place it on your desktop by right clicking with the
mouse and selecting Create Shortcut
When you open the STATDISK program (by
a double-click with your mouse on the above
file) you will see the screen shown here.
Click on the OK button to close the
STATDISK information screen.
Page 4
You can preview the datasets before you open them by going to Datasets and then Dataset Browser.
You can also access datasets that STATDISK has available online by going to Datasets and then Online
Datasets.
Page 5
Sort Data
To sort data, open the data tools and select Sort data. Select Sort a single column: and then use the
drop-down arrow to select the column of data values that you want to sort Then click on Sort. The data
values in that column will be sorted from lowest value to highest value.
Page 6
Page 7
Histogram
A visual display of a single set of data values can be shown by opening the Data menu and then
selecting Histogram.
Select the column that the data values are in. If you would like the STATDISK program to
automatically select the class width and the class start, select Auto-fit. You can display the count or the
frequency for each class. Click on Plot to display the graph.
Page 8
Boxplots
If you would like to compare two or more sets of data values you can plot them on
one graph by using boxplots. Open the Data menu and select Boxplot. Then
select the columns containing the data values that you would like to compare. You
can then select Boxplot to show a standard view of the boxplots or Modified
Boxplot which will emphasize outliers (see figure 11).
Page 9
Normal Distribution
STATDISK uses standard z scores, so first convert scores by using
Here is the STATDISK procedure for finding areas or values from a normal distribution.
1.
2.
3.
4.
5.
Select Analysis from the main menu at the top of the screen.
Select Probability Distributions from the subdirectory.
Select Normal Distributions.
Either enter a standard z score or enter the known cumulative area to the left of a z score.
Click on Evaluate.
For example, if you enter a z score of 1.23 in Step 4 above, the STATDISK display will be as shown
below.
This display shows that the area to the left of z = 1.23 is 0.890651, and the area to the right of z = 1.23 is
0.109349. You may ignore the reference to Table A-2, because that reference applies to books in the
Triola Statistics Series.
Page 10
Page 11
The chart below shows the standard normal distribution with Z-values along the bottom axis and the
area under the curve between the given Z-values and can be used for full or half increments of the
standard deviation. STATDISK will find the given values and any other values that are not shown on
the table.
Page 12
To find values that are not shown on the table, use STATDISK as follws:Open the Analysis menu and
then select Probability Distributions and then Normal Distribution. Enter your z-score into the box
for Z Value and then click on Evaluate. In this example, the z-score is -1 and the probability is 0.2419.
The output gives the discrete probability of seeing a z-score of -1 and is equal to .2419707. It also gives
the cumulative area to the left of -1 or .158655. If you add the areas to the left of -1 shown in the
standard normal distribution: 0.1% + -.5% + 1.7% + 4.4 % + 9.2% = 15.7% (or 0.157) you can see that
you get the same result.
If you put in any value between 0 and 1 representing the area to the left of a Z score and then press
Evaluate you will get the associated Z value.
Page 13
Confidence Intervals
To find a confidence interval for a sample statistic you do not need to type in any data values or have a
dataset in the Sample Editor. For example, to find a confidence interval for one-sample mean open up
the Analysis menu then select Confidence Intervals and then Mean-One Sample. The image below
shows the STATDISK output screen for a 95% confidence interval with a sample mean of 26.7, a
sample standard deviation of 4.1, and a
sample size of 40. The confidence
interval of 25.29 to 28.01 is given. The
Margin of error is the distance from the
mean to the upper value and the
distance from the mean to the lower
value of the confidence interval.
If you are given a set of data values and not given any of the sample statistics such as the mean and
standard deviation you must first use Descriptive Statistics to find the values needed to enter into the
Con. Int.: Mean window that is shown in Figure 15.
Page 14
Page 15
Shown below is the scatter diagram obtained when you click on "Plot" button. Note that the scatter diagram
also includes the graph of the best-fit line. We get to see just how good the "best" fit actually is. In this case,
there is a good fit because the data points are reasonably close to the best-fit line.
Multiple Regression
To generate a multiple regression equation you
first need to type data into the Sample Editor
or select an existing dataset. Open Datasets
and select Elementary Stats 9th Edition.
Open the Homes dataset. Select Analysis and
then Multiple Regression. Select columns 1,
3, and 8 to be included in the regression
analysis. Select 1 for the Dependent variable
column. Click on Evaluate to generate the
multiple regression statistics
Page 16
Additional Techniques
Chi-Square Goodness-of-Fit
To generate a Goodness-of-Fit test to determine if you have equal expected frequencies, you must first
type data into the Sample Editor or select an existing dataset. Lets imagine that a company wants to
know if auto accidents occur equally throughout the days of the week. Use the Clear button at the
bottem of the Sample Editor screen to erase any existing data. The number of accidents in our sample
data that occur each day of the week are as
M
T
W
TR
F
follows:
Type the data into List 1, then use the Edit
column titles option under the Data Tools
button at the bottem of the Sample Editor
screen to name the variable: Accidents
45
36
17
29
52
Page 17
Now select Analysis and then Goodness-offit. Chose Equal Expected Frequencies
since the company is testing to see if
accidents occur equally. Set the
significance level to 0.05 and select 1 as the
column to be the Observed Frequencies.
Click on Evaluate to generate the
Goodness-of-Fit test. The results are shown
in the output window to the right.
Press Plot to view a visual representation of the Chi-Square Distribution of the data. The graph shows
the Critical Value, X2 : 9.488 and the Test Statistic, X2: 20.860.
Page 18
Chocolate
Strawberry
Other
42%
33%
14%
11%
The University of Florida surveyed a sample size of n=250 students questioning their preferred ice
cream flavor. The observed data collected is shown in the table below.
Vanilla
Chocolate
Strawberry
Other
114
68
47
21
Page 19
Blue
Green
White
Male
21
17
44
Female
28
24
14
18
Page 20
Page 21