0% found this document useful (0 votes)
29 views5 pages

Microunit 4

Uploaded by

Bhavya Sree
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
29 views5 pages

Microunit 4

Uploaded by

Bhavya Sree
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

DEPARTMENT OF COMPUTER SCIENCE

DATA ANALYSIS USING MS EXCEL & SPSS

1. Estimation of Mean, Median, and Mode

- Mean: The average value of a dataset (sum of all values divided by the number of values).

- Median: The middle value in a dataset when arranged in ascending order.

- Mode: The value that appears most frequently in a dataset.

In MS Excel

- Mean:

- Formula: `=AVERAGE(range)`

- Example: `=AVERAGE(A1:A10)` calculates the mean of values in cells A1 to A10.

- Median:

- Formula: `=MEDIAN(range)`

- Example: `=MEDIAN(A1:A10)` calculates the median of values in cells A1 to A10.

- Mode:

- Formula: `=[Link](range)` (for a single mode)

- Example: `=[Link](A1:A10)` returns the most frequent value in the range A1 to A10.

In SPSS

1. Go to Analyze → Descriptive Statistics → Frequencies.

2. Select the variables you want to analyze.

3. Click Statistics and check Mean, Median, and Mode.

4. Click OK to view the output.


2. Standard Deviation and Coefficient of Variation

- Standard Deviation (SD): A measure of how spread out the numbers in a dataset are from the mean.
A low SD means data points are close to the mean; a high SD means data is more spread out.

- Coefficient of Variation (CV): The ratio of the standard deviation to the mean, expressed as a
percentage. It shows the extent of variability relative to the mean.

In MS Excel

- Standard Deviation:

- Formula: `=STDEV.S(range)`

- Example: `=STDEV.S(A1:A10)` calculates the standard deviation for values in A1 to A10.

- Coefficient of Variation:

- Formula: `=STDEV.S(range)/AVERAGE(range)*100`

- Example: `=STDEV.S(A1:A10)/AVERAGE(A1:A10)*100` gives the coefficient of variation.

3. Descriptive Statistics

- Descriptive statistics summarize data in a meaningful way. Examples include mean, median, mode,
range, standard deviation, variance, etc.

In MS Excel

1. Go to Data → Data Analysis (If you don't see this, enable the Analysis Toolpak in Excel Options).

2. Select Descriptive Statistics and click OK.

3. Select the data range for Input Range.

4. Check the box for Summary Statistics.

5. Click OK to view mean, median, mode, SD, and other descriptive stats.

In SPSS

1. Go to Analyze → Descriptive Statistics → Descriptives.

2. Select variables.

3. Click Options to choose additional statistics like mean, standard deviation, and range.

4. Click OK to generate a descriptive statistics table.


4. Parametric Tests

Parametric tests assume the data follows a normal distribution.

- T-Test: Compares means between two groups.

- Paired T-Test: Used when the two groups are related (e.g., before and after a treatment on the
same group).

- Unpaired (Independent) T-Test: Used when comparing two unrelated groups.

- ANOVA (Analysis of Variance): Tests differences among group means in a sample.

- One-Way ANOVA: Compares means of three or more unrelated groups for one variable.

- Two-Way ANOVA: Compares means based on two different factors.

- Pearson Rank Correlation: Measures the linear relationship between two continuous variables.

- Linear Regression: A statistical method to model the relationship between two variables by fitting a
linear equation to the data.

T-Test (Paired or Unpaired)

- In MS Excel:

1. Go to Data → Data Analysis → t-Test (Choose either Paired or Two-Sample Assuming


Equal/Unequal Variances).

2. Select the data ranges for the two groups.

3. Set the significance level (usually 0.05).

4. Click OK to view results.

- In SPSS:

1. For paired t-test: Analyze → Compare Means → Paired-Samples T Test.

- Select the paired variables and click OK.

2. For unpaired t-test: Analyze → Compare Means → Independent-Samples T Test.

- Define groups and click OK.

ANOVA (One-Way, Two-Way)

- In MS Excel:

1. Go to Data → Data Analysis → ANOVA (One-way or Two-way).

2. Input ranges for data.

3. Click OK to view results


- In SPSS:

1. For One-Way ANOVA: Analyze → Compare Means → One-Way ANOVA.

- Define dependent variable and factor, click OK.

2. For Two-Way ANOVA: Analyze → General Linear Model → Univariate.

- Define dependent and independent variables, click OK.

Pearson Correlation

- In MS Excel:

1. Formula: `=CORREL(range1, range2)`.

2. Example: `=CORREL(A1:A10, B1:B10)` calculates Pearson correlation between the two data sets.

- In SPSS:

1. Go to Analyze → Correlate → Bivariate.

2. Select variables.

3. Choose Pearson and click OK to view correlation.

Linear Regression

- In MS Excel:

1. Go to Data → Data Analysis → Regression.

2. Select the dependent and independent variables.

3. Click OK to generate the regression output.

- In SPSS:

1. Go to Analyze → Regression → Linear.

2. Define the dependent and independent variables.

3. Click OK to generate the regression results.

5. Non-Parametric Tests

Non-parametric tests are used when data doesn't follow a normal distribution.

- Mann-Whitney U Test: Compares differences between two independent groups when the
assumption of normality is not met.
- Wilcoxon Signed Rank Test: Compares two related samples to assess whether their population mean
ranks differ.

- Kruskal-Wallis Test: A non-parametric alternative to one-way ANOVA; compares more than two
independent groups.

- Chi-Square Test: Tests the relationship between categorical variables to see if they are independent
of each other.

5. Non-Parametric Tests

Mann-Whitney U Test

- In SPSS:

1. Go to Analyze → Nonparametric Tests → Legacy Dialogs → 2 Independent Samples.

2. Select the variable and group, choose Mann-Whitney U test, click OK.

Wilcoxon Signed Rank Test

- In SPSS:

1. Go to Analyze → Nonparametric Tests → Legacy Dialogs → 2 Related Samples.

2. Select paired variables, choose Wilcoxon test, click OK.

Kruskal-Wallis Test

- In SPSS:

1. Go to Analyze → Nonparametric Tests → Legacy Dialogs → K Independent Samples.

2. Select the test variable and group, choose Kruskal-Wallis, click OK.

Chi-Square Test

- In MS Excel:

1. Use the formula `=[Link](actual_range, expected_range)`.

2. Example: `=[Link](A1:A10, B1:B10)`.

- In SPSS:

1. Go to Analyze → Descriptive Statistics → Crosstabs.

2. Select the row and column variables.

3. Click Statistics, choose Chi-Square, and click OK to view the result.

You might also like