DA Practical Lab 02 Statistical Functions

Uploaded by

himanshut.aids22

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

DA Practical Lab 02 Statistical Functions

Uploaded by

himanshut.aids22

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

S. B.

JAIN INSTITUTE OF TECHNOLOGY, MANAGEMENT

& RESEARCH, NAGPUR.

Practical No. 2

To Perform different statistical function used in Data Analytics.

Name of Student:
Roll No.:
Semester/Year:
Academic Session:
Date of Performance:
Date of Submission:
• AIM : To Perform different statistical function used in Data Analytics. •
Task:-
Apply different statistical function on given datasets used in Data Analytics.
 Create a dataset Employee with attribute
(‘name’,’age’,’salary’,’BMI’,’year of expirence’,weight’,’height’).
Perform mean, mode, median, Standard Deviation by taking axis =0
& axis=1.
 Upload any one of inbuild dataset from ‘Heart Disease, Cancer’,
‘Diabetes’,’Iris’.
Compute mean, mode, median, Standard deviation and Variance by
taking axis=0 & axis=1.

OBJECTIVES:

• To apply statistical analysis and technologies on data to find trends and solve
problems.

• To cover the variance in Python and how to calculate the variability for a set of
values..

THEORY:

• Statistic in general, is the method of collection of data, tabulation, and

interpretation of numerical data. It is an area of applied mathematics
concerned with data collection analysis, interpretation, and presentation. With
statistics, we can see how data can be used to solve complex problems.
Descriptive Statistics: -

Descriptive statistics generally means describing the data with the help of some
representative methods like charts, tables, Excel files, etc. The data is described in such a way
that it can express some meaningful information that can also be used to find some future
trends. Describing and summarizing a single variable is called univariate analysis. Describing
a statistical relationship between two variables is called bivariate analysis. Describing the
statistical relationship between multiple variables is called multivariate analysis. There are
two types of Descriptive Statistics:

• 1) The measure of central tendency

• 2) Measure of variability

Descriptive statistics summarizes the data and are broken down into measures of
central tendency (mean, median, and mode) and measures of variability
(Variance, standard deviation, range,).
The measure of central tendency
1) mean()
It is the sum of observation divided by the total number of observations. It is also defined
as average which is the sum divided by count..

>>> nums=[1,2,3,5,7,9]
>>> np..mean(nums)

2) mode()

It is the value that has the highest frequency in the given data set. The data set may have
no mode if the frequency of all data points is the same. Also, we can have more than one
mode if we encounter two or more data points having the same frequency. >>> from
scipy import stats
>>> nums=[1,2,3,5,7,9,7,2,7,6]
>>> stats.mode(nums)

3) median()

Median:
It is the middle value of the data set. It splits the data into two halves. If the number of
elements in the data set is odd then the Centre element is median and if it is even then the
median would be the average of two central elements.
median_low()

When the data is of an even length, this provides us the low median of the data. Otherwise,
it returns the middle value.

>>> st.median_low([1,2,4])
a) median_high()
Like median_low, this returns the high median when the data is of an even length.
Otherwise, it returns the middle value.

>>> st.median_high([1,2,4])

>>> st.median_high([1,2,3,4])
Measure of variability:
Measure of variability is known as the spread of data or how well is our data is distributed.
The most common variability measures are:
• Range

• Variance

• Standard deviation
2. The range describes the difference between the largest and smallest data point in our
data set. The bigger the range, the more is the spread of data and vice versa.
Variance ()
It is defined as an average squared deviation from the mean. It is being calculated by finding
the difference between every data point and the average which is also known as the mean,
squaring them, adding all of them and then dividing by the number of data points present in
our data set. In statistics, variance is a measure of how spread out a dataset is. It calculates the
average of the squared differences from the mean of the dataset. Note that the stats.variance()
function calculates the sample variance, which uses the denominator n - 1 instead of n to
adjust for bias in the estimate of the population variance.

Standard Deviation:
It is defined as the square root of the variance. It is being calculated by finding the Mean,
then subtract each number from the Mean which is also known as average and square the
result. Adding all the values and then divide by the no of terms followed the square root.
CONCLUSION:

DISCUSSION AND VIVA VOCE:

• What are the measures used for central tendency?

• What is standard deviation?

• What are the measure of variability?

REFERENCE:
1. https://www.javatpoint.com/
2. https://www.datacamp.com/tracks/data-analyst-with-python/
3. https://data-flair.training/blogs/python-descriptive-statistics/
4. “"Python for Data Analysis - Data Wrangling with Pandas, NumPy, and IPython'' by
Wes McKinney was published by O'Reilly Media, Inc. 2011,”

Measures of Central Tendency
90% (10)
Measures of Central Tendency
22 pages
Republic of The Philippines: Notre Dame of Dulawan, Inc
100% (5)
Republic of The Philippines: Notre Dame of Dulawan, Inc
1 page
ML Lab Final R22
No ratings yet
ML Lab Final R22
67 pages
DS Chapter - 2
No ratings yet
DS Chapter - 2
73 pages
program-1_
No ratings yet
program-1_
15 pages
Exp2 Me
No ratings yet
Exp2 Me
3 pages
Module3
No ratings yet
Module3
54 pages
Statistics
No ratings yet
Statistics
21 pages
Statistical Analysis_ Descriptive Stat (2)
No ratings yet
Statistical Analysis_ Descriptive Stat (2)
6 pages
Statistical Foundations for Psychology
From Everand
Statistical Foundations for Psychology
James C. Ware
No ratings yet
Data Mining Lab Maual Through Python 031023
No ratings yet
Data Mining Lab Maual Through Python 031023
22 pages
ASSIGNMEN4
100% (1)
ASSIGNMEN4
15 pages
Statistics and Its Types(v1.0)
No ratings yet
Statistics and Its Types(v1.0)
6 pages
Statistics, Statistical Modelling & Data Analytics
No ratings yet
Statistics, Statistical Modelling & Data Analytics
68 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
5 pages
DSBDL Asg 3 Write Up
No ratings yet
DSBDL Asg 3 Write Up
6 pages
Data Analysis and Visualization EDA
No ratings yet
Data Analysis and Visualization EDA
51 pages
Statistics_Compendium_DMS IIT DELHI_2025
No ratings yet
Statistics_Compendium_DMS IIT DELHI_2025
18 pages
Session 1 On Descriptive Statistics
No ratings yet
Session 1 On Descriptive Statistics
24 pages
Statistics
No ratings yet
Statistics
152 pages
chapter2-statistical analysis
No ratings yet
chapter2-statistical analysis
86 pages
Shubh Am
No ratings yet
Shubh Am
70 pages
Statistics Assignment Chinar Dawod Ozair
100% (1)
Statistics Assignment Chinar Dawod Ozair
12 pages
Module 3 - Branches of Statistics (1)
No ratings yet
Module 3 - Branches of Statistics (1)
50 pages
8409 Statistics
No ratings yet
8409 Statistics
17 pages
Unit 2 1
No ratings yet
Unit 2 1
54 pages
02 Exploratory Data Analytics
No ratings yet
02 Exploratory Data Analytics
41 pages
Descriptive Statistic
No ratings yet
Descriptive Statistic
3 pages
Iba Unit - Ii
No ratings yet
Iba Unit - Ii
31 pages
Assignment No 3
No ratings yet
Assignment No 3
16 pages
Probability and Statistics Notes
No ratings yet
Probability and Statistics Notes
38 pages
parc6
No ratings yet
parc6
3 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
13 pages
Statistics[1]
No ratings yet
Statistics[1]
152 pages
Descriptive
No ratings yet
Descriptive
13 pages
HNS 2321 BIOSTATISTICS LECTURE 3 AND 4 DESCRITIVE STATISTICS
No ratings yet
HNS 2321 BIOSTATISTICS LECTURE 3 AND 4 DESCRITIVE STATISTICS
36 pages
S2.Measures of Central Tendency and Variability, Data Visualization
No ratings yet
S2.Measures of Central Tendency and Variability, Data Visualization
17 pages
Tian Statistics Lesson 3 Descriptive Statistics
No ratings yet
Tian Statistics Lesson 3 Descriptive Statistics
64 pages
Data science-Unit-3-Complete
No ratings yet
Data science-Unit-3-Complete
33 pages
Full Statistics
No ratings yet
Full Statistics
108 pages
Data Mining and Predictive Modelling Assignment
No ratings yet
Data Mining and Predictive Modelling Assignment
34 pages
Lecture 3 & 4 Describing Data Numerical Measures
No ratings yet
Lecture 3 & 4 Describing Data Numerical Measures
24 pages
FDS CH 2
No ratings yet
FDS CH 2
2 pages
Business Statstics Complete
No ratings yet
Business Statstics Complete
13 pages
Maths
No ratings yet
Maths
30 pages
Quantitative Data Analysis Thru Descriptive Statistics
No ratings yet
Quantitative Data Analysis Thru Descriptive Statistics
6 pages
Session3
No ratings yet
Session3
11 pages
Decriptive Statistics in Data Science
No ratings yet
Decriptive Statistics in Data Science
9 pages
M6 - Basic Statistics
No ratings yet
M6 - Basic Statistics
66 pages
It B.tech II Year II Sem DV (R18a0555)
No ratings yet
It B.tech II Year II Sem DV (R18a0555)
73 pages
Angilan, Ef
No ratings yet
Angilan, Ef
5 pages
Parameter Statistic Parameter Population Characteristic Statistic Sample Characteristic
No ratings yet
Parameter Statistic Parameter Population Characteristic Statistic Sample Characteristic
9 pages
Emgt 512 SP 2024
No ratings yet
Emgt 512 SP 2024
156 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
35 pages
Exp-3
No ratings yet
Exp-3
16 pages
Measures of Variation, Quartiles and Percentiles, Skewness and Kurtosis
No ratings yet
Measures of Variation, Quartiles and Percentiles, Skewness and Kurtosis
16 pages
PR2 Lesson 6 Data Analysis Using
No ratings yet
PR2 Lesson 6 Data Analysis Using
30 pages
ge8 statistics
No ratings yet
ge8 statistics
2 pages
Presentation 4
No ratings yet
Presentation 4
29 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
24 pages
Document (8)
No ratings yet
Document (8)
5 pages
Business Statistics: Session 2
No ratings yet
Business Statistics: Session 2
60 pages
Lesson 1 Basic Concepts of Statistics
No ratings yet
Lesson 1 Basic Concepts of Statistics
9 pages
Cbsnews Post-speech 20250304
100% (3)
Cbsnews Post-speech 20250304
3 pages
3 Math 154-1 Module 1 Measures of Describing Data
No ratings yet
3 Math 154-1 Module 1 Measures of Describing Data
31 pages
Analisis Andra
No ratings yet
Analisis Andra
2 pages
testing of hypothesis
No ratings yet
testing of hypothesis
52 pages
Pearson Product Moment Correlation Coefficient
No ratings yet
Pearson Product Moment Correlation Coefficient
2 pages
Traffic Data Analysis
No ratings yet
Traffic Data Analysis
4 pages
Statistical Methods Course Syllabus
No ratings yet
Statistical Methods Course Syllabus
20 pages
Analysis of India's Plywood Market - Fall 2016 HAMK - Removed
No ratings yet
Analysis of India's Plywood Market - Fall 2016 HAMK - Removed
19 pages
Problem Set 2 Answer PDF
No ratings yet
Problem Set 2 Answer PDF
5 pages
ch14 Nonlinear Regression Models
100% (1)
ch14 Nonlinear Regression Models
18 pages
Getting Started With The GLMMTMB Package: 1 Introduction/Quick Start
No ratings yet
Getting Started With The GLMMTMB Package: 1 Introduction/Quick Start
9 pages
Group Data Mean
No ratings yet
Group Data Mean
9 pages
Tugas Statistik Hal 109 No 11 - 14
No ratings yet
Tugas Statistik Hal 109 No 11 - 14
26 pages
MS Excel
No ratings yet
MS Excel
6 pages
Jurnal Stem
No ratings yet
Jurnal Stem
6 pages
Statistical Tools For Data Analysis
No ratings yet
Statistical Tools For Data Analysis
4 pages
OPM Chapter 3 - Forecasting
No ratings yet
OPM Chapter 3 - Forecasting
5 pages
Interval Estimation Interval (CI) Estimation: Course No: MATH F113
No ratings yet
Interval Estimation Interval (CI) Estimation: Course No: MATH F113
46 pages
Multivariate Normal Distribution
No ratings yet
Multivariate Normal Distribution
51 pages
Lc07 SL Estimation - PPT - 0
No ratings yet
Lc07 SL Estimation - PPT - 0
48 pages
TAKE HOME QUIZ 1 Stat
No ratings yet
TAKE HOME QUIZ 1 Stat
2 pages
Ppt - 3 (Chi-square Test)
No ratings yet
Ppt - 3 (Chi-square Test)
12 pages
Ch14 ZKH3 Multiple Regression
No ratings yet
Ch14 ZKH3 Multiple Regression
45 pages
كتاب الاحصاء الحيوية
No ratings yet
كتاب الاحصاء الحيوية
67 pages
100 plus Statistics Interview Questions
0% (1)
100 plus Statistics Interview Questions
44 pages
Fixed Vs Random The Hausman Test Four Decades Later
No ratings yet
Fixed Vs Random The Hausman Test Four Decades Later
33 pages
Chapter 1
No ratings yet
Chapter 1
13 pages