0% found this document useful (0 votes)
20 views

Assignment 2 CS Sec#4

Assignment of Probability and Statistics CUST Uni

Uploaded by

shaheerahmed371
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
20 views

Assignment 2 CS Sec#4

Assignment of Probability and Statistics CUST Uni

Uploaded by

shaheerahmed371
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 5

Capital University of Science and Technology

Department of Computer Science


MTCS2063 – Probability and Statistics
ASSIGNMENT NO. 2
Semester: Fall 2024
Instructor: Ms. Mahnoor Ali
Assigned Date: 28/10/2024 Due Date: 08/11/2024
Name: Section: 4

Question: 1
The following data set represents the response time (in milliseconds) of a web application for 25
different user requests:
18, 25, 37, 22, 41, 29, 35, 20, 33, 46, 28, 39, 24, 31, 42, 26, 48, 30, 36, 23, 40, 27, 45, 32, 38.
Find Arithematic mean, Median, Mode and Midrange. Also conclude the averages.

Question: 2
A survey was conducted to analyze the number of applications developed by 30 software engineers
over the past year. The results were organized into the following frequency distribution:
Number of
Applications 10-17 18-25 26-33 34-41 42-49
Developed
frequency 5 10 8 4 3

Find A.M, Median and Mode.

Question: 3
In a recent performance review, 30 software engineers reported the number of project
deadlines they successfully met over the past year. Surprisingly, upon analyzing the data, it
was found that a small group of engineers consistently outperformed their peers, significantly
affecting the overall statistics. The recorded data is as follows:
10, 15, 12, 18, 22, 25, 30, 35, 20, 28, 14, 19, 17, 24, 33, 15, 16, 11, 27, 40, 13, 23, 21, 29,
34, 36, 50, 45, 32, 38.

Calculate the five-number summary for the data set. Check the possible outliers, if they
exist then how might influence the summary statistics.

Question 4:

A study was conducted to analyze the number of software features developed by a team of

100 software engineers in the last quarter. The results were organized as,

Number of
Features 150-158 159-167 168-176 177-185 186-194 195-203 204-212
Developed
Frequency 5 16 20 21 20 15 3

Calculate the first Quartile, second Quartile and third Quartile for the number of features
developed. Calculate Quartile deviation and coefficient of Quartile Deviation.
Also obtain 9th Decile and 6th Decile.

Question 5:

(i) An instructor grades exams, 20%; term paper, 30%; final exam, 50%. A student had
grades of 83, 72, and 90, respectively, for exams, term paper, and final exam. Find the
student’s final average. Use the weighted mean.

(ii) Another instructor gives four 1-hour exams and one final exam, which counts as two 1-
hour exams. Find a student’s grade if he/she received 62, 83, 97, and 90 on the 1-hour
exams and 82 on the final exam.

For these situations, state which measure of central tendency should be used:
a. The most typical case is desired.
b. The distribution is open-ended.
c. There is an extreme value in the data set.
d. The data are categorical.
e. Further statistical computations will be needed.
f. The values are to be divided into two approximately equal groups, one group containing the
larger values and one containing the smaller values.
Question 6:

(i) A tech company analyzed the number of bugs reported in its software over the course
of a year. Surprisingly, despite a few months with an unusually high number of bugs,
the average remained stable. This led to confusion among employees about the
reliability of the average as a measure of performance. The number of bugs reported
each month for 28 months is as follows:

5, 7, 6, 9, 8, 12, 15, 4, 6 10, 50, 3, 15, 7, 8, 9, 11, 14, 20, 18, 22, 30, 5, 6, 4, 8, 12, 16.

Calculate the Range, Variance, Standard deviation and Coefficient of variation of the number of
bugs reported.

(ii) The lengths (in feet) of the main span of the longest suspension bridges in the United
States and the rest of the world are shown below. Which set of data is more variable?
United States: 4205, 4200, 3800, 3500, 3478, 2800, 2800, 2310
World: 6570, 5538, 5328, 4888, 4626, 4544, 4518, 3970

Question 7:

A software company tested the lifetimes (in hours) of 80 randomly selected computer batteries to
evaluate their performance. The results were organized into the following frequency distribution:

Classes 5-11 12-18 19-25 26-32 33-39 40-46


frequency 5 14 18 25 12 6

Find the variance and standard deviation for the data. Can it be concluded that the lifetimes of
batteries are consistent?

Question 8:

A software development company conducted an analysis of the time taken to complete two
projects over the past year. The completion times (in weeks) were categorized into different
intervals to assess productivity and efficiency. The following frequency distributions were
obtained from both projects:
Dataset 1: Project Completion Times (Team A)

Class 1-9 10-18 19-27 28-36 37-45 45-54


Limits
frequency 5 7 10 3 3 2
Dataset 2: Project Completion Times (Team B)
Class 13-21 22-30 31-39 40-48 49-57 58-66
Limits
frequency 6 7 9 4 3 1

Which of the following data set is more consistent? Also interpret the results.

Question 9:

A random selection of state gasoline taxes per gallon is given below,

16 18 35.3 25 23.5 27.1 32.5 16 22 17.5 19 29.5 7.5 12, 9.5, 18, 22.5.
Find:
a. 30th percentile
b. 50th percentile
c. 75th percentile
d. 98th percentile
e. 50th percentile
f. 63th percentile
g. 4th decile
h. 6th decile
i. 9th decile

Question 10:

A data analysis team at an aviation company collected the airborne speeds (in miles per hour) of
21 aircraft during test flights. The team aims to determine the approximate values corresponding
to specific percentiles to assess performance across different aircraft models. The speeds recorded
are as follows:

Classes 366-386 387-407 408-428 429-449 450-470 471-491 492-512 513-533


frequency 4 2 3 2 1 2 3 4

Find:
a. 9th percentile b. 60th percentile c. 20th percentile
d. 72th percentile e. 88th percentile

Question 11:

(i) The number of teacher strikes over a 13-year period in Pennsylvania is shown.
Construct a boxplot for the data. Is the distribution symmetric?
20 18 7 13 7 14 5 9 9 9 10 17 15.
(ii) The number of sites at which the top nine movies (based on the daily gross earnings)
opened in a particular week is indicated below. 3017 3687 2525 2516 2820 2579 3211
3044 2330. Construct a boxplot for the data.

Question 12:

A computer science department conducted a study to analyze the study habits of its software
engineering students. The department collected data on the number of hours students spend
studying per week, which was organized into a frequency distribution. This analysis aims to
understand the distribution of study hours and identify potential areas for academic support.

Classes 10-12 13-15 16-18 19-21 22-24 25-27 28-30


frequency 6 4 14 15 8 2 1

Calculate the mean deviation from the mean, median and mode for the study hours. Also calculate
coefficient of M.D from mean median and mode.

Question 13:

A survey was conducted to analyze the number of programming languages known by 25


software engineers. The results were organized into the following grouped frequency
distribution:
Class 5-10 11-15 16-20 21-25 26-30
Interval
frequency 4 9 6 4 2

Measure the skewness of distribution by using:


(i) Karl pearson’s first coefficient of skewness.
(ii) Karl pearson’s second coefficient of skewness.
(iii) Bowley’s coefficient of skewness.

Good Luck

You might also like