Assignment 2 CS Sec#4
Assignment 2 CS Sec#4
Question: 1
The following data set represents the response time (in milliseconds) of a web application for 25
different user requests:
18, 25, 37, 22, 41, 29, 35, 20, 33, 46, 28, 39, 24, 31, 42, 26, 48, 30, 36, 23, 40, 27, 45, 32, 38.
Find Arithematic mean, Median, Mode and Midrange. Also conclude the averages.
Question: 2
A survey was conducted to analyze the number of applications developed by 30 software engineers
over the past year. The results were organized into the following frequency distribution:
Number of
Applications 10-17 18-25 26-33 34-41 42-49
Developed
frequency 5 10 8 4 3
Question: 3
In a recent performance review, 30 software engineers reported the number of project
deadlines they successfully met over the past year. Surprisingly, upon analyzing the data, it
was found that a small group of engineers consistently outperformed their peers, significantly
affecting the overall statistics. The recorded data is as follows:
10, 15, 12, 18, 22, 25, 30, 35, 20, 28, 14, 19, 17, 24, 33, 15, 16, 11, 27, 40, 13, 23, 21, 29,
34, 36, 50, 45, 32, 38.
Calculate the five-number summary for the data set. Check the possible outliers, if they
exist then how might influence the summary statistics.
Question 4:
A study was conducted to analyze the number of software features developed by a team of
100 software engineers in the last quarter. The results were organized as,
Number of
Features 150-158 159-167 168-176 177-185 186-194 195-203 204-212
Developed
Frequency 5 16 20 21 20 15 3
Calculate the first Quartile, second Quartile and third Quartile for the number of features
developed. Calculate Quartile deviation and coefficient of Quartile Deviation.
Also obtain 9th Decile and 6th Decile.
Question 5:
(i) An instructor grades exams, 20%; term paper, 30%; final exam, 50%. A student had
grades of 83, 72, and 90, respectively, for exams, term paper, and final exam. Find the
student’s final average. Use the weighted mean.
(ii) Another instructor gives four 1-hour exams and one final exam, which counts as two 1-
hour exams. Find a student’s grade if he/she received 62, 83, 97, and 90 on the 1-hour
exams and 82 on the final exam.
For these situations, state which measure of central tendency should be used:
a. The most typical case is desired.
b. The distribution is open-ended.
c. There is an extreme value in the data set.
d. The data are categorical.
e. Further statistical computations will be needed.
f. The values are to be divided into two approximately equal groups, one group containing the
larger values and one containing the smaller values.
Question 6:
(i) A tech company analyzed the number of bugs reported in its software over the course
of a year. Surprisingly, despite a few months with an unusually high number of bugs,
the average remained stable. This led to confusion among employees about the
reliability of the average as a measure of performance. The number of bugs reported
each month for 28 months is as follows:
5, 7, 6, 9, 8, 12, 15, 4, 6 10, 50, 3, 15, 7, 8, 9, 11, 14, 20, 18, 22, 30, 5, 6, 4, 8, 12, 16.
Calculate the Range, Variance, Standard deviation and Coefficient of variation of the number of
bugs reported.
(ii) The lengths (in feet) of the main span of the longest suspension bridges in the United
States and the rest of the world are shown below. Which set of data is more variable?
United States: 4205, 4200, 3800, 3500, 3478, 2800, 2800, 2310
World: 6570, 5538, 5328, 4888, 4626, 4544, 4518, 3970
Question 7:
A software company tested the lifetimes (in hours) of 80 randomly selected computer batteries to
evaluate their performance. The results were organized into the following frequency distribution:
Find the variance and standard deviation for the data. Can it be concluded that the lifetimes of
batteries are consistent?
Question 8:
A software development company conducted an analysis of the time taken to complete two
projects over the past year. The completion times (in weeks) were categorized into different
intervals to assess productivity and efficiency. The following frequency distributions were
obtained from both projects:
Dataset 1: Project Completion Times (Team A)
Which of the following data set is more consistent? Also interpret the results.
Question 9:
16 18 35.3 25 23.5 27.1 32.5 16 22 17.5 19 29.5 7.5 12, 9.5, 18, 22.5.
Find:
a. 30th percentile
b. 50th percentile
c. 75th percentile
d. 98th percentile
e. 50th percentile
f. 63th percentile
g. 4th decile
h. 6th decile
i. 9th decile
Question 10:
A data analysis team at an aviation company collected the airborne speeds (in miles per hour) of
21 aircraft during test flights. The team aims to determine the approximate values corresponding
to specific percentiles to assess performance across different aircraft models. The speeds recorded
are as follows:
Find:
a. 9th percentile b. 60th percentile c. 20th percentile
d. 72th percentile e. 88th percentile
Question 11:
(i) The number of teacher strikes over a 13-year period in Pennsylvania is shown.
Construct a boxplot for the data. Is the distribution symmetric?
20 18 7 13 7 14 5 9 9 9 10 17 15.
(ii) The number of sites at which the top nine movies (based on the daily gross earnings)
opened in a particular week is indicated below. 3017 3687 2525 2516 2820 2579 3211
3044 2330. Construct a boxplot for the data.
Question 12:
A computer science department conducted a study to analyze the study habits of its software
engineering students. The department collected data on the number of hours students spend
studying per week, which was organized into a frequency distribution. This analysis aims to
understand the distribution of study hours and identify potential areas for academic support.
Calculate the mean deviation from the mean, median and mode for the study hours. Also calculate
coefficient of M.D from mean median and mode.
Question 13:
Good Luck