0% found this document useful (0 votes)
72 views6 pages

Assignment Question

This document provides an overview of a coursework assignment to analyze a dataset on house rentals using R programming. Students must thoroughly explore the dataset, applying techniques like data exploration, manipulation, transformation and visualization. The deliverables are an R script file with the code and meaningful analysis of the data, as well as a report summarizing the data preparation, analyses conducted, findings, conclusions and recommendations. The report must also include a workload matrix and follow APA referencing style for sources. The assignment aims to help students meaningfully analyze a real-world dataset.

Uploaded by

Spam Use
Copyright
© © All Rights Reserved
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
Download as doc, pdf, or txt
0% found this document useful (0 votes)
72 views6 pages

Assignment Question

This document provides an overview of a coursework assignment to analyze a dataset on house rentals using R programming. Students must thoroughly explore the dataset, applying techniques like data exploration, manipulation, transformation and visualization. The deliverables are an R script file with the code and meaningful analysis of the data, as well as a report summarizing the data preparation, analyses conducted, findings, conclusions and recommendations. The report must also include a workload matrix and follow APA referencing style for sources. The assignment aims to help students meaningfully analyze a real-world dataset.

Uploaded by

Spam Use
Copyright
© © All Rights Reserved
Available Formats
Download as DOC, PDF, TXT or read online on Scribd
Download as doc, pdf, or txt
Download as doc, pdf, or txt
You are on page 1/ 6

Programming for Data Analysis

Page 1 of 6
1.0 COURSEWORK TITLE

House Rent Prediction Dataset

THE COURSEWORK OVERVIEW

For the assignment, you are asked to explore the application of data analytics techniques to
the dataset which is provided. You must study data problems related to the dataset, giving
special consideration to the unique properties of the problem domain, and testing one or
more techniques on it.

Your analysis needs to be thorough and comprehensive, and goes beyond the scope of what
has been covered in this course. You should incorporate data exploration, manipulation,
transformation, and visualization concepts with data analysis techniques in your solution. It
is crucial to provide explanations and justifications for the chosen techniques.

You also may need to pre-process your data to get it into an appropriate format. The
assignment should involve a number of techniques by categorize it into different criteria and
a detailed exploration with the commands using in each criteria. Outline the findings,
analyze them and justify correctly with an appropriate graph. Also, a supporting document
is needed to reflect the graph and code using R programming concepts.

2.0 OBJECTIVES OF THIS COURSEWORK

This assignment will help you to explore and analyse a set of data and reconstruct it into
meaningful representations for decision making.

3.0 TYPE
Group Assignment

4.0 COURSEWORK DESCRIPTION

This dataset contains the details of varied house rents that could determine how people have
an impact to choose the houses for rental based on their lifestyle. Human Rights
Measurement team assigned you to perform analysis with the given dataset to identify the
how people choose their rental house based on their family background, locality and
lifestyle and provide meaningful insight for decision making.

Level 2 Asia Pacific University of Technology and Innovation 2023


Programming for Data Analysis
Page 2 of 6
Techniques
The dataset provided for this assignment consists of information about
Houses/Apartments/Flats Available for Rent, including parameters such as BHK, Rent,
Size, No. of Floors, Area Type, Area Locality, City, Furnishing Status, Type of Tenant
Preferred, No. of Bathrooms, and Point of Contact. It contains over 4700 rows and 12
columns. In addition to the techniques (data exploration, manipulation, transformation, and
visualization techniques) covered in the course to analyze the dataset, you might consider to
explore and implement more advanced concepts to enhance the effectiveness of data
retrieval.

5.0 GENERAL REQUIREMENTS


 This is a group assignment, with a maximum of 4 students in a group.
 The R program should compile and be executed without errors.
 Validation should be done for each entry from the users to avoid logical errors.
 Do not use third-parties tools such as Excel, OpenRefine and etc to pre-process or clean
the data. Cleaning and pre-processing must be done in R using scripting.
 No duplication is allowed in dataset.
 You should;
o Include good programming practice such as comments, variable naming
conventions and indentation.
o Carried out additional research from Internet to comprehend the knowledge and
information on the given dataset when examine the data.
 The analysis should be meaningful and effective in providing the information for the
decision making.
 Any additional features implemented must improve the retrieval effects.

Level 2 Asia Pacific University of Technology and Innovation 2023


Programming for Data Analysis
Page 3 of 6
6.0 DELIVERABLES:
The complete RScript (source code) and report must be submitted to APU Learning
Management System (Moodle).
6.1 RScript (Program Code):
 Name the file under your group number.
 Start the first few lines in your program by typing all members names and TP
numbers. For example:
# Name1, TP000001
# Name2, TP000002
# Name3, TP000003
# Name4, TP000004
o For each question example, give an id and explain what you want to discover.
For example:
# Question 1: Why staff would leave the company.
# Analysis 1-1: Find the relationship between job position with attrition…
# Analysis 1-2: Find the relationship between job age and ….
# Analysis 1-3: Find the relationship between …
o For each extra feature example, give an id and provide the explanation.
# Extra feature 1
# comments about the extra feature

6.2 Documentation (report):


 A 5000 words (30-40 pages) report excluding appendix. The report should comprise of
the following content.
A) Cover Page:
All reports must be prepared with a front cover. A protective transparent plastic
sheet can be placed in front of the report to protect the front cover. The front cover
should be presented with the following details:
 Module
 Coursework Title
 Intake
 Students name and id
 Date Assigned (the date the report was handed out).
 Date Completed (the date the report is due to be handed in).

Level 2 Asia Pacific University of Technology and Innovation 2023


Programming for Data Analysis
Page 4 of 6

B) Contents:
o Introduction
 Data Description
 Assumptions (if any)
 Objectives
o Data Preparation
 Data import
 Cleaning / pre-processing (if necessary)
 Validation (if any)
o Data Analysis
 Each question must start in a separate page and contains:
 Analysis Techniques – e.g. descriptive using statistics
 Screenshot of source code with output / plot.
 Outline the findings based on the results obtained.
 The extra feature explanation must be in a separate page and contains:
 Screenshot of source code with output/plot.
 Explain how adding this extra feature can improve the results.
o Conclusion
 Interpret the result
 Recommendation
 Limitation and future direction

C) Workload Matrix

D) References
 You may source algorithms and information from the Internet or books.
Proper referencing of the resources should be evident in the document.
 All references must be made using the APA (American Psychological
Association) referencing style as shown below:

The theory was first propounded in 1970 (Larsen, A.E. 1971), but since then
has been refuted; M.K. Larsen (1983) is among those most energetic in their
opposition……….

/**
* Following source code obtained from (Danang, S.N. 2002)

Level 2 Asia Pacific University of Technology and Innovation 2023


Programming for Data Analysis
Page 5 of 6
*/
int noshape=2;
noshape=GetShape();

 List of references at the end of your document or source code must be


specified in the following format:

Larsen, A.E. 1971, A Guide to the Aquatic Science Literature, McGraw-Hill,


London.

Larsen, M.K. 1983, British Medical Journal [Online], Available from


http://libinfor.ume.maine.edu/acquatic.htm (Accessed 19 November 1995)
Danang, S.N., 2002, Finding Similar Images [Online], The Code Project,
*Available from http://www.codeproject.com/bitmap/cbir.asp, [Accessed
14th *September 2006]

 Further information on other types of citation is available in Petrie, A., 2003,


UWE Library Services Study Skills: How to reference [online], England,
University of Western England, Available from
http://www.uweac.uk/library/resources/general/info_study_skills/harvard2.h
tm, [Accessed 4th September 2003].

 The font size used in the report must be 12pt and the font is Times New Roman. Place
the full RScript in the appendix section.

7.0 ASSIGNMENT ASSESSMENT CRITERIA


The assignment assessment consists of 2 major components: Analysis (70%) and finding
and discussion (30%). Details of the division for each component are as follows:

Level 2 Asia Pacific University of Technology and Innovation 2023


Programming for Data Analysis
Page 6 of 6

Analysis (70%) Finding and Discussion (30%)

Criteria Marks Criteria Marks


Allocat Allocate
ed d
Analysis techniques 10% Finding and Discussion
 Approaches used to  Project Introduction
process, interpret, and (description, assumption,
extract insights from data. objectives)
Report content:  Conclusion (result finding, 30%
 Methodology description discussion, recommendation
Analysis Methods 60% and future direction)
 Specific process employed  Structure of the report and
to carry out the analysis references
(transform raw data into
meaningful insights)
[e.g. Exploratory data analysis,
Hypothesis, Descriptive statistics
etc]
Report content:
 RScript – code snippet and
visualization screenshots
with explanation.

8.0 DEVELOPMENT TOOLS


The program written for this assignment should be written in R Studio

9.0 ACADEMIC INTEGRITY


 You are expected to maintain the utmost level of academic integrity during the duration
of the course.
 Plagiarism is a serious offence and will be dealt with according to APU and De
Montfort University regulations on plagiarism.

Level 2 Asia Pacific University of Technology and Innovation 2023

You might also like