Essential Excel Skills For Data Preparation and Analysis Week 1
Essential Excel Skills For Data Preparation and Analysis Week 1
Hi and welcome to the Excel Essentials course. My name is Gabriel Soong, also known as Cricket.
I am a registered Electronics and Communications Engineer, and I work in social impact. This course is
for those who are new to the field of data science and analytics. Every journey has a beginning, and for
you, it starts with learning the fundamentals of Excel. This course aims to teach you how: To perform
basic spreadsheet tasks that may be used to have better efficiency and productivity in carrying out
business tasks. To be proficient in Excel calculations using various functions and formulas that will
strengthen business reports and analysis. To professionally use Excel in creating visualizations and
summary of data through tables, charts and graphs. As a general tip for this course, make sure to write
down the name of the tasks and functions on one column and the corresponding keystrokes on a second
column. These will be your notes for the course, and such will aid in memorizing the keystrokes
and functionalities. Your notes will definitely come in handy when you take the weekly assessment
quizzes.
The purpose of this course is to introduce and equip learners with Excel tasks and functions that
are essential to data science and analytics.
This course will take you through the basic functionalities of Excel for data analysis
and visualization. The key concepts to be learned include functions for data
organization, manipulation, visualization, and task automation.
Course Organization
This course is divided into five weeks. Each week contains:
Video Lessons
Activities
Assessments
Course Prerequisites:
For Windows: Core i3 or better, 4GB RAM or better, MS Excel 2007 or better
For MacBook: ideally MS Excel 2013 or newer should be installed (some functions require this
version on the Mac). If the version of MS Excel is 2011, download and install StatPlus.
Time Commitment
It will take approximately 1-3 hours to complete each week.
Technical Issues
If you have any technical issues (problems unrelated to the course content itself,
such as questions about uploading assignments or tracking your progress), you can
visit our Learner Support Page which is a public repository that is constantly
updated with solutions to the most common learner issues. If there are any issues
that cannot be addressed through the Learner Support Page, you can reach out to
Coursebank directly through learn@coursebank.ph.
Accomplish the task at the end of each week to build the components of your final project. This
will be peer-reviewed and will focus on how you use Excel as a tool - not (yet) the robustness of
your study.
How far is your residence from your school or place of work? How long do you travel to get to
work, or on your way home? Do you get to sleep on your commute?
For your capstone project in Week 5, you will be asked to pick any of the following
criteria and create a set of 5 to 10 survey questions based on your choice. You will
be inviting 35 to 50 individuals to answer your survey, and you will be encouraged
to interview more. You may add information or demographics like gender, age,
educational background, work industry, and other details you may wish to learn
about your respondents.
Method of travel
Distance traveled
Walking
Refer to the sample survey for ideas, and feel free to build on it. You are free to do face-to-face
interviews or use other online tools.
Overview
For Week 1, we will go over two topics: Basic Spreadsheet Tasks and Basic Excel Functions.
You will learn about cell functions, including editing, labeling and basic computations. You will also gain
knowledge about worksheet functions and formula creation. As with the whole course, these topics will
be taught through lectures, demonstrations, and a quiz at the end of the week. As a tip, remember to
take notes as you learn each task and function.
1.3.1 Part 1
Bookmark this page
Selecting a Range
In this lesson, we will learn how to select a range. A range is a collection of selected
cells. In this lesson, we will demonstrate how to make what we call a symmetrical
range of cells.
Keep pressing the different direction arrow buttons until you highlight the desired collection of
cells.
The GIF demo that follows demonstrates both modes of selecting a range.
Selecting Multiple Ranges
To add to that lesson, we will now learn how to select multiple ranges. This is usually a
precursor step to other tasks such as formatting cells that are not adjacent to one another.
To select multiple ranges, begin by first selecting an initial range, whether by click-drag method
or by SHIFT + ARROW KEYS.
PRESS CTRL + CLICK ON A TARGET CELL, THEN DRAG THE CURSOR TO CREATE A
NEW SELECTED RANGE
This creates a second highlighted range. Repeat this step to make even more selected ranges. If
the next range is a single cell, note that you can simply press CTRL and LEFT CLICK on that
range.
The GIF demonstrates the selection of the initial range, followed by several other ranges, including
single cell ranges.
In this lesson, we will learn about copying and pasting. These functions are familiar to most
people who use smartphones or computers, and operate is much the same way in Excel. The
purpose of copying and pasting is to replicate the content of a spreadsheet cell or range in
another location.
To do the copy and paste combination, begin by selecting a cell or range to copy. Then execute
the following commands:
1. Press CTRL + C
3. Press CTRL + V
You can also Right Click the selected range, then select Copy, Right Click another location,
then select Paste.
One last option is to select a range, click Home on the ribbon then select Copy, Right Click on
another location, then select Paste on the ribbon.
The GIF demonstrates all three methods of copying and pasting. It also demonstrates how to repeatedly
paste the same copied range in different locations.
Now that you’ve completed copying and pasting, we will briefly touch on the PASTE
SPECIAL function in Excel.
An Excel cell with content may look simple enough (i.e. a box with numbers and or letters), but
sometimes there are many things going on under the hood. Its values can either be manually
encoded or typed into a cell, could be a product of a mathematical formula, or it could be linked
to another cell, sheet or even document. Its appearance could also be the product of formatting
options.
When copying the contents of a cell, it may be important to know if you only want what appears
in the cell, or if you want the underlying functions that accompany it. To specify what exactly
you would like to copy and paste elsewhere, you can select PASTE SPECIAL, and choose what
exactly you the aspects to paste.
Without getting into detail, you should know that you have the option to paste aspects such as
content, formatting, formulas, values, comments, notes, data validation, column widths, number
formats, and values without formulas.
To do a PASTE SPECIAL, start by copying a target cell or range. Choose another location on
the spreadsheet, then execute a PASTE SPECIAL by:
2. Right Clicking and hovering over the many choices for PASTE SPECIAL or PASTE
OPTIONS
The GIF demonstrates the three methods of pasting, with a particular focus on pointing out the special
paste options.
To perform this function, begin by selecting a cell or range. To initiate the cutting,
1. Press CTRL + X.
Like in copying cells or ranges, the methods of using the Right Click drop-down menu or using
the cutting option on the ribbon are also available.
The GIF demonstrates how a cut cell or range is transferred to a new location.
1.3.2 Part 2
Bookmark this page
Sorting
There are two options in sorting data. They are as follows:
Option 1:
SELECT CELLS TO SORT, RIGHT CLICK + SELECT SORT CHOOSE DESIRED SORTING
OPTION
Option 2:
SELECT CELLS TO SORT, LOOK FOR SORT AND FILTER IN HOME RIBBON +
CHOOSE DESIRED SORTING OPTION
Cell Formatting
Cell Formatting specifies how Excel treats the data entered into each cell.
1. Cell formatting can be accessed directly under the Home Tab to modify font orientation and size.
2. Cell formatting can be accessed through the Formatting button under the Home Tab in the Cells
group. A Cell Format Dialog box will come out to prompt the user once accessed.
3. Cell formatting can also be found in the context menu (right-click) of the active cell.
To distinguish between numeric data and text strings, you can specify the data format under the
Home Tab, in the Number group. Both data types are handled differently by Excel. Numeric
data, can be defined in the first tab of the Format Cells dialog box. Each type has a short
description and a defined format.
Alignment, Font, Border, Fill and Protection are also accessible from here.
Filling a Series
This allows the user to fill up the rest of a series based on the content of the previous cells or the
parameters given by the user. A very helpful function when continuing a number or letter series.
Method 1:
Select VALUE or CELL to FILL. HOVER the mouse pointer over the lower right corner of the
active cell.
The mouse pointer will change into a "+" sign, then left click mouse and drag to the desired
direction. the AUTO FILL option will appear on the last cell with value. Select SERIES for
SERIES FILL or COPY only.
Method 2:
Select desired cell top copy. Start with cell to copy and select desired range to fill. On the home
ribbon, choose FILL. Select Desired option.
Method 3:
When there are values on the left or right column, select the first cell to fill. Out the mouse
pointer on the lower right corner of the selected cell (Pointer will change into a cross "+" sign).
Double left click the mouse. Then, the AUTO FILL option will appear on the last cell with value.
Select desired option. To guide the excel for a pattern, type the next value below the first cell and
select both values and put the mouse pointer on the right corner of the selected cell.
The mouse pointer will again turn into a "cross" sign. Left click the mouse and drag to the
desired direction.
Excel is a modern spreadsheet software capable of numerous functions. While it is not the first,
it’s use has been widespread in recent decades. During the early years of digital spreadsheets,
cell addressing or how we reference cells was different than what we commonly see now as fixed
- columns with letters and rows with numbers.
The importance of addressing may not seem evident this early in your training, but it has its
applications in certain formulas and functions. There are two types of cell references:
ABSOLUTE address and RELATIVE address. They differ mainly in how formulas respond
when they are copied into other cells.
Absolute Address
Absolute addressing refers to a fixed location on the spreadsheet. When this is used and encoded
in a formula, anywhere the formula is copied, it still references back to the same cell. This
assures the user that any change in the location of the formula will always reference the value in
the absolute address.
In absolute addresses, a dollar sign ($) precedes the column and row, eg. =$D$4.
It is also possible to keep only either the column [=$D4] or row [=D$4] absolute, but this is
seldom practiced.
HINT : A quick way to toggle absolute addressing is pressing F4 [Fn+F4 on some] while an
active cell is highlighted.
Relative Address
The other style is relative addressing. Results depend on where the formula is encoded because it
references data relative to the position of the formula like subtotals.
Relative addresses are useful to formulas that change depending on where they reference their
input data. These are particularly helpful when the formula has to be repeated across multiple
cells on a table or worksheet.
Transposing a Range
Now that we’ve done some formula creation and data handling in the previous topics, this is
another layout function that helps reorganize our data. Sometimes we may want to change the
orientation of our ranges, from columns to rows or vice versa. The function we use for this is
TRANSPOSE(). As you’re nearing the end of this topic, bear in mind that the functions you’ve
learned so far are important and will surely come in handy.
As with most commands in Excel, there are a couple of ways to execute TRANSPOSE().
Note: if data (range or array) is not a continuous block of cells, the function will only transpose
cells with data. Empty cells will not be copied.
Approach 2: TRANSPOSE()
Keyboard Shortcut:
2. Select the new cell destination you wish to place your transposed data on.
3. Key-in ALT + E + S
Now you can reorient your data to match your needs for a more effective presentation of
information.
1.5.1 Part 1
Bookmark this page
Documenting with Cell Comments and Text Box
Comments and text boxes are notes that can be inserted into any cell in Excel. Used for
reminders, notes for others, and for cross-referencing.
2. Right click and choose “new comment” a window will appear showing comment owner and
cell address of the comment.
3. Type in the comment in “start conversation box” after typing, the arrow below will turn green.
Click arrow to post comment. Click anywhere to hide the comment.
4. There will be a purple tab on the upper right corner of the cell with comments.
4. There will be a red indicator on the upper right portion of the cell.
5. The text box can be moved using the scale guides on the sides of the text box to hide note
select cell with note.
Method 1
Select cell with note. Show the note by hovering over the cell. Click on the text box & edit.
Click outside to save.
Method 2
Select cell with note, then right click. Choose edit note, edit, click outside to save.
Just a last note: For cell comment, these can be marked as resolved or can be deleted. Resolving
these will not result in the deletion of the comment. You can do this by right-clicking the cell
with the comment, click the three dots, then picking RESOLVE from the drop-down
menu. There is no option to resolve text boxes.For both cell comment and text, the author has the
option to delete the thread. This will permanently remove the comment/text box.
Refer to the graphical demonstration of creating cell comments, text boxes and resolving
comments.
Undoing Actions
This part is for undoing actions. As soon as you commit a mistake, whether it is a
small typing error or accidentally erasing the whole spreadsheet, you can easily
undo the error by executing the UNDO ACTION. Even after you have save an Excel
file, you can still undo your most recent changes
1. LEFT CLICK UNDO ICON ON THE TOP LEFT CORNER OF THE RIBBON; or
2. Press CTRL + Z
This function divides a single worksheet into two windows. It enables the user to view
data at different parts of the worksheet simultaneously.
3. Select “split”
In the GIF demonstration, you can see how screen splitting creates two windows with
identical spreadsheets, but each one viewed independent of the other.
In some cases, we need to compare data from more than one sheet or even file. If
you will need such a function, consider the following Additional Learning: for
Viewing date across multiple worksheets or files. Explore this on your own, this
feature can be found in the following tab.
VIEW >WINDOW
button on the top lefting, Deleting, Hiding, and Unhiding Rows and Columns
Method 1:
Method 2:
Method 3:
Method 1:
1. Highlight rows or columns that you wish to hide or choose the rows or columns
before and after the hidden rows or columns to unhide.
2. Select "Format" in the cells group from the home tab ribbon.
4. There must be selected rows or columns to be hidden. Otherwise, all rows and
columns will be hidden.
Method 2:
1. Highlight rows and columns that you wish to hide or choose the rows or columns
before and after the hidden rows or columns to unhide.
Method 3:
1. Highlight rows and columns that you wish to hide or choose the rows or columns
before and after the hidden rows or columns to unhide.
2. To hide a row, press ctrl + 9; to unhide a row, press (in order) ALT+O+R+U.
3. To hide a column, press ctrl + 0; to unhide a column, press (in order) ALT + O + C
+ U.
The GIF demonstrates Insert, Delete, Hide and Unhide using the ribbon, right click and
shortcut methods.
This lesson is loaded with several functions with multiple methods of execution, so
please do not forget to review the GIF, take written notes and repeat these on your
computer.
Manipulating Worksheets
These commands allow the user to CREATE, RENAME, change the ORDER, DELETE
or SELECT single or multiple worksheets.
Work space is a valuable asset, more so for your computer screen. Efficiency largely
depends on how quick you can access your data in your spreadsheet. Can you
imagine a sheet with 100 rows and 100 columns? How about 1000 rows and
columns? Excel can accommodate up to 1,048,576 rows and 16,384 columns. Try it
out.
1. At the bottom part of the Excel sheet to the right of the sheet names, click the "+"
(plus sign) to "add new sheet."
2. The default name should be "Sheet + Number". The new sheet will be located
next to the active sheet.
This topic will focus on notation or reference style. It determines how Excel “understands and
processes” cell addressing. It uses notation in the form R1C1 (a style of absolute and relative
addressing) where: R=row, C=column. This is a software-level setting , and it changes the layout
of ALL spreadsheets.
Notation
For a bit of history, older spreadsheet software handled cell addressing via R1C1 notation. This
simply meant that any cell address is written as a combination of R(rows) and C(columns) with a
corresponding number. A cell on the fourth row, under the fifth column will look like this in the
Name Box, R4C5.
On the other hand, the environment of Excel is built on the A1 reference style. It is the most
commonly used style because it is the default setting on Excel. This style determines a fixed set
of rows and columns with fixed addresses. Rows are assigned numbers on the leftmost side of
the worksheet; columns are listed with corresponding letters on the topmost of the worksheet,
hence cell addressing like A1 or D4.
2. Select "Formulas".
3. Under "Working with formulas", put a check on the "R1C1 Reference" style, then click "Ok".
It provides a visual representation with arrows of data flow and processes through tracing.
Auditors can follow formulas in a forward or backward manner to determine causes of errors.
This is vital in troubleshooting large and complicated spreadsheets.
Shows the cells related to the selected cell and formula (backward tracing)
1. Select cell to audit.
Trace Precedents(button)
- An arrow will identify sources (dot) of the data in the active cell (arrowhead).
- Each click of the "Trace Precedents" button will reveal the previous step in the
data.
Shows where the current cell and formula is used (forward tracing).
Trace Dependents(button)
Trace Dependents(button)
- An arrow will identify succeeding cells (arrowhead) of the data in the active cell
(dot).
- Each click of the "Trace Dependents" button will reveal the next step further in the
data.
Other Functions:
To remove arrows:
To show formula:
To check error/s:
To evaluate formula:
Excel automatically creates the formula, =sum() in the selected cells when the button is clicked.
This helps make data processing more efficient through mouse clicks or shortcut keys. It allows
a more continuous workflow without having to type in the formula yourself.
1. Select the cells with data for summation, plus a blank cell for the result
NOTE: When used on an array with blank cells on the rightmost and bottom cell, it will
perform across (rows) and downwards (columns).
HINT: Excel will display a green marker on the top left corner of the result if the computed
range has data adjacent to the selection.
This allows name assignments for cells or groups of cells for referencing.
By using a "Range Name", the data within the range is assigned to a variable within
Excel. The range name can be referenced in any other part of the spreadsheet for
data manipulation.
Naming Rules:
Letter
Underscore (_)
Backslash (/)
Number
Period
Underscore
3. Not Allowed
Excel shortcut
3. Type in desired name adhering to the naming rules (names are not case-
sensitive).
4. Press "Enter".
5. To rename or change, click the drop down arrow on the name box.
7. To edit range within "Name Manager", select range name then click the "edit"
button.
To delete:
NOTE: Deleting the range name does not delete the data in the cells.
1. Select cells; The first cell with header to be the name of the insert
2. Follow Method 2.
Most Excel users tend to use rather small worksheets, and therefore it is relatively easy to scroll
up to the top of the document just by using the mouse roller, pressing the UP button, or clicking
and dragging the scroll bar up to the top.
As an aspiring data professional, you will soon contend with extremely large worksheets. By
large, I mean it could be in the tens of thousands, or even hundreds of thousands of lines.
Using the roller, UP button or clicking and dragging in this instance will take too long. Instead of
these, you can simply use the Move to the Top of Sheet command by pressing CTRL + HOME.
The GIF below demonstrates how to move to the top of the sheet.
If the selected cell is blank, the cursor will go to the next non-blank cell;
or
If the selected cell has values, the function will go to the last non-blank cell.
If the selected cell is blank, the task will select from initially selected cell until the next non-
blank cell;
or
If selected cell has values the function will select from initially selected cell until the last non-
blank cell.
In some instances that we may be looking for a particular cell address, this function makes that
easy.
Alternately, the dialog box can be made to appear using the keyboard shortcut CTRL + G.
4. Type in cell reference (e.g "A1", then press "OK" to jump to the desired reference.)
A function is a predefined formula within Excel that can be repeatedly used to perform specific
actions
This helps find a specific function and guide users to the correct syntax. It provides predictive
entry for most functions. Hovering over the entry displays the syntax, a short description and
required input.
Syntax:
=FUNCTION(input)
2. From the insert function window, the user may choose to type or choose a category to search a
function.
3. Choose from the list-- a function guide and what it does will be shown below.
4. Click “ok”. Then, there will be an equals sign (=) on the selected cell and window.
Sum and Average functions are the basic operations found in Excel.
These functions allow direct entry of data or referenced cells as input. It is valuable for accurate
and fast computations. Both functions return a value -- the sum or the average.
Method 1: fx Button
Method 2:
3. Choose a cell or range of cells by left clicking mouse or dragging for multiple selection.
4. Type “)”
5. Press "enter".
NOTE: To select multiple cells or range, add comma (,) after cell or range, or simply press ctrl
& select by left clicking or dragging the mouse.
Syntax (Method 2)
=SUM(range)
Average
2. Type “=AVERAGE(”
3. Select a range of cells by left clicking to select individual cells or left click and dragging the
mouse.
*for multiple cells- press ctrl to select multiple range
5. Hit "enter".
NOTE: Fx button can also be used to select multiple cells or range, add comma (,) after cell or
range or simply press ctrl & select by left clicking or dragging the mouse.
Syntax:
=AVERAGE(range)
The GIF below will give a clear demonstration of the functions discussed.
1.9.2 COUNT, COUNTA, and COUNTBLANK Functions;
MAX and MIN Functions
Bookmark this page
Counting cells is a tedious process. Even counting ten cells takes time.
There are several ways of executing the COUNT, COUNTA, COUNTBLANK Functions:
Method 1: fx Button
Using the fx button, Excel will give suggestions on how the syntax is encoded. You can try it for
yourself, before we move on.
=COUNT(range)
=COUNTA(range)
=COUNTBLANK(range)
* Range can be selected via mouse (Click and drag over chosen cells) or coded in
start_cell:end_cell.
They identify the highest or lowest numerical values within the selected range.
These functions disregard text values, and revert it to zero if there are no numbers
in the range.
MIN function – shows the smallest number in a set of values. This ignores text and
logical values.
MAX function – shows the largest number in a set of values. This function ignores
text and logical values.
2. Type “=” followed by the function name, then open parenthesis “(“
NOTE: To select multiple cells or ranges, add comma (,) after the cell or range, or
simple press ctrl and select by left clicking or dragging the mouse.
4. Hit "enter".
Watch the GIF below and try to follow the given steps.
It checks whether a condition or criteria is met. If met, it yields "TRUE". Otherwise, it
yields "FALSE".
Syntax:
5. Pick a value: when criteria is not met; then, close parenthesis “)”
6. Press "enter".
= Equal to
˃ Greater than
˂˃ Not equal to
Additional Information:
It is possible to use this function with other functions AND() and OR() to define a
series of narrowing parameters. This is called a nested if statement. It will not be
covered in this course, but you are encouraged to read up on it and practice.
Refer to the sample sheet below to know how this function can be used.
LOOKUP functions are reference functions that use an IF argument as a logical test.
Vertical or VLOOKUP and horizontal or HLOOKUP use a reference range to find the
corresponding cell (v or h) and returns that value.
HLOOKUP – looks for value in the top row of a table and returns
the value in the same column from a specified row
Syntax:
lookup_value – the value that the function needs to look for in the given table
row_index – number of row starting from the top-most row in the table
3. Type the LOOKUP value or simply select the cell where it is located, then type comma “,”
5. Type the number of columns for VLOOKUP or row for HLOOKUP. This is the location
where the function will base the result.
6. Type comma “,”, then type “range_lookup”.
8. Hit "enter".
The GIF below will give a clear idea on how to use the functions.
1.9.4 Common Math Functions
Bookmark this page
These functions allow the user to perform common mathematical functions within cells. These
include mathematical operations between cells, simplified data presentation (round) or using
mathematical constants like pi.
=ROUND(number,num_digits)
5. Hit "enter".
B. RAND – used to return random number greater than or equal to (0) zero and less than (1)
one.
1. Type “ =RAND() “
2. Hit "enter".
C. MOD – used to find the remainder after dividing a number by another number
=MOD(number,divisor)
2. Type “ =MOD( “
5. Hit "enter".
=INT(number)
3. Hit "enter".
E. ABS – used to return the absolute value of a given number, without a sign
=ABS(number)
2. Type “=ABS(“ , then choose a number to convert followed by a close parenthesis “)”
3. Hit "enter".
=SQRT(number)
3. Hit "enter".
5. Hit "enter".
H. PI – return of value of pi
=PI()
2. Hit "enter".
=COMBIN(number,number_chosen)
Math Operations
Please refer to the graphical presentation to better understand how each function works.
This week, we focused on the familiarization of the Excel environment and its basic functions. It is
important to establish this early in your training, as future tasks get complicated. You are expected to
remember the operations to make data input and basic manipulation. As you progress through the
course, keep the habit of note-taking for your own quick reference. Next week, we will delve into data
types and encoding. It will be crucial for you to remember syntax, as succeeding functions become
complicated.