0% found this document useful (0 votes)
12 views2 pages

Topics For Lab

setryrdrhgjfgjfg

Uploaded by

rsevrse
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
12 views2 pages

Topics For Lab

setryrdrhgjfgjfg

Uploaded by

rsevrse
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd

2) Data Cleaning Features in Spreadsheet

Topics for Lab: Other Special Features of Sorting and Filtering:


1. Data Cleaning Tools and Techniques  Sorting displays data in a specific order,
2. Data Cleaning Features in Spreadsheet Microsoft Excel provides several features to help clean data, often to reveal patterns, trends, or
3. Sorting and Filtering such as: relationships.
4. Data Cleaning Verifying and Reporting Results  Filtering displays only the rows that meet
5. Capturing Cleaning Changes (your) specific criteria, effectively hiding the
1) Fill data automatically in worksheet cells: This
rows that do not match your conditions.
feature allows filling of data automatically in worksheet
1) Data Cleaning Tools and Techniques  Both Sorting and Filtering allow you to focus
cells based on patterns in the data.
on specific subsets of your data, making it
2) Create and format tables: Creating and formatting
easier to analyze and draw insights
All Data collected must be subjected to Data cleaning through tables in your spreadsheet, which can make it easier to
Data processing. work with large datasets.
4) Data Cleaning, Verifying and Reporting Results
3) Create a macro: Automating repetitive tasks in your
spreadsheet.
Data preprocessing involves identifying and correcting or
4) Check spelling and grammar: Checking the spelling Data cleaning is the process of identifying and
removing inaccurate, incomplete, or irrelevant data from a
and grammar of your data. resolving potential data inconsistencies or errors
dataset.
5) Filter for unique values or remove duplicate values: to improve the quality of your data.
Filtering for unique values or remove duplicate values
Techniques in Cleaning Data: in your data.
 It involves reviewing, analyzing,
6) Find and replace text: Finding and replacing text in
detecting, modifying, or removing ‘dirty’
your data.
1) Removing duplicates: This technique involves data to make your dataset ‘clean’ 1.
7) Change the case of text: Changing the case of text in
identifying and removing identical records from a your data.
dataset. 8) Remove spaces and nonprinting characters from Data validation at the time of data entry or
2) Removing irrelevant data: This involves identifying and text: Involves removal of spaces and nonprinting collection helps you minimize the amount of data
removing data that is not relevant to the analysis. characters from text in your data. cleaning you’ll need to do.
3) Standardizing capitalization: This is converting all text 9) Fix numbers and number signs: Fixing numbers and
to a consistent case format. number signs in your data.
4) Converting data types: converting data from one type 10) Fix dates and times: Fixing dates and times in your  After data collection, you can use data
to another, such as converting text to numbers. data. standardization and data transformation
5) Clearing formatting: removing any formatting from the to clean your data 1.
data, such as bold or italicized text.
6) Fixing errors: identifying and correcting errors in the Data verification is the process of ensuring that
data. 3) Sorting and Filtering the data is accurate, complete, and consistent. It
7) Language translation: translating data from one involves checking the data for errors,
language to another. inconsistencies, and missing values.
Sorting and filtering are powerful techniques to manage and
8) Handling missing values: identifying and handling  Data verification is an essential step in
analyze data in spreadsheets.
missing data in the dataset. ensuring that the data is reliable and
Most Popular Tools Available in Cleaning Data: can be used for analysis and decision-
(through open source and SaaS Tools)  Sorting allows arranging data in a specific order, making 2.
revealing patterns and trends.
 Filtering helps in focusing on specific subsets of data. Reporting results is the process of presenting the
1) OpenRefine: A free, open-source tool for working with
o Advanced filtering techniques provide even findings of your data analysis. It involves
messy data.
greater control over the data analysis. summarizing the data, identifying patterns and
2) RapidMiner: A data science platform that includes data
o Includes trends, and drawing conclusions.
cleaning and preparation tools.
3) Talend Data Preparation: A cloud-based data  custom number and text filters,
 wildcards,  The goal of reporting results is to
preparation tool that allows users to clean and prepare
 date filters, and communicate the insights gained from
data for analysis.
 filtering by color or icon the data analysis to stakeholders in a
4) Data Ladder Cleansing Tool: A data cleaning tool that
clear and concise manner
uses machine learning algorithms to identify and
 Combining sorting and filtering can help draw insights 5) Capturing Cleaning Changes
correct errors in data.
5) Rattle: A free, open-source data mining tool that and allows data-driven decisions quickly, making these
skills essential for anyone working with spreadsheets. The Benefits of Effective Data Cleansing
includes data cleaning and preparation tools.
https://www.techtarget.com/searchdatamanagement/definition/
data-scrubbing
In the long term, that saves time and money,
When data cleaning is done well, data cleansing provides because IT (Information Technology) and
benefits to data management, business or organization in data management teams do not have to
general. continue fixing the same errors in data
sets.
Benefits of Effective Data Cleansing

1) Improved decision-making.

With more accurate data, analytics


applications can produce better results.

That enables organizations to make more


informed decisions on business strategies
and operations, as well as things like
patient care and government programs.

2) More effective marketing and sales.

Customer data is often wrong, inconsistent


or out of date (many customers, by nature,
don’t mind about data integrity or quality,
and just provide whatever comes to mind.
Many customers hate being asked and
disturbed).
Cleaning up the data in customer
relationship management and sales
systems is very important because it helps
improve the effectiveness of marketing
campaigns and sales efforts.

3) Better operational performance.

Clean, high-quality data helps


organizations avoid inventory shortages,
delivery snafus and other business
problems that can result in higher costs,
lower revenues and even damaging
relationships with customers.

4) Increased use of data.

Data has become a key corporate asset, but


it can't generate business value if it isn't
used.

By making data more trustworthy, data


cleansing helps convince business
managers and workers to rely on it as part
of their jobs.

5) Reduced data costs.

Data cleansing stops data errors and issues


from further propagating in systems and
analytics applications.

You might also like