0% found this document useful (0 votes)

67 views55 pages

IR Chapt 5

Uploaded by

Magarsa Bedasa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

67 views55 pages

IR Chapt 5

Uploaded by

Magarsa Bedasa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

Information Retrieval

Chapter 5:
Retrieval Evaluation
IR Evaluation

• It is known that measuring or evaluating the

performance and accuracy of the system is very
important after IR system is designed.

• According to (Singhal, 2001), there are two main things

to measure in IR system; these are: effectiveness of the
system and its efficiency
Cont..
• Effectiveness:-Power to be effective; the quality of being able to
bring about an effect
• How is a system capable of retrieving relevant documents from
the collection?

about user satisfaction

• Efficiency:- The ratio of the output to the input of any system
• Skillfulness in avoiding wasted time and effort

It is about time, space

…cont

 To measure ad hoc (informal) information retrieval

effectiveness in the standard way, we need a test collection
consisting of three things:
1. A document collection
2. A test suite (set)of information needs, expressible as
queries
3. A set of relevance judgments, standardly a binary
assessment of either relevant or non relevant for each
query-document pair
Document collection

• Specific questions that might be considered when gathering

documents include:

1. How many items should be gathered?

2. What items should be sampled to create the document
collection?
3. What about copyright constraints?
Example (N=128)
….cont

The standard approach to information retrieval system

evaluation revolves around the notion of relevant and
non relevant documents.
With respect to a user information need, a document in
the test collection is given a binary classification as
either relevant or non relevant.
This decision is referred to as the gold standard or
ground truth judgment of relevance.
Mind Break
A document is relevant if it addresses the stated
information need, not because it just happens to
contain all the words in the query.

How ?
Types of Evaluation Strategies

•System-centered studies
– Given documents, queries, and relevance judgments
• Try several variations of the system
• Measure which system returns the “best” hit list

•User-centered studies
– Given several users, and at least two retrieval systems
• Have each user try the same task on both systems
• Measure which system works the “best” for users information need
Performance measures (Recall, Precision, etc.)

• The two most frequent and basic measures for

information retrieval effectiveness are :
1. Precision and
2. Recall.
Precision

Precision (P) is the fraction of retrieved documents that

are relevant
The ability to retrieve top-ranked documents
that are mostly relevant.
Precision is percentage of retrieved documents
that are relevant to the query (i.e. number of retrieved
documents that are relevant).
Precision Formula
Recall
Recall (R) is the fraction of relevant documents that are
retrieved

– The ability of the search to find all of the

relevant items in the corpus

– Recall is percentage of relevant documents

retrieved from the database in response to users query.
Recall Formula
Question

• When do you think the precision/recall has value

100% ? Or sometimes we can get the value of
precision and recall 100% or one. How can we
justify this value?
Example
Examples
An IR system returns 8 relevant documents, and 10
non relevant documents. There are a total of 20
relevant documents in the collection.
a. What is the precision of the system on this search,
and
b. what is its recall?
c. What is F-measure?
R- Precision

Precision at the R-th position in the ranking of results for a

query, where R is the total number of relevant documents.

It requires having a set of known relevant documents, from

which we calculate the precision of the top relevant
documents returned
– Calculate precision after R documents are seen
– Can be averaged over all queries
Example
Example 2:
Exercise
• Given a query q, for which the relevant documents are d1,
d6, d10, d15, d22, d26, an IR system retrieves the following
ranking: d6, d2, d11, d3, d10, d1, d14, d15, d7, d23.
• compute the precision and recall for this ranking at each
retrieved document.
Cont..
Cont..
• The average precision over positions 1, 5, 6, and 8
where relevant documents were found is
(1.0+0.40+0.50+0.50)/6=0.40. The R-precision is the
precision at position 6, which is 3/6=0.50.
total retrieved
Problems with both precision and recall
 Number of irrelevant documents in the collection is not
taken into account.
 Recall is undefined when there is no relevant document
in the collection.
 Precision is undefined when no document is retrieved.
Other measures
 Noise = retrieved irrelevant docs / retrieved docs
 Silence/Miss = non-retrieved relevant docs / relevant
docs

Noise = 1 – Precision; Silence = 1 – Recall

F-measure

• A single measure that trades off precision versus

recall is the F measure, which is the weighted
harmonic mean of precision and recall:
• One measure of performance that takes into accounts
both recall and precision. Harmonic mean of recall
and precision:
Exercise
• The following list of Rs and Ns represents relevant (R) and
non relevant (N) returned documents in a ranked list of 20
documents retrieved in response to a query from a collection
of 10,000 documents. The top of the ranked list (the document
the system thinks is most likely to be relevant) is on the left of
the list. This list shows 6 relevant documents. Assume that
there are 8 relevant documents in total in the collection.
RRNNNNNNRNRNNNRNNNNR
Questions
• Calculate the following:

a) What is the precision of the system on the top 20?

b) What is recall?
c) What is p@10?
d) What is the F-measure on the top 20?
e) Assume that these 20 documents are the complete result set
of the system. What is the MAP for the query?
f) Noise
g) Silence
Difficulties in Evaluating IR System

 IR systems essentially facilitate communication between a

user and document collections
 Relevance is a measure of the effectiveness of
communication
– Effectiveness is related to the relevancy of retrieved
items.
– Relevance: relates to problem, information need,
query and a document or surrogate
……..cont

 Relevance judgments is made by

– The user who posed the retrieval problem

– An external judge

– Is the relevance judgment made by users and external

person the same?

 Relevance judgment is usually:

……….cont

– Subjective: Depends upon a specific user’s judgment.

– Situational: Relates to user’s current needs.
– Cognitive: Depends on human perception and
behavior.
– Dynamic: Changes over time.
Information Retrieval

Chapter 6:
Query Languages and Operations
Introduction
• Information is the main value of Information Society.

• Depending on the particular application scenario and on the

type of information that has to be managed and searched,
different techniques need to be devised.

• The dictionary definition of query is a set of instructions passed

to a database to retrieve particular data.
Cont….

• A query is the formulation of a user information need.

• A query is composed of keywords and the documents

containing such keywords are searched for popular and
Intuitive, Easy to express, Allow fast ranking.
Cont…
Query language (QL) refers to any computer programming language
that requests and retrieves data from database and information
systems by sending queries.

• Query Languages: A source language consisting of procedural

operators that invoke functions to be executed.
Keyword-based queries

 Queries are combinations of words.

 The document collection is searched for documents that

contain these words.

 Word queries are intuitive, easy to express and provide fast

ranking.
popular Keyword-based queries are
1. Single-word queries:
 A query is a single word
 Simplest form of query.
 All documents that include this word are retrieved.

 Documents may be ranked by the frequency of this word in the

document.
Con’ted
2. phrase queries:
A query is a sequence of words treated as a single unit. Also
called “literal string” or “exact phrase” query, Phrase is usually
surrounded by quotation marks,

All documents that include this phrase are retrieved, Usually,

separators (commas, colons, etc.) and “trivial words” (e.g., “a”,
“the”, or “of”) in the phrase are ignored,
Conted
In effect, this query is for a set of words that must appear in
sequence, Allows users to specify a context and thus gain
precision.

Example: “United States of America”.

Con’ted
3. Multiple-word queries:

A query is a set of words (or phrases).

Two interpretations:
• A document is retrieved if it includes any of the query words.
• A document is retrieved if it includes each of the query
words.
Cont..
Documents may be ranked by the number of query words they
contain: A document containing n query words is ranked higher
than a document containing n-1 query words.

Documents containing all the query words are ranked at the top.
Documents containing only one query word are ranked at
bottom.
Frequency counts may still be used to break ties among documents
that contain the same query words.
Cont…
4. Proximity queries:
Restrict the distance within a document between two search
terms.
Important for large documents in which the two search words
may appear in different contexts.

Proximity specifications limit the acceptable occurrences and

hence increase the precision of the search.
Cont….
General Format: Word1 within m units of Word2. Unit may be
character, word, paragraph, etc.
Example:
• nuclear within 0 paragraphs of cleanup

Finds documents that discuss “nuclear” and “cleanup” in the

same paragraph.

• united within 5 words of american

Structural queries

 So far, we assumed documents that are entirely free of

structure.
 Structured documents would allow more powerful queries.

 Queries could combine text queries with structural queries:

queries that relate to the structure of the document.

 Mixing contents and structure in queries:

Cont…
• Contents words, phrases, or patterns and

• Structural constraints containment, proximity, or other

restrictions on structural elements
Example
• Example: Retrieve documents that contain a page in which the
phrase “terrorist attack” appears in the text and a photo whose
caption contains the phrase “World Trade Center”.

• The corresponding query could be: same page (“terrorist

attack”, photo (caption (“World Trade Center”))).
Types

 Three main structures

 Fixed structure
Hypertext structure
Hierarchical structure
Fixed structure
 Document is divided to a fixed set of fields, much like a filled
form.

 Fields may be associated with types, such as date.

 Each field has text and fields cannot nest or overlap.

 Queries (multiple-words, Boolean, proximity, patterns, etc.) are

targeted at particular fields.
Hypertext structure
Hierarchical structure

 Intermediate model between fixed structure and hypertext.

 The “anarchic” hypertext network is restricted to a hierarchical
structure.
 The model allows recursive decomposition of documents.

 Queries may combine Regular text queries, which are targeted

at particular areas (the target area is defined by a “path
expression”) and Queries on the structure itself; for example
“retrieve documents with at least 5 sections
Cont…..
Relevance feedback

 After initial retrieval results are presented, allow the user to

provide feedback on the relevance of one or more of the
retrieved documents.
 The system use this feedback information to reformulate the
query and Produce new results based on reformulated query.
After that allows more interactive, multi-pass process.
RF
 The idea of relevance feedback (RF) is to involve the user in
RELEVANCE FEEDBACK the retrieval process so as to
improve the final result set.

 In particular, the user gives feedback on the relevance of

documents in an initial set of results.
The basic procedure is:
 The user issues a (short, simple) query.
 The system returns an initial set of retrieval results.

 The user marks some returned documents as relevant or

non relevant.

 The system computes a better representation of the

information need based on the user feedback.

 The system displays a revised set of retrieval results.

Architecture
THE END OF:

Chapter 5 and 6

IR System Evaluation Guide
No ratings yet
IR System Evaluation Guide
28 pages
5-Retrieval Effectiveness
No ratings yet
5-Retrieval Effectiveness
20 pages
Retrieval Evaluation in IR Systems
No ratings yet
Retrieval Evaluation in IR Systems
28 pages
Evaluating Information Retrieval Systems
No ratings yet
Evaluating Information Retrieval Systems
26 pages
DSA Lab Manual
No ratings yet
DSA Lab Manual
41 pages
Chapter 1
No ratings yet
Chapter 1
92 pages
002chapter 2 - Lexical Analysis
No ratings yet
002chapter 2 - Lexical Analysis
114 pages
Data Structures and Algorithms Overview
No ratings yet
Data Structures and Algorithms Overview
24 pages
Hashing Techniques and Strategies
100% (1)
Hashing Techniques and Strategies
135 pages
Java Lab Record
No ratings yet
Java Lab Record
110 pages
File Structures UNIT 1 Notes
50% (2)
File Structures UNIT 1 Notes
13 pages
Dbms Lab Record 2 Sem All Solved Full
No ratings yet
Dbms Lab Record 2 Sem All Solved Full
9 pages
Understanding Intelligent Agents in AI
No ratings yet
Understanding Intelligent Agents in AI
62 pages
Constructors and Operator Overloading in C++
100% (1)
Constructors and Operator Overloading in C++
6 pages
Java GUI Basics and Components
No ratings yet
Java GUI Basics and Components
207 pages
Lecture 06 - Binary Search Tree (BST) - Design Analysis of Algorithm
No ratings yet
Lecture 06 - Binary Search Tree (BST) - Design Analysis of Algorithm
30 pages
Hashing Techniques & Functions
No ratings yet
Hashing Techniques & Functions
30 pages
EC9560 Data Mining: Lab 02: Classification and Prediction Using WEKA
No ratings yet
EC9560 Data Mining: Lab 02: Classification and Prediction Using WEKA
5 pages
KMP String Matching Algorithm
No ratings yet
KMP String Matching Algorithm
8 pages
Java Assignment Construstors
No ratings yet
Java Assignment Construstors
6 pages
Object Oriented Programming: File I/O
No ratings yet
Object Oriented Programming: File I/O
20 pages
Java OOP Lab Assignments Guide
No ratings yet
Java OOP Lab Assignments Guide
20 pages
Design and Analysis of Algorithms Course
No ratings yet
Design and Analysis of Algorithms Course
43 pages
Identify Ways of Representing Algorithms
No ratings yet
Identify Ways of Representing Algorithms
33 pages
Understanding Data Structures and Algorithms
100% (2)
Understanding Data Structures and Algorithms
361 pages
Delegation Event Model Java
No ratings yet
Delegation Event Model Java
1 page
Model Question Paper of Java Programming
0% (1)
Model Question Paper of Java Programming
4 pages
5.4. ADS - Tries - Standard Tries
No ratings yet
5.4. ADS - Tries - Standard Tries
34 pages
MOEAFramework 2.1 ManualFixed
No ratings yet
MOEAFramework 2.1 ManualFixed
191 pages
OOP Lab Manual for CSE Students
No ratings yet
OOP Lab Manual for CSE Students
37 pages
IT2403 Systems Analysis and Design: (Compulsory)
No ratings yet
IT2403 Systems Analysis and Design: (Compulsory)
6 pages
DAA Module-1-1
No ratings yet
DAA Module-1-1
42 pages
Cse 205 All Java Slides
No ratings yet
Cse 205 All Java Slides
260 pages
Chapter 1 Introduction To ISR
No ratings yet
Chapter 1 Introduction To ISR
11 pages
Big O Notation and Algorithm Efficiency
No ratings yet
Big O Notation and Algorithm Efficiency
20 pages
Introduction to Database Management
100% (1)
Introduction to Database Management
22 pages
Advanced Data Structures Question Bank
No ratings yet
Advanced Data Structures Question Bank
5 pages
Assembly Language Instruction Overview
No ratings yet
Assembly Language Instruction Overview
23 pages
A Medical Image Fusion Method Based On Convolutional Neural Networks
No ratings yet
A Medical Image Fusion Method Based On Convolutional Neural Networks
7 pages
AraBERT for Arabic Reviews Analysis
No ratings yet
AraBERT for Arabic Reviews Analysis
9 pages
Brute Force: Design and Analysis of Algorithms - Chapter 3 1
No ratings yet
Brute Force: Design and Analysis of Algorithms - Chapter 3 1
18 pages
Unit - 1 Introduction To Oop and Java Fundamentals: Cs8392 Object Oriented Programming It/Jjcet
No ratings yet
Unit - 1 Introduction To Oop and Java Fundamentals: Cs8392 Object Oriented Programming It/Jjcet
29 pages
Topic 2 Errors
No ratings yet
Topic 2 Errors
144 pages
Data Mining Concepts and Techniques Guide
No ratings yet
Data Mining Concepts and Techniques Guide
44 pages
Analysis of Algorithms CS 477/677: Hashing Instructor: George Bebis
No ratings yet
Analysis of Algorithms CS 477/677: Hashing Instructor: George Bebis
53 pages
Uninformed Search: Some Material Adopted From Notes and Slides by Marie Desjardins and Charles R. Dyer
No ratings yet
Uninformed Search: Some Material Adopted From Notes and Slides by Marie Desjardins and Charles R. Dyer
56 pages
AI Search Problem Solving Guide
No ratings yet
AI Search Problem Solving Guide
253 pages
Lec5 - LZW Compression
No ratings yet
Lec5 - LZW Compression
29 pages
A719552767 - 20992 - 7 - 2019 - Lecture10 Python OOP
No ratings yet
A719552767 - 20992 - 7 - 2019 - Lecture10 Python OOP
15 pages
LAB # 07 Facts and Rules in PROLOG: Objective
No ratings yet
LAB # 07 Facts and Rules in PROLOG: Objective
6 pages
UML Class Diagram Basics
100% (1)
UML Class Diagram Basics
27 pages
DWH Terminology
100% (1)
DWH Terminology
4 pages
Lecture 2: Problem Solving Using State Space Representation
No ratings yet
Lecture 2: Problem Solving Using State Space Representation
37 pages
CHAPTER - 5 FINAL FOR CLASS Color Image Processing
No ratings yet
CHAPTER - 5 FINAL FOR CLASS Color Image Processing
63 pages
8086 Microprocessor Overview
No ratings yet
8086 Microprocessor Overview
31 pages
Chapter 6-8IR Revised
No ratings yet
Chapter 6-8IR Revised
76 pages
5 Retrieval Evaluation
No ratings yet
5 Retrieval Evaluation
20 pages
5 Retrievalefective
No ratings yet
5 Retrievalefective
22 pages
IR Lecture 5b
No ratings yet
IR Lecture 5b
36 pages
IR Lecture 5b
No ratings yet
IR Lecture 5b
36 pages
250+ TOP MCQs On Multivalued Dependencies and Answers
No ratings yet
250+ TOP MCQs On Multivalued Dependencies and Answers
7 pages
200 301 Questions Ccna Exam Prep
No ratings yet
200 301 Questions Ccna Exam Prep
6 pages
250+ TOP MCQs On Database Design Process and Answers
No ratings yet
250+ TOP MCQs On Database Design Process and Answers
7 pages
250+ TOP MCQs On SQL Data Types and Schemas and Answers
No ratings yet
250+ TOP MCQs On SQL Data Types and Schemas and Answers
7 pages
Blue Spacewalk Presentation
No ratings yet
Blue Spacewalk Presentation
12 pages
Proposal
No ratings yet
Proposal
24 pages
Pharmacy Management System Project Overview
No ratings yet
Pharmacy Management System Project Overview
20 pages
Introduction to Information Retrieval
No ratings yet
Introduction to Information Retrieval
88 pages
Introduction of IR Models
No ratings yet
Introduction of IR Models
67 pages
IR Chapter 1 & 2
No ratings yet
IR Chapter 1 & 2
114 pages
IR Models for Students
No ratings yet
IR Models for Students
62 pages
SED1335 CMOS LCD Controller Overview
No ratings yet
SED1335 CMOS LCD Controller Overview
14 pages
Principles of Api Developement
No ratings yet
Principles of Api Developement
18 pages
TC eUICC User Guide r3
No ratings yet
TC eUICC User Guide r3
38 pages
Distributed Database Solutions
No ratings yet
Distributed Database Solutions
1 page
8 Series Network AM Anti-Theft Antenna - 220V - User's Manual - V1.0.0
No ratings yet
8 Series Network AM Anti-Theft Antenna - 220V - User's Manual - V1.0.0
44 pages
Iso 23309
No ratings yet
Iso 23309
26 pages
Installer's Guide: Touch Manager
No ratings yet
Installer's Guide: Touch Manager
48 pages
Matrix COSEC PATH RDCM Technical Specifications
No ratings yet
Matrix COSEC PATH RDCM Technical Specifications
2 pages
C BW4H 2404-Demo
No ratings yet
C BW4H 2404-Demo
5 pages
SHS GRADE 11 EMPOWERMENT TECHNOLOGY 2ND Q - WEEK 1-8 Latest
No ratings yet
SHS GRADE 11 EMPOWERMENT TECHNOLOGY 2ND Q - WEEK 1-8 Latest
45 pages
Objkect Design
No ratings yet
Objkect Design
4 pages
Final Document
No ratings yet
Final Document
73 pages
Hartron
No ratings yet
Hartron
5 pages
Privacy On Demand: Intelligent Switchable Glass
No ratings yet
Privacy On Demand: Intelligent Switchable Glass
5 pages
Year 10 Exam Scope Overview 2024
No ratings yet
Year 10 Exam Scope Overview 2024
3 pages
C Features
No ratings yet
C Features
3 pages
"Ensayos de Otelo para Estudiantes"
100% (2)
"Ensayos de Otelo para Estudiantes"
4 pages
16 PV1800 VPM
No ratings yet
16 PV1800 VPM
1 page
Social Media Marketing of Pizza Hut
No ratings yet
Social Media Marketing of Pizza Hut
3 pages
CSE Course Outcomes & Mapping
No ratings yet
CSE Course Outcomes & Mapping
26 pages
Unit of Competence Their LO
No ratings yet
Unit of Competence Their LO
3 pages
Technicl Construction File
No ratings yet
Technicl Construction File
22 pages
Cs408 Solved Mcqs Final Term by Junaid
No ratings yet
Cs408 Solved Mcqs Final Term by Junaid
85 pages
REX Trouble Shooting
No ratings yet
REX Trouble Shooting
26 pages
Speed Synchronization of Multiple Motors in Industries: Bachelor of Engineering in Electrical and Electronics Engineering
No ratings yet
Speed Synchronization of Multiple Motors in Industries: Bachelor of Engineering in Electrical and Electronics Engineering
10 pages
Central Battery Systems Guide
No ratings yet
Central Battery Systems Guide
26 pages
Datasheet Sc750 External Solid State Drive 20250609
No ratings yet
Datasheet Sc750 External Solid State Drive 20250609
3 pages
PA-R-205-6 Servo Actuator Specs
No ratings yet
PA-R-205-6 Servo Actuator Specs
6 pages
GST208 Past Questions by Team Brainiac
No ratings yet
GST208 Past Questions by Team Brainiac
2 pages
Hasselt University Resit Exam Schedule
No ratings yet
Hasselt University Resit Exam Schedule
9 pages

IR Chapt 5

Uploaded by

IR Chapt 5

Uploaded by

Information Retrieval

• It is known that measuring or evaluating the

• According to (Singhal, 2001), there are two main things

about user satisfaction

It is about time, space

 To measure ad hoc (informal) information retrieval

• Specific questions that might be considered when gathering

1. How many items should be gathered?

The standard approach to information retrieval system

• The two most frequent and basic measures for

Precision (P) is the fraction of retrieved documents that

– The ability of the search to find all of the

– Recall is percentage of relevant documents

• When do you think the precision/recall has value

Precision at the R-th position in the ranking of results for a

It requires having a set of known relevant documents, from

Noise = 1 – Precision; Silence = 1 – Recall

• A single measure that trades off precision versus

a) What is the precision of the system on the top 20?

 IR systems essentially facilitate communication between a

 Relevance judgments is made by

– Is the relevance judgment made by users and external

 Relevance judgment is usually:

– Subjective: Depends upon a specific user’s judgment.

• Depending on the particular application scenario and on the

• The dictionary definition of query is a set of instructions passed

• A query is the formulation of a user information need.

• A query is composed of keywords and the documents

• Query Languages: A source language consisting of procedural

 Queries are combinations of words.

 The document collection is searched for documents that

 Word queries are intuitive, easy to express and provide fast

 Documents may be ranked by the frequency of this word in the

All documents that include this phrase are retrieved, Usually,

Example: “United States of America”.

A query is a set of words (or phrases).

Proximity specifications limit the acceptable occurrences and

Finds documents that discuss “nuclear” and “cleanup” in the

• united within 5 words of american

 So far, we assumed documents that are entirely free of

 Queries could combine text queries with structural queries:

 Mixing contents and structure in queries:

• Structural constraints containment, proximity, or other

• The corresponding query could be: same page (“terrorist

 Three main structures

 Fields may be associated with types, such as date.

 Each field has text and fields cannot nest or overlap.

 Queries (multiple-words, Boolean, proximity, patterns, etc.) are

 Intermediate model between fixed structure and hypertext.

 Queries may combine Regular text queries, which are targeted

 After initial retrieval results are presented, allow the user to

 In particular, the user gives feedback on the relevance of

 The user marks some returned documents as relevant or

 The system computes a better representation of the

 The system displays a revised set of retrieval results.

You might also like