100% found this document useful (1 vote)

658 views4 pages

Bioinformatics Course SBT 410 Outline

This document outlines the course details for Bioinformatics (SBT 410) including the aim, course description, objectives, contents, methodology, assessment, attendance policy, and references. The course focuses on biological databases, sequence analysis, genome analysis, and gene mapping using computational tools. Students will learn fundamental concepts in bioinformatics and how to analyze biological data. The course will be examined through assignments, practicals, tests, and a final exam weighing 60% of the overall grade. Attendance of at least 80% of lectures is recommended.

Uploaded by

Cecilia Mukototsi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

658 views4 pages

Bioinformatics Course SBT 410 Outline

Uploaded by

Cecilia Mukototsi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Course Outline
Course Contents
Methodologies and Approaches
References

School of Industrial Sciences & Technology

Department: Biotechnology
SBT 410 : Bioinformatics (SBT 410)
Lecturer : Mr. C. Mawere/ Mr N. Ncube
Email ID : cmawere@[Link]/nqncube@[Link]

COURSE OUTLINE

Aim

The course focuses on information search, data retrieval, genome analysis and gene mapping. Students are
introduced to Biological Data bases and their management. These include SQL (Sequence Query Language),
Searching of databases similar sequence; The NCBI; Publicly available tools; Resources at EBI; Resources
on the web; Database mining tools. Pair wise and multiple sequence alignment, scoring matrices, secondary
structure predictions are subjects include. Finally genome analysis and gene mapping using analysis. Tools
for Sequence Data Bank, sequence homology searching using BLAST and FASTA, FASTA and BLAST
Algorithms comparison. Much of the work will require use of internet to deduce gene sequences and
structures of proteins under study.

Course Description

In this course, students learn fundamental concepts and methods in bioinformatics, a field at the intersection
of biology and computing. It surveys a wide range of topics including biological database searching,
computational sequence analysis, sequence homology searching and motif finding, gene finding and genome
annotation, protein structure analysis and modeling, and biological knowledge discovery.

Course objectives

By the end of the course, students should be able to;

 Familiarize with some of the basic computational problems in bioinformatics.

 Familiarize with basic methods and tools for solving computational problems in bioinformatics.
 Analyze biological data set with available computational tools and methods.
 Understand and explain the basic biochemical pathways of importance to biotechnologists,
 Demonstrate an understanding of how to perform, interpret and report on in silico analysis.
Course Contents

Week Content
Definition and History of Bioinformatics, Internet and Bioinformatics, Data
Introduction 1 Management Analysis, Introduction to Data Mining: string mining, Text
to mining, KDD for bioinformatics.
Bioinformatic
2 Applications of Data Mining to Bioinformatics Problems, Applications of
s Bioinformatics.

Major Bioinformatics resources: NCBI, EBI, ExPASY, UNIPROT. The

Biological 3 knowledge of various databases and bioinformatics tools available at these
Data resources, organization of databases: data contents and formats, purpose and
Resources utilities in bioinformatics.
Access to Molecular Biology Databases through: Entrez, Sequence Retrieval
4 System (SRS), Macro Molecular Structural Databases: PDB, NDB, MMDB.
Protein structural classification systems CATH, SCOP, Introduction to pathway
databases: KEGG, BRENDA.
Concept of homology and sequence evolution (substitutions, conservation and
Sequence 5 INDELS), Concept of sequence alignment; different measures of sequence
analysis similarity (%identity, % similarity), pair-wise sequence comparisons, Dynamic
programming as applicable to global (Needleman-Wunch) and local (Smith-
Waterman) sequence alignments.
6 Pair-wise substitution scoring matrices (PAM and BLOSSUM); gap penalties,
Heuristic methods for homology detection FASTA and, BLAST and their
variants (Blast n, Blast p, x-Blast etc.,) PSSM and PSI-BLAST.

Multiple sequence alignments; multi-dimensional dynamic programming for

Multiple 7 multiple sequence alignment (MSA), Heuristic approaches for MSA;
Sequence progressive sequence, iterative alignment method; Clustal W/X, concept of
analysis dendrogram and its interpretation.
Detection of motifs; construction of sequence profile; Block Maker, MEME,
8
MACAW LOGOS and MAST, Introduction to homology modeling and protein
model analysis-PROCHECK, RAMPAGE, ERRAT,ProSA, VERIFY3D.

Concept of biological clock, Concept of Phylogenetic Trees, Comparison of

Molecular 9
Phylogenetic Trees and MSA.
Phylogenetics
Methods of Evaluation for Phylogenies; character based methods, distance
10
based methods for Phylogenetic, bootstrap method, Packages for Phylogenetic
studies like PHYLIP, PAUP, TREE VIEW etc.

Genomics 11 Introduction to genome, large scale genome sequencing strategies, Genome

assembly and annotation, Gene Identification: Introduction, methods of gene
predictions. Gene prediction tools; GRAIL, GENSCAN, FGENES, GenLang,
Gene Parser, Procrustes,
DNA/RNA structure and Function analysis: Poly A site Prediction, TATA
12 signaling, Promoter & Transfactor Bind site prediction, ORF prediction, Splice
Site prediction, Repetitive DNA & CpG island analysis, tRNA Gene prediction.

Methodologies/Approaches

 Lectures,
 Tutorials,
 Lab sessions,
 Group work

Course Assessment

The course will be taken over one semester and will be examined by a written examination and
assessment of assignments, practical and tests as follows;

Final Examination Theory Paper 60%

Continuous Assessment Theory (Assignments and Tests) 15%
Continuous Assessment Practical 25%
a) Tests – 3.
b) Written assignments
c) Term exam paper
d) Practical Lab assignments - based on each chapter.
e) Group presentations

Attendance
It is recommended that you attend all lectures. Students may not be allowed to sit for the exam if they
fail to attend at least 80% of the lecture sessions. Students are responsible for all material presented in
class or during practical sessions including course procedures. The course syllabus is defined by the
lecture content. However this can only lay out the essentials of the subject. You are therefore
encouraged to explore topics further by reading a number of reference texts including those listed in this
outline.

References

Baxevanis, A. D. and Oullette, B. F. 2003. Bioinformatics; A Practical Guide to the Analysis of

Genes and Proteins, 3 ed. John Wiley & Sons, Inc, New Delhi.

Leach, A. R. 1996. Molecular Modeling, Principles & Applications. Addison Wesley Longman,
Singapore.

Lesk, A. M. 2002. Introduction to Bioinformatics. Oxford University Press.

Mount, D. W. 2004. Bioinformatics: Sequence and Genome Analysis. Cold Spring Harbor
Laboratory Press.

Primrose, S.B. and Twyman, R.M. 2007. Principles of Genome Analysis and Genomics. Blackwell
Publishing Company, Oxford, UK.

Rastogi, S.C., Mendiratta, N. and Rastogi, P. 2004. Bioinformatics: Concepts, Skills &
Applications. CBS Publishers & Distributors, New Delhi.
Xiong, J. 2006. Essential Bioinformatics. Cambridge University Press, Cambridge, UK.

Common questions

The bootstrap method evaluates phylogenies by resampling data to create multiple datasets, constructing trees for each, and assessing tree stability by calculating the percentage of times specific groupings appear across all trees. Factors considered include the number of replicates and the underlying model assumptions. Packages like PHYLIP and PAUP automate this process, providing robust statistical tools to handle data, execute resampling, and visualize the consensus trees derived from bootstrap analyses, ensuring reliable phylogenetic inferences .

Large-scale genome sequencing involves challenges such as handling vast amounts of data, ensuring sequence accuracy, and assembling sequences into complete genomes. Methodologies include shotgun sequencing and newer technologies like next-generation sequencing. Post-sequencing, assembly involves piecing together fragments, while annotation involves identifying gene regions and functional elements. Tools like GENSCAN and GRAIL assist in gene prediction by using statistical models to identify coding regions within the sequence based on known gene structures, significantly reducing manual annotation effort .

Multi-dimensional dynamic programming improves MSA by optimally aligning multiple sequences simultaneously, maintaining consistent alignment across all sequences. However, it is computationally intensive and impractical for large datasets. Heuristic approaches like Clustal W/X offer computational efficiency by using progressive alignment methods, but they can miss optimal solutions due to their reliance on initial pair-wise alignments and guide trees, which may not accurately represent evolutionary relationships in all datasets .

FASTA and BLAST are both used for sequence homology searching, but they differ in their approach and efficiency. FASTA, an older tool, aligns sequences using a simplified version of the Smith-Waterman algorithm and is generally considered more rigorous but slower. BLAST, on the other hand, employs heuristic methods to quickly find local alignments, making it much faster. Their variants, such as Blastn, Blastp, and PSI-BLAST, enhance these methods by tailoring the search to specific types of sequences (nucleotide, protein) and improving detection of distant homologs through profile alignments .

Data mining in bioinformatics involves extracting useful patterns and knowledge from large biological datasets, which goes beyond simple data retrieval that focuses on accessing and organizing specific data. It can address problems such as identifying gene variants, predicting protein functions, and discovering potential drug targets. Data mining techniques like string mining and knowledge discovery in databases (KDD) are used to analyze complex biological relationships and structures .

The concept of the biological clock refers to the constant rate at which specific genes or proteins evolve over time, allowing the estimation of time divergence between species. In molecular phylogenetics, this concept helps calibrate evolutionary trees, where the rate of molecular changes is treated as proportional to time, aiding in reconstructing the evolutionary relationships and lineage diversifications among species using phylogenetic trees .

Pair-wise substitution scoring matrices like PAM and BLOSSUM are critical for sequence alignment as they provide the scores for evaluating the likelihood of character substitutions in an alignment. PAM matrices are derived from closely related proteins and predict short-term evolutionary changes, while BLOSSUM matrices are based on observed substitutions in more divergent sequences, thus better for general use with diverse datasets. The choice of matrix affects alignment outcomes; PAM matrices are generally used for sequences with high similarity, while BLOSSUM matrices are more suitable for distantly related sequences .

Homology modeling is based on predicting a protein's structure using the known structure of a homologous protein as a template. The accuracy of the modeled structure largely depends on the sequence identity between the target and template proteins. Validation tools such as PROCHECK, RAMPAGE, and VERIFY3D play a crucial role by assessing the quality of protein models. PROCHECK evaluates stereochemical properties, RAMPAGE assesses Ramachandran plots, and VERIFY3D checks the compatibility of the 3D structure with its sequence, thereby ensuring reliable models for further functional analysis .

Sequence Retrieval Systems such as Entrez and SRS enhance database accessibility by providing user-friendly interfaces for querying and retrieving relevant biological data across multiple databases. Entrez integrates diverse datasets, offering powerful search capabilities and cross-linking between different types of biological information, while SRS allows customized queries and data retrieval from various molecular biology repositories. These systems improve the usability of databases, facilitating efficient data management and analysis for researchers .

Pathways databases like KEGG and BRENDA provide comprehensive data on various biochemical pathways, allowing researchers to understand interactions and functions within a biological system. KEGG integrates genomic, chemical, and systemic functional data to map pathways, while BRENDA offers enzyme-specific information. Researchers can access these databases through various interfaces and tools that enable them to trace metabolic pathways, simulate biochemical reactions, and explore enzymatic functions and regulations within cellular processes .

Bioinformatics Historical Timeline
No ratings yet
Bioinformatics Historical Timeline
10 pages
NGS Workflow Overview and Steps
No ratings yet
NGS Workflow Overview and Steps
22 pages
Overview of GenBank in Bioinformatics
No ratings yet
Overview of GenBank in Bioinformatics
31 pages
Types of Biological Databases Explained
100% (1)
Types of Biological Databases Explained
39 pages
NGS and Bioinformatics Overview
No ratings yet
NGS and Bioinformatics Overview
5 pages
Designing Effective PCR Primers
No ratings yet
Designing Effective PCR Primers
14 pages
Omics Technology: October 2010
No ratings yet
Omics Technology: October 2010
28 pages
Single-Nucleotide Polymorphism
No ratings yet
Single-Nucleotide Polymorphism
21 pages
M.Sc. Bioinformatics Syllabus Overview
No ratings yet
M.Sc. Bioinformatics Syllabus Overview
42 pages
Bioinformatics II Course Overview
100% (1)
Bioinformatics II Course Overview
91 pages
qPCR Data Analysis and Interpretation Guide
No ratings yet
qPCR Data Analysis and Interpretation Guide
5 pages
Primer Design in Bioinformatics Guide
No ratings yet
Primer Design in Bioinformatics Guide
54 pages
Overview of Next Generation Sequencing Technologies
No ratings yet
Overview of Next Generation Sequencing Technologies
12 pages
Bioinformatics
No ratings yet
Bioinformatics
55 pages
Lab 1A - Exploring Ncbi: Bioinformatic Methods I Lab 1
No ratings yet
Lab 1A - Exploring Ncbi: Bioinformatic Methods I Lab 1
22 pages
Sequence Alignment Methods Overview
100% (1)
Sequence Alignment Methods Overview
34 pages
PCR Troubleshooting and Optimization Guide
No ratings yet
PCR Troubleshooting and Optimization Guide
2 pages
Bioinformatics Lab Manual Overview
No ratings yet
Bioinformatics Lab Manual Overview
14 pages
Bioinformatics/Computationa L Tools For NGS Data Analysis: An Overview
No ratings yet
Bioinformatics/Computationa L Tools For NGS Data Analysis: An Overview
81 pages
PCR Inhibitors
No ratings yet
PCR Inhibitors
13 pages
Bioinformatics Practical Exercises Report
No ratings yet
Bioinformatics Practical Exercises Report
63 pages
3 PCR Troubleshooting
100% (1)
3 PCR Troubleshooting
6 pages
Nucleic Acid Extraction Techniques Guide
No ratings yet
Nucleic Acid Extraction Techniques Guide
27 pages
Introduction to Omics in Healthcare
No ratings yet
Introduction to Omics in Healthcare
46 pages
Bioinformatics Lab Course Overview
No ratings yet
Bioinformatics Lab Course Overview
3 pages
Homology Modeling in Protein Prediction
No ratings yet
Homology Modeling in Protein Prediction
17 pages
Bioinformatics and the Human Genome Project
No ratings yet
Bioinformatics and the Human Genome Project
44 pages
Bioinformatics Exercises on TIGR and BLAST
100% (1)
Bioinformatics Exercises on TIGR and BLAST
6 pages
FASTA Tools for Protein and DNA Similarity
No ratings yet
FASTA Tools for Protein and DNA Similarity
33 pages
Primer Design Guidelines for PCR
No ratings yet
Primer Design Guidelines for PCR
37 pages
PCR Primer Design Guide
100% (1)
PCR Primer Design Guide
5 pages
Overview of Biological Databases
No ratings yet
Overview of Biological Databases
7 pages
PCR Primer Design Guidelines
No ratings yet
PCR Primer Design Guidelines
33 pages
Next-Gen Sequencing Sample Prep Guide
100% (2)
Next-Gen Sequencing Sample Prep Guide
25 pages
PCR Troubleshooting Guide
No ratings yet
PCR Troubleshooting Guide
47 pages
Genomics and Bioinformatics Project
No ratings yet
Genomics and Bioinformatics Project
17 pages
Bioinformatics: Using NCBI BLAST Tool
100% (1)
Bioinformatics: Using NCBI BLAST Tool
21 pages
Sanger vs. NGS: Sequencing Techniques Explained
0% (1)
Sanger vs. NGS: Sequencing Techniques Explained
31 pages
Bioinformatics: An Overview of Techniques
100% (1)
Bioinformatics: An Overview of Techniques
41 pages
Genome Sequencing and Assembly Challenges
No ratings yet
Genome Sequencing and Assembly Challenges
225 pages
Comprehensive Protein Purification Guide
No ratings yet
Comprehensive Protein Purification Guide
6 pages
Mutiplexpcr Primer Design
100% (1)
Mutiplexpcr Primer Design
11 pages
Overview of BLAST in Bioinformatics
100% (1)
Overview of BLAST in Bioinformatics
21 pages
Instruction Manual, Iscript Select cDNA Synthesis Kit, Rev B
0% (1)
Instruction Manual, Iscript Select cDNA Synthesis Kit, Rev B
2 pages
Gene Annotation and Prediction Overview
No ratings yet
Gene Annotation and Prediction Overview
12 pages
Bioinformatic Tools For Next Generation DNA Sequencing - PHD Thesis
100% (1)
Bioinformatic Tools For Next Generation DNA Sequencing - PHD Thesis
237 pages
Gene Prediction Tools Overview
No ratings yet
Gene Prediction Tools Overview
49 pages
Bioinformatics: Basics and Applications
No ratings yet
Bioinformatics: Basics and Applications
232 pages
Understanding Nucleic Acids: DNA & RNA
100% (1)
Understanding Nucleic Acids: DNA & RNA
18 pages
Bioinformatics and Genomics Course Overview
No ratings yet
Bioinformatics and Genomics Course Overview
12 pages
Introduction to Bioinformatics Course
No ratings yet
Introduction to Bioinformatics Course
3 pages
Bioinformatics Course Overview and Details
No ratings yet
Bioinformatics Course Overview and Details
3 pages
M.Sc. Bioinformatics Curriculum Overview
No ratings yet
M.Sc. Bioinformatics Curriculum Overview
33 pages
Bioinformatics Course Overview and Objectives
No ratings yet
Bioinformatics Course Overview and Objectives
3 pages
Bioinformatics Course Syllabus BIO310
No ratings yet
Bioinformatics Course Syllabus BIO310
6 pages
Genomics and Bioinformatics Course Overview
No ratings yet
Genomics and Bioinformatics Course Overview
5 pages
Bio310: Bioinformatics Course Outline
No ratings yet
Bio310: Bioinformatics Course Outline
3 pages
BINC SYLLABUS For Paper III 2018
No ratings yet
BINC SYLLABUS For Paper III 2018
9 pages
Bioinformatics Course Overview BTT302
No ratings yet
Bioinformatics Course Overview BTT302
6 pages
MMB 2314 Course Outline
No ratings yet
MMB 2314 Course Outline
2 pages
Form 3 Biology Exam Instructions
No ratings yet
Form 3 Biology Exam Instructions
1 page
Understanding the COSC-COSD Formula
No ratings yet
Understanding the COSC-COSD Formula
4 pages
Food Biotechnology Practical Manual
100% (2)
Food Biotechnology Practical Manual
17 pages
The Nucleus
No ratings yet
The Nucleus
36 pages
Overview of Gene Transfer Techniques
No ratings yet
Overview of Gene Transfer Techniques
35 pages
Learn Microbiology Online Medical Microbiology Guide: News Ticker
No ratings yet
Learn Microbiology Online Medical Microbiology Guide: News Ticker
19 pages
Butyric Acid Glycerides in Broiler Diets
No ratings yet
Butyric Acid Glycerides in Broiler Diets
8 pages
mRNA and Fat Synthesis Processes
No ratings yet
mRNA and Fat Synthesis Processes
4 pages
CRISPR's Role in Future Healthcare
No ratings yet
CRISPR's Role in Future Healthcare
13 pages
Cardiomyopathy Gene Test Results
No ratings yet
Cardiomyopathy Gene Test Results
14 pages
Understanding the Carbon Cycle
No ratings yet
Understanding the Carbon Cycle
3 pages
Antioxidant and Antimicrobial Study of Sesquiterpenes
No ratings yet
Antioxidant and Antimicrobial Study of Sesquiterpenes
5 pages
Catalog of Biological Sciences 2024
No ratings yet
Catalog of Biological Sciences 2024
846 pages
Grade 6 Science Syllabus 2021-2022
No ratings yet
Grade 6 Science Syllabus 2021-2022
4 pages
Photosystems and Photosynthetic Pathways
No ratings yet
Photosystems and Photosynthetic Pathways
63 pages
Roche Taq DNA Polymerase Guide
No ratings yet
Roche Taq DNA Polymerase Guide
4 pages
Ethylene's Role in Cereal Root Angles
No ratings yet
Ethylene's Role in Cereal Root Angles
12 pages
Zacharias Janssen: Microscope Pioneer
No ratings yet
Zacharias Janssen: Microscope Pioneer
8 pages
Microorganisms in Food Safety and Spoilage
No ratings yet
Microorganisms in Food Safety and Spoilage
45 pages
Nutrient Cycling Test Questions
No ratings yet
Nutrient Cycling Test Questions
5 pages
GFC vs DFC in Aquatic Ecosystems
No ratings yet
GFC vs DFC in Aquatic Ecosystems
3 pages
Cows' Natural Behavior at Milking Time
No ratings yet
Cows' Natural Behavior at Milking Time
62 pages
JHS Science First Term Exam Guide
No ratings yet
JHS Science First Term Exam Guide
6 pages
Insect Internal Anatomy Lab Guide
No ratings yet
Insect Internal Anatomy Lab Guide
19 pages
Streptomyces Growth and Preservation Techniques
No ratings yet
Streptomyces Growth and Preservation Techniques
20 pages
Osmosis in Red Onion Cells Experiment
No ratings yet
Osmosis in Red Onion Cells Experiment
3 pages
RNA Structure and Function Overview
No ratings yet
RNA Structure and Function Overview
45 pages
Exploring the Rann of Kutch Ecosystem
No ratings yet
Exploring the Rann of Kutch Ecosystem
1 page
New Alleles and Antibiotic Resistance
No ratings yet
New Alleles and Antibiotic Resistance
12 pages
Plasmid DNA & Protein Electrophoresis
No ratings yet
Plasmid DNA & Protein Electrophoresis
19 pages
MIC and MBC Determination Methods
No ratings yet
MIC and MBC Determination Methods
5 pages
Class IX Science MCQ Practice Paper
No ratings yet
Class IX Science MCQ Practice Paper
2 pages
Lab Guide: Observing Plant & Animal Cells
No ratings yet
Lab Guide: Observing Plant & Animal Cells
7 pages
South Pacific Form 7 Biology Exam 2021
No ratings yet
South Pacific Form 7 Biology Exam 2021
18 pages
IB Biology Project: Ecosystem Stability & Change
No ratings yet
IB Biology Project: Ecosystem Stability & Change
14 pages
Anatomy vs. Physiology Explained
No ratings yet
Anatomy vs. Physiology Explained
12 pages

Bioinformatics Course SBT 410 Outline

Uploaded by

Bioinformatics Course SBT 410 Outline

Uploaded by

School of Industrial Sciences & Technology

By the end of the course, students should be able to;

 Familiarize with some of the basic computational problems in bioinformatics.

Major Bioinformatics resources: NCBI, EBI, ExPASY, UNIPROT. The

Multiple sequence alignments; multi-dimensional dynamic programming for

Concept of biological clock, Concept of Phylogenetic Trees, Comparison of

Genomics 11 Introduction to genome, large scale genome sequencing strategies, Genome

Final Examination Theory Paper 60%

Baxevanis, A. D. and Oullette, B. F. 2003. Bioinformatics; A Practical Guide to the Analysis of

Lesk, A. M. 2002. Introduction to Bioinformatics. Oxford University Press.

Common questions

What are the factors considered in the bootstrap method for evaluating phylogenies, and how do packages like PHYLIP and PAUP facilitate this process?

What are the challenges and methodologies involved in large-scale genome sequencing, assembly, and annotation, and how do tools like GENSCAN and GRAIL aid in gene prediction?

How does multi-dimensional dynamic programming improve multiple sequence alignment (MSA), and what are the limitations of heuristic approaches such as Clustal W/X?

What are the key differences between the FASTA and BLAST algorithms in the context of sequence homology searching, and how do their variants enhance these methods?

How does the application of data mining differ from general data retrieval in bioinformatics, and what specific problems in bioinformatics can data mining address?

Explain the concept of the biological clock and how it influences molecular phylogenetic analyses to construct evolutionary trees.

In what ways do pair-wise substitution scoring matrices like PAM and BLOSSUM contribute to sequence alignment, and what are the implications of using different matrices?

Discuss the principles of homology modeling and the role of validation tools like PROCHECK, RAMPAGE, and VERIFY3D in protein model analysis.

How do sequence retrieval systems like Entrez and SRS enhance the accessibility and usability of molecular biology databases for researchers?

How do pathways databases like KEGG and BRENDA enhance our understanding of biochemical pathways, and what tools are available for researchers to utilize these databases?

You might also like