Bioinformatics Tools Project
Bioinformatics Tools Project
RESEARCH ARTICLE
Design and implementation of semester long project and
problem based bioinformatics course [version 1; peer review: 3
approved with reservations]
Geetha Saarunya , Bert Ely
Biological Sciences, University of South Carolina, Columbia, South Carolina, 29208, USA
First published: 25 Sep 2018, 7(ISCB Comm J):1547 ( Open Peer Review
v1 [Link]
Latest published: 25 Sep 2018, 7(ISCB Comm J):1547 (
[Link]
Reviewer Status
project that allowed them to pursue their individual interests. These Foundation, Alexandria, USA
completed projects were presented as talks and posters at
ISCB-RSG-SEUSA symposium held at University of South Carolina. 3 Allegra Via , Sapienza University of Rome,
Conclusions: An important outcome of this course design was that the Rome, Italy
students acquired the basic skills to critically evaluate the reporting and
Any reports and responses or comments on the
interpretation of data of a problem or a project during the symposium.
article can be found at the end of the article.
Keywords
bioinformatics education, problem-based learning, project-based learning,
hands-on course
Page 1 of 17
F1000Research 2018, 7(ISCB Comm J):1547 Last updated: 09 APR 2020
Page 2 of 17
F1000Research 2018, 7(ISCB Comm J):1547 Last updated: 09 APR 2020
Page 3 of 17
F1000Research 2018, 7(ISCB Comm J):1547 Last updated: 09 APR 2020
Results
Based on the responses of the students, we assigned poten- Dataset 2. Post-class survey
tial user groups as explained in Table 1 at the start of the class [Link]
with their expected competency levels at the end of the class.
Seven students replied and two students did not reply to the
pre-course survey. We were able to obtain permission from six of Discussion
the seven students who replied to the survey to have their answers This course covered a lot of topics in 13 weeks and some
published online anonymously. Any identifying information degree of mastery was required for each topic. In addition,
in terms of names or project details have been edited from the half of the students had no familiarity with programming.
responses (Table 4). As a result, many of the students were stretched beyond their
comfort zone. However, since this was a small class, we were able
Successful completion of the project assigned to every student to work with the students individually to help them be
by the end of a course module determined their competency successful, and also tailor projects to the students’ backgrounds
of the course. In lieu of a final exam, each student designed and expectations. An important outcome of this course design
a research project, conducted appropriate analyses, and sum- was that the students acquired the basic skills to critically
marized their results in the form of a poster or a talk at the end evaluate the reporting and interpretation of data of a problem or
of the semester as part of the ISCB-RSG-SE USA (Interna- a project during the symposium.
tional society of Computational biology-Regional student
group-Southeast USA) conference held on campus on Dec 8/9 of Our leading goal was to develop a course that was respon-
2017. They also had the opportunity to listen to talks from pro- sive to the needs and background abilities of the participating
fessors working on bioinformatics projects and interacted with students. It is important to recognize that every course will
their peers from University of South Florida and University of have students at different levels of learning with different goals.
Alabama. In addition, two graduate students wrote papers on Hence when designing a course that caters to the needs of the
their projects with input from their respective research advisors. students, it may be a good idea to have a small class.
Page 4 of 17
Table 3. Summaries of course modules *.
Page 5 of 17
F1000Research 2018, 7(ISCB Comm J):1547 Last updated: 09 APR 2020
F1000Research 2018, 7(ISCB Comm J):1547 Last updated: 09 APR 2020
Supplementary material
Supplementary File 1: Course syllabus and teaching materials
Click here to access the data
Page 6 of 17
F1000Research 2018, 7(ISCB Comm J):1547 Last updated: 09 APR 2020
References
1. Searls DB: The roots of bioinformatics. PLoS Comput Biol. 2010; 6(6): e1000809. toward a definition of core competencies. PLoS Comput Biol. 2014; 10(3):
PubMed Abstract | Publisher Full Text | Free Full Text e1003496.
2. Barone L, Williams J, Micklos D: Unmet needs for analyzing biological big data: PubMed Abstract | Publisher Full Text | Free Full Text
A survey of 704 NSF principal Investigators. bioRxiv. 2017; 108555. 15. Carver T, Harris SR, Berriman M, et al.: Artemis: an integrated platform for
Publisher Full Text visualization and analysis of high-throughput sequence-based experimental
3. Madlung A: Assessing an effective undergraduate module teaching applied data. Bioinformatics. 2012; 28(4): 464–9.
bioinformatics to biology students. PLoS Comput Biol. 2018; 14(1): e1005872. PubMed Abstract | Publisher Full Text | Free Full Text
PubMed Abstract | Publisher Full Text | Free Full Text 16. Darling AE, Tritt A, Eisen JA, et al.: Mauve assembly metrics. Bioinformatics.
4. Dinsdale E, Elgin SC, Grandgenett N, et al.: NIBLSE: A Network for Integrating 2011; 27(19): 2756–7.
Bioinformatics into Life Sciences Education. CBE Life Sci Educ. 2015; 14(4): Ie3. PubMed Abstract | Publisher Full Text | Free Full Text
PubMed Abstract | Publisher Full Text | Free Full Text 17. Meyer F, Paarmann D, D'Souza M, et al.: The metagenomics RAST server - a
public resource for the automatic phylogenetic and functional analysis of
5. Via A, Blicher T, Bongcam-Rudloff E, et al.: Best practices in bioinformatics
metagenomes. BMC Bioinformatics. 2008; 9(1): 386.
training for life scientists. Brief Bioinform. 2013; 14(5): 528–37.
PubMed Abstract | Publisher Full Text | Free Full Text
PubMed Abstract | Publisher Full Text | Free Full Text
18. Parks DH, Beiko RG: Identifying biologically relevant differences between
6. Cresiski RH: Undergraduate bioinformatics workshops provide perceived
metagenomic communities. Bioinformatics. 2010; 26(6): 715–721.
skills. J Microbiol Biol Educ. 2014; 15(2): 292–4.
PubMed Abstract | Publisher Full Text
PubMed Abstract | Publisher Full Text | Free Full Text
19. Brooks AN, Yang L, Duff MO, et al.: Conservation of an RNA regulatory map
7. Banta LM, Crespi EJ, Nehm RH, et al.: Integrating genomics research
between Drosophila and mammals. Genome Res. 2011; 21(2): 193–202.
throughout the undergraduate curriculum: a collection of inquiry-based
PubMed Abstract | Publisher Full Text | Free Full Text
genomics lab modules. CBE Life Sci Educ. 2012; 11(3): 203–8.
PubMed Abstract | Publisher Full Text | Free Full Text 20. R Core Team: R: A language and environment for statistical computing. R
Foundation for Statistical Computing. Vienna, Austria. 2014.
8. Attwood TK, Blackford S, Brazas MD, et al.: A global perspective on evolving
Reference Source
bioinformatics and data science training needs. Brief Bioinform. 2017; bbx100.
PubMed Abstract | Publisher Full Text 21. Goldman M, Craft B, Swatloski T, et al.: The UCSC Cancer Genomics Browser:
update 2015. Nucleic Acids Res. 2015; 43(Database issue): D812–817.
9. Emery LR, Morgan SL: The application of project-based learning in
PubMed Abstract | Publisher Full Text | Free Full Text
bioinformatics training. PLoS Comput Biol. 2017; 13(8): e1005620.
PubMed Abstract | Publisher Full Text | Free Full Text 22. Cancer Genome Atlas Research Network, Weinstein JN, Collisson EA, et al.: The
Cancer Genome Atlas Pan-Cancer analysis project. Nat Genet. 2013; 45(10):
10. Luo J: Teaching the ABCs of bioinformatics: a brief introduction to the Applied
1113–20.
Bioinformatics Course. Brief Bioinform. 2014; 15(6): 1004–13.
PubMed Abstract | Publisher Full Text | Free Full Text
PubMed Abstract | Publisher Full Text | Free Full Text
23. Subramanian A, Tamayo P, Mootha VK, et al.: Gene set enrichment analysis: a
11. Altmäe S, Esteban FJ, Stavreus-Evers A, et al.: Guidelines for the design, knowledge-based approach for interpreting genome-wide expression profiles.
analysis and interpretation of ‘omics’ data: focus on human endometrium. Proc Natl Acad Sci U S A. 2005; 102(43): 15545–50.
Hum Reprod Update. 2014; 20(1): 12–28. PubMed Abstract | Publisher Full Text | Free Full Text
PubMed Abstract | Publisher Full Text | Free Full Text
24. Banister CE, Liu C, Pirisi L, et al.: Identification and characterization of HPV-
12. Boekel J, Chilton JM, Cooke IR, et al.: Multi-omic data analysis using Galaxy. Nat independent cervical cancers. Oncotarget. 2017; 8(8): 13375–86.
Biotechnol. 2015; 33(2): 137–9. PubMed Abstract | Publisher Full Text | Free Full Text
PubMed Abstract | Publisher Full Text
25. Saarunya G, Ely B: Dataset 1 in: Design and implementation of semester long
13. Mulder N, Schwartz R, Brazas MD, et al.: The development and application of project and problem based bioinformatics course. F1000Research. 2018.
bioinformatics core competencies to improve bioinformatics training and [Link]
education. PLoS Comput Biol. 2018; 14(2): e1005772.
PubMed Abstract | Publisher Full Text | Free Full Text 26. Saarunya G, Ely B: Dataset 2 in: Design and implementation of semester long
project and problem based bioinformatics course. F1000Research. 2018.
14. Welch L, Lewitter F, Schwartz R, et al.: Bioinformatics curriculum guidelines: [Link]
Page 7 of 17
F1000Research 2018, 7(ISCB Comm J):1547 Last updated: 09 APR 2020
Version 1
[Link]
© 2018 Via A. This is an open access peer review report distributed under the terms of the Creative Commons Attribution
License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is
properly cited.
Allegra Via
National Research Council of Italy (CNR), Institute of Molecular Biology and Pathology (IBPM), c/o
Department of Biochemical Sciences "A. Rossi Fanelli", Sapienza University of Rome, Rome, Italy
The paper describes a semester long bioinformatics course targeting graduate seniors and graduate
students who were bench scientists in need for learning how to analyse data generated across different
‘omic technologies’.
I find it weird that “The authors haven’t come across a course that incorporates multi-omics data analyses
in a single semester.” If not in a single course, some curricula offer multi-omics data tools and analyses
spread in more than one course. A comparison of the presented course with such curricula would be of
great interest, as well as a discussion on the convenience of integrating such large amount of
bioinformatics materials in a Biological Sciences curriculum.
There is much discussion in the field on what is the best strategy to incorporate Bioinformatics in Life
Sciences curricula and I wonder whether an overload of different topics, techniques, approaches,
methods would be successful in contexts where instructor could not work individually with students.
Table 3 displays a number of features of the course’s modules. However, a well structured program of
each module is missing. As for reproducibility, a lesson plan describing how much time was allocated to
each classroom activity (lectures, work in group, hands-on, work on individual projects, types and
frequency of formative assessments, etc.) would help.
Teaching materials provided in the Supplementary materials are not structured at all. Teaching materials
are organised in modules, but navigating modules it is very difficult to understand how to use the various
files. There is no homogeneity in file names and a “readme” file describing the content of each folder (and
how to use it in reproducing the course) is missing. Slides are not annotated. In summary, materials are
not reusable in the current form and the course would not be reproducible based on them and on the
information provided in the article.
The teaching techniques/strategies used in the classroom were not described/discussed, apart from
mentioning the importance of the individual work with students. I think the article would benefit from more
details on the course design and from the description of the pedagogical approaches the instructors
adopted to teach programming and computational skills to bench scientists.
I understand that a key point was the small number of students. Nevertheless, most courses with a small
Page 8 of 17
F1000Research 2018, 7(ISCB Comm J):1547 Last updated: 09 APR 2020
I understand that a key point was the small number of students. Nevertheless, most courses with a small
number of students and motivated instructors usually produce successful results. One big challenge is
when the number is high. It would be interesting if the authors could reason on how their course could be
translated into one for a bigger group of students. What should be definitely changed? Which other
strategies could be adopted (peer instruction? Helpers?)?
Finally, the authors use a lot of the term “competency/competencies”. There is currently quite a lot of
debate around the convenience of using competencies to describe the outcomes of courses. Indeed,
competencies can hardly be assessed and mapped on a learning trajectory. By completing a single
course, students may develop knowledge, skills and abilities (KSAs), which are measurable and
accessible objects and the development of which can be followed along a learning trajectory, rather than
competencies. Could the authors comment on this?
1. p.3 – Re the following sentence: “Practioners of bioinformatics now add techniques from statistics,
information science and engineering to develop algorithms and build predictive models to
understand the dynamics within a biological system.” In my experience, practitioners of
bioinformatics have always added techniques from statistics, information theory and engineering to
develop algorithms to predict the functioning of biological systems. The paradigm shift caused by
the rapid advances in sequencing technologies is of different kind in my opinion: in the first place,
bioinformatics has become the only approach to make sense of the deluge of biological data the
authors refer to. Moreover, the storage, management, sharing, annotation, “fairfication” of the
enormous amount of data produced, poses important technological challenges and emphasizes
the need for new professions.
2. p. 3 – In the sentence: “Practioners of bioinformatics…”, “Practioners” should be changed to
“Practitioners”. Please, check the whole manuscript for typos/misspellings.
3. p. 3 – The authors put the sentence: “However, one of the biggest challenges is the heterogeneity
of the backgrounds of the course participants” in opposition to the previous one on ISCB
competencies (“However,…”). In contrast, I believe that Bioinformatics core competencies listed in
Mulder et al. indirectly express the high degree of heterogeneity of backgrounds in bioinformatics.
4. p.3 – Re the sentence: “In fact, there are three different types of user groups that employ
bioinformatics in their research”, I would not define Bioinformatics Engineers as
bioinformatics users, but rather developers and managers/maintainers of computational tools.
5. p.4, Table 1 – There is another relevant group of bioinformatics practitioners: those who take care
of and manage data, bioinformatics resources and their interoperability and develop standards,
data quality metrics, ontologies, annotation, etc. The “big data issue” is especially relevant in the
“omics” field and, in my opinion, it would be good if the authors could mention this fourth group,
even though none of their students did belong to it.
6. p.3, In the sentence: “We sent a three-question survey (Table 2) to all the participants to
understand their reasoning for registering in the course.” I suggest that the authors replace
“reasoning” with “motivations” or “reasons”.
7. p.3, in the sentence “We also identified the following three course outcomes for the students.” The
authors say “course outcomes”. What is a course outcome? I suspect they mean “learning
outcomes”. There is quite a lot of confusion in the field around the definition and usage of “learning
objectives”, “learning outcomes” and “teaching objectives”. I suggest that the authors replace
“course outcomes” with “learning outcomes”.
8. p.3, Re Learning outcomes. The literature provides quite precise rules to write learning outcomes.
You can use the sentence “by the end of the course, students will ( NOT should) be able to”
followed by an “actionable verb”, namely a verb expressing an action or a behaviour that can be (at
Page 9 of 17
8. F1000Research 2018, 7(ISCB Comm J):1547 Last updated: 09 APR 2020
followed by an “actionable verb”, namely a verb expressing an action or a behaviour that can be (at
least in principle) assessed. The verbs used in learning outcomes I (“identify” and “implement”) are
of this type, whereas some verbs used in II and III are not (“be comfortable”, “elicit”). Moreover, it is
a good practice to write learning outcomes that are as much specific as possible in terms of both
the cognitive complexity level they express and their content. For example, in learning outcome I,
“identify” and “implement” express two different levels of cognitive complexity and learning
outcome II includes a large variety of contents.
9. p.3, Learning outcome II. What do the authors mean by “command line programming”? Do they
mean “Linux shell scripting” or “navigating files and directories using the command line shell”? To
be able to use R statistical packages implies to be able to do (at least some) R programming. I
suggest that the authors specify this.
10. p.4, the footnote of Table 2 is misleading. What does it mean that the authors did not have the
information about programming experience in the pre-class survey answers? Did they asked
question 1 in the pre-class survey (as stated in the manuscript) or in class (as stated in the
footnote)? Were the 7 responses about programming experience? If so, this means that the
authors got 2 answers in class and 7 answers in the pre-class survey. Is this correct? Or the
pre-labsurvey is another thing? Very confusing.
11. Table 2. Survey questions sent out to the students - As question 1 is about “programming
experience”, please notice that “using bioinformatics software” is not “programming”.
12. For consistency with answers to questions 1 and 2, please specify the distribution of answers to
question 3.
13. p.4, Re the sentence: “Based on the responses of the students, we assigned potential user groups
as explained in Table 1 at the start of the class with their expected competency levels at the end of
the class.”, I have three main concerns: 1) I don’t see where competency levels at the end of the
class are listed (unless the authors are now calling “competency levels” what they called
“characteristics” in Table 1. Should this be the case, in no way can students acquire the
characteristics listed in Table 1 by completing the course described in this paper; 2) Competencies
are yes/no objects, which means either an individual has a competency or they don’t have it.
Therefore, it may be problematic to talk about “competency levels”; it may be perhaps more
appropriate to talk about knowledge, skills or abilities (KSAs) levels; 3) If by “class” you mean a
series of lectures on a subject, could you specify at the end of which class (a module? The entire
course?) you defined “expected competency levels”? As a side note, a single class can possibly
increase the level of a KSA, surely not allow students to acquire a competency.
14. p. 4: in this sentence: “Successful completion of the project assigned to every student by the end of
a course module determined their competency of the course.” It is not clear what do the authors
mean by “competency of the course”. Do they mean that the competency acquired in a module
determined students’ competency in the whole course?
15. p. 6: In the sentence: “We determined the competency of a student per module by their successful
completion of the problem set and or the project.” what do the authors mean by “successful
completion of the problem set and or the project”? There were students who did not successfully
complete the project? How did instructors grade them?
Is the work clearly and accurately presented and does it cite the current literature?
Partly
Are sufficient details of methods and analysis provided to allow replication by others?
No
Page 10 of 17
F1000Research 2018, 7(ISCB Comm J):1547 Last updated: 09 APR 2020
No
Are all the source data underlying the results available to ensure full reproducibility?
Partly
Reviewer Expertise: Protein structural bioinformatics, protein structure and function prediction and
analysis, and protein interactions. Programming and software development. Science of
learning, educational psychology, cognitive sciences, and (bioinformatics) curriculum development.
I confirm that I have read this submission and believe that I have an appropriate level of
expertise to confirm that it is of an acceptable scientific standard, however I have significant
reservations, as outlined above.
[Link]
© 2018 Pauley M. This is an open access peer review report distributed under the terms of the Creative Commons
Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original
work is properly cited.
Mark A. Pauley
National Science Foundation, Alexandria, VA, USA
“Design and Implementation of Semester Long Project and Problem based Bioinformatics Course”
describes a “multi-omics” bioinformatics course at the University of South Carolina intended for advanced
undergraduates and graduate students. The course was implemented in Fall 2017; nine students took it.
Per the authors, the primary learning objective of the class was to introduce students “to the breadth and
depth of the field of Bioinformatics for ‘omics’ data analyses.” The course was divided into seven modules
(e.g., “Genome Assembly and Annotation,” “Comparative Genomics”). Each module had an associated
graded problem set, and students completed a research project at the end of the course. A
three-question, pre-course survey was used to place students into user groups—bioinformatics tool
users, bioinformatics data scientists, and bioinformatics engineers.
The article has many strengths. The authors make a compelling case for the need for courses like it to
prepare students for graduate school and to address the need for specialists in the field, and they do a
good job of putting their course in the context of other bioinformatics education efforts. The contents of
the course are clearly laid out (Table 3), and the authors provide a large amount of material (syllabus,
slide decks, problem sets) developed for the class as a supplementary file—both will be invaluable for
others wishing to implement the entire course or parts of it. As how a course could be improved is often
more instructive than what went well, their discussion of potential changes in subsequent iterations of the
Page 11 of 17
F1000Research 2018, 7(ISCB Comm J):1547 Last updated: 09 APR 2020
more instructive than what went well, their discussion of potential changes in subsequent iterations of the
class is very helpful. Finally, the article is clearly written and easy to read.
That said, the manuscript has several issues that should be addressed. First, a number of references are
potentially mis-cited. For example, References 6 and 7 cite a Global Organization for Bioinformatics
Learning (GOBLET) study that showed that basic data stewardship skills are only taught in 25% of
education programs. However, neither of these papers mention the GOBLET survey or the 25% statistic.
In addition, References 11 and 12 do not deal with bioinformatics courses and Reference 15 does not
discuss the competencies of different bioinformatics users as their use would imply. Similarly, I am
concerned about the bioinformatics user groups given in Table 1. Specifically, the descriptions of the
three groups are very similar to the three personas described in Reference 14, and the name of one
(bioinformatics engineer) is the same (the names of the other two are almost the same). In short, it’s not
clear if the authors are restating the results of Reference 14 or are proposing a slightly different grouping.
Although the posted resources are clearly an important contribution, I found them to be incomplete in one
important aspect. In particular, the authors state that every module had a problem set/project associated
with it, but this was missing from three of the seven modules. Furthermore, a brief description of the final
research projects the students worked on would be helpful as it would indicate what the students were
able to do at the end of the semester.
In addition to the above, very little is provided in terms of results. One of the results seems to be the
placement of students into the three user groups. However, how the results of the pre-course survey were
used to place the students into these groups and if and how they impacted the way in which the course
was taught is not clear. Similarly, Table 4 and the corresponding description of it in the narrative,
particularly the use of the word “expected,” is confusing. Does Column 3 of the table refer to the group a
given student was in at the end of the semester or where they were expected to be at some other point in
the semester? In any event, how was this determined? Although the course evaluation is helpful in
understanding how students felt the course went, I would have liked to have seen more assessment
results, particularly if the learning objectives of the course had been met. In general, the paper would be
strengthened by the results of another iteration of the course, one in which the proposed changes had
been made and the learning gains of the students were assessed.
As previously mentioned, the article is well-written. However, I did notice two small errors. The first
sentence of “Course design” should probably be “We had nine students register for the course.” Also,
“bioinformatics” is incorrectly capitalized in “This course was designed to provide a structured
Bioinformatics course. . .”.
Is the work clearly and accurately presented and does it cite the current literature?
Partly
Are sufficient details of methods and analysis provided to allow replication by others?
Partly
Are all the source data underlying the results available to ensure full reproducibility?
Partly
Page 12 of 17
F1000Research 2018, 7(ISCB Comm J):1547 Last updated: 09 APR 2020
Partly
I confirm that I have read this submission and believe that I have an appropriate level of
expertise to confirm that it is of an acceptable scientific standard, however I have significant
reservations, as outlined above.
[Link]
© 2018 Schwartz R. This is an open access peer review report distributed under the terms of the Creative Commons
Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original
work is properly cited.
Russell Schwartz
Department of Biological Sciences and Computational Biology Department, Carnegie Mellon University ,
Pittsburgh, PA, USA
Saarunya and Ely describe a problem-based bioinformatics course designed to meet a need for “next
generation data scientists” in the life sciences, a need identified by many current efforts in life sciences
education. Case studies of course development efforts like this can be valuable to those seeking to
develop similar courses or incorporate those courses into their curricula and looking for ideas or for pitfalls
to avoid. The authors do a good service for the field in putting out their efforts and lessons learned in a
form from which other educators can benefit. The specific effort here is a nice example of a small
project-focused course serving a cohort with some diversity of backgrounds and immediate training
needs. While it presents just one small example, that description might reasonably apply to courses many
training programs are developing or would like to develop. In addition to the article itself, the
supplementary material includes a full syllabus, lecture slides, assignments, and some supplementary
materials, increasing its value to others looking to develop course materials in this space.
The authors make a good case for the need for new courses along these lines. They back that need up
well with appropriate citations to the relevant literature on life sciences and bioinformatics education. The
manuscript provides a good background on prior efforts to characterize the need for bioinformatics
training, identify the specific skills required by future life scientists, and how those skills are or are not
being provided in practice. The authors further give reasonable consideration to challenges to the design
of bioinformatics curricula that they expected to confront in this effort. On the latter point, they might also
refer to Williams et al. (20171), which identified a number of other recurring challenges to bioinformatics
education in the life sciences. Others in the field might appreciate the perspective of these authors on
whether any of the challenges Williams et al. identified were encountered in their effort and, if so, how
they were overcome.
The course itself covers a nice range of topics in applied bioinformatics, which might be expected to meet
Page 13 of 17
F1000Research 2018, 7(ISCB Comm J):1547 Last updated: 09 APR 2020
The course itself covers a nice range of topics in applied bioinformatics, which might be expected to meet
the needs of a diverse set of likely users. The course materials provided in the supplement might
therefore find a good audience. One general concern, though, is that the supplementary materials contain
some third-party resources, for which it might be more appropriate to include a reference or link rather
than the material itself. The teaching approach is fairly applied, with a lot of focus on specific data
resources and software, although with some attention to principles behind these resources. While some
user communities might favor an approach more grounded in the principles and theory, the focus here
seems typical of many bioinformatics courses aimed primarily at biology students. The authors might do a
bit more to justify the balance of focus on practice versus theory, with reference to efforts at identifying
specific bioinformatics competencies needed by their likely user community, several of which the paper
cites.
The Results present some interesting material in the form of a pre-class survey and post-class course
evaluation material. While the cohort here is a single small sample, some useful lessons can be drawn
about the diversity of backgrounds and needs of even a small group like this. The paper would be
considerably stronger with some more serious assessment of whether the learning objectives of the
course were met. That is a non-trivial undertaking and cannot be done retroactively, but might be worth
considering for a future iteration of the class if it is being continued. The materials do include results of a
university-run course evaluation, which provide some indication of how students felt about the course,
although that is different from showing how successfully they learned the material. This post-class
evaluation makes for some interesting reading, although if it is being included with the paper, it might bear
some comment in the Results and Discussion.
It would be useful also to see some comparison to other similar course material available in publicly
accessible forms. While that is a difficult moving target, comparing to a few alternatives from prominent
course repositories or MOOCs, particularly to highlight the unusual or especially innovative features of
this course, would be valuable.
The paper does a nice job of presenting some lessons learned in the Discussion. It is commendable that
the authors spend some time on what did not work so well in this class and consider how it might be done
differently in the future. One would ideally like to see this taken further via a more comprehensive
formative assessment process – with problems identified via a formal assessment, solutions proposed,
and those solutions demonstrated to be effective in a re-assessment. It is understandable that that may
be beyond the scope of a one-off paper like this, though, and it is nonetheless easy to see how others
developing a class in this domain might benefit from the advice given here to avoid some of the same
pitfalls.
Beyond these more specific technical points, the document is clear and generally well-written. I noted just
a couple of minor errors:
p. 4: ``International society of Computational biology’’ should be ``International Society for
Computational Biology’’.
p. 4: ``Regional student group – Southeast USA’’ should be ``Regional Student Group – Southeast
USA’’.
References
1. Williams J, Drew J, Galindo-Gonzalez S, Robic S, et al.: Barriers to Integration of Bioinformatics into
Undergraduate Life Sciences Education. bioRxiv. 2017. Publisher Full Text
Is the work clearly and accurately presented and does it cite the current literature?
Page 14 of 17
F1000Research 2018, 7(ISCB Comm J):1547 Last updated: 09 APR 2020
Is the work clearly and accurately presented and does it cite the current literature?
Yes
Are sufficient details of methods and analysis provided to allow replication by others?
Partly
Are all the source data underlying the results available to ensure full reproducibility?
Yes
I confirm that I have read this submission and believe that I have an appropriate level of
expertise to confirm that it is of an acceptable scientific standard, however I have significant
reservations, as outlined above.
The authors would like to thank Dr. Schwartz for his in-depth and insightful feedback on the paper.
Following are the comments from the authors, which will be incorporated into the final version of
the paper after the second and third referees' feedback:
The authors recognize the contributions made by 'Williams et al.*' in identifying the challenges of
introducing bioinformatics to life-science students. These issues are already addressed in the
paper in the following ways:
(i) Faculty issues (training): The authors’ training and background gave them an opportunity to
design a multi-project/problem based course. But the post-module projects/problem sets were
based on the background of the students. And this was possible because of the small class size.
(ii) Faculty issue (time): This course was designed with inputs from the students based on their
needs and training. Hence a lot of time was spent on the course design followed by making
changes/adjustments to the course during the implementation.
(iii) Student issue (Background skills): The authors addressed the gaps in student's computational
and statistical training by offering additional learning modules. The authors have also addressed
the problems faced by the students and ways to tackle them in the future under ‘Discussion’
section.
(iv) Student issue (Interest): As an applied Bioinformatics course, the students had an opportunity
Page 15 of 17
F1000Research 2018, 7(ISCB Comm J):1547 Last updated: 09 APR 2020
(iv) Student issue (Interest): As an applied Bioinformatics course, the students had an opportunity
to apply their learning to solve problems and projects in their area of interest/background. Active
engagement and participation of the students was encouraged throughout the course by timely
submission of projects and problem sets.
2. The authors recognize the need to have a better competency assessment of the students’ pre-
and post-course. In future, this can be accomplished in the form of pre-course problem solving and
post-course problem solving to ensure that the students meet the set learning objectives. The
course in the current format had the student’s research, design, address and present their learning
(with emphasis on critical evaluation and problem solving) in the form of a project presented as a
talk/poster in the research symposium held at the end of the semester. To protect the student’s
data/projects, the final posters and presentations are not included in this paper.
3. As most of the participants were classified as 'Bioinformatics tool users' the authors chose to
focus on applied bioinformatics as opposed to Bioinformatics theory. In order to have a
bioinformatics focused theory class designed to address every 'omic' problem, the authors believe
that it would be prudent to have just one or two modules together and introduce theory and
problem/projects pertaining to the same.
4. The authors have cited the third-party resources in the main paper with reference numbers in the
supplementary materials. The authors will add the supplementary references in supplementary
section and main references in the main paper.
5. The course design and challenges addressed in this paper are pertaining to the small class size
and may not accurately reflect the challenges faced at the level of MOOC learning. But the authors
can add references to MOOC courses that offer similar style of training in the background section.
*Reference:
* Williams J, Drew J, Galindo-Gonzalez S, Robic S, Dinsdale E, Morgan W, Triplett E, Burnette J,
Donovan S, Elgin S, Fowlks E, Goodman A, Grandgenett N, Goller C, Hauser C, Jungck J,
Newman J, Pearson W, Ryder E, Wilson Sayres M, Sierk M, Smith T, Tosado-Acevedo R,
Tapprich W, Tobin T, Toro-Martínez A, Welch L, Wright R, Ebenbach D, McWilliams M, Rosenwald
A, Pauley M: Barriers to Integration of Bioinformatics into Undergraduate Life Sciences Education.
bioRxiv. 2017
Page 16 of 17
F1000Research 2018, 7(ISCB Comm J):1547 Last updated: 09 APR 2020
You can publish traditional articles, null/negative results, case reports, data notes and more
Page 17 of 17