Biomanager: The Use of A Bioinformatics Web Application As A Teaching Tool in Undergraduate Bioinformatics Training
Biomanager: The Use of A Bioinformatics Web Application As A Teaching Tool in Undergraduate Bioinformatics Training
1093/bib/bbm039
Advance Access publication August 22, 2007
Corresponding author. Sonia Cattley, Sydney Bioinformatics, Medical Foundation Building (K25), The University of Sydney, Sydney
NSW 2006, Australia. Tel: þ61 2 9036 3306; Fax: þ61 2 9036 3234; E-mail: [email protected]
Sonia Cattley is the Education Officer at Sydney Bioinformatics and is charged with training scientists in workshops around Australia
with the application of bioinformatic tools in relation to their particular area of study.
Jonathan Arthur is the Director of Sydney Bioinformatics. He is also a Senior Lecturer in Bioinformatics within the Faculty of
Medicine at the University of Sydney.
ß The Author 2007. Published by Oxford University Press. For Permissions, please email: [email protected]
458 Cattley and Arthur
applications are run. Furthermore, many of these examples by combining two or more applications
students are interested in making effective and in the solution of a particular problem, as output
appropriate use of bioinformatics applications for from one application needs to be cut and pasted to
furthering their biological research, rather than be used as input for another, often with reformatting
seeking to understand the information technology of the data required in between the cut and the paste.
and algorithms in, and of, themselves. This The educator must also constantly revise the training
latter stream is generally included as part of a material to reflect changes in the Uniform Resource
biotechnology or general biological science degree. Locator (URL) of the resource. Finally, many online
BioManager has been utilized as a training environ- resources are not hosted on dedicated servers and
ment within this latter stream in Australia, New thus may periodically become unavailable or may
Zealand and Malaysia. not return results in a reasonable space of time due to
Students graduating in molecular biology, veter- fluctuating load. In a study of the stability and
government, non-profit and industry researchers subject to fluctuations due to the use of other non-
since 1991. Whilst its main purpose is to provide related applications on the same server. Finally, the
for the needs of Australian researchers, it is available integrated nature of the BioManager environment
throughout the world and already used in New provides the student with the same ‘look and feel’
Zealand and Malaysia. The current web interface, to the different applications, aiding in concentration
BioManager, was installed in 2001 and has over 2800 and reducing frustration and inconvenience, and
active research-based users. This system is a web enables examples involving multiple applications to
application and is thus potentially accessible any- be constructed by automatically moving output from
where in the world via the Internet. The application one application into a second application via the
currently runs on a group of servers including a Sun correct data transformations.
E450 (2 440 MHz CPUs, 2 GB RAM), a Sun
E3500 (6 400 MHz CPUs, 2 GB RAM, 2 TB HD)
Select a file
Figure 1: A screenshot of the BioManager workbench. Selecting the GO option will extract all suitable programs
to analyse the selected file.
use as a research tool, means it is used in numerous to their account at a later time to view the results
environments, including remote areas of regional (Figure 1). As the computationally intensive analyses
Australia with less reliable internet connection and are run via a queuing mechanism the user interface is
reduced access to bandwidth, and sites outside of relatively ‘light’ and can be accessed worldwide over
Australia. There have been few, if any, reported a regular internet connection.
difficulties arising from the specific environment One of the main features of BioManager is the
where BioManager has been used. automatic formatting of an output file from one
The interface, and all generated data, is centrally program in such a way that it can be used as the input
housed on a server at the University of Sydney. for a related program even where the two programs
Access is via the web using a login and password are not from the same package. For instance, a
assigned by the tutor. Each student is given a unique multiple sequence alignment generated through
account and all files created during the class are ClustalW [9] can be used as input to a phylogenetic
stored in this account. The account can be accessed program from the Phylip [10] package. All format-
at any time and at any location. The tutor can share a ting requirements between suites of programs are
series of files with all students and students can share dealt with automatically by the BioManager applica-
files between each other (to facilitate working on tion. This prevents the need to download and install
group assignments). Bioinformatics analyses are run applications to handle the format changes, where
on the central server. All analyses are completed via a such applications exist, or perform the changes
queuing mechanism, being directed to the most manually, where they do not. It also allows the
appropriate available processor. Students do not need student to concentrate on the biological analysis
to wait until the job has finished but can come back without regard to data format transformation.
BioManager: a teaching tool 461
Students wishing to understand more about the data to the biological conclusions generated by their
underlying file formats can still access the raw file analysis.
formats as required. In feedback from the universities concerned, all
BioManager also allows both program directed (‘I classes have some formal class time dedicated to the
want to create a phylogenetic tree—what program interface, where students submit the same analyses at
can I use to do that and which of my data files can be roughly the same time from multiple computers as
used as input?’) and data directed (‘‘I have a sequence they work through set exercises. At other times,
I want to know more about—what program or students either work on the system in their own time
analysis can I use on this file?’’) approaches using or continue working at pre-determined class times
either the Program Index or the Workbench, but on their own projects. Here, job submissions
respectively. This helps guide the student to the are more sporadic and there is not the en-masse
required data or alternatively make the student aware submission as seen in the initial class or classes. In
of different ways of analysing their data they may not more than half of current classes, students are given
have been aware of previously. assignments based on the use of BioManager.
BioManager saves the history of all analyses The University of Sydney offers a number
completed. This history can be displayed by viewing of Units of Study that include teaching in the
an output and selecting the History view option. In field of bioinformatics. Some of these units make
addition to the ancestry of the result (sequences and use of the BioManager interface (e.g. Mole-
other output files generated along the path to the cular Biotechnology—MOBT3101, Molecular
final result), the parameters used in the generation of Genetics and Inheritance—BIOL5001, Pharmaco-
the select result are recorded (Figure 2). This allows kinetics and Pharmacogenetics—PHAR3630, Cell
students to review their work and more fully Biology 1B—VETS1018 and Genetics and
understand the process of moving from their original Biometry—VETS2009) while others do not
462 Cattley and Arthur
(e.g. Bioinformatics—INIM5006), mainly according complete, even when utilizing powerful hardware.
to the content of the unit of study. For example, This is due to the inherent, computationally intense
BioManager currently focuses on genomic, nature of the algorithms themselves. In a research
comparative genomics and sequence-based bioinfor- situation, this length of calculation is understood and
matics. Thus, teaching in other areas of bioinfor- factored into the individual research plan. In a class
matics, such as microarray analysis, currently makes situation, even an analysis taking up to 20 min may
use of other software packages. Our experience has cause difficulties in regard to teaching. Given the
shown a number of advantages for both teachers and output of these analyses are often needed as input for
students in making use of the BioManager environ- the next part of the student exercise, slow analyses
ment. These have been outlined earlier. Indeed, in at may limit the amount of material covered in the class
least one unit of study relying on bioinformatics and break student concentration on the task.
software outside the BioManager environment, The creation of a ‘course database’, a subset of
between universities, the use of the BioManager various bioinformatics applications. This is essential
interface is a common need of all courses. to ensure researchers can use the applications in the
To address this need we have created an most effective way. However, the range of options
introductory tutorial, freely available for download can, at times, be confusing to a student who is
(http://www.usyd.edu.au/sydneybioinformatics/ learning to use the application for the first time.
pdfs_docs/bioman_intro.pdf). This tutorial can be This can be addressed by preparing the classes in
used as part of the first formal class session. It is such as way as to concentrate on the main parameters
planned for additional tutorials to be added at a later and options. In addition, we are currently exploring
stage. the possibility of modifying the parameter selection
pages to reduce the number of options available
Rogue analyses when the tool is accessed by a class account to allow
Care must be taken to ensure students do not send in students to focus on the key parameters.
This can be addressed by limiting the number advantageous by time-poor students. While the use
of analyses students can simultaneously submit. of BioManager, created as a research tool, as a
Additional analyses then remain queued until the teaching aid raises a number of additional issues,
earlier analyses are complete. various strategies are relatively easily put in place to
address these. There have been some marked
Cost improvements since the above changes have been
The operation of BioManager is funded through implemented. The ability to expose students to a
the support provided by the University of Sydney, wide range of bioinformatics applications, through
occasional research grants and small, annual sub- the one consistent interface, without either the
scription fees charged to users. The service is not student or educator having to manage the installation
operated for profit. and maintenance of all these programs makes
In order to facilitate the use of BioManager BioManager a convenient resource for undergradu-
Acknowledgements
CONCLUSIONS The authors would like to specifically thank Dr Catherine
The BioManager interface has been well received by Abbott of Flinders University, Dr Alex Andrianopoulos of the
University of Melbourne and Dr Neville Firth of the University
the students and educators currently using it in 28
of Sydney for providing specific feedback on their experiences of
courses across 17 universities around the world. BioManager in a teaching environment. We also acknowledge
In particular, the ability to start an analysis and return other BioManager users who have provided feedback on the use
to review the results at a later time has been seen as of BioManager as both a research and teaching tool.
BioManager: a teaching tool 465
Information about BioManager access can be found at 6. Altschul SF, Madden TL, Schaffer AA, etal. Gapped BLAST
http://www.usyd.edu.au/sydneybioinformatics/angis/biomana- and PSI-BLAST: a new generation of protein
database searching programs. Nucleic Acids Res 1997;25:
ger.shtml. Sydney Bioinformatics is a Centre of the University of
3389–402.
Sydney. A free trial period is offered for international (non-
Australian) departments. For further information contact Sonia 7. Wren J. 404 not found: the stability and persistence
of URLs published in MEDLINE. Bioinformatics 2004;20:
Cattley ([email protected]).
668–72.
8. Honts J. Evolving strategies for the incorporation of
References bioinformatics within the undergraduate cell biology
curriculum. Cell Biol Educ 2003;2:233–47.
1. Counsell D. A review of bioinformatics education in the
9. Higgins D, Thompson J, Gibson T, et al. CLUSTAL W:
UK. Brief Bioinform 2003;4:7–21.
improving the sensitivity of progressive multiple sequence
2. Zatz M. Bioinformatics training in the USA. Brief Bioinform alignment through sequence weighting, position-specific
2002;3:353–60. gap penalties and weight matrix choice. Nucleic Acids Res
3. Cattley S. A review of bioinformatic degrees in Australia. 1994;22:4673–80.