Seminar: Applied Machine Learning: Annalisa Marsico
Seminar: Applied Machine Learning: Annalisa Marsico
Annalisa Marsico
OWL RNA Bionformatics group
Max Planck Institute for Molecular Genetics
Free University of Berlin
SoSe 2015
Goals of this course - I
• Soft skills
– Learn how to evaluate a research paper
– Learn what makes a paper good
– Learn how to get your paper published
– Learn how to give a scientific talk
– Learn to be critical / evaluate
Goals of this course - II
• Hard skills
– Get an overview of the Methods in Machine
Learning field & the applications to
Biology/Bioinformatics
– Learn how basic concepts / algorithms/
statistical methods are applied and extended
in this field
– Learn how to ask the right biological question
and choose the right Machine Learning
method ‚to solve it‘
Course design
• Today -> overview on the topics, assignment of
papers
• Student presentations
– Each student will choose a paper and will give a
presentation
– Two presentations per term (30-40 minutes + 15
minutes questions)
– Discussion: questions, critical assessmnet
Presentation guidelines
Compression with minimal loss of information
• Do not try to understand every detail but the general idea has to be
clear
http://www.molgen.mpg.de/3415218/Seminar-Applied-Machine-Learning
Topics
General machine learning papers
1. Assessing the accuracy of prediction algorithms for classification: an overview
2. An introduction to ROC analysis
3. A Study of Cross-Validation and Bootstrap for Accuracy Estimation
and Model Selection
Feature selection
1. A review of feature selection techniques in bioinformatics
2. Novel unsupervised feature filtering of biological data
Topics
Unsupervised Learning (applications to Bioinformatics)
1. Cluster analysis of gene expression data: A Survey
2. Biclustering algorithms for Biological data analysis: A Survey