Aalto University
Computational Methods in Genomics
Summer 2010: August 2 to August 13
Helpful links
Information about the Instructor
Name: Sami Khuri, visiting professor from San José
State University
Office: B153
Phone: TBD
Office Hours: By appointment.
Email: khuri@cs.sjsu.edu
Registration
Send email to Kalle Karhu at: kalle.karhu@tkk.fi
Lectures and Hands-on Exercises
Lectures: Everyday from Monday through Firday from 1000 to 1200 in lecture hall T3.
Hands-on exercises: Tuesday and Thursdays from 1400 to 1600 in the computer
lab C106.
Course code and number
T-106.6200 - Special Course in Software Techniques
Course Description
The course is on algorithms in Bioinformatics.
he course investigates the main algorithms used in Bioninformatics. It
starts with a brief overview of traditional sequence analysis methods and
then investigates sophisticated and cutting-edge computational methods and
algorithms for the analysis of complex biological sequence data. Lecture
topics include Hidden Markov Models for pattern recognition, conducting
profile-based searches and transmembrane protein structure prediction;
microarray analysis; the use of SNPs and haplotypes in genomic variation,
in pharmacogenomics, in genome-wide association studies and in
personalized medicine. The course is self contained and does not assume
any background knowledge in biology, although an interest is molecular
biology is important.
The course will also be complemented by a few hands-on lab sessions
(practicals) that will allow the participants to practice with some of the
major tools and databases. Students will be given projects that will have
to be completed and submitted within two months of the end of the course.
Remarks
- The contents of this course are different from the topics of the
Summer 2008 course I taught.
-
No prior knowledge of Biology is required.
In other words, biology is not a prerequisite for this course.
But an interest in Biology is.
I will spend the first lecture of the course
going over some notions in molecular biology
(mainly the following Biology
Terms) that will be needed
for the course.
-
We have two rooms at our disposal for the course.
The lectures will be given in a lecture hall (TBD) and the hands-on
exercises (praticals in British - travaux pratiques in French)
will be conducted in the computer lab to be determined.
The purpose of the hands-on exercises is to develop your
understanding of the material. Please keep your laptops closed
during the lectures.
Lecture Material and
Schedule
Recommended Textbooks (Not required)
-
An Introduction to Bioinformatics Algorithms;
by Neil Jones and Pavel Pevzner,
MIT Press, 2000.
- Introduction to Computational Molecular Biology;
by J. C. Setubal and J. Meidanis.
PWS Publishing Company, Boston, 1997.
- Understanding Bioinformatics; by Marketa Zvelebil and Jeremy Baum,
Garland Science, 2008.
- Bioinformatics and Functional Genomics;
by Jonathan Pevsner, John Wiley, 2003.
- Bioinformatics: Sequence and Genome Analysis; by David Mount,
Cold Spring Harbor Laboratory Press, Second Edition 2004.
Lecture Notes:
A copy of my lecture notes (Introduction followed by four parts):
Topics to be covered in the course: (not finalized)
- Micro RNA (miRNA)
- Seminal Paper:
David Bartel, 2004.
This is an excellent review that highlights what has been learned
about miRNAs in the decade since the report of the lin-4 RNA
and its regulation of lin-14. The major topics discussed
are miRNA genomics, miRNA biogenesis, miRNA regulatory mechanisms,
and the roles of miRNAs in gene regulatory pathways.
Although outdated (2004), the article is an
excellent introduction to miRNAs -
Carlo Croce, 2008.
The main purpose of this article is to show how
"The discovery of the involvement
of microRNAs in the initiation and
progression of human cancer may provide additional
targets for anticancer treatments."
-
Fu et al., 2009.
The authors use a Markov model to examine base-pairing binding patterns of
known microRNA targets.
-
Ken Garber, 2006.
A short but very good review on the role of micro RNAs in cancer.
- Microarrays
- Seminal Paper:
DeRisi et al., 1997.
By using yeast, the authors show how "DNA microarrays provide
a simple and economical way to explore gene expression
patterns on a genomic scale."
-
Gasch et al., 2000.
As a follow-up to DeRisi's work, the authors explore genomic
expression patterns in the yeast, by measuring
changes in transcript levels over time for almost every yeast gene,
of the yeast genome responding to
diverse environmental transitions.
-
Alizadeh et al., 2000.
The authors use DNA microarrays to conduct a systematic
characterization of gene expression in B-cell malignancies
and show that there is diversity in gene expression among the tumours of
the diffuse large B-cell lymphoma which is the most common subtype of
non-Hodgkin's lymphoma.
- Mark Reimers, 2010.
The article describes the typical stages
in the analysis of microarray data for nonspecialist
researchers in systems biology
and medicine.
- Genome-Wide Asociation Studies
- Seminal Paper:
Moore et al., 2010.
A very recent review on the bioinformatics challenges associated
with genome-wide association studies (GWAS).
-
Pearson and Manolio, 2008.
A thorough review on genome-wide association studies (GWAS), including
a section on terms used in GWAS and a second section on ten basic
questions on how to interpret GWAS results.
-
Manolio et al., 2009.
The authors examine potential sources of missing heritability. They
suggest research strategies, including and extending beyond current
genome-wide association approaches, to illuminate the genetics
of complex diseases in the hope of enabling effective disease prevention
or treatment.
-
Chung et al., 2010.
A very recent review on genome-wide association studies (GWAS) in
cancer, including curret and future directions of GWAS. The article gives
a description of the design, interpretation, application, and
also limitations of GWAS.
-
Altshuler et al., 2008.
This excellent review discusses "the
intellectual foundations of genetic mapping of Mendelian and complex
traits in humans, examine
lessons emerging from linkage analysis of Mendelian diseases and
genome-wide association
studies of common diseases, and discuss questions and challenges that
lie ahead."
- Next-Generation Sequencing
- Seminal Paper:
Horner et al., 2009.
A very recent review on the bioinformatics approaches that have
been introduced for several genomics and functional genomics
applications of next-generation sequencing.
-
Rusk and Kiermer, 2007.
A one page primer on different sequencing technologies
of the next generation; namely: 454, Solexa, and SOLiD technologies.
-
Voelkerding et al., 2009.
A thorough review on different sequencing technologies
of the next generation and their impact on research.
Course Requirements
-
Problem Set:
One homework assignment.
No late homework will be accepted.
The homework
is due in the beginning of the
lecture on Wednesday, August 11, 2010.
The cover sheet
for the homework.
-
Term Project:
There will be a group project. Each group consists of two students.
The group chooses a topic and writes a term-project.
A student may work alone if she/he desires not to team up
with someone else.
The term-project is due by 1700 on September 24, 2010.
Please submit your project (hard copy or pdf) to Kalle Karhu.
Some guidelines can be found
here.
-
Hands-On Exercises:
(the hands-on exercises will be distributed in class and will not be
posted)
-
Exam:
In-class, closed notes, and comprehensive on Friday, August 13, 2010,
11:00-12:00.
Grading Policy
The final grade will be computed as shown below:
Homework Assignment: 20%
Hands-On Exercises: 20%
Term Project: 30%
Final Exam: 30%
More Links