September 28, 2005

9:00/13:15 - Room 9 (2nd Floor)
T1: Mining Structured Data in Bioinformatics
Prof. Stefan Kramer

The topic of the tutorial is mining in structured data. This is particularly relevant for data mining applications in bioinformatics, since the majority of biological data is not kept in databases consisting of a single, flat table. Instead, we are frequently dealing with databases of structured and linked objects. In other words, the “objects” in bioinformatics databases often have a rich internal structure and are connected by some relation. (Consider, for instance, databases of proteins, small molecules, metabolic and regulatory networks, text databases, etc.) The tutorial will give an overview of data mining techniques for sequences, trees, graphs and relational databases. We will present techniques for both descriptive and predictive data mining in this context. In descriptive mining, we are looking for local patterns to characterize the data. In predictive mining, we are looking for models that can be used to make predictions for new, unseen cases. Along the two dimensions (types of data and predictive/descriptive), the tutorial is organized as follows: the first two parts of the tutorial are devoted to descriptive mining in databases of itemsets, strings and sequences, trees, graphs and relational databases. The third part of the tutorial deals with predictive mining based on propositionalization (i.e., feature construction using patterns), instance-based learning and kernel methods for graph and relational databases.

More Information


9:00/13:15 - Room 9 Bis (1st Floor)
T2: GEPAS: New challenges in microarray data analysis
Dr. Joaquin Dopazo

The need to associate both biochemical function and cellular roles to protein domains, rather than the individual proteins, has been recognized for a number of years. However, the ability to identify functional and/or evolutionary domains in proteins from the many full genomic sequence projects is still somewhat problematic. The problem is compounded by the large number of proteins and domains with no obvious currently recognized homologues. In research on these and related problems, our group has developed a number of tools and discovered some interesting examples of domain functions and deep evolutionary histories. Two cases will be discussed demonstrating both the utility of various bioinformatics approaches and the importance of very large-scale data integration.

More Information


14:15/18:00 - Room 9 (2nd Floor)
T3: InterPro, exploring a powerful protein diagnostic tool
Dr. Jennifer McDowall

InterPro is an integrated protein resource that provides protein annotation and classification at family and domain levels. InterPro combines the major signature databases, PROSITE, PRINTS, PFAM, PRODOM, SMART, TIGRFAM, PIR Superfamily, GENE3D, SUPERFAMILY, and PANTHER, as well as structural information from PDB, MSD, CATH, SCOP and SWISS-MODEL, into a unified database. This tutorial is designed to allow users to get the most value out of the database, and will focus on the type and organisation of annotated data, the different query methods possible, understanding the different visualisations of the data, as well as exploring the multiple external links and cross-references available.

More Information


14:15/18:00 - Room 9 Bis (1st Floor)
T4: Computational proteomics
Prof. Colinge Jacques

Proteomics has become an important approach to analyze biological samples and it extensively uses mass spectrometry to identify and characterize proteins. This tutorial will introduce the audience to the central problem of searching mass spectrometry data against a database of proteins. This presentation should stimulate the interest of bioinformatics researchers in other fields and provide a concise though accessible introduction to life scientists. The last part of the tutorial will rapidly cover other important problems in mass spectrometry data analysis such as peptide de novo sequencing, eukaryote genome search and protein quantification and characterization.

More Information




Developed by SoftActiva