Towards a semi-automatic functional annotation tool based on decision tree techniques
J. Azé, L. Gentils, C. Toffano-Nioche, V. Loux, J-F. Gibrat, P. Bessières, C. Rouveirol, A. Poupon, and C. Froidevaux
In Proceedings of the International Conference on Machine Learning in Systems Biology (MLSB 2007), in press
Due to the continuous improvements of high throughput technologies and experimental procedures, the number of sequenced genomes is increasing exponentially.
Biologist experts play a central role in the analysis of this massive amount of raw data. To annotate a new genome they need to integrate many pieces of information coming from various sources: results of bioinformatics analysis programs, data stored in specialized databases, results of high-throughput experiments such as transcriptomics, proteomics, etc., information stored in the literature, general knowledge about the domain of interest (biological properties of the studied organism, its ecology, etc.). To face the deluge of new genomic data, there is a crying need to automate, as far as possible, the annotation process itself.