Searching for Virus Phylotypes

Abstract : MOTIVATION: Large phylogenies are being built today to study virus evolution, trace the origin of epidemics, establish the mode of transmission and survey the appearance of drug resistance. However, no tool is available to quickly inspect these phylogenies and combine them with extrinsic traits (e.g. geographic location, risk group, presence of a given resistance mutation), seeking to extract strain groups of specific interest or requiring surveillance. RESULTS: We propose a new method for obtaining such groups, which we call phylotypes, from a phylogeny having taxa (strains) annotated with extrinsic traits. Phylotypes are subsets of taxa with close phylogenetic relationships and common trait values. The method combines ancestral trait reconstruction using parsimony, with combinatorial and numerical criteria measuring tree shape characteristics and the diversity and separation of the potential phylotypes. A shuffling procedure is used to assess the statistical significance of phylotypes. All algorithms have linear time complexity. This results in low computing times, typically a few minutes for the larger data sets with a number of shuffling steps. Two HIV-1 data sets are analyzed, one of which is large, containing >3000 strains of HIV-1 subtype C collected worldwide, where the method shows its ability to recover known clusters and transmission routes, and to detect new ones. AVAILABILITY: This method and companion tools are implemented in an interactive Web interface (www.phylotype.org), which provides a wide choice of graphical views and output formats, and allows for exploratory analyses of large data sets.
Type de document :
Article dans une revue
Bioinformatics, Oxford University Press (OUP), 2013, 29 (5), pp.561-570. 〈10.1093/bioinformatics/btt010〉
Liste complète des métadonnées

Littérature citée [32 références]  Voir  Masquer  Télécharger

http://hal.ird.fr/ird-00831601
Contributeur : François Chevenet <>
Soumis le : vendredi 7 juin 2013 - 12:44:01
Dernière modification le : jeudi 11 janvier 2018 - 06:26:12
Document(s) archivé(s) le : dimanche 8 septembre 2013 - 04:18:45

Fichier

Bioinformatics-2013-Chevenet-5...
Fichiers éditeurs autorisés sur une archive ouverte

Identifiants

Collections

Citation

François Chevenet, Matthieu Jung, Martine Peeters, Tulio De Oliveira, Olivier Gascuel. Searching for Virus Phylotypes. Bioinformatics, Oxford University Press (OUP), 2013, 29 (5), pp.561-570. 〈10.1093/bioinformatics/btt010〉. 〈ird-00831601〉

Partager

Métriques

Consultations de la notice

521

Téléchargements de fichiers

269