|Home | People | Research | Publications|
Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis
Version 0.91b, January 2002
Copyright © Jose Castresana
Gblocks is a computer program written in ANSI C language that eliminates poorly aligned positions and divergent regions of an alignment of DNA or protein sequences. These positions may not be homologous or may have been saturated by multiple substitutions and it is convenient to eliminate them prior to phylogenetic analysis. Gblocks selects blocks in a similar way as it is usually done by hand but following a reproducible set of conditions. The selected blocks must fulfill certain requirements with respect to the lack of large segments of contiguous nonconserved positions, lack of gap positions and high conservation of flanking positions, making the final alignment more suitable for phylogenetic analysis. Gblocks outputs several files to visualize the selected blocks. The use of a program such as Gblocks reduces the necessity of manually editing multiple alignments, makes the automation of phylogenetic analysis of large data sets feasible and, finally, facilitates the reproduction of the alignments and subsequent phylogenetic analysis by other researchers. Gblocks is very fast in processing alignments and it is therefore highly suitable for large-scale phylogenetic analyses.
Several parameters can be modified to make the selection of blocks more or less stringent. In general, a relaxed selection of blocks is better for short alignments, whereas a stringent selection is more adequate for longer ones. Be aware that the default options of Gblocks are stringent.
Gblocks is available for the following operating systems:
The software and its accompanying documentation are provided as is, without guarantee of support or maintenance.
Institut de Biologia Evolutiva (CSIC-UPF)
Passeig Marítim de la Barceloneta 37, 08003 Barcelona, Spain