TY - JOUR
T1 - SeqFIRE
T2 - A web application for automated extraction of indel regions and conserved blocks from protein multiple sequence alignments
AU - Ajawatanawong, Pravech
AU - Atkinson, Gemma C.
AU - Watson-Haigh, Nathan S.
AU - MacKenzie, Bryony
AU - Baldauf, Sandra L.
N1 - Funding Information:
Royal Thai Government Scholarship and a graduate student fellowship from Uppsala University (to P.A.); Estonian Science Foundation Mobilitas [MJD99 and GLOTI9020 to G.C.A.]; the Center of Excellence in Chemical Biology, University of Tartu, Estonia (to G.C.A.) and the Swedish Research Council [2010-2771] (to S.L.B.). Funding for open access charge: Swedish Research Council [2010-2771] to senior author (S.L.B.).
Copyright:
Copyright 2021 Elsevier B.V., All rights reserved.
PY - 2012/7
Y1 - 2012/7
N2 - Analyses of multiple sequence alignments generally focus on well-defined conserved sequence blocks, while the rest of the alignment is largely ignored or discarded. This is especially true in phylogenomics, where large multigene datasets are produced through automated pipelines. However, some of the most powerful phylogenetic markers have been found in the variable length regions of multiple alignments, particularly insertions/deletions (indels) in protein sequences. We have developed Sequence Feature and Indel Region Extractor (SeqFIRE) to enable the automated identification and extraction of indels from protein sequence alignments. The program can also extract conserved blocks and identify fast evolving sites using a combination of conservation and entropy. All major variables can be adjusted by the user, allowing them to identify the sets of variables most suited to a particular analysis or dataset. Thus, all major tasks in preparing an alignment for further analysis are combined in a single flexible and user-friendly program. The output includes a numbered list of indels, alignments in NEXUS format with indels annotated or removed and indel-only matrices. SeqFIRE is a user-friendly web application, freely available online at www.seqfire.org/.
AB - Analyses of multiple sequence alignments generally focus on well-defined conserved sequence blocks, while the rest of the alignment is largely ignored or discarded. This is especially true in phylogenomics, where large multigene datasets are produced through automated pipelines. However, some of the most powerful phylogenetic markers have been found in the variable length regions of multiple alignments, particularly insertions/deletions (indels) in protein sequences. We have developed Sequence Feature and Indel Region Extractor (SeqFIRE) to enable the automated identification and extraction of indels from protein sequence alignments. The program can also extract conserved blocks and identify fast evolving sites using a combination of conservation and entropy. All major variables can be adjusted by the user, allowing them to identify the sets of variables most suited to a particular analysis or dataset. Thus, all major tasks in preparing an alignment for further analysis are combined in a single flexible and user-friendly program. The output includes a numbered list of indels, alignments in NEXUS format with indels annotated or removed and indel-only matrices. SeqFIRE is a user-friendly web application, freely available online at www.seqfire.org/.
UR - https://www.scopus.com/pages/publications/84864446807
U2 - 10.1093/nar/gks561
DO - 10.1093/nar/gks561
M3 - Article
C2 - 22693213
AN - SCOPUS:84864446807
SN - 0305-1048
VL - 40
SP - W340-W347
JO - Nucleic Acids Research
JF - Nucleic Acids Research
IS - W1
ER -