This tool allows you to obtain all sequences (or a subset of sequences) which contain a selected region. Entries can be downloaded as Genbank records, or as alignments in FASTA or IG/Stanford format.
Note 1: Selecting the 5' LTR region will result in selecting sequences in both the 3' LTR and 5' LTR region. This problem will be corrected in the near future.
Note 2: Please check these alignments carefully before using them! Although they appear to be multiple alignments, they were not created by a program that considers all of the sequences while generating the alignment. Instead, alignments were created by a two-step process. First, the single sequence was aligned against a HMMER model. Second, gaps were inserted as needed to create the final product, an alignment of all sequences. While such alignments serve as a good starting point, they may not be optimal.
1. Search for sequences matching a name or accession number. Leaving both fields empty returns all sequences that match the other search criteria. Note: These are wild-card searches (see examples)
3. Download only complete sequences, or include fragments of the specified region. If you choose to include partial sequences, you can set a minimum sequence length.
4. You can either view the sequences in the web browser or download them.
5. Choose the format for the sequences. If you choose IG or FASTA, the sequences will be aligned and you may clip the sequences to the region of interest (see #6 below). If you choose GenBank format, sequences will not be aligned, but you will get the background information concerning the sequence.
6. If you are downloading an ALIGNED set of sequences in FASTA or IG format, you can have the sequences clipped down to the region of interest.
7. Choose the identification information for the sequences: