HIV Databases HIV Databases home HIV Databases home
HIV sequence database



ElimDupes

Duplicate Sequence Removal

Purpose: compare the sequences in an alignment and identify or eliminate duplicates or very similar sequences.

For details, see ElimDupes Explanation.

You have javascript turned off
Please note that some tool features, form validation in particular, may not work properly.
Input
Paste your sequences here icon
[Sample Input]
or upload your file
Yes, sequences are aligned icon UNCHECK box if your sequences aren't aligned (tool will be much slower)
Analyze input by groups icon enter number of leading digits
Elimination options
Eliminate sequences 100% identical
more similar than % icon
Remove extraneous characters icon (If seqs are unaligned, this setting may be changed by the tool; check your results.)
Make all letters uppercase icon
Consider subsequences as duplicates icon
Output options
Restore original sequences in output icon Yes No
Create file of unique sequences with
_count added to sequence names icon
Yes No
 Include rank in sequence names icon
 Sequence names end in '_nn' where nn is the occurrance count icon

last modified: Wed Oct 31 09:57 2018


Questions or comments? Contact us at seq-info@lanl.gov.

 
Operated by Triad National Security, LLC for the U.S. Department of Energy's National Nuclear Security Administration
© Copyright Triad National Security, LLC. All Rights Reserved | Disclaimer/Privacy

Dept of Health & Human Services Los Alamos National Institutes of Health