Bioinformatics Assignment
Bioinformatics Assignment
AAAGAATGGTCACGGTCGAGGAAGTTCGTAAGGCTCAACGCGCCGAAGGGCCGGCC
ACCGTGTTGGCGAT
CGGCACGTCGACTCCACCAAACTGTGTTGATCAAAGCACATACCCGGACTACTATTT
CCGCATCACAAGT
AGCGAGCACAAGACGGAGTTGAAAGAGAAATTCAAGCGCATGTGTGAAAAATCCAT
GATCAAGAAGCGTT
ACATGTACCTAACGGAGGAGATTTTGAAAGAGAACCCCAACGTTTGTGAGTACATG
GCACCATCGCTTGA
CGCAAGGCAAGACATGGTGGTGGTTGAGGTGCCAAAGCTAGGCAAAGAAGCGGCC
ACCAAGGCAATTAAG
GAATGGGGCCAGCCGAAGTCCAAGATCACCCACCTAGTCTTCTGCACCACCAGTGGT
GTCGACATGCCCG
GGGCCGACTACCAGCTCACCAAGCTCTTGGGTCTCCGTCCGTCCGTTAAGCGTCTCA
TGATGTACCAACA
GGGTTGTTTCGCGGGCGGTACTGTTCTTCGTGTGGCCAAGGATTTGGCCGAGAACAA
CAAGGGTGCTCGT
GTTCTTGTTGTTTGCTCGGAAATCACCGCGGTTACTTTCCGTGGACCGAGTGATACTC
ACTTGGATAGTC
TTGTGGGACAAGCATTGTTTGGTGATGGTGCTGCTGCTGTTATAATTGGTGCTGATCC
AATACCGGAGAT
CGAAAAACCTATGTTCGAACTTGTATCGGCGGCACAAACGATATTGCCGGATAGCG
ACGGTGCTATCGAC
GGTCACCTTCGTGAAGTCGGGCTTACATTTCACCTCCTCAAGGATGTTCCGGGACTA
ATTTCAAAGAACA
TTGAGAAGAGCCTAGTTGAAGCATTTCAGCCTTTGGGCATATCCGATTGGAACTCTC
TCTTTTGGATAGC
GCACCCTGGTGGTCCAGCCATATTGGACCAAGTCGAAGCGAAACTAGCACTCAAGC
CCGAAAAGCTTCGA
GCCACTCGACATGTTCTTTCGGAATATGGGAACATGTCGAGTGCTTGTGTGTTGTTTA
TATTAGACGAGA
TGAGGAAGAGTTCGAAAGAAAATGGACTAGGCACCACGGGTGAAGGGCTCGAGTG
GGGTGTGCTGTTCGG
GTTCGGACCGGGGCTCACCGTCGAGACGGTGGTGCTCCATAGCGTCACTGCATAA
MVTVEEVRKAQRAEGPATVLAIGTSTPPNCVDQSTYPDYYFRITSSEHKTELKEKFKRM
CEKSMIKKRYMYLTEEILKENPNVCEYMAPSLDARQDMVVVEVPKLGKEAATAIKEW
GQKSKITHLVFCTTSGVDMPGADYQLTKLLGLRPSVKRLMMYQQGCFAGGTVLRVAK
DLAENNKGARVLVVCSEITAVTFRGPSDTHLDSLVGQALFGDGAAAVIIGADPIPEIEKP
MFELVSAAQTILPDSDGAIDDEMRKSSKENGLGTTGEGLEWGVLFGFGPGLTVETVVLH
SVTA
Secondary Structure:
2. UNVERIFIED: Oryza sativa isolate Qitougu cultivar Qitougu long and barbed awn 1
(LABA1) gene, partial sequence
>KR456128.1 UNVERIFIED: Oryza sativa isolate Qitougu cultivar Qitougu long and barbed
awn 1 (LABA1) gene, partial sequence
TCTGAACAGTTAGCCCTTCAATTCTACTCCCAATATTTACTGATGGGATGGTCCAGA
AAATAAACTCCCG
CACTGCGCGCAATAACTTTTCAATGGAGTTGCAAAAGTCTGCTGCAACTTAATTTTG
CAACAGTAGTTGC
AATTTACAGCTAGCTCGTTGCGATGACAGTGCTTAGACTTGAAGCATAAATGTCCAA
AAGAATTTCAATC
GAGATGATGGATCATTGGACCCCCTATATATTACACTGGACCTGTAGATACATAATG
AGCATATAGGGAA
GTAAGTAAAGTGGCCGCCCCCCCTATATGCGTTTTCGCTCTGCCTAAATGCATCAGA
CTACATTCAGCTA
GTCAGGATTCCCTGAACGGGTTGCATGTGCATGTTTCTTCTCTTCTTCCTTTGCTCGG
AAGTCTGTCTAT
GTCATCTTGATGAACCCTAGCTAGTGTATAACAAGCAAGCAGGGTTGTGTCGCCGAC
ATGCATAAAGCTT
TTTTGGACATGCGGGCCTGCAGGCCGCAGTAAGAACGAGCCGGCATGGTCTCATAT
AAAATCTGAACTAA
CCTTTTGAGATAAGAAATCTGAAGTAGCTAGCACGAACGGTCTTCACTTCTGCTAAA
CAAACACCGGAAG
AGATCAACTAACAGAGACGGCATTATTAGTTTAAGTGATTCCAAGTGTTCTCCGGAT
CAATCATATTAAT
TTAGCCCTTCTTGATGCATGTCCTGCGTAATTAGCACATCACATTCAGCTTTCTGCTT
TCTATACACCCA
GTTAGGTCGAATCACACAACATGCAGCTACCAGGATATATATTATACCAACACAAA
ATAATCAGATGATC
AGATGGGGCCTACAAGATACCTATATATAACAGCAAAGCGGTAGTGCACTCAAGTG
TGAGCAAAGCCATG
CTGTAGTGGGATCTCTCTCCATCCATTCCTGCAGTGCCGCCACTGCTATAGCTGCGA
GAGCTATAGCTAG
AAAAATATCAACGTAGGAAGTGGGAGGAGAAGTAGATGATGGATACAGATCACACT
GAGATAATTAAGGA
GGGAGAGGCAGTAGTGGAAGCCATGGCTCTACTCAGTCTCGGTTCAGGAGGATATG
CGTCTTCTGCGGGA
GCAGCCAGGGCAAGAAGAAGAGCTACCAGGACGCAGCCGTTGAGCTTGGCAAGGA
GCTGGTACGTTCTAC
TATATTTCTCAATTCTCATATATGTAATCGTCTTGACAAAGCAGCTGCTTCATTTTTC
AACCTTGATCAT
GACATCCTTTCTGAGATCTTGCTTCTTTAATTTTCTCTCGTTAATCAAAAACACCATG
GCCATGGAGCTT
TTTACTGATATTCAGCACAACGCGCCTGGTGATATGTAGGTATATAGAGAGTACCTG
ACAAGGTTTTGGG
TTTCTTTTCAACGTCGTAATCTTGGAGTAATATAATTATCAAGCTGTGTAGACGTACA
CTTGCTGAACAA
GGGCTAAAATTTATCTCTCTATATATTTAAGTGACAATATATGCGACATTACTGCTA
GCTGTAGACGTAC
TCTTGCAGTGCATGCAGCTTGTTCTTAGTAGAAGATGAGAGCATATGCATGAGCTGA
GCACAGCATGTGT
AGGTCTGAATTGAATTTCTGTATCATGCAGTAAAACCCACTTGTATTACAAAGGAGA
ATGTCATGAGGGC
TTGCACATGAGCTTATTAATTACTTCACATGAAAGACAATATGTCTGTCGGTGAACA
TGGAATTTTATAT
TTATGCAGCTTGCATAGTTCCATAGAAAATGACTTACTCCCCAGGAACCTAGTACTG
GATTAGGTTATGA
TCTTGTATAAGGCTGCTATATTTTGAAACGGAGGGAGTACTAGCCAGCAAGTTACTA
CGCGCATTTGCAT
TTGTATGGTGACTTTTCGCCGGGATTTTCGGTGCTCAGCGCATTTTTCTCTATCCATTT
CGAGGCACGGT
ATAAATGCAAAAATGAAATAGACAAGTCGCATGGATCAGTCTTAAAGGAAACTATA
TATAGTGTGTACTG
TATGTAGTAGCTAGACTACATATTCTAGATCACTTGATGATATATATTATAGAAGAC
TAGATTCTAGAAT
GTGGAAATGCATCCCCACTCTCCATTATTATGGTTTCACCACAGGTAGCAAGGAACA
TTGATCTAGTGTA
TGGTGGAGGAAGTGTGGGGCTCATGGGCCTGGTCTCTCAAGCTGTCTACAATGGAG
GGAGGCATGTTATT
GGGTATGTAAAAACGTAATAATTGTTGATCATTCTTCAGCACTGATACATGGAAAGA
ATAACTCCGTATA
TAGTACACTTTGTAAGATGTCATCTTGTCATGACAATAGACTTGGATTATGGTTCCTT
TCCTCAGGTGCT
ACATGCTAGTACTAATATATAATTTGCCATTTTGTTGTCCCTATATACACGCAGGGTG
ATTCCCAAGACT
CTTATGCCTAGAGAGGTAAGCACGCTCATCTCCCTTCCAACAAGTCCTGACATAGTT
TATTCACATGCAA
CGATTCCCATGAGCTTGACATCATTGTTGCTGATCACCGGTTAAAATCTCATCATTTT
CATTGGAAATGT
CACTTGAAAAAAAATCTAGTTAATACTTATTGATCAAGCATGTTAAGTATTTTGTAA
GCATGATAGTATC
ACTAGATGAGATTGCTTTTTAGAAAAAAGTCAGTAGCACTGATCTAAAATTATATCA
CTTATAGGTATTA
GAAAAACATATATATATGGTATGTGGATACATACTACTACTGGAAAATTATGCTTGT
TCATTATCTTTCT
TGAATCTTGATAGTGGAAAGAAGAATGCGATCTTATCTGCTTACAAAAGATCGCATG
GAACGTTCAAAGA
AAATCTTGTTTTTCCTTTCCCTCGTAGATGAAATCCTGCTGATCTTGACTGCCGCTTC
TAGTGGCTGCCC
CTTTGTTCTTAAGGCCTTGATCCTTTAAGCAGATGCTGACTTTATTAATCAGAAAGAA
ATGTCCCTCACA
TTTTCTTTTTGTCCCCCAACAACAGCAACATGATTGCCTCTCCTGAACCTTTGCTGCG
TTGCTCTGATTA
AACTATACATGCTACCACTCCTGTTAAACAGTGCTCATCCCCTTGCTCGTTGGCTAGT
TTTTGCATGCTC
TCTTACTTTTTGTGGAGCAGAAAGCTGCAGGGTGCTCCTGTGTCTGTGCGAGAGGCT
TTGTCACATACGC
ATGCGTGTTTGCTTGTGTGTGCAGATTACGGGTGAGACAGTAGGGGAGGTGAAAGC
AGTGGCAGATATGC
ATCAGAGGAAGGCTGAGATGGCCAGGCAATCTGATGCGTTCATAGCACTGCCTGGT
TAGTCTCTCTCACC
ACCTGGATATATTTTTTTTAAGATAATGAAACCACCTGAATATATAGCTTCATCAACT
TGTGTTGAGTCG
ACATGTACTAATTTGCTTTACTTTTGTTGTCAATTGCCTTGTCATATTTTTTCCTTTAT
CATGTTATGAT
ACGTGGCTAGTGGAAGTGTGAGTGGAAAGTTGCGTAAAAGCAAGCACAAGCAGGGT
GCATGAGTTTGACT
TTGACCTGCTAAATTGGCTTGCATGCATGCTGCTGCCGCCCAGTCGGTGCAGAATCC
TTCTTTATACGCA
TGAGCTGGTCGATCAGATTGTCAGATACTCGGATAAGAGAGGACCTGTTGAGCCACT
TCGTGCTGATATT
TTCAGTATCGTTCGTAGCTTATCTTTTTCAGTTATTTGATTCTTTTTCCTAATCATCAG
CCAACTGAATT
TGTGATTCGCCCATCAGTCTTAAAAAAAAATTGTGATACGCCTTGTTAGCAAGGACC
AAATTTGGCCCAT
CCCCGAGGATGGGCTGGGAATTTTAACTAGATTCAGGACCATCCTAGTGAAATGGG
CTAGTATTTTTTCT
GCCTTCTCAAATCAAACCTGACACAACTTCATACTACTATGGATTTATGAGGTTCAA
ACTCAGCCTAGAA
ATATTCTGAAGATAATGCAAACATCCGACAATTATACCCAGGCAGATGCTGTATCCG
CAGATTCCCTCAA
TCAATACTCGTATTTCTGGAGGAGAGAGAGATCCAGAGCGAGCACTGTTCATTACTT
AAATTTGTTTACT
GCAGGTGGGTATGGAACACTTGAAGAGCTCCTGGAAGTAATTGCCTGGGCTCAGCT
CGGCATTCACGACA
AGCCGGTACATACTGAAATAGTTCATGATCAGCTTTTGCACATGCAACATATGTACA
CGCACTGATGAAC
AATGCACGTATATACACGGAGAGCCAAAACTTTTTTTTACCTTGGCTTAATGTGGAC
GATCGATGCTACG
TACAGGTTGGCCTGCTAAATGTGGACGGCTACTACAACTCTCTGCTGTCGTTCATCG
ATAAAGCTGTGGA
GGAAGAGTTCATCAGCCCCTCTGCGCGCCATATCATCGTGTTAGCTCCAACACCAAA
AGAACTTCTCGAG
AAGCTAGAGGTGTATATACTTATATATATAATCTATCGATTTTTCTGATGCATCATGG
CACACTGCAAAG
GACAGAAGAAAACGCCGGTTTCATCGGTGAATGAGAGATCGAGCTGAACTTTCCTC
TTGCTCGCATGCAG
GCGTACTCCCCTCGGCATGACAAGGTCGTGCCGAAGATGCAGTGGGAGATGGAGAA
GATGAGCTACTGCA
AGAGCTGCGAGATCCCTGGCCTGAAAGAAGGCAACAAGGCGACCATCCAAGCACAG
CGAGGAAGCATGCT
CTGAAATTTACTGTAGCACTAGCTAGCTATAGCTTAGCCAATGCGTGCAGCAAGAAG
ATTCAAAACTTCT
TGGTGCACATGAACTGCAACTTTTAATTCATGTAACTCGGTTCATTAGAAGGCAATT
GATCACTGATCAT
GCAATTAGTCGTGTACGTAGCCCTAAGAATGAAATCATCAGTAAGATTTGTAAATCA
CAGAGCTCCAGAG
CAGCAGACTGACATAAAACGATATGTGGGCTAGATCGATCTCCTCCGTGGGCTGGG
CCGGA
SRFRRICVFCGSSQGKKKSYQDAAVELGKELITGETVGEVKAVADMHQRKAEMARQSD
AFIALPGGYGTLEELLEVIAWAQLGIHDKPAYSPRHDKVVPKMQWEMEKMSYCKSCEI
PGLKEGNKATIQAQRGSML
Secondary Structure
GAATTCTTTCTCCCACACACCCTTTTTGCCCTCTTTCGCCGAGGAGGAAAGAATAATC
TTCCAAGCGGAC
AGAGACCTAAATTTCCATTAGATTCATTCCTAAGCTTGCTTTGTTGCAGCAAGATGA
TCAGTCCGAGAGT
GCTGGAGAGAAGAGAAAGCGGTAAAAACCTCTCTTATTCGGTCACCGAGAAGTCGG
ACGACTCTTCAGTA
ACCCAGGGTGATCCGACCCCTTCGACGCTTTTTTCGCTGTATACCCCCTCCATCCTTC
GGAGGTGGAAGA
AAGGGTACTCACATTTTAATACATAGTAGGGCCCCAGAACGCTAAAAGGTGGGGGA
ACAAGAGTTGTCAC
GATAGAAAAGAGAAAAAAAGAAATGACTATAAGGAACCAACGGCTCTCTCTTCTTA
AACAACCTATATCC
TCCACACTTAATCAGCATTTGATAGATTATCCAACCCCGAGCAATCTTAGTTATTGGT
GGGGGTTCGGTT
CGTTAGCGGGTATTTGTTTAGTCATTCAGATAGTGACTGGCGTTTTTTTAGCTATGCA
TTACACACCTCA
TGTGGATCTAGCTTTCAACAGCGTAGAACACATTATGAGAGATGTTGAAGGGGGCT
GGTTGCTCCGTTAT
ATGCATGCTAATGGGGCAAGTATGTTTTTCATTGTGGTTCACCTGCATATTTTTCGTG
GTCTATATCATG
CCAGTTATAGCAGTCCTAGGGAATTTGTTCGGTGTCTCGGAGTTGTAATCTTCCTATT
AATGATTGTGAC
AGCTTTTATAGGATATGTCCTACCTTGGGGTCAGATGAGCTTTTGGGGAGCTACAGT
AATCACAAGCTTA
GCTAGCGCCATACCTGTAGTAGGAGAATTAGCCGCCTTATTTATGGATTTTCCTCTG
GTATTGGGGAATG
GGGGGCACCCCTACAAGACTTTAATCTAAACATGCTGCCTGATTTGAATCTGCCCGC
GGAACCCGAGCCC
TATCAACCACTATTTCCCCAGGTTCCTCTTGGGACTCTGTCCTTAGAGGAGCAAAGG
GCAGTAGACGAAT
TAGTCCGCCTTGAAGGGCGGCTGATCCGGACCGCAAGATGTGTCTTGGAGTCGCTGG
GCTATTCGCCCCA
GCCGGGCGACATAAAAACTTTCGTACAGATCTTCATAGTGGACATCGACTCATCCGA
CTATGATGATCTT
GTCTTGGCTCTTATTGATGAAAAGTCGCCCATTTTTCTTCAATTTTTAGAAGAATGGG
AAATTTTTCTCG
CAGACAACAGCGCTCTGGTCGAAAATTTAGGGATCAATAAAATAAAATAACAACAC
CCACCTAACCTAAT
TAATGTGGTGTGGAAGGGCAAGCGATAGAAAAGTCCCTCGCGCGAACTACTACTAC
TAGAAAATGGTGAG
ACCATTAGTGAAAGAAGGACCAAGGAGGATCCTATCCTAAAGGAGAAGGAGGAGG
AGTAGGAGCTAGCTT
TAGGGGCGGAATCGAATGATTACGAGATAAACAATGAGACAAAGGAGAGCACTTA
GACGAGTCAGCCAAA
AAGAAAGACCACCAAAAGTAACGACCACCAAGATAGGCATAGTAATTCGATCTTTT
GATCACCCATTTTT
GGAAAACCATTTTTGGGGGCTTCCGCCTTACACACGGAAGATTGGATTGCCTGAATC
ACGAGTCTTATAT
ACTGTGTTACGATCACCTCATATTGATAAAAAGTCCAGAGAACAATTTTTTATGAAA
ATAAAGAAAGAAT
TTCTGGTCATAAAAACAGAAAGGCATGAATTGCGCAAGAAGTTCTTTCGGTTAAAAC
GCCGTGCGACTCG
GAGGACATAAGACTTCTTGGTCAAGCCAAAAAGATTGCCGACTGGATGCTCCTACCC
CACCATGCCTGGC
CCTTTATCTTACCTTAAGAAGAAAGAGGAGGTATGAAGCGTGGGACAGAATGCAAG
AAGTGTATGATACA
ACAGATAACCTTAAGGAGTGGCGGCAACCCTCTTGATTGATCAACGCGAGTGAACT
GTGCTTAGACGCTT
CGTAAAACCGCACCGATCTACGAGAGGAGCTGATAGATGAGTAGGCTTCCCCTTTCG
ATTCACGGAATGT
GATCTGGGACACGATGGGAGTTTGCGTGTCTCGGTAGGAAAGATATCACCGGAGTA
TAGCACAAGATCGC
CTTTTCTTGCTGCCATGGGGTCGACCTGTGAACAAGGTAAACCCAATGGGAACCAAA
AAACGGGGGGTAC
CGCATTGGGCAGAAGACGATCCAAAAAGCGAAGGCTCACCCAGCTGAAGGAAGGG
GGTGAGATAGGCAAG
AGGTAGCTTGCTTCCAAGCCGGCCCGGCCGCGAGGAATCAAGAGATCTTTGCCGGT
GCTGACTTGGATCT
CGGGTGACGGAATAAGGCGGCCAGAAGCGACGAGCAGTCGCGGTCGAAGCTTGGCT
CTAGCCGATAGGCT
AGCTGTAAGCTTGGCTACAGCGAACGCTTACCAAGCGCGAAGGAAAGGGCCTTCGC
CACTTGAGCCGTAT
GCGGGGGAACTCGCACGTGCGGTTCTTAGGGGGGGAGAGCTAGTAGGAGCCATCCC
ATCCCAATAGCGTA
TATTTGGAGCTCAATATGAAATCCTATTTTCTTGCAAGACCCGTTCGGATAAGGGAA
AACTCCAGAGATT
GCTTCGAAATAAGATCCTTGCGTTGACCCTTTCCTGAACCAGAACCGGGGGAGGATG
AGAGGAAAGGGGG
ATCAAAATGTCCCATCAATAAAGTTCGGGCTTTCCATTCTTCCTCCCCCCCCCTCACT
CCCCCTTTTTC
MSVLGPYYVLKCEYPFFHLRRMEGVYSEKSVEGVGSPWVTEESSDFSVTE
Secondary Structure
AGACATAAAAATCATCGCCAGTTGCTTAAAGAAGCAGAAACAGACCAGATAAATCG
TCGCTGAAAAACGC
ACTTCAAACTGGCTGGTAATATATTTAAAGCAGCCCACCAGCAGGAACGGTACTTCA
AACATATGCAGCG
TTTTCAGAATAACCACTTCCAGTGCTGAGGTAGCGAACGATGAGCCAATAATACGTA
CTGACATAATAGT
GCCAGCCAGCAGCAGGGCGTTTTTCCCACCGATGCGATTAATGATCAGTGGCGCAA
AAAACATAATCGAG
GCGTTAAGTAATTCGCCCATTGTCGTTACGTAGCCAAATACCCGCGTACCCTGTTCA
CCGGTAGCAAAGA
ACGAAGTAAAGAAATTAGCAAACTGTTGGTCAAAAACATCGTAGGTGCAGGAAACG
CCAATAACATACAG
TGACAAAAACCACAGTTTTGGCTGTCTGAACAGTTCCAGCGCCAGCTTAAGGCTAAA
TGCCGAATGGTTG
GCACCTACCGCATTGGCAACCGTGGCAGAAGAGGGCGCATCCGTTTTGGCGAAAAA
GAGTAAAACGGCGA
GGATGAATGCACAGCCAGAACCCAGCCAGAAAACAAACTGATTATTGATGGTGAAC
ATGATGCCGACAAT
CGAGGCACACAGCGCCCAGCCAACACAGCCAAACATCCGCGCGCGACCAAATTCGA
AATTACTGCGACGG
CTGACTTTCTCAATAAATGCCTCTACTGCTGGCGCACCGGCGTTAAAACAAAAGCCT
AGATAAATACCAC
CAACAATCGATCCTACTAAAATGTTGTATTGTAACAGTGGCCCGAAGATAAAAATA
AAGAACGGCGCAAA
CATCACTAACATGCCGGTAATAATCCACAGCAGGTATTTGCGCAGCCCGAGTTTGTC
AGAAAGCAGACCA
AACAGCGGTTGGAATAATAGCGAGAACAGAGAAATAGCGGCAAAAATAATACCCG
TATCACTTTTGCTGA
TATGGTTGATGTCATGTAGCCAAATCGGGAAAAACGGGAAGTAGGCTCCCATGATA
AAAAAGTAAAAGAA
AAAGAATAAACCGAACATCCAAAAGTTTGTGTTTTTTAAATAGTACATAATGGATTT
CCTTACGCGAAAT
ACGGGCAGACATGGCCTGCCCGGTTATTATTATTTTTGACACCAGACCAACTGGTAA
TGGTAGCGACCGG
CGCTCAGCTGGAATTCCGCCGACACTGACGGGCTCCAGGAGTCGTCGCCACCAATCC
CCATATGGAAACC
GTCGATATTCAGCCATGTGCCTTCTTCCGCGTGCAGCAGATGGCGATGGCTGGTTTC
CATCAGTTGCTGT
TGACTGTAGCGGCTGATGTTGAACTGGAAGTCGCCGCGCCACTGGTGTGGGCCATAA
TTCAATTCGCGCG
TCCCGCAGCGCAGACCGTTTTCGCTCGGGAAGACGTACGGGGTATACATGTCTGACA
ATGGCAGATCCCA
GCGGTCAAAACAGGCGGCAGTAAGGCGGTCGGGATAGTTTTCTTGCGGCCCTAATC
CGAGCCAGTTTACC
CGCTCTGCTACCTGCGCCAGCTGGCAGTTCAGGCCAATCCGCGCCGGATGCGGTGTA
TCGCTCGCCACTT
CAACATCAACGGTAATCGCCATTTGACCACTACCATCAATCCGGTAGGTTTTCCGGC
TGATAAATAAGGT
TTTCCCCTGATGCTGCCACGCGTGAGCGGTCGTAATCAGCACCGCATCAGCAAGTGT
ATCTGCCGTGCAC
TGCAACAACGCTGCTTCGGCCTGGTAATGGCCCGCCGCCTTCCAGCGTTCGACCCAG
GCGTTAGGGTCAA
TGCGGGTCGCTTCACTTACGCCAATGTCGTTATCCAGCGGTGCACGGGTGAACTGAT
CGCGCAGCGGCGT
CAGCAGTTGTTTTTTATCGCCAATCCACATCTGTGAAAGAAAGCCTGACTGGCGGTT
AAATTGCCAACGC
TTATTACCCAGCTCGATGCAAAAATCCATTTCGCTGGTGGTCAGATGCGGGATGGCG
TGGGACGCGGCGG
GGAGTGTCACGCTGAGGTTTTCAGCCAGACGCCACTGCTGCCAGGCGCTGATGTGTC
CGGCTTCTGACCA
TGCGGTCGCGTTCGGTTGCACTACGCGTACTGTGAGCCAGAGTTGCCCGGCGCTCTC
CGGCTGCGGTAGT
TCAGGCAGTTCAATCAACTGTTTACCTTGTGGAGCGACATCCAGAGGCACTTCACCG
CTTGCCAGCGGCT
TGCCATCCAGCGCCACCATCCAGTGCAGGAGCTCGTTATCGCTATGACGGAACAGGT
MVALDGKPLASGEVPLDVAPQGKQLIELPELPQPESAGQLWLTVRVVQPNATAWSEAG
HISAWQQWRLAENLSVTLPAASHAIPHLTTSEMDFCIELGNKRWQFNRQSGFLSQMWIG
DKKQLLTPLRDQFTRAPLDNDIGVSEATRIDPNAWVERWKAAGHYQAEAALLQCTADT
LADAVLITTAHAWQHQGKTLFISRKTYRIDGSGQMAITVDVEVASDTPHPARIGLNCQL
AQVAERVNWLGLGPQENYPDRLTAACFDRWDLPLSDMYTPYVFPSENGLRCGTRELN
YGPHQWRGDFQFNISRYSQQQLMETSHRHLLHAEEGTWLNIDGFHMGIGGDDSWSPSV
SAEFQLSAGRYHYQL
Secondary Structure
>LC645191.1 Klebsiella pneumoniae IRQBAS103 gene for 16S rRNA, partial sequence
GCTACACATGCAGTCGAGCGGTAGCACAGAGAGCTTGCTCTCGGGTGACGAGCGGC
GGACGGGTGAGTAA
TGTCTGGGAAACTGCCTGATGGAGGGGGATAACTACTGGAAACGGTAGCTAATACC
GCATAACGTCGCAA
GACCAAAGTGGGGGACCTTCGGGCCTCATGCCATCAGATGTGCCCAGATGGGATTA
GCTAGTAGGTGGGG
TAACGGCTCACCTAGGCGACGATCCCTAGCTGGTCTGAGAGGATGACCAGCCACAC
TGGAACTGAGACAC
GGTCCAGACTCCTACGGGAGGCAGCAGTGGGGAATATTGCACAATGGGCGCAAGCC
TGATGCAGCCATGC
CGCGTGTGTGAAGAAGGCCTTCGGGTTGTAAAGCACTTTCAGCGGGGAGGAAGGCG
ATAAGGTTAATAAC
CTTGTCGATTGACGTTACCCGCAGAAGAAGCACCGGCTAACTCCGTGCCAGCAGCCG
CGGTAATACGGAG
GGTGCAAGCGTTAATCGGAATTACTGGGCGTAAAGCGCACGCAGGCGGTCTGTCAA
GTCGGATGTGAAAT
CCCCGGGCTCAACCTGGGAACTGCATTCGAAACTGGCAGGCTAGAGTCTTGTAGAG
GGGGGTAGAATTCC
AGGTGTAGCGGTGAAATGCGTAGAGATCTGGAGGAATACCGGTGGCGAAGGCGGCC
CCCTGGACAAAGAC
TGACGCTCAGGTGCGAAAGCGTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCC
ACGCCGTAAACGAT
GTCGATTTGGAGGTTGTGCCCTTGAGGCGTGGCTTCCGGAGCTAACGCGTTAAATCG
ACCGCCTGGGGAG
TACGGCCGCAAGGTTAAAACTCAAATGAATTGACGGGGGCCCGCACAAGCGGTGGA
GCATGTGGTTTAAT
TCGATGCAACGCGAAGAACCTTACCTGGTCTTGACATCCACAGAACTTTCCAGAGAT
GGATTGGTGCCTT
CGGGAACTGTGAGACAGGTGCTGCATGGCTGTCGTCAGCTCGTGTTGTGAAATGTTG
GGTTAAGTCCCGC
AACGAGCGCAACCCTTATCCTTTGTTGCCAGCGGTCCGGCCGGGAACTCAAAGGAG
ACTGCCAGTGATAA
ACTGGAGGAAGGTGGGGATGACGTCAAGTCATCATGGCCCTTACGACCAGGGCTAC
ACACGTGCTACAAT
GGCATATACAAAGAGAAGCGACCTCGCGAGAGCAAGCGGACCTCATAAAGTATGTC
GTAGTCCGGATTGG
AGTCTGCAACTCGACTCCATGAAGTCGGAATCGCTAGTAATCGTAGATCAGAATGCT
ACGGTGAATACGT
TCCCGGGCCTTGTACACACCGCCCGTCACACCATGGGAGTGGGTTGCAAAAGAAGT
AGGTAGCTTAACCT
TCGGGAGGGCGCTACCACTTTGATCTTGTTCAG
GRFNALAPEATPQGHNLQIDIVYGVDYQDLYAFHRYTWNSTPLYKTLACQFRMQFPVII
NACTLRITAAAGTELAGASSAGNVNRQGVWTVSQFQCGWSSSQTS
Secondary Structure
>LC752325.1 Staphylococcus aureus S33 R gene for 16S rRNA, partial sequence
TTTATGGAGAGTTTGATCCTGGCTCAGGATGAACGCTGGCGGCGTGCCTAATACATG
CAAGTCGAGCGAA
CGGACGAGAAGCTTGCTTCTCTGATGTTAGCGGCGGACGGGTGAGTAACACGTGGA
TAACCTACCTATAA
GACTGGGATAACTTCGGGAAACCGGAGCTAATACCGGATAATATTTTGAACCGCAT
GGTTCAAAAGTGAA
AGACGGTCTTGCTGTCACTTATAGATGGATCCGCGCTGCATTAGCTAGTTGGTAAGG
TAACGGCTTACCA
AGGCAACGATGCATAGCCGACCTGAGAGGGTGATCGGCCACACTGGAACTGAGACA
CGGTCCAGACTCCT
ACGGGAGGCAGCAGTAGGGAATCTTCCGCAATGGGCGAAAGCCTGACGGAGCAAC
GCCGCGTGAGTGATG
AAGGTCTTCGGATCGTAAAACTCTGTTATTAGGGAAGAACATATGTGTAAGTAACTG
TGCACATCTTGAC
GGTACCTAATCAGAAAGCCACGGCTAACTACGTGCCAGCAGCCGCGGTAATACGTA
GGTGGCAAGCGTTA
TCCGGAATTATTGGGCGTAAAGCGCGCGTAGGCGGTTTTTTAAGTCTGATGTGAAAG
CCCACGGCTCAAC
CGTGGAGGGTCATTGGAAACTGGAAAACTTGAGTGCAGAAGAGGAAAGTGGAATTC
CATGTGTAGCGGTG
AAATGCGCAGAGATATGGAGGAACACCAGTGGCGAAGGCGACTTTCTGGTCTGTAA
CTGACGCTGATGTG
CGAAAGCGTGGGGATCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGA
TGAGTGCTAAGTGT
TAGGGGGTTTCCGCCCCTTAGTGCTGCAGCTAACGCATTAAGCACTCCGCCTGGGGA
GTACGACCGCAAG
GTTGAAACTCAAAGGAATTGACGGGGACCCGCACAAGCGGTGGAGCATGTGGTTTA
ATTCGAAGCAACGC
GAAGAACCTTACCAAATCTTGACATCCTTTGACAACTCTAGAGATAGAGCCTTCCCC
TTCGGGGGACAAA
GTGACAGGTGGTGCATGGTTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCC
CGCAACGAGCGCA
ACCCTTAAGCTTAGTTGCCATCATTAAGTTGGGCACTCTAAGTTGACTGCCGGTGAC
AAACCGGAGGAAG
GTGGGGATGACATCAAATCATCATGCCCCTTATGATTTGGGCTACACACGTGCTACA
ATGGACAATACAA
AGGGCAGCGAAACCGCGAGGTCAAGCAAATCCCATAAAGTTGTTCTCAGTTCGGAT
TGTAGTCTGCAACT
CGACTACATGAAGCTGGAATCGCTAGTAATCGTAGATCAGCATGCTACGGTGAATA
CGTTCCCGGGTCTT
GTACACACCGCCCGTCACACCACGAGAGTTTGTAACACCCGAAGCCGGTGGAGTAA
CCTTTTAGGAGCCA
GCCGTCGAAGGTGGGACAAATGATTGGGGTGAAGTCGTAACAAGGTAGCCGTATCG
GAAGGTGCGGCTGG
ATCACCTCCTTT
MHHLSLCPPKGKALSLELSKDVKIWLSPIAEDSLLLPPVGVWTVSQFQCGRSPSQVGYA
SLPX
Secondary Structure
CTTATTTGGCAGCTGCTGCCTCACCCACAGCTTTTGATATCTAGGAGGACTCTTCTCT
CCCAAACTACCT
GTCACCATGGCCCACCGATTTCCAGCCCTCACCCAGGAGCAGAAGAAGGAGCTCTC
AGAAATTGCCCAGA
GCATTGTTGCCAATGGAAAGGGGATCCTGGCTGCAGATGAATCTGTAGGTACCATG
GGGAACCGCCTGCA
GAGGATCAAGGTGGAAAACACTGAAGAGAACCGCCGGCAGTTCCGAGAAATCCTCT
TCTCTGTGGACAGT
TCCATCAACCAGAGCATCGGGGGTGTGATCCTTTTCCACGAGACCCTCTACCAGAAG
GACAGCCAGGGAA
AGCTGTTCAGAAACATCCTCAAGGAAAAGGGGATCGTGGTGGGAATCAAGTTAGAC
CAAGGAGGTGCTCC
TCTTGCAGGAACAAACAAAGAAACCACCATTCAAGGGCTTGATGGCCTCTCAGAGC
GCTGTGCTCAGTAC
AAGAAAGATGGTGTTGACTTTGGGAAGTGGCGTGCTGTGCTGAGGATTGCCGACCA
GTGTCCATCCAGCC
TCGCTATCCAGGAAAACGCCAACGCCCTGGCTCGCTACGCCAGCATCTGTCAGCAGA
ATGGACTGGTACC
TATTGTTGAACCAGAGGTAATTCCTGATGGAGACCATGACCTGGAACACTGCCAGTA
TGTTACTGAGAAG
GTCCTGGCTGCTGTCTACAAGGCCCTGAATGACCATCATGTTTACCTGGAGGGCACC
CTGCTAAAGCCCA
ACATGGTGACTGCTGGACATGCCTGCACCAAGAAGTATACTCCAGAACAAGTAGCT
ATGGCCACCGTAAC
AGCTCTCCACCGTACTGTTCCTGCAGCTGTTCCTGGCATCTGCTTTTTGTCTGGTGGC
ATGAGTGAAGAG
GATGCCACTCTCAACCTCAATGCTATCAACCTTTGCCCTCTACCAAAGCCCTGGAAA
CTAAGTTTCTCTT
ATGGACGGGCCCTGCAGGCCAGTGCACTGGCTGCCTGGGGTGGCAAGGCTGCAAAC
AAGGAGGCAACCCA
GGAGGCTTTTATGAAGCGGGCCATGGCTAACTGCCAGGCGGCCAAAGGACAGTATG
TTCACACGGGTTCT
TCTGGGGCTGCTTCCACCCAGTCGCTCTTCACAGCCTGCTATACCTACTAGGGTCCA
ATGCCCGCCAGCC
TAGCTCCAGTGCTTCTAGTAGGAGGGCTGAAAGGGAGCAACTTTTCCTCCAATCCTG
GAAATTCGACACA
ATTAGATTTGAACTGCTGGAAATACAACACATGTTAAATCTTAAGTACAAGGGGGA
AAAAATAAATCAGT
TATTGAAACATAAAAATGAATACCAAGGACCTGATCAAATTTCACACAGCAGTTTCC
TTGCAACACTTTC
AGCTCCCCATGCTCCAGAATACCCACCCAAGAAAATAATAGGCTTTAAAACAATATC
GGCTCCTCATCCA
AAGAACAACTGCTGATTGAAACACCTCATTAGCTGAGTGTAGAGAAGTGCATCTTAT
GAAACAGTCTTAG
CAGTGGTAGGTTGGGAAGGAGATAGCTGCAACCAAAAAAGAAATAAATATTCTATA
AACCTTCAGCTGCT
ATCGGGTTTCACTTTTCTGCTCTTGCTGTCCAAAGACTCAGTGTATTTCATTACTTTTG
ACTCTACTAGA
CATGACTGGGTTTCAACAGTAAAGGTCTTCAACTCTTGCTAGTCATTGGAATCAAGC
CGCAAAATTTTAA
AAACTGAGATGCTCAGGCCACACCCCAGCTCAATTAAATCAGAAACCCTAGACTTG
GGATCCTCTAACTA
TTAGATTTCTTAAAGCTCCCTCAGTAATTCCAATGTACAGTCAAGTTTGAGAACTAC
CAATCTAAATTTC
AAGTTTGAGGGTATTTGAAAATTAAAGCCATTCACAATACGAAGCCAGCTAAAAAT
GTAGAATGATTTTG
AGCAACTTGTGGAGTATAATAAGAGAATTAATGTGACTTCAATGCTTGGAGCATTCT
TGTTCAAGTGGCC
CAGGTTTGGTGAAACAGGACTACCTTGTCATCTGCACGTCCAGGCATATTTCGTAGT
TTTGCAGTAAATA
ATATTCACATAATGATACTGTATTGACTTTCAATTTTCAGAATTAACCTATAGTTACA
GCACTTAAGACA
ACCAGAGTTATAAAAGAGAATTTAAATATTATAACTTTGGACAATATAAAAGTGAT
GATTTAACTGACAG
AAGCTAGGAAATATAAGGGGGAGGAGAAGTGGAAGAAAGCAAAGAGAGTCAGGAA
TACTACTTAAAACTG
ATGGGTTAAGAAATAGTGCTTTAATTCTATTTAAGTAATAAAAGAAATGGATGTAAA
TCATAAAAATATA
TATCTAAAATTAAAATATTGATGGTAGTATGCTAAATTTC
Secondary Structure
8. Macaca mulatta gene for MHC class I antigen (Mamu-G gene), isolate K721, allele Mamu-
G*04_nov
>LR990779.1 Macaca mulatta gene for MHC class I antigen (Mamu-G gene), isolate K721,
allele Mamu-G*04_nov
ATGGTGGTCATGGCGCCCCGAACCCTCCTGCTGCTGCTCTCGGGGGCTCTGGCCCTG
ACAGAGACCTGGG
CGAGTGAGTGCGGGATCGGGAGATGGCCTCTGGGGGGAGGGCCAGGGGCCCGCCCG
ACGGTGGTGCAGGA
CCCGGGGAGCCGCGCGGGGAGGAGGGTCGGGCGGGTCTCAGCCCCTCCTCGCCCCC
AGGCTCCCACTCCA
TGAGGTACTTCAGCGCTGCCGTGTCCCGGTCAGGCCGCGGGAAGCCCCGCTTCATCG
CGGTGGGCCACGT
GGACGACACGCAGTTCGTGCTGCTCGATAGCGACGCTGCCAGTCCGAGGATGGAGC
CGCGGGCGCCGTGG
GTGGAGCAGGATGGGCCGGAGTATTGGGAAGAGGAGACACGGATCGCCAAGGCCC
ACGCACACACTGACA
GAGTGAACCTGCGGACCCTGCGCAGCTACTACAACCAGAGCCAGGCCGGTGAGTGA
CCCCGGCCTGGGGC
GCAGATCACGACCCCCCACCTCCATGCCCTGCGGACGACCGGGGTACCCCCGAGTCT
CCAGGTCTGAGAT
CCACCCCGAGGCCGCGGGACTCGCCCAGACCCTCTACCTGGGAGAAGCCCACGCGC
CTTTACCAAAATCC
CTGCGGGTTGGTCCGGGAGGGGGCGAGGTTCGGTGGGCGGGGCTGACCGAGGGGGC
GGGGCCAGGGTCTC
ACACCCTCCAGTGGATGATTGGTTGCGACCTGGGGCCCGACGGGCGACTCCTCTTCC
GGTGTGAACAGTT
CGCCTACGATGGCAAGGATTACCTCGCCCTGAACGAGGACCTGTGCTCCTGAACCGC
AGCGGACCCTGTG
GCTCAGATCTCCAAGCGCAAGTGTGAGGCGGCCAAAGCGACTGAACGAAGGAGAGC
CTACCTGGAGGGCA
CGTGAGTGGAGTGGCTCCACAGATACCTGGAGAATGGGAAGGAGATGCTGCATCGT
GCGGGTACCAGGGG
CCATGGGGCACCTCCCCGATCTCCTGTAGACCTCCCAGGCTGGCCTAGCACAAGGAG
AGGAGGAAAATGG
GACCAACACTAGAATACGCCCTCCCTCTGGTCCTGAGGGAGAGGAATCCTCCTGGGT
TTCCAGATCCTGT
ACCAGAGAGTGACTCTGAGGGCCCGCCCTGCTCTCTGGGACAATTAAGGGATGAAG
TCTCTGAGGGAGTG
GAGGGGAAGACAATCCCTGGAATACTGATCAGGAGTTCCCTTTGACCCCACAGCAG
TCTTAGGCACCAGG
ACTTTTCCCCTCAGGCCTTGCTCTCTGCCTCACACTAAATGTGTGTGGGAGTCTGACT
CCAGCTCCTCTG
AGCCCTTTGGCCTTCACTCAGGTCAGAATCGGAAGTCCCTGCTCCCCCGCTCTGAGA
TTAGAACTTTCCA
AGTATTAGGAGATTATCCCAGGTGCCCGTGTCCAGGCTGGTGTCTGAGTTACGTGCT
CCCTCCCCCCACC
CCATCCCGCCAGGTATCTGGTTCATTCTTAGGACGGTCACATACTGGTGCTGCTGGA
GTGTCCCACGAGA
GATGCAAAGTGCCTGAGTTTTTGGACTTTTCCTTTCAGAACCCCCCAATTCACACGTG
ACCCACCAACCT
GTCTTTGACTATGATGCCACCCTGAGGTGCTGAGCCCTGGGCTTCTACCCTGTGGAG
ATCAGACTGACCT
GGCAGTGGGAGAGTGAGGACCAGACCCACGACGTGGAGCTCGTGGGGACCAGGCCT
GCAGGGGATGGAAC
CTTCCAGAAGTGGGCAGCTGTGGTGGTGCCTTCTGGAGAGGAGCAGAGATTCACGT
GCTATGTGCAGCAC
CAGGGGCTGCCTGAGCCCCTCATGCTGAGATGGAGTAAGGAGGGAGATGGAGGTGT
CATGTCTCTTAGGG
AAAGCAGGAGCCCCTCTGGAGACCTTTAACAGGGTCGGTGGTGGGGCCTGGGGTCA
GAGACCCTCACCTT
CCCCTCCTTTCCCAGAGTAGTCTTTCCAGCCCACTATCCTCATCATGGGCATCATTGC
TGGACTGGTTGT
CTTTACAGCTGTTGTCACTGAAGCTGTGGTCACTTTTGTGCTGTGGAGGAAGAAGAG
CTCTGAGTTTTTT
TGTCCCACTGAGGGTTCCAAGCCCCAGGTAGAAATGCCCTGCCTGGTTACTGGGAAG
CACCATCTACACT
CATGGGCCGACCCAGCCTACGCCCTGTGTGCCAGCACTTACTCTTTTGTAAAGCACA
TGTGACAAAGAAG
CACAGATTTATCACCTTGATGATTGTAGTGATGGGGACCTGATCCCAGTAATCACAG
GTCAGGGGAAGGT
CCCTGGCTAAGGACAGACCTTAGGAAGGCAGTTAGTCCAGGACCCACATCTGCTTTC
CTTGTTTTCCCTT
ATCCTGCCCTGGGTCTGCAGTCACACATTTCTGGAAACTTCTCTGGGGTCCAAGACT
AGGAGTTTCCTCT
AGGGTCTCATGGTGCTGCCACCTTTCTGACCTCTCAAAGGACATTTTCTTCTCACAGA
TAGAAAAGGAGG
GAGCTACTCTCAGGCTGCAAGTAAGTATGAAGGAGGCTGATCCCTGAGATCCTTGG
GATCTTGTGGTTGG
GAGCCCATGGGGGAGCTCACTCCGGCAATATTTCCTCCTCTGGCCATATTTCCTGTG
GGCTCTGACCAGG
TCCTGTTTTTGTTGTACCCCAGGCAGTGACAGCATCCAGGGCTCTGATGTCTCTCATG
GCTTGTAAATGT
GAGACCCTGGGGGCCTGATGTGTGTGGGTTTTTGGGGGGAACAGTGGACACAGCTG
TGCCATGAGGTTTC
TTTGACTTGGATGTATTGAGCATGTGCTGGGCTGTTTGAAGTGTCACCCCTCACTGTA
ACAGATATGAAT
TTGTTCATGAATGTTTTTCTGCAGTCTGA
MAPRTLLLLLSGALALTETWASSHSMRYFSAAVSRSGRGKPRFIAVGHVDDTQFVLLDS
DAASPRMEPRAPWVEQDGPEYWEEETRIAKAHAHTDRVNLRTLRSYYNQSQAADPVA
QISKRKCEAAKATERRRAYLEGTLTWQWESEDQTHDVELVGTRPAGDGTFQKWAAVV
VPSGEEQRFTCYVQHQGLPEPLMLRWTVVTEAVVTFVLWRKKSSEFFCPTEGSKPQIEK
EGATLRLQ
Secondary Sequence
GTTAATAGGGAAATCTGTTTATGAAAAAGTTAGCAGTTATTATTTCGAGTATGCTTC
TGTCAACTGCCGC
GACTGCTGCGGATAGTTACCAATCAATTAGCCATTTAGGATACAAGGACACCGATG
GCAACGACACTGTA
AGTGTTGATTCAACTTATTACTTCGCACCAAAGAAAACGATGGGTCCATACGACCAG
TTTGAATACATCA
ACAGAACAACTAACGTGTTTGGTTCGTACGCAGATGATGACTTTGGTGATGTGACTA
ACATCGGCGGTGA
GTACTTCGTACAAGATTTTGTAATCGGTGCGGGTTACAGCAATTACGATTACGGTTC
AGACACTGATTTA
TTCAATGTGTCTGCGGGTTACTTCTTTAACCCTAACCTTCTTCTAAAAGCGACTTTCA
CTGACGTTGAAG
ATGGTGATAACTATGTAATGTTCGACCTTAAGTACAATCATCAAATTAACAGCACTG
ATTACTTAGGTTT
CACGTTCACTGCAGACGACGAGTTTGATTATCGTGCGGTTAGCGCTAAATACTTTAT
GGATCTTCAACAA
GGTAATTACCTAACGATTGAAGGTACAATTGCTGATACTGATGATAGTGGTAGCTCT
TGGGAACTAGGTT
CTAACTACTACTTCTCTAAAGCAACATCAGTATTTGTAACGTTTAATAAAGAAGATG
ACTACAGCTTTGG
TGCTCAGCACTTCTTTAATAAAAACGTTGGCTTAAAAGCTGGTTATGCAAACAACTG
GGATGACTCAGAC
TACGATGCATACTTTGCAAACCTAAGCTTACAGTTTTAATCATTAATTTGATGATATA
AAAAAGCCCGCA
TTATGCGGGCTTTTTCGTTTTTATCATTACCAACCGCACTGGCGGCCTTCATTTTGCT
CATCAAGGTATT
TTTGTAACGGTGCAAAGTAATCTAAGATTGCTGTTGCATCCATTTCTTCTTTACCAGT
GATTGTCGCAAG
TGCTTCTTGCCATGGGCGGCTTGAGCCCATTTCTAACATAGCATTTAGTTTTTCACCA
GCCTCAGCAGAG
TTGTATACAGAACAACGGTGAATCGCTTCTTCGTTACCGGCAATTTCACATAAACTT
CTGTGGAAGTCGA
ATTGTAAAATGTGCGCTAAGAAATAACGTGTGTAAGGCGTGTTGCCTGGTACGTGAT
ACTTAGCACCTGG
ATCGAAGTCTGCTTCGCTACGTGCGATAGGCGCTTGTACACCTTGGTATTTTTCACGT
AATTCCCACCAC
GCTTTGTTGTAGTTCTCTGGCGTTACTTCACCTGAGAAAACTTTCCAGCGCCATTGGT
CTACTAATAAAC
CAAATGGGATAAATGCTACTTTGTCTAATGCCATTTTCATTAACAGGCCGATATCTTT
AGATTCATCAGG
CACGTCGTCTAATAAACCAATTTCTTTTAAATAACCTGGCGTTACAGAAAGTGCAAT
GGTGTCACCAATC
GCTTCGTGGAAACCATCGTTCGCACTTTCTTGATAATAAATAGGCTGTGTGTTGTAG
GCGCGTTGGTAGA
AGTTATGCCCTAGTTCATGGTGAATAACAGAGAATTCTTCACCTGTGCGTTGAATAC
ACATCTTAATACG
TAGGTCGTCTTTGCTATCAATATTCCATGCAGATGCATGACATTGTACGTCACGGTCT
TGCGGTTTAGTG
AATAGCGAGCGCTCATAGAAGGTATCAGGCAGTGGTGCAAAGCCCATTGAGGTGAA
GAACTTCTCAGCAC
CACGTACCATTTTAAGTTCGTCGTAATCGTGTTCAGCTAATAGTTCTGTTACATCATA
ACCAGGATCGGC
ATTTTCTGGTGCAACAACGTCGTAAATGTTACCCCATGTTTGGGCCCACATATTACCT
AGAAGGTGAGCT
GGGATAGGTTGATCTTGCGGGACTTTATCTTCGCCATATTTTTCACCTAGCTTGGCAC
GAACATGACAAT
GTAATGAATCATAAAGCGGTTTAACTTGGCCCCAAATGCGGTCGAGTTCTTTTGCAA
AATCATCGGCTGG
CATGTCGTATTTACTACGCCACATAGCACCTGTATCTGCATAACCTAGCTCTTTTGCA
CCTTCATTGGTG
AGTGCAACTTGTTGCTCATAAAGCGGGCGCATAGGTTTTGAAACTTGTCGCCAGCCT
TGCCATAAATCAA
GCAATTCATTGTAATCACGGCTCGTTGCCATTTTAGCAGTCATCTCGCCAAGGCTTA
AACAGCTACCATC
TTCTTTACAGTATTTACCTTTACCATAAATACCCCCTAGCTCAGCCACTAATTGTGAG
AGCTTAGCTGTT
TTTTCAGCATCTTGTGGAGCAGGTAAGGTAAGTGCGAGTTTAAGTTTATCGAGTTTA
CGACGTGCATCGT
AATCAAGCTCTAAACTATCGAACTTTGCGGCTTCATTAGCAAGACGAACCACAGCTT
CTGTCATTTTGCG
ATTTACTTCAGCAGATAGTTCCGCAGTATCATGGGTGATGAAGTTGGCATAAATCCA
CTCAGCACGACTT
GCTTCTAAATAAAGCGCGCTTAATTCTTTTTCAGTATCGGCGATAAATTTAGCGGCA
TCTTGTGCGGTTA
CTTGGCTTGTTTTGACTTCAGCAGTCGTTGTTTTAGTGCTCGCAGTGTCATCATTACA
GCCTGTAAGTGC
TAGTGCACTGGCGACCATAAGCGCGCTGATGCTGAGCTTAAATGGAGTCTTTTTCAT
AACGTTCCCTAGA
AGTTTATTGTTATTTATTGCCCATGCATAATAGCAGTGTCTTAGTAAACAACAAACT
GCCTAGTTGAAAA
CAAGCTAAGTAAGCGCTGCTT
MLLSTAATAADSYQSISHLGYKDTDGNDTVSVDSTYYFAPKKTMGPYDQFEYINRTTN
VFGSYADDDFGDVTNIGGEYFVQDFVIGAGYSNYDYGSDTDLFNVSAGYFFNPNLLLK
ATFTDVEDGDNYVMFDLKYNHQINSTDYLGFTFTADDEFDYRAVSAKYFMDLQQGNY
LTIEGTIADTDDSGSSWELGSNYYFSKATSVFVTFNKEDDYSFGAQHFFNKNVGLKAGY
ANNWDDSD YDAYFANLSLQF
Secondary Structure
10. Obainia sp. SVM-2017 18S ribosomal RNA gene, partial sequence
>KU561101.1 Obainia sp. SVM-2017 18S ribosomal RNA gene, partial sequence
CTATAATTTACTTGATCTTGATATCCTACGTGGAATAACTGTGGTAATTCTAGAGCTA
ATACATGCACCA
AAGCTCCGACTTTCGAAAGAGCGCATCTATTAGATTAAAACCAATCAGGTTCCGGCC
TGTAAATTGGTGA
CTCTGAATAGCTAAGCTAATCGTATGGTCTTGCACCGACGATGTATCTATCAAGTAT
CTGCCTTATCAAC
TTTCGATGGTAGTTTATGTGCCTACCATGGTTGTAACGGGTAACGGAGAATAAGGGT
TCGACTCCGGAGA
GGGAGCCTGAGAAACGGCTACCACATCCAAGGAAGGCAGCAGGCGCGCAAATTACC
CACTCTCAGCAAGA
GGAGGTAGTGACGAAAAATAACGAGACCGTTCTCTTTGAGGCCGGTTATCGGAATG
GGTACAATTTAAAT
CCGTTAACGAGGATCTATGAGAGGGCAAGTCTGGTGCCAGCAGCCGCGGTAATTCC
AGCTCTCAAAGTGT
ATATCGTCACTGCTGCGGTTAAAAAGCTCGTAGTTGGATCTGCGCATCAGGACCCGG
TCCGCCCACTGGG
TGTGAACTGGGTTCCTGAGCTTGTACTGCTGGTTTTCCCTACGTTGCCTTCATCGGTC
GCGTAGGGTGGC
TAGCGAGTTTACTTTGAAAAAATTAGAGTGCTTCACGCGGGCTATTGTCTGAATACT
CGTGCATGGAATA
ATAGAATAGGACCTCGGTTCTATTTTGTTGGTTTTCTGATCTGAGGTAATGGTTAAGA
GGGACGGACGGG
GGCATTCGTATCGCTGCGTGAGAGGTGAAATTCTTGGACCGTAGCGAGACGTCCGAC
TGCGAAAGCATTT
GCCAAGAATGTCTTCATTAATCAAGAACGAAAGTCAGAGGTTCGAAGGCGATCAGA
TACCGCCCTAGTTC
TGACCGTAAACGATACCAACTAGCGTTCCGTCGGCGGTAAATACGCCTTGGCGGGC
AGCTTCCCGGAAAC
GAAAGTTTTTCGGTTCCGGGGGAAGTATGGTTGCAAAGCTGAAACTTAAAGAAATT
GACGGAAGGGCACC
ACCAGGAGTGGAGCCTGCGGCTTAATTTGACTCAACACGGGAAAACTCACCTGGCC
CGGACACCGTGAGG
ATTGACAGATTGAGAGCTCTTTCTTGATTCGGTGGTTGGTGGTGCATGGCCGTTCTTA
GTTGGTGGAGTG
ATTTGTCTGGTTTATTCCGATAACGAGCGAGACTCTAGCCTACTAAATAGTCACTGG
ATAAAAAAGTCCA
GACGACTTCTTAGAGGGACAAGCGGTGTTCAGCCGCACGAAGTTGAGCAATAACAG
GTCTGTGATGCCCT
TAGATGTCCAGGGCTGCACGCGCGCTACACTGGAGGAATCAGCGTGCTGTAACCATT
GCCGAAAGGCATT
GGTAACCCCTTGAAAATCCTCCGTGATCGGGATCGGGAATTGCAATTATTTCCCTTG
AACGAGGAATTCC
TAGTAAGTGTGAGTCATCAGCTCACGTTGATTACGTCCCTGCCCTTTGTACACACCG
CCCGTCGCTGCCC
GGGACTGAGCCGTTTCGAGAAAAGCGGGGACTGCTGTTTCGATACCTTTCGGGGTGG
AGATTCTTTGGTG
GAAACCGCCTTAATCGCAGTGGCTTGAACCGGGCAAAAGTCGTAACAAGGTTTCC
MHHQPPNQERALNLSILTVSGPGSLITVMNH
Secondary Structure
PHYLOGENETIC ANALYSIS
Trichonephila clavipes isolate Nep-004 scaffold 48759 whole genome shotgun sequence
UNVERIFIED: Oryza sativa isolate Qitougu cultivar Qitougu long and barbed awn 1 (LABA1) gene partial sequence
Escherichia coli strain 126 NODE 128 151 length 2297 cov 48.9748/1-2297 whole genome shotgun sequence
38
Macaca mulatta gene for MHC class I antigen (Mamu-G gene) isolate K721 allele Mamu-G*04 nov
22
Klebsiella pneumoniae IRQBAS103 gene for 16S rRNA partial sequence
0.50