0% found this document useful (0 votes)
10 views

Bioinformatics Assignment

This document contains two sections. The first section provides the mRNA sequence and amino acid sequence for Abelmoschus esculentus chalcone synthase (CHS). The second section provides a partial gene sequence for Oryza sativa isolate Qitougu cultivar Qitougu long and barbed awn 1 (LABA1) gene.
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
10 views

Bioinformatics Assignment

This document contains two sections. The first section provides the mRNA sequence and amino acid sequence for Abelmoschus esculentus chalcone synthase (CHS). The second section provides a partial gene sequence for Oryza sativa isolate Qitougu cultivar Qitougu long and barbed awn 1 (LABA1) gene.
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 29

1.

Abelmoschus esculentus chalcone synthase (CHS) mRNA, complete cds

>KF246682.1 Abelmoschus esculentus chalcone synthase (CHS) mRNA, complete cds

AAAGAATGGTCACGGTCGAGGAAGTTCGTAAGGCTCAACGCGCCGAAGGGCCGGCC
ACCGTGTTGGCGAT

CGGCACGTCGACTCCACCAAACTGTGTTGATCAAAGCACATACCCGGACTACTATTT
CCGCATCACAAGT

AGCGAGCACAAGACGGAGTTGAAAGAGAAATTCAAGCGCATGTGTGAAAAATCCAT
GATCAAGAAGCGTT

ACATGTACCTAACGGAGGAGATTTTGAAAGAGAACCCCAACGTTTGTGAGTACATG
GCACCATCGCTTGA

CGCAAGGCAAGACATGGTGGTGGTTGAGGTGCCAAAGCTAGGCAAAGAAGCGGCC
ACCAAGGCAATTAAG

GAATGGGGCCAGCCGAAGTCCAAGATCACCCACCTAGTCTTCTGCACCACCAGTGGT
GTCGACATGCCCG

GGGCCGACTACCAGCTCACCAAGCTCTTGGGTCTCCGTCCGTCCGTTAAGCGTCTCA
TGATGTACCAACA

GGGTTGTTTCGCGGGCGGTACTGTTCTTCGTGTGGCCAAGGATTTGGCCGAGAACAA
CAAGGGTGCTCGT

GTTCTTGTTGTTTGCTCGGAAATCACCGCGGTTACTTTCCGTGGACCGAGTGATACTC
ACTTGGATAGTC

TTGTGGGACAAGCATTGTTTGGTGATGGTGCTGCTGCTGTTATAATTGGTGCTGATCC
AATACCGGAGAT

CGAAAAACCTATGTTCGAACTTGTATCGGCGGCACAAACGATATTGCCGGATAGCG
ACGGTGCTATCGAC

GGTCACCTTCGTGAAGTCGGGCTTACATTTCACCTCCTCAAGGATGTTCCGGGACTA
ATTTCAAAGAACA

TTGAGAAGAGCCTAGTTGAAGCATTTCAGCCTTTGGGCATATCCGATTGGAACTCTC
TCTTTTGGATAGC

GCACCCTGGTGGTCCAGCCATATTGGACCAAGTCGAAGCGAAACTAGCACTCAAGC
CCGAAAAGCTTCGA
GCCACTCGACATGTTCTTTCGGAATATGGGAACATGTCGAGTGCTTGTGTGTTGTTTA
TATTAGACGAGA

TGAGGAAGAGTTCGAAAGAAAATGGACTAGGCACCACGGGTGAAGGGCTCGAGTG
GGGTGTGCTGTTCGG

GTTCGGACCGGGGCTCACCGTCGAGACGGTGGTGCTCCATAGCGTCACTGCATAA

1232 bp : 51.47% C+G

Amino acid Sequence:

MVTVEEVRKAQRAEGPATVLAIGTSTPPNCVDQSTYPDYYFRITSSEHKTELKEKFKRM
CEKSMIKKRYMYLTEEILKENPNVCEYMAPSLDARQDMVVVEVPKLGKEAATAIKEW
GQKSKITHLVFCTTSGVDMPGADYQLTKLLGLRPSVKRLMMYQQGCFAGGTVLRVAK
DLAENNKGARVLVVCSEITAVTFRGPSDTHLDSLVGQALFGDGAAAVIIGADPIPEIEKP
MFELVSAAQTILPDSDGAIDDEMRKSSKENGLGTTGEGLEWGVLFGFGPGLTVETVVLH
SVTA

Secondary Structure:

Alpha helix (Hh) : 113 is 37.92%

Extended strand (Ee) : 53 is 17.79%

Random coil (Cc) : 132 is 44.30%

2. UNVERIFIED: Oryza sativa isolate Qitougu cultivar Qitougu long and barbed awn 1
(LABA1) gene, partial sequence

>KR456128.1 UNVERIFIED: Oryza sativa isolate Qitougu cultivar Qitougu long and barbed
awn 1 (LABA1) gene, partial sequence

TCTGAACAGTTAGCCCTTCAATTCTACTCCCAATATTTACTGATGGGATGGTCCAGA
AAATAAACTCCCG

CACTGCGCGCAATAACTTTTCAATGGAGTTGCAAAAGTCTGCTGCAACTTAATTTTG
CAACAGTAGTTGC

AATTTACAGCTAGCTCGTTGCGATGACAGTGCTTAGACTTGAAGCATAAATGTCCAA
AAGAATTTCAATC

GAGATGATGGATCATTGGACCCCCTATATATTACACTGGACCTGTAGATACATAATG
AGCATATAGGGAA
GTAAGTAAAGTGGCCGCCCCCCCTATATGCGTTTTCGCTCTGCCTAAATGCATCAGA
CTACATTCAGCTA

GTCAGGATTCCCTGAACGGGTTGCATGTGCATGTTTCTTCTCTTCTTCCTTTGCTCGG
AAGTCTGTCTAT

GTCATCTTGATGAACCCTAGCTAGTGTATAACAAGCAAGCAGGGTTGTGTCGCCGAC
ATGCATAAAGCTT

TTTTGGACATGCGGGCCTGCAGGCCGCAGTAAGAACGAGCCGGCATGGTCTCATAT
AAAATCTGAACTAA

CCTTTTGAGATAAGAAATCTGAAGTAGCTAGCACGAACGGTCTTCACTTCTGCTAAA
CAAACACCGGAAG

AGATCAACTAACAGAGACGGCATTATTAGTTTAAGTGATTCCAAGTGTTCTCCGGAT
CAATCATATTAAT

TTAGCCCTTCTTGATGCATGTCCTGCGTAATTAGCACATCACATTCAGCTTTCTGCTT
TCTATACACCCA

GTTAGGTCGAATCACACAACATGCAGCTACCAGGATATATATTATACCAACACAAA
ATAATCAGATGATC

AGATGGGGCCTACAAGATACCTATATATAACAGCAAAGCGGTAGTGCACTCAAGTG
TGAGCAAAGCCATG

CTGTAGTGGGATCTCTCTCCATCCATTCCTGCAGTGCCGCCACTGCTATAGCTGCGA
GAGCTATAGCTAG

AAAAATATCAACGTAGGAAGTGGGAGGAGAAGTAGATGATGGATACAGATCACACT
GAGATAATTAAGGA

GGGAGAGGCAGTAGTGGAAGCCATGGCTCTACTCAGTCTCGGTTCAGGAGGATATG
CGTCTTCTGCGGGA

GCAGCCAGGGCAAGAAGAAGAGCTACCAGGACGCAGCCGTTGAGCTTGGCAAGGA
GCTGGTACGTTCTAC

TATATTTCTCAATTCTCATATATGTAATCGTCTTGACAAAGCAGCTGCTTCATTTTTC
AACCTTGATCAT

GACATCCTTTCTGAGATCTTGCTTCTTTAATTTTCTCTCGTTAATCAAAAACACCATG
GCCATGGAGCTT
TTTACTGATATTCAGCACAACGCGCCTGGTGATATGTAGGTATATAGAGAGTACCTG
ACAAGGTTTTGGG

TTTCTTTTCAACGTCGTAATCTTGGAGTAATATAATTATCAAGCTGTGTAGACGTACA
CTTGCTGAACAA

GGGCTAAAATTTATCTCTCTATATATTTAAGTGACAATATATGCGACATTACTGCTA
GCTGTAGACGTAC

TCTTGCAGTGCATGCAGCTTGTTCTTAGTAGAAGATGAGAGCATATGCATGAGCTGA
GCACAGCATGTGT

AGGTCTGAATTGAATTTCTGTATCATGCAGTAAAACCCACTTGTATTACAAAGGAGA
ATGTCATGAGGGC

TTGCACATGAGCTTATTAATTACTTCACATGAAAGACAATATGTCTGTCGGTGAACA
TGGAATTTTATAT

TTATGCAGCTTGCATAGTTCCATAGAAAATGACTTACTCCCCAGGAACCTAGTACTG
GATTAGGTTATGA

TCTTGTATAAGGCTGCTATATTTTGAAACGGAGGGAGTACTAGCCAGCAAGTTACTA
CGCGCATTTGCAT

TTGTATGGTGACTTTTCGCCGGGATTTTCGGTGCTCAGCGCATTTTTCTCTATCCATTT
CGAGGCACGGT

ATAAATGCAAAAATGAAATAGACAAGTCGCATGGATCAGTCTTAAAGGAAACTATA
TATAGTGTGTACTG

TATGTAGTAGCTAGACTACATATTCTAGATCACTTGATGATATATATTATAGAAGAC
TAGATTCTAGAAT

GTGGAAATGCATCCCCACTCTCCATTATTATGGTTTCACCACAGGTAGCAAGGAACA
TTGATCTAGTGTA

TGGTGGAGGAAGTGTGGGGCTCATGGGCCTGGTCTCTCAAGCTGTCTACAATGGAG
GGAGGCATGTTATT

GGGTATGTAAAAACGTAATAATTGTTGATCATTCTTCAGCACTGATACATGGAAAGA
ATAACTCCGTATA

TAGTACACTTTGTAAGATGTCATCTTGTCATGACAATAGACTTGGATTATGGTTCCTT
TCCTCAGGTGCT
ACATGCTAGTACTAATATATAATTTGCCATTTTGTTGTCCCTATATACACGCAGGGTG
ATTCCCAAGACT

CTTATGCCTAGAGAGGTAAGCACGCTCATCTCCCTTCCAACAAGTCCTGACATAGTT
TATTCACATGCAA

CGATTCCCATGAGCTTGACATCATTGTTGCTGATCACCGGTTAAAATCTCATCATTTT
CATTGGAAATGT

CACTTGAAAAAAAATCTAGTTAATACTTATTGATCAAGCATGTTAAGTATTTTGTAA
GCATGATAGTATC

ACTAGATGAGATTGCTTTTTAGAAAAAAGTCAGTAGCACTGATCTAAAATTATATCA
CTTATAGGTATTA

GAAAAACATATATATATGGTATGTGGATACATACTACTACTGGAAAATTATGCTTGT
TCATTATCTTTCT

TGAATCTTGATAGTGGAAAGAAGAATGCGATCTTATCTGCTTACAAAAGATCGCATG
GAACGTTCAAAGA

AAATCTTGTTTTTCCTTTCCCTCGTAGATGAAATCCTGCTGATCTTGACTGCCGCTTC
TAGTGGCTGCCC

CTTTGTTCTTAAGGCCTTGATCCTTTAAGCAGATGCTGACTTTATTAATCAGAAAGAA
ATGTCCCTCACA

TTTTCTTTTTGTCCCCCAACAACAGCAACATGATTGCCTCTCCTGAACCTTTGCTGCG
TTGCTCTGATTA

AACTATACATGCTACCACTCCTGTTAAACAGTGCTCATCCCCTTGCTCGTTGGCTAGT
TTTTGCATGCTC

TCTTACTTTTTGTGGAGCAGAAAGCTGCAGGGTGCTCCTGTGTCTGTGCGAGAGGCT
TTGTCACATACGC

ATGCGTGTTTGCTTGTGTGTGCAGATTACGGGTGAGACAGTAGGGGAGGTGAAAGC
AGTGGCAGATATGC

ATCAGAGGAAGGCTGAGATGGCCAGGCAATCTGATGCGTTCATAGCACTGCCTGGT
TAGTCTCTCTCACC

ACCTGGATATATTTTTTTTAAGATAATGAAACCACCTGAATATATAGCTTCATCAACT
TGTGTTGAGTCG
ACATGTACTAATTTGCTTTACTTTTGTTGTCAATTGCCTTGTCATATTTTTTCCTTTAT
CATGTTATGAT

ACGTGGCTAGTGGAAGTGTGAGTGGAAAGTTGCGTAAAAGCAAGCACAAGCAGGGT
GCATGAGTTTGACT

TTGACCTGCTAAATTGGCTTGCATGCATGCTGCTGCCGCCCAGTCGGTGCAGAATCC
TTCTTTATACGCA

TGAGCTGGTCGATCAGATTGTCAGATACTCGGATAAGAGAGGACCTGTTGAGCCACT
TCGTGCTGATATT

TTCAGTATCGTTCGTAGCTTATCTTTTTCAGTTATTTGATTCTTTTTCCTAATCATCAG
CCAACTGAATT

TGTGATTCGCCCATCAGTCTTAAAAAAAAATTGTGATACGCCTTGTTAGCAAGGACC
AAATTTGGCCCAT

CCCCGAGGATGGGCTGGGAATTTTAACTAGATTCAGGACCATCCTAGTGAAATGGG
CTAGTATTTTTTCT

GCCTTCTCAAATCAAACCTGACACAACTTCATACTACTATGGATTTATGAGGTTCAA
ACTCAGCCTAGAA

ATATTCTGAAGATAATGCAAACATCCGACAATTATACCCAGGCAGATGCTGTATCCG
CAGATTCCCTCAA

TCAATACTCGTATTTCTGGAGGAGAGAGAGATCCAGAGCGAGCACTGTTCATTACTT
AAATTTGTTTACT

GCAGGTGGGTATGGAACACTTGAAGAGCTCCTGGAAGTAATTGCCTGGGCTCAGCT
CGGCATTCACGACA

AGCCGGTACATACTGAAATAGTTCATGATCAGCTTTTGCACATGCAACATATGTACA
CGCACTGATGAAC

AATGCACGTATATACACGGAGAGCCAAAACTTTTTTTTACCTTGGCTTAATGTGGAC
GATCGATGCTACG

TACAGGTTGGCCTGCTAAATGTGGACGGCTACTACAACTCTCTGCTGTCGTTCATCG
ATAAAGCTGTGGA

GGAAGAGTTCATCAGCCCCTCTGCGCGCCATATCATCGTGTTAGCTCCAACACCAAA
AGAACTTCTCGAG
AAGCTAGAGGTGTATATACTTATATATATAATCTATCGATTTTTCTGATGCATCATGG
CACACTGCAAAG

GACAGAAGAAAACGCCGGTTTCATCGGTGAATGAGAGATCGAGCTGAACTTTCCTC
TTGCTCGCATGCAG

GCGTACTCCCCTCGGCATGACAAGGTCGTGCCGAAGATGCAGTGGGAGATGGAGAA
GATGAGCTACTGCA

AGAGCTGCGAGATCCCTGGCCTGAAAGAAGGCAACAAGGCGACCATCCAAGCACAG
CGAGGAAGCATGCT

CTGAAATTTACTGTAGCACTAGCTAGCTATAGCTTAGCCAATGCGTGCAGCAAGAAG
ATTCAAAACTTCT

TGGTGCACATGAACTGCAACTTTTAATTCATGTAACTCGGTTCATTAGAAGGCAATT
GATCACTGATCAT

GCAATTAGTCGTGTACGTAGCCCTAAGAATGAAATCATCAGTAAGATTTGTAAATCA
CAGAGCTCCAGAG

CAGCAGACTGACATAAAACGATATGTGGGCTAGATCGATCTCCTCCGTGGGCTGGG
CCGGA

5031 bp : 42.06% C+G

Amino Acid Sequence

SRFRRICVFCGSSQGKKKSYQDAAVELGKELITGETVGEVKAVADMHQRKAEMARQSD
AFIALPGGYGTLEELLEVIAWAQLGIHDKPAYSPRHDKVVPKMQWEMEKMSYCKSCEI
PGLKEGNKATIQAQRGSML

Secondary Structure

Alpha helix (Hh) : 59 is 43.70%

Extended strand (Ee) : 23 is 17.04%

Random coil (Cc) : 53 is 39.26%


3. S.tuberosum mitochondrial DNA for ribosomal protein S10

>X74826.1 S.tuberosum mitochondrial DNA for ribosomal protein S10

GAATTCTTTCTCCCACACACCCTTTTTGCCCTCTTTCGCCGAGGAGGAAAGAATAATC
TTCCAAGCGGAC

AGAGACCTAAATTTCCATTAGATTCATTCCTAAGCTTGCTTTGTTGCAGCAAGATGA
TCAGTCCGAGAGT

GCTGGAGAGAAGAGAAAGCGGTAAAAACCTCTCTTATTCGGTCACCGAGAAGTCGG
ACGACTCTTCAGTA

ACCCAGGGTGATCCGACCCCTTCGACGCTTTTTTCGCTGTATACCCCCTCCATCCTTC
GGAGGTGGAAGA

AAGGGTACTCACATTTTAATACATAGTAGGGCCCCAGAACGCTAAAAGGTGGGGGA
ACAAGAGTTGTCAC

GATAGAAAAGAGAAAAAAAGAAATGACTATAAGGAACCAACGGCTCTCTCTTCTTA
AACAACCTATATCC

TCCACACTTAATCAGCATTTGATAGATTATCCAACCCCGAGCAATCTTAGTTATTGGT
GGGGGTTCGGTT

CGTTAGCGGGTATTTGTTTAGTCATTCAGATAGTGACTGGCGTTTTTTTAGCTATGCA
TTACACACCTCA

TGTGGATCTAGCTTTCAACAGCGTAGAACACATTATGAGAGATGTTGAAGGGGGCT
GGTTGCTCCGTTAT

ATGCATGCTAATGGGGCAAGTATGTTTTTCATTGTGGTTCACCTGCATATTTTTCGTG
GTCTATATCATG

CCAGTTATAGCAGTCCTAGGGAATTTGTTCGGTGTCTCGGAGTTGTAATCTTCCTATT
AATGATTGTGAC

AGCTTTTATAGGATATGTCCTACCTTGGGGTCAGATGAGCTTTTGGGGAGCTACAGT
AATCACAAGCTTA

GCTAGCGCCATACCTGTAGTAGGAGAATTAGCCGCCTTATTTATGGATTTTCCTCTG
GTATTGGGGAATG

GGGGGCACCCCTACAAGACTTTAATCTAAACATGCTGCCTGATTTGAATCTGCCCGC
GGAACCCGAGCCC
TATCAACCACTATTTCCCCAGGTTCCTCTTGGGACTCTGTCCTTAGAGGAGCAAAGG
GCAGTAGACGAAT

TAGTCCGCCTTGAAGGGCGGCTGATCCGGACCGCAAGATGTGTCTTGGAGTCGCTGG
GCTATTCGCCCCA

GCCGGGCGACATAAAAACTTTCGTACAGATCTTCATAGTGGACATCGACTCATCCGA
CTATGATGATCTT

GTCTTGGCTCTTATTGATGAAAAGTCGCCCATTTTTCTTCAATTTTTAGAAGAATGGG
AAATTTTTCTCG

CAGACAACAGCGCTCTGGTCGAAAATTTAGGGATCAATAAAATAAAATAACAACAC
CCACCTAACCTAAT

TAATGTGGTGTGGAAGGGCAAGCGATAGAAAAGTCCCTCGCGCGAACTACTACTAC
TAGAAAATGGTGAG

ACCATTAGTGAAAGAAGGACCAAGGAGGATCCTATCCTAAAGGAGAAGGAGGAGG
AGTAGGAGCTAGCTT

TAGGGGCGGAATCGAATGATTACGAGATAAACAATGAGACAAAGGAGAGCACTTA
GACGAGTCAGCCAAA

AAGAAAGACCACCAAAAGTAACGACCACCAAGATAGGCATAGTAATTCGATCTTTT
GATCACCCATTTTT

GGAAAACCATTTTTGGGGGCTTCCGCCTTACACACGGAAGATTGGATTGCCTGAATC
ACGAGTCTTATAT

ACTGTGTTACGATCACCTCATATTGATAAAAAGTCCAGAGAACAATTTTTTATGAAA
ATAAAGAAAGAAT

TTCTGGTCATAAAAACAGAAAGGCATGAATTGCGCAAGAAGTTCTTTCGGTTAAAAC
GCCGTGCGACTCG

GAGGACATAAGACTTCTTGGTCAAGCCAAAAAGATTGCCGACTGGATGCTCCTACCC
CACCATGCCTGGC

CCTTTATCTTACCTTAAGAAGAAAGAGGAGGTATGAAGCGTGGGACAGAATGCAAG
AAGTGTATGATACA

ACAGATAACCTTAAGGAGTGGCGGCAACCCTCTTGATTGATCAACGCGAGTGAACT
GTGCTTAGACGCTT
CGTAAAACCGCACCGATCTACGAGAGGAGCTGATAGATGAGTAGGCTTCCCCTTTCG
ATTCACGGAATGT

GATCTGGGACACGATGGGAGTTTGCGTGTCTCGGTAGGAAAGATATCACCGGAGTA
TAGCACAAGATCGC

CTTTTCTTGCTGCCATGGGGTCGACCTGTGAACAAGGTAAACCCAATGGGAACCAAA
AAACGGGGGGTAC

CGCATTGGGCAGAAGACGATCCAAAAAGCGAAGGCTCACCCAGCTGAAGGAAGGG
GGTGAGATAGGCAAG

AGGTAGCTTGCTTCCAAGCCGGCCCGGCCGCGAGGAATCAAGAGATCTTTGCCGGT
GCTGACTTGGATCT

CGGGTGACGGAATAAGGCGGCCAGAAGCGACGAGCAGTCGCGGTCGAAGCTTGGCT
CTAGCCGATAGGCT

AGCTGTAAGCTTGGCTACAGCGAACGCTTACCAAGCGCGAAGGAAAGGGCCTTCGC
CACTTGAGCCGTAT

GCGGGGGAACTCGCACGTGCGGTTCTTAGGGGGGGAGAGCTAGTAGGAGCCATCCC
ATCCCAATAGCGTA

TATTTGGAGCTCAATATGAAATCCTATTTTCTTGCAAGACCCGTTCGGATAAGGGAA
AACTCCAGAGATT

GCTTCGAAATAAGATCCTTGCGTTGACCCTTTCCTGAACCAGAACCGGGGGAGGATG
AGAGGAAAGGGGG

ATCAAAATGTCCCATCAATAAAGTTCGGGCTTTCCATTCTTCCTCCCCCCCCCTCACT
CCCCCTTTTTC

2799 bp : 46.91% C+G

Amino Acid Sequence

MSVLGPYYVLKCEYPFFHLRRMEGVYSEKSVEGVGSPWVTEESSDFSVTE

Secondary Structure

Alpha helix (Hh) : 3 is 6.00%

Extended strand (Ee) : 21 is 42.00%

Random coil (Cc) : 26 is 52.00%


4. Escherichia coli strain 126 NODE_128_151_length_2297_cov_48.9748/1-2297, whole
genome shotgun sequence

>NZ_JAEUYS010000057.1 Escherichia coli strain 126


NODE_128_151_length_2297_cov_48.9748/1-2297, whole genome shotgun sequence

AGACATAAAAATCATCGCCAGTTGCTTAAAGAAGCAGAAACAGACCAGATAAATCG
TCGCTGAAAAACGC

ACTTCAAACTGGCTGGTAATATATTTAAAGCAGCCCACCAGCAGGAACGGTACTTCA
AACATATGCAGCG

TTTTCAGAATAACCACTTCCAGTGCTGAGGTAGCGAACGATGAGCCAATAATACGTA
CTGACATAATAGT

GCCAGCCAGCAGCAGGGCGTTTTTCCCACCGATGCGATTAATGATCAGTGGCGCAA
AAAACATAATCGAG

GCGTTAAGTAATTCGCCCATTGTCGTTACGTAGCCAAATACCCGCGTACCCTGTTCA
CCGGTAGCAAAGA

ACGAAGTAAAGAAATTAGCAAACTGTTGGTCAAAAACATCGTAGGTGCAGGAAACG
CCAATAACATACAG

TGACAAAAACCACAGTTTTGGCTGTCTGAACAGTTCCAGCGCCAGCTTAAGGCTAAA
TGCCGAATGGTTG

GCACCTACCGCATTGGCAACCGTGGCAGAAGAGGGCGCATCCGTTTTGGCGAAAAA
GAGTAAAACGGCGA

GGATGAATGCACAGCCAGAACCCAGCCAGAAAACAAACTGATTATTGATGGTGAAC
ATGATGCCGACAAT

CGAGGCACACAGCGCCCAGCCAACACAGCCAAACATCCGCGCGCGACCAAATTCGA
AATTACTGCGACGG

CTGACTTTCTCAATAAATGCCTCTACTGCTGGCGCACCGGCGTTAAAACAAAAGCCT
AGATAAATACCAC

CAACAATCGATCCTACTAAAATGTTGTATTGTAACAGTGGCCCGAAGATAAAAATA
AAGAACGGCGCAAA

CATCACTAACATGCCGGTAATAATCCACAGCAGGTATTTGCGCAGCCCGAGTTTGTC
AGAAAGCAGACCA
AACAGCGGTTGGAATAATAGCGAGAACAGAGAAATAGCGGCAAAAATAATACCCG
TATCACTTTTGCTGA

TATGGTTGATGTCATGTAGCCAAATCGGGAAAAACGGGAAGTAGGCTCCCATGATA
AAAAAGTAAAAGAA

AAAGAATAAACCGAACATCCAAAAGTTTGTGTTTTTTAAATAGTACATAATGGATTT
CCTTACGCGAAAT

ACGGGCAGACATGGCCTGCCCGGTTATTATTATTTTTGACACCAGACCAACTGGTAA
TGGTAGCGACCGG

CGCTCAGCTGGAATTCCGCCGACACTGACGGGCTCCAGGAGTCGTCGCCACCAATCC
CCATATGGAAACC

GTCGATATTCAGCCATGTGCCTTCTTCCGCGTGCAGCAGATGGCGATGGCTGGTTTC
CATCAGTTGCTGT

TGACTGTAGCGGCTGATGTTGAACTGGAAGTCGCCGCGCCACTGGTGTGGGCCATAA
TTCAATTCGCGCG

TCCCGCAGCGCAGACCGTTTTCGCTCGGGAAGACGTACGGGGTATACATGTCTGACA
ATGGCAGATCCCA

GCGGTCAAAACAGGCGGCAGTAAGGCGGTCGGGATAGTTTTCTTGCGGCCCTAATC
CGAGCCAGTTTACC

CGCTCTGCTACCTGCGCCAGCTGGCAGTTCAGGCCAATCCGCGCCGGATGCGGTGTA
TCGCTCGCCACTT

CAACATCAACGGTAATCGCCATTTGACCACTACCATCAATCCGGTAGGTTTTCCGGC
TGATAAATAAGGT

TTTCCCCTGATGCTGCCACGCGTGAGCGGTCGTAATCAGCACCGCATCAGCAAGTGT
ATCTGCCGTGCAC

TGCAACAACGCTGCTTCGGCCTGGTAATGGCCCGCCGCCTTCCAGCGTTCGACCCAG
GCGTTAGGGTCAA

TGCGGGTCGCTTCACTTACGCCAATGTCGTTATCCAGCGGTGCACGGGTGAACTGAT
CGCGCAGCGGCGT

CAGCAGTTGTTTTTTATCGCCAATCCACATCTGTGAAAGAAAGCCTGACTGGCGGTT
AAATTGCCAACGC
TTATTACCCAGCTCGATGCAAAAATCCATTTCGCTGGTGGTCAGATGCGGGATGGCG
TGGGACGCGGCGG

GGAGTGTCACGCTGAGGTTTTCAGCCAGACGCCACTGCTGCCAGGCGCTGATGTGTC
CGGCTTCTGACCA

TGCGGTCGCGTTCGGTTGCACTACGCGTACTGTGAGCCAGAGTTGCCCGGCGCTCTC
CGGCTGCGGTAGT

TCAGGCAGTTCAATCAACTGTTTACCTTGTGGAGCGACATCCAGAGGCACTTCACCG
CTTGCCAGCGGCT

TGCCATCCAGCGCCACCATCCAGTGCAGGAGCTCGTTATCGCTATGACGGAACAGGT

2297 bp : 50.94% C+G

Amino Acid Sequence

MVALDGKPLASGEVPLDVAPQGKQLIELPELPQPESAGQLWLTVRVVQPNATAWSEAG
HISAWQQWRLAENLSVTLPAASHAIPHLTTSEMDFCIELGNKRWQFNRQSGFLSQMWIG
DKKQLLTPLRDQFTRAPLDNDIGVSEATRIDPNAWVERWKAAGHYQAEAALLQCTADT
LADAVLITTAHAWQHQGKTLFISRKTYRIDGSGQMAITVDVEVASDTPHPARIGLNCQL
AQVAERVNWLGLGPQENYPDRLTAACFDRWDLPLSDMYTPYVFPSENGLRCGTRELN
YGPHQWRGDFQFNISRYSQQQLMETSHRHLLHAEEGTWLNIDGFHMGIGGDDSWSPSV
SAEFQLSAGRYHYQL

Secondary Structure

Alpha helix (Hh) : 108 is 29.67%

Extended strand (Ee) : 75 is 20.60%

Random coil (Cc) : 181 is 49.73%

5. Klebsiella pneumoniae IRQBAS103 gene for 16S rRNA, partial sequence

>LC645191.1 Klebsiella pneumoniae IRQBAS103 gene for 16S rRNA, partial sequence

GCTACACATGCAGTCGAGCGGTAGCACAGAGAGCTTGCTCTCGGGTGACGAGCGGC
GGACGGGTGAGTAA

TGTCTGGGAAACTGCCTGATGGAGGGGGATAACTACTGGAAACGGTAGCTAATACC
GCATAACGTCGCAA
GACCAAAGTGGGGGACCTTCGGGCCTCATGCCATCAGATGTGCCCAGATGGGATTA
GCTAGTAGGTGGGG

TAACGGCTCACCTAGGCGACGATCCCTAGCTGGTCTGAGAGGATGACCAGCCACAC
TGGAACTGAGACAC

GGTCCAGACTCCTACGGGAGGCAGCAGTGGGGAATATTGCACAATGGGCGCAAGCC
TGATGCAGCCATGC

CGCGTGTGTGAAGAAGGCCTTCGGGTTGTAAAGCACTTTCAGCGGGGAGGAAGGCG
ATAAGGTTAATAAC

CTTGTCGATTGACGTTACCCGCAGAAGAAGCACCGGCTAACTCCGTGCCAGCAGCCG
CGGTAATACGGAG

GGTGCAAGCGTTAATCGGAATTACTGGGCGTAAAGCGCACGCAGGCGGTCTGTCAA
GTCGGATGTGAAAT

CCCCGGGCTCAACCTGGGAACTGCATTCGAAACTGGCAGGCTAGAGTCTTGTAGAG
GGGGGTAGAATTCC

AGGTGTAGCGGTGAAATGCGTAGAGATCTGGAGGAATACCGGTGGCGAAGGCGGCC
CCCTGGACAAAGAC

TGACGCTCAGGTGCGAAAGCGTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCC
ACGCCGTAAACGAT

GTCGATTTGGAGGTTGTGCCCTTGAGGCGTGGCTTCCGGAGCTAACGCGTTAAATCG
ACCGCCTGGGGAG

TACGGCCGCAAGGTTAAAACTCAAATGAATTGACGGGGGCCCGCACAAGCGGTGGA
GCATGTGGTTTAAT

TCGATGCAACGCGAAGAACCTTACCTGGTCTTGACATCCACAGAACTTTCCAGAGAT
GGATTGGTGCCTT

CGGGAACTGTGAGACAGGTGCTGCATGGCTGTCGTCAGCTCGTGTTGTGAAATGTTG
GGTTAAGTCCCGC

AACGAGCGCAACCCTTATCCTTTGTTGCCAGCGGTCCGGCCGGGAACTCAAAGGAG
ACTGCCAGTGATAA

ACTGGAGGAAGGTGGGGATGACGTCAAGTCATCATGGCCCTTACGACCAGGGCTAC
ACACGTGCTACAAT
GGCATATACAAAGAGAAGCGACCTCGCGAGAGCAAGCGGACCTCATAAAGTATGTC
GTAGTCCGGATTGG

AGTCTGCAACTCGACTCCATGAAGTCGGAATCGCTAGTAATCGTAGATCAGAATGCT
ACGGTGAATACGT

TCCCGGGCCTTGTACACACCGCCCGTCACACCATGGGAGTGGGTTGCAAAAGAAGT
AGGTAGCTTAACCT

TCGGGAGGGCGCTACCACTTTGATCTTGTTCAG

1433 bp : 54.92% C+G

Amino Acid Sequence

GRFNALAPEATPQGHNLQIDIVYGVDYQDLYAFHRYTWNSTPLYKTLACQFRMQFPVII
NACTLRITAAAGTELAGASSAGNVNRQGVWTVSQFQCGWSSSQTS

Secondary Structure

Alpha helix (Hh) : 16 is 15.24%

Extended strand (Ee) : 28 is 26.67%

Random coil (Cc) : 61 is 58.10%

6. Staphylococcus aureus S33 R gene for 16S rRNA, partial sequence

>LC752325.1 Staphylococcus aureus S33 R gene for 16S rRNA, partial sequence

TTTATGGAGAGTTTGATCCTGGCTCAGGATGAACGCTGGCGGCGTGCCTAATACATG
CAAGTCGAGCGAA

CGGACGAGAAGCTTGCTTCTCTGATGTTAGCGGCGGACGGGTGAGTAACACGTGGA
TAACCTACCTATAA

GACTGGGATAACTTCGGGAAACCGGAGCTAATACCGGATAATATTTTGAACCGCAT
GGTTCAAAAGTGAA

AGACGGTCTTGCTGTCACTTATAGATGGATCCGCGCTGCATTAGCTAGTTGGTAAGG
TAACGGCTTACCA

AGGCAACGATGCATAGCCGACCTGAGAGGGTGATCGGCCACACTGGAACTGAGACA
CGGTCCAGACTCCT
ACGGGAGGCAGCAGTAGGGAATCTTCCGCAATGGGCGAAAGCCTGACGGAGCAAC
GCCGCGTGAGTGATG

AAGGTCTTCGGATCGTAAAACTCTGTTATTAGGGAAGAACATATGTGTAAGTAACTG
TGCACATCTTGAC

GGTACCTAATCAGAAAGCCACGGCTAACTACGTGCCAGCAGCCGCGGTAATACGTA
GGTGGCAAGCGTTA

TCCGGAATTATTGGGCGTAAAGCGCGCGTAGGCGGTTTTTTAAGTCTGATGTGAAAG
CCCACGGCTCAAC

CGTGGAGGGTCATTGGAAACTGGAAAACTTGAGTGCAGAAGAGGAAAGTGGAATTC
CATGTGTAGCGGTG

AAATGCGCAGAGATATGGAGGAACACCAGTGGCGAAGGCGACTTTCTGGTCTGTAA
CTGACGCTGATGTG

CGAAAGCGTGGGGATCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGA
TGAGTGCTAAGTGT

TAGGGGGTTTCCGCCCCTTAGTGCTGCAGCTAACGCATTAAGCACTCCGCCTGGGGA
GTACGACCGCAAG

GTTGAAACTCAAAGGAATTGACGGGGACCCGCACAAGCGGTGGAGCATGTGGTTTA
ATTCGAAGCAACGC

GAAGAACCTTACCAAATCTTGACATCCTTTGACAACTCTAGAGATAGAGCCTTCCCC
TTCGGGGGACAAA

GTGACAGGTGGTGCATGGTTGTCGTCAGCTCGTGTCGTGAGATGTTGGGTTAAGTCC
CGCAACGAGCGCA

ACCCTTAAGCTTAGTTGCCATCATTAAGTTGGGCACTCTAAGTTGACTGCCGGTGAC
AAACCGGAGGAAG

GTGGGGATGACATCAAATCATCATGCCCCTTATGATTTGGGCTACACACGTGCTACA
ATGGACAATACAA

AGGGCAGCGAAACCGCGAGGTCAAGCAAATCCCATAAAGTTGTTCTCAGTTCGGAT
TGTAGTCTGCAACT

CGACTACATGAAGCTGGAATCGCTAGTAATCGTAGATCAGCATGCTACGGTGAATA
CGTTCCCGGGTCTT
GTACACACCGCCCGTCACACCACGAGAGTTTGTAACACCCGAAGCCGGTGGAGTAA
CCTTTTAGGAGCCA

GCCGTCGAAGGTGGGACAAATGATTGGGGTGAAGTCGTAACAAGGTAGCCGTATCG
GAAGGTGCGGCTGG

ATCACCTCCTTT

1552 bp : 51.10% C+G

Amino Acid Sequence

MHHLSLCPPKGKALSLELSKDVKIWLSPIAEDSLLLPPVGVWTVSQFQCGRSPSQVGYA
SLPX

Secondary Structure

Alpha helix (Hh) : 8 is 12.70%

Extended strand (Ee) : 16 is 25.40%

Random coil (Cc) : 39 is 61.90%

7. Homo sapiens aldolase, fructose-bisphosphate B (ALDOB), mRNA

>NM_000035.4 Homo sapiens aldolase, fructose-bisphosphate B (ALDOB), mRNA

CTTATTTGGCAGCTGCTGCCTCACCCACAGCTTTTGATATCTAGGAGGACTCTTCTCT
CCCAAACTACCT

GTCACCATGGCCCACCGATTTCCAGCCCTCACCCAGGAGCAGAAGAAGGAGCTCTC
AGAAATTGCCCAGA

GCATTGTTGCCAATGGAAAGGGGATCCTGGCTGCAGATGAATCTGTAGGTACCATG
GGGAACCGCCTGCA

GAGGATCAAGGTGGAAAACACTGAAGAGAACCGCCGGCAGTTCCGAGAAATCCTCT
TCTCTGTGGACAGT

TCCATCAACCAGAGCATCGGGGGTGTGATCCTTTTCCACGAGACCCTCTACCAGAAG
GACAGCCAGGGAA

AGCTGTTCAGAAACATCCTCAAGGAAAAGGGGATCGTGGTGGGAATCAAGTTAGAC
CAAGGAGGTGCTCC
TCTTGCAGGAACAAACAAAGAAACCACCATTCAAGGGCTTGATGGCCTCTCAGAGC
GCTGTGCTCAGTAC

AAGAAAGATGGTGTTGACTTTGGGAAGTGGCGTGCTGTGCTGAGGATTGCCGACCA
GTGTCCATCCAGCC

TCGCTATCCAGGAAAACGCCAACGCCCTGGCTCGCTACGCCAGCATCTGTCAGCAGA
ATGGACTGGTACC

TATTGTTGAACCAGAGGTAATTCCTGATGGAGACCATGACCTGGAACACTGCCAGTA
TGTTACTGAGAAG

GTCCTGGCTGCTGTCTACAAGGCCCTGAATGACCATCATGTTTACCTGGAGGGCACC
CTGCTAAAGCCCA

ACATGGTGACTGCTGGACATGCCTGCACCAAGAAGTATACTCCAGAACAAGTAGCT
ATGGCCACCGTAAC

AGCTCTCCACCGTACTGTTCCTGCAGCTGTTCCTGGCATCTGCTTTTTGTCTGGTGGC
ATGAGTGAAGAG

GATGCCACTCTCAACCTCAATGCTATCAACCTTTGCCCTCTACCAAAGCCCTGGAAA
CTAAGTTTCTCTT

ATGGACGGGCCCTGCAGGCCAGTGCACTGGCTGCCTGGGGTGGCAAGGCTGCAAAC
AAGGAGGCAACCCA

GGAGGCTTTTATGAAGCGGGCCATGGCTAACTGCCAGGCGGCCAAAGGACAGTATG
TTCACACGGGTTCT

TCTGGGGCTGCTTCCACCCAGTCGCTCTTCACAGCCTGCTATACCTACTAGGGTCCA
ATGCCCGCCAGCC

TAGCTCCAGTGCTTCTAGTAGGAGGGCTGAAAGGGAGCAACTTTTCCTCCAATCCTG
GAAATTCGACACA

ATTAGATTTGAACTGCTGGAAATACAACACATGTTAAATCTTAAGTACAAGGGGGA
AAAAATAAATCAGT

TATTGAAACATAAAAATGAATACCAAGGACCTGATCAAATTTCACACAGCAGTTTCC
TTGCAACACTTTC

AGCTCCCCATGCTCCAGAATACCCACCCAAGAAAATAATAGGCTTTAAAACAATATC
GGCTCCTCATCCA
AAGAACAACTGCTGATTGAAACACCTCATTAGCTGAGTGTAGAGAAGTGCATCTTAT
GAAACAGTCTTAG

CAGTGGTAGGTTGGGAAGGAGATAGCTGCAACCAAAAAAGAAATAAATATTCTATA
AACCTTCAGCTGCT

ATCGGGTTTCACTTTTCTGCTCTTGCTGTCCAAAGACTCAGTGTATTTCATTACTTTTG
ACTCTACTAGA

CATGACTGGGTTTCAACAGTAAAGGTCTTCAACTCTTGCTAGTCATTGGAATCAAGC
CGCAAAATTTTAA

AAACTGAGATGCTCAGGCCACACCCCAGCTCAATTAAATCAGAAACCCTAGACTTG
GGATCCTCTAACTA

TTAGATTTCTTAAAGCTCCCTCAGTAATTCCAATGTACAGTCAAGTTTGAGAACTAC
CAATCTAAATTTC

AAGTTTGAGGGTATTTGAAAATTAAAGCCATTCACAATACGAAGCCAGCTAAAAAT
GTAGAATGATTTTG

AGCAACTTGTGGAGTATAATAAGAGAATTAATGTGACTTCAATGCTTGGAGCATTCT
TGTTCAAGTGGCC

CAGGTTTGGTGAAACAGGACTACCTTGTCATCTGCACGTCCAGGCATATTTCGTAGT
TTTGCAGTAAATA

ATATTCACATAATGATACTGTATTGACTTTCAATTTTCAGAATTAACCTATAGTTACA
GCACTTAAGACA

ACCAGAGTTATAAAAGAGAATTTAAATATTATAACTTTGGACAATATAAAAGTGAT
GATTTAACTGACAG

AAGCTAGGAAATATAAGGGGGAGGAGAAGTGGAAGAAAGCAAAGAGAGTCAGGAA
TACTACTTAAAACTG

ATGGGTTAAGAAATAGTGCTTTAATTCTATTTAAGTAATAAAAGAAATGGATGTAAA
TCATAAAAATATA

TATCTAAAATTAAAATATTGATGGTAGTATGCTAAATTTC

2420 bp : 44.46% C+G

Amino Acid Sequence


MAHRFPALTQEQKKELSEIAQSIVANGKGILAADESVGTMGNRLQRIKVENTEENRRQF
REILFSVDSSINQSIGGVILFHETLYQKDSQGKLFRNILKEKGIVVGIKLDQGGAPLAGTK
ETTIQGLDGLSERCAQYKKDGVDFGKWRAVLRIADQCPSSLAIQENANALARYASICQQ
NGLVPIVEPEVIPDGDHDLEHCQYVTEKVLAAVYKALNDHHVYLEGTLLKPNMVTAGH
ACTKKYTPEQVAMATVTALHRTVPAAVPGICFLSGGMSEEDATLNLNAINLCPLPKPW
KLSFSYGRALQASALAAWGGKAANKEATQEAFMKRAMANCQAAKGQYVHTGSSGAA
STQSLFTACYTY

Secondary Structure

Alpha helix (Hh) : 153 is 42.03%

Extended strand (Ee) : 52 is 14.29%

Random coil (Cc) : 159 is 43.68%

8. Macaca mulatta gene for MHC class I antigen (Mamu-G gene), isolate K721, allele Mamu-
G*04_nov

>LR990779.1 Macaca mulatta gene for MHC class I antigen (Mamu-G gene), isolate K721,
allele Mamu-G*04_nov

ATGGTGGTCATGGCGCCCCGAACCCTCCTGCTGCTGCTCTCGGGGGCTCTGGCCCTG
ACAGAGACCTGGG

CGAGTGAGTGCGGGATCGGGAGATGGCCTCTGGGGGGAGGGCCAGGGGCCCGCCCG
ACGGTGGTGCAGGA

CCCGGGGAGCCGCGCGGGGAGGAGGGTCGGGCGGGTCTCAGCCCCTCCTCGCCCCC
AGGCTCCCACTCCA

TGAGGTACTTCAGCGCTGCCGTGTCCCGGTCAGGCCGCGGGAAGCCCCGCTTCATCG
CGGTGGGCCACGT
GGACGACACGCAGTTCGTGCTGCTCGATAGCGACGCTGCCAGTCCGAGGATGGAGC
CGCGGGCGCCGTGG

GTGGAGCAGGATGGGCCGGAGTATTGGGAAGAGGAGACACGGATCGCCAAGGCCC
ACGCACACACTGACA

GAGTGAACCTGCGGACCCTGCGCAGCTACTACAACCAGAGCCAGGCCGGTGAGTGA
CCCCGGCCTGGGGC

GCAGATCACGACCCCCCACCTCCATGCCCTGCGGACGACCGGGGTACCCCCGAGTCT
CCAGGTCTGAGAT

CCACCCCGAGGCCGCGGGACTCGCCCAGACCCTCTACCTGGGAGAAGCCCACGCGC
CTTTACCAAAATCC

CTGCGGGTTGGTCCGGGAGGGGGCGAGGTTCGGTGGGCGGGGCTGACCGAGGGGGC
GGGGCCAGGGTCTC

ACACCCTCCAGTGGATGATTGGTTGCGACCTGGGGCCCGACGGGCGACTCCTCTTCC
GGTGTGAACAGTT

CGCCTACGATGGCAAGGATTACCTCGCCCTGAACGAGGACCTGTGCTCCTGAACCGC
AGCGGACCCTGTG

GCTCAGATCTCCAAGCGCAAGTGTGAGGCGGCCAAAGCGACTGAACGAAGGAGAGC
CTACCTGGAGGGCA

CGTGAGTGGAGTGGCTCCACAGATACCTGGAGAATGGGAAGGAGATGCTGCATCGT
GCGGGTACCAGGGG

CCATGGGGCACCTCCCCGATCTCCTGTAGACCTCCCAGGCTGGCCTAGCACAAGGAG
AGGAGGAAAATGG

GACCAACACTAGAATACGCCCTCCCTCTGGTCCTGAGGGAGAGGAATCCTCCTGGGT
TTCCAGATCCTGT

ACCAGAGAGTGACTCTGAGGGCCCGCCCTGCTCTCTGGGACAATTAAGGGATGAAG
TCTCTGAGGGAGTG

GAGGGGAAGACAATCCCTGGAATACTGATCAGGAGTTCCCTTTGACCCCACAGCAG
TCTTAGGCACCAGG

ACTTTTCCCCTCAGGCCTTGCTCTCTGCCTCACACTAAATGTGTGTGGGAGTCTGACT
CCAGCTCCTCTG
AGCCCTTTGGCCTTCACTCAGGTCAGAATCGGAAGTCCCTGCTCCCCCGCTCTGAGA
TTAGAACTTTCCA

AGTATTAGGAGATTATCCCAGGTGCCCGTGTCCAGGCTGGTGTCTGAGTTACGTGCT
CCCTCCCCCCACC

CCATCCCGCCAGGTATCTGGTTCATTCTTAGGACGGTCACATACTGGTGCTGCTGGA
GTGTCCCACGAGA

GATGCAAAGTGCCTGAGTTTTTGGACTTTTCCTTTCAGAACCCCCCAATTCACACGTG
ACCCACCAACCT

GTCTTTGACTATGATGCCACCCTGAGGTGCTGAGCCCTGGGCTTCTACCCTGTGGAG
ATCAGACTGACCT

GGCAGTGGGAGAGTGAGGACCAGACCCACGACGTGGAGCTCGTGGGGACCAGGCCT
GCAGGGGATGGAAC

CTTCCAGAAGTGGGCAGCTGTGGTGGTGCCTTCTGGAGAGGAGCAGAGATTCACGT
GCTATGTGCAGCAC

CAGGGGCTGCCTGAGCCCCTCATGCTGAGATGGAGTAAGGAGGGAGATGGAGGTGT
CATGTCTCTTAGGG

AAAGCAGGAGCCCCTCTGGAGACCTTTAACAGGGTCGGTGGTGGGGCCTGGGGTCA
GAGACCCTCACCTT

CCCCTCCTTTCCCAGAGTAGTCTTTCCAGCCCACTATCCTCATCATGGGCATCATTGC
TGGACTGGTTGT

CTTTACAGCTGTTGTCACTGAAGCTGTGGTCACTTTTGTGCTGTGGAGGAAGAAGAG
CTCTGAGTTTTTT

TGTCCCACTGAGGGTTCCAAGCCCCAGGTAGAAATGCCCTGCCTGGTTACTGGGAAG
CACCATCTACACT

CATGGGCCGACCCAGCCTACGCCCTGTGTGCCAGCACTTACTCTTTTGTAAAGCACA
TGTGACAAAGAAG

CACAGATTTATCACCTTGATGATTGTAGTGATGGGGACCTGATCCCAGTAATCACAG
GTCAGGGGAAGGT

CCCTGGCTAAGGACAGACCTTAGGAAGGCAGTTAGTCCAGGACCCACATCTGCTTTC
CTTGTTTTCCCTT
ATCCTGCCCTGGGTCTGCAGTCACACATTTCTGGAAACTTCTCTGGGGTCCAAGACT
AGGAGTTTCCTCT

AGGGTCTCATGGTGCTGCCACCTTTCTGACCTCTCAAAGGACATTTTCTTCTCACAGA
TAGAAAAGGAGG

GAGCTACTCTCAGGCTGCAAGTAAGTATGAAGGAGGCTGATCCCTGAGATCCTTGG
GATCTTGTGGTTGG

GAGCCCATGGGGGAGCTCACTCCGGCAATATTTCCTCCTCTGGCCATATTTCCTGTG
GGCTCTGACCAGG

TCCTGTTTTTGTTGTACCCCAGGCAGTGACAGCATCCAGGGCTCTGATGTCTCTCATG
GCTTGTAAATGT

GAGACCCTGGGGGCCTGATGTGTGTGGGTTTTTGGGGGGAACAGTGGACACAGCTG
TGCCATGAGGTTTC

TTTGACTTGGATGTATTGAGCATGTGCTGGGCTGTTTGAAGTGTCACCCCTCACTGTA
ACAGATATGAAT

TTGTTCATGAATGTTTTTCTGCAGTCTGA

2899 bp : 58.61% C+G

Amino Acid Sequence

MAPRTLLLLLSGALALTETWASSHSMRYFSAAVSRSGRGKPRFIAVGHVDDTQFVLLDS
DAASPRMEPRAPWVEQDGPEYWEEETRIAKAHAHTDRVNLRTLRSYYNQSQAADPVA
QISKRKCEAAKATERRRAYLEGTLTWQWESEDQTHDVELVGTRPAGDGTFQKWAAVV
VPSGEEQRFTCYVQHQGLPEPLMLRWTVVTEAVVTFVLWRKKSSEFFCPTEGSKPQIEK
EGATLRLQ

Secondary Sequence

Alpha helix (Hh) : 93 is 38.75%

Extended strand (Ee) : 51 is 21.25%

Random coil (Cc) : 96 is 40.00%

9. Trichonephila clavipes isolate Nep-004 scaffold_48759, whole genome shotgun sequence


>MWRG01095280.1 UNVERIFIED_CONTAM: Trichonephila clavipes isolate Nep-004
scaffold_48759, whole genome shotgun sequence

GTTAATAGGGAAATCTGTTTATGAAAAAGTTAGCAGTTATTATTTCGAGTATGCTTC
TGTCAACTGCCGC

GACTGCTGCGGATAGTTACCAATCAATTAGCCATTTAGGATACAAGGACACCGATG
GCAACGACACTGTA

AGTGTTGATTCAACTTATTACTTCGCACCAAAGAAAACGATGGGTCCATACGACCAG
TTTGAATACATCA

ACAGAACAACTAACGTGTTTGGTTCGTACGCAGATGATGACTTTGGTGATGTGACTA
ACATCGGCGGTGA

GTACTTCGTACAAGATTTTGTAATCGGTGCGGGTTACAGCAATTACGATTACGGTTC
AGACACTGATTTA

TTCAATGTGTCTGCGGGTTACTTCTTTAACCCTAACCTTCTTCTAAAAGCGACTTTCA
CTGACGTTGAAG

ATGGTGATAACTATGTAATGTTCGACCTTAAGTACAATCATCAAATTAACAGCACTG
ATTACTTAGGTTT

CACGTTCACTGCAGACGACGAGTTTGATTATCGTGCGGTTAGCGCTAAATACTTTAT
GGATCTTCAACAA

GGTAATTACCTAACGATTGAAGGTACAATTGCTGATACTGATGATAGTGGTAGCTCT
TGGGAACTAGGTT

CTAACTACTACTTCTCTAAAGCAACATCAGTATTTGTAACGTTTAATAAAGAAGATG
ACTACAGCTTTGG

TGCTCAGCACTTCTTTAATAAAAACGTTGGCTTAAAAGCTGGTTATGCAAACAACTG
GGATGACTCAGAC

TACGATGCATACTTTGCAAACCTAAGCTTACAGTTTTAATCATTAATTTGATGATATA
AAAAAGCCCGCA

TTATGCGGGCTTTTTCGTTTTTATCATTACCAACCGCACTGGCGGCCTTCATTTTGCT
CATCAAGGTATT

TTTGTAACGGTGCAAAGTAATCTAAGATTGCTGTTGCATCCATTTCTTCTTTACCAGT
GATTGTCGCAAG
TGCTTCTTGCCATGGGCGGCTTGAGCCCATTTCTAACATAGCATTTAGTTTTTCACCA
GCCTCAGCAGAG

TTGTATACAGAACAACGGTGAATCGCTTCTTCGTTACCGGCAATTTCACATAAACTT
CTGTGGAAGTCGA

ATTGTAAAATGTGCGCTAAGAAATAACGTGTGTAAGGCGTGTTGCCTGGTACGTGAT
ACTTAGCACCTGG

ATCGAAGTCTGCTTCGCTACGTGCGATAGGCGCTTGTACACCTTGGTATTTTTCACGT
AATTCCCACCAC

GCTTTGTTGTAGTTCTCTGGCGTTACTTCACCTGAGAAAACTTTCCAGCGCCATTGGT
CTACTAATAAAC

CAAATGGGATAAATGCTACTTTGTCTAATGCCATTTTCATTAACAGGCCGATATCTTT
AGATTCATCAGG

CACGTCGTCTAATAAACCAATTTCTTTTAAATAACCTGGCGTTACAGAAAGTGCAAT
GGTGTCACCAATC

GCTTCGTGGAAACCATCGTTCGCACTTTCTTGATAATAAATAGGCTGTGTGTTGTAG
GCGCGTTGGTAGA

AGTTATGCCCTAGTTCATGGTGAATAACAGAGAATTCTTCACCTGTGCGTTGAATAC
ACATCTTAATACG

TAGGTCGTCTTTGCTATCAATATTCCATGCAGATGCATGACATTGTACGTCACGGTCT
TGCGGTTTAGTG

AATAGCGAGCGCTCATAGAAGGTATCAGGCAGTGGTGCAAAGCCCATTGAGGTGAA
GAACTTCTCAGCAC

CACGTACCATTTTAAGTTCGTCGTAATCGTGTTCAGCTAATAGTTCTGTTACATCATA
ACCAGGATCGGC

ATTTTCTGGTGCAACAACGTCGTAAATGTTACCCCATGTTTGGGCCCACATATTACCT
AGAAGGTGAGCT

GGGATAGGTTGATCTTGCGGGACTTTATCTTCGCCATATTTTTCACCTAGCTTGGCAC
GAACATGACAAT

GTAATGAATCATAAAGCGGTTTAACTTGGCCCCAAATGCGGTCGAGTTCTTTTGCAA
AATCATCGGCTGG
CATGTCGTATTTACTACGCCACATAGCACCTGTATCTGCATAACCTAGCTCTTTTGCA
CCTTCATTGGTG

AGTGCAACTTGTTGCTCATAAAGCGGGCGCATAGGTTTTGAAACTTGTCGCCAGCCT
TGCCATAAATCAA

GCAATTCATTGTAATCACGGCTCGTTGCCATTTTAGCAGTCATCTCGCCAAGGCTTA
AACAGCTACCATC

TTCTTTACAGTATTTACCTTTACCATAAATACCCCCTAGCTCAGCCACTAATTGTGAG
AGCTTAGCTGTT

TTTTCAGCATCTTGTGGAGCAGGTAAGGTAAGTGCGAGTTTAAGTTTATCGAGTTTA
CGACGTGCATCGT

AATCAAGCTCTAAACTATCGAACTTTGCGGCTTCATTAGCAAGACGAACCACAGCTT
CTGTCATTTTGCG

ATTTACTTCAGCAGATAGTTCCGCAGTATCATGGGTGATGAAGTTGGCATAAATCCA
CTCAGCACGACTT

GCTTCTAAATAAAGCGCGCTTAATTCTTTTTCAGTATCGGCGATAAATTTAGCGGCA
TCTTGTGCGGTTA

CTTGGCTTGTTTTGACTTCAGCAGTCGTTGTTTTAGTGCTCGCAGTGTCATCATTACA
GCCTGTAAGTGC

TAGTGCACTGGCGACCATAAGCGCGCTGATGCTGAGCTTAAATGGAGTCTTTTTCAT
AACGTTCCCTAGA

AGTTTATTGTTATTTATTGCCCATGCATAATAGCAGTGTCTTAGTAAACAACAAACT
GCCTAGTTGAAAA

CAAGCTAAGTAAGCGCTGCTT

2821 bp : 41.72% C+G

Amino Acid Sequence

MLLSTAATAADSYQSISHLGYKDTDGNDTVSVDSTYYFAPKKTMGPYDQFEYINRTTN
VFGSYADDDFGDVTNIGGEYFVQDFVIGAGYSNYDYGSDTDLFNVSAGYFFNPNLLLK
ATFTDVEDGDNYVMFDLKYNHQINSTDYLGFTFTADDEFDYRAVSAKYFMDLQQGNY
LTIEGTIADTDDSGSSWELGSNYYFSKATSVFVTFNKEDDYSFGAQHFFNKNVGLKAGY
ANNWDDSD YDAYFANLSLQF
Secondary Structure

Alpha helix (Hh) : 33 is 13.10%

Extended strand (Ee) : 72 is 28.57%

Random coil (Cc) : 147 is 58.33%

10. Obainia sp. SVM-2017 18S ribosomal RNA gene, partial sequence

>KU561101.1 Obainia sp. SVM-2017 18S ribosomal RNA gene, partial sequence

CTATAATTTACTTGATCTTGATATCCTACGTGGAATAACTGTGGTAATTCTAGAGCTA
ATACATGCACCA

AAGCTCCGACTTTCGAAAGAGCGCATCTATTAGATTAAAACCAATCAGGTTCCGGCC
TGTAAATTGGTGA

CTCTGAATAGCTAAGCTAATCGTATGGTCTTGCACCGACGATGTATCTATCAAGTAT
CTGCCTTATCAAC

TTTCGATGGTAGTTTATGTGCCTACCATGGTTGTAACGGGTAACGGAGAATAAGGGT
TCGACTCCGGAGA

GGGAGCCTGAGAAACGGCTACCACATCCAAGGAAGGCAGCAGGCGCGCAAATTACC
CACTCTCAGCAAGA

GGAGGTAGTGACGAAAAATAACGAGACCGTTCTCTTTGAGGCCGGTTATCGGAATG
GGTACAATTTAAAT

CCGTTAACGAGGATCTATGAGAGGGCAAGTCTGGTGCCAGCAGCCGCGGTAATTCC
AGCTCTCAAAGTGT

ATATCGTCACTGCTGCGGTTAAAAAGCTCGTAGTTGGATCTGCGCATCAGGACCCGG
TCCGCCCACTGGG

TGTGAACTGGGTTCCTGAGCTTGTACTGCTGGTTTTCCCTACGTTGCCTTCATCGGTC
GCGTAGGGTGGC

TAGCGAGTTTACTTTGAAAAAATTAGAGTGCTTCACGCGGGCTATTGTCTGAATACT
CGTGCATGGAATA
ATAGAATAGGACCTCGGTTCTATTTTGTTGGTTTTCTGATCTGAGGTAATGGTTAAGA
GGGACGGACGGG

GGCATTCGTATCGCTGCGTGAGAGGTGAAATTCTTGGACCGTAGCGAGACGTCCGAC
TGCGAAAGCATTT

GCCAAGAATGTCTTCATTAATCAAGAACGAAAGTCAGAGGTTCGAAGGCGATCAGA
TACCGCCCTAGTTC

TGACCGTAAACGATACCAACTAGCGTTCCGTCGGCGGTAAATACGCCTTGGCGGGC
AGCTTCCCGGAAAC

GAAAGTTTTTCGGTTCCGGGGGAAGTATGGTTGCAAAGCTGAAACTTAAAGAAATT
GACGGAAGGGCACC

ACCAGGAGTGGAGCCTGCGGCTTAATTTGACTCAACACGGGAAAACTCACCTGGCC
CGGACACCGTGAGG

ATTGACAGATTGAGAGCTCTTTCTTGATTCGGTGGTTGGTGGTGCATGGCCGTTCTTA
GTTGGTGGAGTG

ATTTGTCTGGTTTATTCCGATAACGAGCGAGACTCTAGCCTACTAAATAGTCACTGG
ATAAAAAAGTCCA

GACGACTTCTTAGAGGGACAAGCGGTGTTCAGCCGCACGAAGTTGAGCAATAACAG
GTCTGTGATGCCCT

TAGATGTCCAGGGCTGCACGCGCGCTACACTGGAGGAATCAGCGTGCTGTAACCATT
GCCGAAAGGCATT

GGTAACCCCTTGAAAATCCTCCGTGATCGGGATCGGGAATTGCAATTATTTCCCTTG
AACGAGGAATTCC

TAGTAAGTGTGAGTCATCAGCTCACGTTGATTACGTCCCTGCCCTTTGTACACACCG
CCCGTCGCTGCCC

GGGACTGAGCCGTTTCGAGAAAAGCGGGGACTGCTGTTTCGATACCTTTCGGGGTGG
AGATTCTTTGGTG

GAAACCGCCTTAATCGCAGTGGCTTGAACCGGGCAAAAGTCGTAACAAGGTTTCC

1665 bp : 49.19% C+G

Amino Acid Sequence

MHHQPPNQERALNLSILTVSGPGSLITVMNH
Secondary Structure

Alpha helix (Hh) : 6 is 19.35%

Extended strand (Ee) : 9 is 29.03%

Random coil (Cc) : 16 is 51.61%

PHYLOGENETIC ANALYSIS

S.tuberosum mitochondrial DNA for ribosomal protein S10


53

Trichonephila clavipes isolate Nep-004 scaffold 48759 whole genome shotgun sequence

UNVERIFIED: Oryza sativa isolate Qitougu cultivar Qitougu long and barbed awn 1 (LABA1) gene partial sequence

Staphylococcus aureus S33 R gene for 16S rRNA partial sequence

Obainia sp. SVM-2017 18S ribosomal RNA gene partial sequence

Homo sapiens aldolase fructose-bisphosphate B (ALDOB) mRNA

Escherichia coli strain 126 NODE 128 151 length 2297 cov 48.9748/1-2297 whole genome shotgun sequence

38
Macaca mulatta gene for MHC class I antigen (Mamu-G gene) isolate K721 allele Mamu-G*04 nov

Abelmoschus esculentus chalcone synthase (CHS) mRNA complete cds

22
Klebsiella pneumoniae IRQBAS103 gene for 16S rRNA partial sequence

0.50

You might also like