| United States Patent Application | 20090133160 |
| Kind Code | A1 |
| Scott; Richard William ;   et al. | May 21, 2009 |
The present invention relates to constructs including one or more nucleic acids encoding two or more oleosin repeat units, and methods of use thereof. The present invention also relates to recombinant polypeptides including two or more oleosin repeat units, and methods of use thereof.
| Inventors: | Scott; Richard William; (Palmerston North, NZ) ; Arcus; Vickery Laurence; (Hamilton, NZ) ; Roberts; Nicholas John; (Feilding, NZ) |
| Correspondence Address: |
Larson & Anderson, LLC
P.O. BOX 4928
DILLON
CO
80435
US
|
| Assignee: |
AGRICULTURE VICTORIA SERVICES PTY LTD Attwood, Victoria AU AGRESEARCH LIMITED Hamilton NZ |
| Serial No.: | 090758 |
| Series Code: | 12 |
| Filed: | October 16, 2006 |
| PCT Filed: | October 16, 2006 |
| PCT NO: | PCT/AU2006/001528 |
| 371 Date: | April 18, 2008 |
| Current U.S. Class: | 800/281; 435/243; 435/410; 435/69.1; 530/370; 536/23.6 |
| Class at Publication: | 800/281; 536/23.6; 435/410; 435/243; 435/69.1; 530/370 |
| International Class: | A01H 1/00 20060101 A01H001/00; C07H 21/00 20060101 C07H021/00; C12N 5/10 20060101 C12N005/10; C12P 21/00 20060101 C12P021/00; C07K 14/415 20060101 C07K014/415 |
| Date | Code | Application Number |
|---|---|---|
| Oct 19, 2005 | AU | 2005905787 |
| Nov 16, 2005 | AU | 2005906364 |
Sequence CWU
1
8314DNAArtificial SequencePrimer sequence 1cacc
4215DNAArtificial SequencePrimer
sequence 2tcactcgagg agctc
15312DNAArtificial SequencePrimer sequence 3tctagaggta ct
12412DNAArtificial
SequencePrimer sequence 4actagtagta cc
12512DNAArtificial SequencePrimer sequence
5cccgggggta ct
12612DNAArtificial SequencePrimer sequence 6ctgcagagta cc
12712DNAArtificial SequencePrimer
sequence 7aagcttggta ct
12812DNAArtificial SequencePrimer sequence 8atcgatagta cc
12915DNAArtificial
SequencePrimer sequence 9gtcgacggta cttct
15106DNAArtificial SequencePrimer sequence 10ctcgag
61122DNAArtificial SequencePrimer sequence 11tcagatctcg gtgacgggca gg
221222DNAArtificial
SequencePrimer sequence 12atgagccaag aacgacgccc gg
221322DNAArtificial SequencePrimer sequence
13caccatggca caacctcaag tt
221411DNAArtificial SequencePrimer sequence 14gctccaccgc g
111514DNAArtificial
SequencePrimer sequence 15tccactagtt ctag
141615DNAArtificial SequencePrimer sequence
16ctagaactag tggat
151713DNAArtificial SequencePrimer sequence 17tcctgcagag tac
131813DNAArtificial
SequencePrimer sequence 18gtactctgca gga
131912DNAArtificial SequencePrimer sequence
19atacgatagt ac
122015DNAArtificial SequencePrimer sequence 20ctatcgatac cgtcg
152124DNAArtificial
SequencePrimer sequence 21ctcgagtgtt gatctcttag cttc
242210PRTArtificial SequenceLinker sequence 22Gly
Gly Gly Gly Ser Gly Gly Gly Gly Ser1 5
102324DNAArtificial SequencePrimer sequence 23gacacgctcg aggaattcgg tacc
242424DNAArtificial
SequencePrimer sequence 24gatggtgatg actgcagatc agaa
242530DNAArtificial SequencePrimer sequence
25cgattaatct gagtttttct gatctgcagt
302623DNAArtificial SequencePrimer sequence 26cgatcaccgt tccggccaat gtc
232723DNAArtificial
SequencePrimer sequence 27gacacgtgac ttctcgtctc ctt
232824DNAArtificial SequencePrimer sequence
28gatggtgatg actgcagatc agaa
2429628DNATrifolium repens 29caccatggca caacctcaag ttcaagtcca ctcaacaaca
acacaccgtc aagaaactgc 60tacctaccca tcaacccaaa acattcgtaa agatgtttac
gaaaatgtta actatcccgg 120ccaacgcggt cgttataacg accgctataa tgatagtggt
cgttatgatg gtggtattgc 180ctcctttttg tcagagagaa gtcctccagc ctctcaaatc
ctcgctaccg ttggaggatt 240tttcataggt ggtactctat ttttattagc tagcatttca
tttatcgcca gtcttattgg 300attggcgata atgacaccac tttttatcct ttttagcccg
gttttagtcc ctgctgccct 360cactataggg ctagcagtgg ctggaatatt gacagcagat
gcttgcgggt tgacggggct 420tatgtcgttg tcgtggaccg tgaaatatgt tagggattta
caagcagtag tgcccgaaca 480aatggattcg atgaagggac gtgtcgcgga tgtcgcgagt
tatgttggac aaaagactaa 540ggatgttgga caaaaaacta aagaggttgg acaagacata
caaacaaaag cacatgaagc 600taagagatca acagtcgacc tcgagtga
62830633DNATrifolium repens 30tctagaggta
ctatggcaca acctcaagtt caagtccact caacaacaac acaccgtcaa 60gaaactgcta
cctacccatc aacccaaaac attcgtaaag atgtttacga aaatgttaac 120tatcccggcc
aacgcggtcg ttataacgac cgctataatg atagtggtcg ttatgatggt 180ggtattgcct
cctttttgtc agagagaagt cctccagcct ctcaaatcct cgctaccgtt 240ggaggatttt
tcataggtgg tactctattt ttattagcta gcatttcatt tatcgccagt 300cttattggat
tggcgataat gacaccactt tttatccttt ttagcccggt tttagtccct 360gctgccctca
ctatagggct agcagtggct ggaatattga cagcagatgc ttgcgggttg 420acggggctta
tgtcgttgtc gtggaccgtg aaatatgtta gggatttaca agcagtagtg 480cccgaacaaa
tggattcgat gaagggacgt gtcgcggatg tcgcgagtta tgttggacaa 540aagactaagg
atgttggaca aaaaactaaa gaggttggac aagacataca aacaaaagca 600catgaagcta
agagatcaac aggtactact agt
63331613DNATrifolium repens 31cccggggtac tatggcacaa cctcaagttc aagtccactc
aacaacaaca caccgtcaag 60aaactgctac ctacccatca acccaaaaca ttcgtaaaga
tgtttacgaa aatgttaact 120atcccggcca acgcggtcgt tataacgacc gctataatga
tagtggtcgt tatgatggtg 180gtattgcctc ctttttgtca gagagaagtc ctccagcctc
tcaaatcctc gctaccgttg 240gaggattttt cataggtggt actctatttt tattagctag
catttcattt atcgccagtc 300ttattggatt ggcgataatg acaccacttt ttatcctttt
tagcccggtt ttagtccctg 360ctgccctcac tatagggcta gcagtggctg gaatattgac
agcagatgct tgcgggttga 420cggggcttat gtcgttgtcg tggaccgtga aatatgttag
ggatttacaa gcagtagtgc 480ccgaacaaat ggattcgatg aagggacgtg tcgcggatgt
cgcgagttat gttggacaaa 540agactaagga tgttggacaa aaaactaaag aggttggaca
agacatacaa acaaaagcac 600aggtactctg cag
61332633DNATrifolium repens 32aagcttggta
ctatggcaca acctcaagtt caagtccact caacaacaac acaccgtcaa 60gaaactgcta
cctacccatc aacccaaaac attcgtaaag atgtttacga aaatgttaac 120tatcccggcc
aacgcggtcg ttataacgac cgctataatg atagtggtcg ttatgatggt 180ggtattgcct
cctttttgtc agagagaagt cctccagcct ctcaaatcct cgctaccgtt 240ggaggatttt
tcataggtgg tactctattt ttattagcta gcatttcatt tatcgccagt 300cttattggat
tggcgataat gacaccactt tttatccttt ttagcccggt tttagtccct 360gctgccctca
ctatagggct agcagtggct ggaatattga cagcagatgc ttgcgggttg 420acggggctta
tgtcgttgtc gtggaccgtg aaatatgtta gggatttaca agcagtagtg 480cccgaacaaa
tggattcgat gaagggacgt gtcgcggatg tcgcgagtta tgttggacaa 540aagactaagg
atgttggaca aaaaactaaa gaggttggac aagacataca aacaaaagca 600catgaagcta
agagatcaac aggtactatc gat
63333630DNATrifolium repens 33gtcgacggta cttctatggc acaacctcaa gttcaagtcc
actcaacaac aacacaccgt 60caagaaactg ctacctaccc atcaacccaa aacattcgta
aagatgttta cgaaaatgtt 120aactatcccg gccaacgcgg tcgttataac gaccgctata
atgatagtgg tcgttatgat 180ggtggtattg cctccttttt gtcagagaga agtcctccag
cctctcaaat cctcgctacc 240gttggaggat ttttcatagg tggtactcta tttttattag
ctagcatttc atttatcgcc 300agtcttattg gattggcgat aatgacacca ctttttatcc
tttttagccc ggttttagtc 360cctgctgccc tcactatagg gctagcagtg gctggaatat
tgacagcaga tgcttgcggg 420ttgacggggc ttatgtcgtt gtcgtggacc gtgaaatatg
ttagggattt acaagcagta 480gtgcccgaac aaatggattc gatgaaggga cgtgtcgcgg
atgtcgcgag ttatgttgga 540caaaagacta aggatgttgg acaaaaaact aaagaggttg
gacaagacat acaaacaaaa 600gcacatgaag ctaagagatc aacactcgag
63034709DNATrifolium repens 34caccatggca
caacctcaag ttcaagtcca ctcaacaaca acacaccgtc aagaaactgc 60tacctaccca
tcaacccaaa acattcgtaa agatgtttac gaaaatgtta actatcccgg 120ccaacgcggt
cgttataacg accgctataa tgatagtggt cgttatgatg gtggtattgc 180ctcctttttg
tcagagagaa gtcctccagc ctctcaaatc ctcgctaccg ttggaggatt 240tttcataggt
ggtactctat ttttattagc tagcatttca tttatcgcca gtcttattgg 300attggcgata
atgacaccac tttttatcct ttttagcccg gttttagtcc ctgctgccct 360cactataggg
ctagcagtgg ctggaatatt gacagcagat gcttgcgggt tgacggggct 420tatgtcgttg
tcgtggaccg tgaaatatgt tagggattta caagcagtag tgcccgaaca 480aatggattcg
atgaagggac gtgtcgcgga tgtcgcgagt tatgttggac aaaagactaa 540ggatgttgga
caaaaaacta aagaggttgg acaagacata caaacaaaag cacatgaagc 600taagagatca
acagagctcc accgcggtgg cggccgctct agaactagtg gatcccccgg 660gctgcaggaa
ttcgatatca agcttatcga taccgtcgac ctcgagtga
709351336DNATrifolium repens 35caccatggca caacctcaag ttcaagtcca
ctcaacaaca acacaccgtc aagaaactgc 60tacctaccca tcaacccaaa acattcgtaa
agatgtttac gaaaatgtta actatcccgg 120ccaacgcggt cgttataacg accgctataa
tgatagtggt cgttatgatg gtggtattgc 180ctcctttttg tcagagagaa gtcctccagc
ctctcaaatc ctcgctaccg ttggaggatt 240tttcataggt ggtactctat ttttattagc
tagcatttca tttatcgcca gtcttattgg 300attggcgata atgacaccac tttttatcct
ttttagcccg gttttagtcc ctgctgccct 360cactataggg ctagcagtgg ctggaatatt
gacagcagat gcttgcgggt tgacggggct 420tatgtcgttg tcgtggaccg tgaaatatgt
tagggattta caagcagtag tgcccgaaca 480aatggattcg atgaagggac gtgtcgcgga
tgtcgcgagt tatgttggac aaaagactaa 540ggatgttgga caaaaaacta aagaggttgg
acaagacata caaacaaaag cacatgaagc 600taagagatca acagagctcc accgcggtgg
cggccgctct agaggtacta tggcacaacc 660tcaagttcaa gtccactcaa caacaacaca
ccgtcaagaa actgctacct acccatcaac 720ccaaaacatt cgtaaagatg tttacgaaaa
tgttaactat cccggccaac gcggtcgtta 780taacgaccgc tataatgata gtggtcgtta
tgatggtggt attgcctcct ttttgtcaga 840gagaagtcct ccagcctctc aaatcctcgc
taccgttgga ggatttttca taggtggtac 900tctattttta ttagctagca tttcatttat
cgccagtctt attggattgg cgataatgac 960accacttttt atccttttta gcccggtttt
agtccctgct gccctcacta tagggctagc 1020agtggctgga atattgacag cagatgcttg
cgggttgacg gggcttatgt cgttgtcgtg 1080gaccgtgaaa tatgttaggg atttacaagc
agtagtgccc gaacaaatgg attcgatgaa 1140gggacgtgtc gcggatgtcg cgagttatgt
tggacaaaag actaaggatg ttggacaaaa 1200aactaaagag gttggacaag acatacaaac
aaaagcacat gaagctaaga gatcaacagg 1260tactactaga actagtggat cccccgggct
gcaggaattc gatatcaagc ttatcgatac 1320cgtcgacctc gagtga
1336361958DNATrifolium repens
36caccatggca caacctcaag ttcaagtcca ctcaacaaca acacaccgtc aagaaactgc
60tacctaccca tcaacccaaa acattcgtaa agatgtttac ggaaaatgtt aactatcccg
120gccaacgcgg tcgttataac gaccgctata atgatagtgg tcgttatgat ggtggtattg
180cctccttttt gtcagagaga agtcctccag cctctcaaat cctcgctacc gttggaggat
240ttttcatagg tggtactcta tttttattag ctagcatttc atttatcgcc agtcttattg
300gattggcgat aatgacacca ctttttatcc tttttagccc ggttttagtc cctgctgccc
360tcactatagg gctagcagtg gctggaatat tgacagcaga tgcttgcggg ttgacggggc
420ttatgtcgtt gtcgtggacc gtgaaatatg ttagggattt acaagcagta gtgcccgaac
480aaatggattc gatgaaggga cgtgtcgcgg atgtcgcgag ttatgttgga caaaagacta
540aggatgttgg acaaaaaact aaagaggttg gacaagacat acaaacaaaa gcacatgaag
600ctaagagatc aacagagctc caccgcggtg gcggccgctc tagaggtact atggcacaac
660ctcaagttca agtccactca acaacaacac accgtcaaga aactgctacc tacccatcaa
720cccaaaacat tcgtaaagat gtttacgaaa atgttaacta tcccggccaa cgcggtcgtt
780ataacgaccg ctataatgat agtggtcgtt atgatggtgg tattgcctcc tttttgtcag
840agagaagtcc tccagcctct caaatcctcg ctaccgttgg aggatttttc ataggtggta
900ctctattttt attagctagc atttcattta tcgccagtct tattggattg gcgataatga
960caccactttt tatccttttt agcccggttt tagtccctgc tgccctcact atagggctag
1020cagtggctgg aatattgaca gcagatgctt gcgggttgac ggggcttatg tcgttgtcgt
1080ggaccgtgaa atatgttagg gatttacaag cagtagtgcc cgaacaaatg gattcgatga
1140agggacgtgt cgcggatgtc gcgagttatg ttggacaaaa gactaaggat gttggacaaa
1200aaactaaaga ggttggacaa gacatacaaa caaaagcaca tgaagctaag agatcaacag
1260gtactactag aactagtgga tcccccgggg gtactatggc acaacctcaa gttcaagtcc
1320actcaacaac aacacaccgt caagaaactg ctacctaccc atcaacccaa aacattcgta
1380aagatgttta cgaaaatgtt aactatcccg gccaacgcgg tcgttataac gaccgctata
1440atgatagtgg tcgttatgat ggtggtattg cctccttttt gtcagagaga agtcctccag
1500cctctcaaat cctcgctacc gttggaggat ttttcatagg tggtactcta tttttattag
1560ctagcatttc atttatcgcc agtcttattg gattggcgat aatgacacca ctttttatcc
1620tttttagccc ggttttagtc cctgctgccc tcactatagg gctagcagtg gctggaatat
1680tgacagcaga tgcttgcggg ttgacggggc ttatgtcgtt gtcgtggacc gtgaaatatg
1740ttagggattt acaagcagta gtgcccgaac aaatggattc gatgaaggga cgtgtcgcgg
1800atgtcgcgag ttatgttgga caaaagacta aggatgttgg acaaaaaact aaagaggttg
1860gacaagacat acaaacaaaa gcacatgaag ctaagagatc aacaggtact ctgcaggaat
1920tcgatatcaa gcttatcgat accgtcgacc tcgagtga
195837500DNATrifolium repens 37caacaaayaa cwcaccgtca agaaactgct
acctacccat caacccawwa cattcgtaaa 60gatgtttacg aaaatgttaa ctatcccggc
caacgcggtc gttataacga ccgctataat 120gatagtggtc gttatgatgg tggtattgcc
tcctttttgt cagagagaag tcctccagcc 180accgttggag gatttttcat aggtggtact
ctatttttat tagctagcat ttcatttatc 240gccagtctta ttggattggc gataatgaca
ccacttttta tctcaaatcc tcgcttcctt 300tttagcccgg ttttagtccc tgctgccctc
actatagggc tagcagtggc tggaatattg 360acagcagatg cttgcgggtt gacggggctt
atgtcgttgt cgtggaccgt gaaatatgtt 420agggatttac aagcagtagt gcccgaacaa
atggattcga tgaagggacg tgtcgcggat 480gtcgcgagtt atgttggaca
50038662DNATrifolium repens
38gatatcaagc ttggtactaw tggcacaacc tcaagttcaa gtccactcaa caacaacaca
60ccgtcaagaa aactgctacc tacccatcaa cccaaaacat tcgtaaagat gtttacgaaa
120aatgtttaac tatcccggcc aacgcggtcg ttataacgac cgctataatg atagtggtcg
180tttatgatgg tggtattgcc tcctttttgt cagagagaag tcctccagcc tctcaaatcc
240tcgctaccgt tggaggattt ttcataggtg gtactctatt tttattagct agcatttcat
300ttatcgccag tcttattgga ttggcgataa tgacaccact ttttatcctt tttagcccgg
360twttagtccc tgctgccctc actatagggc tagcagtggc tggaatattg acagcagatg
420cttgcgggtt gacggggctt atgtcgttgt cgtggaccgt gaaatatgtt agggatttac
480aagcagtagt gcccgaacaa atggattcga tgaagggacg tgtcgcggat gtcgcgagtt
540atgttggaca aaagactaag gatgttggac aaaaaactaa agaggttgga caagacatac
600aaacaaaagc acatgaagct aagagatcaa caggtactat cgataccgtc gacctcgagt
660ga
66239794DNATrifolium repens 39atacaaacaa aagcacatga agctaagaga tcaacaggta
ctctgcagga attcgatatc 60aagcttggta ctatggcaca acctcaagtt caagtccact
caacaacaac acaccgtcaa 120gaaactgcta cctacccatc aacccaaaac attcgtaaag
atgtttacga aaatgttaac 180tatcccggcc aacgcggtcg ttataacgac cgctataatg
atagtggtcg ttatgatggt 240ggtattgcct cctttttgtc agagagaagt cctccagcct
ctcaaatcct cgctaccgtt 300ggaggatttt tcataggtgg tactctattt ttattagcta
gcatttcatt tatcgccagt 360cttattggat tggcgataat gacaccactt tttatccttt
ttagcccggt tttagtccct 420gctgccctca ctatagggct agcagtggct ggaatattga
cagcagatgc ttgcgggttg 480acggggctta tgtcgttgtc gtggaccgtg aaatatgtta
gggatttaca agcagtagtg 540cccgaacaaa tggattcgat gaagggacgt gtcgcggatg
tcgcgagtta tgttggacaa 600aagactaagg atgttggaca aaaaactaaa gaggttggac
aagacataca aacaaaagca 660catgaagcta agagatcaac aggtactatc gataccgtcg
acctcgagtg aaagggtggg 720cgcgccgacc cagctttctt gtacaaagtt ggcattataa
gaaagcattg cttatcaatt 780tgttgcaacg aaca
79440609DNATrifolium repens 40atggcacaac
ctcaagttca agtccactca acaacaacac accgtcaaga aactgctacc 60tacccatcaa
cccaaaacat tcgtaaagat gtttacgaaa atgttaacta tcccggccaa 120cgcggtcgtt
ataacgaccg ctataatgat agtggtcgtt atgatggtgg tattgcctcc 180tttttgtcag
agagaagtcc tccagcctct caaatcctcg ctaccgttgg aggatttttc 240ataggtggta
ctctattttt attagctagc atttcattta tcgccagtct tattggattg 300gcgataatga
caccactttt tatccttttt agcccggttt tagtccctgc tgccctcact 360atagggctag
cagtggctgg aatattgaca gcagatgctt gcgggttgac ggggcttatg 420tcgttgtcgt
ggaccgtgaa atatgttagg gatttacaag cagtagtgcc cgaacaaatg 480gattcgatga
agggacgtgt cgcggatgtc gcgagttatg ttggacaaaa gactaaggat 540gttggacaaa
aaactaaaga ggttggacaa gacatacaaa caaaagcaca tgaagctaag 600agatcaaca
60941609DNATrifolium repens 41atggcacaac ctcaagttca agtccactca acaacaacac
accgtcaaga aactgctacc 60tacccatcaa cccaaaacat tcgtaaagat gtttacgaaa
atgttaacta tcccggccaa 120cgcggtcgtt ataacgaccg ctataatgat agtggtcgtt
atgatggtgg tattgcctcc 180tttttgtcag agagaagtcc tccagcctct caaatcctcg
ctaccgttgg aggatttttc 240ataggtggta ctctattttt attagctagc atttcattta
tcgccagtct tattggattg 300gcgataatga caccactttt tatccttttt agcccggttt
tagtccctgc tgccctcact 360atagggctag cagtggctgg aatattgaca gcagatgctt
gcgggttgac ggggcttatg 420tcgttgtcgt ggaccgtgaa atatgttagg gatttacaag
cagtagtgcc cgaacaaatg 480gattcgatga agggacgtgt cgcggatgtc gcgagttatg
ttggacaaaa gactaaggat 540gttggacaaa aaactaaaga ggttggacaa gacatacaaa
caaaagcaca tgaagctaag 600agatcaaca
60942869DNATrifolium repens 42agatcaacag
gtactatcga taccgtcgac ctcgacggta cttctatggc acaacctcaa 60gttcaagtcc
actcaacaac aacacaccgt caagaaactg ctacctaccc atcaacccaa 120aacattcgta
aagatgttta cgaaaatgtt aactatcccg gccaacgcgg tcgttataac 180gaccgctata
atgatagtgg tcgttatgat ggtggtattg cctccttttt gtcagagaga 240agtcctccag
cctctcaaat cctcgctacc gttggaggat ttttcatagg tggtactcta 300tttttattag
ctagcatttc atttatcgcc agtcttattg gattggcgat aatgacacca 360ctttttatcc
tttttagccc ggttttagtc cctgctgccc tcactatagg gctagcagtg 420gctggaatat
tgacagcaga tgcttgcggg ttgacggggc ttatgtcgtt gtcgtggacc 480gtgaaatatg
ttagggattt acaagcagta gtgcccgaac aaatggattc gatgaaggga 540cgtgtcgcgg
atgtcgcgag ttatgttgga caaaagacta aggatgttgg acaaaaaact 600aaagaggttg
gacaagacat acaaacaaaa gcacatgaag ctaagagatc aacactcgag 660tgaaagggtg
ggcgcgccga cccagctttc ttgtacaaag ttggcattat aagaaagcat 720tgcttatcaa
tttgttgcaa cgaacaggtc actatcagtc aaaataaaat cattatttgc 780catccagctg
atatccccta tagtgagtcg tattacatgg tcatagctgt ttcctggcag 840ctctggcccg
tgtctcaaaa tctctgatg
86943818DNATrifolium repens 43gacatacaaa caaaagcaca tgaagctaag agatcaacag
gtactatcga taccgtcgac 60ctcgacggta cttctatggc acaacctcaa gttcaagtcc
actcaacaac aacacaccgt 120caagaaactg ctacctaccc atcaacccaa aacattcgta
aagatgttta cgaaaatgtt 180aactatcccg gccaacgcgg tcgttataac gaccgctata
atgatagtgg tcgttatgat 240ggtggtattg cctccttttt gtcagagaga agtcctccag
cctctcaaat cctcgctacc 300gttggaggat ttttcatagg tggtactcta tttttattag
ctagcatttc atttatcgcc 360agtcttattg gattggcgat aatgacacca ctttttatcc
tttttagccc ggttttagtc 420cctgctgccc tcactatagg gctagcagtg gctggaatat
tgacagcaga tgcttgcggg 480ttgacggggc ttatgtcgtt gtcgtggacc gtgaaatatg
ttagggattt acaagcagta 540gtgcccgaac aaatggattc gatgaaggga cgtgtcgcgg
atgtcgcgag ttatgttgga 600caaaagacta aggatgttgg acaaaaaact aaagaggttg
gacaagacat acaaacaaaa 660gcacatgaag ctaagagatc aacactcgag tgaaagggtg
ggcgcgccga cccagctttc 720ttgtacaaag ttgscattat aagaaagcat tgcttatcaa
tttgtkgcaa cgaacaggtc 780actatcagtc aaaataaaat ckattatttg ctcgattg
81844304DNAArabidopsis thaliana 44gtaaatttct
gtgttcctta ttctctcaaa atcttcgatt ttgttttcgt tcgatcccaa 60tttcgtatat
gttctttggt ttagattctg ttaatcttag atcgaagacg attttctggg 120tttgatcgtt
agatatcatc ttaattctcg attagggttt catagatatc atccgatttg 180ttcaaataat
ttgagttttg tcgaataatt actcttcgat ttgtgatttc tatctagatc 240tggtgttagt
ttctagtttg tgcgatcgaa tttgtcgatt aatctgagtt tttctgatta 300acag
30445304DNAArabidopsis thaliana 45gtaaatttct gtgttcctta ttctctcaaa
atcttcgatt ttgttttcgt tcgatcccaa 60tttcgtatat gttctttggt ttagattctg
ttaatcttag atcgaagacg attttctggg 120tttgatcgtt agatatcatc ttaattctcg
attagggttt catagatatc atccgatttg 180ttcaaataat ttgagttttg tcgaataatt
actcttcgat ttgtgatttc tatctagatc 240tggtgttagt ttctagtttg tgcgatcgaa
tttgtcgatt aatctgagtt tttctgatct 300gcag
30446301DNAArabidopsis thaliana
46gtaaatttct gtgttcctta ttctctcaaa atcttcgatt ttgttttcgt tcgatcccaa
60tttcgtatat gttctttggt ttagattctg ttaatcttag atcgaagacg attttctggg
120tttgatcgtt agatatcatc ttaattctcg attagggttt catagatatc atccgatttg
180ttcaaataat ttgagttttg tcgaataatt actcttcgat ttgtgatttc tatctagatc
240tggtgttagt ttctagtttg tgcgatcgaa tttgtcgatt aatctgagtt tttctgatca
300g
30147870PRTSesamum indicum 47Met Ala Glu His Tyr Gly Gln Gln Gln Gln Thr
Arg Ala Pro His Leu1 5 10
15Gln Leu Gln Pro Arg Ala Gln Arg Val Val Lys Ala Ala Thr Ala Val
20 25 30Thr Ala Gly Gly Ser Leu Leu
Val Leu Ser Gly Leu Thr Leu Ala Gly 35 40
45Thr Val Ile Ala Leu Thr Ile Ala Thr Pro Leu Leu Val Ile Phe
Ser 50 55 60Pro Val Leu Val Pro Ala
Val Ile Thr Ile Phe Leu Leu Gly Ala Gly65 70
75 80Phe Leu Ala Ser Gly Gly Phe Gly Val Ala Ala
Leu Ser Val Leu Ser 85 90
95Trp Ile Tyr Arg Tyr Leu Thr Gly Lys His Pro Pro Gly Ala Asp Gln
100 105 110Leu Glu Ser Ala Lys Thr
Lys Leu Ala Ser Lys Ala Arg Glu Met Lys 115 120
125Asp Arg Ala Glu Gln Phe Ser Gln Gln Pro Val Ala Gly Ser
Gln Thr 130 135 140Ser Met Ala Glu His
Tyr Gly Gln Gln Gln Gln Thr Arg Ala Pro His145 150
155 160Leu Gln Leu Gln Pro Arg Ala Gln Arg Val
Val Lys Ala Ala Thr Ala 165 170
175Val Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Gly Leu Thr Leu Ala
180 185 190Gly Thr Val Ile Ala
Leu Thr Ile Ala Thr Pro Leu Leu Val Ile Phe 195
200 205Ser Pro Val Leu Val Pro Ala Val Ile Thr Ile Phe
Leu Leu Gly Ala 210 215 220Gly Phe Leu
Ala Ser Gly Gly Phe Gly Val Ala Ala Leu Ser Val Leu225
230 235 240Ser Trp Ile Tyr Arg Tyr Leu
Thr Gly Lys His Pro Pro Gly Ala Asp 245
250 255Gln Leu Glu Ser Ala Lys Thr Lys Leu Ala Ser Lys
Ala Arg Glu Met 260 265 270Lys
Asp Arg Ala Glu Gln Phe Ser Gln Gln Pro Val Ala Gly Ser Gln 275
280 285Thr Ser Met Ala Glu His Tyr Gly Gln
Gln Gln Gln Thr Arg Ala Pro 290 295
300His Leu Gln Leu Gln Pro Arg Ala Gln Arg Val Val Lys Ala Ala Thr305
310 315 320Ala Val Thr Ala
Gly Gly Ser Leu Leu Val Leu Ser Gly Leu Thr Leu 325
330 335Ala Gly Thr Val Ile Ala Leu Thr Ile Ala
Thr Pro Leu Leu Val Ile 340 345
350Phe Ser Pro Val Leu Val Pro Ala Val Ile Thr Ile Phe Leu Leu Gly
355 360 365Ala Gly Phe Leu Ala Ser Gly
Gly Phe Gly Val Ala Ala Leu Ser Val 370 375
380Leu Ser Trp Ile Tyr Arg Tyr Leu Thr Gly Lys His Pro Pro Gly
Ala385 390 395 400Asp Gln
Leu Glu Ser Ala Lys Thr Lys Leu Ala Ser Lys Ala Arg Glu
405 410 415Met Lys Asp Arg Ala Glu Gln
Phe Ser Gln Gln Pro Val Ala Gly Ser 420 425
430Gln Thr Ser Met Ala Glu His Tyr Gly Gln Gln Gln Gln Thr
Arg Ala 435 440 445Pro His Leu Gln
Leu Gln Pro Arg Ala Gln Arg Val Val Lys Ala Ala 450
455 460Thr Ala Val Thr Ala Gly Gly Ser Leu Leu Val Leu
Ser Gly Leu Thr465 470 475
480Leu Ala Gly Thr Val Ile Ala Leu Thr Ile Ala Thr Pro Leu Leu Val
485 490 495Ile Phe Ser Pro Val
Leu Val Pro Ala Val Ile Thr Ile Phe Leu Leu 500
505 510Gly Ala Gly Phe Leu Ala Ser Gly Gly Phe Gly Val
Ala Ala Leu Ser 515 520 525Val Leu
Ser Trp Ile Tyr Arg Tyr Leu Thr Gly Lys His Pro Pro Gly 530
535 540Ala Asp Gln Leu Glu Ser Ala Lys Thr Lys Leu
Ala Ser Lys Ala Arg545 550 555
560Glu Met Lys Asp Arg Ala Glu Gln Phe Ser Gln Gln Pro Val Ala Gly
565 570 575Ser Gln Thr Ser
Met Ala Glu His Tyr Gly Gln Gln Gln Gln Thr Arg 580
585 590Ala Pro His Leu Gln Leu Gln Pro Arg Ala Gln
Arg Val Val Lys Ala 595 600 605Ala
Thr Ala Val Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Gly Leu 610
615 620Thr Leu Ala Gly Thr Val Ile Ala Leu Thr
Ile Ala Thr Pro Leu Leu625 630 635
640Val Ile Phe Ser Pro Val Leu Val Pro Ala Val Ile Thr Ile Phe
Leu 645 650 655Leu Gly Ala
Gly Phe Leu Ala Ser Gly Gly Phe Gly Val Ala Ala Leu 660
665 670Ser Val Leu Ser Trp Ile Tyr Arg Tyr Leu
Thr Gly Lys His Pro Pro 675 680
685Gly Ala Asp Gln Leu Glu Ser Ala Lys Thr Lys Leu Ala Ser Lys Ala 690
695 700Arg Glu Met Lys Asp Arg Ala Glu
Gln Phe Ser Gln Gln Pro Val Ala705 710
715 720Gly Ser Gln Thr Ser Met Ala Glu His Tyr Gly Gln
Gln Gln Gln Thr 725 730
735Arg Ala Pro His Leu Gln Leu Gln Pro Arg Ala Gln Arg Val Val Lys
740 745 750Ala Ala Thr Ala Val Thr
Ala Gly Gly Ser Leu Leu Val Leu Ser Gly 755 760
765Leu Thr Leu Ala Gly Thr Val Ile Ala Leu Thr Ile Ala Thr
Pro Leu 770 775 780Leu Val Ile Phe Ser
Pro Val Leu Val Pro Ala Val Ile Thr Ile Phe785 790
795 800Leu Leu Gly Ala Gly Phe Leu Ala Ser Gly
Gly Phe Gly Val Ala Ala 805 810
815Leu Ser Val Leu Ser Trp Ile Tyr Arg Tyr Leu Thr Gly Lys His Pro
820 825 830Pro Gly Ala Asp Gln
Leu Glu Ser Ala Lys Thr Lys Leu Ala Ser Lys 835
840 845Ala Arg Glu Met Lys Asp Arg Ala Glu Gln Phe Ser
Gln Gln Pro Val 850 855 860Ala Gly Ser
Gln Thr Ser865 87048839PRTSesamum indicum 48Met Ala Glu
His Tyr Gly Gln Gln Gln Gln Thr Arg Ala Pro His Leu1 5
10 15Gln Leu Gln Pro Arg Ala Gln Arg Val
Val Lys Ala Ala Thr Ala Val 20 25
30Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Gly Leu Thr Leu Ala Gly
35 40 45Thr Val Ile Ala Leu Thr Ile
Ala Thr Pro Leu Leu Val Ile Phe Ser 50 55
60Pro Val Leu Val Pro Ala Val Ile Thr Ile Phe Leu Leu Gly Ala Gly65
70 75 80Phe Leu Ala Ser
Gly Gly Phe Gly Val Ala Ala Leu Ser Val Leu Ser 85
90 95Trp Ile Tyr Arg Tyr Leu Thr Gly Lys His
Pro Pro Gly Ala Asp Gln 100 105
110Leu Glu Ser Ala Lys Thr Lys Leu Ala Ser Lys Ala Arg Glu Met Lys
115 120 125Asp Arg Ala Glu Gln Phe Ser
Gln Gln Pro Val Ala Gly Ser Gln Thr 130 135
140Ser Ile Asp Pro Ser Ser Trp Leu Glu Met Ala Glu His Tyr Gly
Gln145 150 155 160Gln Gln
Gln Thr Arg Ala Pro His Leu Gln Leu Gln Pro Arg Ala Gln
165 170 175Arg Val Val Lys Ala Ala Thr
Ala Val Thr Ala Gly Gly Ser Leu Leu 180 185
190Val Leu Ser Gly Leu Thr Leu Ala Gly Thr Val Ile Ala Leu
Thr Ile 195 200 205Ala Thr Pro Leu
Leu Val Ile Phe Ser Pro Val Leu Val Pro Ala Val 210
215 220Ile Thr Ile Phe Leu Leu Gly Ala Gly Phe Leu Ala
Ser Gly Gly Phe225 230 235
240Gly Val Ala Ala Leu Ser Val Leu Ser Trp Ile Tyr Arg Tyr Leu Thr
245 250 255Gly Lys His Pro Pro
Gly Ala Asp Gln Leu Glu Ser Ala Lys Thr Lys 260
265 270Leu Ala Ser Lys Ala Arg Glu Met Lys Asp Arg Ala
Glu Gln Phe Ser 275 280 285Gln Gln
Pro Val Ala Gly Ser Gln Thr Ser His Met Phe Lys Trp Pro 290
295 300Ser Ala Met Ala Glu His Tyr Gly Gln Gln Gln
Gln Thr Arg Ala Pro305 310 315
320His Leu Gln Leu Gln Pro Arg Ala Gln Arg Val Val Lys Ala Ala Thr
325 330 335Ala Val Thr Ala
Gly Gly Ser Leu Leu Val Leu Ser Gly Leu Thr Leu 340
345 350Ala Gly Thr Val Ile Ala Leu Thr Ile Ala Thr
Pro Leu Leu Val Ile 355 360 365Phe
Ser Pro Val Leu Val Pro Ser Ser Ser Glu Leu Pro Trp Val Asp 370
375 380Met Ala Glu His Tyr Gly Gln Gln Gln Gln
Thr Arg Ala Pro His Leu385 390 395
400Gln Leu Gln Pro Arg Ala Gln Arg Val Val Lys Ala Ala Thr Ala
Val 405 410 415Thr Ala Gly
Gly Ser Leu Leu Val Leu Ser Gly Leu Thr Leu Ala Gly 420
425 430Thr Val Ile Ala Leu Thr Ile Ala Thr Pro
Leu Leu Val Ile Phe Ser 435 440
445Pro Val Leu Val Pro Ala Val Ile Thr Ile Phe Leu Leu Gly Ala Gly 450
455 460Phe Leu Ala Ser Gly Gly Phe Gly
Val Ala Ala Leu Ser Val Leu Ser465 470
475 480Trp Ile Tyr Arg Tyr Leu Thr Gly Lys His Pro Pro
Gly Ala Asp Gln 485 490
495Leu Glu Ser Ala Lys Thr Lys Leu Ala Ser Lys Ala Arg Glu Met Lys
500 505 510Asp Arg Ala Glu Gln Phe
Ser Gln Gln Pro Val Ala Gly Ser Gln Thr 515 520
525Ser His Met Glu Phe Lys Leu Ser Thr Met Ala Glu His Tyr
Gly Gln 530 535 540Gln Gln Gln Thr Arg
Ala Pro His Leu Gln Leu Gln Pro Arg Ala Gln545 550
555 560Arg Val Val Lys Ala Ala Thr Ala Val Thr
Ala Gly Gly Ser Leu Leu 565 570
575Val Leu Ser Gly Leu Thr Leu Ala Gly Thr Val Ile Ala Leu Thr Ile
580 585 590Ala Thr Pro Leu Leu
Val Ile Phe Ser Pro Val Leu Val Pro Ala Val 595
600 605Ile Thr Ile Phe Leu Leu Gly Ala Gly Phe Leu Ala
Ser Gly Gly Phe 610 615 620Gly Val Ala
Ala Leu Ser Val Leu Ser Trp Ile Tyr Arg Tyr Leu Thr625
630 635 640Gly Lys His Pro Pro Gly Ala
Asp Gln Leu Glu Ser Ala Lys Thr Lys 645
650 655Leu Ala Ser Lys Ala Arg Glu Met Lys Asp Arg Ala
Glu Gln Phe Ser 660 665 670Gln
Gln Pro Val Ala Gly Ser Gln Thr Ser Ile Asp Gln Gln Val Asn 675
680 685Val His Met Ala Glu His Tyr Gly Gln
Gln Gln Gln Thr Arg Ala Pro 690 695
700His Leu Gln Leu Gln Pro Arg Ala Gln Arg Val Val Lys Ala Ala Thr705
710 715 720Ala Val Thr Ala
Gly Gly Ser Leu Leu Val Leu Ser Gly Leu Thr Leu 725
730 735Ala Gly Thr Val Ile Ala Leu Thr Ile Ala
Thr Pro Leu Leu Val Ile 740 745
750Phe Ser Pro Val Leu Val Pro Ala Val Ile Thr Ile Phe Leu Leu Gly
755 760 765Ala Gly Phe Leu Ala Ser Gly
Gly Phe Gly Val Ala Ala Leu Ser Val 770 775
780Leu Ser Trp Ile Tyr Arg Tyr Leu Thr Gly Lys His Pro Pro Gly
Ala785 790 795 800Asp Gln
Leu Glu Ser Ala Lys Thr Lys Leu Ala Ser Lys Ala Arg Glu
805 810 815Met Lys Asp Arg Ala Glu Gln
Phe Ser Gln Gln Pro Val Ala Gly Ser 820 825
830Gln Thr Ser Pro Trp Leu Glu 83549869PRTSesamum
indicum 49Met Ala Glu His Tyr Gly Gln Gln Gln Gln Thr Arg Ala Pro His
Leu1 5 10 15Gln Leu Gln
Pro Arg Ala Gln Arg Val Val Lys Ala Ala Thr Ala Val 20
25 30Thr Ala Gly Gly Ser Leu Leu Val Leu Ser
Gly Leu Thr Leu Ala Gly 35 40
45Thr Val Ile Ala Leu Thr Ile Ala Thr Pro Leu Leu Val Ile Phe Ser 50
55 60Pro Val Leu Val Pro Ala Val Ile Thr
Ile Phe Leu Leu Gly Ala Gly65 70 75
80Phe Leu Ala Ser Gly Gly Phe Gly Val Ala Ala Leu Ser Val
Leu Ser 85 90 95Trp Ile
Tyr Arg Tyr Leu Thr Gly Lys His Pro Pro Gly Ala Asp Gln 100
105 110Leu Glu Ser Ala Lys Thr Lys Leu Ala
Ser Lys Ala Arg Glu Met Lys 115 120
125Asp Arg Ala Glu Gln Phe Ser Gln Gln Pro Val Ala Gly Ser Gln Thr
130 135 140Ser Met Ala Glu His Tyr Gly
Gln Gln Gln Gln Thr Arg Ala Pro His145 150
155 160Leu Gln Leu Gln Pro Arg Ala Gln Arg Val Val Lys
Ala Ala Thr Ala 165 170
175Val Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Gly Leu Thr Leu Ala
180 185 190Gly Thr Val Ile Ala Leu
Thr Ile Ala Thr Pro Leu Leu Val Ile Phe 195 200
205Ser Pro Val Leu Val Pro Ala Val Ile Thr Ile Phe Leu Leu
Gly Ala 210 215 220Gly Phe Leu Ala Ser
Gly Gly Phe Gly Val Ala Ala Leu Ser Val Leu225 230
235 240Ser Trp Ile Tyr Arg Tyr Leu Thr Gly Lys
His Pro Pro Gly Ala Asp 245 250
255Gln Leu Glu Ser Ala Lys Thr Lys Leu Ala Ser Lys Ala Arg Glu Met
260 265 270Lys Asp Arg Ala Glu
Gln Phe Ser Gln Gln Pro Val Ala Gly Ser Gln 275
280 285Thr Ser Met Ala Glu His Tyr Gly Gln Gln Gln Gln
Thr Arg Ala Pro 290 295 300His Leu Gln
Leu Gln Pro Arg Ala Gln Arg Val Val Lys Ala Ala Thr305
310 315 320Ala Val Thr Ala Gly Gly Ser
Leu Leu Val Leu Ser Gly Leu Thr Leu 325
330 335Ala Gly Thr Val Ile Ala Leu Thr Ile Ala Thr Pro
Leu Leu Val Ile 340 345 350Phe
Ser Pro Val Leu Val Pro Ala Val Ile Thr Ile Phe Leu Leu Gly 355
360 365Ala Gly Phe Leu Ala Ser Gly Gly Phe
Gly Val Ala Ala Leu Ser Val 370 375
380Leu Ser Trp Ile Tyr Arg Tyr Leu Thr Gly Lys His Pro Pro Gly Ala385
390 395 400Asp Gln Leu Glu
Ser Ala Lys Thr Lys Leu Ala Ser Lys Ala Arg Glu 405
410 415Met Lys Asp Arg Ala Glu Gln Phe Ser Gln
Gln Pro Val Ala Gly Ser 420 425
430Gln Thr Ser Met Ala Glu His Tyr Gly Gln Gln Gln Gln Thr Arg Ala
435 440 445Pro His Leu Gln Leu Gln Pro
Arg Ala Gln Arg Val Val Lys Ala Ala 450 455
460Thr Ala Val Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Gly Leu
Thr465 470 475 480Leu Ala
Gly Thr Val Ile Ala Leu Thr Ile Ala Thr Pro Leu Leu Val
485 490 495Ile Phe Ser Pro Val Leu Val
Pro Ala Val Ile Thr Ile Phe Leu Leu 500 505
510Gly Ala Gly Phe Leu Ala Ser Gly Gly Phe Gly Val Ala Ala
Leu Ser 515 520 525Val Leu Ser Trp
Ile Tyr Arg Tyr Leu Thr Gly Lys His Pro Pro Gly 530
535 540Ala Asp Gln Leu Glu Ser Ala Lys Thr Lys Leu Ala
Ser Lys Ala Arg545 550 555
560Glu Met Lys Asp Arg Ala Glu Gln Phe Ser Gln Gln Pro Val Ala Gly
565 570 575Ser Gln Thr Ser Met
Ala Glu His Tyr Gly Gln Gln Gln Gln Thr Arg 580
585 590Ala Pro His Leu Gln Leu Gln Pro Arg Ala Gln Arg
Val Val Lys Ala 595 600 605Ala Thr
Ala Val Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Gly Leu 610
615 620Thr Leu Ala Gly Thr Val Ile Ala Leu Thr Ile
Ala Thr Pro Leu Leu625 630 635
640Val Ile Phe Ser Pro Val Leu Val Pro Ala Val Ile Thr Ile Phe Leu
645 650 655Leu Gly Ala Gly
Phe Leu Ala Ser Gly Gly Phe Gly Val Ala Ala Leu 660
665 670Ser Val Leu Ser Trp Ile Tyr Arg Tyr Leu Thr
Gly Lys His Pro Pro 675 680 685Gly
Ala Asp Gln Leu Glu Ser Ala Lys Thr Lys Leu Ala Ser Lys Ala 690
695 700Arg Glu Met Lys Asp Arg Ala Glu Gln Phe
Ser Gln Gln Pro Val Ala705 710 715
720Gly Ser Gln Thr Met Ala Glu His Tyr Gly Gln Gln Gln Gln Thr
Arg 725 730 735Ala Pro His
Leu Gln Leu Gln Pro Arg Ala Gln Arg Val Val Lys Ala 740
745 750Ala Thr Ala Val Thr Ala Gly Gly Ser Leu
Leu Val Leu Ser Gly Leu 755 760
765Thr Leu Ala Gly Thr Val Ile Ala Leu Thr Ile Ala Thr Pro Leu Leu 770
775 780Val Ile Phe Ser Pro Val Leu Val
Pro Ala Val Ile Thr Ile Phe Leu785 790
795 800Leu Gly Ala Gly Phe Leu Ala Ser Gly Gly Phe Gly
Val Ala Ala Leu 805 810
815Ser Val Leu Ser Trp Ile Tyr Arg Tyr Leu Thr Gly Lys His Pro Pro
820 825 830Gly Ala Asp Gln Leu Glu
Ser Ala Lys Thr Lys Leu Ala Ser Lys Ala 835 840
845Arg Glu Met Lys Asp Arg Ala Glu Gln Phe Ser Gln Gln Pro
Val Ala 850 855 860Gly Ser Gln Thr
Ser865503289DNAArtificial SequenceSynthetic sequence 50aataatgatt
ttattttgac tgatagtgac ctgttcgttg caacaaattg atgagcaatg 60cttttttata
atgccaactt tgtacaaaaa agcaggctcc gcggccgctt gctccgttaa 120aaaaaaccat
ggcagagcac tacgggcagc agcaacaaac acgtgccccc cacttgcaac 180tgcaaccgcg
tgctcagcgt gttgtaaagg cagcgaccgc ggttactgcc ggaggtagcc 240tgttggtgtt
atccgggttg accctggctg gaacggtcat tgcgctgaca atcgccacac 300ctctcctggt
gatcttctcg cctgtactgg tccctgcagg taaatttctg tgttccttat 360tctctcaaaa
tcttcgattt tgttttcgtt cgatcccaat ttcgtatatg ttctttggtt 420tagattctgt
taatcttaga tcgaagacga ttttctgggt ttgatcgtta gatatcatct 480taattctcga
ttagggtttc atagatatca tccgatttgt tcaaataatt tgagttttgt 540cgaataatta
ctcttcgatt tgtgatttct atctagatct ggtgttagtt tctagtttgt 600gcgatcgaat
ttgtcgatta atctgagttt ttctgatctg cagtcatcac catcttcctg 660ttgggagcgg
gttttctggc aagcggggga tttggtgttg ccgctctgtc tgtgctgtcc 720tggatctacc
gttacctgac ggggaaacat ccacccggag cggatcagtt ggagtcggcc 780aagacaaagt
tggcgagcaa ggcccgtgaa atgaaggacc gtgccgagca gttcagtcag 840caaccggtag
cagggtctca gaccagcatc gatccatcct cctggctcga gatggcggaa 900cactacgggc
aacaacagca gactcgtgct ccccacctgc aattacaacc ccgtgcccaa 960cgtgttgtga
aagcggcaac agcagtaacg gcagggggaa gtttgctggt cttatcgggg 1020ttgaccttag
cgggaaccgt gattgccctg acaattgcga ctccgctgct ggttatcttc 1080agccccgtat
tggttccggc cgtgatcacg atttttttgc tgggggcagg atttttagcc 1140agcggaggat
ttggggtcgc agcgttgtct gtgctgagtt ggatctatcg ttatttgacc 1200gggaagcacc
cacctggagc agaccagctg gagagcgcga aaacgaagct ggcatcgaag 1260gcgcgtgaaa
tgaaggatcg tgctgaacaa ttctcccagc agcctgttgc cggttctcag 1320accagccata
tgtttaaatg gccaagcgct atggccgagc attatgggca gcaacagcaa 1380acccgtgccc
cgcatctgca attgcaacct cgtgcccagc gtgtcgttaa ggcggctact 1440gcggtaacag
cgggagggag cttactggta ttaagcgggc tgacattggc cggaacggtg 1500atcgccttaa
caatcgcgac acccttgctg gtcatcttca gtccggttct ggtgcccgcg 1560gtgattacga
ttttcctgct gggagccggt ttcttagcat cggggggttt tggggtagca 1620gccttgagtg
tcctgtcgtg gatctatcgt tacttaactg gaaaacaccc gccaggagct 1680gaccagttgg
agtctgcaaa aactaagctg gcgtccaaag cccgtgaaat gaaggatcgt 1740gctgagcagt
ttagccagca gccagttgcg ggaagtcaga cctcttcatc tgagctccca 1800tgggtcgaca
tggcggagca ttacggtcaa cagcaacaga cccgtgctcc gcacttacaa 1860ttgcaaccac
gtgctcaacg tgtcgtaaaa gccgccacgg cagttactgc ggggggatca 1920ttgctggtgt
taagtgggtt gacactggcg gggacagtta ttgcactgac gatcgcgacc 1980cccttgttag
tgatcttctc ccccgttctg gttccggcgg tcattacaat ctttctgttg 2040ggtgccggat
ttttagcctc tgggggattt ggagtagctg ccctgtcagt gttgagctgg 2100atctaccgtt
acttaacagg gaagcaccct cccggggcag atcagttgga aagcgccaag 2160accaagctgg
caagtaaagc gcgtgaaatg aaggaccgtg ccgaacaatt ttcgcagcaa 2220ccggttgcgg
gatcacagac ctctagtact ccatcctcct ggcatatgat ggccgagcac 2280tatggacaac
agcagcagac gcgtgcccct catctgcaac tgcaaccccg tgctcaacgt 2340gtcgttaagg
ctgcgacagc agtaaccgct gggggttctc tgttagtgtt gtcagggctg 2400actttggcgg
ggacggtaat tgcgttgacc attgccaccc cgctgttagt gattttcagc 2460ccggtactgg
tgccagcagt tatcacgatc ttcttgctgg gtgccggatt cttggcaagt 2520ggaggttttg
gagttgcggc gctgtcagtt ttatcctgga tctatcgtta tctgacagga 2580aaacatcccc
caggtgccga tcagctggag agtgccaaga caaaactggc gtctaaggca 2640cgtgaaatga
aggatcgtgc cgaacagttt tctcaacagc ccgtagcggg gtcacagacc 2700tcgatcgatc
agcaggttaa cgtgcacatg gccgaacatt acggacagca acaacagacg 2760cgtgctccac
acctgcaatt gcaaccgcgt gctcaacgtg ttgtcaaagc ggcgaccgcc 2820gtaacagcag
gaggatcact gttagtgctg tcgggtttaa ccttggccgg gaccgtcatt 2880gcattgacta
ttgcgacgcc cttactggtg atcttttctc cggtgctggt tcccgccgtt 2940attaccatct
tcttgttagg ggcaggattc ctggcatcag ggggattcgg agttgcggcg 3000ttgagtgtct
taagttggat ctaccgttat ctgactggaa agcacccgcc tggggccgat 3060caactggagt
cagccaaaac gaaattggcg tcaaaagcgc gtgaaatgaa ggaccgtgct 3120gagcagtttt
ctcagcagcc tgtggcagga tcccagacat caccatggct cgagtaatga 3180agcggccgca
cccagctttc ttgtacaaag ttggcattat aagaaagcat tgcttatcaa 3240tttgttgcaa
cgaacaggtc actatcagtc aaaataaaat cattatttg
328951914PRTArtificial SequenceSynthetic sequence 51Met Ala Glu His Tyr
Gly Gln Gln Gln Gln Thr Arg Ala Pro His Leu1 5
10 15Gln Leu Gln Pro Arg Ala Gln Arg Val Val Lys
Ala Ala Thr Ala Val 20 25
30Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Gly Leu Thr Leu Ala Gly
35 40 45Thr Val Ile Ala Leu Thr Ile Ala
Thr Pro Leu Leu Val Ile Phe Ser 50 55
60Pro Val Leu Val Pro Ala Val Ile Thr Ile Phe Leu Leu Gly Ala Gly65
70 75 80Phe Leu Ala Ser Gly
Gly Phe Gly Val Ala Ala Leu Ser Val Leu Ser 85
90 95Trp Ile Tyr Arg Tyr Leu Thr Gly Lys His Pro
Pro Gly Ala Asp Gln 100 105
110Leu Glu Ser Ala Lys Thr Lys Leu Ala Ser Lys Ala Arg Glu Met Lys
115 120 125Asp Arg Ala Glu Gln Phe Ser
Gln Gln Pro Val Ala Gly Ser Gln Thr 130 135
140Ser Ile Asp Pro Ser Ser Trp Leu Glu Met Ala Glu His Tyr Gly
Gln145 150 155 160Gln Gln
Gln Thr Arg Ala Pro His Leu Gln Leu Gln Pro Arg Ala Gln
165 170 175Arg Val Val Lys Ala Ala Thr
Ala Val Thr Ala Gly Gly Ser Leu Leu 180 185
190Val Leu Ser Gly Leu Thr Leu Ala Gly Thr Val Ile Ala Leu
Thr Ile 195 200 205Ala Thr Pro Leu
Leu Val Ile Phe Ser Pro Val Leu Val Pro Ala Val 210
215 220Ile Thr Ile Phe Leu Leu Gly Ala Gly Phe Leu Ala
Ser Gly Gly Phe225 230 235
240Gly Val Ala Ala Leu Ser Val Leu Ser Trp Ile Tyr Arg Tyr Leu Thr
245 250 255Gly Lys His Pro Pro
Gly Ala Asp Gln Leu Glu Ser Ala Lys Thr Lys 260
265 270Leu Ala Ser Lys Ala Arg Glu Met Lys Asp Arg Ala
Glu Gln Phe Ser 275 280 285Gln Gln
Pro Val Ala Gly Ser Gln Thr Ser His Met Phe Lys Trp Pro 290
295 300Ser Ala Met Ala Glu His Tyr Gly Gln Gln Gln
Gln Thr Arg Ala Pro305 310 315
320His Leu Gln Leu Gln Pro Arg Ala Gln Arg Val Val Lys Ala Ala Thr
325 330 335Ala Val Thr Ala
Gly Gly Ser Leu Leu Val Leu Ser Gly Leu Thr Leu 340
345 350Ala Gly Thr Val Ile Ala Leu Thr Ile Ala Thr
Pro Leu Leu Val Ile 355 360 365Phe
Ser Pro Val Leu Val Pro Ala Val Ile Thr Ile Phe Leu Leu Gly 370
375 380Ala Gly Phe Leu Ala Ser Gly Gly Phe Gly
Val Ala Ala Leu Ser Val385 390 395
400Leu Ser Trp Ile Tyr Arg Tyr Leu Thr Gly Lys His Pro Pro Gly
Ala 405 410 415Asp Gln Leu
Glu Ser Ala Lys Thr Lys Leu Ala Ser Lys Ala Arg Glu 420
425 430Met Lys Asp Arg Ala Glu Gln Phe Ser Gln
Gln Pro Val Ala Gly Ser 435 440
445Gln Thr Ser Ser Ser Glu Leu Pro Trp Val Asp Met Ala Glu His Tyr 450
455 460Gly Gln Gln Gln Gln Thr Arg Ala
Pro His Leu Gln Leu Gln Pro Arg465 470
475 480Ala Gln Arg Val Val Lys Ala Ala Thr Ala Val Thr
Ala Gly Gly Ser 485 490
495Leu Leu Val Leu Ser Gly Leu Thr Leu Ala Gly Thr Val Ile Ala Leu
500 505 510Thr Ile Ala Thr Pro Leu
Leu Val Ile Phe Ser Pro Val Leu Val Pro 515 520
525Ala Val Ile Thr Ile Phe Leu Leu Gly Ala Gly Phe Leu Ala
Ser Gly 530 535 540Gly Phe Gly Val Ala
Ala Leu Ser Val Leu Ser Trp Ile Tyr Arg Tyr545 550
555 560Leu Thr Gly Lys His Pro Pro Gly Ala Asp
Gln Leu Glu Ser Ala Lys 565 570
575Thr Lys Leu Ala Ser Lys Ala Arg Glu Met Lys Asp Arg Ala Glu Gln
580 585 590Phe Ser Gln Gln Pro
Val Ala Gly Ser Gln Thr Ser Ser Thr Pro Ser 595
600 605Ser Trp His Met Met Ala Glu His Tyr Gly Gln Gln
Gln Gln Thr Arg 610 615 620Ala Pro His
Leu Gln Leu Gln Pro Arg Ala Gln Arg Val Val Lys Ala625
630 635 640Ala Thr Ala Val Thr Ala Gly
Gly Ser Leu Leu Val Leu Ser Gly Leu 645
650 655Thr Leu Ala Gly Thr Val Ile Ala Leu Thr Ile Ala
Thr Pro Leu Leu 660 665 670Val
Ile Phe Ser Pro Val Leu Val Pro Ala Val Ile Thr Ile Phe Leu 675
680 685Leu Gly Ala Gly Phe Leu Ala Ser Gly
Gly Phe Gly Val Ala Ala Leu 690 695
700Ser Val Leu Ser Trp Ile Tyr Arg Tyr Leu Thr Gly Lys His Pro Pro705
710 715 720Gly Ala Asp Gln
Leu Glu Ser Ala Lys Thr Lys Leu Ala Ser Lys Ala 725
730 735Arg Glu Met Lys Asp Arg Ala Glu Gln Phe
Ser Gln Gln Pro Val Ala 740 745
750Gly Ser Gln Thr Ser Ile Asp Gln Gln Val Asn Val His Met Ala Glu
755 760 765His Tyr Gly Gln Gln Gln Gln
Thr Arg Ala Pro His Leu Gln Leu Gln 770 775
780Pro Arg Ala Gln Arg Val Val Lys Ala Ala Thr Ala Val Thr Ala
Gly785 790 795 800Gly Ser
Leu Leu Val Leu Ser Gly Leu Thr Leu Ala Gly Thr Val Ile
805 810 815Ala Leu Thr Ile Ala Thr Pro
Leu Leu Val Ile Phe Ser Pro Val Leu 820 825
830Val Pro Ala Val Ile Thr Ile Phe Leu Leu Gly Ala Gly Phe
Leu Ala 835 840 845Ser Gly Gly Phe
Gly Val Ala Ala Leu Ser Val Leu Ser Trp Ile Tyr 850
855 860Arg Tyr Leu Thr Gly Lys His Pro Pro Gly Ala Asp
Gln Leu Glu Ser865 870 875
880Ala Lys Thr Lys Leu Ala Ser Lys Ala Arg Glu Met Lys Asp Arg Ala
885 890 895Glu Gln Phe Ser Gln
Gln Pro Val Ala Gly Ser Gln Thr Ser Pro Trp 900
905 910Leu Glu 523291DNAArtificial SequenceSynthetic
sequence 52caaataatga ttttattttg actgatagtg acctgttcgt tgcaacaaat
tgatgagcaa 60tgctttttta taatgccaac tttgtacaaa aaagcaggct ccgcggccgc
ttgctccgtt 120aaaaaaaacc atggcagagc actacgggca gcagcaacaa acacgtgccc
cccacttgca 180actgcaaccg cgtgctcagc gtgttgtaaa ggcagcgacc gcggttactg
ccggaggtag 240cctgttggtg ttatccgggt tgaccctggc tggaacggtc attgcgctga
caatcgccac 300acctctcctg gtgatcttct cgcctgtact ggtccctgca ggtaaatttc
tgtgttcctt 360attctctcaa aatcttcgat tttgttttcg ttcgatccca atttcgtata
tgttctttgg 420tttagattct gttaatctta gatcgaagac gattttctgg gtttgatcgt
tagatatcat 480cttaattctc gattagggtt tcatagatat catccgattt gttcaaataa
tttgagtttt 540gtcgaataat tactcttcga tttgtgattt ctatctagat ctggtgttag
tttctagttt 600gtgcgatcga atttgtcgat taatctgagt ttttctgatc tgcagtcatc
accatcttcc 660tgttgggagc gggttttctg gcaagcgggg gatttggtgt tgccgctctg
tctgtgctgt 720cctggatcta ccgttacctg acggggaaac atccacccgg agcggatcag
ttggagtcgg 780ccaagacaaa gttggcgagc aaggcccgtg aaatgaagga ccgtgccgag
cagttcagtc 840agcaaccggt agcagggtct cagaccagca tcgatccatc ctcctggctc
gagatggcgg 900aacactacgg gcaacaacag cagactcgtg ctccccacct gcaattacaa
ccccgtgccc 960aacgtgttgt gaaagcggca acagcagtaa cggcaggggg aagtttgctg
gtcttatcgg 1020ggttgacctt agcgggaacc gtgattgccc tgacaattgc gactccgctg
ctggttatct 1080tcagccccgt attggttccg gccgtgatca cgattttttt gctgggggca
ggatttttag 1140ccagcggagg atttggggtc gcagcgttgt ctgtgctgag ttggatctat
cgttatttga 1200ccgggaagca cccacctgga gcagaccagc tggagagcgc gaaaacgaag
ctggcatcga 1260aggcgcgtga aatgaaggat cgtgctgaac aattctccca gcagcctgtt
gccggttctc 1320agaccagcca tatgtttaaa tggccaagcg ctatggccga gcattatggg
cagcaacagc 1380aaacccgtgc cccgcatctg caattgcaac ctcgtgccca gcgtgtcgtt
aaggcggcta 1440ctgcggtaac agcgggaggg agcttactgg tattaagcgg gctgacattg
gccggaacgg 1500tgatcgcctt aacaatcgcg acacccttgc tggtcatctt cagtccggtt
ctggtgcccg 1560cggtgattac gattttcctg ctgggagccg gtttcttagc atcggggggt
tttggggtag 1620cagccttgag tgtcctgtcg tggatctatc gttacttaac tggaaaacac
ccgccaggag 1680ctgaccagtt ggagtctgca aaaactaagc tggcgtccaa agcccgtgaa
atgaaggatc 1740gtgctgagca gtttagccag cagccagttg cgggaagtca gacctcttca
tctgagctcc 1800catgggtcga catggcggag cattacggtc aacagcaaca gacccgtgct
ccgcacttac 1860aattgcaacc acgtgctcaa cgtgtcgtaa aagccgccac ggcagttact
gcggggggat 1920cattgctggt gttaagtggg ttgacactgg cggggacagt tattgcactg
acgatcgcga 1980cccccttgtt agtgatcttc tcccccgttc tggttccggc ggtcattaca
atctttctgt 2040tgggtgccgg atttttagcc tctgggggat ttggagtagc tgccctgtca
gtgttgagct 2100ggatctaccg ttacttaaca gggaagcacc ctcccggggc agatcagttg
gaaagcgcca 2160agaccaagct ggcaagtaaa gcgcgtgaaa tgaaggaccg tgccgaacaa
ttttcgcagc 2220aaccggttgc gggatcacag acctctagta ctccatcctc ctggcatatg
atggccgagc 2280actatggaca acagcagcag acgcgtgccc ctcatctgca actgcaaccc
cgtgctcaac 2340gtgtcgttaa ggctgcgaca gcagtaaccg ctgggggttc tctgttagtg
ttgtcagggc 2400tgactttggc ggggacggta attgcgttga ccattgccac cccgctgtta
gtgattttca 2460gcccggtact ggtgccagca gttatcacga tcttcttgct gggtgccgga
ttcttggcaa 2520gtggaggttt tggagttgcg gcgctgtcag ttttatcctg gatctatcgt
tatctgacag 2580gaaaacatcc cccaggtgcc gatcagctgg agagtgccaa gacaaaactg
gcgtctaagg 2640cacgtgaaat gaaggatcgt gccgaacagt tttctcaaca gcccgtagcg
gggtcacaga 2700cctcgatcga tcagcaggtt aacgtgcaca tggccgaaca ttacggacag
caacaacaga 2760cgcgtgctcc acacctgcaa ttgcaaccgc gtgctcaacg tgttgtcaaa
gcggcgaccg 2820ccgtaacagc aggaggatca ctgttagtgc tgtcgggttt aaccttggcc
gggaccgtca 2880ttgcattgac tattgcgacg cccttactgg tgatcttttc tccggtgctg
gttcccgccg 2940ttattaccat cttcttgtta ggggcaggat tcctggcatc agggggattc
ggagttgcgg 3000cgttgagtgt cttaagttgg atctaccgtt atctgactgg aaagcacccg
cctggggccg 3060atcaactgga gtcagccaaa acgaaattgg cgtcaaaagc gcgtgaaatg
aaggaccgtg 3120ctgagcagtt ttctcagcag cctgtggcag gatcccagac atcaccatgg
ctcgagtaat 3180gaagcggccg cacccagctt tcttgtacaa agttggcatt ataagaaagc
attgcttatc 3240aatttgttgc aacgaacagg tcactatcag tcaaaataaa atcattattt g
3291531350DNAArtificial SequenceSynthetic sequence
53ttgctccgtt aaaaaaaacc atggctgagc attatggtca acaacagcag accagggcgc
60ctcacctgca gctgcagccg cgcgcccagc gggtagtgaa ggcggccacc gccgtgacag
120ccggcggctc gcttctcgtc ctctctggcc tcactttagc cggaactgtt attgcgctca
180ccatcgccac tccgctgctt gtgatcttta gccccgttct ggtgccggcg gtcataacca
240ttttcttgct gggtgcgggt tttctggcat ccggaggctt cggcgtggcg gcgctgagtg
300tgctgtcgtg gatttacaga tatctgacag ggaaacaccc gccgggggcg gatcagctgg
360aatcggcaaa gacgaagctg gcgagcaagg cgcgagagat gaaggatagg gcagagcagt
420tctcgcagca gcctgttgga ggcggtggat ccggaggcgg tggtagtatg gctgagcatt
480atggtcaaca acagcagacc agggcgcctc acctgcagct gcagccgcgc gcccagcggg
540tagtgaaggc ggccaccgcc gtgacagccg gcggctcgct tctcgtcctc tctggcctca
600ctttagccgg aactgttatt gcgctcacca tcgccactcc gctgcttgtg atctttagcc
660ccgttctggt gccggcggtc ataaccattt tcttgctggg tgcgggtttt ctggcatccg
720gaggcttcgg cgtggcggcg ctgagtgtgc tgtcgtggat ttacagatat ctgacaggga
780aacacccgcc gggggcggat cagctggaat cggcaaagac gaagctggcg agcaaggcgc
840gagagatgaa ggatagggca gagcagttct cgcagcagcc tgttgggggc ggtggatccg
900gtggaggggg atccatggct gagcattatg gtcaacaaca gcagaccagg gcgcctcacc
960tgcagctgca gccgcgcgcc cagcgggtag tgaaggcggc caccgccgtg acagccggcg
1020gctcgcttct cgtcctctct ggcctcactt tagccggaac tgttattgcg ctcaccatcg
1080ccactccgct gcttgtgatc tttagccccg ttctggtgcc ggcggtcata accattttct
1140tgctgggtgc gggttttctg gcatccggag gcttcggcgt ggcggcgctg agtgtgctgt
1200cgtggattta cagatatctg acagggaaac acccgccggg ggcggatcag ctggaatcgg
1260caaagacgaa gctggcgagc aaggcgcgag agatgaagga tagggcagag cagttctcgc
1320agcagcctgt tccatggctc gagtaatgaa
1350542878DNAArtificial SequenceSynthetic sequence 54gaaggagata
tacatatgaa agaaaccgct gctgctaaat tcgaacgcca gcacatggac 60agcccagatc
tgggtaccct ggtgccacgc ggttccatgg ctgagcatta tggtcaacaa 120cagcagacca
gggcgcctca cctgcagctg cagccgcgcg cccagcgggt agtgaaggcg 180gccaccgccg
tgacagccgg cggctcgctt ctcgtcctct ctggcctcac tttagccgga 240actgttattg
cgctcaccat cgccactccg ctgcttgtga tctttagccc cgttctggtg 300ccggcggtca
taaccatttt cttgctgggt gcgggttttc tggcatccgg aggcttcggc 360gtggcggcgc
tgagtgtgct gtcgtggatt tacagatatc tgacagggaa acacccgccg 420ggggcggatc
agctggaatc ggcaaagacg aagctggcga gcaaggcgcg agagatgaag 480gatagggcag
agcagttctc gcagcagcct gttccatggc tgatatcgga tccgaattcg 540agctccgtcg
acaagcttgc ggccgcactc gagatggcgg aacactacgg gcaacaacag 600cagactcgtg
ctccccacct gcaattacaa ccccgtgccc aacgtgttgt gaaagcggca 660acagcagtaa
cggcaggggg aagtttgctg gtcttatcgg ggttgacctt agcgggaacc 720gtgattgccc
tgacaattgc gactccgctg ctggttatct tcagccccgt attggttccg 780gccgtgatca
cgattttttt gctgggggca ggatttttag ccagcggagg atttggggtc 840gcagcgttgt
ctgtgctgag ttggatctat cgttatttga ccgggaagca cccacctgga 900gcagaccagc
tggagagcgc gaaaacgaag ctggcatcga aggcgcgtga aatgaaggat 960cgtgctgaac
aattctccca gcagcctgtt gccggttctc agaccagcca tatgtttaaa 1020tggccaagcg
ctatggccga gcattatggg cagcaacagc aaacccgtgc cccgcatctg 1080caattgcaac
ctcgtgccca gcgtgtcgtt aaggcggcta ctgcggtaac agcgggaggg 1140agcttactgg
tattaagcgg gctgacattg gccggaacgg tgatcgcctt aacaatcgcg 1200acacccttgc
tggtcatctt cagtccggtt ctggtgcccg cggtgattac gattttcctg 1260ctgggagccg
gtttcttagc atcggggggt tttggggtag cagccttgag tgtcctgtcg 1320tggatctatc
gttacttaac tggaaaacac ccgccaggag ctgaccagtt ggagtctgca 1380aaaactaagc
tggcgtccaa agcccgtgaa atgaaggatc gtgctgagca gtttagccag 1440cagccagttg
cgggaagtca gacctcttca tctgagctcc catgggtcga catggcggag 1500cattacggtc
aacagcaaca gacccgtgct ccgcacttac aattgcaacc acgtgctcaa 1560cgtgtcgtaa
aagccgccac ggcagttact gcggggggat cattgctggt gttaagtggg 1620ttgacactgg
cggggacagt tattgcactg acgatcgcga cccccttgtt agtgatcttc 1680tcccccgttc
tggttccggc ggtcattaca atctttctgt tgggtgccgg atttttagcc 1740tctgggggat
ttggagtagc tgccctgtca gtgttgagct ggatctaccg ttacttaaca 1800gggaagcacc
ctcccggggc agatcagttg gaaagcgcca agaccaagct ggcaagtaaa 1860gcgcgtgaaa
tgaaggaccg tgccgaacaa ttttcgcagc aaccggttgc gggatcacag 1920acctctagta
ctccatcctc ctggcatatg atggccgagc actatggaca acagcagcag 1980acgcgtgccc
ctcatctgca actgcaaccc cgtgctcaac gtgtcgttaa ggctgcgaca 2040gcagtaaccg
ctgggggttc tctgttagtg ttgtcagggc tgactttggc ggggacggta 2100attgcgttga
ccattgccac cccgctgtta gtgattttca gcccggtact ggtgccagca 2160gttatcacga
tcttcttgct gggtgccgga ttcttggcaa gtggaggttt tggagttgcg 2220gcgctgtcag
ttttatcctg gatctatcgt tatctgacag gaaaacatcc cccaggtgcc 2280gatcagctgg
agagtgccaa gacaaaactg gcgtctaagg cacgtgaaat gaaggatcgt 2340gccgaacagt
tttctcaaca gcccgtagcg gggtcacaga cctcgatcga tcagcaggtt 2400aacgtgcaca
tggccgaaca ttacggacag caacaacaga cgcgtgctcc acacctgcaa 2460ttgcaaccgc
gtgctcaacg tgttgtcaaa gcggcgaccg ccgtaacagc aggaggatca 2520ctgttagtgc
tgtcgggttt aaccttggcc gggaccgtca ttgcattgac tattgcgacg 2580cccttactgg
tgatcttttc tccggtgctg gttcccgccg ttattaccat cttcttgtta 2640ggggcaggat
tcctggcatc agggggattc ggagttgcgg cgttgagtgt cttaagttgg 2700atctaccgtt
atctgactgg aaagcacccg cctggggccg atcaactgga gtcagccaaa 2760acgaaattgg
cgtcaaaagc gcgtgaaatg aaggaccgtg ctgagcagtt ttctcagcag 2820cctgtggcag
gatcccagac atcaccatgg ctcgagcacc accaccacca ccactgag
2878551068DNAArtificial SequenceSynthetic sequence 55gaaggagata
tacatatgaa agaaaccgct gctgctaaat tcgaacgcca gcacatggac 60agcccagatc
tgggtaccct ggtgccacgc ggttccatgg ctgagcatta tggtcaacaa 120cagcagacca
gggcgcctca cctgcagctg cagccgcgcg cccagcgggt agtgaaggcg 180gccaccgccg
tgacagccgg cggctcgctt ctcgtcctct ctggcctcac tttagccgga 240actgttattg
cgctcaccat cgccactccg ctgcttgtga tctttagccc cgttctggtg 300ccggcggtca
taaccatttt cttgctgggt gcgggttttc tggcatccgg aggcttcggc 360gtggcggcgc
tgagtgtgct gtcgtggatt tacagatatc tgacagggaa acacccgccg 420ggggcggatc
agctggaatc ggcaaagacg aagctggcga gcaaggcgcg agagatgaag 480gatagggcag
agcagttctc gcagcagcct gttggaggcg gtggatccgg aggcggtggt 540agtatggctg
agcattatgg tcaacaacag cagaccaggg cgcctcacct gcagctgcag 600ccgcgcgccc
agcgggtagt gaaggcggcc accgccgtga cagccggcgg ctcgcttctc 660gtcctctctg
gcctcacttt agccggaact gttattgcgc tcaccatcgc cactccgctg 720cttgtgatct
ttagccccgt tctggtgccg gcggtcataa ccattttctt gctgggtgcg 780ggttttctgg
catccggagg cttcggcgtg gcggcgctga gtgtgctgtc gtggatttac 840agatatctga
cagggaaaca cccgccgggg gcggatcagc tggaatcggc aaagacgaag 900ctggcgagca
aggcgcgaga gatgaaggat agggcagagc agttctcgca gcagcctgtt 960gggggcggtg
gatccggtgg agggggatcc atggcgatat cggatccgaa ttcgagctcc 1020gtcgacaagc
ttgcggccgc actcgagcac caccaccacc accactga
1068561929DNAArtificial SequenceSynthetic sequence 56gaaggagata
tacatatgaa agaaaccgct gctgctaaat tcgaacgcca gcacatggac 60agcccagatc
tgggtaccct ggtgccacgc ggttccatgg ctgagcatta tggtcaacaa 120cagcagacca
gggcgcctca cctgcagctg cagccgcgcg cccagcgggt agtgaaggcg 180gccaccgccg
tgacagccgg cggctcgctt ctcgtcctct ctggcctcac tttagccgga 240actgttattg
cgctcaccat cgccactccg ctgcttgtga tctttagccc cgttctggtg 300ccggcggtca
taaccatttt cttgctgggt gcgggttttc tggcatccgg aggcttcggc 360gtggcggcgc
tgagtgtgct gtcgtggatt tacagatatc tgacagggaa acacccgccg 420ggggcggatc
agctggaatc ggcaaagacg aagctggcga gcaaggcgcg agagatgaag 480gatagggcag
agcagttctc gcagcagcct gttccatggc gatatcggat ccgaattcga 540gctccgtcga
caagcttgcg gccgcttgct ccgttaaaaa aaaccatggc tgagcattat 600ggtcaacaac
agcagaccag ggcgcctcac ctgcagctgc agccgcgcgc ccagcgggta 660gtgaaggcgg
ccaccgccgt gacagccggc ggctcgcttc tcgtcctctc tggcctcact 720ttagccggaa
ctgttattgc gctcaccatc gccactccgc tgcttgtgat ctttagcccc 780gttctggtgc
cggcggtcat aaccattttc ttgctgggtg cgggttttct ggcatccgga 840ggcttcggcg
tggcggcgct gagtgtgctg tcgtggattt acagatatct gacagggaaa 900cacccgccgg
gggcggatca gctggaatcg gcaaagacga agctggcgag caaggcgcga 960gagatgaagg
atagggcaga gcagttctcg cagcagcctg ttggaggcgg tggatccgga 1020ggcggtggta
gtatggctga gcattatggt caacaacagc agaccagggc gcctcacctg 1080cagctgcagc
cgcgcgccca gcgggtagtg aaggcggcca ccgccgtgac agccggcggc 1140tcgcttctcg
tcctctctgg cctcacttta gccggaactg ttattgcgct caccatcgcc 1200actccgctgc
ttgtgatctt tagccccgtt ctggtgccgg cggtcataac cattttcttg 1260ctgggtgcgg
gttttctggc atccggaggc ttcggcgtgg cggcgctgag tgtgctgtcg 1320tggatttaca
gatatctgac agggaaacac ccgccggggg cggatcagct ggaatcggca 1380aagacgaagc
tggcgagcaa ggcgcgagag atgaaggata gggcagagca gttctcgcag 1440cagcctgttg
ggggcggtgg atccggtgga gggggatcca tggctgagca ttatggtcaa 1500caacagcaga
ccagggcgcc tcacctgcag ctgcagccgc gcgcccagcg ggtagtgaag 1560gcggccaccg
ccgtgacagc cggcggctcg cttctcgtcc tctctggcct cactttagcc 1620ggaactgtta
ttgcgctcac catcgccact ccgctgcttg tgatctttag ccccgttctg 1680gtgccggcgg
tcataaccat tttcttgctg ggtgcgggtt ttctggcatc cggaggcttc 1740ggcgtggcgg
cgctgagtgt gctgtcgtgg atttacagat atctgacagg gaaacacccg 1800ccgggggcgg
atcagctgga atcggcaaag acgaagctgg cgagcaaggc gcgagagatg 1860aaggataggg
cagagcagtt ctcgcagcag cctgttccat ggctcgagca ccaccaccac 1920caccactga
1929572403DNAArtificial SequenceSynthetic sequence 57gaaggagata
tacatatgaa agaaaccgct gctgctaaat tcgaacgcca gcacatggac 60agcccagatc
tgggtaccct ggtgccacgc ggttccatgg ctgagcatta tggtcaacaa 120cagcagacca
gggcgcctca cctgcagctg cagccgcgcg cccagcgggt agtgaaggcg 180gccaccgccg
tgacagccgg cggctcgctt ctcgtcctct ctggcctcac tttagccgga 240actgttattg
cgctcaccat cgccactccg ctgcttgtga tctttagccc cgttctggtg 300ccggcggtca
taaccatttt cttgctgggt gcgggttttc tggcatccgg aggcttcggc 360gtggcggcgc
tgagtgtgct gtcgtggatt tacagatatc tgacagggaa acacccgccg 420ggggcggatc
agctggaatc ggcaaagacg aagctggcga gcaaggcgcg agagatgaag 480gatagggcag
agcagttctc gcagcagcct gttggaggcg gtggatccgg aggcggtggt 540agtatggctg
agcattatgg tcaacaacag cagaccaggg cgcctcacct gcagctgcag 600ccgcgcgccc
agcgggtagt gaaggcggcc accgccgtga cagccggcgg ctcgcttctc 660gtcctctctg
gcctcacttt agccggaact gttattgcgc tcaccatcgc cactccgctg 720cttgtgatct
ttagccccgt tctggtgccg gcggtcataa ccattttctt gctgggtgcg 780ggttttctgg
catccggagg cttcggcgtg gcggcgctga gtgtgctgtc gtggatttac 840agatatctga
cagggaaaca cccgccgggg gcggatcagc tggaatcggc aaagacgaag 900ctggcgagca
aggcgcgaga gatgaaggat agggcagagc agttctcgca gcagcctgtt 960gggggcggtg
gatccggtgg agggggatcc atgggatatc ggatccgaat tcgagctccg 1020tcgacaagct
tgcggccgct tgctccgtta aaaaaaacca tggctgagca ttatggtcaa 1080caacagcaga
ccagggcgcc tcacctgcag ctgcagccgc gcgcccagcg ggtagtgaag 1140gcggccaccg
ccgtgacagc cggcggctcg cttctcgtcc tctctggcct cactttagcc 1200ggaactgtta
ttgcgctcac catcgccact ccgctgcttg tgatctttag ccccgttctg 1260gtgccggcgg
tcataaccat tttcttgctg ggtgcgggtt ttctggcatc cggaggcttc 1320ggcgtggcgg
cgctgagtgt gctgtcgtgg atttacagat atctgacagg gaaacacccg 1380ccgggggcgg
atcagctgga atcggcaaag acgaagctgg cgagcaaggc gcgagagatg 1440aaggataggg
cagagcagtt ctcgcagcag cctgttggag gcggtggatc cggaggcggt 1500ggtagtatgg
ctgagcatta tggtcaacaa cagcagacca gggcgcctca cctgcagctg 1560cagccgcgcg
cccagcgggt agtgaaggcg gccaccgccg tgacagccgg cggctcgctt 1620ctcgtcctct
ctggcctcac tttagccgga actgttattg cgctcaccat cgccactccg 1680ctgcttgtga
tctttagccc cgttctggtg ccggcggtca taaccatttt cttgctgggt 1740gcgggttttc
tggcatccgg aggcttcggc gtggcggcgc tgagtgtgct gtcgtggatt 1800tacagatatc
tgacagggaa acacccgccg ggggcggatc agctggaatc ggcaaagacg 1860aagctggcga
gcaaggcgcg agagatgaag gatagggcag agcagttctc gcagcagcct 1920gttgggggcg
gtggatccgg tggaggggga tccatggctg agcattatgg tcaacaacag 1980cagaccaggg
cgcctcacct gcagctgcag ccgcgcgccc agcgggtagt gaaggcggcc 2040accgccgtga
cagccggcgg ctcgcttctc gtcctctctg gcctcacttt agccggaact 2100gttattgcgc
tcaccatcgc cactccgctg cttgtgatct ttagccccgt tctggtgccg 2160gcggtcataa
ccattttctt gctgggtgcg ggttttctgg catccggagg cttcggcgtg 2220gcggcgctga
gtgtgctgtc gtggatttac agatatctga cagggaaaca cccgccgggg 2280gcggatcagc
tggaatcggc aaagacgaag ctggcgagca aggcgcgaga gatgaaggat 2340agggcagagc
agttctcgca gcagcctgtt ccatggctcg agcaccacca ccaccaccac 2400tga
240358953PRTArtificial SequenceSynthetic sequence 58Met Lys Glu Thr Ala
Ala Ala Lys Phe Glu Arg Gln His Met Asp Ser1 5
10 15Pro Asp Leu Gly Thr Leu Val Pro Arg Gly Ser
Met Ala Glu His Tyr 20 25
30Gly Gln Gln Gln Gln Thr Arg Ala Pro His Leu Gln Leu Gln Pro Arg
35 40 45Ala Gln Arg Val Val Lys Ala Ala
Thr Ala Val Thr Ala Gly Gly Ser 50 55
60Leu Leu Val Leu Ser Gly Leu Thr Leu Ala Gly Thr Val Ile Ala Leu65
70 75 80Thr Ile Ala Thr Pro
Leu Leu Val Ile Phe Ser Pro Val Leu Val Pro 85
90 95Ala Val Ile Thr Ile Phe Leu Leu Gly Ala Gly
Phe Leu Ala Ser Gly 100 105
110Gly Phe Gly Val Ala Ala Leu Ser Val Leu Ser Trp Ile Tyr Arg Tyr
115 120 125Leu Thr Gly Lys His Pro Pro
Gly Ala Asp Gln Leu Glu Ser Ala Lys 130 135
140Thr Lys Leu Ala Ser Lys Ala Arg Glu Met Lys Asp Arg Ala Glu
Gln145 150 155 160Phe Ser
Gln Gln Pro Val Pro Trp Leu Ile Ser Asp Pro Asn Ser Ser
165 170 175Ser Val Asp Lys Leu Ala Ala
Ala Leu Glu Met Ala Glu His Tyr Gly 180 185
190Gln Gln Gln Gln Thr Arg Ala Pro His Leu Gln Leu Gln Pro
Arg Ala 195 200 205Gln Arg Val Val
Lys Ala Ala Thr Ala Val Thr Ala Gly Gly Ser Leu 210
215 220Leu Val Leu Ser Gly Leu Thr Leu Ala Gly Thr Val
Ile Ala Leu Thr225 230 235
240Ile Ala Thr Pro Leu Leu Val Ile Phe Ser Pro Val Leu Val Pro Ala
245 250 255Val Ile Thr Ile Phe
Leu Leu Gly Ala Gly Phe Leu Ala Ser Gly Gly 260
265 270Phe Gly Val Ala Ala Leu Ser Val Leu Ser Trp Ile
Tyr Arg Tyr Leu 275 280 285Thr Gly
Lys His Pro Pro Gly Ala Asp Gln Leu Glu Ser Ala Lys Thr 290
295 300Lys Leu Ala Ser Lys Ala Arg Glu Met Lys Asp
Arg Ala Glu Gln Phe305 310 315
320Ser Gln Gln Pro Val Ala Gly Ser Gln Thr Ser His Met Phe Lys Trp
325 330 335Pro Ser Ala Met
Ala Glu His Tyr Gly Gln Gln Gln Gln Thr Arg Ala 340
345 350Pro His Leu Gln Leu Gln Pro Arg Ala Gln Arg
Val Val Lys Ala Ala 355 360 365Thr
Ala Val Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Gly Leu Thr 370
375 380Leu Ala Gly Thr Val Ile Ala Leu Thr Ile
Ala Thr Pro Leu Leu Val385 390 395
400Ile Phe Ser Pro Val Leu Val Pro Ala Val Ile Thr Ile Phe Leu
Leu 405 410 415Gly Ala Gly
Phe Leu Ala Ser Gly Gly Phe Gly Val Ala Ala Leu Ser 420
425 430Val Leu Ser Trp Ile Tyr Arg Tyr Leu Thr
Gly Lys His Pro Pro Gly 435 440
445Ala Asp Gln Leu Glu Ser Ala Lys Thr Lys Leu Ala Ser Lys Ala Arg 450
455 460Glu Met Lys Asp Arg Ala Glu Gln
Phe Ser Gln Gln Pro Val Ala Gly465 470
475 480Ser Gln Thr Ser Ser Ser Glu Leu Pro Trp Val Asp
Met Ala Glu His 485 490
495Tyr Gly Gln Gln Gln Gln Thr Arg Ala Pro His Leu Gln Leu Gln Pro
500 505 510Arg Ala Gln Arg Val Val
Lys Ala Ala Thr Ala Val Thr Ala Gly Gly 515 520
525Ser Leu Leu Val Leu Ser Gly Leu Thr Leu Ala Gly Thr Val
Ile Ala 530 535 540Leu Thr Ile Ala Thr
Pro Leu Leu Val Ile Phe Ser Pro Val Leu Val545 550
555 560Pro Ala Val Ile Thr Ile Phe Leu Leu Gly
Ala Gly Phe Leu Ala Ser 565 570
575Gly Gly Phe Gly Val Ala Ala Leu Ser Val Leu Ser Trp Ile Tyr Arg
580 585 590Tyr Leu Thr Gly Lys
His Pro Pro Gly Ala Asp Gln Leu Glu Ser Ala 595
600 605Lys Thr Lys Leu Ala Ser Lys Ala Arg Glu Met Lys
Asp Arg Ala Glu 610 615 620Gln Phe Ser
Gln Gln Pro Val Ala Gly Ser Gln Thr Ser Ser Thr Pro625
630 635 640Ser Ser Trp His Met Met Ala
Glu His Tyr Gly Gln Gln Gln Gln Thr 645
650 655Arg Ala Pro His Leu Gln Leu Gln Pro Arg Ala Gln
Arg Val Val Lys 660 665 670Ala
Ala Thr Ala Val Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Gly 675
680 685Leu Thr Leu Ala Gly Thr Val Ile Ala
Leu Thr Ile Ala Thr Pro Leu 690 695
700Leu Val Ile Phe Ser Pro Val Leu Val Pro Ala Val Ile Thr Ile Phe705
710 715 720Leu Leu Gly Ala
Gly Phe Leu Ala Ser Gly Gly Phe Gly Val Ala Ala 725
730 735Leu Ser Val Leu Ser Trp Ile Tyr Arg Tyr
Leu Thr Gly Lys His Pro 740 745
750Pro Gly Ala Asp Gln Leu Glu Ser Ala Lys Thr Lys Leu Ala Ser Lys
755 760 765Ala Arg Glu Met Lys Asp Arg
Ala Glu Gln Phe Ser Gln Gln Pro Val 770 775
780Ala Gly Ser Gln Thr Ser Ile Asp Gln Gln Val Asn Val His Met
Ala785 790 795 800Glu His
Tyr Gly Gln Gln Gln Gln Thr Arg Ala Pro His Leu Gln Leu
805 810 815Gln Pro Arg Ala Gln Arg Val
Val Lys Ala Ala Thr Ala Val Thr Ala 820 825
830Gly Gly Ser Leu Leu Val Leu Ser Gly Leu Thr Leu Ala Gly
Thr Val 835 840 845Ile Ala Leu Thr
Ile Ala Thr Pro Leu Leu Val Ile Phe Ser Pro Val 850
855 860Leu Val Pro Ala Val Ile Thr Ile Phe Leu Leu Gly
Ala Gly Phe Leu865 870 875
880Ala Ser Gly Gly Phe Gly Val Ala Ala Leu Ser Val Leu Ser Trp Ile
885 890 895Tyr Arg Tyr Leu Thr
Gly Lys His Pro Pro Gly Ala Asp Gln Leu Glu 900
905 910Ser Ala Lys Thr Lys Leu Ala Ser Lys Ala Arg Glu
Met Lys Asp Arg 915 920 925Ala Glu
Gln Phe Ser Gln Gln Pro Val Ala Gly Ser Gln Thr Ser Pro 930
935 940Trp Leu Glu His His His His His His945
95059350PRTArtificial SequenceSynthetic sequence 59Met Lys Glu
Thr Ala Ala Ala Lys Phe Glu Arg Gln His Met Asp Ser1 5
10 15Pro Asp Leu Gly Thr Leu Val Pro Arg
Gly Ser Met Ala Glu His Tyr 20 25
30Gly Gln Gln Gln Gln Thr Arg Ala Pro His Leu Gln Leu Gln Pro Arg
35 40 45Ala Gln Arg Val Val Lys Ala
Ala Thr Ala Val Thr Ala Gly Gly Ser 50 55
60Leu Leu Val Leu Ser Gly Leu Thr Leu Ala Gly Thr Val Ile Ala Leu65
70 75 80Thr Ile Ala Thr
Pro Leu Leu Val Ile Phe Ser Pro Val Leu Val Pro 85
90 95Ala Val Ile Thr Ile Phe Leu Leu Gly Ala
Gly Phe Leu Ala Ser Gly 100 105
110Gly Phe Gly Val Ala Ala Leu Ser Val Leu Ser Trp Ile Tyr Arg Tyr
115 120 125Leu Thr Gly Lys His Pro Pro
Gly Ala Asp Gln Leu Glu Ser Ala Lys 130 135
140Thr Lys Leu Ala Ser Lys Ala Arg Glu Met Lys Asp Arg Ala Glu
Gln145 150 155 160Phe Ser
Gln Gln Pro Val Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser
165 170 175Met Ala Glu His Tyr Gly Gln
Gln Gln Gln Thr Arg Ala Pro His Leu 180 185
190Gln Leu Gln Pro Arg Ala Gln Arg Val Val Lys Ala Ala Thr
Ala Val 195 200 205Thr Ala Gly Gly
Ser Leu Leu Val Leu Ser Gly Leu Thr Leu Ala Gly 210
215 220Thr Val Ile Ala Leu Thr Ile Ala Thr Pro Leu Leu
Val Ile Phe Ser225 230 235
240Pro Val Leu Val Pro Ala Val Ile Thr Ile Phe Leu Leu Gly Ala Gly
245 250 255Phe Leu Ala Ser Gly
Gly Phe Gly Val Ala Ala Leu Ser Val Leu Ser 260
265 270Trp Ile Tyr Arg Tyr Leu Thr Gly Lys His Pro Pro
Gly Ala Asp Gln 275 280 285Leu Glu
Ser Ala Lys Thr Lys Leu Ala Ser Lys Ala Arg Glu Met Lys 290
295 300Asp Arg Ala Glu Gln Phe Ser Gln Gln Pro Val
Gly Gly Gly Gly Ser305 310 315
320Gly Gly Gly Gly Ser Met Ala Ile Ser Asp Pro Asn Ser Ser Ser Val
325 330 335Asp Lys Leu Ala
Ala Ala Leu Glu His His His His His His 340
345 35060637PRTArtificial SequenceSynthetic sequence
60Met Lys Glu Thr Ala Ala Ala Lys Phe Glu Arg Gln His Met Asp Ser1
5 10 15Pro Asp Leu Gly Thr Leu
Val Pro Arg Gly Ser Met Ala Glu His Tyr 20 25
30Gly Gln Gln Gln Gln Thr Arg Ala Pro His Leu Gln Leu
Gln Pro Arg 35 40 45Ala Gln Arg
Val Val Lys Ala Ala Thr Ala Val Thr Ala Gly Gly Ser 50
55 60Leu Leu Val Leu Ser Gly Leu Thr Leu Ala Gly Thr
Val Ile Ala Leu65 70 75
80Thr Ile Ala Thr Pro Leu Leu Val Ile Phe Ser Pro Val Leu Val Pro
85 90 95Ala Val Ile Thr Ile Phe
Leu Leu Gly Ala Gly Phe Leu Ala Ser Gly 100
105 110Gly Phe Gly Val Ala Ala Leu Ser Val Leu Ser Trp
Ile Tyr Arg Tyr 115 120 125Leu Thr
Gly Lys His Pro Pro Gly Ala Asp Gln Leu Glu Ser Ala Lys 130
135 140Thr Lys Leu Ala Ser Lys Ala Arg Glu Met Lys
Asp Arg Ala Glu Gln145 150 155
160Phe Ser Gln Gln Pro Val Pro Trp Arg Tyr Arg Ile Arg Ile Arg Ala
165 170 175Pro Ser Thr Ser
Leu Arg Pro Leu Ala Pro Leu Lys Lys Thr Met Ala 180
185 190Glu His Tyr Gly Gln Gln Gln Gln Thr Arg Ala
Pro His Leu Gln Leu 195 200 205Gln
Pro Arg Ala Gln Arg Val Val Lys Ala Ala Thr Ala Val Thr Ala 210
215 220Gly Gly Ser Leu Leu Val Leu Ser Gly Leu
Thr Leu Ala Gly Thr Val225 230 235
240Ile Ala Leu Thr Ile Ala Thr Pro Leu Leu Val Ile Phe Ser Pro
Val 245 250 255Leu Val Pro
Ala Val Ile Thr Ile Phe Leu Leu Gly Ala Gly Phe Leu 260
265 270Ala Ser Gly Gly Phe Gly Val Ala Ala Leu
Ser Val Leu Ser Trp Ile 275 280
285Tyr Arg Tyr Leu Thr Gly Lys His Pro Pro Gly Ala Asp Gln Leu Glu 290
295 300Ser Ala Lys Thr Lys Leu Ala Ser
Lys Ala Arg Glu Met Lys Asp Arg305 310
315 320Ala Glu Gln Phe Ser Gln Gln Pro Val Gly Gly Gly
Gly Ser Gly Gly 325 330
335Gly Gly Ser Met Ala Glu His Tyr Gly Gln Gln Gln Gln Thr Arg Ala
340 345 350Pro His Leu Gln Leu Gln
Pro Arg Ala Gln Arg Val Val Lys Ala Ala 355 360
365Thr Ala Val Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Gly
Leu Thr 370 375 380Leu Ala Gly Thr Val
Ile Ala Leu Thr Ile Ala Thr Pro Leu Leu Val385 390
395 400Ile Phe Ser Pro Val Leu Val Pro Ala Val
Ile Thr Ile Phe Leu Leu 405 410
415Gly Ala Gly Phe Leu Ala Ser Gly Gly Phe Gly Val Ala Ala Leu Ser
420 425 430Val Leu Ser Trp Ile
Tyr Arg Tyr Leu Thr Gly Lys His Pro Pro Gly 435
440 445Ala Asp Gln Leu Glu Ser Ala Lys Thr Lys Leu Ala
Ser Lys Ala Arg 450 455 460Glu Met Lys
Asp Arg Ala Glu Gln Phe Ser Gln Gln Pro Val Gly Gly465
470 475 480Gly Gly Ser Gly Gly Gly Gly
Ser Met Ala Glu His Tyr Gly Gln Gln 485
490 495Gln Gln Thr Arg Ala Pro His Leu Gln Leu Gln Pro
Arg Ala Gln Arg 500 505 510Val
Val Lys Ala Ala Thr Ala Val Thr Ala Gly Gly Ser Leu Leu Val 515
520 525Leu Ser Gly Leu Thr Leu Ala Gly Thr
Val Ile Ala Leu Thr Ile Ala 530 535
540Thr Pro Leu Leu Val Ile Phe Ser Pro Val Leu Val Pro Ala Val Ile545
550 555 560Thr Ile Phe Leu
Leu Gly Ala Gly Phe Leu Ala Ser Gly Gly Phe Gly 565
570 575Val Ala Ala Leu Ser Val Leu Ser Trp Ile
Tyr Arg Tyr Leu Thr Gly 580 585
590Lys His Pro Pro Gly Ala Asp Gln Leu Glu Ser Ala Lys Thr Lys Leu
595 600 605Ala Ser Lys Ala Arg Glu Met
Lys Asp Arg Ala Glu Gln Phe Ser Gln 610 615
620Gln Pro Val Pro Trp Leu Glu His His His His His His625
630 63561795PRTArtificial SequenceSynthetic sequence
61Met Lys Glu Thr Ala Ala Ala Lys Phe Glu Arg Gln His Met Asp Ser1
5 10 15Pro Asp Leu Gly Thr Leu
Val Pro Arg Gly Ser Met Ala Glu His Tyr 20 25
30Gly Gln Gln Gln Gln Thr Arg Ala Pro His Leu Gln Leu
Gln Pro Arg 35 40 45Ala Gln Arg
Val Val Lys Ala Ala Thr Ala Val Thr Ala Gly Gly Ser 50
55 60Leu Leu Val Leu Ser Gly Leu Thr Leu Ala Gly Thr
Val Ile Ala Leu65 70 75
80Thr Ile Ala Thr Pro Leu Leu Val Ile Phe Ser Pro Val Leu Val Pro
85 90 95Ala Val Ile Thr Ile Phe
Leu Leu Gly Ala Gly Phe Leu Ala Ser Gly 100
105 110Gly Phe Gly Val Ala Ala Leu Ser Val Leu Ser Trp
Ile Tyr Arg Tyr 115 120 125Leu Thr
Gly Lys His Pro Pro Gly Ala Asp Gln Leu Glu Ser Ala Lys 130
135 140Thr Lys Leu Ala Ser Lys Ala Arg Glu Met Lys
Asp Arg Ala Glu Gln145 150 155
160Phe Ser Gln Gln Pro Val Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser
165 170 175Met Ala Glu His
Tyr Gly Gln Gln Gln Gln Thr Arg Ala Pro His Leu 180
185 190Gln Leu Gln Pro Arg Ala Gln Arg Val Val Lys
Ala Ala Thr Ala Val 195 200 205Thr
Ala Gly Gly Ser Leu Leu Val Leu Ser Gly Leu Thr Leu Ala Gly 210
215 220Thr Val Ile Ala Leu Thr Ile Ala Thr Pro
Leu Leu Val Ile Phe Ser225 230 235
240Pro Val Leu Val Pro Ala Val Ile Thr Ile Phe Leu Leu Gly Ala
Gly 245 250 255Phe Leu Ala
Ser Gly Gly Phe Gly Val Ala Ala Leu Ser Val Leu Ser 260
265 270Trp Ile Tyr Arg Tyr Leu Thr Gly Lys His
Pro Pro Gly Ala Asp Gln 275 280
285Leu Glu Ser Ala Lys Thr Lys Leu Ala Ser Lys Ala Arg Glu Met Lys 290
295 300Asp Arg Ala Glu Gln Phe Ser Gln
Gln Pro Val Gly Gly Gly Gly Ser305 310
315 320Gly Gly Gly Gly Ser Met Gly Tyr Arg Ile Arg Ile
Arg Ala Pro Ser 325 330
335Thr Ser Leu Arg Pro Leu Ala Pro Leu Lys Lys Thr Met Ala Glu His
340 345 350Tyr Gly Gln Gln Gln Gln
Thr Arg Ala Pro His Leu Gln Leu Gln Pro 355 360
365Arg Ala Gln Arg Val Val Lys Ala Ala Thr Ala Val Thr Ala
Gly Gly 370 375 380Ser Leu Leu Val Leu
Ser Gly Leu Thr Leu Ala Gly Thr Val Ile Ala385 390
395 400Leu Thr Ile Ala Thr Pro Leu Leu Val Ile
Phe Ser Pro Val Leu Val 405 410
415Pro Ala Val Ile Thr Ile Phe Leu Leu Gly Ala Gly Phe Leu Ala Ser
420 425 430Gly Gly Phe Gly Val
Ala Ala Leu Ser Val Leu Ser Trp Ile Tyr Arg 435
440 445Tyr Leu Thr Gly Lys His Pro Pro Gly Ala Asp Gln
Leu Glu Ser Ala 450 455 460Lys Thr Lys
Leu Ala Ser Lys Ala Arg Glu Met Lys Asp Arg Ala Glu465
470 475 480Gln Phe Ser Gln Gln Pro Val
Gly Gly Gly Gly Ser Gly Gly Gly Gly 485
490 495Ser Met Ala Glu His Tyr Gly Gln Gln Gln Gln Thr
Arg Ala Pro His 500 505 510Leu
Gln Leu Gln Pro Arg Ala Gln Arg Val Val Lys Ala Ala Thr Ala 515
520 525Val Thr Ala Gly Gly Ser Leu Leu Val
Leu Ser Gly Leu Thr Leu Ala 530 535
540Gly Thr Val Ile Ala Leu Thr Ile Ala Thr Pro Leu Leu Val Ile Phe545
550 555 560Ser Pro Val Leu
Val Pro Ala Val Ile Thr Ile Phe Leu Leu Gly Ala 565
570 575Gly Phe Leu Ala Ser Gly Gly Phe Gly Val
Ala Ala Leu Ser Val Leu 580 585
590Ser Trp Ile Tyr Arg Tyr Leu Thr Gly Lys His Pro Pro Gly Ala Asp
595 600 605Gln Leu Glu Ser Ala Lys Thr
Lys Leu Ala Ser Lys Ala Arg Glu Met 610 615
620Lys Asp Arg Ala Glu Gln Phe Ser Gln Gln Pro Val Gly Gly Gly
Gly625 630 635 640Ser Gly
Gly Gly Gly Ser Met Ala Glu His Tyr Gly Gln Gln Gln Gln
645 650 655Thr Arg Ala Pro His Leu Gln
Leu Gln Pro Arg Ala Gln Arg Val Val 660 665
670Lys Ala Ala Thr Ala Val Thr Ala Gly Gly Ser Leu Leu Val
Leu Ser 675 680 685Gly Leu Thr Leu
Ala Gly Thr Val Ile Ala Leu Thr Ile Ala Thr Pro 690
695 700Leu Leu Val Ile Phe Ser Pro Val Leu Val Pro Ala
Val Ile Thr Ile705 710 715
720Phe Leu Leu Gly Ala Gly Phe Leu Ala Ser Gly Gly Phe Gly Val Ala
725 730 735Ala Leu Ser Val Leu
Ser Trp Ile Tyr Arg Tyr Leu Thr Gly Lys His 740
745 750Pro Pro Gly Ala Asp Gln Leu Glu Ser Ala Lys Thr
Lys Leu Ala Ser 755 760 765Lys Ala
Arg Glu Met Lys Asp Arg Ala Glu Gln Phe Ser Gln Gln Pro 770
775 780Val Pro Trp Leu Glu His His His His His
His785 790 79562441PRTArtificial
SequenceSynthetic sequence 62Met Ala Glu His Tyr Gly Gln Gln Gln Gln Thr
Arg Ala Pro His Leu1 5 10
15Gln Leu Gln Pro Arg Ala Gln Arg Val Val Lys Ala Ala Thr Ala Val
20 25 30Thr Ala Gly Gly Ser Leu Leu
Val Leu Ser Gly Leu Thr Leu Ala Gly 35 40
45Thr Val Ile Ala Leu Thr Ile Ala Thr Pro Leu Leu Val Ile Phe
Ser 50 55 60Pro Val Leu Val Pro Ala
Val Ile Thr Ile Phe Leu Leu Gly Ala Gly65 70
75 80Phe Leu Ala Ser Gly Gly Phe Gly Val Ala Ala
Leu Ser Val Leu Ser 85 90
95Trp Ile Tyr Arg Tyr Leu Thr Gly Lys His Pro Pro Gly Ala Asp Gln
100 105 110Leu Glu Ser Ala Lys Thr
Lys Leu Ala Ser Lys Ala Arg Glu Met Lys 115 120
125Asp Arg Ala Glu Gln Phe Ser Gln Gln Pro Val Gly Gly Gly
Gly Ser 130 135 140Gly Gly Gly Gly Ser
Met Ala Glu His Tyr Gly Gln Gln Gln Gln Thr145 150
155 160Arg Ala Pro His Leu Gln Leu Gln Pro Arg
Ala Gln Arg Val Val Lys 165 170
175Ala Ala Thr Ala Val Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Gly
180 185 190Leu Thr Leu Ala Gly
Thr Val Ile Ala Leu Thr Ile Ala Thr Pro Leu 195
200 205Leu Val Ile Phe Ser Pro Val Leu Val Pro Ala Val
Ile Thr Ile Phe 210 215 220Leu Leu Gly
Ala Gly Phe Leu Ala Ser Gly Gly Phe Gly Val Ala Ala225
230 235 240Leu Ser Val Leu Ser Trp Ile
Tyr Arg Tyr Leu Thr Gly Lys His Pro 245
250 255Pro Gly Ala Asp Gln Leu Glu Ser Ala Lys Thr Lys
Leu Ala Ser Lys 260 265 270Ala
Arg Glu Met Lys Asp Arg Ala Glu Gln Phe Ser Gln Gln Pro Val 275
280 285Gly Gly Gly Gly Ser Gly Gly Gly Gly
Ser Met Ala Glu His Tyr Gly 290 295
300Gln Gln Gln Gln Thr Arg Ala Pro His Leu Gln Leu Gln Pro Arg Ala305
310 315 320Gln Arg Val Val
Lys Ala Ala Thr Ala Val Thr Ala Gly Gly Ser Leu 325
330 335Leu Val Leu Ser Gly Leu Thr Leu Ala Gly
Thr Val Ile Ala Leu Thr 340 345
350Ile Ala Thr Pro Leu Leu Val Ile Phe Ser Pro Val Leu Val Pro Ala
355 360 365Val Ile Thr Ile Phe Leu Leu
Gly Ala Gly Phe Leu Ala Ser Gly Gly 370 375
380Phe Gly Val Ala Ala Leu Ser Val Leu Ser Trp Ile Tyr Arg Tyr
Leu385 390 395 400Thr Gly
Lys His Pro Pro Gly Ala Asp Gln Leu Glu Ser Ala Lys Thr
405 410 415Lys Leu Ala Ser Lys Ala Arg
Glu Met Lys Asp Arg Ala Glu Gln Phe 420 425
430Ser Gln Gln Pro Val Pro Trp Leu Glu 435
44063149PRTArtificial SequenceSynthetic sequence 63Met Ala Glu His
Tyr Gly Gln Gln Gln Gln Thr Arg Ala Pro His Leu1 5
10 15Gln Leu Gln Pro Arg Ala Gln Arg Val Val
Lys Ala Ala Thr Ala Val 20 25
30Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Gly Leu Thr Leu Ala Gly
35 40 45Thr Val Ile Ala Leu Thr Ile Ala
Thr Pro Leu Leu Val Ile Phe Ser 50 55
60Pro Val Leu Val Pro Ala Val Ile Thr Ile Phe Leu Leu Gly Ala Gly65
70 75 80Phe Leu Ala Ser Gly
Gly Phe Gly Val Ala Ala Leu Ser Val Leu Ser 85
90 95Trp Ile Tyr Arg Tyr Leu Thr Gly Lys His Pro
Pro Gly Ala Asp Gln 100 105
110Leu Glu Ser Ala Lys Thr Lys Leu Ala Ser Lys Ala Arg Glu Met Lys
115 120 125Asp Arg Ala Glu Gln Phe Ser
Gln Gln Pro Val Pro Trp Leu Glu His 130 135
140His His His His His145642227DNAArtificial SequenceSynthetic
sequence 64tcgacgaatt aattccaatc ccacaaaaat ctgagcttaa cagcacagtt
gctcctctca 60gagcagaatc gggtattcaa caccctcata tcaactacta cgttgtgtat
aacggtccac 120atgccggtat atacgatgac tggggttgta caaaggcggc aacaaacggc
gttcccggag 180ttgcacacaa gaaatttgcc actattacag aggcaagagc agcagctgac
gcgtacacaa 240caagtcagca aacagacagg ttgaacttca tccccaaagg agaagctcaa
ctcaagccca 300agagctttgc taaggcccta acaagcccac caaagcaaaa agcccactgg
ctcacgctag 360gaaccaaaag gcccagcagt gatccagccc caaaagagat ctcctttgcc
ccggagatta 420caatggacga tttcctctat ctttacgatc taggaaggaa gttcgaaggt
gaaggtgacg 480acactatgtt caccactgat aatgagaagg ttagcctctt caatttcaga
aagaatgctg 540acccacagat ggttagagag gcctacgcag caggtctcat caagacgatc
tacccgagta 600acaatctcca ggagatcaaa taccttccca agaaggttaa agatgcagtc
aaaagattca 660ggactaattg catcaagaac acagagaaag acatatttct caagatcaga
agtactattc 720cagtatggac gattcaaggc ttgcttcata aaccaaggca agtaatagag
attggagtct 780ctaaaaaggt agttcctact gaatctaagg ccatgcatgg agtctaagat
tcaaatcgag 840gatctaacag aactcgccgt gaagactggc gaacagttca tacagagtct
tttacgactc 900aatgacaaga agaaaatctt cgtcaacatg gtggagcacg acactctggt
ctactccaaa 960aatgtcaaag atacagtctc agaagaccaa agggctattg agacttttca
acaaaggata 1020atttcgggaa acctcctcgg attccattgc ccagctatct gtcacttcat
cgaaaggaca 1080gtagaaaagg aaggtggctc ctacaaatgc catcattgcg ataaaggaaa
ggctatcatt 1140caagatctct ctgccgacag tggtcccaaa gatggacccc cacccacgag
gagcatcgtg 1200gaaaaagaag acgttccaac cacgtcttca aagcaagtgg attgatgtga
catctccact 1260gacgtaaggg atgacgcaca atcccactat ccttcgcaag acccttcctc
tatataagga 1320agttcatttc atttggagag gacacgctcg aggaattcgg taccccatca
caagtttgta 1380caaaaaagca ggctccgcgg ccgcttgctc cgttaaaaaa aaccatggca
gagcactacg 1440ggcagcagca acaaacacgt gccccccact tgcaactgca accgcgtgct
cagcgtgttg 1500taaaggcagc gaccgcggtt actgccggag gtagcctgtt ggtgttatcc
gggttgaccc 1560tggctggaac ggtcattgcg ctgacaatcg ccacacctct cctggtgatc
ttctcgcctg 1620tactggtccc tgcaggtaaa tttctgtgtt ccttattctc tcaaaatctt
cgattttgtt 1680ttcgttcgat cccaatttcg tatatgttct ttggtttaga ttctgttaat
cttagatcga 1740agacgatttt ctgggtttga tcgttagata tcatcttaat tctcgattag
ggtttcatag 1800atatcatccg atttgttcaa ataatttgag ttttgtcgaa taattactct
tcgatttgtg 1860atttctatct agatctggtg ttagtttcta gtttgtgcga tcgaatttgt
cgattaatct 1920gagtttttct gatctgcagt catcaccatc ttcctgttgg gagcgggttt
tctggcaagc 1980gggggatttg gtgttgccgc tctgtctgtg ctgtcctgga tctaccgtta
cctgacgggg 2040aaacatccac ccggagcgga tcagttggag tcggccaaga caaagttggc
gagcaaggcc 2100cgtgaaatga aggaccgtgc cgagcagttc agtcagcaac cggtagcagg
gtctcagacc 2160agcatcgatc catcctcctg gctcgagtaa tgaagcggcc gcacccagct
ttcttgtaca 2220aagtggt
2227653133DNAArtificial SequenceSynthetic sequence
65tcgacgaatt aattccaatc ccacaaaaat ctgagcttaa cagcacagtt gctcctctca
60gagcagaatc gggtattcaa caccctcata tcaactacta cgttgtgtat aacggtccac
120atgccggtat atacgatgac tggggttgta caaaggcggc aacaaacggc gttcccggag
180ttgcacacaa gaaatttgcc actattacag aggcaagagc agcagctgac gcgtacacaa
240caagtcagca aacagacagg ttgaacttca tccccaaagg agaagctcaa ctcaagccca
300agagctttgc taaggcccta acaagcccac caaagcaaaa agcccactgg ctcacgctag
360gaaccaaaag gcccagcagt gatccagccc caaaagagat ctcctttgcc ccggagatta
420caatggacga tttcctctat ctttacgatc taggaaggaa gttcgaaggt gaaggtgacg
480acactatgtt caccactgat aatgagaagg ttagcctctt caatttcaga aagaatgctg
540acccacagat ggttagagag gcctacgcag caggtctcat caagacgatc tacccgagta
600acaatctcca ggagatcaaa taccttccca agaaggttaa agatgcagtc aaaagattca
660ggactaattg catcaagaac acagagaaag acatatttct caagatcaga agtactattc
720cagtatggac gattcaaggc ttgcttcata aaccaaggca agtaatagag attggagtct
780ctaaaaaggt agttcctact gaatctaagg ccatgcatgg agtctaagat tcaaatcgag
840gatctaacag aactcgccgt gaagactggc gaacagttca tacagagtct tttacgactc
900aatgacaaga agaaaatctt cgtcaacatg gtggagcacg acactctggt ctactccaaa
960aatgtcaaag atacagtctc agaagaccaa agggctattg agacttttca acaaaggata
1020atttcgggaa acctcctcgg attccattgc ccagctatct gtcacttcat cgaaaggaca
1080gtagaaaagg aaggtggctc ctacaaatgc catcattgcg ataaaggaaa ggctatcatt
1140caagatctct ctgccgacag tggtcccaaa gatggacccc cacccacgag gagcatcgtg
1200gaaaaagaag acgttccaac cacgtcttca aagcaagtgg attgatgtga catctccact
1260gacgtaaggg atgacgcaca atcccactat ccttcgcaag acccttcctc tatataagga
1320agttcatttc atttggagag gacacgctcg aggaattcgg taccccatca caagtttgta
1380caaaaaagca ggctccgcgg ccgcttgctc cgttaaaaaa aaccatggca gagcactacg
1440ggcagcagca acaaacacgt gccccccact tgcaactgca accgcgtgct cagcgtgttg
1500taaaggcagc gaccgcggtt actgccggag gtagcctgtt ggtgttatcc gggttgaccc
1560tggctggaac ggtcattgcg ctgacaatcg ccacacctct cctggtgatc ttctcgcctg
1620tactggtccc tgcaggtaaa tttctgtgtt ccttattctc tcaaaatctt cgattttgtt
1680ttcgttcgat cccaatttcg tatatgttct ttggtttaga ttctgttaat cttagatcga
1740agacgatttt ctgggtttga tcgttagata tcatcttaat tctcgattag ggtttcatag
1800atatcatccg atttgttcaa ataatttgag ttttgtcgaa taattactct tcgatttgtg
1860atttctatct agatctggtg ttagtttcta gtttgtgcga tcgaatttgt cgattaatct
1920gagtttttct gatctgcagt catcaccatc ttcctgttgg gagcgggttt tctggcaagc
1980gggggatttg gtgttgccgc tctgtctgtg ctgtcctgga tctaccgtta cctgacgggg
2040aaacatccac ccggagcgga tcagttggag tcggccaaga caaagttggc gagcaaggcc
2100cgtgaaatga aggaccgtgc cgagcagttc agtcagcaac cggtagcagg gtctcagacc
2160agcatcgatc catcctcctg gcatatgatg gccgagcact atggacaaca gcagcagacg
2220cgtgcccctc atctgcaact gcaaccccgt gctcaacgtg tcgttaaggc tgcgacagca
2280gtaaccgctg ggggttctct gttagtgttg tcagggctga ctttggcggg gacggtaatt
2340gcgttgacca ttgccacccc gctgttagtg attttcagcc cggtactggt gccagcagtt
2400atcacgatct tcttgctggg tgccggattc ttggcaagtg gaggttttgg agttgcggcg
2460ctgtcagttt tatcctggat ctatcgttat ctgacaggaa aacatccccc aggtgccgat
2520cagctggaga gtgccaagac aaaactggcg tctaaggcac gtgaaatgaa ggatcgtgcc
2580gaacagtttt ctcaacagcc cgtagcgggg tcacagacct cgatcgatca gcaggttaac
2640gtgcacatgg ccgaacatta cggacagcaa caacagacgc gtgctccaca cctgcaattg
2700caaccgcgtg ctcaacgtgt tgtcaaagcg gcgaccgccg taacagcagg aggatcactg
2760ttagtgctgt cgggtttaac cttggccggg accgtcattg cattgactat tgcgacgccc
2820ttactggtga tcttttctcc ggtgctggtt cccgccgtta ttaccatctt cttgttaggg
2880gcaggattcc tggcatcagg gggattcgga gttgcggcgt tgagtgtctt aagttggatc
2940taccgttatc tgactggaaa gcacccgcct ggggccgatc aactggagtc agccaaaacg
3000aaattggcgt caaaagcgcg tgaaatgaag gaccgtgctg agcagttttc tcagcagcct
3060gtggcaggat cccagacatc accatggctc gagtaatgaa gcggccgcac ccagctttct
3120tgtacaaagt ggt
3133663610DNAArtificial SequenceSynthetic sequence 66tcgacgaatt
aattccaatc ccacaaaaat ctgagcttaa cagcacagtt gctcctctca 60gagcagaatc
gggtattcaa caccctcata tcaactacta cgttgtgtat aacggtccac 120atgccggtat
atacgatgac tggggttgta caaaggcggc aacaaacggc gttcccggag 180ttgcacacaa
gaaatttgcc actattacag aggcaagagc agcagctgac gcgtacacaa 240caagtcagca
aacagacagg ttgaacttca tccccaaagg agaagctcaa ctcaagccca 300agagctttgc
taaggcccta acaagcccac caaagcaaaa agcccactgg ctcacgctag 360gaaccaaaag
gcccagcagt gatccagccc caaaagagat ctcctttgcc ccggagatta 420caatggacga
tttcctctat ctttacgatc taggaaggaa gttcgaaggt gaaggtgacg 480acactatgtt
caccactgat aatgagaagg ttagcctctt caatttcaga aagaatgctg 540acccacagat
ggttagagag gcctacgcag caggtctcat caagacgatc tacccgagta 600acaatctcca
ggagatcaaa taccttccca agaaggttaa agatgcagtc aaaagattca 660ggactaattg
catcaagaac acagagaaag acatatttct caagatcaga agtactattc 720cagtatggac
gattcaaggc ttgcttcata aaccaaggca agtaatagag attggagtct 780ctaaaaaggt
agttcctact gaatctaagg ccatgcatgg agtctaagat tcaaatcgag 840gatctaacag
aactcgccgt gaagactggc gaacagttca tacagagtct tttacgactc 900aatgacaaga
agaaaatctt cgtcaacatg gtggagcacg acactctggt ctactccaaa 960aatgtcaaag
atacagtctc agaagaccaa agggctattg agacttttca acaaaggata 1020atttcgggaa
acctcctcgg attccattgc ccagctatct gtcacttcat cgaaaggaca 1080gtagaaaagg
aaggtggctc ctacaaatgc catcattgcg ataaaggaaa ggctatcatt 1140caagatctct
ctgccgacag tggtcccaaa gatggacccc cacccacgag gagcatcgtg 1200gaaaaagaag
acgttccaac cacgtcttca aagcaagtgg attgatgtga catctccact 1260gacgtaaggg
atgacgcaca atcccactat ccttcgcaag acccttcctc tatataagga 1320agttcatttc
atttggagag gacacgctcg aggaattcgg taccccatca caagtttgta 1380caaaaaagca
ggctccgcgg ccgcttgctc cgttaaaaaa aaccatggca gagcactacg 1440ggcagcagca
acaaacacgt gccccccact tgcaactgca accgcgtgct cagcgtgttg 1500taaaggcagc
gaccgcggtt actgccggag gtagcctgtt ggtgttatcc gggttgaccc 1560tggctggaac
ggtcattgcg ctgacaatcg ccacacctct cctggtgatc ttctcgcctg 1620tactggtccc
tgcaggtaaa tttctgtgtt ccttattctc tcaaaatctt cgattttgtt 1680ttcgttcgat
cccaatttcg tatatgttct ttggtttaga ttctgttaat cttagatcga 1740agacgatttt
ctgggtttga tcgttagata tcatcttaat tctcgattag ggtttcatag 1800atatcatccg
atttgttcaa ataatttgag ttttgtcgaa taattactct tcgatttgtg 1860atttctatct
agatctggtg ttagtttcta gtttgtgcga tcgaatttgt cgattaatct 1920gagtttttct
gatctgcagt catcaccatc ttcctgttgg gagcgggttt tctggcaagc 1980gggggatttg
gtgttgccgc tctgtctgtg ctgtcctgga tctaccgtta cctgacgggg 2040aaacatccac
ccggagcgga tcagttggag tcggccaaga caaagttggc gagcaaggcc 2100cgtgaaatga
aggaccgtgc cgagcagttc agtcagcaac cggtagcagg gtctcagacc 2160agcatcgatc
catcctcctg gctcgagatg gcggaacact acgggcaaca acagcagact 2220cgtgctcccc
acctgcaatt acaaccccgt gcccaacgtg ttgtgaaagc ggcaacagca 2280gtaacggcag
ggggaagttt gctggtctta tcggggttga ccttagcggg aaccgtgatt 2340gccctgacaa
ttgcgactcc gctgctggtt atcttcagcc ccgtattggt tccggccgtg 2400atcacgattt
ttttgctggg ggcaggattt ttagccagcg gaggatttgg ggtcgcagcg 2460ttgtctgtgc
tgagttggat ctatcgttat ttgaccggga agcacccacc tggagcagac 2520cagctggaga
gcgcgaaaac gaagctggca tcgaaggcgc gtgaaatgaa ggatcgtgct 2580gaacaattct
cccagcagcc tgttgccggt tctcagacca gccatatgat ggccgagcac 2640tatggacaac
agcagcagac gcgtgcccct catctgcaac tgcaaccccg tgctcaacgt 2700gtcgttaagg
ctgcgacagc agtaaccgct gggggttctc tgttagtgtt gtcagggctg 2760actttggcgg
ggacggtaat tgcgttgacc attgccaccc cgctgttagt gattttcagc 2820ccggtactgg
tgccagcagt tatcacgatc ttcttgctgg gtgccggatt cttggcaagt 2880ggaggttttg
gagttgcggc gctgtcagtt ttatcctgga tctatcgtta tctgacagga 2940aaacatcccc
caggtgccga tcagctggag agtgccaaga caaaactggc gtctaaggca 3000cgtgaaatga
aggatcgtgc cgaacagttt tctcaacagc ccgtagcggg gtcacagacc 3060tcgatcgatc
agcaggttaa cgtgcacatg gccgaacatt acggacagca acaacagacg 3120cgtgctccac
acctgcaatt gcaaccgcgt gctcaacgtg ttgtcaaagc ggcgaccgcc 3180gtaacagcag
gaggatcact gttagtgctg tcgggtttaa ccttggccgg gaccgtcatt 3240gcattgacta
ttgcgacgcc cttactggtg atcttttctc cggtgctggt tcccgccgtt 3300attaccatct
tcttgttagg ggcaggattc ctggcatcag ggggattcgg agttgcggcg 3360ttgagtgtct
taagttggat ctaccgttat ctgactggaa agcacccgcc tggggccgat 3420caactggagt
cagccaaaac gaaattggcg tcaaaagcgc gtgaaatgaa ggaccgtgct 3480gagcagtttt
ctcagcagcc tgtggcagga tcccagacat caccatggct cgagtaatga 3540agcggccgca
cccagctttc ttgtacaaag tggtgatggg ttcgaaatcg ataagcttgg 3600atcctctaga
3610674510DNAArtificial SequenceSynthetic sequence 67tcgacgaatt
aattccaatc ccacaaaaat ctgagcttaa cagcacagtt gctcctctca 60gagcagaatc
gggtattcaa caccctcata tcaactacta cgttgtgtat aacggtccac 120atgccggtat
atacgatgac tggggttgta caaaggcggc aacaaacggc gttcccggag 180ttgcacacaa
gaaatttgcc actattacag aggcaagagc agcagctgac gcgtacacaa 240caagtcagca
aacagacagg ttgaacttca tccccaaagg agaagctcaa ctcaagccca 300agagctttgc
taaggcccta acaagcccac caaagcaaaa agcccactgg ctcacgctag 360gaaccaaaag
gcccagcagt gatccagccc caaaagagat ctcctttgcc ccggagatta 420caatggacga
tttcctctat ctttacgatc taggaaggaa gttcgaaggt gaaggtgacg 480acactatgtt
caccactgat aatgagaagg ttagcctctt caatttcaga aagaatgctg 540acccacagat
ggttagagag gcctacgcag caggtctcat caagacgatc tacccgagta 600acaatctcca
ggagatcaaa taccttccca agaaggttaa agatgcagtc aaaagattca 660ggactaattg
catcaagaac acagagaaag acatatttct caagatcaga agtactattc 720cagtatggac
gattcaaggc ttgcttcata aaccaaggca agtaatagag attggagtct 780ctaaaaaggt
agttcctact gaatctaagg ccatgcatgg agtctaagat tcaaatcgag 840gatctaacag
aactcgccgt gaagactggc gaacagttca tacagagtct tttacgactc 900aatgacaaga
agaaaatctt cgtcaacatg gtggagcacg acactctggt ctactccaaa 960aatgtcaaag
atacagtctc agaagaccaa agggctattg agacttttca acaaaggata 1020atttcgggaa
acctcctcgg attccattgc ccagctatct gtcacttcat cgaaaggaca 1080gtagaaaagg
aaggtggctc ctacaaatgc catcattgcg ataaaggaaa ggctatcatt 1140caagatctct
ctgccgacag tggtcccaaa gatggacccc cacccacgag gagcatcgtg 1200gaaaaagaag
acgttccaac cacgtcttca aagcaagtgg attgatgtga catctccact 1260gacgtaaggg
atgacgcaca atcccactat ccttcgcaag acccttcctc tatataagga 1320agttcatttc
atttggagag gacacgctcg aggaattcgg taccccatca caagtttgta 1380caaaaaagca
ggctccgcgg ccgcttgctc cgttaaaaaa aaccatggca gagcactacg 1440ggcagcagca
acaaacacgt gccccccact tgcaactgca accgcgtgct cagcgtgttg 1500taaaggcagc
gaccgcggtt actgccggag gtagcctgtt ggtgttatcc gggttgaccc 1560tggctggaac
ggtcattgcg ctgacaatcg ccacacctct cctggtgatc ttctcgcctg 1620tactggtccc
tgcaggtaaa tttctgtgtt ccttattctc tcaaaatctt cgattttgtt 1680ttcgttcgat
cccaatttcg tatatgttct ttggtttaga ttctgttaat cttagatcga 1740agacgatttt
ctgggtttga tcgttagata tcatcttaat tctcgattag ggtttcatag 1800atatcatccg
atttgttcaa ataatttgag ttttgtcgaa taattactct tcgatttgtg 1860atttctatct
agatctggtg ttagtttcta gtttgtgcga tcgaatttgt cgattaatct 1920gagtttttct
gatctgcagt catcaccatc ttcctgttgg gagcgggttt tctggcaagc 1980gggggatttg
gtgttgccgc tctgtctgtg ctgtcctgga tctaccgtta cctgacgggg 2040aaacatccac
ccggagcgga tcagttggag tcggccaaga caaagttggc gagcaaggcc 2100cgtgaaatga
aggaccgtgc cgagcagttc agtcagcaac cggtagcagg gtctcagacc 2160agcatcgatc
catcctcctg gctcgagatg gcggaacact acgggcaaca acagcagact 2220cgtgctcccc
acctgcaatt acaaccccgt gcccaacgtg ttgtgaaagc ggcaacagca 2280gtaacggcag
ggggaagttt gctggtctta tcggggttga ccttagcggg aaccgtgatt 2340gccctgacaa
ttgcgactcc gctgctggtt atcttcagcc ccgtattggt tccggccgtg 2400atcacgattt
ttttgctggg ggcaggattt ttagccagcg gaggatttgg ggtcgcagcg 2460ttgtctgtgc
tgagttggat ctatcgttat ttgaccggga agcacccacc tggagcagac 2520cagctggaga
gcgcgaaaac gaagctggca tcgaaggcgc gtgaaatgaa ggatcgtgct 2580gaacaattct
cccagcagcc tgttgccggt tctcagacca gccatatgtt taaatggcca 2640agcgctatgg
ccgagcatta tgggcagcaa cagcaaaccc gtgccccgca tctgcaattg 2700caacctcgtg
cccagcgtgt cgttaaggcg gctactgcgg taacagcggg agggagctta 2760ctggtattaa
gcgggctgac attggccgga acggtgatcg ccttaacaat cgcgacaccc 2820ttgctggtca
tcttcagtcc ggttctggtg cccgcggtga ttacgatttt cctgctggga 2880gccggtttct
tagcatcggg gggttttggg gtagcagcct tgagtgtcct gtcgtggatc 2940tatcgttact
taactggaaa acacccgcca ggagctgacc agttggagtc tgcaaaaact 3000aagctggcgt
ccaaagcccg tgaaatgaag gatcgtgctg agcagtttag ccagcagcca 3060gttgcgggaa
gtcagacctc ttcatctgag ctcccatggg tcgacatggc ggagcattac 3120ggtcaacagc
aacagacccg tgctccgcac ttacaattgc aaccacgtgc tcaacgtgtc 3180gtaaaagccg
ccacggcagt tactgcgggg ggatcattgc tggtgttaag tgggttgaca 3240ctggcgggga
cagttattgc actgacgatc gcgaccccct tgttagtgat cttctccccc 3300gttctggttc
cggcggtcat tacaatcttt ctgttgggtg ccggattttt agcctctggg 3360ggatttggag
tagctgccct gtcagtgttg agctggatct accgttactt aacagggaag 3420caccctcccg
gggcagatca gttggaaagc gccaagacca agctggcaag taaagcgcgt 3480gaaatgaagg
accgtgccga acaattttcg cagcaaccgg ttgcgggatc acagacctct 3540agtactccat
cctcctggca tatgatggcc gagcactatg gacaacagca gcagacgcgt 3600gcccctcatc
tgcaactgca accccgtgct caacgtgtcg ttaaggctgc gacagcagta 3660accgctgggg
gttctctgtt agtgttgtca gggctgactt tggcggggac ggtaattgcg 3720ttgaccattg
ccaccccgct gttagtgatt ttcagcccgg tactggtgcc agcagttatc 3780acgatcttct
tgctgggtgc cggattcttg gcaagtggag gttttggagt tgcggcgctg 3840tcagttttat
cctggatcta tcgttatctg acaggaaaac atcccccagg tgccgatcag 3900ctggagagtg
ccaagacaaa actggcgtct aaggcacgtg aaatgaagga tcgtgccgaa 3960cagttttctc
aacagcccgt agcggggtca cagacctcga tcgatcagca ggttaacgtg 4020cacatggccg
aacattacgg acagcaacaa cagacgcgtg ctccacacct gcaattgcaa 4080ccgcgtgctc
aacgtgttgt caaagcggcg accgccgtaa cagcaggagg atcactgtta 4140gtgctgtcgg
gtttaacctt ggccgggacc gtcattgcat tgactattgc gacgccctta 4200ctggtgatct
tttctccggt gctggttccc gccgttatta ccatcttctt gttaggggca 4260ggattcctgg
catcaggggg attcggagtt gcggcgttga gtgtcttaag ttggatctac 4320cgttatctga
ctggaaagca cccgcctggg gccgatcaac tggagtcagc caaaacgaaa 4380ttggcgtcaa
aagcgcgtga aatgaaggac cgtgctgagc agttttctca gcagcctgtg 4440gcaggatccc
agacatcacc atggctcgag taatgaagcg gccgcaccca gctttcttgt 4500acaaagtggt
4510681706DNAArtificial SequenceSynthetic sequence 68gcaggaactc
tctggtaagc tagctccact ccccagaaac aaccggcgcc aaattgcgcg 60aattgctgac
ctgaagacgg aacatcatcg tcgggtcctt gggcgattgc ggcggaagat 120gggtcagctt
gggcttgagg acgagacccg aatccgagtc tgttgaaaag gttgttcatt 180ggggatttgt
atacggagat tggtcgtcga gaggtttgag ggaaaggaca aatgggtttg 240gctctggaga
aagagagtgc ggctttagag agagaattga gaggtttaga gagagatgcg 300gcggcgatga
gcggaggaga gacgacgagg acctgcatta tcaaagcagt gacgtggtga 360aatttggaac
ttttaagagg cagatagatt tattatttgt atccattttc ttcattgttc 420tagaatgtcg
cggaacaaat tttaaaacta aatcctaaat ttttctaatt ttgttgccaa 480tagtggatat
gtgggccgta tagaaggaat ctattgaagg cccaaaccca tactgacgag 540cccaaaggtt
cgttttgcgt tttatgtttc ggttcgatgc caacgccaca ttctgagcta 600ggcaaaaaac
aaacgtgtct ttgaatagac tcctctcgtt aacacatgca gcggctgcat 660ggtgacgcca
ttaacacgtg gcctacaatt gcatgatgtc tccattgaca cgtgacttct 720cgtctccttt
cttaatatat ctaacaaaca ctcctacctc ttccaaaata tatacacatc 780tttttgatca
atctctcatt caaaatctca ttctctctag taaacaagaa caaaaaaggt 840accccatcac
aagtttgtac aaaaaagcag gctccgcggc cgcttgctcc gttaaaaaaa 900accatggcag
agcactacgg gcagcagcaa caaacacgtg ccccccactt gcaactgcaa 960ccgcgtgctc
agcgtgttgt aaaggcagcg accgcggtta ctgccggagg tagcctgttg 1020gtgttatccg
ggttgaccct ggctggaacg gtcattgcgc tgacaatcgc cacacctctc 1080ctggtgatct
tctcgcctgt actggtccct gcaggtaaat ttctgtgttc cttattctct 1140caaaatcttc
gattttgttt tcgttcgatc ccaatttcgt atatgttctt tggtttagat 1200tctgttaatc
ttagatcgaa gacgattttc tgggtttgat cgttagatat catcttaatt 1260ctcgattagg
gtttcataga tatcatccga tttgttcaaa taatttgagt tttgtcgaat 1320aattactctt
cgatttgtga tttctatcta gatctggtgt tagtttctag tttgtgcgat 1380cgaatttgtc
gattaatctg agtttttctg atctgcagtc atcaccatct tcctgttggg 1440agcgggtttt
ctggcaagcg ggggatttgg tgttgccgct ctgtctgtgc tgtcctggat 1500ctaccgttac
ctgacgggga aacatccacc cggagcggat cagttggagt cggccaagac 1560aaagttggcg
agcaaggccc gtgaaatgaa ggaccgtgcc gagcagttca gtcagcaacc 1620ggtagcaggg
tctcagacca gcatcgatcc atcctcctgg ctcgagtaat gaagcggccg 1680cacccagctt
tcttgtacaa agtggt
1706692612DNAArtificial SequenceSynthetic sequence 69gcaggaactc
tctggtaagc tagctccact ccccagaaac aaccggcgcc aaattgcgcg 60aattgctgac
ctgaagacgg aacatcatcg tcgggtcctt gggcgattgc ggcggaagat 120gggtcagctt
gggcttgagg acgagacccg aatccgagtc tgttgaaaag gttgttcatt 180ggggatttgt
atacggagat tggtcgtcga gaggtttgag ggaaaggaca aatgggtttg 240gctctggaga
aagagagtgc ggctttagag agagaattga gaggtttaga gagagatgcg 300gcggcgatga
gcggaggaga gacgacgagg acctgcatta tcaaagcagt gacgtggtga 360aatttggaac
ttttaagagg cagatagatt tattatttgt atccattttc ttcattgttc 420tagaatgtcg
cggaacaaat tttaaaacta aatcctaaat ttttctaatt ttgttgccaa 480tagtggatat
gtgggccgta tagaaggaat ctattgaagg cccaaaccca tactgacgag 540cccaaaggtt
cgttttgcgt tttatgtttc ggttcgatgc caacgccaca ttctgagcta 600ggcaaaaaac
aaacgtgtct ttgaatagac tcctctcgtt aacacatgca gcggctgcat 660ggtgacgcca
ttaacacgtg gcctacaatt gcatgatgtc tccattgaca cgtgacttct 720cgtctccttt
cttaatatat ctaacaaaca ctcctacctc ttccaaaata tatacacatc 780tttttgatca
atctctcatt caaaatctca ttctctctag taaacaagaa caaaaaaggt 840accccatcac
aagtttgtac aaaaaagcag gctccgcggc cgcttgctcc gttaaaaaaa 900accatggcag
agcactacgg gcagcagcaa caaacacgtg ccccccactt gcaactgcaa 960ccgcgtgctc
agcgtgttgt aaaggcagcg accgcggtta ctgccggagg tagcctgttg 1020gtgttatccg
ggttgaccct ggctggaacg gtcattgcgc tgacaatcgc cacacctctc 1080ctggtgatct
tctcgcctgt actggtccct gcaggtaaat ttctgtgttc cttattctct 1140caaaatcttc
gattttgttt tcgttcgatc ccaatttcgt atatgttctt tggtttagat 1200tctgttaatc
ttagatcgaa gacgattttc tgggtttgat cgttagatat catcttaatt 1260ctcgattagg
gtttcataga tatcatccga tttgttcaaa taatttgagt tttgtcgaat 1320aattactctt
cgatttgtga tttctatcta gatctggtgt tagtttctag tttgtgcgat 1380cgaatttgtc
gattaatctg agtttttctg atctgcagtc atcaccatct tcctgttggg 1440agcgggtttt
ctggcaagcg ggggatttgg tgttgccgct ctgtctgtgc tgtcctggat 1500ctaccgttac
ctgacgggga aacatccacc cggagcggat cagttggagt cggccaagac 1560aaagttggcg
agcaaggccc gtgaaatgaa ggaccgtgcc gagcagttca gtcagcaacc 1620ggtagcaggg
tctcagacca gcatcgatcc atcctcctgg catatgatgg ccgagcacta 1680tggacaacag
cagcagacgc gtgcccctca tctgcaactg caaccccgtg ctcaacgtgt 1740cgttaaggct
gcgacagcag taaccgctgg gggttctctg ttagtgttgt cagggctgac 1800tttggcgggg
acggtaattg cgttgaccat tgccaccccg ctgttagtga ttttcagccc 1860ggtactggtg
ccagcagtta tcacgatctt cttgctgggt gccggattct tggcaagtgg 1920aggttttgga
gttgcggcgc tgtcagtttt atcctggatc tatcgttatc tgacaggaaa 1980acatccccca
ggtgccgatc agctggagag tgccaagaca aaactggcgt ctaaggcacg 2040tgaaatgaag
gatcgtgccg aacagttttc tcaacagccc gtagcggggt cacagacctc 2100gatcgatcag
caggttaacg tgcacatggc cgaacattac ggacagcaac aacagacgcg 2160tgctccacac
ctgcaattgc aaccgcgtgc tcaacgtgtt gtcaaagcgg cgaccgccgt 2220aacagcagga
ggatcactgt tagtgctgtc gggtttaacc ttggccggga ccgtcattgc 2280attgactatt
gcgacgccct tactggtgat cttttctccg gtgctggttc ccgccgttat 2340taccatcttc
ttgttagggg caggattcct ggcatcaggg ggattcggag ttgcggcgtt 2400gagtgtctta
agttggatct accgttatct gactggaaag cacccgcctg gggccgatca 2460actggagtca
gccaaaacga aattggcgtc aaaagcgcgt gaaatgaagg accgtgctga 2520gcagttttct
cagcagcctg tggcaggatc ccagacatca ccatggctcg agtaatgaag 2580cggccgcacc
cagctttctt gtacaaagtg gt
2612703089DNAArtificial SequenceSynthetic sequence 70gcaggaactc
tctggtaagc tagctccact ccccagaaac aaccggcgcc aaattgcgcg 60aattgctgac
ctgaagacgg aacatcatcg tcgggtcctt gggcgattgc ggcggaagat 120gggtcagctt
gggcttgagg acgagacccg aatccgagtc tgttgaaaag gttgttcatt 180ggggatttgt
atacggagat tggtcgtcga gaggtttgag ggaaaggaca aatgggtttg 240gctctggaga
aagagagtgc ggctttagag agagaattga gaggtttaga gagagatgcg 300gcggcgatga
gcggaggaga gacgacgagg acctgcatta tcaaagcagt gacgtggtga 360aatttggaac
ttttaagagg cagatagatt tattatttgt atccattttc ttcattgttc 420tagaatgtcg
cggaacaaat tttaaaacta aatcctaaat ttttctaatt ttgttgccaa 480tagtggatat
gtgggccgta tagaaggaat ctattgaagg cccaaaccca tactgacgag 540cccaaaggtt
cgttttgcgt tttatgtttc ggttcgatgc caacgccaca ttctgagcta 600ggcaaaaaac
aaacgtgtct ttgaatagac tcctctcgtt aacacatgca gcggctgcat 660ggtgacgcca
ttaacacgtg gcctacaatt gcatgatgtc tccattgaca cgtgacttct 720cgtctccttt
cttaatatat ctaacaaaca ctcctacctc ttccaaaata tatacacatc 780tttttgatca
atctctcatt caaaatctca ttctctctag taaacaagaa caaaaaaggt 840accccatcac
aagtttgtac aaaaaagcag gctccgcggc cgcttgctcc gttaaaaaaa 900accatggcag
agcactacgg gcagcagcaa caaacacgtg ccccccactt gcaactgcaa 960ccgcgtgctc
agcgtgttgt aaaggcagcg accgcggtta ctgccggagg tagcctgttg 1020gtgttatccg
ggttgaccct ggctggaacg gtcattgcgc tgacaatcgc cacacctctc 1080ctggtgatct
tctcgcctgt actggtccct gcaggtaaat ttctgtgttc cttattctct 1140caaaatcttc
gattttgttt tcgttcgatc ccaatttcgt atatgttctt tggtttagat 1200tctgttaatc
ttagatcgaa gacgattttc tgggtttgat cgttagatat catcttaatt 1260ctcgattagg
gtttcataga tatcatccga tttgttcaaa taatttgagt tttgtcgaat 1320aattactctt
cgatttgtga tttctatcta gatctggtgt tagtttctag tttgtgcgat 1380cgaatttgtc
gattaatctg agtttttctg atctgcagtc atcaccatct tcctgttggg 1440agcgggtttt
ctggcaagcg ggggatttgg tgttgccgct ctgtctgtgc tgtcctggat 1500ctaccgttac
ctgacgggga aacatccacc cggagcggat cagttggagt cggccaagac 1560aaagttggcg
agcaaggccc gtgaaatgaa ggaccgtgcc gagcagttca gtcagcaacc 1620ggtagcaggg
tctcagacca gcatcgatcc atcctcctgg ctcgagatgg cggaacacta 1680cgggcaacaa
cagcagactc gtgctcccca cctgcaatta caaccccgtg cccaacgtgt 1740tgtgaaagcg
gcaacagcag taacggcagg gggaagtttg ctggtcttat cggggttgac 1800cttagcggga
accgtgattg ccctgacaat tgcgactccg ctgctggtta tcttcagccc 1860cgtattggtt
ccggccgtga tcacgatttt tttgctgggg gcaggatttt tagccagcgg 1920aggatttggg
gtcgcagcgt tgtctgtgct gagttggatc tatcgttatt tgaccgggaa 1980gcacccacct
ggagcagacc agctggagag cgcgaaaacg aagctggcat cgaaggcgcg 2040tgaaatgaag
gatcgtgctg aacaattctc ccagcagcct gttgccggtt ctcagaccag 2100ccatatgatg
gccgagcact atggacaaca gcagcagacg cgtgcccctc atctgcaact 2160gcaaccccgt
gctcaacgtg tcgttaaggc tgcgacagca gtaaccgctg ggggttctct 2220gttagtgttg
tcagggctga ctttggcggg gacggtaatt gcgttgacca ttgccacccc 2280gctgttagtg
attttcagcc cggtactggt gccagcagtt atcacgatct tcttgctggg 2340tgccggattc
ttggcaagtg gaggttttgg agttgcggcg ctgtcagttt tatcctggat 2400ctatcgttat
ctgacaggaa aacatccccc aggtgccgat cagctggaga gtgccaagac 2460aaaactggcg
tctaaggcac gtgaaatgaa ggatcgtgcc gaacagtttt ctcaacagcc 2520cgtagcgggg
tcacagacct cgatcgatca gcaggttaac gtgcacatgg ccgaacatta 2580cggacagcaa
caacagacgc gtgctccaca cctgcaattg caaccgcgtg ctcaacgtgt 2640tgtcaaagcg
gcgaccgccg taacagcagg aggatcactg ttagtgctgt cgggtttaac 2700cttggccggg
accgtcattg cattgactat tgcgacgccc ttactggtga tcttttctcc 2760ggtgctggtt
cccgccgtta ttaccatctt cttgttaggg gcaggattcc tggcatcagg 2820gggattcgga
gttgcggcgt tgagtgtctt aagttggatc taccgttatc tgactggaaa 2880gcacccgcct
ggggccgatc aactggagtc agccaaaacg aaattggcgt caaaagcgcg 2940tgaaatgaag
gaccgtgctg agcagttttc tcagcagcct gtggcaggat cccagacatc 3000accatggctc
gagtaatgaa gcggccgcac ccagctttct tgtacaaagt ggtgatgggt 3060tcgaaatcga
taagcttgga tcctctaga
3089713989DNAArtificial SequenceSynthetic sequence 71gcaggaactc
tctggtaagc tagctccact ccccagaaac aaccggcgcc aaattgcgcg 60aattgctgac
ctgaagacgg aacatcatcg tcgggtcctt gggcgattgc ggcggaagat 120gggtcagctt
gggcttgagg acgagacccg aatccgagtc tgttgaaaag gttgttcatt 180ggggatttgt
atacggagat tggtcgtcga gaggtttgag ggaaaggaca aatgggtttg 240gctctggaga
aagagagtgc ggctttagag agagaattga gaggtttaga gagagatgcg 300gcggcgatga
gcggaggaga gacgacgagg acctgcatta tcaaagcagt gacgtggtga 360aatttggaac
ttttaagagg cagatagatt tattatttgt atccattttc ttcattgttc 420tagaatgtcg
cggaacaaat tttaaaacta aatcctaaat ttttctaatt ttgttgccaa 480tagtggatat
gtgggccgta tagaaggaat ctattgaagg cccaaaccca tactgacgag 540cccaaaggtt
cgttttgcgt tttatgtttc ggttcgatgc caacgccaca ttctgagcta 600ggcaaaaaac
aaacgtgtct ttgaatagac tcctctcgtt aacacatgca gcggctgcat 660ggtgacgcca
ttaacacgtg gcctacaatt gcatgatgtc tccattgaca cgtgacttct 720cgtctccttt
cttaatatat ctaacaaaca ctcctacctc ttccaaaata tatacacatc 780tttttgatca
atctctcatt caaaatctca ttctctctag taaacaagaa caaaaaaggt 840accccatcac
aagtttgtac aaaaaagcag gctccgcggc cgcttgctcc gttaaaaaaa 900accatggcag
agcactacgg gcagcagcaa caaacacgtg ccccccactt gcaactgcaa 960ccgcgtgctc
agcgtgttgt aaaggcagcg accgcggtta ctgccggagg tagcctgttg 1020gtgttatccg
ggttgaccct ggctggaacg gtcattgcgc tgacaatcgc cacacctctc 1080ctggtgatct
tctcgcctgt actggtccct gcaggtaaat ttctgtgttc cttattctct 1140caaaatcttc
gattttgttt tcgttcgatc ccaatttcgt atatgttctt tggtttagat 1200tctgttaatc
ttagatcgaa gacgattttc tgggtttgat cgttagatat catcttaatt 1260ctcgattagg
gtttcataga tatcatccga tttgttcaaa taatttgagt tttgtcgaat 1320aattactctt
cgatttgtga tttctatcta gatctggtgt tagtttctag tttgtgcgat 1380cgaatttgtc
gattaatctg agtttttctg atctgcagtc atcaccatct tcctgttggg 1440agcgggtttt
ctggcaagcg ggggatttgg tgttgccgct ctgtctgtgc tgtcctggat 1500ctaccgttac
ctgacgggga aacatccacc cggagcggat cagttggagt cggccaagac 1560aaagttggcg
agcaaggccc gtgaaatgaa ggaccgtgcc gagcagttca gtcagcaacc 1620ggtagcaggg
tctcagacca gcatcgatcc atcctcctgg ctcgagatgg cggaacacta 1680cgggcaacaa
cagcagactc gtgctcccca cctgcaatta caaccccgtg cccaacgtgt 1740tgtgaaagcg
gcaacagcag taacggcagg gggaagtttg ctggtcttat cggggttgac 1800cttagcggga
accgtgattg ccctgacaat tgcgactccg ctgctggtta tcttcagccc 1860cgtattggtt
ccggccgtga tcacgatttt tttgctgggg gcaggatttt tagccagcgg 1920aggatttggg
gtcgcagcgt tgtctgtgct gagttggatc tatcgttatt tgaccgggaa 1980gcacccacct
ggagcagacc agctggagag cgcgaaaacg aagctggcat cgaaggcgcg 2040tgaaatgaag
gatcgtgctg aacaattctc ccagcagcct gttgccggtt ctcagaccag 2100ccatatgttt
aaatggccaa gcgctatggc cgagcattat gggcagcaac agcaaacccg 2160tgccccgcat
ctgcaattgc aacctcgtgc ccagcgtgtc gttaaggcgg ctactgcggt 2220aacagcggga
gggagcttac tggtattaag cgggctgaca ttggccggaa cggtgatcgc 2280cttaacaatc
gcgacaccct tgctggtcat cttcagtccg gttctggtgc ccgcggtgat 2340tacgattttc
ctgctgggag ccggtttctt agcatcgggg ggttttgggg tagcagcctt 2400gagtgtcctg
tcgtggatct atcgttactt aactggaaaa cacccgccag gagctgacca 2460gttggagtct
gcaaaaacta agctggcgtc caaagcccgt gaaatgaagg atcgtgctga 2520gcagtttagc
cagcagccag ttgcgggaag tcagacctct tcatctgagc tcccatgggt 2580cgacatggcg
gagcattacg gtcaacagca acagacccgt gctccgcact tacaattgca 2640accacgtgct
caacgtgtcg taaaagccgc cacggcagtt actgcggggg gatcattgct 2700ggtgttaagt
gggttgacac tggcggggac agttattgca ctgacgatcg cgaccccctt 2760gttagtgatc
ttctcccccg ttctggttcc ggcggtcatt acaatctttc tgttgggtgc 2820cggattttta
gcctctgggg gatttggagt agctgccctg tcagtgttga gctggatcta 2880ccgttactta
acagggaagc accctcccgg ggcagatcag ttggaaagcg ccaagaccaa 2940gctggcaagt
aaagcgcgtg aaatgaagga ccgtgccgaa caattttcgc agcaaccggt 3000tgcgggatca
cagacctcta gtactccatc ctcctggcat atgatggccg agcactatgg 3060acaacagcag
cagacgcgtg cccctcatct gcaactgcaa ccccgtgctc aacgtgtcgt 3120taaggctgcg
acagcagtaa ccgctggggg ttctctgtta gtgttgtcag ggctgacttt 3180ggcggggacg
gtaattgcgt tgaccattgc caccccgctg ttagtgattt tcagcccggt 3240actggtgcca
gcagttatca cgatcttctt gctgggtgcc ggattcttgg caagtggagg 3300ttttggagtt
gcggcgctgt cagttttatc ctggatctat cgttatctga caggaaaaca 3360tcccccaggt
gccgatcagc tggagagtgc caagacaaaa ctggcgtcta aggcacgtga 3420aatgaaggat
cgtgccgaac agttttctca acagcccgta gcggggtcac agacctcgat 3480cgatcagcag
gttaacgtgc acatggccga acattacgga cagcaacaac agacgcgtgc 3540tccacacctg
caattgcaac cgcgtgctca acgtgttgtc aaagcggcga ccgccgtaac 3600agcaggagga
tcactgttag tgctgtcggg tttaaccttg gccgggaccg tcattgcatt 3660gactattgcg
acgcccttac tggtgatctt ttctccggtg ctggttcccg ccgttattac 3720catcttcttg
ttaggggcag gattcctggc atcaggggga ttcggagttg cggcgttgag 3780tgtcttaagt
tggatctacc gttatctgac tggaaagcac ccgcctgggg ccgatcaact 3840ggagtcagcc
aaaacgaaat tggcgtcaaa agcgcgtgaa atgaaggacc gtgctgagca 3900gttttctcag
cagcctgtgg caggatccca gacatcacca tggctcgagt aatgaagcgg 3960ccgcacccag
ctttcttgta caaagtggt
3989722787DNAArtificial SequenceSynthetic sequence 72tcgacgaatt
aattccaatc ccacaaaaat ctgagcttaa cagcacagtt gctcctctca 60gagcagaatc
gggtattcaa caccctcata tcaactacta cgttgtgtat aacggtccac 120atgccggtat
atacgatgac tggggttgta caaaggcggc aacaaacggc gttcccggag 180ttgcacacaa
gaaatttgcc actattacag aggcaagagc agcagctgac gcgtacacaa 240caagtcagca
aacagacagg ttgaacttca tccccaaagg agaagctcaa ctcaagccca 300agagctttgc
taaggcccta acaagcccac caaagcaaaa agcccactgg ctcacgctag 360gaaccaaaag
gcccagcagt gatccagccc caaaagagat ctcctttgcc ccggagatta 420caatggacga
tttcctctat ctttacgatc taggaaggaa gttcgaaggt gaaggtgacg 480acactatgtt
caccactgat aatgagaagg ttagcctctt caatttcaga aagaatgctg 540acccacagat
ggttagagag gcctacgcag caggtctcat caagacgatc tacccgagta 600acaatctcca
ggagatcaaa taccttccca agaaggttaa agatgcagtc aaaagattca 660ggactaattg
catcaagaac acagagaaag acatatttct caagatcaga agtactattc 720cagtatggac
gattcaaggc ttgcttcata aaccaaggca agtaatagag attggagtct 780ctaaaaaggt
agttcctact gaatctaagg ccatgcatgg agtctaagat tcaaatcgag 840gatctaacag
aactcgccgt gaagactggc gaacagttca tacagagtct tttacgactc 900aatgacaaga
agaaaatctt cgtcaacatg gtggagcacg acactctggt ctactccaaa 960aatgtcaaag
atacagtctc agaagaccaa agggctattg agacttttca acaaaggata 1020atttcgggaa
acctcctcgg attccattgc ccagctatct gtcacttcat cgaaaggaca 1080gtagaaaagg
aaggtggctc ctacaaatgc catcattgcg ataaaggaaa ggctatcatt 1140caagatctct
ctgccgacag tggtcccaaa gatggacccc cacccacgag gagcatcgtg 1200gaaaaagaag
acgttccaac cacgtcttca aagcaagtgg attgatgtga catctccact 1260gacgtaaggg
atgacgcaca atcccactat ccttcgcaag acccttcctc tatataagga 1320agttcatttc
atttggagag gacacgctcg aggaattcgg taccccatca caagtttgta 1380caaaaaagca
ggctccgcgg ccgcttgctc cgttaaaaaa aaccatggct gagcattatg 1440gtcaacaaca
gcagaccagg gcgcctcacc tgcagctgca gccgcgcgcc cagcgggtag 1500tgaaggcggc
caccgccgtg acagccggcg gctcgcttct cgtcctctct ggcctcactt 1560tagccggaac
tgttattgcg ctcaccatcg ccactccgct gcttgtgatc tttagccccg 1620ttctggtgcc
ggcggtcata accattttct tgctgggtgc gggttttctg gcatccggag 1680gcttcggcgt
ggcggcgctg agtgtgctgt cgtggattta cagatatctg acagggaaac 1740acccgccggg
ggcggatcag ctggaatcgg caaagacgaa gctggcgagc aaggcgcgag 1800agatgaagga
tagggcagag cagttctcgc agcagcctgt tggaggcggt ggatccggag 1860gcggtggtag
tatggctgag cattatggtc aacaacagca gaccagggcg cctcacctgc 1920agctgcagcc
gcgcgcccag cgggtagtga aggcggccac cgccgtgaca gccggcggct 1980cgcttctcgt
cctctctggc ctcactttag ccggaactgt tattgcgctc accatcgcca 2040ctccgctgct
tgtgatcttt agccccgttc tggtgccggc ggtcataacc attttcttgc 2100tgggtgcggg
ttttctggca tccggaggct tcggcgtggc ggcgctgagt gtgctgtcgt 2160ggatttacag
atatctgaca gggaaacacc cgccgggggc ggatcagctg gaatcggcaa 2220agacgaagct
ggcgagcaag gcgcgagaga tgaaggatag ggcagagcag ttctcgcagc 2280agcctgttgg
gggcggtgga tccggtggag ggggatccat ggctgagcat tatggtcaac 2340aacagcagac
cagggcgcct cacctgcagc tgcagccgcg cgcccagcgg gtagtgaagg 2400cggccaccgc
cgtgacagcc ggcggctcgc ttctcgtcct ctctggcctc actttagccg 2460gaactgttat
tgcgctcacc atcgccactc cgctgcttgt gatctttagc cccgttctgg 2520tgccggcggt
cataaccatt ttcttgctgg gtgcgggttt tctggcatcc ggaggcttcg 2580gcgtggcggc
gctgagtgtg ctgtcgtgga tttacagata tctgacaggg aaacacccgc 2640cgggggcgga
tcagctggaa tcggcaaaga cgaagctggc gagcaaggcg cgagagatga 2700aggatagggc
agagcagttc tcgcagcagc ctgttccatg gctcgagtaa tgaagcggcc 2760gcacccagct
ttcttgtaca aagtggt
2787732263DNAArtificial SequenceSynthetic sequence 73gcaggaactc
tctggtaagc tagctccact ccccagaaac aaccggcgcc aaattgcgcg 60aattgctgac
ctgaagacgg aacatcatcg tcgggtcctt gggcgattgc ggcggaagat 120gggtcagctt
gggcttgagg acgagacccg aatccgagtc tgttgaaaag gttgttcatt 180ggggatttgt
atacggagat tggtcgtcga gaggtttgag ggaaaggaca aatgggtttg 240gctctggaga
aagagagtgc ggctttagag agagaattga gaggtttaga gagagatgcg 300gcggcgatga
gcggaggaga gacgacgagg acctgcatta tcaaagcagt gacgtggtga 360aatttggaac
ttttaagagg cagatagatt tattatttgt atccattttc ttcattgttc 420tagaatgtcg
cggaacaaat tttaaaacta aatcctaaat ttttctaatt ttgttgccaa 480tagtggatat
gtgggccgta tagaaggaat ctattgaagg cccaaaccca tactgacgag 540cccaaaggtt
cgttttgcgt tttatgtttc ggttcgatgc caacgccaca ttctgagcta 600ggcaaaaaac
aaacgtgtct ttgaatagac tcctctcgtt aacacatgca gcggctgcat 660ggtgacgcca
ttaacacgtg gcctacaatt gcatgatgtc tccattgaca cgtgacttct 720cgtctccttt
cttaatatat ctaacaaaca ctcctacctc ttccaaaata tatacacatc 780tttttgatca
atctctcatt caaaatctca ttctctctag taaacaagaa caaaaaaggt 840accccatcac
aagtttgtac aaaaaagcag gctccgcggc cgcttgctcc gttaaaaaaa 900accatggctg
agcattatgg tcaacaacag cagaccaggg cgcctcacct gcagctgcag 960ccgcgcgccc
agcgggtagt gaaggcggcc accgccgtga cagccggcgg ctcgcttctc 1020gtcctctctg
gcctcacttt agccggaact gttattgcgc tcaccatcgc cactccgctg 1080cttgtgatct
ttagccccgt tctggtgccg gcggtcataa ccattttctt gctgggtgcg 1140ggttttctgg
catccggagg cttcggcgtg gcggcgctga gtgtgctgtc gtggatttac 1200agatatctga
cagggaaaca cccgccgggg gcggatcagc tggaatcggc aaagacgaag 1260ctggcgagca
aggcgcgaga gatgaaggat agggcagagc agttctcgca gcagcctgtt 1320ggaggcggtg
gatccggagg cggtggtagt atggctgagc attatggtca acaacagcag 1380accagggcgc
ctcacctgca gctgcagccg cgcgcccagc gggtagtgaa ggcggccacc 1440gccgtgacag
ccggcggctc gcttctcgtc ctctctggcc tcactttagc cggaactgtt 1500attgcgctca
ccatcgccac tccgctgctt gtgatcttta gccccgttct ggtgccggcg 1560gtcataacca
ttttcttgct gggtgcgggt tttctggcat ccggaggctt cggcgtggcg 1620gcgctgagtg
tgctgtcgtg gatttacaga tatctgacag ggaaacaccc gccgggggcg 1680gatcagctgg
aatcggcaaa gacgaagctg gcgagcaagg cgcgagagat gaaggatagg 1740gcagagcagt
tctcgcagca gcctgttggg ggcggtggat ccggtggagg gggatccatg 1800gctgagcatt
atggtcaaca acagcagacc agggcgcctc acctgcagct gcagccgcgc 1860gcccagcggg
tagtgaaggc ggccaccgcc gtgacagccg gcggctcgct tctcgtcctc 1920tctggcctca
ctttagccgg aactgttatt gcgctcacca tcgccactcc gctgcttgtg 1980atctttagcc
ccgttctggt gccggcggtc ataaccattt tcttgctggg tgcgggtttt 2040ctggcatccg
gaggcttcgg cgtggcggcg ctgagtgtgc tgtcgtggat ttacagatat 2100ctgacaggga
aacacccgcc gggggcggat cagctggaat cggcaaagac gaagctggcg 2160agcaaggcgc
gagagatgaa ggatagggca gagcagttct cgcagcagcc tgttccatgg 2220ctcgagtaat
gaagcggccg cacccagctt tcttgtacaa agt
226374153PRTArtificial SequenceSynthetic sequence 74Met Ala Glu His Tyr
Gly Gln Gln Gln Gln Thr Arg Ala Pro His Leu1 5
10 15Gln Leu Gln Pro Arg Ala Gln Arg Val Val Lys
Ala Ala Thr Ala Val 20 25
30Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Gly Leu Thr Leu Ala Gly
35 40 45Thr Val Ile Ala Leu Thr Ile Ala
Thr Pro Leu Leu Val Ile Phe Ser 50 55
60Pro Val Leu Val Pro Ala Val Ile Thr Ile Phe Leu Leu Gly Ala Gly65
70 75 80Phe Leu Ala Ser Gly
Gly Phe Gly Val Ala Ala Leu Ser Val Leu Ser 85
90 95Trp Ile Tyr Arg Tyr Leu Thr Gly Lys His Pro
Pro Gly Ala Asp Gln 100 105
110Leu Glu Ser Ala Lys Thr Lys Leu Ala Ser Lys Ala Arg Glu Met Lys
115 120 125Asp Arg Ala Glu Gln Phe Ser
Gln Gln Pro Val Ala Gly Ser Gln Thr 130 135
140Ser Ile Asp Pro Ser Ser Trp Leu Glu145
15075455PRTArtificial SequenceSynthetic sequence 75Met Ala Glu His Tyr
Gly Gln Gln Gln Gln Thr Arg Ala Pro His Leu1 5
10 15Gln Leu Gln Pro Arg Ala Gln Arg Val Val Lys
Ala Ala Thr Ala Val 20 25
30Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Gly Leu Thr Leu Ala Gly
35 40 45Thr Val Ile Ala Leu Thr Ile Ala
Thr Pro Leu Leu Val Ile Phe Ser 50 55
60Pro Val Leu Val Pro Ala Val Ile Thr Ile Phe Leu Leu Gly Ala Gly65
70 75 80Phe Leu Ala Ser Gly
Gly Phe Gly Val Ala Ala Leu Ser Val Leu Ser 85
90 95Trp Ile Tyr Arg Tyr Leu Thr Gly Lys His Pro
Pro Gly Ala Asp Gln 100 105
110Leu Glu Ser Ala Lys Thr Lys Leu Ala Ser Lys Ala Arg Glu Met Lys
115 120 125Asp Arg Ala Glu Gln Phe Ser
Gln Gln Pro Val Ala Gly Ser Gln Thr 130 135
140Ser Ile Asp Pro Ser Ser Trp His Met Met Ala Glu His Tyr Gly
Gln145 150 155 160Gln Gln
Gln Thr Arg Ala Pro His Leu Gln Leu Gln Pro Arg Ala Gln
165 170 175Arg Val Val Lys Ala Ala Thr
Ala Val Thr Ala Gly Gly Ser Leu Leu 180 185
190Val Leu Ser Gly Leu Thr Leu Ala Gly Thr Val Ile Ala Leu
Thr Ile 195 200 205Ala Thr Pro Leu
Leu Val Ile Phe Ser Pro Val Leu Val Pro Ala Val 210
215 220Ile Thr Ile Phe Leu Leu Gly Ala Gly Phe Leu Ala
Ser Gly Gly Phe225 230 235
240Gly Val Ala Ala Leu Ser Val Leu Ser Trp Ile Tyr Arg Tyr Leu Thr
245 250 255Gly Lys His Pro Pro
Gly Ala Asp Gln Leu Glu Ser Ala Lys Thr Lys 260
265 270Leu Ala Ser Lys Ala Arg Glu Met Lys Asp Arg Ala
Glu Gln Phe Ser 275 280 285Gln Gln
Pro Val Ala Gly Ser Gln Thr Ser Ile Asp Gln Gln Val Asn 290
295 300Val His Met Ala Glu His Tyr Gly Gln Gln Gln
Gln Thr Arg Ala Pro305 310 315
320His Leu Gln Leu Gln Pro Arg Ala Gln Arg Val Val Lys Ala Ala Thr
325 330 335Ala Val Thr Ala
Gly Gly Ser Leu Leu Val Leu Ser Gly Leu Thr Leu 340
345 350Ala Gly Thr Val Ile Ala Leu Thr Ile Ala Thr
Pro Leu Leu Val Ile 355 360 365Phe
Ser Pro Val Leu Val Pro Ala Val Ile Thr Ile Phe Leu Leu Gly 370
375 380Ala Gly Phe Leu Ala Ser Gly Gly Phe Gly
Val Ala Ala Leu Ser Val385 390 395
400Leu Ser Trp Ile Tyr Arg Tyr Leu Thr Gly Lys His Pro Pro Gly
Ala 405 410 415Asp Gln Leu
Glu Ser Ala Lys Thr Lys Leu Ala Ser Lys Ala Arg Glu 420
425 430Met Lys Asp Arg Ala Glu Gln Phe Ser Gln
Gln Pro Val Ala Gly Ser 435 440
445Gln Thr Ser Pro Trp Leu Glu 450
45576602PRTArtificial SequenceSynthetic sequence 76Met Ala Glu His Tyr
Gly Gln Gln Gln Gln Thr Arg Ala Pro His Leu1 5
10 15Gln Leu Gln Pro Arg Ala Gln Arg Val Val Lys
Ala Ala Thr Ala Val 20 25
30Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Gly Leu Thr Leu Ala Gly
35 40 45Thr Val Ile Ala Leu Thr Ile Ala
Thr Pro Leu Leu Val Ile Phe Ser 50 55
60Pro Val Leu Val Pro Ala Val Ile Thr Ile Phe Leu Leu Gly Ala Gly65
70 75 80Phe Leu Ala Ser Gly
Gly Phe Gly Val Ala Ala Leu Ser Val Leu Ser 85
90 95Trp Ile Tyr Arg Tyr Leu Thr Gly Lys His Pro
Pro Gly Ala Asp Gln 100 105
110Leu Glu Ser Ala Lys Thr Lys Leu Ala Ser Lys Ala Arg Glu Met Lys
115 120 125Asp Arg Ala Glu Gln Phe Ser
Gln Gln Pro Val Ala Gly Ser Gln Thr 130 135
140Ser Ile Asp Pro Ser Ser Trp Leu Glu Met Ala Glu His Tyr Gly
Gln145 150 155 160Gln Gln
Gln Thr Arg Ala Pro His Leu Gln Leu Gln Pro Arg Ala Gln
165 170 175Arg Val Val Lys Ala Ala Thr
Ala Val Thr Ala Gly Gly Ser Leu Leu 180 185
190Val Leu Ser Gly Leu Thr Leu Ala Gly Thr Val Ile Ala Leu
Thr Ile 195 200 205Ala Thr Pro Leu
Leu Val Ile Phe Ser Pro Val Leu Val Pro Ala Val 210
215 220Ile Thr Ile Phe Leu Leu Gly Ala Gly Phe Leu Ala
Ser Gly Gly Phe225 230 235
240Gly Val Ala Ala Leu Ser Val Leu Ser Trp Ile Tyr Arg Tyr Leu Thr
245 250 255Gly Lys His Pro Pro
Gly Ala Asp Gln Leu Glu Ser Ala Lys Thr Lys 260
265 270Leu Ala Ser Lys Ala Arg Glu Met Lys Asp Arg Ala
Glu Gln Phe Ser 275 280 285Gln Gln
Pro Val Ala Gly Ser Gln Thr Ser His Met Met Ala Glu His 290
295 300Tyr Gly Gln Gln Gln Gln Thr Arg Ala Pro His
Leu Gln Leu Gln Pro305 310 315
320Arg Ala Gln Arg Val Val Lys Ala Ala Thr Ala Val Thr Ala Gly Gly
325 330 335Ser Leu Leu Val
Leu Ser Gly Leu Thr Leu Ala Gly Thr Val Ile Ala 340
345 350Leu Thr Ile Ala Thr Pro Leu Leu Val Ile Phe
Ser Pro Val Leu Val 355 360 365Pro
Ala Val Ile Thr Ile Phe Leu Leu Gly Ala Gly Phe Leu Ala Ser 370
375 380Gly Gly Phe Gly Val Ala Ala Leu Ser Val
Leu Ser Trp Ile Tyr Arg385 390 395
400Tyr Leu Thr Gly Lys His Pro Pro Gly Ala Asp Gln Leu Glu Ser
Ala 405 410 415Lys Thr Lys
Leu Ala Ser Lys Ala Arg Glu Met Lys Asp Arg Ala Glu 420
425 430Gln Phe Ser Gln Gln Pro Val Ala Gly Ser
Gln Thr Ser Ile Asp Gln 435 440
445Gln Val Asn Val His Met Ala Glu His Tyr Gly Gln Gln Gln Gln Thr 450
455 460Arg Ala Pro His Leu Gln Leu Gln
Pro Arg Ala Gln Arg Val Val Lys465 470
475 480Ala Ala Thr Ala Val Thr Ala Gly Gly Ser Leu Leu
Val Leu Ser Gly 485 490
495Leu Thr Leu Ala Gly Thr Val Ile Ala Leu Thr Ile Ala Thr Pro Leu
500 505 510Leu Val Ile Phe Ser Pro
Val Leu Val Pro Ala Val Ile Thr Ile Phe 515 520
525Leu Leu Gly Ala Gly Phe Leu Ala Ser Gly Gly Phe Gly Val
Ala Ala 530 535 540Leu Ser Val Leu Ser
Trp Ile Tyr Arg Tyr Leu Thr Gly Lys His Pro545 550
555 560Pro Gly Ala Asp Gln Leu Glu Ser Ala Lys
Thr Lys Leu Ala Ser Lys 565 570
575Ala Arg Glu Met Lys Asp Arg Ala Glu Gln Phe Ser Gln Gln Pro Val
580 585 590Ala Gly Ser Gln Thr
Ser Pro Trp Leu Glu 595 60077914PRTArtificial
SequenceSynthetic sequence 77Met Ala Glu His Tyr Gly Gln Gln Gln Gln Thr
Arg Ala Pro His Leu1 5 10
15Gln Leu Gln Pro Arg Ala Gln Arg Val Val Lys Ala Ala Thr Ala Val
20 25 30Thr Ala Gly Gly Ser Leu Leu
Val Leu Ser Gly Leu Thr Leu Ala Gly 35 40
45Thr Val Ile Ala Leu Thr Ile Ala Thr Pro Leu Leu Val Ile Phe
Ser 50 55 60Pro Val Leu Val Pro Ala
Val Ile Thr Ile Phe Leu Leu Gly Ala Gly65 70
75 80Phe Leu Ala Ser Gly Gly Phe Gly Val Ala Ala
Leu Ser Val Leu Ser 85 90
95Trp Ile Tyr Arg Tyr Leu Thr Gly Lys His Pro Pro Gly Ala Asp Gln
100 105 110Leu Glu Ser Ala Lys Thr
Lys Leu Ala Ser Lys Ala Arg Glu Met Lys 115 120
125Asp Arg Ala Glu Gln Phe Ser Gln Gln Pro Val Ala Gly Ser
Gln Thr 130 135 140Ser Ile Asp Pro Ser
Ser Trp Leu Glu Met Ala Glu His Tyr Gly Gln145 150
155 160Gln Gln Gln Thr Arg Ala Pro His Leu Gln
Leu Gln Pro Arg Ala Gln 165 170
175Arg Val Val Lys Ala Ala Thr Ala Val Thr Ala Gly Gly Ser Leu Leu
180 185 190Val Leu Ser Gly Leu
Thr Leu Ala Gly Thr Val Ile Ala Leu Thr Ile 195
200 205Ala Thr Pro Leu Leu Val Ile Phe Ser Pro Val Leu
Val Pro Ala Val 210 215 220Ile Thr Ile
Phe Leu Leu Gly Ala Gly Phe Leu Ala Ser Gly Gly Phe225
230 235 240Gly Val Ala Ala Leu Ser Val
Leu Ser Trp Ile Tyr Arg Tyr Leu Thr 245
250 255Gly Lys His Pro Pro Gly Ala Asp Gln Leu Glu Ser
Ala Lys Thr Lys 260 265 270Leu
Ala Ser Lys Ala Arg Glu Met Lys Asp Arg Ala Glu Gln Phe Ser 275
280 285Gln Gln Pro Val Ala Gly Ser Gln Thr
Ser His Met Phe Lys Trp Pro 290 295
300Ser Ala Met Ala Glu His Tyr Gly Gln Gln Gln Gln Thr Arg Ala Pro305
310 315 320His Leu Gln Leu
Gln Pro Arg Ala Gln Arg Val Val Lys Ala Ala Thr 325
330 335Ala Val Thr Ala Gly Gly Ser Leu Leu Val
Leu Ser Gly Leu Thr Leu 340 345
350Ala Gly Thr Val Ile Ala Leu Thr Ile Ala Thr Pro Leu Leu Val Ile
355 360 365Phe Ser Pro Val Leu Val Pro
Ala Val Ile Thr Ile Phe Leu Leu Gly 370 375
380Ala Gly Phe Leu Ala Ser Gly Gly Phe Gly Val Ala Ala Leu Ser
Val385 390 395 400Leu Ser
Trp Ile Tyr Arg Tyr Leu Thr Gly Lys His Pro Pro Gly Ala
405 410 415Asp Gln Leu Glu Ser Ala Lys
Thr Lys Leu Ala Ser Lys Ala Arg Glu 420 425
430Met Lys Asp Arg Ala Glu Gln Phe Ser Gln Gln Pro Val Ala
Gly Ser 435 440 445Gln Thr Ser Ser
Ser Glu Leu Pro Trp Val Asp Met Ala Glu His Tyr 450
455 460Gly Gln Gln Gln Gln Thr Arg Ala Pro His Leu Gln
Leu Gln Pro Arg465 470 475
480Ala Gln Arg Val Val Lys Ala Ala Thr Ala Val Thr Ala Gly Gly Ser
485 490 495Leu Leu Val Leu Ser
Gly Leu Thr Leu Ala Gly Thr Val Ile Ala Leu 500
505 510Thr Ile Ala Thr Pro Leu Leu Val Ile Phe Ser Pro
Val Leu Val Pro 515 520 525Ala Val
Ile Thr Ile Phe Leu Leu Gly Ala Gly Phe Leu Ala Ser Gly 530
535 540Gly Phe Gly Val Ala Ala Leu Ser Val Leu Ser
Trp Ile Tyr Arg Tyr545 550 555
560Leu Thr Gly Lys His Pro Pro Gly Ala Asp Gln Leu Glu Ser Ala Lys
565 570 575Thr Lys Leu Ala
Ser Lys Ala Arg Glu Met Lys Asp Arg Ala Glu Gln 580
585 590Phe Ser Gln Gln Pro Val Ala Gly Ser Gln Thr
Ser Ser Thr Pro Ser 595 600 605Ser
Trp His Met Met Ala Glu His Tyr Gly Gln Gln Gln Gln Thr Arg 610
615 620Ala Pro His Leu Gln Leu Gln Pro Arg Ala
Gln Arg Val Val Lys Ala625 630 635
640Ala Thr Ala Val Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Gly
Leu 645 650 655Thr Leu Ala
Gly Thr Val Ile Ala Leu Thr Ile Ala Thr Pro Leu Leu 660
665 670Val Ile Phe Ser Pro Val Leu Val Pro Ala
Val Ile Thr Ile Phe Leu 675 680
685Leu Gly Ala Gly Phe Leu Ala Ser Gly Gly Phe Gly Val Ala Ala Leu 690
695 700Ser Val Leu Ser Trp Ile Tyr Arg
Tyr Leu Thr Gly Lys His Pro Pro705 710
715 720Gly Ala Asp Gln Leu Glu Ser Ala Lys Thr Lys Leu
Ala Ser Lys Ala 725 730
735Arg Glu Met Lys Asp Arg Ala Glu Gln Phe Ser Gln Gln Pro Val Ala
740 745 750Gly Ser Gln Thr Ser Ile
Asp Gln Gln Val Asn Val His Met Ala Glu 755 760
765His Tyr Gly Gln Gln Gln Gln Thr Arg Ala Pro His Leu Gln
Leu Gln 770 775 780Pro Arg Ala Gln Arg
Val Val Lys Ala Ala Thr Ala Val Thr Ala Gly785 790
795 800Gly Ser Leu Leu Val Leu Ser Gly Leu Thr
Leu Ala Gly Thr Val Ile 805 810
815Ala Leu Thr Ile Ala Thr Pro Leu Leu Val Ile Phe Ser Pro Val Leu
820 825 830Val Pro Ala Val Ile
Thr Ile Phe Leu Leu Gly Ala Gly Phe Leu Ala 835
840 845Ser Gly Gly Phe Gly Val Ala Ala Leu Ser Val Leu
Ser Trp Ile Tyr 850 855 860Arg Tyr Leu
Thr Gly Lys His Pro Pro Gly Ala Asp Gln Leu Glu Ser865
870 875 880Ala Lys Thr Lys Leu Ala Ser
Lys Ala Arg Glu Met Lys Asp Arg Ala 885
890 895Glu Gln Phe Ser Gln Gln Pro Val Ala Gly Ser Gln
Thr Ser Pro Trp 900 905 910Leu
Glu78153PRTArtificial SequenceSynthetic sequence 78Met Ala Glu His Tyr
Gly Gln Gln Gln Gln Thr Arg Ala Pro His Leu1 5
10 15Gln Leu Gln Pro Arg Ala Gln Arg Val Val Lys
Ala Ala Thr Ala Val 20 25
30Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Gly Leu Thr Leu Ala Gly
35 40 45Thr Val Ile Ala Leu Thr Ile Ala
Thr Pro Leu Leu Val Ile Phe Ser 50 55
60Pro Val Leu Val Pro Ala Val Ile Thr Ile Phe Leu Leu Gly Ala Gly65
70 75 80Phe Leu Ala Ser Gly
Gly Phe Gly Val Ala Ala Leu Ser Val Leu Ser 85
90 95Trp Ile Tyr Arg Tyr Leu Thr Gly Lys His Pro
Pro Gly Ala Asp Gln 100 105
110Leu Glu Ser Ala Lys Thr Lys Leu Ala Ser Lys Ala Arg Glu Met Lys
115 120 125Asp Arg Ala Glu Gln Phe Ser
Gln Gln Pro Val Ala Gly Ser Gln Thr 130 135
140Ser Ile Asp Pro Ser Ser Trp Leu Glu145
15079455PRTArtificial SequenceSynthetic sequence 79Met Ala Glu His Tyr
Gly Gln Gln Gln Gln Thr Arg Ala Pro His Leu1 5
10 15Gln Leu Gln Pro Arg Ala Gln Arg Val Val Lys
Ala Ala Thr Ala Val 20 25
30Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Gly Leu Thr Leu Ala Gly
35 40 45Thr Val Ile Ala Leu Thr Ile Ala
Thr Pro Leu Leu Val Ile Phe Ser 50 55
60Pro Val Leu Val Pro Ala Val Ile Thr Ile Phe Leu Leu Gly Ala Gly65
70 75 80Phe Leu Ala Ser Gly
Gly Phe Gly Val Ala Ala Leu Ser Val Leu Ser 85
90 95Trp Ile Tyr Arg Tyr Leu Thr Gly Lys His Pro
Pro Gly Ala Asp Gln 100 105
110Leu Glu Ser Ala Lys Thr Lys Leu Ala Ser Lys Ala Arg Glu Met Lys
115 120 125Asp Arg Ala Glu Gln Phe Ser
Gln Gln Pro Val Ala Gly Ser Gln Thr 130 135
140Ser Ile Asp Pro Ser Ser Trp His Met Met Ala Glu His Tyr Gly
Gln145 150 155 160Gln Gln
Gln Thr Arg Ala Pro His Leu Gln Leu Gln Pro Arg Ala Gln
165 170 175Arg Val Val Lys Ala Ala Thr
Ala Val Thr Ala Gly Gly Ser Leu Leu 180 185
190Val Leu Ser Gly Leu Thr Leu Ala Gly Thr Val Ile Ala Leu
Thr Ile 195 200 205Ala Thr Pro Leu
Leu Val Ile Phe Ser Pro Val Leu Val Pro Ala Val 210
215 220Ile Thr Ile Phe Leu Leu Gly Ala Gly Phe Leu Ala
Ser Gly Gly Phe225 230 235
240Gly Val Ala Ala Leu Ser Val Leu Ser Trp Ile Tyr Arg Tyr Leu Thr
245 250 255Gly Lys His Pro Pro
Gly Ala Asp Gln Leu Glu Ser Ala Lys Thr Lys 260
265 270Leu Ala Ser Lys Ala Arg Glu Met Lys Asp Arg Ala
Glu Gln Phe Ser 275 280 285Gln Gln
Pro Val Ala Gly Ser Gln Thr Ser Ile Asp Gln Gln Val Asn 290
295 300Val His Met Ala Glu His Tyr Gly Gln Gln Gln
Gln Thr Arg Ala Pro305 310 315
320His Leu Gln Leu Gln Pro Arg Ala Gln Arg Val Val Lys Ala Ala Thr
325 330 335Ala Val Thr Ala
Gly Gly Ser Leu Leu Val Leu Ser Gly Leu Thr Leu 340
345 350Ala Gly Thr Val Ile Ala Leu Thr Ile Ala Thr
Pro Leu Leu Val Ile 355 360 365Phe
Ser Pro Val Leu Val Pro Ala Val Ile Thr Ile Phe Leu Leu Gly 370
375 380Ala Gly Phe Leu Ala Ser Gly Gly Phe Gly
Val Ala Ala Leu Ser Val385 390 395
400Leu Ser Trp Ile Tyr Arg Tyr Leu Thr Gly Lys His Pro Pro Gly
Ala 405 410 415Asp Gln Leu
Glu Ser Ala Lys Thr Lys Leu Ala Ser Lys Ala Arg Glu 420
425 430Met Lys Asp Arg Ala Glu Gln Phe Ser Gln
Gln Pro Val Ala Gly Ser 435 440
445Gln Thr Ser Pro Trp Leu Glu 450
45580602PRTArtificial SequenceSynthetic sequence 80Met Ala Glu His Tyr
Gly Gln Gln Gln Gln Thr Arg Ala Pro His Leu1 5
10 15Gln Leu Gln Pro Arg Ala Gln Arg Val Val Lys
Ala Ala Thr Ala Val 20 25
30Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Gly Leu Thr Leu Ala Gly
35 40 45Thr Val Ile Ala Leu Thr Ile Ala
Thr Pro Leu Leu Val Ile Phe Ser 50 55
60Pro Val Leu Val Pro Ala Val Ile Thr Ile Phe Leu Leu Gly Ala Gly65
70 75 80Phe Leu Ala Ser Gly
Gly Phe Gly Val Ala Ala Leu Ser Val Leu Ser 85
90 95Trp Ile Tyr Arg Tyr Leu Thr Gly Lys His Pro
Pro Gly Ala Asp Gln 100 105
110Leu Glu Ser Ala Lys Thr Lys Leu Ala Ser Lys Ala Arg Glu Met Lys
115 120 125Asp Arg Ala Glu Gln Phe Ser
Gln Gln Pro Val Ala Gly Ser Gln Thr 130 135
140Ser Ile Asp Pro Ser Ser Trp Leu Glu Met Ala Glu His Tyr Gly
Gln145 150 155 160Gln Gln
Gln Thr Arg Ala Pro His Leu Gln Leu Gln Pro Arg Ala Gln
165 170 175Arg Val Val Lys Ala Ala Thr
Ala Val Thr Ala Gly Gly Ser Leu Leu 180 185
190Val Leu Ser Gly Leu Thr Leu Ala Gly Thr Val Ile Ala Leu
Thr Ile 195 200 205Ala Thr Pro Leu
Leu Val Ile Phe Ser Pro Val Leu Val Pro Ala Val 210
215 220Ile Thr Ile Phe Leu Leu Gly Ala Gly Phe Leu Ala
Ser Gly Gly Phe225 230 235
240Gly Val Ala Ala Leu Ser Val Leu Ser Trp Ile Tyr Arg Tyr Leu Thr
245 250 255Gly Lys His Pro Pro
Gly Ala Asp Gln Leu Glu Ser Ala Lys Thr Lys 260
265 270Leu Ala Ser Lys Ala Arg Glu Met Lys Asp Arg Ala
Glu Gln Phe Ser 275 280 285Gln Gln
Pro Val Ala Gly Ser Gln Thr Ser His Met Met Ala Glu His 290
295 300Tyr Gly Gln Gln Gln Gln Thr Arg Ala Pro His
Leu Gln Leu Gln Pro305 310 315
320Arg Ala Gln Arg Val Val Lys Ala Ala Thr Ala Val Thr Ala Gly Gly
325 330 335Ser Leu Leu Val
Leu Ser Gly Leu Thr Leu Ala Gly Thr Val Ile Ala 340
345 350Leu Thr Ile Ala Thr Pro Leu Leu Val Ile Phe
Ser Pro Val Leu Val 355 360 365Pro
Ala Val Ile Thr Ile Phe Leu Leu Gly Ala Gly Phe Leu Ala Ser 370
375 380Gly Gly Phe Gly Val Ala Ala Leu Ser Val
Leu Ser Trp Ile Tyr Arg385 390 395
400Tyr Leu Thr Gly Lys His Pro Pro Gly Ala Asp Gln Leu Glu Ser
Ala 405 410 415Lys Thr Lys
Leu Ala Ser Lys Ala Arg Glu Met Lys Asp Arg Ala Glu 420
425 430Gln Phe Ser Gln Gln Pro Val Ala Gly Ser
Gln Thr Ser Ile Asp Gln 435 440
445Gln Val Asn Val His Met Ala Glu His Tyr Gly Gln Gln Gln Gln Thr 450
455 460Arg Ala Pro His Leu Gln Leu Gln
Pro Arg Ala Gln Arg Val Val Lys465 470
475 480Ala Ala Thr Ala Val Thr Ala Gly Gly Ser Leu Leu
Val Leu Ser Gly 485 490
495Leu Thr Leu Ala Gly Thr Val Ile Ala Leu Thr Ile Ala Thr Pro Leu
500 505 510Leu Val Ile Phe Ser Pro
Val Leu Val Pro Ala Val Ile Thr Ile Phe 515 520
525Leu Leu Gly Ala Gly Phe Leu Ala Ser Gly Gly Phe Gly Val
Ala Ala 530 535 540Leu Ser Val Leu Ser
Trp Ile Tyr Arg Tyr Leu Thr Gly Lys His Pro545 550
555 560Pro Gly Ala Asp Gln Leu Glu Ser Ala Lys
Thr Lys Leu Ala Ser Lys 565 570
575Ala Arg Glu Met Lys Asp Arg Ala Glu Gln Phe Ser Gln Gln Pro Val
580 585 590Ala Gly Ser Gln Thr
Ser Pro Trp Leu Glu 595 60081914PRTArtificial
SequenceSynthetic sequence 81Met Ala Glu His Tyr Gly Gln Gln Gln Gln Thr
Arg Ala Pro His Leu1 5 10
15Gln Leu Gln Pro Arg Ala Gln Arg Val Val Lys Ala Ala Thr Ala Val
20 25 30Thr Ala Gly Gly Ser Leu Leu
Val Leu Ser Gly Leu Thr Leu Ala Gly 35 40
45Thr Val Ile Ala Leu Thr Ile Ala Thr Pro Leu Leu Val Ile Phe
Ser 50 55 60Pro Val Leu Val Pro Ala
Val Ile Thr Ile Phe Leu Leu Gly Ala Gly65 70
75 80Phe Leu Ala Ser Gly Gly Phe Gly Val Ala Ala
Leu Ser Val Leu Ser 85 90
95Trp Ile Tyr Arg Tyr Leu Thr Gly Lys His Pro Pro Gly Ala Asp Gln
100 105 110Leu Glu Ser Ala Lys Thr
Lys Leu Ala Ser Lys Ala Arg Glu Met Lys 115 120
125Asp Arg Ala Glu Gln Phe Ser Gln Gln Pro Val Ala Gly Ser
Gln Thr 130 135 140Ser Ile Asp Pro Ser
Ser Trp Leu Glu Met Ala Glu His Tyr Gly Gln145 150
155 160Gln Gln Gln Thr Arg Ala Pro His Leu Gln
Leu Gln Pro Arg Ala Gln 165 170
175Arg Val Val Lys Ala Ala Thr Ala Val Thr Ala Gly Gly Ser Leu Leu
180 185 190Val Leu Ser Gly Leu
Thr Leu Ala Gly Thr Val Ile Ala Leu Thr Ile 195
200 205Ala Thr Pro Leu Leu Val Ile Phe Ser Pro Val Leu
Val Pro Ala Val 210 215 220Ile Thr Ile
Phe Leu Leu Gly Ala Gly Phe Leu Ala Ser Gly Gly Phe225
230 235 240Gly Val Ala Ala Leu Ser Val
Leu Ser Trp Ile Tyr Arg Tyr Leu Thr 245
250 255Gly Lys His Pro Pro Gly Ala Asp Gln Leu Glu Ser
Ala Lys Thr Lys 260 265 270Leu
Ala Ser Lys Ala Arg Glu Met Lys Asp Arg Ala Glu Gln Phe Ser 275
280 285Gln Gln Pro Val Ala Gly Ser Gln Thr
Ser His Met Phe Lys Trp Pro 290 295
300Ser Ala Met Ala Glu His Tyr Gly Gln Gln Gln Gln Thr Arg Ala Pro305
310 315 320His Leu Gln Leu
Gln Pro Arg Ala Gln Arg Val Val Lys Ala Ala Thr 325
330 335Ala Val Thr Ala Gly Gly Ser Leu Leu Val
Leu Ser Gly Leu Thr Leu 340 345
350Ala Gly Thr Val Ile Ala Leu Thr Ile Ala Thr Pro Leu Leu Val Ile
355 360 365Phe Ser Pro Val Leu Val Pro
Ala Val Ile Thr Ile Phe Leu Leu Gly 370 375
380Ala Gly Phe Leu Ala Ser Gly Gly Phe Gly Val Ala Ala Leu Ser
Val385 390 395 400Leu Ser
Trp Ile Tyr Arg Tyr Leu Thr Gly Lys His Pro Pro Gly Ala
405 410 415Asp Gln Leu Glu Ser Ala Lys
Thr Lys Leu Ala Ser Lys Ala Arg Glu 420 425
430Met Lys Asp Arg Ala Glu Gln Phe Ser Gln Gln Pro Val Ala
Gly Ser 435 440 445Gln Thr Ser Ser
Ser Glu Leu Pro Trp Val Asp Met Ala Glu His Tyr 450
455 460Gly Gln Gln Gln Gln Thr Arg Ala Pro His Leu Gln
Leu Gln Pro Arg465 470 475
480Ala Gln Arg Val Val Lys Ala Ala Thr Ala Val Thr Ala Gly Gly Ser
485 490 495Leu Leu Val Leu Ser
Gly Leu Thr Leu Ala Gly Thr Val Ile Ala Leu 500
505 510Thr Ile Ala Thr Pro Leu Leu Val Ile Phe Ser Pro
Val Leu Val Pro 515 520 525Ala Val
Ile Thr Ile Phe Leu Leu Gly Ala Gly Phe Leu Ala Ser Gly 530
535 540Gly Phe Gly Val Ala Ala Leu Ser Val Leu Ser
Trp Ile Tyr Arg Tyr545 550 555
560Leu Thr Gly Lys His Pro Pro Gly Ala Asp Gln Leu Glu Ser Ala Lys
565 570 575Thr Lys Leu Ala
Ser Lys Ala Arg Glu Met Lys Asp Arg Ala Glu Gln 580
585 590Phe Ser Gln Gln Pro Val Ala Gly Ser Gln Thr
Ser Ser Thr Pro Ser 595 600 605Ser
Trp His Met Met Ala Glu His Tyr Gly Gln Gln Gln Gln Thr Arg 610
615 620Ala Pro His Leu Gln Leu Gln Pro Arg Ala
Gln Arg Val Val Lys Ala625 630 635
640Ala Thr Ala Val Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Gly
Leu 645 650 655Thr Leu Ala
Gly Thr Val Ile Ala Leu Thr Ile Ala Thr Pro Leu Leu 660
665 670Val Ile Phe Ser Pro Val Leu Val Pro Ala
Val Ile Thr Ile Phe Leu 675 680
685Leu Gly Ala Gly Phe Leu Ala Ser Gly Gly Phe Gly Val Ala Ala Leu 690
695 700Ser Val Leu Ser Trp Ile Tyr Arg
Tyr Leu Thr Gly Lys His Pro Pro705 710
715 720Gly Ala Asp Gln Leu Glu Ser Ala Lys Thr Lys Leu
Ala Ser Lys Ala 725 730
735Arg Glu Met Lys Asp Arg Ala Glu Gln Phe Ser Gln Gln Pro Val Ala
740 745 750Gly Ser Gln Thr Ser Ile
Asp Gln Gln Val Asn Val His Met Ala Glu 755 760
765His Tyr Gly Gln Gln Gln Gln Thr Arg Ala Pro His Leu Gln
Leu Gln 770 775 780Pro Arg Ala Gln Arg
Val Val Lys Ala Ala Thr Ala Val Thr Ala Gly785 790
795 800Gly Ser Leu Leu Val Leu Ser Gly Leu Thr
Leu Ala Gly Thr Val Ile 805 810
815Ala Leu Thr Ile Ala Thr Pro Leu Leu Val Ile Phe Ser Pro Val Leu
820 825 830Val Pro Ala Val Ile
Thr Ile Phe Leu Leu Gly Ala Gly Phe Leu Ala 835
840 845Ser Gly Gly Phe Gly Val Ala Ala Leu Ser Val Leu
Ser Trp Ile Tyr 850 855 860Arg Tyr Leu
Thr Gly Lys His Pro Pro Gly Ala Asp Gln Leu Glu Ser865
870 875 880Ala Lys Thr Lys Leu Ala Ser
Lys Ala Arg Glu Met Lys Asp Arg Ala 885
890 895Glu Gln Phe Ser Gln Gln Pro Val Ala Gly Ser Gln
Thr Ser Pro Trp 900 905 910Leu
Glu82441PRTArtificial SequenceSynthetic sequence 82Met Ala Glu His Tyr
Gly Gln Gln Gln Gln Thr Arg Ala Pro His Leu1 5
10 15Gln Leu Gln Pro Arg Ala Gln Arg Val Val Lys
Ala Ala Thr Ala Val 20 25
30Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Gly Leu Thr Leu Ala Gly
35 40 45Thr Val Ile Ala Leu Thr Ile Ala
Thr Pro Leu Leu Val Ile Phe Ser 50 55
60Pro Val Leu Val Pro Ala Val Ile Thr Ile Phe Leu Leu Gly Ala Gly65
70 75 80Phe Leu Ala Ser Gly
Gly Phe Gly Val Ala Ala Leu Ser Val Leu Ser 85
90 95Trp Ile Tyr Arg Tyr Leu Thr Gly Lys His Pro
Pro Gly Ala Asp Gln 100 105
110Leu Glu Ser Ala Lys Thr Lys Leu Ala Ser Lys Ala Arg Glu Met Lys
115 120 125Asp Arg Ala Glu Gln Phe Ser
Gln Gln Pro Val Gly Gly Gly Gly Ser 130 135
140Gly Gly Gly Gly Ser Met Ala Glu His Tyr Gly Gln Gln Gln Gln
Thr145 150 155 160Arg Ala
Pro His Leu Gln Leu Gln Pro Arg Ala Gln Arg Val Val Lys
165 170 175Ala Ala Thr Ala Val Thr Ala
Gly Gly Ser Leu Leu Val Leu Ser Gly 180 185
190Leu Thr Leu Ala Gly Thr Val Ile Ala Leu Thr Ile Ala Thr
Pro Leu 195 200 205Leu Val Ile Phe
Ser Pro Val Leu Val Pro Ala Val Ile Thr Ile Phe 210
215 220Leu Leu Gly Ala Gly Phe Leu Ala Ser Gly Gly Phe
Gly Val Ala Ala225 230 235
240Leu Ser Val Leu Ser Trp Ile Tyr Arg Tyr Leu Thr Gly Lys His Pro
245 250 255Pro Gly Ala Asp Gln
Leu Glu Ser Ala Lys Thr Lys Leu Ala Ser Lys 260
265 270Ala Arg Glu Met Lys Asp Arg Ala Glu Gln Phe Ser
Gln Gln Pro Val 275 280 285Gly Gly
Gly Gly Ser Gly Gly Gly Gly Ser Met Ala Glu His Tyr Gly 290
295 300Gln Gln Gln Gln Thr Arg Ala Pro His Leu Gln
Leu Gln Pro Arg Ala305 310 315
320Gln Arg Val Val Lys Ala Ala Thr Ala Val Thr Ala Gly Gly Ser Leu
325 330 335Leu Val Leu Ser
Gly Leu Thr Leu Ala Gly Thr Val Ile Ala Leu Thr 340
345 350Ile Ala Thr Pro Leu Leu Val Ile Phe Ser Pro
Val Leu Val Pro Ala 355 360 365Val
Ile Thr Ile Phe Leu Leu Gly Ala Gly Phe Leu Ala Ser Gly Gly 370
375 380Phe Gly Val Ala Ala Leu Ser Val Leu Ser
Trp Ile Tyr Arg Tyr Leu385 390 395
400Thr Gly Lys His Pro Pro Gly Ala Asp Gln Leu Glu Ser Ala Lys
Thr 405 410 415Lys Leu Ala
Ser Lys Ala Arg Glu Met Lys Asp Arg Ala Glu Gln Phe 420
425 430Ser Gln Gln Pro Val Pro Trp Leu Glu
435 44083441PRTArtificial SequenceSynthetic sequence
83Met Ala Glu His Tyr Gly Gln Gln Gln Gln Thr Arg Ala Pro His Leu1
5 10 15Gln Leu Gln Pro Arg Ala
Gln Arg Val Val Lys Ala Ala Thr Ala Val 20 25
30Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Gly Leu Thr
Leu Ala Gly 35 40 45Thr Val Ile
Ala Leu Thr Ile Ala Thr Pro Leu Leu Val Ile Phe Ser 50
55 60Pro Val Leu Val Pro Ala Val Ile Thr Ile Phe Leu
Leu Gly Ala Gly65 70 75
80Phe Leu Ala Ser Gly Gly Phe Gly Val Ala Ala Leu Ser Val Leu Ser
85 90 95Trp Ile Tyr Arg Tyr Leu
Thr Gly Lys His Pro Pro Gly Ala Asp Gln 100
105 110Leu Glu Ser Ala Lys Thr Lys Leu Ala Ser Lys Ala
Arg Glu Met Lys 115 120 125Asp Arg
Ala Glu Gln Phe Ser Gln Gln Pro Val Gly Gly Gly Gly Ser 130
135 140Gly Gly Gly Gly Ser Met Ala Glu His Tyr Gly
Gln Gln Gln Gln Thr145 150 155
160Arg Ala Pro His Leu Gln Leu Gln Pro Arg Ala Gln Arg Val Val Lys
165 170 175Ala Ala Thr Ala
Val Thr Ala Gly Gly Ser Leu Leu Val Leu Ser Gly 180
185 190Leu Thr Leu Ala Gly Thr Val Ile Ala Leu Thr
Ile Ala Thr Pro Leu 195 200 205Leu
Val Ile Phe Ser Pro Val Leu Val Pro Ala Val Ile Thr Ile Phe 210
215 220Leu Leu Gly Ala Gly Phe Leu Ala Ser Gly
Gly Phe Gly Val Ala Ala225 230 235
240Leu Ser Val Leu Ser Trp Ile Tyr Arg Tyr Leu Thr Gly Lys His
Pro 245 250 255Pro Gly Ala
Asp Gln Leu Glu Ser Ala Lys Thr Lys Leu Ala Ser Lys 260
265 270Ala Arg Glu Met Lys Asp Arg Ala Glu Gln
Phe Ser Gln Gln Pro Val 275 280
285Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Met Ala Glu His Tyr Gly 290
295 300Gln Gln Gln Gln Thr Arg Ala Pro
His Leu Gln Leu Gln Pro Arg Ala305 310
315 320Gln Arg Val Val Lys Ala Ala Thr Ala Val Thr Ala
Gly Gly Ser Leu 325 330
335Leu Val Leu Ser Gly Leu Thr Leu Ala Gly Thr Val Ile Ala Leu Thr
340 345 350Ile Ala Thr Pro Leu Leu
Val Ile Phe Ser Pro Val Leu Val Pro Ala 355 360
365Val Ile Thr Ile Phe Leu Leu Gly Ala Gly Phe Leu Ala Ser
Gly Gly 370 375 380Phe Gly Val Ala Ala
Leu Ser Val Leu Ser Trp Ile Tyr Arg Tyr Leu385 390
395 400Thr Gly Lys His Pro Pro Gly Ala Asp Gln
Leu Glu Ser Ala Lys Thr 405 410
415Lys Leu Ala Ser Lys Ala Arg Glu Met Lys Asp Arg Ala Glu Gln Phe
420 425 430Ser Gln Gln Pro Val
Pro Trp Leu Glu 435 440