Homo sapiens mrna for rpgr protein, retina specific isoform



Yüklə 200,92 Kb.
tarix01.11.2017
ölçüsü200,92 Kb.
#25212

Homo sapiens RPGR protein.

Genbank: AJ238395, U57629, AF286471S1, AF286471S2

According to: [1],[2]
Alternative splicing is an important feature of RPGR. The currently known transcripts are given according to Kirschner et al. [3] and Vervoort et al.[4]

Nucleotide numbering is given in blue for the retina specific transcript and further in cyan for the complete gene, and starts with the first codon since RPGR exon 1 is a coding exon

Amino acid numbering is given in green

Mutations are indicated above the sequence with polymorphisms in black, RP3 mutations in red, COD1 mutations in violett

The major splice forms in the retina are either terminated by exon 15a or ORF15. A complete splice form with all 19 exons is present in the retina but is considered to have no function in the pathogeneity of RP. Minor splice forms in the retina include exons 15b1, 15b2 or ORF14, therefore these splice forms are not given here.

A testis specific spliceform is missing exons 14 and 15.

Primers are indicated by arrows |--> and the sources are given according to the list of references at the end of the sequence.
N: Nucleotide binding motifs

I: Isoprenylation motif


-140 -130 -120 -110 -100 -90 -80 -70

gatct ccgga gtttc gccat gcgga acttg ggggc tttcg cggcc cgcgt cggtg cggag tagct gcttt agccc

ctaga ggcct caaag cggta cgcct tgaac ccccg aaagc gccgg gcgca gccac gcctc atcga cgaaa tcggg

-60 -50 -40 -30 -20 -10 1

|--------------------> 1f (1)

cgacc aacca aaccg tcctc tacag cctcc tggcc ccggc gcagg ctgcc cgtac tgccc gtggc ATG AGG GAG

gctgg ttggt ttggc aggag atgtc ggagg accgg ggccg cgtcc gacgg gcatg acggg caccg TAC TCC CTC

Met Arg Glu

M R E

1

a (13,16)



| a (9)

15 28 | |

CCG GAA GAG CTG ATG CCC G gtgag ttgcg cccgc gccgg tggcg gaagg ccggg aggga gggtc cgcgg

GGC CTT CTC GAC TAC GGG C cactc aacgc gggcg cggcc accgc cttcc ggccc tccct cccag gcgcc

Pro Glu Glu Leu Met Pro Asp

P E E L M P D

5 10

gcggc cgggc cccgc ctccg gatcg gccct gtagg gggac aggcc ccagg cttgg cagcc ggact ggggc

cgccg gcccg gggcg gaggc ctagc cggga catcc ccctg tccgg ggtcc gaacc gtcgg cctga ccccg

<-------------------| 1r (1)
IVS1
g (13,10)

|----------------------------> 2f (1) |

tggat aaatt actca aaagt tattt taata acagg caaca tagta aaagt tataa aataa agacc atctt atatt

accta tttaa tgagt tttca ataaa attat tgtcc gttgt atcat tttca atatt ttatt tctgg tagaa tataa

29 45 60 75 90

tgcag AT TCG GGT GCT GTG TTT ACA TTT GGG AAA AGT AAA TTT GCT GAA AAT AAT CCC GGT AAA TTC

acgtc TA AGC CCA CGA CAC AAA TGT AAA CCC TTT TCA TTT AAA CGA CTT TTA TTA GGG CCA TTT AAG

Asp Ser Gly Ala Val Phe Thr Phe Gly Lys Ser Lys Phe Ala Glu Asn Asn Pro Gly Lys Phe

D S G A V F T F G K S K F A E N N P G K F

NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN NNN

10 15 20 25 30

A=Arg (13,16) G=Ser (22)

Del(13)T=Phe (21) |A=Glu (13,16) | -MspI(1) ter=T

| 105 | 120 || 135 | 150 154|

TGG TTT AAA AAT GAT GTC CCT GTA CAT CTT TCA TGT GGA GAT GAA CAT TCT GCT GTT GTT ACC G

ACC AAA TTT TTA CTA CAG GGA CAT GTA GAA AGT ACA CCT CTA CTT GTA AGA CGA CAA CAA TGG C

Trp Phe Lys Asn Asp Val Pro Val His Leu Ser Cys Gly Asp Glu His Ser Ala Val Val Thr Gly

W F K N D V P V H L S C G D E H S A V V T G

NNNNNNNNNNNNNNNNNNN

35 40 45 50 52

gttag tatta caatc ctgaa taagg acagt aatac ttaga aaatt ctaaa tattt tgtta tagat atgct gctgt

caatc ataat gttag gactt attcc tgtca ttatg aatct tttaa gattt ataaa acaat atcta tacga cgaca



<-------------------------| 2r (1)

gtttt aaaca tgttg

caaaa tttgt acaac
IVS2
|------------------------> 3f (1)

atttc aacac agaaa agcag tgaca tttta aaagt atagc ttttt attgc tttgt ggtga cctca tctta catta

taaag ttgtg tcttt tcgtc actgt aaaat tttca tatcg aaaaa taacg aaaca ccact ggagt agaat gtaat

g (18,22) (10,16,13) Val=T

|155 165 |

tgtga aaaat gcttc aattg attat ttctt tttcc ctcct accag GA AAT AAT AAA CTT TAC ATG TTT GGC

acact tttta cgaag ttaac taata aagaa aaagg gagga tggtc CT TTA TTA TTT GAA ATG TAC AAA CCG

Gly Asn Asn Lys Leu Tyr Met Phe Gly

G N N K L Y M F G

52 55 60


del (11,12)

195 210 225 |-| 247

AGT AAC AAC TGG GGT CAG TTA GGA TTA GGA TCA AAG TCA GCC ATC AGC AAG CCA ACA TGT GTC AAA G

TCA TTG TTG ACC CCA GTC AAT CCT AAT CCT AGT TTC AGT CGG TAG TCG TTC GGT TGT ACA CAG TTT C

Ser Asn Asn Trp Gly Gln Leu Gly Leu Gly Ser Lys Ser Ala Ile Ser Lys Pro Thr Cys Val Lys Ala

S N N W G Q L G L G S K S A I S K P T C V K A

65 70 75 80 83

gtttg ggatc ttttt atctt ttcac aaact gtata aagta ttttt atgtt gactg tgtag ttctt taatg tcata

caaac cctag aaaaa tagaa aagtg tttga catat ttcat aaaaa tacaa ctgac acatc aagaa attac agtat

3r (1) <------------------------|

aataa taaa

ttatt attt


IVS3
4f (1) |------------------------>

acttt ggaat cagac aagcc cgcta ttaaa tctcg gttct atcac ttagt tacca tttgt cctgg actac tgttc

tgaaa cctta gtctg ttcgg gcgat aattt agagc caaga tagtg aatca atggt aaaca ggacc tgatg acaag

a (13)


| 248 255 270

atttt cattg ctata aaata attgt cccat tttaa atatt tttag CT CTA AAA CCT GAA AAA GTG AAA TTA

taaaa gtaac gatat tttat taaca gggta aaatt tataa aaatc GA GAT TTT GGA CTT TTT CAC TTT AAT

Ala Leu Lys Pro Glu Lys Val Lys Leu


A L K P E K V K L

83 85 90

A=Gln (1)

| insCA (8) c (13)

285 | | 300 310 |

GCT GCC TGT GGA AGG AAC CAC ACC CTG GTG TCA ACA G gtata gtgct caacc tgtat gattt cctgc

CGA CGG ACA CCT TCC TTG GTG TGG GAC CAC AGT TGT C catat cacga gttgg acata ctaaa ggacg

Ala Ala Cys Gly Arg Asn His Thr Leu Val Ser Thr Glu

A A C G R N H T L V S T E

95 100 104

ctgac tttga gaact gaggt ctcat tccag taacg tggct ttgta aaatt aatag agatt atact ttggc aacag

gactg aaact cttga ctcca gagta aggtc attgc accga aacat tttaa ttatc tctaa tatga aaccg ttgtc

<-----------------------| 4r (1)
IVS4
|------------------------> 5f (1) 311 315

gacct gtctc ataaa aaggg ggact ctata gtcta ttgac gtttt ctttt ccatg tgcag AA GGA GGC AAT

ctgga cagag tattt ttccc cctga gatat cagat aactg caaaa gaaaa ggtac acgtc TT CCT CCG TTA

Glu Gly Gly Asn
E G G N

104 105

del (13,16) del (10) G=Gly (13,16)

330 345 | 360 | 375 |

GTA TAT GCA ACT GGT GGA AAT AAT GAA GGA CAG TTG GGG CTT GGT GAC ACC GAA GAA AGA AAC ACT

CAT ATA CGT TGA CCA CCT TTA TTA CTT CCT GTC AAC CCC GAA CCA CTG TGG CTT CTT TCT TTG TGA

Val Tyr Ala Thr Gly Gly Asn Asn Glu Gly Gln Leu Gly Leu Gly Asp Thr Glu Glu Arg Asn Thr

V Y A T G G N N E G Q L G L G D T E E R N T

110 115 120 125

G=Cys (2) T=ter (5)

| 405 | 420 435 450

TTT CAT GTA ATT AGC TTT TTT ACA TCC GAG CAT AAG ATT AAG CAG CTG TCT GCT GGA TCT AAT ACT

AAA GTA CAT TAA TCG AAA AAA TGT AGG CTC GTA TTC TAA TTC GTC GAC AGA CGA CCT AGA TTA TGA

Phe His Val Ile Ser Phe Phe Thr Ser Glu His Lys Ile Lys Gln Leu Ser Ala Gly Ser Asn Thr

F H V I S F F T S E H K I K Q L S A G S N T

130 135 140 145 150

t (11)


465 469 |

TCA GCT GCC CTA ACT G gtgag acttg tttct ttttc agtct gggac acatt ccttt cctgt agatc aatat

AGT CGA CGG GAT TGA C cactc tgaac aaaga aaaag tcaga ccctg tgtaa ggaaa ggaca tctag ttata

Ser Ala Ala Leu Thr Glu <-------------------| 5r (1)


S A A L T E

155 157

atgcc aattc agtaa accga aggga

tacgg ttaag tcatt tggct tccct

IVS5
|-------------------> 6f (1)

atggt ctttt ggaag tataa tagct tcaga gcctg gctac ctttt aaaac ctttt ctccc ccttt tattt ttctt

tacca gaaaa ccttc atatt atcga agtct cggac cgatg gaaaa ttttg gaaaa gaggg ggaaa ataaa aagaa

del (12,15,16)

|del (13) A=ter (10) del (5,15)

470 480 || | 495 510 | 525

tatag AG GAT GGA AGA CTT TTT ATG TGG GGT GAC AAT TCC GAA GGG CAA ATT GGT TTA AAA AAT GTA

atatc TC CTA CCT TCT GAA AAA TAC ACC CCA CTG TTA AGG CTT CCC GTT TAA CCA AAT TTT TTA CAT

Glu Asp Gly Arg Leu Phe Met Trp Gly Asp Asn Ser Glu Gly Gln Ile Gly Leu Lys Asn Val

E D G R L F M W G D N S E G Q I G L K N V

157 160 165 170 175

A=ter (1) +BglIII

540 555 570 | 585

AGT AAT GTC TGT GTC CCT CAG CAA GTG ACC ATT GGG AAA CCT GTC TCC TGG ATC TCT TGT GGA TAT

TCA TTA CAG ACA CAG GGA GTC GTT CAC TGG TAA CCC TTT GGA CAG AGG ACC TAG AGA ACA CCT ATA

Ser Asn Val Cys Val Pro Gln Gln Val Thr Ile Gly Lys Pro Val Ser Trp Ile Ser Cys Gly Tyr

S N V C V P Q Q V T I G K P V S W I S C G Y

180 185 190 195

A=ter (13,16) a (9)

600 | 615 619 |

TAC CAT TCA GCT TTT GTA ACA A gtaag acaca ttttg aagtc atcat gttat ctccc acttc tatgt tgtta

ATG GTA AGT CGA AAA CAT TGT T cattc tgtgt aaaac ttcag tagta caata gaggg tgaag ataca acaat

Tyr His Ser Ala Phe Val Thr Thr 6r (1) <-------------------------|

Y H S A F V T T

200 205 207

ttaaa ttaata

aattt aattat


IVS6
C (13)

|-------------------------> 7f (1) |

gaaag gtcaa atgta ttaag tgatc cttgt caaaa ctttt ttttg acggt aagac cagct ttttg ttcat atgta

ctttc cagtt tacat aattc actag gaaca gtttt gaaaa aaaac tgcca ttctg gtcga aaaac aagta tacat

T=Val (1)

620 630 |

tgtca tcact aatca ctata ctttt ccttc aatag CA GAT GGT GAG CTA TAT GTG TTT GGA GAA CCT GAG

acagt agtga ttagt gatat gaaaa ggaag ttatc GT CTA CCA CTC GAT ATA CAC AAA CCT CTT GGA CTC

Thr Asp Gly Glu Leu Tyr Val Phe Gly Glu Pro Glu

T D G E L Y V F G E P E

207 210 215

T=Ser (2)

| T=ter (10)

660 675 690 | | 720

AAT GGG AAG TTA GGT CTT CCC AAT CAG CTC CTG GGC AAT CAC AGA ACA CCC CAG CTG GTG TCT GAA

TTA CCC TTC AAT CCA GAA GGG TTA GTC GAG GAC CCG TTA GTG TCT TGT GGG GTC GAC CAC AGA CTT

Asn Gly Lys Leu Gly Leu Pro Asn Gln Leu Leu Gly Asn His Arg Thr Pro Gln Leu Val Ser Glu

N G K L G L P N Q L L G N H R T P Q L V S E

220 225 230 235 240

del (16)


T=ter (3) | C=Arg (1) +DsaI

| A=Lys (13,16) | | del (14) a (10)

| | 735 | | |-| 765 778 |

ATT CCG GAG AAG GTG ATC CAA GTA GCC TGT GGT GGA GAG CAT ACT GTG GTT CTC ACG G gtatg tgtgg

TAA GGC CTC TTC CAC TAG GTT CAT CGG ACA CCA CCT CTC GTA TGA CAC CAA GAG TGC C catac acacc

Ile Pro Glu Lys Val Ile Gln Val Ala Cys Gly Gly Glu His Thr Val Val Leu Thr Glu

I P E K V I Q V A C G G E H T V V L T E

245 250 255 260

agtcc actgt tctgt tccct gcgtt ctgtg gtggc taatg aaatt ttttt aatgt tcatt ttttt tatgt tcatt

tcagg tgaca agaca aggga cgcaa gacac caccg attac tttaa aaaaa ttaca agtaa aaaaa ataca agtaa



<---------------------| 7r (1)
IVS 7
|-----------------------> 8f (1)

gttga acgtg acagt ttttc cagat atgat ggtaa cagtg tctgc ttcca taatc cttgt cttgt atttg cccct

caact tgcac tgtca aaaag gtcta tacta ccatt gtcac agacg aaggt attag gaaca gaaca taaac gggga

(13) del


A=Ser (2) |

A (16) | (16) del |-

| 795 810 | (16) del |-

ttcct ttcag AG AAT GCT GTG TAT ACC TTT GGG CTG GGA CAA TTT GGT CAG CTG GGT CTT GGC ACT

aagga aagtc TC TTA CGA CAC ATA TGG AAA CCC GAC CCT GTT AAA CCA GTC GAC CCA GAA CCG TGA

Glu Asn Ala Val Tyr Thr Phe Gly Leu Gly Gln Phe Gly Gln Leu Gly Leu Gly Thr

E N A V Y T F G L G Q F G Q L G L G T

260 265 270 275


|--del (5)----------------------------------------------------------------------------

| G=Val (22) |---del (1)-------|

-| | del (10,13) | del (9) |

|| 840 855 | | 885 | |-| |

TTT CTT TTT GAA ACT TCA GAA CCC AAA GTC ATT GAG AAT ATT AGG GAT GAA ACA ATA AGT TAT ATT

AAA GAA AAA CTT TGA AGT CTT GGG TTT CAG TAA CTC TTA TAA TCC CTA CTT TGT TAT TCA ATA TAA

Phe Leu Phe Glu Thr Ser Glu Pro Lys Val Ile Glu Asn Ile Arg Asp Glu Thr Ile Ser Tyr Ile

F L F E T S E P K V I E N I R D E T I S Y I

280 285 290 295 300

---------------------------------------------------------------------------------------

A=Asn, T=Tyr (16)

T=Arg (9) | a (22)

|A=Tyr (16,15) | c (2,3,5,20)

|| 915 930 | |

TCT TGT GGA GAA AAT CAC ACA GCT TTG ATA ACA G gtatg gatta gaatt tctgt tatat attta tagaa

AGA ACA CCT CTT TTA GTG TGT CGA AAC TAT TGT C catac ctaat cttaa agaca atata taaat atctt

Ser Cys Gly Glu Asn His Thr Ala Leu Ile Thr Asp

S C G E N H T A L I T D

305 310 312

-----------------------------------------------------------------------------------------

ctggg tatat tttat tacaa aataa attat atgtg aaatg aatgt atcta actta agtaa aggaa ttaat ggctg

gaccc atata aaata atgtt ttatt taata tacac tttac ttaca tagat tgaat tcatt tcctt aatta ccgac



<---------------------
---->

agaaa


tcttt

--| 8r (1)


IVS 8
|---------------------------> 9f (1)

gtatg agtta acatg tttat ctgtt tacaa ataga tccat acaag taaca ctttt ataat tctct tcatt atttt

catac tcaat tgtac aaata gacaa atgtt tatct aggta tgttc attgt gaaaa tatta agaga agtaa taaaa

A=Arg (16) G=ter (16)

935 945 | 975 | 990

tgcat tttag AT ATC GGC CTT ATG TAT ACT TTT GGA GAT GGT CGC CAC GGA AAA TTA GGA CTT GGA

acgta aaatc TA TAG CCG GAA TAC ATA TGA AAA CCT CTA CCA GCG GTG CCT TTT AAT CCT GAA CCT

Asp Ile Gly Leu Met Tyr Thr Phe Gly Asp Gly Arg His Gly Lys Leu Gly Leu Gly


D I G L M Y T F G D G R H G K L G L G

312 315 320 325 330

T=ter (6) G=Asp (13)

| 1005 1020 | 1050

CTG GAG AAT TTT ACC AAT CAC TTC ATT CCT ACT TTG TGC TCT AAT TTT TTG AGG TTT ATA GTT AAA

GAC CTC TTA AAA TGG TTA GTG AAG TAA GGA TGA AAC ACG AGA TTA AAA AAC TCC AAA TAT CAA TTT

Leu Glu Asn Phe Thr Asn His Phe Ile Pro Thr Leu Cys Ser Asn Phe Leu Arg Phe Ile Val Lys

L E N F T N H F I P T L C S N F L R F I V K

335 340 345 350

1059

TTG gtaag ggcat cacat ttccc tgttt atatt ttcat attac tgtat ttgag tcttt cagta tttca tggaa



AAC cattc ccgta gtgta aaggg acaaa tataa aagta taatg acata aactc agaaa gtcat aaagt acctt

Leu <-----------------------| 9r (1)

L

353
aataa gtaat tttat



ttatt catta aaata
IVS9
10f (1) |---------------------->

accac taaat aaatg tctta cagct gataa ctgtt aaaaa ctcat ttgtg tttga agcca atgtt gatga gtttt

tggtg attta tttac agaat gtcga ctatt gacaa ttttt gagta aacac aaact tcggt tacaa ctact caaaa
(16,

ins T 2,3,13) del (13,16)

1060 1065 1080 | 1095 |

tctgt ggatt tatgc tgcag GTT GCT TGT GGT GGA TGT CAC ATG GTA GTT TTT GCT GCT CCT CAT CGT

agaca cctaa atacg acgtc CAA CGA ACA CCA CCT ACA GTG TAC CAT CAA AAA CGA CGA GGA GTA GCA

Val Ala Cys Gly Gly Cys His Met Val Val Phe Ala Ala Pro His Arg

V A C G G C H M V V F A A P H R

353 355 360 365

T=ter (10) A=Ala (13)

1110 | 1125 1140 1155 | 1170

GGT GTG GCA AAA GAA ATT GAA TTC GAT GAA ATA AAT GAT ACT TGC TTA TCT GTG GCG ACT TTT CTG

CCA CAC CGT TTT CTT TAA CTT AAG CTA CTT TAT TTA CTA TGA ACG AAT AGA CAC CGC TGA AAA GAC

Gly Val Ala Lys Glu Ile Glu Phe Asp Glu Ile Asn Asp Thr Cys Leu Ser Val Ala Thr Phe Leu

G V A K E I E F D E I N D T C L S V A T F L

370 375 380 385 390

1185 1200 1215 1230

CCG TAT AGC AGT TTA ACC TCA GGA AAT GTA CTG CAG AGG ACT CTA TCA GCA CGT ATG CGG CGA AGA

GGC ATA TCG TCA AAT TGG AGT CCT TTA CAT GAC GTC TCC TGA GAT AGT CGT GCA TAC GCC GCT TCT

Pro Tyr Ser Ser Leu Thr Ser Gly Asn Val Leu Gln Arg Thr Leu Ser Ala Arg Met Arg Arg Arg

P Y S S L T S G N V L Q R T L S A R M R R R

395 400 405 410
ins AG (14)

| g (20) g (13)

1245 | |

GAG AGG gtaca attgt ggcct attct gtata atagt attat ctagc actgc aaaga aaact agaag aaaaa

CTC TCC catgt taaca ccgga taaga catat tatca taata gatcg tgacg tttct tttga tcttc ttttt

Glu Arg 10r (1) <-----------------------|


E R

415

aatct gtttt

ttaga caaaa
IVS10
|------------------------> 11f (1)

tgtt ggcat acttt gaact ttctc ttaaa tattt tagca acttt tagga aagtt tatat caata ttaaa tatat

acaa ccgta tgaaa cttga aagag aattt ataaa atcgt tgaaa atcct ttcaa atata gttat aattt atata

|------del (10)----

del | (10,13) Lys=A

| 1246 1260 | |

aacat catga aatag tttgt ttaac ctgct tattt taaag GAG AGG TCT CCA cTCT TTT TCA ATG AGG

ttgta gtact ttatc aaaca aattg gacga ataaa atttc CTC TCC AGA GGT CTA AGA AAA AGT TAC TCC

Glu Arg Ser Pro Asp Ser Phe Ser Met Arg

E R S P D S F S M R

416 420 425
----| G=Val (13,10) A=Asp (2,3,16)

| 1290 | 1305 | 1320 1335

AGA ACA CTA CCT CCA ATA GAA GGG ACT CTT GGC CTT TCT GCT TGT TTT CTC CCC AAT TCA GTC TTT

TCT TGT GAT GGA GGT TAT CTT CCC TGA GAA CCG GAA AGA CGA ACA AAA GAG GGG TTA AGT CAG AAA

Arg Thr Leu Pro Pro Ile Glu Gly Thr Leu Gly Leu Ser Ala Cys Phe Leu Pro Asn Ser Val Phe

R T L P P I E G T L G L S A C F L P N S V F

430 435 440 445

del (13,14,16) del (2)

1350 1365 ||1380 1395 |---|

CCA CGA TGT TCT GAG AGA AAC CTC CAA GAG AGT GTC TTA TCT GAA CAG GAC CTC ATG CAG CCA GAG

GGT GCT ACA AGA CTC TCT TTG GAG GTT CTC TCA CAG AAT AGA CTT GTC CTG GAG TAC GTC GGT CTC

Pro Arg Cys Ser Glu Arg Asn Leu Gln Glu Ser Val Leu Ser Glu Gln Asp Leu Met Gln Pro Glu

P R C S E R N L Q E S V L S E Q D L M Q P E

450 455 460 465

1410 1414

GAA CCA G gtaac atttg ctact ctgtc tgctc tcagg ataaa tgtgc ctttt attcc tgtag ttttt atgtt

CTT GGT C cattg taaac gatga gacag acgag agtcc tattt acacg gaaaa taagg acatc aaaaa tacaa

Glu Pro Asp <-----------------------| 11r (1)

E P D

470 472


tatct taaga tgttt tctcc agtta tttaa attgt tctct ccctg gttag agcct aacat cactc atttg aactc

ataga attct acaaa agagg tcaat aaatt taaca agaga gggac caatc tcgga ttgta gtgag taaac ttgag

aaatc cggga atatc cttac aagta gaaag cacca tcagt gctcc taaga ctgcc atgta accac agcag catat

tttag gccct tatag gaatg ttcat ctttc gtggt agtca cgagg attct gacgg tacat tggtg tcgtc gtata

cttta tagag cagtg gataa cagtt atagt acatt cctta acttt atccc cttta aaccc tgaaa ccact tgatg

gaaat atctc gtcac ctatt gtcaa tatca tgtaa ggaat tgaaa taggg gaaat ttggg acttt ggtga actac

aactt aggag ggagt tatag tcttc atttt agagat

ttgaa tcctc cctca atatc agaag taaaa tctcta


IVS11
|----------------------

atgag gttaa aggac tttta gtaaa attat accta cactt tatat tatag aataa tttct gtttt ctgtc cagtt

tactc caatt tcctg aaaat cattt taata tggat gtgaa atata atatc ttatt aaaga caaaa gacag gtcaa

-> 12f (1) 1415 1425

gcctt tcact tttct gttga aatct ggatt tcctt tttcc cattt aatca ttcag AT TAT TTG CTA GAT GAA

cggaa agtga aaaga caact ttaga cctaa aggaa aaagg gtaaa ttagt aagtc TA ATA AAC GAT CTA CTT

Asp Tyr Leu Leu Asp Glu

D Y L L D E

472 475

del (6)


1440 1455 1470 | 1485

ATG ACC AAA GAA GCA GAG ATA GAT AAT TCT TCA ACT GTA GAA AGC CTT GGA GAA ACT ACT GAT ATC

TAC TGG TTT CTT CGT CTC TAT CTA TTA AGA AGT TGA CAT CTT TCG GAA CCT CTT TGA TGA CTA TAG

Met Thr Lys Glu Ala Glu Ile Asp Asn Ser Ser Thr Val Glu Ser Leu Gly Glu Thr Thr Asp Ile


M T K E A E I D N S S T V E S L G E T T D I

480 485 490 495

1500 1506

TTA AAC ATG gtaat gattt cctat ctatt tatct aaata tgagt ttgaa tttta gttca ttgtg tatgt gaatg

AAT TTG TAC catta ctaaa ggata gataa ataga tttat actca aactt aaaat caagt aacac ataca cttac

Leu Asn Met <----------------------

L N M

500 502

tgtgt atact

acaca tatga

--| 12r (1)


IVS12
|del| (13)

| a (13)


| |(13,c

|-------------------------> 13f (1) | | 10)|

aattg agagc ggatt ctatt actat ctgaa ctgtt ttccc ttcat tttat atata tatat atata aaaat atata

ttaac tctcg cctaa gataa tgata gactt gacaa aaggg aagta aaata tatat atata tatat tttta tatat

c (10)

|

tataa ttgaa aatat actat atatt tgtac tatat acaaa tatat agtat atata ctgta ttctt tttat aatat



atatt aactt ttata tgata tataa acatg atata tgttt atata tcata tatat gacat aagaa aaata ttata

del (7,10)

1507|| 1515 1530 1545

atatt tttta tttat ttcag ACA CAC ATC ATG AGC CTG AAT TCC AAT GAA AAG TCA TTA AAA TTA TCA

tataa aaaat aaata aagtc TGT GTG TAG TAC TCG GAC TTA AGG TTA CTT TTC AGT AAT TTT AAT AGT

Thr His Ile Met Ser Leu Asn Ser Asn Glu Lys Ser Leu Lys Leu Ser


T H I M S L N S N E K S L K L S

503 505 510 515

g (13)


1560 1572 |

CCA GTT CAG AAA CAA AAG gtata atatg aagat ttatt ttcgt tattg ttggt gagag cagtt taagt taatt

GGT CAA GTC TTT GTT TTC catat tatac ttcta aataa aagca ataac aacca ctctc gtcaa attca attaa

Pro Val Gln Lys Gln Lys 13r (1)<-------------------------|


P V Q K Q K

520 524

ttttg


aaaac
IVS 13
|---

cttg


gaac
----------------------> 14f (1)

ctatt ttgat tctcc taaac ttata aaaat gtaaa atctt ggcag tattt aaagt agata agttg tcctt gtcaa

gataa aacta agagg atttg aatat tttta cattt tagaa ccgtc ataaa tttca tctat tcaac aggaa cagtt

g (22)


|a (13)

g (20) || |---| del (13,10) (16) Met=T

| || | | 1590 |

ctaat agtag tatga catat gataa ttact tcaaa aaatt tacag AAA CAA CAA ACA ATT GGG GAA CTG ACG

gatta tcatc atact gtata ctatt aatga agttt tttaa atgtc TTT GTT GTT TGT TAA CCC CTT GAC TGC

Lys Gln Gln Thr Ile Gly Glu Leu Thr

K Q Q T I G E L T

525 530


1605 1620 1635 1650 1665

CAG GAT ACA GCT CTT ACT GAA AAC GAT GAT AGT GAT GAA TAT GAA GAA ATG TCA GAA ATG AAA GAA

GTC CTA TGT CGA GAA TGA CTT TTG CTA CTA TCA CTA CTT ATA CTT CTT TAC AGT CTT TAC TTT CTT

Gln Asp Thr Ala Leu Thr Glu Asn Asp Asp Ser Asp Glu Tyr Glu Glu Met Ser Glu Met Lys Glu


Q D T A L T E N D D S D E Y E E M S E M K E

535 540 545 550 555

A=Glu (10,13,15,16)

1680 1695 | 1710 1725

GGG AAA GCA TGT AAA CAA CAT GTG TCA CAA GGG ATT TTC ATG ACG CAG CCA GCT ACG ACT ATC GAA

CCC TTT CGT ACA TTT GTT GTA CAC AGT GTT CCC TAA AAG TAC TGC GTC GGT CGA TGC TGA TAG CTT

Gly Lys Ala Cys Lys Gln His Val Ser Gln Gly Ile Phe Met Thr Gln Pro Ala Thr Thr Ile Glu

G K A C K Q H V S Q G I F M T Q P A T T I E

560 565 570 575

1740 1744

GCA TTT TCA GAT G gtaat gacac aggcc aggtg ggacc tcagg ctgac actga tggag agggt ttaca aaaag

CGT AAA AGT CTA C catta ctgtg tccgg tccac cctgg agtcc gactg tgact acctc tccca aatgt ttttc

Ala Phe Ser Asp Gly <---------------------| 14r (1)


A F S D G

580 582

aggta tatag acatg aaaat aataa tggtg ttgat caact tgatg ctaag gaaat agaaa aggaa agtga tggag

tccat atatc tgtac tttta ttatt accac aacta gttga actac gattc cttta tcttt tcctt tcact acctc
gacac agtca gaagg aatca gaagc agaag aaata gacag tgaga aggaa actaa actgg cagaa atagc aggta

ctgtg tcagt cttcc ttagt cttcg tcttc tttat ctgtc actct tcctt tgatt tgacc gtctt tatcg tccat


tgaag gattt aagag aaagg gaaaa gagta caaaa aagat gagtc ctttc tttgg caact tacca gatag aggta

acttc ctaaa ttctc tttcc ctttt ctcat gtttt ttcta ctcag gaaag aaacc gttga atggt ctatc tccat


tgaat actga gagtg aagaa aataa agatt ttgtt aagaa aaggg aaagt tgcaa gcaag atgtg atctt tgaca

actta tgact ctcac ttctt ttatt tctaa aacaa ttctt ttccc tttca acgtt cgttc tacac tagaa actgt


gtgaa agaga atcag tagaa aagcc agaca gttac atgga aggtg caagt gagag tcaac agggt atagc tgatg

cactt tctct tagtc atctt ttcgg tctgt caatg tacct tccac gttca ctctc agttg tccca tatcg actac


gattc cagca gcctg aggca ataga attta gtagt ggaga gaaag aggat gatga agtgg aaact gacca aaaca

ctaag gtcgt cggac tccgt tatct taaat catca cctct ctttc tccta ctact tcacc tttga ctggt tttgt

|------------------------> 15f (1)

tacgg tatgg cagga aattg attga acaag gaaat gaaaa agaga ctaaa cccat aatat ccaaa tccat ggcaa

atgcc atacc gtcct ttaac taact tgttc cttta ctttt tctct gattt gggta ttata ggttt aggta ccgtt
agtat gattt taaat gtgat cgctt gtcag

tcata ctaaa attta cacta gcgaa cagtc

1745 1755 1770 1785 1800

|------------------------> ORF15-1f (23)

AG GAA GTA GAG ATC CCA GAG GAG AAG GAA GGA GCA GAG GAT TCA AAA GGA AAT GGA ATA GAG GAG

TC CTT CAT CTC TAG GGT CTC CTC TTC CTT CCT CGT CTC CTA AGT TTT CCT TTA CCT TAT CTC CTC

Glu Glu Val Glu Ile Pro Glu Glu Lys Glu Gly Ala Glu Asp Ser Lys Gly Asn Gly Ile Glu Glu

E E V E I P E E K E G A E D S K G N G I E E

582 585 590 595 600

ins A (13)

1815 1830 | 1845 1860 1875

CAA GAG GTA GAA GCA AAT GAG GAA AAT GTG AAG GTG CAT GGA GGA AGA AAG GAG AAA ACA GAG ATC

GTT CTC CAT CTT CGT TTA CTC CTT TTA CAC TTC CAC GTA CCT CCT TCT TTC CTC TTT TGT CTC TAG

Gln Glu Val Glu Ala Asn Glu Glu Asn Val Lys Val His Gly Gly Arg Lys Glu Lys Thr Glu Ile

Q E V E A N E E N V K V H G G R K E K T E I

605 610 615 620 625

1890 1905

CTA TCA GAT GAC CTT ACA GAC AAA GCA GAG

GAT AGT CTA CTG GAA TGT CTG TTT CGT CTC

Leu Ser Asp Asp Leu Thr Asp Lys Ala Glu


L S D D L T D K A E

630 635


ORF15


1920 1935 1950 1965

GTG AGT GAA GGC AAG GCA AAA TCA GTG GGA GAA GCA GAG GAT GGG CCT GAA GGT AGA GGG GAT GGA

CAC TCA CTT CCG TTC CGT TTT AGT CAC CCT CTT CGT CTC CTA CCC GGA CTT CCA TCT CCC CTA CCT

<-----------------------| 15r (1)

Val Ser Glu Gly Lys Ala Lys Ser Val Gly Glu Ala Glu Asp Gly Pro Glu Gly Arg Gly Asp Gly

V S E G K A K S V G E A E D G P E G R G D G

640 645 650 655

T=ter (19)

1980 1995 | 2025

|----------

ACC TGT GAG GAA GGT AGT TCA GGA GCA GAA CAC TGG CAA GAT GAG GAG AGG GAG AAG GGG GAG AAA

TGG ACA CTC CTT CCA TCA AGT CCT CGT CTT GTG ACC GTT CTA CTC CTC TCC CTC TTC CCC CTC TTT

Thr Cys Glu Glu Gly Ser Ser Gly Ala Glu His Trp Gln Asp Glu Glu Arg Glu Lys Gly Glu Lys

T C E E G S S G A E H W Q D E E R E K G E K

660 665 670 675


G=Glu (16) T=ter (16)

2040 | 2070 | 2085 2100

GAC AAG GGT AGA GGA GAA ATG GAG AGG CCA GGA GAG GGA GAG AAG GAA CTA GCA GAG AAG GAA GAA

CTG TTC CCA TCT CCT CTT TAC CTC TCC GGT CCT CTC CCT CTC TTC CTT GAT CGT CTC TTC CTT CTT

-------------> ORF15-2f (23) <--------

Asp Lys Gly Arg Gly Glu Met Glu Arg Pro Gly Glu Gly Glu Lys Glu Leu Ala Glu Lys Glu Glu

D K G R G E M E R P G E G E K E L A E K E E

680 685 690 695 700

del (16)

| | ins 4 bp (16) |----------------------dup (5)----

|---| | 2130 2145 2160

TGG AAG AAG AGG GAT GGG GAA GAG CAG GAG CAA AAG GAG AGG GAG CAG GGC CAT CAG AAG GAA AGA

ACC TTC TTC TCC CTA CCC CTT CTC GTC CTC GTT TTC CTC TCC CTC GTC CCG GTA GTC TTC CTT TCT

----------------| ORF15-1r (23)

Trp Lys Lys Arg Asp Gly Glu Glu Gln Glu Gln Lys Glu Arg Glu Gln Gly His Gln Lys Glu Arg

W K K R D G E E Q E Q K E R E Q G H Q K E R

705 710 715 720

T=ter (16)

| A=Glu (16)

-------------------------------------------------------------| | | A=Glu (19) del

2175 2190 2205 | | | | |-

AAC CAA GAG ATG GAG GAG GGA GGG GAG GAG GAG CAT GGA GAA GGA GAA GAA GAG GAG GGA GAC AGA

TTG GTT CTC TAC CTC CTC CCT CCC CTC CTC CTC GTA CCT CTT CCT CTT CTT CTC CTC CCT CTG TCT

Asn Gln Glu Met Glu Glu Gly Gly Glu Glu Glu His Gly Glu Gly Glu Glu Glu Glu Gly Asp Arg

N Q E M E E G G E E E H G E G E E E E G D R

725 730 735 740 745

T=ter (5) T=Gly (19)

(16)del (5) | del (16) | T=ter (16) del (16)

-| || | || | | 2265 || 2280 2295

GAA GAG GAA GAA GAG AAG GAG GGA GAA GGG AAA GAG GAA GGA GAA GGG GAA GAA GTG GAG GGA GAA

CTT CTC CTT CTT CTC TTC CTC CCT CTT CCC TTT CTC CTT CCT CTT CCC CTT CTT CAC CTC CCT CTT

Glu Glu Glu Glu Glu Lys Glu Gly Glu Gly Lys Glu Glu Gly Glu Gly Glu Glu Val Glu Gly Glu

E E E E E K E G E G K E E G E G E E V E G E

750 755 760 765

G=Arg (5)

del (16) del (16)

del (19) del(19) | A=Thr (16,19) ||

2310 | | 2325 | | 2355 ||-------------

|----

CGT GAA AAG GAG GAA GGA GAG AGG AAA AAG GAG GAA AGA GCG GGG AAG GAG GAG AAA GGA GAG GAA



GCA CTT TTC CTC CTT CCT CTC TCC TTT TTC CTC CTT TCT CGC CCC TTC CTC CTC TTT CCT CTC CTT

Arg Glu Lys Glu Glu Gly Glu Arg Lys Lys Glu Glu Arg Ala Gly Lys Glu Glu Lys Gly Glu Glu

R E K E E G E R K K E E R A G K E E K G E E

770 775 780 785

del(5) del (5,19) del (5)

del (19) | | del (16) del (16) ||

---------| 2385 |-| |--|| |-| ||2430

--------------------> ORF15-3 (23)

GAA GGA GAC CAA GGA GAG GGG GAA GAG GAG GAA ACA GAG GGG AGA GGG GAG GAA AAA GAG GAG GGA

CTT CCT CTG GTT CCT CTC CCC CTT CTC CTC CTT TGT CTC CCC TCT CCC CTC CTT TTT CTC CTC CCT

Glu Gly Asp Gln Gly Glu Gly Glu Glu Glu Glu Thr Glu Gly Arg Gly Glu Glu Lys Glu Glu Gly

E G D Q G E G E E E E T E G R G E E K E E G

790 795 800 805 810

del (5)|-----del (5,24)---| (16) ins G

|---| | 2460 | 2475 2490 |

GGG GAA GTA GAG GGA GGG GAA GTA GAG GAG GGG AAA GGA GAG AGG GAA GAG GAA GAG GAG GAG GGT

CCC CTT CAT CTC CCT CCC CTT CAT CTC CTC CCC TTT CCT CTC TCC CTT CTC CTT CTC CTC CTC CCA

<------------------------| ORF15-2r (23)

Gly Glu Val Glu Gly Gly Glu Val Glu Glu Gly Lys Gly Glu Arg Glu Glu Glu Glu Glu Glu Gly

G E V E G G E V E E G K G E R E E E E E E G

815 820 825 830

del (5) T=ter (5)

|--------del (5)------|----|

2505 2520 | 2535 | 2550 | | 2565

GAG GGG GAA GAG GAG GAA GGG GAG GGG GAA GAG GAG GAA GGG GAG GGG GAA GAG GAG GAA GGA GAA

CTC CCC CTT CTC CTC CTT CCC CTC CCC CTT CTC CTC CTT CCC CTC CCC CTT CTC CTC CTT CCT CTT

Glu Gly Glu Glu Glu Glu Gly Glu Gly Glu Glu Glu Glu Gly Glu Gly Glu Glu Glu Glu Gly Glu


E G E E E E G E G E E E E G E G E E E E G E


835 840 845 850 855

del (17) |----del (5)------|

| ins GG (17) ins A (5)

2580 || 2595 | 2610 | | 2625|

GGG AAA GGG GAG GAA GAA GGG GAA GAA GGA GAA GGG GAG GAA GAA GGG GAG GAA GGA GAA GGG GAG

CCC TTT CCC CTC CTT CTT CCC CTT CTT CCT CTT CCC CTC CTT CTT CCC CTC CTT CCT CTT CCC CTC

Gly Lys Gly Glu Glu Glu Gly Glu Glu Gly Glu Gly Glu Glu Glu Gly Glu Glu Gly Glu Gly Glu

G K G E E E G E E G E G E E E G E E G E G E

860 865 870 875

T=ter (5) del (5)

2640 | 2655 |--| 2685

GGG GAA GAG GAG GAA GGA GAA GGG GAG GGA GAA GAG GAA GGA GAA GGG GAG GGA GAA GAG GAG GAA

CCC CTT CTC CTC CTT CCT CTT CCC CTC CCT CTT CTC CTT CCT CTT CCC CTC CCT CTT CTC CTC CTT

Gly Glu Glu Glu Glu Gly Glu Gly Glu Gly Glu Glu Glu Gly Glu Gly Glu Gly Glu Glu Glu Glu

G E E E E G E G E G E E E G E G E G E E E E

880 885 890 895

del (5) (5) del

2700 2715 || 2745 2760 ||

GGA GAA GGG GAG GGA GAA GAG GAA GGA GAA GGG GAG GGA GAA GAG GAG GAA GGA GAA GGG AAA GGG

CCT CTT CCC CTC CCT CTT CTC CTT CCT CTT CCC CTC CCT CTT CTC CTC CTT CCT CTT CCC TTT CCC

Gly Glu Gly Glu Gly Glu Glu Glu Gly Glu Gly Glu Gly Glu Glu Glu Glu Gly Glu Gly Lys Gly

G E G E G E E E G E G E G E E E E G E G K G

900 905 910 915 920

2775 2790 2805 2820

GAG GAG GAA GGA GAG GAA GGA GAA GGG GAG GGG GAA GAG GAG GAA GGA GAA GGG GAA GGG GAG GAT

CTC CTC CTT CCT CTC CTT CCT CTT CCC CTC CCC CTT CTC CTC CTT CCT CTT CCC CTT CCC CTC CTA

Glu Glu Glu Gly Glu Glu Gly Glu Gly Glu Gly Glu Glu Glu Glu Gly Glu Gly Glu Gly Glu Asp

E E E G E E G E G E G E E E E G E G E G E D

925 930 935 940

(5) del


2835 2850 2865 2880 |

GGA GAA GGG GAG GGG GAA GAG GAG GAA GGA GAA TGG GAG GGG GAA GAG GAG GAA GGA GAA GGG GAG

CCT CTT CCC CTC CCC CTT CTC CTC CTT CCT CTT ACC CTC CCC CTT CTC CTC CTT CCT CTT CCC CTC

Gly Glu Gly Glu Gly Glu Glu Glu Glu Gly Glu Trp Glu Gly Glu Glu Glu Glu Gly Glu Gly Glu

G E G E G E E E E G E W E G E E E E G E G E

945 950 955 960 965

|----------------dup (5)-----------|

2910 2925 2940 2955 2970

GGG GAA GAG GAA GGA GAA GGG GAA GGG GAG GAA GGA GAA GGG GAG GGG GAA GAG GAG GAA GGA GAA

CCC CTT CTC CTT CCT CTT CCC CTT CCC CTC CTT CCT CTT CCC CTC CCC CTT CTC CTC CTT CCT CTT

Gly Glu Glu Glu Gly Glu Gly Glu Gly Glu Glu Gly Glu Gly Glu Gly Glu Glu Glu Glu Gly Glu

G E E E G E G E G E E G E G E G E E E E G E

970 975 980 985 990

del (5)


2985 | | 3000 3015 3030

GGG GAG GGG GAA GAG GAG GAA GGG GAA GAA GAA GGG GAG GAA GAA GGA GAG GGA GAG GAA GAA GGG

CCC CTC CCC CTT CTC CTC CTT CCC CTT CTT CTT CCC CTC CTT CTT CCT CTC CCT CTC CTT CTT CCC

Gly Glu Gly Glu Glu Glu Glu Gly Glu Glu Glu Gly Glu Glu Glu Gly Glu Gly Glu Glu Glu Gly

G E G E E E E G E E E G E E E G E G E E E G

995 1000 1005 1010


(16,19, 16,5) del

del (5) |------del 5) | |------del (5)-|(23)del |

3045 3060 || |3075 | | 3090 | || |-

GAG GGA GAA GGG GAG GAA GAA GAG GAA GGG GAA GTG GAA GGG GAG GTG GAA GGG GAG GAA GGA GAG

CTC CCT CTT CCC CTC CTT CTT CTC CTT CCC CTT CAC CTT CCC CTC CAC CTT CCC CTC CTT CCT CTC

Glu Gly Glu Gly Glu Glu Glu Glu Glu Gly Glu Val Glu Gly Glu Val Glu Gly Glu Glu Gly Glu

E G E G E E E E E G E V E G E V E G E E G E

1015 1020 1025 1030 1035


del (23,24)

| 3120 3135 3150 3165

GGG GAA GGA GAG GAA GAG GAA GGA GAG GAG GAA GGA GAA GAA AGG GAA AAG GAG GGG GAA GGA GAA

CCC CTT CCT CTC CTT CTC CTT CCT CTC CTC CTT CCT CTT CTT TCC CTT TTC CTC CCC CTT CCT CTT

Gly Glu Gly Glu Glu Glu Glu Gly Glu Glu Glu Gly Glu Glu Arg Glu Lys Glu Gly Glu Gly Glu

G E G E E E E G E E E G E E R E K E G E G E

1040 1045 1050 1055

T=Gly (5)

3180 3195 3210 3225 |

GAA AAC AGG AGG AAC AGA GAA GAG GAG GAG GAA GAA GAG GGG AAG TAT CAG GAG ACA GGC GAA GAA

CTT TTG TCC TCC TTG TCT CTT CTC CTC CTC CTT CTT CTC CCC TTC ATA GTC CTC TGT CCG CTT CTT

Glu Asn Arg Arg Asn Arg Glu Glu Glu Glu Glu Glu Glu Gly Lys Tyr Gln Glu Thr Gly Glu Glu

E N R R N R E E E E E E E G K Y Q E T G E E

1060 1065 1070 1075

A=Lys (16) A=Val (19)

3240 | 3255 3270 | 3285 3300

GAG AAT GAA AGG CAG GAT GGA GAG GAG TAC AAA AAA GTG AGC AAA ATA AAA GGA TCT GTG AAA TAT

CTC TTA CTT TCC GTC CTA CCT CTC CTC ATG TTT TTT CAC TCG TTT TAT TTT CCT AGA CAC TTT ATA

|-------------------------> ORF15-4f(23)

Glu Asn Glu Arg Gln Asp Gly Glu Glu Tyr Lys Lys Val Ser Lys Ile Lys Gly Ser Val Lys Tyr

E N E R Q D G E E Y K K V S K I K G S V K Y

1080 1085 1090 1095 1100

insA (23)

3315 | 3330 3345 3360

GGC AAA CAT AAA ACA TAT CAA AAA AAG TCA GTT ACT AAC ACA CAG GGA AAT GGG AAA GAG CAG AGG

CCG TTT GTA TTT TGT ATA GTT TTT TTC AGT CAA TGA TTG TGT GTC CCT TTA CCC TTT CTC GTC TCC



<------------------------| ORF15-3r (23)

Gly Lys His Lys Thr Tyr Gln Lys Lys Ser Val Thr Asn Thr Gln Gly Asn Gly Lys Glu Gln Arg

G K H K T Y Q K K S V T N T Q G N G K E Q R

1105 1110 1115 1120

C=Asn (16)

3375 3390 3405 | 3420 3435

TCC AAA ATG CCA GTC CAG TCA AAA CGA CTT TTA AAA AAT GGG CCA TCA GGT TCC AAA AAG TTC TGG

AGG TTT TAC GGT CAG GTC AGT TTT GCT GAA AAT TTT TTA CCC GGT AGT CCA AGG TTT TTC AAG ACC

Ser Lys Met Pro Val Gln Ser Lys Arg Leu Leu Lys Asn Gly Pro Ser Gly Ser Lys Lys Phe Trp

S K M P V Q S K R L L K N G P S G S K K F W

1125 1130 1135 1140 1145

3450 3468 3481 3491 3501

AAT AAT ATA TTA CCA CAT TAC TTG GAA TTG AAG TAA caaac cttaa atgtg acccg attat ggcca

TTA TTA TAT AAT GGT GTA ATG AAC CTT AAC TTC ATT gtttg gaatt tacac tgggc taata ccggt

Asn Asn Ile Leu Pro His Tyr Leu Glu Leu Lys ter

N N I L P H Y L E L K X

1150 1156
3511 3521 3531 3541 3551 3561 3571

gtcag acaat ttaaa tgcct tgcat ataac gggca ctcat tacgt gttat taaat tgatt ttatg tcaat tattt

cagtc tgtta aattt acgga acgta tattg cccgt gagta atgca caata attta actaa aatac agtta ataaa
3581 3591 3601 3611 3621 3631 3641 3651

tatgt gtagt aaaaa aaaag caact gatgc agctg tgtta aggag ccaaa gacaa tagga ggcac tggta aattt

ataca catca ttttt ttttc gttga ctacg tcgac acaat tcctc ggttt ctgtt atcct ccgtg accat ttaaa

<----------
3661 3671 3681 3691 3701 3711 3721

tggcc tctct caaac taaaa ttttc gtgta tttcc ccccc aaatt ataaa aacat aacta gaaaa tatta aaagg

accgg agaga gtttg atttt aaaag cacat aaagg ggggg tttaa tattt ttgta ttgat ctttt ataat tttcc

------------| ORF15-4r (23)


3731 3741 3751 3761 3771 3781 3791 3801

tcata tcaga ttatt aacat tatat attca ttaaa ggcag cttta ggaaa cagga atata ctaca agagt gtttt

agtat agtct aataa ttgta atata taagt aattt ccgtc gaaat ccttt gtcct tatat gatgt tctca caaaa
3811 3821 3831 3841 3851 3861 3871

gtttg tgtat acaaa tcatt ccatt tttaa atggc acaga tgctt aaggg ctata aaaac ttcta atttc ttata

caaac acata tgttt agtaa ggtaa aaatt taccg tgtct acgaa ttccc gatat ttttg aagat taaag aatat
3881 3891 3901 3911 3921 3931 3941 3951

aatat gttag cactt ttttt aagtt agtga ttaca gttta cctac tgtat agaat aattt tctaa taatg gatgg

ttata caatc gtgaa aaaaa ttcaa tcact aatgt caaat ggatg acata tctta ttaaa agatt attac ctacc
….
IVS15
Exon15a
aacaa agggt ccttt tgtga gaggt caggc ccatt cactt cttta ctgtg atcct aacgc aagac acaga catat

ttgtt tccca ggaaa acact ctcca gtccg ggtaa gtgaa gaaat gacac tagga ttgcg ttctg tgtct gtata

actga ccatt tcaga ctacc tgttt acatc ttctt agatg gttct agtgc caaga atgta cttaa acttt actct

tgact ggtaa agtct gatgg acaaa tgtag aagaa tctac caaga tcacg gttct tacat gaatt tgaaa tgaga

1906 1920 1938 1951

ttctt tttat aaaat gacag TAT TCT GCC AGT CAC TCC CAA ATT GTT TCA GTT TAA aaggt aagtg ctttc

aagaa aaata tttta ctgtc ATA AGA CGG TCA GTG AGG GTT TAA CAA AGT CAA ATT ttcca ttcac gaaag

Tyr Ser Ala Ser His Ser Gln Ile Val Ser Val ter


Y S A S H S Q I V S V X

636 640 646
1961 1971 1981 1991 2001 2011 2021 2031

tgtga cttca agtag tcgtc cagat gtgtt tagaa taatt ttaaa tgtgc ttagt ctgtg agtgc tatga tgtga

acact gaagt tcatc agcag gtcta cacaa atctt attaa aattt acacg aatca gacac tcacg atact acact
2041 2051 2061 2071 2081

tttct agtga aaatg tcatt ttcta atagc tttct caaat ttaga aaatg atata gtt

aaaga tcact tttac agtaa aagat tatcg aaaga gttta aatct tttac tatat caa

IVS15a
16f (1) |------------------------->

acaaa ttatt tatga cagca atatc aaatc cctcg ttttt taaca agaaa atctt actct tctct gatgg tcaat

tgttt aataa atact gtcgt tatag tttag ggagc aaaaa attgt tcttt tagaa tgaga agaga ctacc agtta

1906 1920 1935 1950

tttgt cacag GAT CAT GAA TTT TCT AAA ACT GAG GAA CTA AAA CTA GAA GAT GTG GAT GAG GAA ATT

aaaca gtgtc CTA GTA CTT AAA AGA TTT TGA CTC CTT GAT TTT GAT CTT CTA CAC CTA CTC CTT TAA

Asp His Glu Phe Ser Lys Thr Glu Glu Leu Lys Leu Glu Asp Val Asp Glu Glu Ile

D H E F S K T E E L K L E D V D E E I

636 640 645 650

1965 1980 1995 2010 2025

AAT GCT GAA AAT GTG GAA AGC AAG AAG AAA ACT GTG GGA GAT GAT GAA AGT GTT CCT ACA GGT TAT

TTA CGA CTT TTA CAC CTT TCG TTC TTC TTT TGA CAC CCT CTA CTA CTT TCA CAA GGA TGT CCA ATA

Asn Ala Glu Asn Val Glu Ser Lys Lys Lys Thr Val Gly Asp Asp Glu Ser Val Pro Thr Gly Tyr

N A E N V E S K K K T V G D D E S V P T G Y

655 660 665 670 675

2040 2055 2070 2085 2091

CAC AGT AAA ACA GAA GGA GCA GAA AGA ACC AAT GAT GAT AGC TCA GCT GAA ACT ATT GAA AAG gtatg

GTG TCA TTT TGT CTT CCT CGT CTT TCT TGG TTA CTA CTA TCG AGT CGA CTT TGA TAA CTT TTC catac

His Ser Lys Thr Glu Gly Ala Glu Arg Thr Asn Asp Asp Ser Ser Ala Glu Thr Ile Glu Lys

H S K T E G A E R T N D D S S A E T I E K

680 685 690 695 697

atgat agatg cttaa atcct gagat cctgg gagaa aacat atggg ttctc tagtt tctgc agagg acaga ttcca

tacta tctac gaatt tagga ctcta ggacc ctctt ttgta taccc aagag atcaa agacg tctcc tgtct aaggt



<-----------------------| 16r (1)

gttgc


caacg
IVS16
|------------------------> 17f (1)

gactc tagaa cctgc ttaaa gattc agact taagt tttct agaac agcac aaaaa tattt tttgt tatat tgctt

ctgag atctt ggacg aattt ctaag tctga attca aaaga tcttg tcgtg ttttt ataaa aaaca atata acgaa

2092 2100 2115 2130 2145

tgtta ttgtt tatag AAA GAA AAA GCC AAC CTA GAG GAA CGG GCC ATT TGT GAG TAC AAT GAA AAC CCA

acaat aacaa atatc TTT CTT TTT CGG TTG GAT CTC CTT GCC CGG TAA ACA CTC ATG TTA CTT TTG GGT

Lys Glu Lys Ala Asn Leu Glu Glu Arg Ala Ile Cys Glu Tyr Asn Glu Asn Pro

K E K A N L E E R A I C E Y N E N P

698 700 705 710 715

t (13)


2151 |

AAA GGA gtatg agcac atttg ataac ataca tatat acata catac ataca cacat atata tgtgt atgta

TTT CCT catac tcgtg taaac tattg tatgt atata tgtat gtatg tatgt gtgta tatat acaca tacat

Lys Gly <----------

K G

717


tgcca taatt gggta attat aagct

acggt attaa cccat taata ttcga

--------------| 17r (1)
IVS17
|-----------------------> 18f (1)

catag agttt cactg agagc atcag gtctt gtctt tttcc tatta gttaa aacta gtatt tctga atgat ccaat

gtatc tcaaa gtgac tctcg tagtc cagaa cagaa aaagg ataat caatt ttgat cataa agact tacta ggtta
2152 2160 2175 2190 2205

tgtag TAC ATG CTT GAT GAT GCA GAT AGC AGT TCA TTA GAA ATC CTA GAA AAC AGT GAA ACA ACA CCA

acatc ATG TAC GAA CTA CTA CGT CTA TCG TCA AGT AAT CTT TAG GAT CTT TTG TCA CTT TGT TGT GGT

Tyr Met Leu Asp Asp Ala Asp Ser Ser Ser Leu Glu Ile Leu Glu Asn Ser Glu Thr Thr Pro

Y M L D D A D S S S L E I L E N S E T T P

718 720 725 730 735

c (13)

2220 2235 2241 |



AGC AAA GAC ATG AAA AAA ACA AAG AAG gtaag tgaaa tgaat tgtct ttttt gatcc ttttg aattc aattg

TCG TTT CTG TAC TTT TTT TGT TTC TTC cattc acttt actta acaga aaaaa ctagg aaaac ttaag ttaac

Ser Lys Asp Met Lys Lys Thr Lys Lys

S K D M K K T K K

740 745 747

tttgt tttac ttttc atatg gcaat ccaca taatt taact ctctt aatat gagtt ttcat ggccc caaaa cctga

aaaca aaatg aaaag tatac cgtta ggtgt attaa attga gagaa ttata ctcaa aagta ccggg gtttt ggact

<-------------------------| 18r (1)
agaat ttc

tctta aag


IVS18
|---------------------------> 19f (1) 2242

cccct aaata ctgtt aagtg aatgt ttcac ttatg gaaat gaata tttgt gtgtg tggtt ttttt cctag ATT

gggga tttat gacaa ttcac ttaca aagtg aatac cttta cttat aaaca cacac accaa aaaaa ggatc TAA

Ile


I

748


2250 2265 2280 2295 2310

TTT CTG TTC AAA AGA GTC CCC TCA ATA AAT CAA AAG ATT GTC AAG AAT AAC AAT GAG CCG CTC CCA

AAA GAC AAG TTT TCT CAG GGG AGT TAT TTA GTT TTC TAA CAG TTC TTA TTG TTA CTC GGC GAG GGT

Phe Leu Phe Lys Arg Val Pro Ser Ile Asn Gln Lys Ile Val Lys Asn Asn Asn Glu Pro Leu Pro

F L F K R V P S I N Q K I V K N N N E P L P

750 755 760 765 771

2325 2340 2355 2370

GAG ATA AAA TCC ATA GGA GAC CAG ATC ATT TTA AAA AGT GAT AAT AAA GAT GCC GAC CAG AAC CAC

CTC TAT TTT AGG TAT CCT CTG GTC TAG TAA AAT TTT TCA CTA TTA TTT CTA CGG CTG GTC TTG GTG

Glu Ile Lys Ser Ile Gly Asp Gln Ile Ile Leu Lys Ser Asp Asn Lys Asp Ala Asp Gln Asn His

E I K S I G D Q I I L K S D N K D A D Q N H

775 780 785 790

2385 2400 2415 2430

ATG AGT CAG AAT CAT CAG AAT ATC CCA CCA ACA AAT ACA GAG AGA AGA TCA AAA TCC TGT ACA ATA

TAC TCA GTC TTA GTA GTC TTA TAG GGT GGT TGT TTA TGT CTC TCT TCT AGT TTT AGG ACA TGT TAT

Met Ser Gln Asn His Gln Asn Ile Pro Pro Thr Asn Thr Glu Arg Arg Ser Lys Ser Cys Thr Ile

M S Q N H Q N I P P T N T E R R S K S C T I

IIIIIIIIIII

795 800 805 810

2445 2453 2463 2473 2483 2493 2503 2513

CTA TAA atata tattt atgtt ttcac agtca ccaag tgtat tgtaa tgtat acttg aaaaa tgtta taact

GAT ATT tatat ataaa tacaa aagtg tcagt ggttc acata acatt acata tgaac ttttt acaat attga

Leu ter <-------------------------| 19r (1)

L X


III

815
2523 2533 2543 2553 2563 2573 2583

tatga agtaa agttt ctgat agtag tcttt aaaag atata agact taata tgttt tattc agctt ctata agtgt

atact tcatt tcaaa gacta tcatc agaaa ttttc tatat tctga attat acaaa ataag tcgaa gatat tcaca


2593 2603 2613 2623 2633 2643 2653 2663

gacca gtttt gatat ttatt tatgc taata ttttt aacaa gtcat ttcaa aatat gtgta tctca aattc tccct

ctggt caaaa ctata aataa atacg attat aaaaa ttgtt cagta aagtt ttata cacat agagt ttaag aggga
2673 2683 2693 2703 2713 2723 2733

aaagt gttgt ggcct taact gttca gtatt gcaat aaaaa atata ttttt atatg tggaa aaaaa aaaaa aaaaa

tttca caaca ccgga attga caagt cataa cgtta ttttt tatat aaaaa tatac acctt ttttt ttttt ttttt
2743

aaaaa


ttttt
(1)=[1]

(2)=[2]

(3)=[5]

(4)=[3]

(5)=[4]

(6)=[6]

(7)=[7]

(8)=[8]

(9)=[9]

(10)=[10]

(11)=[11]

(12)=[12]

(13)=[13]

(14)=[14]

(15)=[15]

(16)=[16]

(17)=[17]

(18)=[18]

(19)=[19]

(20)=[20]

(21)=[21]

(22)=[22]

(23)=[23]

(24)=[24]


1. Meindl,A., Dry,K., Herrmann,K., Manson,F., Ciccodicola,A., Edgar,A., Carvalho,M.R., Achatz,H., Hellebrand,H., Lennon,A., Migliaccio,C., Porter,K., Zrenner,E., Bird,A., Jay,M., Lorenz,B., Wittwer,B., D'Urso,M., Meitinger,T., Wright,A. (1996) A gene (RPGR) with homology to the RCC1 guanine nucleotide exchange factor is mutated in X-linked retinitis pigmentosa (RP3). Nat.Genet. 13 (1): 35-42.

2. Roepman,R., van Duijnhoven,G., Rosenberg,T., Pinckers,A.J., Bleeker Wagemakers,L.M., Bergen,A.A.B., Post,J., Beck,A., Reinhardt,R., Ropers,H.H., Cremers,F.P., Berger,W. (1996) Positional cloning of the gene for X-linked retinitis pigmentosa 3: homology with the guanine-nucleotide-exchange factor RCC1. Hum.Mol.Genet. 5 (7): 1035-1041.

3. Kirschner,R., Rosenberg,T., Schultz-Heienbrok,R., Lenzner,S., Feil,S., Roepman,R., Cremers,F.P., Ropers,H.H., Berger,W. (1999) RPGR transcription studies in mouse and human tissues reveal a retina-specific isoform that is disrupted in a patient with X- linked retinitis pigmentosa. Hum.Mol.Genet. 8 (8): 1571-1578.

4. Vervoort,R., Lennon,A., Bird,A.C., Tulloch,B., Axton,R., Miano,M.G., Meindl,A., Meitinger,T., Ciccodicola,A., Wright,A.F. (2000) Mutational hot spot within a new RPGR exon in X-linked retinitis pigmentosa. Nat.Genet. 25 (4): 462-466.

5. Guevara-Fujita,M., Fahrner,S., Buraczynska,K., Cook,J., Wheaton,D., Cortes,F., Vicencio,C., Pena,M., Fishman,G., Mintz-Hittner,H., Birch,D., Hoffman,D., Mears,A., Fujita,R., Swaroop,A. (2001) Five novel RPGR mutations in families with X-linked retinitis pigmentosa. Hum.Mutat. 17 (2): 151

6. Liu,L., Jin,L., Liu,M., Wei,Y., Wu,Y., Liu,Y., Wang,H., Chu,R., Chai,J. (2000) Identification of two novel mutations (E332X and c1536delC) in the RPGR gene in two Chinese families with X-linked retinitis pigmentosa. Hum.Mutat.Online.

7. Fishman,G.A., Grover,S., Buraczynska,M., Wu,W., Swaroop,A. (1998) A new 2-base pair deletion in the RPGR gene in a black family with X-linked retinitis pigmentosa. Arch.Ophthalmol. 116 (2): 213-218.

8. Weleber,R.G., Butler,N.S., Murphey,W.H., Sheffield,V.C., Stone,E.M. (1997) X-linked retinitis pigmentosa associated with a 2-base pair insertion in codon 99 of the RP3 gene RPGR. Arch.Ophthalmol. 115 (11): 1429-1435.

9. Zito,I., Thiselton,D.L., Gorin,M.B., Stout,J.T., Plant,C., Bird,A.C., Bhattacharya,S.S., Hardcastle,A.J. (1999) Identification of novel RPGR (retinitis pigmentosa GTPase regulator) mutations in a subset of X-linked retinitis pigmentosa families segregating with the RP3 locus. Hum.Genet. 105 (1-2): 57-62.

10. Buraczynska,M., Wu,W., Fujita,R., Buraczynska,K., Phelps,E., Andreasson,S., Bennett,J., Birch,D.G., Fishman,G.A., Hoffman,D.R., Inana,G., Jacobson,S.G., Musarella,M.A., Sieving,P.A., Swaroop,A. (1997) Spectrum of mutations in the RPGR gene that are identified in 20% of families with X-linked retinitis pigmentosa. Am.J.Hum.Genet. 61 (6): 1287-1292.

11. Dry,K.L., Manson,F.D., Lennon,A., Bergen,A.A., van Dorp,D.B., Wright,A.F. (1999) Identification of a 5' splice site mutation in the RPGR gene in a family with X-linked retinitis pigmentosa (RP3). Hum.Mutat. 13 (2): 141-145.

12. Miano,M.G., Valverde,D., Solans,T., Grammatico,B., Migliaccio,C., Cirigliano,V., DeBernardo,C., Ventruto,V., Meitinger,T., Wright,A., Del Porto,G., Baiget,M., D'Urso,M., Ciccodicola,A. (1998) Two novel mutations in the retinitis pigmentosa GTPase regulator (RPGR) gene in X-linked retinitis pigmentosa (RP3). Mutations in brief no. 172. Online. Hum.Mutat. 12 (3): 212-213.

13. Zito,I., Gorin,M.B., Plant,C., Bird,A.C., Bhattacharya,S.S., Hardcastle,A.J. (2000) Novel mutations of the RPGR gene in RP3 families. Hum Mutat.Online. 15 (4): 386

14. Bauer,S., Fujita,R., Buraczynska,M., Abrahamson,M., Ehinger,B., Wu,W., Falls,T.J., Andreasson,S., Swaroop,A. (1998) Phenotype of an X-linked retinitis pigmentosa family with a novel splice defect in the RPGR gene. Invest.Ophthalmol.Vis.Sci. 39 (12): 2470-2474.

15. Sharon,D., Bruns,G.A., McGee,T.L., Sandberg,M.A., Berson,E.L., Dryja,T.P. (2000) X-linked retinitis pigmentosa: mutation spectrum of the RPGR and RP2 genes and correlation with visual function. Invest Ophthalmol.Vis.Sci. 41 (9): 2712-2721.

16. Rabe,V.W., Sharon,D., Berson,E.L., Dryja,T.P. (2001) The Mutation Spectrum Of RPGR-ORF15 In North American Patients With X-Linked Retinitis Pigmentosa (XLRP). Invest Ophthalmol.Vis.Sci. 42 (4): S642

17. Nao-i,N., Yokoyama,A., Maruiwa,F., Hayakawa,M., Kanai,A., Vervoort,R., Wright,A.F., Yamada,K., Niikawa,N. (2001) Novel Mutations Of RPGR Gene Exon Orf 15 In Japanese Families With X-Linked Retinitis Pigmentosa. Invest Ophthalmol.Vis.Sci. 42 (4): S642

18. Blasi,M.A., Grammatico,P., Rinaldi,R., Grammatico,B., Sasso,P., D'Urso,M., Miano,M.G., Balestrazzi,E. (2001) Genotype-phenotype correlation in X-linked retinitis pigmentosa family with a novel mutation in RPGR gene. Invest Ophthalmol.Vis.Sci. 42 (4): S641

19. Miano,M.G., Vervoort,R., Conte,I., Zullo,A., Lanzara,C., Circolo,D., D'Urso,M., Ayuso,C., Wright,A., Ciccodicola,A. (2001) Mutational analysis of the RPGR Exon ORF 15 in South European patients with X-Linked Retinitis Pigmentosa. Invest Ophthalmol.Vis.Sci. 42 (4): S641

20. Fujita,R., Buraczynska,M., Gieser,L., Wu,W., Forsythe,P., Abrahamson,M., Jacobson,S.G., Sieving,P.A., Andreasson,S., Swaroop,A. (1997) Analysis of the RPGR gene in 11 pedigrees with the retinitis pigmentosa type 3 genotype: paucity of mutations in the coding region but splice defects in two families. Am.J.Hum.Genet. 61 (3): 571-580.

21. Linari,M., Ueffing,M., Manson,F., Wright,A., Meitinger,T., Becker,J. (1999) The retinitis pigmentosa GTPase regulator, RPGR, interacts with the delta subunit of rod cyclic GMP phosphodiesterase. Proc.Natl.Acad.Sci.U.S.A. 96 (4): 1315-1320.

22. D'Urso,M., Miano,M.G., Testa,F., Simonelli,F., Baiget,M., Ciccodicola,A. (1999) Mutation Analysis In Patients With X-Linked Retinitis Pigmentosa. Invest.Ophthalmol.Vis.Sci. 40 (Suppl.): S469

23. Demirci,F.Y., Rigatti,B.W., Wen,G., Radak,A.L., Mah,T.S., Baic,C.L., Traboulsi,E.I., Alitalo,T., Ramser,J., Gorin,M.B. (2002) X-Linked Cone-Rod Dystrophy (Locus COD1): Identification of Mutations in RPGR Exon ORF15. Am.J.Hum.Genet. 70 (4): 1049-1053.



24. Yang,Z., Peachey,N.S., Moshfeghi,D.M., Thirumalaichary,S., Chorich,L., Shugart,Y.Y., Fan,K., Zhang,K. (2002) Mutations in the RPGR gene cause X-linked cone dystrophy. Hum.Mol.Genet. 11 (5): 605-611.
Yüklə 200,92 Kb.

Dostları ilə paylaş:




Verilənlər bazası müəlliflik hüququ ilə müdafiə olunur ©muhaz.org 2024
rəhbərliyinə müraciət

gir | qeydiyyatdan keç
    Ana səhifə


yükləyin