Addgene pHAGE-EGFR-A839V Sequencing Result - Sequence Analyzer Skip to main content
Addgene

Sequence Analyzer: pHAGE-EGFR-A839V Sequencing Result


Map View

NruI * (11,093) EBV-rev (10,062 .. 10,081) M13 rev M13 Reverse (9996 .. 10,012) lac operator M13/pUC Reverse (9977 .. 9999) CAP binding site SfiI (9775) L4440 (9641 .. 9658) pBR322ori-F (9388 .. 9407) FspI (8432) PvuI (8286) pBRforEco (7677 .. 7695) PacI (7640) WPRE-R (6715 .. 6735) BsaBI * (6660) Puro-F (6545 .. 6565) RsrII (6165) BsiWI (6105) Puro-R (6049 .. 6068) KflI (84) SpeI (362) EF1a-F (1499 .. 1519) NsiI (4560) T3 promoter T7 promoter T7 (5307 .. 5326) NotI (5372) PspXI (5378) XbaI (5384) BstBI (5401) MluI (5445) IRES reverse (5629 .. 5646) IRES-F (5856 .. 5875) Acc65I (5903) KpnI (5907) pHAGE-EGFR-A839V 12,110 bp

Sequence View

5ʹ 3ʹ 60 (in frame with gp41 peptide) VFAVLSIVNRVRQGYSPLSF FAVLSIVNRVRQGYSPLSF t t t t t g c t g t a c t t t c t a t a g t g a a t a g a g t t a g g c a g g g a t a t t c a c c a t t a t c g t t t c a a a a a c g a c a t g a a a g a t a t c a c t t a t c t c a a t c c g t c c c t a t a a g t g g t a a t a g c a a a g

KflI 120 (in frame with gp41 peptide) QTHLPTPRGPDRPEGIEEEG QTHLPTPRGPDRPEGIEEEG a g a c c c a c c t c c c a a c c c c g a g g g g a c c c g a c a g g c c c g a a g g a a t a g a a g a a g a a g g t g t c t g g g t g g a g g g t t g g g g c t c c c c t g g g c t g t c c g g g c t t c c t t a t c t t c t t c t t c c a c

180 (in frame with gp41 peptide) GERDRDRSIRLVNGSRRYRR GERDRDRSIRLVNGSRRYRR g a g a g a g a g a c a g a g a c a g a t c c a t t c g a t t a g t g a a c g g a t c t c g a c g g t a t c g c c g a a c t c t c t c t c t g t c t c t g t c t a g g t a a g c t a a t c a c t t g c c t a g a g c t g c c a t a g c g g c t t

240 (in frame with gp41 peptide) IHKWQYSSTILKEKGGLGGT cPPT/CTS IHKWQYSSTILKEKGGLGGT t t c a c a a a t g g c a g t a t t c a t c c a c a a t t t t a a a a g a a a a g g g g g g a t t g g g g g g t a c a g a a g t g t t t a c c g t c a t a a g t a g g t g t t a a a a t t t t c t t t t c c c c c c t a a c c c c c c a t g t c

300 (in frame with gp41 peptide) VQGKE* cPPT/CTS VQGKE* t g c a g g g g a a a g a a t a g t a g a c a t a a t a g c a a c a g a c a t a c a a a c t a a a g a a t t a c a a a a a c g t c c c c t t t c t t a t c a t c t g t a t t a t c g t t g t c t g t a t g t t t g a t t t c t t a a t g t t t t

360 cPPT/CTS a c a a a t t a c a a a a a t t c a a a a t t t t c g g g t t t a t t a c a g g g a c a g c a g a g a t c c a g t t t g t g t t t a a t g t t t t t a a g t t t t a a a a g c c c a a a t a a t g t c c c t g t c g t c t c t a g g t c a a a c

SpeI 420 EF-1 α promoter g a c t a g t c g t g a g g c t c c g g t g c c c g t c a g t g g g c a g a g c g c a c a t c g c c c a c a g t c c c c c t g a t c a g c a c t c c g a g g c c a c g g g c a g t c a c c c g t c t c g c g t g t a g c g g g t g t c a g g g g

480 EF-1 α promoter g a g a a g t t g g g g g g a g g g g t c g g c a a t t g a a c c g g t g c c t a g a g a a g g t g g c g c g g g g t a c t c t t c a a c c c c c c t c c c c a g c c g t t a a c t t g g c c a c g g a t c t c t t c c a c c g c g c c c c a t

540 EF-1 α promoter a a c t g g g a a a g t g a t g t c g t g t a c t g g c t c c g c c t t t t t c c c g a g g g t g g g g g a g a a c c g t t g a c c c t t t c a c t a c a g c a c a t g a c c g a g g c g g a a a a a g g g c t c c c a c c c c c t c t t g g c

600 EF-1 α promoter t a t a t a a g t g c a g t a g t c g c c g t g a a c g t t c t t t t t c g c a a c g g g t t t g c c g c c a g a a c a a t a t a t t c a c g t c a t c a g c g g c a c t t g c a a g a a a a a g c g t t g c c c a a a c g g c g g t c t t g t

660 EF-1 α promoter EF-1 α intron A c a g g t a a g t g c c g t g t g t g g t t c c c g c g g g c c t g g c c t c t t t a c g g g t t a t g g c c c t t g c g t c c a t t c a c g g c a c a c a c c a a g g g c g c c c g g a c c g g a g a a a t g c c c a a t a c c g g g a a c g

720 EF-1 α promoter EF-1 α intron A *KWRAATRSEQDRAEPQ g t g c c t t g a a t t a c t t c c a c c t g g c t g c a g t a c g t g a t t c t t g a t c c c g a g c t t c g g g t t c a c g g a a c t t a a t g a a g g t g g a c c g a c g t c a t g c a c t a a g a a c t a g g g c t c g a a g c c c a a

780 EF-1 α promoter EF-1 α intron A FHTPSNSAKRKLLGKAEHKL g g a a g t g g g t g g g a g a g t t c g a g g c c t t g c g c t t a a g g a g c c c c t t c g c c t c g t g c t t g a c c t t c a c c c a c c c t c t c a a g c t c c g g a a c g c g a a t t c c t c g g g g a a g c g g a g c a c g a a c t

840 EF-1 α promoter EF-1 α intron A QPRAQASPGGRAFRTAGERR g t t g a g g c c t g g c c t g g g c g c t g g g g c c g c c g c g t g c g a a t c t g g t g g c a c c t t c g c g c c c a a c t c c g g a c c g g a c c c g c g a c c c c g g c g g c g c a c g c t t a g a c c a c c g t g g a a g c g c g g

900 EF-1 α promoter EF-1 α intron A DRQKRYTELWKFNKIVQQSA t g t c t c g c t g c t t t c g a t a a g t c t c t a g c c a t t t a a a a t t t t t g a t g a c c t g c t g c g a c g a c a g a g c g a c g a a a g c t a t t c a g a g a t c g g t a a a t t t t a a a a a c t a c t g g a c g a c g c t g c

960 EF-1 α promoter EF-1 α intron A KKRALYDQLHPGLDACQYKP c t t t t t t t c t g g c a a g a t a g t c t t g t a a a t g c g g g c c a a g a t c t g c a c a c t g g t a t t t c g g a a a a a a a g a c c g t t c t a t c a g a a c a t t t a c g c c c g g t t c t a g a c g t g t g a c c a t a a a g c

1020 EF-1 α promoter EF-1 α intron A KQPRPRRRPGHTGACMNPSA g t t t t t g g g g c c g c g g g c g g c g a c g g g g c c c g t g c g t c c c a g c g c a c a t g t t c g g c g a g g c a a a a a c c c c g g c g c c c g c c g c t g c c c c g g g c a c g c a g g g t c g c g t g t a c a a g c c g c t c c

1080 EF-1 α promoter EF-1 α intron A PGALAAVSFRVPTTELQGAQ c g g g g c c t g c g a g c g c g g c c a c c g a g a a t c g g a c g g g g g t a g t c t c a a g c t g g c c g g c c t g c c c c g g a c g c t c g c g c c g g t g g c t c t t a g c c t g c c c c c a t c a g a g t t c g a c c g g c c g g a

1140 EF-1 α promoter EF-1 α intron A EPAQGRAATYRGARPPLAPG g c t c t g g t g c c t g g c c t c g c g c c g c c g t g t a t c g c c c c g c c c t g g g c g g c a a g g c t g g c c c g a g a c c a c g g a c c g g a g c g c g g c g g c a c a t a g c g g g g c g g g a c c c g c c g t t c c g a c c g g

1200 EF-1 α promoter EF-1 α intron A TPVLQTLPFIAAERGQQLSS c g g t c g g c a c c a g t t g c g t g a g c g g a a a g a t g g c c g c t t c c c g g c c c t g c t g c a g g g a g c g c c a g c c g t g g t c a a c g c a c t c g c c t t t c t a c c g g c g a a g g g c c g g g a c g a c g t c c c t c g

1260 EF-1 α promoter EF-1 α intron A LISSAASPLAPPHTVWVFSF t c a a a a t g g a g g a c g c g g c g c t c g g g a g a g c g g g c g g g t g a g t c a c c c a c a c a a a g g a a a a g t t t t a c c t c c t g c g c c g c g a g c c c t c t c g c c c g c c c a c t c a g t g g g t g t g t t t c c t t t

1320 EF-1 α promoter EF-1 α intron A PRETRLRRKM a g g g c c t t t c c g t c c t c a g c c g t c g c t t c a t g t g a c t c c a c g g a g t a c c g g g c g c c g t c c t c c c g g a a a g g c a g g a g t c g g c a g c g a a g t a c a c t g a g g t g c c t c a t g g c c c g c g g c a g g

1380 EF-1 α promoter EF-1 α intron A a g g c a c c t c g a t t a g t t c t c g a g c t t t t g g a g t a c g t c g t c t t t a g g t t g g g g g g a g g g g t c c g t g g a g c t a a t c a a g a g c t c g a a a a c c t c a t g c a g c a g a a a t c c a a c c c c c c t c c c c

1440 EF-1 α promoter EF-1 α intron A t t t t a t g c g a t g g a g t t t c c c c a c a c t g a g t g g g t g g a g a c t g a a g t t a g g c c a g c t t g g a a a a t a c g c t a c c t c a a a g g g g t g t g a c t c a c c c a c c t c t g a c t t c a a t c c g g t c g a a c c

1500 EF-1 α promoter EF-1 α intron A T C EF1a-F c a c t t g a t g t a a t t c t c c t t g g a a t t t g c c c t t t t t g a g t t t g g a t c t t g g t t c a t t c t c g t g a a c t a c a t t a a g a g g a a c c t t a a a c g g g a a a a a c t c a a a c c t a g a a c c a a g t a a g a g

1560 EF-1 α promoter EF-1 α intron A A A G C C T C A G A C A G T G G T T C EF1a-F a a g c c t c a g a c a g t g g t t c a a a g t t t t t t t c t t c c a t t t c a g g t g t c g t g a a g c g g c c c t t t c g g a g t c t g t c a c c a a g t t t c a a a a a a a g a a g g t a a a g t c c a c a g c a c t t c g c c g g g a

1620 1 5 signal sequence MRPSGTAG attB1 EGFR MRPSGTAG g c a g a t a t c a a c a a g t t t g t a c a a a a a a g c a g g c a c c a t g c g a c c c t c c g g g a c g g c c g g c g t c t a t a g t t g t t c a a a c a t g t t t t t t c g t c c g t g g t a c g c t g g g a g g c c c t g c c g g c c

1680 10 15 20 signal sequence AALLALLAALCPASRA 25 extracellular domain LEEK EGFR AALLALLAALCPASRALEEK g g c a g c g c t c c t g g c g c t g c t g g c t g c g c t c t g c c c g g c g a g t c g g g c t c t g g a g g a a a a c c g t c g c g a g g a c c g c g a c g a c c g a c g c g a g a c g g g c c g c t c a g c c c g a g a c c t c c t t t t

1740 30 35 40 45 extracellular domain KVCQGTSNKLTQLGTFEDHF EGFR KVCQGTSNKLTQLGTFEDHF g a a a g t t t g c c a a g g c a c g a g t a a c a a g c t c a c g c a g t t g g g c a c t t t t g a a g a t c a t t t c t t t c a a a c g g t t c c g t g c t c a t t g t t c g a g t g c g t c a a c c c g t g a a a a c t t c t a g t a a a

1800 50 55 60 65 extracellular domain LSLQRMFNNCEVVLGNLEIT EGFR LSLQRMFNNCEVVLGNLEIT t c t c a g c c t c c a g a g g a t g t t c a a t a a c t g t g a g g t g g t c c t t g g g a a t t t g g a a a t t a c a g a g t c g g a g g t c t c c t a c a a g t t a t t g a c a c t c c a c c a g g a a c c c t t a a a c c t t t a a t g

1860 70 75 80 85 extracellular domain YVQRNYDLSFLKTIQEVAGY EGFR YVQRNYDLSFLKTIQEVAGY c t a t g t g c a g a g g a a t t a t g a t c t t t c c t t c t t a a a g a c c a t c c a g g a g g t g g c t g g t t a g a t a c a c g t c t c c t t a a t a c t a g a a a g g a a g a a t t t c t g g t a g g t c c t c c a c c g a c c a a t

1920 90 95 100 105 extracellular domain VLIALNTVERIPLENLQIIR EGFR VLIALNTVERIPLENLQIIR t g t c c t c a t t g c c c t c a a c a c a g t g g a g c g a a t t c c t t t g g a a a a c c t g c a g a t c a t c a g a c a g g a g t a a c g g g a g t t g t g t c a c c t c g c t t a a g g a a a c c t t t t g g a c g t c t a g t a g t c

1980 110 115 120 125 extracellular domain GNMYYENSYALAVLSNYDAN EGFR GNMYYENSYALAVLSNYDAN a g g a a a t a t g t a c t a c g a a a a t t c c t a t g c c t t a g c a g t c t t a t c t a a c t a t g a t g c a a a t c c t t t a t a c a t g a t g c t t t t a a g g a t a c g g a a t c g t c a g a a t a g a t t g a t a c t a c g t t t

2040 130 135 140 145 extracellular domain KTGLKELPMRNLQEILHGAV EGFR KTGLKELPMRNLQEILHGAV t a a a a c c g g a c t g a a g g a g c t g c c c a t g a g a a a t t t a c a g g a a a t c c t g c a t g g c g c c g t a t t t t g g c c t g a c t t c c t c g a c g g g t a c t c t t t a a a t g t c c t t t a g g a c g t a c c g c g g c a

2100 150 155 160 165 extracellular domain RFSNNPALCNVESIQWRDIV EGFR RFSNNPALCNVESIQWRDIV g c g g t t c a g c a a c a a c c c t g c c c t g t g c a a c g t g g a g a g c a t c c a g t g g c g g g a c a t a g t c g c c a a g t c g t t g t t g g g a c g g g a c a c g t t g c a c c t c t c g t a g g t c a c c g c c c t g t a t c a

2160 170 175 180 185 extracellular domain SSDFLSNMSMDFQNHLGSCQ EGFR SSDFLSNMSMDFQNHLGSCQ c a g c a g t g a c t t t c t c a g c a a c a t g t c g a t g g a c t t c c a g a a c c a c c t g g g c a g c t g c c a g t c g t c a c t g a a a g a g t c g t t g t a c a g c t a c c t g a a g g t c t t g g t g g a c c c g t c g a c g g t

2220 190 195 200 205 extracellular domain KCDPSCPNGSCWGAGEENCQ EGFR KCDPSCPNGSCWGAGEENCQ a a a g t g t g a t c c a a g c t g t c c c a a t g g g a g c t g c t g g g g t g c a g g a g a g g a g a a c t g c c a t t t c a c a c t a g g t t c g a c a g g g t t a c c c t c g a c g a c c c c a c g t c c t c t c c t c t t g a c g g t

2280 210 215 220 225 extracellular domain KLTKIICAQQCSGRCRGKSP EGFR KLTKIICAQQCSGRCRGKSP g a a a c t g a c c a a a a t c a t c t g t g c c c a g c a g t g c t c c g g g c g c t g c c g t g g c a a g t c c c c c t t t g a c t g g t t t t a g t a g a c a c g g g t c g t c a c g a g g c c c g c g a c g g c a c c g t t c a g g g g

2340 230 235 240 245 extracellular domain SDCCHNQCAAGCTGPRESDC EGFR SDCCHNQCAAGCTGPRESDC c a g t g a c t g c t g c c a c a a c c a g t g t g c t g c a g g c t g c a c a g g c c c c c g g g a g a g c g a c t g g t c a c t g a c g a c g g t g t t g g t c a c a c g a c g t c c g a c g t g t c c g g g g g c c c t c t c g c t g a c

2400 250 255 260 265 extracellular domain LVCRKFRDEATCKDTCPPLM EGFR LVCRKFRDEATCKDTCPPLM c c t g g t c t g c c g c a a a t t c c g a g a c g a a g c c a c g t g c a a g g a c a c c t g c c c c c c a c t c a t g g a c c a g a c g g c g t t t a a g g c t c t g c t t c g g t g c a c g t t c c t g t g g a c g g g g g g t g a g t a

2460 270 275 280 285 extracellular domain LYNPTTYQMDVNPEGKYSFG EGFR LYNPTTYQMDVNPEGKYSFG g c t c t a c a a c c c c a c c a c g t a c c a g a t g g a t g t g a a c c c c g a g g g c a a a t a c a g c t t t g g c g a g a t g t t g g g g t g g t g c a t g g t c t a c c t a c a c t t g g g g c t c c c g t t t a t g t c g a a a c c

2520 290 295 300 305 extracellular domain ATCVKKCPRNYVVTDHGSCV EGFR ATCVKKCPRNYVVTDHGSCV t g c c a c c t g c g t g a a g a a g t g t c c c c g t a a t t a t g t g g t g a c a g a t c a c g g c t c g t g c g t a c g g t g g a c g c a c t t c t t c a c a g g g g c a t t a a t a c a c c a c t g t c t a g t g c c g a g c a c g c a

2580 310 315 320 325 extracellular domain RACGADSYEMEEDGVRKCKK EGFR RACGADSYEMEEDGVRKCKK c c g a g c c t g t g g g g c c g a c a g c t a t g a g a t g g a g g a a g a c g g c g t c c g c a a g t g t a a g a a g g c t c g g a c a c c c c g g c t g t c g a t a c t c t a c c t c c t t c t g c c g c a g g c g t t c a c a t t c t t

2640 330 335 340 345 extracellular domain CEGPCRKVCNGIGIGEFKDS EGFR CEGPCRKVCNGIGIGEFKDS g t g c g a a g g g c c t t g c c g c a a a g t g t g t a a c g g a a t a g g t a t t g g t g a a t t t a a a g a c t c c a c g c t t c c c g g a a c g g c g t t t c a c a c a t t g c c t t a t c c a t a a c c a c t t a a a t t t c t g a g

2700 350 355 360 365 extracellular domain LSINATNIKHFKNCTSISGD EGFR LSINATNIKHFKNCTSISGD a c t c t c c a t a a a t g c t a c g a a t a t t a a a c a c t t c a a a a a c t g c a c c t c c a t c a g t g g c g a t g a g a g g t a t t t a c g a t g c t t a t a a t t t g t g a a g t t t t t g a c g t g g a g g t a g t c a c c g c t

2760 370 375 380 385 extracellular domain LHILPVAFRGDSFTHTPPLD EGFR LHILPVAFRGDSFTHTPPLD t c t c c a c a t c c t g c c g g t g g c a t t t a g g g g t g a c t c c t t c a c a c a t a c t c c t c c t c t g g a a g a g g t g t a g g a c g g c c a c c g t a a a t c c c c a c t g a g g a a g t g t g t a t g a g g a g g a g a c c t

2820 390 395 400 405 extracellular domain PQELDILKTVKEITGFLLIQ EGFR PQELDILKTVKEITGFLLIQ t c c a c a g g a a c t g g a t a t t c t g a a a a c c g t a a a g g a a a t c a c a g g g t t t t t g c t g a t t c a a g g t g t c c t t g a c c t a t a a g a c t t t t g g c a t t t c c t t t a g t g t c c c a a a a a c g a c t a a g t

2880 410 415 420 425 extracellular domain AWPENRTDLHAFENLEIIRG EGFR AWPENRTDLHAFENLEIIRG g g c t t g g c c t g a a a a c a g g a c g g a c c t c c a t g c c t t t g a g a a c c t a g a a a t c a t a c g c g g c c g a a c c g g a c t t t t g t c c t g c c t g g a g g t a c g g a a a c t c t t g g a t c t t t a g t a t g c g c c

2940 430 435 440 445 extracellular domain RTKQHGQFSLAVVSLNITSL EGFR RTKQHGQFSLAVVSLNITSL c a g g a c c a a g c a a c a t g g t c a g t t t t c t c t t g c a g t c g t c a g c c t g a a c a t a a c a t c c t t g t c c t g g t t c g t t g t a c c a g t c a a a a g a g a a c g t c a g c a g t c g g a c t t g t a t t g t a g g a a

3000 450 455 460 465 extracellular domain GLRSLKEISDGDVIISGNKN EGFR GLRSLKEISDGDVIISGNKN *SVFI g g g a t t a c g c t c c c t c a a g g a g a t a a g t g a t g g a g a t g t g a t a a t t t c a g g a a a c a a a a a c c c t a a t g c g a g g g a g t t c c t c t a t t c a c t a c c t c t a c a c t a t t a a a g t c c t t t g t t t t t

3060 470 475 480 485 extracellular domain LCYANTINWKKLFGTSGQKT EGFR LCYANTINWKKLFGTSGQKT QAICICYVPFFQKPGGTLFG t t t g t g c t a t g c a a a t a c a a t a a a c t g g a a a a a a c t g t t t g g g a c c t c c g g t c a g a a a a c a a a c a c g a t a c g t t t a t g t t a t t t g a c c t t t t t t g a c a a a c c c t g g a g g c c a g t c t t t t g

3120 490 495 500 505 extracellular domain KIISNRGENSCKATGQVCHA EGFR KIISNRGENSCKATGQVCHA FNYAVSTFVAALGCALDAMG c a a a a t t a t a a g c a a c a g a g g t g a a a a c a g c t g c a a g g c c a c a g g c c a g g t c t g c c a t g c g t t t t a a t a t t c g t t g t c t c c a c t t t t g t c g a c g t t c c g g t g t c c g g t c c a g a c g g t a c g

3180 510 515 520 525 extracellular domain LCSPEGCWGPEPRDCVSCRN EGFR LCSPEGCWGPEPRDCVSCRN QAGGLAAPARLGPVADRAPI c t t g t g c t c c c c c g a g g g c t g c t g g g g c c c g g a g c c c a g g g a c t g c g t c t c t t g c c g g a a g a a c a c g a g g g g g c t c c c g a c g a c c c c g g g c c t c g g g t c c c t g a c g c a g a g a a c g g c c t t

3240 530 535 540 545 extracellular domain VSRGRECVDKCNLLEGEPRE EGFR VSRGRECVDKCNLLEGEPRE DASAPFAHVLAVKQLTLWPL t g t c a g c c g a g g c a g g g a a t g c g t g g a c a a g t g c a a c c t t c t g g a g g g t g a g c c a a g g g a a c a g t c g g c t c c g t c c c t t a c g c a c c t g t t c a c g t t g g a a g a c c t c c c a c t c g g t t c c c t

3300 550 555 560 565 extracellular domain FVENSECIQCHPECLPQAMN EGFR FVENSECIQCHPECLPQAMN KHLVRLAYLAVWLAQRLGHV g t t t g t g g a g a a c t c t g a g t g c a t a c a g t g c c a c c c a g a g t g c c t g c c t c a g g c c a t g a a c a a a c a c c t c t t g a g a c t c a c g t a t g t c a c g g t g g g t c t c a c g g a c g g a g t c c g g t a c t t

3360 570 575 580 585 extracellular domain ITCTGRGPDNCIQCAHYIDG EGFR ITCTGRGPDNCIQCAHYIDG DGACSPSWVVTDLTGVVNVA c a t c a c c t g c a c a g g a c g g g g a c c a g a c a a c t g t a t c c a g t g t g c c c a c t a c a t t g a c g g g t a g t g g a c g t g t c c t g c c c c t g g t c t g t t g a c a t a g g t c a c a c g g g t g a t g t a a c t g c c

3420 590 595 600 605 extracellular domain PHCVKTCPAGVMGENNTLVW EGFR PHCVKTCPAGVMGENNTLVW GVADLGARCSDHSFVVGQDP c c c c c a c t g c g t c a a g a c c t g c c c g g c a g g a g t c a t g g g a g a a a a c a a c a c c c t g g t c t g g g g g g t g a c g c a g t t c t g g a c g g g c c g t c c t c a g t a c c c t c t t t t g t t g t g g g a c c a g a c

3480 610 615 620 625 extracellular domain KYADAGHVCHLCHPNCTYGC EGFR KYADAGHVCHLCHPNCTYGC LVCVGAMHAVQAM g a a g t a c g c a g a c g c c g g c c a t g t g t g c c a c c t g t g c c a t c c a a a c t g c a c c t a c g g a t g c t t c a t g c g t c t g c g g c c g g t a c a c a c g g t g g a c a c g g t a g g t t t g a c g t g g a t g c c t a c

3540 630 635 640 645 extracellular domain TGPGLEGCPTNGPKIPS transmembra... IAT EGFR TGPGLEGCPTNGPKIPSIAT c a c t g g g c c a g g t c t t g a a g g c t g t c c a a c g a a t g g g c c t a a g a t c c c g t c c a t c g c c a c g t g a c c c g g t c c a g a a c t t c c g a c a g g t t g c t t a c c c g g a t t c t a g g g c a g g t a g c g g t g

3600 650 655 660 665 transmembrane region GMVGALLLLLVVALGIGLFM EGFR GMVGALLLLLVVALGIGLFM t g g g a t g g t g g g g g c c c t c c t c t t g c t g c t g g t g g t g g c c c t g g g g a t c g g c c t c t t c a t a c c c t a c c a c c c c c g g g a g g a g a a c g a c g a c c a c c a c c g g g a c c c c t a g c c g g a g a a g t a

3660 670 675 680 685 cytoplasmic domain RRRHIVRKRTLRRLLQEREL EGFR RRRHIVRKRTLRRLLQEREL g c g a a g g c g c c a c a t c g t t c g g a a g c g c a c g c t g c g g a g g c t g c t g c a g g a g a g g g a g c t c g c t t c c g c g g t g t a g c a a g c c t t c g c g t g c g a c g c c t c c g a c g a c g t c c t c t c c c t c g a

3720 690 695 700 705 cytoplasmic domain VEPLTPSGEAPNQALLRILK EGFR VEPLTPSGEAPNQALLRILK t g t g g a g c c t c t t a c a c c c a g t g g a g a a g c t c c c a a c c a a g c t c t c t t g a g g a t c t t g a a a c a c c t c g g a g a a t g t g g g t c a c c t c t t c g a g g g t t g g t t c g a g a g a a c t c c t a g a a c t t

3780 710 715 720 725 cytoplasmic domain ETEFKKIKVLGSGAFGTVYK EGFR ETEFKKIKVLGSGAFGTVYK g g a a a c t g a a t t c a a a a a g a t c a a a g t g c t g g g c t c c g g t g c g t t c g g c a c g g t g t a t a a c c t t t g a c t t a a g t t t t t c t a g t t t c a c g a c c c g a g g c c a c g c a a g c c g t g c c a c a t a t t

3840 730 735 740 745 cytoplasmic domain GLWIPEGEKVKIPVAIKELR EGFR GLWIPEGEKVKIPVAIKELR g g g a c t c t g g a t c c c a g a a g g t g a g a a a g t t a a a a t t c c c g t c g c t a t c a a g g a a t t a a g c c c t g a g a c c t a g g g t c t t c c a c t c t t t c a a t t t t a a g g g c a g c g a t a g t t c c t t a a t t c

3900 750 755 760 765 cytoplasmic domain EATSPKANKEILDEAYVMAS EGFR EATSPKANKEILDEAYVMAS a g a a g c a a c a t c t c c g a a a g c c a a c a a g g a a a t c c t c g a t g a a g c c t a c g t g a t g g c c a g t c t t c g t t g t a g a g g c t t t c g g t t g t t c c t t t a g g a g c t a c t t c g g a t g c a c t a c c g g t c

3960 770 775 780 785 cytoplasmic domain VDNPHVCRLLGICLTSTVQL EGFR VDNPHVCRLLGICLTSTVQL c g t g g a c a a c c c c c a c g t g t g c c g c c t g c t g g g c a t c t g c c t c a c c t c c a c c g t g c a g c t g c a c c t g t t g g g g g t g c a c a c g g c g g a c g a c c c g t a g a c g g a g t g g a g g t g g c a c g t c g a

4020 790 795 800 805 cytoplasmic domain ITQLMPFGCLLDYVREHKDN EGFR ITQLMPFGCLLDYVREHKDN c a t c a c g c a g c t c a t g c c c t t c g g c t g c c t c c t g g a c t a t g t c c g g g a a c a c a a a g a c a a g t a g t g c g t c g a g t a c g g g a a g c c g a c g g a g g a c c t g a t a c a g g c c c t t g t g t t t c t g t t

4080 810 815 820 825 cytoplasmic domain IGSQYLLNWCVQIAKGMNYL EGFR IGSQYLLNWCVQIAKGMNYL t a t t g g c t c c c a g t a c c t g c t c a a c t g g t g t g t g c a g a t c g c a a a g g g c a t g a a c t a c t t a t a a c c g a g g g t c a t g g a c g a g t t g a c c a c a c a c g t c t a g c g t t t c c c g t a c t t g a t g a a

4140 830 835 840 845 cytoplasmic domain EDRRLVHRDLVARNVLVKTP EGFR EDRRLVHRDLVARNVLVKTP g g a g g a c c g t c g c t t g g t g c a c c g c g a c c t g g t a g c c a g g a a c g t a c t g g t g a a a a c a c c c c t c c t g g c a g c g a a c c a c g t g g c g c t g g a c c a t c g g t c c t t g c a t g a c c a c t t t t g t g g

4200 850 855 860 865 cytoplasmic domain QHVKITDFGLAKLLGAEEKE EGFR QHVKITDFGLAKLLGAEEKE g c a g c a t g t c a a g a t c a c a g a t t t t g g g c t g g c c a a a c t g c t g g g t g c g g a a g a g a a a g a c g t c g t a c a g t t c t a g t g t c t a a a a c c c g a c c g g t t t g a c g a c c c a c g c c t t c t c t t t c t

4260 870 875 880 885 cytoplasmic domain YHAEGGKVPIKWMALESILH EGFR YHAEGGKVPIKWMALESILH a t a c c a t g c a g a a g g a g g c a a a g t g c c t a t c a a g t g g a t g g c a t t g g a a t c a a t t t t a c a t a t g g t a c g t c t t c c t c c g t t t c a c g g a t a g t t c a c c t a c c g t a a c c t t a g t t a a a a t g t

4320 890 895 900 905 cytoplasmic domain RIYTHQSDVWSYGVTVWELM EGFR RIYTHQSDVWSYGVTVWELM c a g a a t c t a t a c c c a c c a g a g t g a t g t c t g g a g c t a c g g g g t g a c c g t t t g g g a g t t g a t g t c t t a g a t a t g g g t g g t c t c a c t a c a g a c c t c g a t g c c c c a c t g g c a a a c c c t c a a c t a

4380 910 915 920 925 cytoplasmic domain TFGSKPYDGIPASEISSILE EGFR TFGSKPYDGIPASEISSILE g a c c t t t g g a t c c a a g c c a t a t g a c g g a a t c c c t g c c a g c g a g a t c t c c t c c a t c c t g g a c t g g a a a c c t a g g t t c g g t a t a c t g c c t t a g g g a c g g t c g c t c t a g a g g a g g t a g g a c c t

4440 930 935 940 945 cytoplasmic domain KGERLPQPPICTIDVYMIMV EGFR KGERLPQPPICTIDVYMIMV g a a a g g a g a a c g c c t c c c t c a g c c a c c c a t a t g t a c c a t c g a t g t c t a c a t g a t c a t g g t c t t t c c t c t t g c g g a g g g a g t c g g t g g g t a t a c a t g g t a g c t a c a g a t g t a c t a g t a c c a

4500 950 955 960 965 cytoplasmic domain KCWMIDADSRPKFRELIIEF EGFR KCWMIDADSRPKFRELIIEF c a a g t g c t g g a t g a t a g a c g c a g a t a g t c g c c c a a a g t t c c g t g a g t t g a t c a t c g a a t t g t t c a c g a c c t a c t a t c t g c g t c t a t c a g c g g g t t t c a a g g c a c t c a a c t a g t a g c t t a a

NsiI 4560 970 975 980 985 cytoplasmic domain SKMARDPQRYLVIQGDERMH EGFR SKMARDPQRYLVIQGDERMH c t c c a a a a t g g c c c g a g a c c c c c a g c g c t a c c t t g t c a t t c a g g g g g a t g a a a g a a t g c a g a g g t t t t a c c g g g c t c t g g g g g t c g c g a t g g a a c a g t a a g t c c c c c t a c t t t c t t a c g t

4620 990 995 1000 1005 cytoplasmic domain LPSPTDSNFYRALMDEEDMD EGFR LPSPTDSNFYRALMDEEDMD t t t g c c a a g t c c t a c a g a c t c c a a c t t c t a c c g t g c c c t g a t g g a t g a a g a a g a c a t g g a a a a c g g t t c a g g a t g t c t g a g g t t g a a g a t g g c a c g g g a c t a c c t a c t t c t t c t g t a c c t

4680 1010 1015 1020 1025 cytoplasmic domain DVVDADEYLIPQQGFFSSPS EGFR DVVDADEYLIPQQGFFSSPS c g a c g t g g t g g a t g c c g a c g a g t a c c t c a t c c c a c a g c a g g g c t t c t t c a g c a g c c c c t c g c t g c a c c a c c t a c g g c t g c t c a t g g a g t a g g g t g t c g t c c c g a a g a a g t c g t c g g g g a g

4740 1030 1035 1040 1045 cytoplasmic domain TSRTPLLSSLSATSNNSTVA EGFR TSRTPLLSSLSATSNNSTVA c a c g t c a c g g a c t c c c c t c c t g a g c t c t c t g a g t g c a a c c a g c a a c a a t t c c a c c g t g g c g t g c a g t g c c t g a g g g g a g g a c t c g a g a g a c t c a c g t t g g t c g t t g t t a a g g t g g c a c c g

4800 1050 1055 1060 1065 cytoplasmic domain CIDRNGLQSCPIKEDSFLQR EGFR CIDRNGLQSCPIKEDSFLQR t t g c a t t g a t a g a a a t g g g c t g c a a a g c t g t c c c a t c a a g g a a g a c a g c t t c t t g c a g c g a a c g t a a c t a t c t t t a c c c g a c g t t t c g a c a g g g t a g t t c c t t c t g t c g a a g a a c g t c g c

4860 1070 1075 1080 1085 cytoplasmic domain YSSDPTGALTEDSIDDTFLP EGFR YSSDPTGALTEDSIDDTFLP a t a c a g c t c a g a c c c c a c a g g c g c c t t g a c t g a g g a c a g c a t a g a c g a c a c c t t c c t c c c t a t g t c g a g t c t g g g g t g t c c g c g g a a c t g a c t c c t g t c g t a t c t g c t g t g g a a g g a g g g

4920 1090 1095 1100 1105 cytoplasmic domain VPEYINQSVPKRPAGSVQNP EGFR VPEYINQSVPKRPAGSVQNP a g t g c c t g a a t a c a t a a a c c a g t c c g t t c c c a a a a g g c c c g c t g g c t c t g t g c a g a a t c c t c a c g g a c t t a t g t a t t t g g t c a g g c a a g g g t t t t c c g g g c g a c c g a g a c a c g t c t t a g g

4980 1110 1115 1120 1125 cytoplasmic domain VYHNQPLNPAPSRDPHYQDP EGFR VYHNQPLNPAPSRDPHYQDP t g t c t a t c a c a a t c a g c c t c t g a a c c c c g c g c c c a g c a g a g a c c c a c a c t a c c a g g a c c c a c a g a t a g t g t t a g t c g g a g a c t t g g g g c g c g g g t c g t c t c t g g g t g t g a t g g t c c t g g g

5040 1130 1135 1140 1145 cytoplasmic domain HSTAVGNPEYLNTVQPTCVN EGFR HSTAVGNPEYLNTVQPTCVN c c a c a g c a c t g c a g t g g g c a a c c c c g a g t a t c t c a a c a c t g t c c a g c c c a c c t g t g t c a a g g t g t c g t g a c g t c a c c c g t t g g g g c t c a t a g a g t t g t g a c a g g t c g g g t g g a c a c a g t t

5100 1150 1155 1160 1165 cytoplasmic domain STFDSPAHWAQKGSHQISLD EGFR STFDSPAHWAQKGSHQISLD c a g c a c a t t c g a c a g c c c t g c c c a c t g g g c c c a g a a a g g c a g c c a c c a a a t t a g c c t g g a g t c g t g t a a g c t g t c g g g a c g g g t g a c c c g g g t c t t t c c g t c g g t g g t t t a a t c g g a c c t

5160 1170 1175 1180 1185 cytoplasmic domain NPDYQQDFFPKEAKPNGIFK EGFR NPDYQQDFFPKEAKPNGIFK c a a c c c t g a c t a c c a g c a g g a c t t c t t t c c c a a g g a a g c c a a g c c a a a t g g c a t c t t t a a g t t g g g a c t g a t g g t c g t c c t g a a g a a a g g g t t c c t t c g g t t c g g t t t a c c g t a g a a a t t

5220 1190 1195 1200 1205 cytoplasmic domain GSTAENAEYLRVAPQSSEFI EGFR GSTAENAEYLRVAPQSSEFI g g g c t c c a c a g c t g a a a a t g c a g a a t a c c t a a g g g t c g c g c c a c a a a g c a g t g a a t t t a t c c c g a g g t g t c g a c t t t t a c g t c t t a t g g a t t c c c a g c g c g g t g t t t c g t c a c t t a a a t a

5280 1210 cytoplasmic do... GA* EGFR attB2 T3 promoter GA* t g g a g c a t a g a a c c c a g c t t t c t t g t a c a a a g t g g t g a t a t c c a a t t a a c c c t c a c t a a a a c c t c g t a t c t t g g g t c g a a a g a a c a t g t t t c a c c a c t a t a g g t t a a t t g g g a g t g a t t t

5340 T3 promoter T7 promoter attB4 G G G A T A T C A C T C A G C A T A A T T7 g g a t t t a t g t a g t t g a g a g t g a t a a a c c c t a t a g t g a g t c g t a t t a c c a c c c a a c t t t t c c c t a a a t a c a t c a a c t c t c a c t a t t t g g g a t a t c a c t c a g c a t a a t g g t g g g t t g a a a a g

XbaIPspXINotI 5400 attB4 t a t a c a a a g t g g t t g a t a t c c a g c a c a g t g g c g g c c g c t c g a g t c t a g a g g g c c c g c g g t a t a t g t t t c a c c a a c t a t a g g t c g t g t c a c c g c c g g c g a g c t c a g a t c t c c c g g g c g c c a

MluIBstBI 5460 1 5 10 GKPIPNPLLGLDST RTG* V5 tag t c g a a g g t a a g c c t a t c c c t a a c c c t c t c c t c g g t c t c g a t t c t a c g c g t a c c g g t t a g t a g c t t c c a t t c g g a t a g g g a t t g g g a g a g g a g c c a g a g c t a a g a t g c g c a t g g c c a a t c a

5520 IRES IRES a a t g a g a t c c c t c c c c c c c c c c t a a c g t t a c t g g c c g a a g c c g c t t g g a a t a a g g c c g g t t t a c t c t a g g g a g g g g g g g g g g a t t g c a a t g a c c g g c t t c g g c g a a c c t t a t t c c g g c c a

5580 IRES IRES g t g c g t t t g t c t a t a t g t t a t t t t c c a c c a t a t t g c c g t c t t t t g g c a a t g t g a g g g c c c c a c g c a a a c a g a t a t a c a a t a a a a g g t g g t a t a a c g g c a g a a a a c c g t t a c a c t c c c g g g

5640 IRES IRES G A G A G C G G T T T C IRES reverse g g a a a c c t g g c c c t g t c t t c t t g a c g a g c a t t c c t a g g g g t c t t t c c c c t c t c g c c a a a g c c t t t g g a c c g g g a c a g a a g a a c t g c t c g t a a g g a t c c c c a g a a a g g g g a g a g c g g t t t c

5700 IRES IRES C T T A C G IRES reverse g a a t g c a a g g t c t g t t g a a t g t c g t g a a g g a a g c a g t t c c t c t g g a a g c t t c t t g a a g a c c t t a c g t t c c a g a c a a c t t a c a g c a c t t c c t t c g t c a a g g a g a c c t t c g a a g a a c t t c t g

5760 IRES IRES a a a c a a c g t c t g t a g c g a c c c t t t g c a g g c a g c g g a a c c c c c c a c c t g g c g a c a g g t g c c t t t g t t g c a g a c a t c g c t g g g a a a c g t c c g t c g c c t t g g g g g g t g g a c c g c t g t c c a c g g

5820 IRES IRES t c t g c g g c c a a a a g c c a c g t g t a t a a g a t a c a c c t g c a a a g g c g g c a c a a c c c c a g t g c c a g a c g c c g g t t t t c g g t g c a c a t a t t c t a t g t g g a c g t t t c c g c c g t g t t g g g g t c a c g g

5880 IRES IRES T G G C T C T C C T C A A G C G T A T T IRES-F a c g t t g t g a g t t g g a t a g t t g t g g a a a g a g t c a a a t g g c t c t c c t c a a g c g t a t t c a a c a t g c a a c a c t c a a c c t a t c a a c a c c t t t c t c a g t t t a c c g a g a g g a g t t c g c a t a a g t t g t

Acc65IKpnI 5940 IRES IRES a g g g g c t g a a g g a t g c c c a g a a g g t a c c c c a t t g t a t g g g a t c t g a t c t g g g g c c t c g g t t c c c c g a c t t c c t a c g g g t c t t c c a t g g g g t a a c a t a c c c t a g a c t a g a c c c c g g a g c c a

6000 IRES IRES g c a c a t g c t t t a c a t g t g t t t a g t c g a g g t t a a a a a a a c g t c t a g g c c c c c c g a a c c a c g c g t g t a c g a a a t g t a c a c a a a t c a g c t c c a a t t t t t t t g c a g a t c c g g g g g g c t t g g t g c

6060 1 MTEY IRES PuroR IRES T A C T G G C T C A T G Puro-R MATHMTEY g g g a c g t g g t t t t c c t t t g a a a a a c a c g a t g a t a a t a t g g c c a c a c a t a t g a c c g a g t a c c c c t g c a c c a a a a g g a a a c t t t t t g t g c t a c t a t t a t a c c g g t g t g t a t a c t g g c t c a t g

BsiWI 6120 5 10 15 20 KPTVRLATRDDVPRAVRTLA PuroR T T C G G G T G Puro-R KPTVRLATRDDVPRAVRTLA a a g c c c a c g g t g c g c c t c g c c a c c c g c g a c g a c g t c c c c a g g g c c g t a c g c a c c c t c g c c t t c g g g t g c c a c g c g g a g c g g t g g g c g c t g c t g c a g g g g t c c c g g c a t g c g t g g g a g c g g

RsrII 6180 25 30 35 40 AAFADYPATRHTVDPDRHIE PuroR AAFADYPATRHTVDPDRHIE g c c g c g t t c g c c g a c t a c c c c g c c a c g c g c c a c a c c g t c g a t c c g g a c c g c c a c a t c g a g c g g c g c a a g c g g c t g a t g g g g c g g t g c g c g g t g t g g c a g c t a g g c c t g g c g g t g t a g c t c

6240 45 50 55 60 RVTELQELFLTRVGLDIGKV PuroR RVTELQELFLTRVGLDIGKV c g g g t c a c c g a g c t g c a a g a a c t c t t c c t c a c g c g c g t c g g g c t c g a c a t c g g c a a g g t g g c c c a g t g g c t c g a c g t t c t t g a g a a g g a g t g c g c g c a g c c c g a g c t g t a g c c g t t c c a c

6300 65 70 75 80 WVADDGAAVAVWTTPESVEA PuroR WVADDGAAVAVWTTPESVEA t g g g t c g c g g a c g a c g g c g c c g c g g t g g c g g t c t g g a c c a c g c c g g a g a g c g t c g a a g c g a c c c a g c g c c t g c t g c c g c g g c g c c a c c g c c a g a c c t g g t g c g g c c t c t c g c a g c t t c g c

6360 85 90 95 100 GAVFAEIGPRMAELSGSRLA PuroR GAVFAEIGPRMAELSGSRLA g g g g c g g t g t t c g c c g a g a t c g g c c c g c g c a t g g c c g a g t t g a g c g g t t c c c g g c t g g c c c c c c g c c a c a a g c g g c t c t a g c c g g g c g c g t a c c g g c t c a a c t c g c c a a g g g c c g a c c g g

6420 105 110 115 120 AQQQMEGLLAPHRPKEPAWF PuroR AQQQMEGLLAPHRPKEPAWF g c g c a g c a a c a g a t g g a a g g c c t c c t g g c g c c g c a c c g g c c c a a g g a g c c c g c g t g g t t c c g c g t c g t t g t c t a c c t t c c g g a g g a c c g c g g c g t g g c c g g g t t c c t c g g g c g c a c c a a g

6480 125 130 135 140 LATVGVSPDHQGKGLGSAVV PuroR LATVGVSPDHQGKGLGSAVV c t g g c c a c c g t c g g c g t c t c g c c c g a c c a c c a g g g c a a g g g t c t g g g c a g c g c c g t c g t g g a c c g g t g g c a g c c g c a g a g c g g g c t g g t g g t c c c g t t c c c a g a c c c g t c g c g g c a g c a c

6540 145 150 155 160 LPGVEAAERAGVPAFLETSA PuroR LPGVEAAERAGVPAFLETSA c t c c c c g g a g t g g a g g c g g c c g a g c g c g c c g g g g t g c c c g c c t t c c t g g a g a c c t c c g c g g a g g g g c c t c a c c t c c g c c g g c t c g c g c g g c c c c a c g g g c g g a a g g a c c t c t g g a g g c g c

6600 165 170 175 180 PRNLPFYERLGFTVTADVEV PuroR G C A A C C T C C C C T T C T A C G A G C Puro-F PRNLPFYERLGFTVTADVEV c c c c g c a a c c t c c c c t t c t a c g a g c g g c t c g g c t t c a c c g t c a c c g c c g a c g t c g a g g t g g g g g c g t t g g a g g g g a a g a t g c t c g c c g a g c c g a a g t g g c a g t g g c g g c t g c a g c t c c a c

BsaBI* 6660 185 190 195 200 PEGPRTWCMTRKPGA* PuroR PEGPRTWCMTRKPGA* c c c g a a g g a c c g c g c a c c t g g t g c a t g a c c c g c a a g c c c g g t g c c t a a a t c g a t a g a t c c g g g c t t c c t g g c g c g t g g a c c a c g t a c t g g g c g t t c g g g c c a c g g a t t t a g c t a t c t a g g

6720 WPRE A C A A C G WPRE-R t a a t c a a c c t c t g g a t t a c a a a a t t t g t g a a a g a t t g a c t g g t a t t c t t a a c t a t g t t g c a t t a g t t g g a g a c c t a a t g t t t t a a a c a c t t t c t a a c t g a c c a t a a g a a t t g a t a c a a c g

6780 WPRE A G G A A A A T G C G A T A C WPRE-R MPLYHAIASR MLLLP t c c t t t t a c g c t a t g t g g a t a c g c t g c t t t a a t g c c t t t g t a t c a t g c t a t t g c t t c c c g a g g a a a a t g c g a t a c a c c t a t g c g a c g a a a t t a c g g a a a c a t a g t a c g a t a a c g a a g g g c

6840 WPRE MAFIFSSLYKSWLLSLYEEL VWLSFSPPCINPGCCLFMRS t a t g g c t t t c a t t t t c t c c t c c t t g t a t a a a t c c t g g t t g c t g t c t c t t t a t g a g g a g t t a t a c c g a a a g t a a a a g a g g a g g a a c a t a t t t a g g a c c a a c g a c a g a g a a a t a c t c c t c a a

6900 WPRE WPVVRQRGVVCTVFADATPT CGPLSGNVAWCALCLLTQPP g t g g c c c g t t g t c a g g c a a c g t g g c g t g g t g t g c a c t g t g t t t g c t g a c g c a a c c c c c a c c a c c g g g c a a c a g t c c g t t g c a c c g c a c c a c a c g t g a c a c a a a c g a c t g c g t t g g g g g t g

6960 WPRE GWGIATTCQLLSGTFAFPLP LVGALPPPVSSFPGLSLSPS t g g t t g g g g c a t t g c c a c c a c c t g t c a g c t c c t t t c c g g g a c t t t c g c t t t c c c c c t c c c a c c a a c c c c g t a a c g g t g g t g g a c a g t c g a g g a a a g g c c c t g a a a g c g a a a g g g g g a g g g

7020 WPRE IATAELIAACLARCWTGARL LLPRRNSSPPALPAAGQGLG t a t t g c c a c g g c g g a a c t c a t c g c c g c c t g c c t t g c c c g c t g c t g g a c a g g g g c t c g g c t a t a a c g g t g c c g c c t t g a g t a g c g g c g g a c g g a a c g g g c g a c g a c c t g t c c c c g a g c c g a

7080 WPRE (in frame with Factor Xa site) *RGKRPQE LGTDNSVVLSGKSSSFPWLL CWALTIPWCCRGNHRPFLGC g t t g g g c a c t g a c a a t t c c g t g g t g t t g t c g g g g a a a t c a t c g t c c t t t c c t t g g c t g c t c a a c c c g t g a c t g t t a a g g c a c c a c a a c a g c c c c t t t a g t a g c a g g a a a g g a a c c g a c g a

7140 WPRE (in frame with Factor Xa site) GTNGGPNQAPRGEAVDR RGE Factor Xa site ACVATWILRGTSFCYVPSAL SPVLPPGFCAGRPSATSLRP c g c c t g t g t t g c c a c c t g g a t t c t g c g c g g g a c g t c c t t c t g c t a c g t c c c t t c g g c c c t g c g g a c a c a a c g g t g g a c c t a a g a c g c g c c c t g c a g g a a g a c g a t g c a g g g a a g c c g g g a

7200 WPRE 1 I Factor Xa site NPADLPSRGLLPALRPLPRL SIQRTFLPAACCRLCGLFRV c a a t c c a g c g g a c c t t c c t t c c c g c g g c c t g c t g c c g g c t c t g c g g c c t c t t c c g c g t c t g t t a g g t c g c c t g g a a g g a a g g g c g c c g g a c g a c g g c c g a g a c g c c g g a g a a g g c g c a g a

7260 WPRE RLRPQTSRISLWAASPPEIL FAFALRRVGSPFGPPPRLRS t c g c c t t c g c c c t c a g a c g a g t c g g a t c t c c c t t t g g g c c g c c t c c c c g c c t g a g a t c c t a g c g g a a g c g g g a g t c t g c t c a g c c t a g a g g g a a a c c c g g c g g a g g g g c g g a c t c t a g g a

7320 * FKTNDLQGSCRS* t t a a g a c c a a t g a c t t a c a a g g c a g c t g t a g a t c t t a g c c a c t t t t t a a a a g a a a a g g g g a a t t c t g g t t a c t g a a t g t t c c g t c g a c a t c t a g a a t c g g t g a a a a a t t t t c t t t t c c c c

7380 3' LTR ( Δ U3) g g a c t g g a a g g g c t a a t t c a c t c c c a a c g a a g a c a a g a t c t g c t t t t t g c t t g t a c t g g g c c t g a c c t t c c c g a t t a a g t g a g g g t t g c t t c t g t t c t a g a c g a a a a a c g a a c a t g a c c c

7440 3' LTR ( Δ U3) t c t c t c t g g t t a g a c c a g a t c t g a g c c t g g g a g c t c t c t g g c t a a c t a g g g a a c c c a c t g a g a g a g a c c a a t c t g g t c t a g a c t c g g a c c c t c g a g a g a c c g a t t g a t c c c t t g g g t g a c

7500 3' LTR ( Δ U3) c t t a a g c c t c a a t a a a g c t t g c c t t g a g t g c t t c a a g t a g t g t g t g c c c g t c t g t t g t g t g a a t t c g g a g t t a t t t c g a a c g g a a c t c a c g a a g t t c a t c a c a c a c g g g c a g a c a a c a c a

7560 3' LTR ( Δ U3) g a c t c t g g t a a c t a g a g a t c c c t c a g a c c c t t t t a g t c a g t g t g g a a a a t c t c t a g c a g t c t g a g a c c a t t g a t c t c t a g g g a g t c t g g g a a a a t c a g t c a c a c c t t t t a g a g a t c g t c a

7620 a g t a g t t c a t g t c a t c t t a t t a t t c a g t a t t t a t a a c t t g c a a a g a a a t g a a t a t c a g a g t c a t c a a g t a c a g t a g a a t a a t a a g t c a t a a a t a t t g a a c g t t t c t t t a c t t a t a g t c t c

PacI 7680 C G G A pBRforEco a g t g a g a g g c c c g g g t t a a t t a a g g a a a g g g c t a g a t c a t t c t t g a a g a c g a a a g g g c c t t c a c t c t c c g g g c c c a a t t a a t t c c t t t c c c g a t c t a g t a a g a a c t t c t g c t t t c c c g g a

7740 G C A C T A T G C G G A T A A pBRforEco c g t g a t a c g c c t a t t t t t a t a g g t t a a t g t c a t g a t a a t a a t g g t t t c t t a g a c g t c a g g g c a c t a t g c g g a t a a a a a t a t c c a a t t a c a g t a c t a t t a t t a c c a a a g a a t c t g c a g t c c

7800 AmpR promoter t g g c a c t t t t c g g g g a a a t g t g c g c g g a a c c c c t a t t t g t t t a t t t t t c t a a a t a c a t t c a c c g t g a a a a g c c c c t t t a c a c g c g c c t t g g g g a t a a a c a a a t a a a a a g a t t t a t g t a a g

7860 AmpR promoter a a a t a t g t a t c c g c t c a t g a g a c a a t a a c c c t g a t a a a t g c t t c a a t a a t a t t g a a a a a g t t t a t a c a t a g g c g a g t a c t c t g t t a t t g g g a c t a t t t a c g a a g t t a t t a t a a c t t t t t c

7920 1 5 10 15 signal sequence MSIQHFRVALIPFFAAFC AmpR promoter AmpR MSIQHFRVALIPFFAAFC g a a g a g t a t g a g t a t t c a a c a t t t c c g t g t c g c c c t t a t t c c c t t t t t t g c g g c a t t t t g c t t c t c a t a c t c a t a a g t t g t a a a g g c a c a g c g g g a a t a a g g g a a a a a a c g c c g t a a a a c

7980 20 signal sequence LPVFA 25 30 35 HPETLVKVKDAEDQL AmpR LPVFAHPETLVKVKDAEDQL c c t t c c t g t t t t t g c t c a c c c a g a a a c g c t g g t g a a a g t a a a a g a t g c t g a a g a t c a g t t g g a a g g a c a a a a a c g a g t g g g t c t t t g c g a c c a c t t t c a t t t t c t a c g a c t t c t a g t c a a

8040 40 45 50 55 GARVGYIELDLNSGKILESF AmpR GARVGYIELDLNSGKILESF g g g t g c a c g a g t g g g t t a c a t c g a a c t g g a t c t c a a c a g c g g t a a g a t c c t t g a g a g t t t c c c a c g t g c t c a c c c a a t g t a g c t t g a c c t a g a g t t g t c g c c a t t c t a g g a a c t c t c a a a

8100 60 65 70 75 RPEERFPMMSTFKVLLCGAV AmpR RPEERFPMMSTFKVLLCGAV t c g c c c c g a a g a a c g t t t t c c a a t g a t g a g c a c t t t t a a a g t t c t g c t a t g t g g c g c g g t a g c g g g g c t t c t t g c a a a a g g t t a c t a c t c g t g a a a a t t t c a a g a c g a t a c a c c g c g c c a

8160 80 85 90 95 LSRVDAGQEQLGRRIHYSQN AmpR LSRVDAGQEQLGRRIHYSQN a t t a t c c c g t g t t g a c g c c g g g c a a g a g c a a c t c g g t c g c c g c a t a c a c t a t t c t c a g a a t a a t a g g g c a c a a c t g c g g c c c g t t c t c g t t g a g c c a g c g g c g t a t g t g a t a a g a g t c t t

8220 100 105 110 115 DLVEYSPVTEKHLTDGMTVR AmpR DLVEYSPVTEKHLTDGMTVR t g a c t t g g t t g a g t a c t c a c c a g t c a c a g a a a a g c a t c t t a c g g a t g g c a t g a c a g t a a g a c t g a a c c a a c t c a t g a g t g g t c a g t g t c t t t t c g t a g a a t g c c t a c c g t a c t g t c a t t c

8280 120 125 130 135 ELCSAAITMSDNTAANLLLT AmpR ELCSAAITMSDNTAANLLLT a g a a t t a t g c a g t g c t g c c a t a a c c a t g a g t g a t a a c a c t g c g g c c a a c t t a c t t c t g a c t c t t a a t a c g t c a c g a c g g t a t t g g t a c t c a c t a t t g t g a c g c c g g t t g a a t g a a g a c t g

PvuI 8340 140 145 150 155 TIGGPKELTAFLHNMGDHVT AmpR TIGGPKELTAFLHNMGDHVT a a c g a t c g g a g g a c c g a a g g a g c t a a c c g c t t t t t t g c a c a a c a t g g g g g a t c a t g t a a c t t g c t a g c c t c c t g g c t t c c t c g a t t g g c g a a a a a a c g t g t t g t a c c c c c t a g t a c a t t g

8400 160 165 170 175 RLDRWEPELNEAIPNDERDT AmpR RLDRWEPELNEAIPNDERDT t c g c c t t g a t c g t t g g g a a c c g g a g c t g a a t g a a g c c a t a c c a a a c g a c g a g c g t g a c a c a g c g g a a c t a g c a a c c c t t g g c c t c g a c t t a c t t c g g t a t g g t t t g c t g c t c g c a c t g t g

FspI 8460 180 185 190 195 TMPVAMATTLRKLLTGELLT AmpR TMPVAMATTLRKLLTGELLT c a c g a t g c c t g t a g c a a t g g c a a c a a c g t t g c g c a a a c t a t t a a c t g g c g a a c t a c t t a c g t g c t a c g g a c a t c g t t a c c g t t g t t g c a a c g c g t t t g a t a a t t g a c c g c t t g a t g a a t g

8520 200 205 210 215 LASRQQLIDWMEADKVAGPL AmpR LASRQQLIDWMEADKVAGPL t c t a g c t t c c c g g c a a c a a t t a a t a g a c t g g a t g g a g g c g g a t a a a g t t g c a g g a c c a c t a g a t c g a a g g g c c g t t g t t a a t t a t c t g a c c t a c c t c c g c c t a t t t c a a c g t c c t g g t g a

8580 220 225 230 235 LRSALPAGWFIADKSGAGER AmpR LRSALPAGWFIADKSGAGER t c t g c g c t c g g c c c t t c c g g c t g g c t g g t t t a t t g c t g a t a a a t c t g g a g c c g g t g a g c g a g a c g c g a g c c g g g a a g g c c g a c c g a c c a a a t a a c g a c t a t t t a g a c c t c g g c c a c t c g c

8640 240 245 250 255 GSRGIIAALGPDGKPSRIVV AmpR GSRGIIAALGPDGKPSRIVV t g g g t c t c g c g g t a t c a t t g c a g c a c t g g g g c c a g a t g g t a a g c c c t c c c g t a t c g t a g t a c c c a g a g c g c c a t a g t a a c g t c g t g a c c c c g g t c t a c c a t t c g g g a g g g c a t a g c a t c a

8700 260 265 270 275 IYTTGSQATMDERNRQIAEI AmpR IYTTGSQATMDERNRQIAEI t a t c t a c a c g a c g g g g a g t c a g g c a a c t a t g g a t g a a c g a a a t a g a c a g a t c g c t g a g a t a t a g a t g t g c t g c c c c t c a g t c c g t t g a t a c c t a c t t g c t t t a t c t g t c t a g c g a c t c t a

8760 280 285 GASLIKHW* AmpR GASLIKHW* a g g t g c c t c a c t g a t t a a g c a t t g g t a a c t g t c a g a c c a a g t t t a c t c a t a t a t a c t t t a t c c a c g g a g t g a c t a a t t c g t a a c c a t t g a c a g t c t g g t t c a a a t g a g t a t a t a t g a a a t

8820 g a t t g a t t t a a a a c t t c a t t t t t a a t t t a a a a g g a t c t a g g t g a a g a t c c t t t t t g a t a a c t a a c t a a a t t t t g a a g t a a a a a t t a a a t t t t c c t a g a t c c a c t t c t a g g a a a a a c t a t t

8880 t c t c a t g a c c a a a a t c c c t t a a c g t g a g t t t t c g t t c c a c t g a g c g t c a g a c c c c g t a g a a g a g t a c t g g t t t t a g g g a a t t g c a c t c a a a a g c a a g g t g a c t c g c a g t c t g g g g c a t c t

8940 ori a a a g a t c a a a g g a t c t t c t t g a g a t c c t t t t t t t c t g c g c g t a a t c t g c t g c t t g c a a a c t t t c t a g t t t c c t a g a a g a a c t c t a g g a a a a a a a g a c g c g c a t t a g a c g a c g a a c g t t t g

9000 ori a a a a a a a c c a c c g c t a c c a g c g g t g g t t t g t t t g c c g g a t c a a g a g c t a c c a a c t c t t t t t t t t t t t g g t g g c g a t g g t c g c c a c c a a a c a a a c g g c c t a g t t c t c g a t g g t t g a g a a a a

9060 ori t c c g a a g g t a a c t g g c t t c a g c a g a g c g c a g a t a c c a a a t a c t g t t c t t c t a g t g t a g c c a g g c t t c c a t t g a c c g a a g t c g t c t c g c g t c t a t g g t t t a t g a c a a g a a g a t c a c a t c g g

9120 ori g t a g t t a g g c c a c c a c t t c a a g a a c t c t g t a g c a c c g c c t a c a t a c c t c g c t c t g c t a a t c a t c a a t c c g g t g g t g a a g t t c t t g a g a c a t c g t g g c g g a t g t a t g g a g c g a g a c g a t t a

9180 ori c c t g t t a c c a g t g g c t g c t g c c a g t g g c g a t a a g t c g t g t c t t a c c g g g t t g g a c t c a a g g g a c a a t g g t c a c c g a c g a c g g t c a c c g c t a t t c a g c a c a g a a t g g c c c a a c c t g a g t t c

9240 ori a c g a t a g t t a c c g g a t a a g g c g c a g c g g t c g g g c t g a a c g g g g g g t t c g t g c a c a c a g c c t g c t a t c a a t g g c c t a t t c c g c g t c g c c a g c c c g a c t t g c c c c c c a a g c a c g t g t g t c g g

9300 ori c a g c t t g g a g c g a a c g a c c t a c a c c g a a c t g a g a t a c c t a c a g c g t g a g c t a t g a g a a a g g t c g a a c c t c g c t t g c t g g a t g t g g c t t g a c t c t a t g g a t g t c g c a c t c g a t a c t c t t t c

9360 ori c g c c a c g c t t c c c g a a g g g a g a a a g g c g g a c a g g t a t c c g g t a a g c g g c a g g g t c g g a a c g c g g t g c g a a g g g c t t c c c t c t t t c c g c c t g t c c a t a g g c c a t t c g c c g t c c c a g c c t t g

9420 ori G G G A A A C G C C T G G T A T C T T T pBR322ori-F a g g a g a g c g c a c g a g g g a g c t t c c a g g g g g a a a c g c c t g g t a t c t t t a t a g t c c t g t c g g t c c t c t c g c g t g c t c c c t c g a a g g t c c c c c t t t g c g g a c c a t a g a a a t a t c a g g a c a g c c

9480 ori g t t t c g c c a c c t c t g a c t t g a g c g t c g a t t t t t g t g a t g c t c g t c a g g g g g g c g g a g c c t c a a a g c g g t g g a g a c t g a a c t c g c a g c t a a a a a c a c t a c g a g c a g t c c c c c c g c c t c g g a

9540 ori a t g g a a a a a c g c c a g c a a c g c g g c c t t t t t a c g g t t c c t g g c c t t t t g c t g g c c t t t t g c t a c c t t t t t g c g g t c g t t g c g c c g g a a a a a t g c c a a g g a c c g g a a a a c g a c c g g a a a a c g

9600 t c a c a t g t t c t t t c c t g c g t t a t c c c c t g a t t c t g t g g a t a a c c g t a t t a c c g c c t t t g a a g t g t a c a a g a a a g g a c g c a a t a g g g g a c t a a g a c a c c t a t t g g c a t a a t g g c g g a a a c t

9660 A G C G A G T C A G T G A G C G A G L4440 g t g a g c t g a t a c c g c t c g c c g c a g c c g a a c g a c c g a g c g c a g c g a g t c a g t g a g c g a g g a c a c t c g a c t a t g g c g a g c g g c g t c g g c t t g c t g g c t c g c g t c g c t c a g t c a c t c g c t c c t

9720 a g c g g a a g a g c g c c c a a t a c g c a a a c c g c c t c t c c c c g c g c g t t g g c c g a t t c a t t a a t g t c g c c t t c t c g c g g g t t a t g c g t t t g g c g g a g a g g g g c g c g c a a c c g g c t a a g t a a t t a c

SfiI 9780 c a g c a a g c t c a t g g c t g a c t a a t t t t t t t t a t t t a t g c a g a g g c c g a g g c c g c c t c g g c c g t c g t t c g a g t a c c g a c t g a t t a a a a a a a a t a a a t a c g t c t c c g g c t c c g g c g g a g c c g g

9840 t c t g a g c t a t t c c a g a a g t a g t g a g g a g g c t t t t t t g g a g g c c t a g g c t t t t g c a a a a a g a g a c t c g a t a a g g t c t t c a t c a c t c c t c c g a a a a a a c c t c c g g a t c c g a a a a c g t t t t t c

9900 CAP binding site c t c c c c g t g g c a c g a c a g g t t t c c c g a c t g g a a a g c g g g c a g t g a g c g c a a c g c a a t t a a g a g g g g c a c c g t g c t g t c c a a a g g g c t g a c c t t t c g c c c g t c a c t c g c g t t g c g t t a a t t

9960 -35 -10 CAP binding site lac promoter t g t g a g t t a g c t c a c t c a t t a g g c a c c c c a g g c t t t a c a c t t t a t g c t t c c g g c t c g t a t a c a c t c a a t c g a g t g a g t a a t c c g t g g g g t c c g a a a t g t g a a a t a c g a a g g c c g a g c a t a

10,020 -10 lac promoter lac operator M13 rev A G C G G A T A A C A A T T T C A C A C A G G M13/pUC Reverse C A G G A A A C A G C T A T G A C M13 Reverse g t t g t g t g g a a t t g t g a g c g g a t a a c a a t t t c a c a c a g g a a a c a g c t a t g a c a t g a t t a c c a a c a c a c c t t a a c a c t c g c c t a t t g t t a a a g t g t g t c c t t t g t c g a t a c t g t a c t a a t g

10,080 G T G G T T T G T C C A A A C T C A T EBV-rev g a a t t t c a c a a a t a a a g c a t t t t t t t c a c t g c a t t c t a g t t g t g g t t t g t c c a a a c t c a t c t t a a a g t g t t t a t t t c g t a a a a a a a g t g a c g t a a g a t c a a c a c c a a a c a g g t t t g a g t a

10,140 C EBV-rev c a a t g t a t c t t a t c a t g t c t g g a t c a a c t g g a t a a c t c a a g c t a a c c a a a a t c a t c c c a a g t t a c a t a g a a t a g t a c a g a c c t a g t t g a c c t a t t g a g t t c g a t t g g t t t t a g t a g g g t t

10,200 a c t t c c c a c c c c a t a c c c t a t t a c c a c t g c c a a t t a c c t g t g g t t t c a t t t a c a t t c c t c t g a a g g g t g g g g t a t g g g a t a a t g g t g a c g g t t a a t g g a c a c c a a a g t a a a t g t a a g g a g

10,260 t g a a t t a t t t t c a t t t t a a a g a a a t t g t a t t t g t t a a a t a t g t a c t a c a a a c t t a g t a g t a c t t a a t a a a a g t a a a a t t t c t t t a a c a t a a a c a a t t t a t a c a t g a t g t t t g a a t c a t c a

10,320 3' LTR t g g a a g g g c t a a t t c a c t c c c a a a g a a g a c a a g a t a t c c t t g a t c t g t g g a t c t a c c a c a a c c t t c c c g a t t a a g t g a g g g t t t c t t c t g t t c t a t a g g a a c t a g a c a c c t a g a t g g t g t

10,380 3' LTR c a c a a g g c t a c t t c c c t g a t t a g c a g a a c t a c a c a c c a g g g c c a g g g g t c a g a t a t c c a c g t g t t c c g a t g a a g g g a c t a a t c g t c t t g a t g t g t g g t c c c g g t c c c c a g t c t a t a g g t g

10,440 3' LTR t g a c c t t t g g a t g g t g c t a c a a g c t a g t a c c a g t t g a g c c a g a t a a g g t a g a a g a g g c c a a c t g g a a a c c t a c c a c g a t g t t c g a t c a t g g t c a a c t c g g t c t a t t c c a t c t t c t c c g g t

10,500 3' LTR a t a a a g g a g a g a a c a c c a g c t t g t t a c a c c c t g t g a g c c t g c a t g g g a t g g a t g a c c c g g t a t t t c c t c t c t t g t g g t c g a a c a a t g t g g g a c a c t c g g a c g t a c c c t a c c t a c t g g g c c

10,560 3' LTR a g a g a g a a g t g t t a g a g t g g a g g t t t g a c a g c c g c c t a g c a t t t c a t c a c g t g g c c c g a g t c t c t c t t c a c a a t c t c a c c t c c a a a c t g t c g g c g g a t c g t a a a g t a g t g c a c c g g g c t c

10,620 3' LTR a g c t g c a t c c g g a g t a c t t c a a g a a c t g c t g a t a t c g a g c t t g c t a c a a g g g a c t t t c c g t c g a c g t a g g c c t c a t g a a g t t c t t g a c g a c t a t a g c t c g a a c g a t g t t c c c t g a a a g g c

10,680 3' LTR c t g g g g a c t t t c c a g g g a g g c g t g g c c t g g g c g g g a c t g g g g a g t g g c g a g c c c t c a g a t g a c c c c t g a a a g g t c c c t c c g c a c c g g a c c c g c c c t g a c c c c t c a c c g c t c g g g a g t c t a

10,740 3' LTR c c t g c a t a t a a g c a g c t g c t t t t t g c c t g t a c t g g g t c t c t c t g g t t a g a c c a g a t c t g a g g a c g t a t a t t c g t c g a c g a a a a a c g g a c a t g a c c c a g a g a g a c c a a t c t g g t c t a g a c t

10,800 3' LTR g c c t g g g a g c t c t c t g g c t a a c t a g g g a a c c c a c t g c t t a a g c c t c a a t a a a g c t t g c c t c g g a c c c t c g a g a g a c c g a t t g a t c c c t t g g g t g a c g a a t t c g g a g t t a t t t c g a a c g g a

10,860 3' LTR t g a g t g c t t c a a g t a g t g t g t g c c c g t c t g t t g t g t g a c t c t g g t a a c t a g a g a t c c c t c a c t c a c g a a g t t c a t c a c a c a c g g g c a g a c a a c a c a c t g a g a c c a t t g a t c t c t a g g g a g

10,920 3' LTR a g a c c c t t t t a g t c a g t g t g g a a a a t c t c t a g c a g t g g c g c c c g a a c a g g g a c t t g a a a g t c t g g g a a a a t c a g t c a c a c c t t t t a g a g a t c g t c a c c g c g g g c t t g t c c c t g a a c t t t c

10,980 HIV-1 Ψ c g a a a g g g a a a c c a g a g g a g c t c t c t c g a c g c a g g a c t c g g c t t g c t g a a g c g c g c a c g g g c t t t c c c t t t g g t c t c c t c g a g a g a g c t g c g t c c t g a g c c g a a c g a c t t c g c g c g t g c c

11,040 HIV-1 Ψ c a a g a g g c g a g g g g c g g c g a c t g g t g a g t a c g c c a a a a a t t t t g a c t a g c g g a g g c t a g a g t t c t c c g c t c c c c g c c g c t g a c c a c t c a t g c g g t t t t t a a a a c t g a t c g c c t c c g a t c t

NruI* 11,100 HIV-1 Ψ a g g a g a g a g a t g g g t g c g a g a g c g t c a g t a t t a a g c g g g g g a g a a t t a g a t c g c g a t g g g t c c t c t c t c t a c c c a c g c t c t c g c a g t c a t a a t t c g c c c c c t c t t a a t c t a g c g c t a c c c

11,160 a a a a a a t t c g g t t a a g g c c a g g g g g a a a g a a a a a a t a t a a a t t a a a a c a t a t a g t a t g g g t t t t t t a a g c c a a t t c c g g t c c c c c t t t c t t t t t t a t a t t t a a t t t t g t a t a t c a t a c c c

11,220 c a a g c a g g g a g c t a g a a c g a t t c g c a g t t a a t c c t g g c c t g t t a g a a a c a t c a g a a g g c t g t t c g t c c c t c g a t c t t g c t a a g c g t c a a t t a g g a c c g g a c a a t c t t t g t a g t c t t c c g a

11,280 *I g t a g a c a a a t a c t g g g a c a g c t a c a a c c a t c c c t t c a g a c a g g a t c a g a a g a a c t t a g a t c a t c t g t t t a t g a c c c t g t c g a t g t t g g t a g g g a a g t c t g t c c t a g t c t t c t t g a a t c t a

11,340 MIYYLLLGRNHADFSLSLLC c a t t a t a t a a t a c a g t a g c a a c c c t c t a t t g t g t g c a t c a a a g g a t a g a g a t a a a a g a c a g t a a t a t a t t a t g t c a t c g t t g g g a g a t a a c a c a c g t a g t t t c c t a t c t c t a t t t t c t g t

11,400 WPLKLCSLPLAFCFYSWRVA c c a a g g a a g c t t t a g a c a a g a t a g a g g a a g a g c a a a a c a a a a g t a a g a c c a c c g c a c a g c g g t t c c t t c g a a a t c t g t t c t a t c t c c t t c t c g t t t t g t t t t c a t t c t g g t g g c g t g t c g

11,460 MRDNWRS LPRGSIKLGPPPSILSLQLL a a g c g g c c g g c c g c t g a t c t t c a g a c c t g g a g g a g g a g a t a t g a g g g a c a a t t g g a g a a g t t c g c c g g c c g g c g a c t a g a a g t c t g g a c c t c c t c c t c t a t a c t c c c t g t t a a c c t c t t c

11,520 ELYKYKVVKIEPLGVAPTKA SNYLYLTTFISGNPTAGVLA t g a a t t a t a t a a a t a t a a a g t a g t a a a a a t t g a a c c a t t a g g a g t a g c a c c c a c c a a g g c a c t t a a t a t a t t t a t a t t t c a t c a t t t t t a a c t t g g t a a t c c t c a t c g t g g g t g g t t c c g

11,580 RRE KRRVVQREKRAVGIGALFLG FLLTTCLSFLATPIPAKNRP a a a g a g a a g a g t g g t g c a g a g a g a a a a a a g a g c a g t g g g a a t a g g a g c t t t g t t c c t t g g t t t c t c t t c t c a c c a c g t c t c t c t t t t t t c t c g t c a c c c t t a t c c t c g a a a c a a g g a a c c

11,640 RRE FLGAAGSTMGAASMTLTVQA NKPAAPLVIPAADIVSVTCA g t t c t t g g g a g c a g c a g g a a g c a c t a t g g g c g c a g c g t c a a t g a c g c t g a c g g t a c a g g c c a a g a a c c c t c g t c g t c c t t c g t g a t a c c c g c g t c g c a g t t a c t g c g a c t g c c a t g t c c g

11,700 RRE RQLLSGIVQQQNNLLRAIEA LCNNDPITCCCFLKSLAISA c a g a c a a t t a t t g t c t g g t a t a g t g c a g c a g c a g a a c a a t t t g c t g a g g g c t a t t g a g g c g t c t g t t a a t a a c a g a c c a t a t c a c g t c g t c g t c t t g t t a a a c g a c t c c c g a t a a c t c c g

11,760 RRE QQHLLQLTVWGIKQLQARIL CCCRNCSVTQPM g c a a c a g c a t c t g t t g c a a c t c a c a g t c t g g g g c a t c a a g c a g c t c c a g g c a a g a a t c c t c g t t g t c g t a g a c a a c g t t g a g t g t c a g a c c c c g t a g t t c g t c g a g g t c c g t t c t t a g g a

11,820 RRE AVERYLKDQQLLGIWGCSGK g g c t g t g g a a a g a t a c c t a a a g g a t c a a c a g c t c c t g g g g a t t t g g g g t t g c t c t g g a a a c c g a c a c c t t t c t a t g g a t t t c c t a g t t g t c g a g g a c c c c t a a a c c c c a a c g a g a c c t t t

11,880 LICTTAVPWNASWSNKSLEQ a c t c a t t t g c a c c a c t g c t g t g c c t t g g a a t g c t a g t t g g a g t a a t a a a t c t c t g g a a c a t g a g t a a a c g t g g t g a c g a c a c g g a a c c t t a c g a t c a a c c t c a t t a t t t a g a g a c c t t g t

11,940 IWNHTTWMEWDREINNYTSL g a t t t g g a a t c a c a c g a c c t g g a t g g a g t g g g a c a g a g a a a t t a a c a a t t a c a c a a g c t t c t a a a c c t t a g t g t g c t g g a c c t a c c t c a c c c t g t c t c t t t a a t t g t t a a t g t g t t c g a a

12,000 1 5 KNEQELL gp41 peptide IHSLIEESQNQQEKNEQELL a a t a c a c t c c t t a a t t g a a g a a t c g c a a a a c c a g c a a g a a a a g a a t g a a c a a g a a t t a t t t t a t g t g a g g a a t t a a c t t c t t a g c g t t t t g g t c g t t c t t t t c t t a c t t g t t c t t a a t a a

12,060 10 15 ELDKWASL (in frame with gp41 peptide) WNWFNITNWLWY gp41 peptide ELDKWASLWNWFNITNWLWY g g a a t t a g a t a a a t g g g c a a g t t t g t g g a a t t g g t t t a a c a t a a c a a a t t g g c t g t g g t a c c t t a a t c t a t t t a c c c g t t c a a a c a c c t t a a c c a a a t t g t a t t g t t t a a c c g a c a c c a t

3ʹ 5ʹ 12,110 (in frame with gp41 peptide) IKLFIMIVGGLVGLRI IKLFIMIVGGLVGLRI t a t a a a a t t a t t c a t a a t g a t a g t a g g a g g c t t g g t a g g t t t a a g a a t a g a t a t t t t a a t a a g t a t t a c t a t c a t c c t c c g a a c c a t c c a a a t t c t t a t c

Restriction Enzymes

Instructions: By default, all cutters are shown. Filter on number of cut sites or search by enzyme name.

Filter

Features

Primers

BLAST

BLAST (Basic Local Alignment Search Tool) finds regions of similarity between biological sequences. Click on the buttons below to submit a BLAST search to NCBI. The results will appear in a new window. See your recent BLAST results on NCBI's website.

  • Nucleotide-Nucleotide BLAST (BLASTN)

  • Translated Nucleotide-Protein BLAST (BLASTX)

  • Sequence alignment using BLAST (BLAST2)

Sequence Analyzer Guide

Map

Displays a graphical map based on nucleotide sequence data labeled with restriction enzymes, plasmid features, ORFs (theoretical open reading frames) and primers. Hovering over data labels will display additional information (e.g. cut site)

To select a portion of sequence, click one location on the plasmid and then a second location to display the sequence between the two locations.

Sequence

Displays both strands of base paired nucleotide sequences with annotated enzymes, plasmid features, ORFs (theoretical open reading frames) and primers. Hovering over data labels will display additional information (e.g. cut site).

To select a portion of sequence, click one location on the sequence and then a second location to display the sequence between the two locations.

Enzymes

List of restriction enzymes that can cut a given nucleotide sequence. Table lists enzyme name and the sequence location of the cut.

Features

List of common features detected in a given nucleotide sequence. Table lists feature name, location, size, color used to indicate its position on the map, and direction (if relevant).

Primers

List of commonly used primers detected in a given nucleotide sequence. Table lists primer name, sequence, length, binding site location, and direction.

BLAST

Use Basic Local Alignment Search Tool (BLAST) via the NCBI website to determine similarity between a given sequence and nucleotide (BLASTN) or protein (BLASTX) sequence databases. Additionally, align a custom nucleotide sequence against a given sequence using BLAST2.

File Downloads

GenBank

File contains the nucleotide sequence and annotated features in GenBank flat file format. Open the file with a text editor or plasmid mapping software to view the sequence.

SnapGene

File contains the nucleotide sequence and enhanced annotations from SnapGene Server. Open the file with SnapGene software or the free Viewer to view the plasmid map, sequence, and perform additional sequence analysis.