Addgene pHAGE-EGFR-P596S Sequencing Result - Sequence Analyzer Skip to main content
Addgene

Sequence Analyzer: pHAGE-EGFR-P596S Sequencing Result


Map View

M13 rev M13 Reverse (11,846 .. 11,862) lac operator M13/pUC Reverse (11,827 .. 11,849) CAP binding site SfiI (11,625) L4440 (11,491 .. 11,508) pBR322ori-F (11,238 .. 11,257) FspI (10,282) PvuI (10,136) pBRforEco (9527 .. 9545) PacI (9490) WPRE-R (8565 .. 8585) BsaBI * (8510) CsiI - SexAI * (8466) Puro-F (8395 .. 8415) RsrII (8015) BsiWI (7955) Puro-R (7899 .. 7918) KpnI (7757) Acc65I (7753) IRES-F (7706 .. 7725) IRES reverse (7479 .. 7496) MluI (7295) BstBI (7251) XbaI (7234) PspXI (7228) NotI (7222) T7 (7157 .. 7176) T7 promoter T3 promoter NsiI (6410) EBV-rev (11,912 .. 11,931) NruI * (833) KflI (1934) SpeI (2212) EF1a-F (3349 .. 3369) pHAGE-EGFR-P596S 12,122 bp

Sequence View

5ʹ 3ʹ 60 3' LTR t g g a a g g g c t a a t t c a c t c c c a a a g a a g a c a a g a t a t c c t t g a t c t g t g g a t c t a c c a c a a c c t t c c c g a t t a a g t g a g g g t t t c t t c t g t t c t a t a g g a a c t a g a c a c c t a g a t g g t g t

120 3' LTR c a c a a g g c t a c t t c c c t g a t t a g c a g a a c t a c a c a c c a g g g c c a g g g g t c a g a t a t c c a c g t g t t c c g a t g a a g g g a c t a a t c g t c t t g a t g t g t g g t c c c g g t c c c c a g t c t a t a g g t g

180 3' LTR t g a c c t t t g g a t g g t g c t a c a a g c t a g t a c c a g t t g a g c c a g a t a a g g t a g a a g a g g c c a a c t g g a a a c c t a c c a c g a t g t t c g a t c a t g g t c a a c t c g g t c t a t t c c a t c t t c t c c g g t

240 3' LTR a t a a a g g a g a g a a c a c c a g c t t g t t a c a c c c t g t g a g c c t g c a t g g g a t g g a t g a c c c g g t a t t t c c t c t c t t g t g g t c g a a c a a t g t g g g a c a c t c g g a c g t a c c c t a c c t a c t g g g c c

300 3' LTR a g a g a g a a g t g t t a g a g t g g a g g t t t g a c a g c c g c c t a g c a t t t c a t c a c g t g g c c c g a g t c t c t c t t c a c a a t c t c a c c t c c a a a c t g t c g g c g g a t c g t a a a g t a g t g c a c c g g g c t c

360 3' LTR a g c t g c a t c c g g a g t a c t t c a a g a a c t g c t g a t a t c g a g c t t g c t a c a a g g g a c t t t c c g t c g a c g t a g g c c t c a t g a a g t t c t t g a c g a c t a t a g c t c g a a c g a t g t t c c c t g a a a g g c

420 3' LTR c t g g g g a c t t t c c a g g g a g g c g t g g c c t g g g c g g g a c t g g g g a g t g g c g a g c c c t c a g a t g a c c c c t g a a a g g t c c c t c c g c a c c g g a c c c g c c c t g a c c c c t c a c c g c t c g g g a g t c t a

480 3' LTR c c t g c a t a t a a g c a g c t g c t t t t t g c c t g t a c t g g g t c t c t c t g g t t a g a c c a g a t c t g a g g a c g t a t a t t c g t c g a c g a a a a a c g g a c a t g a c c c a g a g a g a c c a a t c t g g t c t a g a c t

540 3' LTR g c c t g g g a g c t c t c t g g c t a a c t a g g g a a c c c a c t g c t t a a g c c t c a a t a a a g c t t g c c t c g g a c c c t c g a g a g a c c g a t t g a t c c c t t g g g t g a c g a a t t c g g a g t t a t t t c g a a c g g a

600 3' LTR t g a g t g c t t c a a g t a g t g t g t g c c c g t c t g t t g t g t g a c t c t g g t a a c t a g a g a t c c c t c a c t c a c g a a g t t c a t c a c a c a c g g g c a g a c a a c a c a c t g a g a c c a t t g a t c t c t a g g g a g

660 3' LTR a g a c c c t t t t a g t c a g t g t g g a a a a t c t c t a g c a g t g g c g c c c g a a c a g g g a c t t g a a a g t c t g g g a a a a t c a g t c a c a c c t t t t a g a g a t c g t c a c c g c g g g c t t g t c c c t g a a c t t t c

720 HIV-1 Ψ c g a a a g g g a a a c c a g a g g a g c t c t c t c g a c g c a g g a c t c g g c t t g c t g a a g c g c g c a c g g g c t t t c c c t t t g g t c t c c t c g a g a g a g c t g c g t c c t g a g c c g a a c g a c t t c g c g c g t g c c

780 HIV-1 Ψ c a a g a g g c g a g g g g c g g c g a c t g g t g a g t a c g c c a a a a a t t t t g a c t a g c g g a g g c t a g a g t t c t c c g c t c c c c g c c g c t g a c c a c t c a t g c g g t t t t t a a a a c t g a t c g c c t c c g a t c t

NruI* 840 HIV-1 Ψ a g g a g a g a g a t g g g t g c g a g a g c g t c a g t a t t a a g c g g g g g a g a a t t a g a t c g c g a t g g g t c c t c t c t c t a c c c a c g c t c t c g c a g t c a t a a t t c g c c c c c t c t t a a t c t a g c g c t a c c c

900 a a a a a a t t c g g t t a a g g c c a g g g g g a a a g a a a a a a t a t a a a t t a a a a c a t a t a g t a t g g g t t t t t t a a g c c a a t t c c g g t c c c c c t t t c t t t t t t a t a t t t a a t t t t g t a t a t c a t a c c c

960 c a a g c a g g g a g c t a g a a c g a t t c g c a g t t a a t c c t g g c c t g t t a g a a a c a t c a g a a g g c t g t t c g t c c c t c g a t c t t g c t a a g c g t c a a t t a g g a c c g g a c a a t c t t t g t a g t c t t c c g a

1020 *I g t a g a c a a a t a c t g g g a c a g c t a c a a c c a t c c c t t c a g a c a g g a t c a g a a g a a c t t a g a t c a t c t g t t t a t g a c c c t g t c g a t g t t g g t a g g g a a g t c t g t c c t a g t c t t c t t g a a t c t a

1080 MIYYLLLGRNHADFSLSLLC c a t t a t a t a a t a c a g t a g c a a c c c t c t a t t g t g t g c a t c a a a g g a t a g a g a t a a a a g a c a g t a a t a t a t t a t g t c a t c g t t g g g a g a t a a c a c a c g t a g t t t c c t a t c t c t a t t t t c t g t

1140 WPLKLCSLPLAFCFYSWRVA c c a a g g a a g c t t t a g a c a a g a t a g a g g a a g a g c a a a a c a a a a g t a a g a c c a c c g c a c a g c g g t t c c t t c g a a a t c t g t t c t a t c t c c t t c t c g t t t t g t t t t c a t t c t g g t g g c g t g t c g

1200 MRDNWRS LPRGSIKLGPPPSILSLQLL a a g c g g c c g g c c g c t g a t c t t c a g a c c t g g a g g a g g a g a t a t g a g g g a c a a t t g g a g a a g t t c g c c g g c c g g c g a c t a g a a g t c t g g a c c t c c t c c t c t a t a c t c c c t g t t a a c c t c t t c

1260 ELYKYKVVKIEPLGVAPTKA SNYLYLTTFISGNPTAGVLA t g a a t t a t a t a a a t a t a a a g t a g t a a a a a t t g a a c c a t t a g g a g t a g c a c c c a c c a a g g c a c t t a a t a t a t t t a t a t t t c a t c a t t t t t a a c t t g g t a a t c c t c a t c g t g g g t g g t t c c g

1320 RRE KRRVVQREKRAVGIGALFLG FLLTTCLSFLATPIPAKNRP a a a g a g a a g a g t g g t g c a g a g a g a a a a a a g a g c a g t g g g a a t a g g a g c t t t g t t c c t t g g t t t c t c t t c t c a c c a c g t c t c t c t t t t t t c t c g t c a c c c t t a t c c t c g a a a c a a g g a a c c

1380 RRE FLGAAGSTMGAASMTLTVQA NKPAAPLVIPAADIVSVTCA g t t c t t g g g a g c a g c a g g a a g c a c t a t g g g c g c a g c g t c a a t g a c g c t g a c g g t a c a g g c c a a g a a c c c t c g t c g t c c t t c g t g a t a c c c g c g t c g c a g t t a c t g c g a c t g c c a t g t c c g

1440 RRE RQLLSGIVQQQNNLLRAIEA LCNNDPITCCCFLKSLAISA c a g a c a a t t a t t g t c t g g t a t a g t g c a g c a g c a g a a c a a t t t g c t g a g g g c t a t t g a g g c g t c t g t t a a t a a c a g a c c a t a t c a c g t c g t c g t c t t g t t a a a c g a c t c c c g a t a a c t c c g

1500 RRE QQHLLQLTVWGIKQLQARIL CCCRNCSVTQPM g c a a c a g c a t c t g t t g c a a c t c a c a g t c t g g g g c a t c a a g c a g c t c c a g g c a a g a a t c c t c g t t g t c g t a g a c a a c g t t g a g t g t c a g a c c c c g t a g t t c g t c g a g g t c c g t t c t t a g g a

1560 RRE AVERYLKDQQLLGIWGCSGK g g c t g t g g a a a g a t a c c t a a a g g a t c a a c a g c t c c t g g g g a t t t g g g g t t g c t c t g g a a a c c g a c a c c t t t c t a t g g a t t t c c t a g t t g t c g a g g a c c c c t a a a c c c c a a c g a g a c c t t t

1620 LICTTAVPWNASWSNKSLEQ a c t c a t t t g c a c c a c t g c t g t g c c t t g g a a t g c t a g t t g g a g t a a t a a a t c t c t g g a a c a t g a g t a a a c g t g g t g a c g a c a c g g a a c c t t a c g a t c a a c c t c a t t a t t t a g a g a c c t t g t

1680 IWNHTTWMEWDREINNYTSL g a t t t g g a a t c a c a c g a c c t g g a t g g a g t g g g a c a g a g a a a t t a a c a a t t a c a c a a g c t t c t a a a c c t t a g t g t g c t g g a c c t a c c t c a c c c t g t c t c t t t a a t t g t t a a t g t g t t c g a a

1740 1 5 KNEQELL gp41 peptide IHSLIEESQNQQEKNEQELL a a t a c a c t c c t t a a t t g a a g a a t c g c a a a a c c a g c a a g a a a a g a a t g a a c a a g a a t t a t t t t a t g t g a g g a a t t a a c t t c t t a g c g t t t t g g t c g t t c t t t t c t t a c t t g t t c t t a a t a a

1800 10 15 ELDKWASL (in frame with gp41 peptide) WNWFNITNWLWY gp41 peptide ELDKWASLWNWFNITNWLWY g g a a t t a g a t a a a t g g g c a a g t t t g t g g a a t t g g t t t a a c a t a a c a a a t t g g c t g t g g t a c c t t a a t c t a t t t a c c c g t t c a a a c a c c t t a a c c a a a t t g t a t t g t t t a a c c g a c a c c a t

1860 (in frame with gp41 peptide) IKLFIMIVGGLVGLRIVFAV IKLFIMIVGGLVGLRIVFAV t a t a a a a t t a t t c a t a a t g a t a g t a g g a g g c t t g g t a g g t t t a a g a a t a g t t t t t g c t g t a t a t t t t a a t a a g t a t t a c t a t c a t c c t c c g a a c c a t c c a a a t t c t t a t c a a a a a c g a c a

1920 (in frame with gp41 peptide) LSIVNRVRQGYSPLSFQTHL LSIVNRVRQGYSPLSFQTHL a c t t t c t a t a g t g a a t a g a g t t a g g c a g g g a t a t t c a c c a t t a t c g t t t c a g a c c c a c c t t g a a a g a t a t c a c t t a t c t c a a t c c g t c c c t a t a a g t g g t a a t a g c a a a g t c t g g g t g g a

KflI 1980 (in frame with gp41 peptide) PTPRGPDRPEGIEEEGGERD PTPRGPDRPEGIEEEGGERD c c c a a c c c c g a g g g g a c c c g a c a g g c c c g a a g g a a t a g a a g a a g a a g g t g g a g a g a g a g a g g g t t g g g g c t c c c c t g g g c t g t c c g g g c t t c c t t a t c t t c t t c t t c c a c c t c t c t c t c t

2040 (in frame with gp41 peptide) RDRSIRLVNGSRRYRRIHKW RDRSIRLVNGSRRYRRIHKW c a g a g a c a g a t c c a t t c g a t t a g t g a a c g g a t c t c g a c g g t a t c g c c g a a t t c a c a a a t g g t c t c t g t c t a g g t a a g c t a a t c a c t t g c c t a g a g c t g c c a t a g c g g c t t a a g t g t t t a c

2100 (in frame with gp41 peptide) QYSSTILKEKGGLGGTVQGK cPPT/CTS QYSSTILKEKGGLGGTVQGK g c a g t a t t c a t c c a c a a t t t t a a a a g a a a a g g g g g g a t t g g g g g g t a c a g t g c a g g g g a a c g t c a t a a g t a g g t g t t a a a a t t t t c t t t t c c c c c c t a a c c c c c c a t g t c a c g t c c c c t t

2160 E* cPPT/CTS E* a g a a t a g t a g a c a t a a t a g c a a c a g a c a t a c a a a c t a a a g a a t t a c a a a a a c a a a t t a c a t c t t a t c a t c t g t a t t a t c g t t g t c t g t a t g t t t g a t t t c t t a a t g t t t t t g t t t a a t g t

SpeI 2220 cPPT/CTS a a a a t t c a a a a t t t t c g g g t t t a t t a c a g g g a c a g c a g a g a t c c a g t t t g g a c t a g t c g t t t t t a a g t t t t a a a a g c c c a a a t a a t g t c c c t g t c g t c t c t a g g t c a a a c c t g a t c a g c a

2280 EF-1 α promoter g a g g c t c c g g t g c c c g t c a g t g g g c a g a g c g c a c a t c g c c c a c a g t c c c c g a g a a g t t g g c t c c g a g g c c a c g g g c a g t c a c c c g t c t c g c g t g t a g c g g g t g t c a g g g g c t c t t c a a c c

2340 EF-1 α promoter g g g g a g g g g t c g g c a a t t g a a c c g g t g c c t a g a g a a g g t g g c g c g g g g t a a a c t g g g a a a c c c c t c c c c a g c c g t t a a c t t g g c c a c g g a t c t c t t c c a c c g c g c c c c a t t t g a c c c t t t

2400 EF-1 α promoter g t g a t g t c g t g t a c t g g c t c c g c c t t t t t c c c g a g g g t g g g g g a g a a c c g t a t a t a a g t g c a c t a c a g c a c a t g a c c g a g g c g g a a a a a g g g c t c c c a c c c c c t c t t g g c a t a t a t t c a c

2460 EF-1 α promoter EF-1 α intron A c a g t a g t c g c c g t g a a c g t t c t t t t t c g c a a c g g g t t t g c c g c c a g a a c a c a g g t a a g t g g t c a t c a g c g g c a c t t g c a a g a a a a a g c g t t g c c c a a a c g g c g g t c t t g t g t c c a t t c a c

2520 EF-1 α promoter EF-1 α intron A c c g t g t g t g g t t c c c g c g g g c c t g g c c t c t t t a c g g g t t a t g g c c c t t g c g t g c c t t g a a g g c a c a c a c c a a g g g c g c c c g g a c c g g a g a a a t g c c c a a t a c c g g g a a c g c a c g g a a c t t

2580 EF-1 α promoter EF-1 α intron A *KWRAATRSEQDRAEPQFHT t t a c t t c c a c c t g g c t g c a g t a c g t g a t t c t t g a t c c c g a g c t t c g g g t t g g a a g t g g g t a a t g a a g g t g g a c c g a c g t c a t g c a c t a a g a a c t a g g g c t c g a a g c c c a a c c t t c a c c c a

2640 EF-1 α promoter EF-1 α intron A PSNSAKRKLLGKAEHKLQPR g g g a g a g t t c g a g g c c t t g c g c t t a a g g a g c c c c t t c g c c t c g t g c t t g a g t t g a g g c c t c c c t c t c a a g c t c c g g a a c g c g a a t t c c t c g g g g a a g c g g a g c a c g a a c t c a a c t c c g g a

2700 EF-1 α promoter EF-1 α intron A AQASPGGRAFRTAGERRDRQ g g c c t g g g c g c t g g g g c c g c c g c g t g c g a a t c t g g t g g c a c c t t c g c g c c t g t c t c g c t g c c g g a c c c g c g a c c c c g g c g g c g c a c g c t t a g a c c a c c g t g g a a g c g c g g a c a g a g c g a c

2760 EF-1 α promoter EF-1 α intron A KRYTELWKFNKIVQQSAKKR c t t t c g a t a a g t c t c t a g c c a t t t a a a a t t t t t g a t g a c c t g c t g c g a c g c t t t t t t t c t g a a a g c t a t t c a g a g a t c g g t a a a t t t t a a a a a c t a c t g g a c g a c g c t g c g a a a a a a a g a

2820 EF-1 α promoter EF-1 α intron A ALYDQLHPGLDACQYKPKQP g g c a a g a t a g t c t t g t a a a t g c g g g c c a a g a t c t g c a c a c t g g t a t t t c g g t t t t t g g g g c c g t t c t a t c a g a a c a t t t a c g c c c g g t t c t a g a c g t g t g a c c a t a a a g c c a a a a a c c c c

2880 EF-1 α promoter EF-1 α intron A RPRRRPGHTGACMNPSAPGA c c g c g g g c g g c g a c g g g g c c c g t g c g t c c c a g c g c a c a t g t t c g g c g a g g c g g g g c c t g c g g c g c c c g c c g c t g c c c c g g g c a c g c a g g g t c g c g t g t a c a a g c c g c t c c g c c c c g g a c g

2940 EF-1 α promoter EF-1 α intron A LAAVSFRVPTTELQGAQEPA g a g c g c g g c c a c c g a g a a t c g g a c g g g g g t a g t c t c a a g c t g g c c g g c c t g c t c t g g t g c c t c g c g c c g g t g g c t c t t a g c c t g c c c c c a t c a g a g t t c g a c c g g c c g g a c g a g a c c a c g

3000 EF-1 α promoter EF-1 α intron A QGRAATYRGARPPLAPGTPV c t g g c c t c g c g c c g c c g t g t a t c g c c c c g c c c t g g g c g g c a a g g c t g g c c c g g t c g g c a c g a c c g g a g c g c g g c g g c a c a t a g c g g g g c g g g a c c c g c c g t t c c g a c c g g g c c a g c c g t g

3060 EF-1 α promoter EF-1 α intron A LQTLPFIAAERGQQLSSLIS c a g t t g c g t g a g c g g a a a g a t g g c c g c t t c c c g g c c c t g c t g c a g g g a g c t c a a a a t g g a g t c a a c g c a c t c g c c t t t c t a c c g g c g a a g g g c c g g g a c g a c g t c c c t c g a g t t t t a c c t

3120 EF-1 α promoter EF-1 α intron A SAASPLAPPHTVWVFSFPRE g g a c g c g g c g c t c g g g a g a g c g g g c g g g t g a g t c a c c c a c a c a a a g g a a a a g g g c c t t t c c c t g c g c c g c g a g c c c t c t c g c c c g c c c a c t c a g t g g g t g t g t t t c c t t t t c c c g g a a a g

3180 EF-1 α promoter EF-1 α intron A TRLRRKM c g t c c t c a g c c g t c g c t t c a t g t g a c t c c a c g g a g t a c c g g g c g c c g t c c a g g c a c c t c g g c a g g a g t c g g c a g c g a a g t a c a c t g a g g t g c c t c a t g g c c c g c g g c a g g t c c g t g g a g c

3240 EF-1 α promoter EF-1 α intron A a t t a g t t c t c g a g c t t t t g g a g t a c g t c g t c t t t a g g t t g g g g g g a g g g g t t t t a t g c g a t a a t c a a g a g c t c g a a a a c c t c a t g c a g c a g a a a t c c a a c c c c c c t c c c c a a a a t a c g c t

3300 EF-1 α promoter EF-1 α intron A t g g a g t t t c c c c a c a c t g a g t g g g t g g a g a c t g a a g t t a g g c c a g c t t g g c a c t t g a t g t a c c t c a a a g g g g t g t g a c t c a c c c a c c t c t g a c t t c a a t c c g g t c g a a c c g t g a a c t a c a

3360 EF-1 α promoter EF-1 α intron A T C A A G C C T C A G A EF1a-F a a t t c t c c t t g g a a t t t g c c c t t t t t g a g t t t g g a t c t t g g t t c a t t c t c a a g c c t c a g a t t a a g a g g a a c c t t a a a c g g g a a a a a c t c a a a c c t a g a a c c a a g t a a g a g t t c g g a g t c t

3420 EF-1 α promoter EF-1 α intron A C A G T G G T T C EF1a-F c a g t g g t t c a a a g t t t t t t t c t t c c a t t t c a g g t g t c g t g a a g c g g c c c t g c a g a t a t c a g t c a c c a a g t t t c a a a a a a a g a a g g t a a a g t c c a c a g c a c t t c g c c g g g a c g t c t a t a g t

3480 1 5 10 signal sequence MRPSGTAGAAL attB1 EGFR MRPSGTAGAAL a c a a g t t t g t a c a a a a a a g c a g g c a c c a t g c g a c c c t c c g g g a c g g c c g g g g c a g c g c t c t g t t c a a a c a t g t t t t t t c g t c c g t g g t a c g c t g g g a g g c c c t g c c g g c c c c g t c g c g a g

3540 15 20 signal sequence LALLAALCPASRA 25 30 extracellular domain LEEKKVC EGFR LALLAALCPASRALEEKKVC c t g g c g c t g c t g g c t g c g c t c t g c c c g g c g a g t c g g g c t c t g g a g g a a a a g a a a g t t t g c g a c c g c g a c g a c c g a c g c g a g a c g g g c c g c t c a g c c c g a g a c c t c c t t t t c t t t c a a a c g

3600 35 40 45 50 extracellular domain QGTSNKLTQLGTFEDHFLSL EGFR QGTSNKLTQLGTFEDHFLSL c a a g g c a c g a g t a a c a a g c t c a c g c a g t t g g g c a c t t t t g a a g a t c a t t t t c t c a g c c t c g t t c c g t g c t c a t t g t t c g a g t g c g t c a a c c c g t g a a a a c t t c t a g t a a a a g a g t c g g a g

3660 55 60 65 70 extracellular domain QRMFNNCEVVLGNLEITYVQ EGFR QRMFNNCEVVLGNLEITYVQ c a g a g g a t g t t c a a t a a c t g t g a g g t g g t c c t t g g g a a t t t g g a a a t t a c c t a t g t g c a g g t c t c c t a c a a g t t a t t g a c a c t c c a c c a g g a a c c c t t a a a c c t t t a a t g g a t a c a c g t c

3720 75 80 85 90 extracellular domain RNYDLSFLKTIQEVAGYVLI EGFR RNYDLSFLKTIQEVAGYVLI a g g a a t t a t g a t c t t t c c t t c t t a a a g a c c a t c c a g g a g g t g g c t g g t t a t g t c c t c a t t t c c t t a a t a c t a g a a a g g a a g a a t t t c t g g t a g g t c c t c c a c c g a c c a a t a c a g g a g t a a

3780 95 100 105 110 extracellular domain ALNTVERIPLENLQIIRGNM EGFR ALNTVERIPLENLQIIRGNM g c c c t c a a c a c a g t g g a g c g a a t t c c t t t g g a a a a c c t g c a g a t c a t c a g a g g a a a t a t g c g g g a g t t g t g t c a c c t c g c t t a a g g a a a c c t t t t g g a c g t c t a g t a g t c t c c t t t a t a c

3840 115 120 125 130 extracellular domain YYENSYALAVLSNYDANKTG EGFR YYENSYALAVLSNYDANKTG t a c t a c g a a a a t t c c t a t g c c t t a g c a g t c t t a t c t a a c t a t g a t g c a a a t a a a a c c g g a a t g a t g c t t t t a a g g a t a c g g a a t c g t c a g a a t a g a t t g a t a c t a c g t t t a t t t t g g c c t

3900 135 140 145 150 extracellular domain LKELPMRNLQEILHGAVRFS EGFR LKELPMRNLQEILHGAVRFS c t g a a g g a g c t g c c c a t g a g a a a t t t a c a g g a a a t c c t g c a t g g c g c c g t g c g g t t c a g c g a c t t c c t c g a c g g g t a c t c t t t a a a t g t c c t t t a g g a c g t a c c g c g g c a c g c c a a g t c g

3960 155 160 165 170 extracellular domain NNPALCNVESIQWRDIVSSD EGFR NNPALCNVESIQWRDIVSSD a a c a a c c c t g c c c t g t g c a a c g t g g a g a g c a t c c a g t g g c g g g a c a t a g t c a g c a g t g a c t t g t t g g g a c g g g a c a c g t t g c a c c t c t c g t a g g t c a c c g c c c t g t a t c a g t c g t c a c t g

4020 175 180 185 190 extracellular domain FLSNMSMDFQNHLGSCQKCD EGFR FLSNMSMDFQNHLGSCQKCD t t t c t c a g c a a c a t g t c g a t g g a c t t c c a g a a c c a c c t g g g c a g c t g c c a a a a g t g t g a t a a a g a g t c g t t g t a c a g c t a c c t g a a g g t c t t g g t g g a c c c g t c g a c g g t t t t c a c a c t a

4080 195 200 205 210 extracellular domain PSCPNGSCWGAGEENCQKLT EGFR PSCPNGSCWGAGEENCQKLT c c a a g c t g t c c c a a t g g g a g c t g c t g g g g t g c a g g a g a g g a g a a c t g c c a g a a a c t g a c c g g t t c g a c a g g g t t a c c c t c g a c g a c c c c a c g t c c t c t c c t c t t g a c g g t c t t t g a c t g g

4140 215 220 225 230 extracellular domain KIICAQQCSGRCRGKSPSDC EGFR KIICAQQCSGRCRGKSPSDC a a a a t c a t c t g t g c c c a g c a g t g c t c c g g g c g c t g c c g t g g c a a g t c c c c c a g t g a c t g c t t t t a g t a g a c a c g g g t c g t c a c g a g g c c c g c g a c g g c a c c g t t c a g g g g g t c a c t g a c g

4200 235 240 245 250 extracellular domain CHNQCAAGCTGPRESDCLVC EGFR CHNQCAAGCTGPRESDCLVC t g c c a c a a c c a g t g t g c t g c a g g c t g c a c a g g c c c c c g g g a g a g c g a c t g c c t g g t c t g c a c g g t g t t g g t c a c a c g a c g t c c g a c g t g t c c g g g g g c c c t c t c g c t g a c g g a c c a g a c g

4260 255 260 265 270 extracellular domain RKFRDEATCKDTCPPLMLYN EGFR RKFRDEATCKDTCPPLMLYN c g c a a a t t c c g a g a c g a a g c c a c g t g c a a g g a c a c c t g c c c c c c a c t c a t g c t c t a c a a c g c g t t t a a g g c t c t g c t t c g g t g c a c g t t c c t g t g g a c g g g g g g t g a g t a c g a g a t g t t g

4320 275 280 285 290 extracellular domain PTTYQMDVNPEGKYSFGATC EGFR PTTYQMDVNPEGKYSFGATC c c c a c c a c g t a c c a g a t g g a t g t g a a c c c c g a g g g c a a a t a c a g c t t t g g t g c c a c c t g c g g g t g g t g c a t g g t c t a c c t a c a c t t g g g g c t c c c g t t t a t g t c g a a a c c a c g g t g g a c g

4380 295 300 305 310 extracellular domain VKKCPRNYVVTDHGSCVRAC EGFR VKKCPRNYVVTDHGSCVRAC g t g a a g a a g t g t c c c c g t a a t t a t g t g g t g a c a g a t c a c g g c t c g t g c g t c c g a g c c t g t c a c t t c t t c a c a g g g g c a t t a a t a c a c c a c t g t c t a g t g c c g a g c a c g c a g g c t c g g a c a

4440 315 320 325 330 extracellular domain GADSYEMEEDGVRKCKKCEG EGFR GADSYEMEEDGVRKCKKCEG g g g g c c g a c a g c t a t g a g a t g g a g g a a g a c g g c g t c c g c a a g t g t a a g a a g t g c g a a g g g c c c c g g c t g t c g a t a c t c t a c c t c c t t c t g c c g c a g g c g t t c a c a t t c t t c a c g c t t c c c

4500 335 340 345 350 extracellular domain PCRKVCNGIGIGEFKDSLSI EGFR PCRKVCNGIGIGEFKDSLSI c c t t g c c g c a a a g t g t g t a a c g g a a t a g g t a t t g g t g a a t t t a a a g a c t c a c t c t c c a t a g g a a c g g c g t t t c a c a c a t t g c c t t a t c c a t a a c c a c t t a a a t t t c t g a g t g a g a g g t a t

4560 355 360 365 370 extracellular domain NATNIKHFKNCTSISGDLHI EGFR NATNIKHFKNCTSISGDLHI a a t g c t a c g a a t a t t a a a c a c t t c a a a a a c t g c a c c t c c a t c a g t g g c g a t c t c c a c a t c t t a c g a t g c t t a t a a t t t g t g a a g t t t t t g a c g t g g a g g t a g t c a c c g c t a g a g g t g t a g

4620 375 380 385 390 extracellular domain LPVAFRGDSFTHTPPLDPQE EGFR LPVAFRGDSFTHTPPLDPQE c t g c c g g t g g c a t t t a g g g g t g a c t c c t t c a c a c a t a c t c c t c c t c t g g a t c c a c a g g a a g a c g g c c a c c g t a a a t c c c c a c t g a g g a a g t g t g t a t g a g g a g g a g a c c t a g g t g t c c t t

4680 395 400 405 410 extracellular domain LDILKTVKEITGFLLIQAWP EGFR LDILKTVKEITGFLLIQAWP c t g g a t a t t c t g a a a a c c g t a a a g g a a a t c a c a g g g t t t t t g c t g a t t c a g g c t t g g c c t g a c c t a t a a g a c t t t t g g c a t t t c c t t t a g t g t c c c a a a a a c g a c t a a g t c c g a a c c g g a

4740 415 420 425 430 extracellular domain ENRTDLHAFENLEIIRGRTK EGFR ENRTDLHAFENLEIIRGRTK g a a a a c a g g a c g g a c c t c c a t g c c t t t g a g a a c c t a g a a a t c a t a c g c g g c a g g a c c a a g c t t t t g t c c t g c c t g g a g g t a c g g a a a c t c t t g g a t c t t t a g t a t g c g c c g t c c t g g t t c

4800 435 440 445 450 extracellular domain QHGQFSLAVVSLNITSLGLR EGFR QHGQFSLAVVSLNITSLGLR c a a c a t g g t c a g t t t t c t c t t g c a g t c g t c a g c c t g a a c a t a a c a t c c t t g g g a t t a c g c g t t g t a c c a g t c a a a a g a g a a c g t c a g c a g t c g g a c t t g t a t t g t a g g a a c c c t a a t g c g

4860 455 460 465 470 extracellular domain SLKEISDGDVIISGNKNLCY EGFR SLKEISDGDVIISGNKNLCY *SVFIQAI t c c c t c a a g g a g a t a a g t g a t g g a g a t g t g a t a a t t t c a g g a a a c a a a a a t t t g t g c t a t a g g g a g t t c c t c t a t t c a c t a c c t c t a c a c t a t t a a a g t c c t t t g t t t t t a a a c a c g a t a

4920 475 480 485 490 extracellular domain ANTINWKKLFGTSGQKTKII EGFR ANTINWKKLFGTSGQKTKII CICYVPFFQKPGGTLFGFNY g c a a a t a c a a t a a a c t g g a a a a a a c t g t t t g g g a c c t c c g g t c a g a a a a c c a a a a t t a t a c g t t t a t g t t a t t t g a c c t t t t t t g a c a a a c c c t g g a g g c c a g t c t t t t g g t t t t a a t a t

4980 495 500 505 510 extracellular domain SNRGENSCKATGQVCHALCS EGFR SNRGENSCKATGQVCHALCS AVSTFVAALGCALDAMGQAG a g c a a c a g a g g t g a a a a c a g c t g c a a g g c c a c a g g c c a g g t c t g c c a t g c c t t g t g c t c c t c g t t g t c t c c a c t t t t g t c g a c g t t c c g g t g t c c g g t c c a g a c g g t a c g g a a c a c g a g g

5040 515 520 525 530 extracellular domain PEGCWGPEPRDCVSCRNVSR EGFR PEGCWGPEPRDCVSCRNVSR GLAAPARLGPVADRAPIDAS c c c g a g g g c t g c t g g g g c c c g g a g c c c a g g g a c t g c g t c t c t t g c c g g a a t g t c a g c c g a g g g c t c c c g a c g a c c c c g g g c c t c g g g t c c c t g a c g c a g a g a a c g g c c t t a c a g t c g g c t

5100 535 540 545 550 extracellular domain GRECVDKCNLLEGEPREFVE EGFR GRECVDKCNLLEGEPREFVE APFAHVLAVKQLTLWPLKHL g g c a g g g a a t g c g t g g a c a a g t g c a a c c t t c t g g a g g g t g a g c c a a g g g a g t t t g t g g a g c c g t c c c t t a c g c a c c t g t t c a c g t t g g a a g a c c t c c c a c t c g g t t c c c t c a a a c a c c t c

5160 555 560 565 570 extracellular domain NSECIQCHPECLPQAMNITC EGFR NSECIQCHPECLPQAMNITC VRLAYLAVWLAQRLGHVDGA a a c t c t g a g t g c a t a c a g t g c c a c c c a g a g t g c c t g c c t c a g g c c a t g a a c a t c a c c t g c t t g a g a c t c a c g t a t g t c a c g g t g g g t c t c a c g g a c g g a g t c c g g t a c t t g t a g t g g a c g

5220 575 580 585 590 extracellular domain TGRGPDNCIQCAHYIDGPHC EGFR TGRGPDNCIQCAHYIDGPHC CSPSWVVTDLTGVVNVAGVA a c a g g a c g g g g a c c a g a c a a c t g t a t c c a g t g t g c c c a c t a c a t t g a c g g c c c c c a c t g c t g t c c t g c c c c t g g t c t g t t g a c a t a g g t c a c a c g g g t g a t g t a a c t g c c g g g g g t g a c g

5280 595 600 605 610 extracellular domain VKTCSAGVMGENNTLVWKYA EGFR VKTCSAGVMGENNTLVWKYA DLGARCSDHSFVVGQDPLVC g t c a a g a c c t g c t c g g c a g g a g t c a t g g g a g a a a a c a a c a c c c t g g t c t g g a a g t a c g c a c a g t t c t g g a c g a g c c g t c c t c a g t a c c c t c t t t t g t t g t g g g a c c a g a c c t t c a t g c g t

5340 615 620 625 630 extracellular domain DAGHVCHLCHPNCTYGCTGP EGFR DAGHVCHLCHPNCTYGCTGP VGAMHAVQAM g a c g c c g g c c a t g t g t g c c a c c t g t g c c a t c c a a a c t g c a c c t a c g g a t g c a c t g g g c c a c t g c g g c c g g t a c a c a c g g t g g a c a c g g t a g g t t t g a c g t g g a t g c c t a c g t g a c c c g g t

5400 635 640 645 extracellular domain GLEGCPTNGPKIPS 650 transmembrane region IATGMV EGFR GLEGCPTNGPKIPSIATGMV g g t c t t g a a g g c t g t c c a a c g a a t g g g c c t a a g a t c c c g t c c a t c g c c a c t g g g a t g g t g c c a g a a c t t c c g a c a g g t t g c t t a c c c g g a t t c t a g g g c a g g t a g c g g t g a c c c t a c c a c

5460 655 660 665 transmembrane region GALLLLLVVALGIGLFM 670 cytoplasmic do... RRR EGFR GALLLLLVVALGIGLFMRRR g g g g c c c t c c t c t t g c t g c t g g t g g t g g c c c t g g g g a t c g g c c t c t t c a t g c g a a g g c g c c c c c g g g a g g a g a a c g a c g a c c a c c a c c g g g a c c c c t a g c c g g a g a a g t a c g c t t c c g c g

5520 675 680 685 690 cytoplasmic domain HIVRKRTLRRLLQERELVEP EGFR HIVRKRTLRRLLQERELVEP c a c a t c g t t c g g a a g c g c a c g c t g c g g a g g c t g c t g c a g g a g a g g g a g c t t g t g g a g c c t g t g t a g c a a g c c t t c g c g t g c g a c g c c t c c g a c g a c g t c c t c t c c c t c g a a c a c c t c g g a

5580 695 700 705 710 cytoplasmic domain LTPSGEAPNQALLRILKETE EGFR LTPSGEAPNQALLRILKETE c t t a c a c c c a g t g g a g a a g c t c c c a a c c a a g c t c t c t t g a g g a t c t t g a a g g a a a c t g a a g a a t g t g g g t c a c c t c t t c g a g g g t t g g t t c g a g a g a a c t c c t a g a a c t t c c t t t g a c t t

5640 715 720 725 730 cytoplasmic domain FKKIKVLGSGAFGTVYKGLW EGFR FKKIKVLGSGAFGTVYKGLW t t c a a a a a g a t c a a a g t g c t g g g c t c c g g t g c g t t c g g c a c g g t g t a t a a g g g a c t c t g g a a g t t t t t c t a g t t t c a c g a c c c g a g g c c a c g c a a g c c g t g c c a c a t a t t c c c t g a g a c c

5700 735 740 745 750 cytoplasmic domain IPEGEKVKIPVAIKELREAT EGFR IPEGEKVKIPVAIKELREAT a t c c c a g a a g g t g a g a a a g t t a a a a t t c c c g t c g c t a t c a a g g a a t t a a g a g a a g c a a c a t a g g g t c t t c c a c t c t t t c a a t t t t a a g g g c a g c g a t a g t t c c t t a a t t c t c t t c g t t g t

5760 755 760 765 770 cytoplasmic domain SPKANKEILDEAYVMASVDN EGFR SPKANKEILDEAYVMASVDN t c t c c g a a a g c c a a c a a g g a a a t c c t c g a t g a a g c c t a c g t g a t g g c c a g c g t g g a c a a c a g a g g c t t t c g g t t g t t c c t t t a g g a g c t a c t t c g g a t g c a c t a c c g g t c g c a c c t g t t g

5820 775 780 785 790 cytoplasmic domain PHVCRLLGICLTSTVQLITQ EGFR PHVCRLLGICLTSTVQLITQ c c c c a c g t g t g c c g c c t g c t g g g c a t c t g c c t c a c c t c c a c c g t g c a g c t c a t c a c g c a g g g g g t g c a c a c g g c g g a c g a c c c g t a g a c g g a g t g g a g g t g g c a c g t c g a g t a g t g c g t c

5880 795 800 805 810 cytoplasmic domain LMPFGCLLDYVREHKDNIGS EGFR LMPFGCLLDYVREHKDNIGS c t c a t g c c c t t c g g c t g c c t c c t g g a c t a t g t c c g g g a a c a c a a a g a c a a t a t t g g c t c c g a g t a c g g g a a g c c g a c g g a g g a c c t g a t a c a g g c c c t t g t g t t t c t g t t a t a a c c g a g g

5940 815 820 825 830 cytoplasmic domain QYLLNWCVQIAKGMNYLEDR EGFR QYLLNWCVQIAKGMNYLEDR c a g t a c c t g c t c a a c t g g t g t g t g c a g a t c g c a a a g g g c a t g a a c t a c t t g g a g g a c c g t g t c a t g g a c g a g t t g a c c a c a c a c g t c t a g c g t t t c c c g t a c t t g a t g a a c c t c c t g g c a

6000 835 840 845 850 cytoplasmic domain RLVHRDLAARNVLVKTPQHV EGFR RLVHRDLAARNVLVKTPQHV c g c t t g g t g c a c c g c g a c c t g g c a g c c a g g a a c g t a c t g g t g a a a a c a c c g c a g c a t g t c g c g a a c c a c g t g g c g c t g g a c c g t c g g t c c t t g c a t g a c c a c t t t t g t g g c g t c g t a c a g

6060 855 860 865 870 cytoplasmic domain KITDFGLAKLLGAEEKEYHA EGFR KITDFGLAKLLGAEEKEYHA a a g a t c a c a g a t t t t g g g c t g g c c a a a c t g c t g g g t g c g g a a g a g a a a g a a t a c c a t g c a t t c t a g t g t c t a a a a c c c g a c c g g t t t g a c g a c c c a c g c c t t c t c t t t c t t a t g g t a c g t

6120 875 880 885 890 cytoplasmic domain EGGKVPIKWMALESILHRIY EGFR EGGKVPIKWMALESILHRIY g a a g g a g g c a a a g t g c c t a t c a a g t g g a t g g c a t t g g a a t c a a t t t t a c a c a g a a t c t a t c t t c c t c c g t t t c a c g g a t a g t t c a c c t a c c g t a a c c t t a g t t a a a a t g t g t c t t a g a t a

6180 895 900 905 910 cytoplasmic domain THQSDVWSYGVTVWELMTFG EGFR THQSDVWSYGVTVWELMTFG a c c c a c c a g a g t g a t g t c t g g a g c t a c g g g g t g a c c g t t t g g g a g t t g a t g a c c t t t g g a t g g g t g g t c t c a c t a c a g a c c t c g a t g c c c c a c t g g c a a a c c c t c a a c t a c t g g a a a c c t

6240 915 920 925 930 cytoplasmic domain SKPYDGIPASEISSILEKGE EGFR SKPYDGIPASEISSILEKGE t c c a a g c c a t a t g a c g g a a t c c c t g c c a g c g a g a t c t c c t c c a t c c t g g a g a a a g g a g a a a g g t t c g g t a t a c t g c c t t a g g g a c g g t c g c t c t a g a g g a g g t a g g a c c t c t t t c c t c t t

6300 935 940 945 950 cytoplasmic domain RLPQPPICTIDVYMIMVKCW EGFR RLPQPPICTIDVYMIMVKCW c g c c t c c c t c a g c c a c c c a t a t g t a c c a t c g a t g t c t a c a t g a t c a t g g t c a a g t g c t g g g c g g a g g g a g t c g g t g g g t a t a c a t g g t a g c t a c a g a t g t a c t a g t a c c a g t t c a c g a c c

6360 955 960 965 970 cytoplasmic domain MIDADSRPKFRELIIEFSKM EGFR MIDADSRPKFRELIIEFSKM a t g a t a g a c g c a g a t a g t c g c c c a a a g t t c c g t g a g t t g a t c a t c g a a t t c t c c a a a a t g t a c t a t c t g c g t c t a t c a g c g g g t t t c a a g g c a c t c a a c t a g t a g c t t a a g a g g t t t t a c

NsiI 6420 975 980 985 990 cytoplasmic domain ARDPQRYLVIQGDERMHLPS EGFR ARDPQRYLVIQGDERMHLPS g c c c g a g a c c c c c a g c g c t a c c t t g t c a t t c a g g g g g a t g a a a g a a t g c a t t t g c c a a g t c g g g c t c t g g g g g t c g c g a t g g a a c a g t a a g t c c c c c t a c t t t c t t a c g t a a a c g g t t c a

6480 995 1000 1005 1010 cytoplasmic domain PTDSNFYRALMDEEDMDDVV EGFR PTDSNFYRALMDEEDMDDVV c c t a c a g a c t c c a a c t t c t a c c g t g c c c t g a t g g a t g a a g a a g a c a t g g a c g a c g t g g t g g g a t g t c t g a g g t t g a a g a t g g c a c g g g a c t a c c t a c t t c t t c t g t a c c t g c t g c a c c a c

6540 1015 1020 1025 1030 cytoplasmic domain DADEYLIPQQGFFSSPSTSR EGFR DADEYLIPQQGFFSSPSTSR g a t g c c g a c g a g t a c c t c a t c c c a c a g c a g g g c t t c t t c a g c a g c c c c t c c a c g t c a c g g c t a c g g c t g c t c a t g g a g t a g g g t g t c g t c c c g a a g a a g t c g t c g g g g a g g t g c a g t g c c

6600 1035 1040 1045 1050 cytoplasmic domain TPLLSSLSATSNNSTVACID EGFR TPLLSSLSATSNNSTVACID a c t c c c c t c c t g a g c t c t c t g a g t g c a a c c a g c a a c a a t t c c a c c g t g g c t t g c a t t g a t t g a g g g g a g g a c t c g a g a g a c t c a c g t t g g t c g t t g t t a a g g t g g c a c c g a a c g t a a c t a

6660 1055 1060 1065 1070 cytoplasmic domain RNGLQSCPIKEDSFLQRYSS EGFR RNGLQSCPIKEDSFLQRYSS a g a a a t g g g c t g c a a a g c t g t c c c a t c a a g g a a g a c a g c t t c t t g c a g c g a t a c a g c t c a t c t t t a c c c g a c g t t t c g a c a g g g t a g t t c c t t c t g t c g a a g a a c g t c g c t a t g t c g a g t

6720 1075 1080 1085 1090 cytoplasmic domain DPTGALTEDSIDDTFLPVPE EGFR DPTGALTEDSIDDTFLPVPE g a c c c c a c a g g c g c c t t g a c t g a g g a c a g c a t a g a c g a c a c c t t c c t c c c a g t g c c t g a a c t g g g g t g t c c g c g g a a c t g a c t c c t g t c g t a t c t g c t g t g g a a g g a g g g t c a c g g a c t t

6780 1095 1100 1105 1110 cytoplasmic domain YINQSVPKRPAGSVQNPVYH EGFR YINQSVPKRPAGSVQNPVYH t a c a t a a a c c a g t c c g t t c c c a a a a g g c c c g c t g g c t c t g t g c a g a a t c c t g t c t a t c a c a t g t a t t t g g t c a g g c a a g g g t t t t c c g g g c g a c c g a g a c a c g t c t t a g g a c a g a t a g t g

6840 1115 1120 1125 1130 cytoplasmic domain NQPLNPAPSRDPHYQDPHST EGFR NQPLNPAPSRDPHYQDPHST a a t c a g c c t c t g a a c c c c g c g c c c a g c a g a g a c c c a c a c t a c c a g g a c c c c c a c a g c a c t t t a g t c g g a g a c t t g g g g c g c g g g t c g t c t c t g g g t g t g a t g g t c c t g g g g g t g t c g t g a

6900 1135 1140 1145 1150 cytoplasmic domain AVGNPEYLNTVQPTCVNSTF EGFR AVGNPEYLNTVQPTCVNSTF g c a g t g g g c a a c c c c g a g t a t c t c a a c a c t g t c c a g c c c a c c t g t g t c a a c a g c a c a t t c c g t c a c c c g t t g g g g c t c a t a g a g t t g t g a c a g g t c g g g t g g a c a c a g t t g t c g t g t a a g

6960 1155 1160 1165 1170 cytoplasmic domain DSPAHWAQKGSHQISLDNPD EGFR DSPAHWAQKGSHQISLDNPD g a c a g c c c t g c c c a c t g g g c c c a g a a a g g c a g c c a c c a a a t t a g c c t g g a c a a c c c t g a c c t g t c g g g a c g g g t g a c c c g g g t c t t t c c g t c g g t g g t t t a a t c g g a c c t g t t g g g a c t g

7020 1175 1180 1185 1190 cytoplasmic domain YQQDFFPKEAKPNGIFKGST EGFR YQQDFFPKEAKPNGIFKGST t a c c a g c a g g a c t t c t t t c c c a a g g a a g c c a a g c c a a a t g g c a t c t t t a a g g g c t c c a c a a t g g t c g t c c t g a a g a a a g g g t t c c t t c g g t t c g g t t t a c c g t a g a a a t t c c c g a g g t g t

7080 1195 1200 1205 1210 cytoplasmic domain AENAEYLRVAPQSSEFIGA* EGFR AENAEYLRVAPQSSEFIGA* g c t g a a a a t g c a g a a t a c c t a a g g g t c g c g c c a c a a a g c a g t g a a t t t a t t g g a g c a t a g c g a c t t t t a c g t c t t a t g g a t t c c c a g c g c g g t g t t t c g t c a c t t a a a t a a c c t c g t a t c

7140 attB2 T3 promoter a a c c c a g c t t t c t t g t a c a a a g t g g t g a t a t c c a a t t a a c c c t c a c t a a a g g g a t g t a t a t t g g g t c g a a a g a a c a t g t t t c a c c a c t a t a g g t t a a t t g g g a g t g a t t t c c c t a c a t a t

7200 T7 promoter attB4 G G G A T A T C A C T C A G C A T A A T T7 a t g a t g t g t g t a a a t t c c c t a t a g t g a g t c g t a t t a c c a c c c a a c t t t t c t a t a c a a a g t t a c t a c a c a c a t t t a a g g g a t a t c a c t c a g c a t a a t g g t g g g t t g a a a a g a t a t g t t t c a

BstBIXbaIPspXINotI 7260 1 GK attB4 V5 tag g g t t g a t a t c c a g c a c a g t g g c g g c c g c t c g a g t c t a g a g g g c c c g c g g t t c g a a g g t a a c c a a c t a t a g g t c g t g t c a c c g c c g g c g a g c t c a g a t c t c c c g g g c g c c a a g c t t c c a t t

MluI 7320 5 10 PIPNPLLGLDST RTG* V5 tag IRES g c c t a t c c c t a a c c c t c t c c t c g g t c t c g a t t c t a c g c g t a c c g g t t a g t a a t g a g a t c c c g g a t a g g g a t t g g g a g a g g a g c c a g a g c t a a g a t g c g c a t g g c c a a t c a t t a c t c t a g g

7380 IRES IRES c t c c c c c c c c c c t a a c g t t a c t g g c c g a a g c c g c t t g g a a t a a g g c c g g t g t g c g t t t g t g a g g g g g g g g g g a t t g c a a t g a c c g g c t t c g g c g a a c c t t a t t c c g g c c a c a c g c a a a c a

7440 IRES IRES c t a t a t g t t a t t t t c c a c c a t a t t g c c g t c t t t t g g c a a t g t g a g g g c c c g g a a a c c t g g g a t a t a c a a t a a a a g g t g g t a t a a c g g c a g a a a a c c g t t a c a c t c c c g g g c c t t t g g a c c

7500 IRES IRES G A G A G C G G T T T C C T T A C G IRES reverse c c c t g t c t t c t t g a c g a g c a t t c c t a g g g g t c t t t c c c c t c t c g c c a a a g g a a t g c a a g g g g g a c a g a a g a a c t g c t c g t a a g g a t c c c c a g a a a g g g g a g a g c g g t t t c c t t a c g t t c c

7560 IRES IRES t c t g t t g a a t g t c g t g a a g g a a g c a g t t c c t c t g g a a g c t t c t t g a a g a c a a a c a a c g t c a g a c a a c t t a c a g c a c t t c c t t c g t c a a g g a g a c c t t c g a a g a a c t t c t g t t t g t t g c a g

7620 IRES IRES t g t a g c g a c c c t t t g c a g g c a g c g g a a c c c c c c a c c t g g c g a c a g g t g c c t c t g c g g c c a a c a t c g c t g g g a a a c g t c c g t c g c c t t g g g g g g t g g a c c g c t g t c c a c g g a g a c g c c g g t

7680 IRES IRES a a a g c c a c g t g t a t a a g a t a c a c c t g c a a a g g c g g c a c a a c c c c a g t g c c a c g t t g t g a g t t t c g g t g c a c a t a t t c t a t g t g g a c g t t t c c g c c g t g t t g g g g t c a c g g t g c a a c a c t c

7740 IRES IRES T G G C T C T C C T C A A G C G T A T T IRES-F t t g g a t a g t t g t g g a a a g a g t c a a a t g g c t c t c c t c a a g c g t a t t c a a c a a g g g g c t g a a a a c c t a t c a a c a c c t t t c t c a g t t t a c c g a g a g g a g t t c g c a t a a g t t g t t c c c c g a c t t

Acc65IKpnI 7800 IRES IRES g g a t g c c c a g a a g g t a c c c c a t t g t a t g g g a t c t g a t c t g g g g c c t c g g t g c a c a t g c t t c c t a c g g g t c t t c c a t g g g g t a a c a t a c c c t a g a c t a g a c c c c g g a g c c a c g t g t a c g a a

7860 IRES IRES t a c a t g t g t t t a g t c g a g g t t a a a a a a a c g t c t a g g c c c c c c g a a c c a c g g g g a c g t g g t a t g t a c a c a a a t c a g c t c c a a t t t t t t t g c a g a t c c g g g g g g c t t g g t g c c c c t g c a c c a

7920 1 5 MTEYKPT IRES PuroR IRES T A C T G G C T C A T G T T C G G G T G Puro-R MATHMTEYKPT t t t c c t t t g a a a a a c a c g a t g a t a a t a t g g c c a c a c a t a t g a c c g a g t a c a a g c c c a c g g a a a g g a a a c t t t t t g t g c t a c t a t t a t a c c g g t g t g t a t a c t g g c t c a t g t t c g g g t g c c

BsiWI 7980 10 15 20 25 VRLATRDDVPRAVRTLAAAF PuroR VRLATRDDVPRAVRTLAAAF t g c g c c t c g c c a c c c g c g a c g a c g t c c c c a g g g c c g t a c g c a c c c t c g c c g c c g c g t t c g a c g c g g a g c g g t g g g c g c t g c t g c a g g g g t c c c g g c a t g c g t g g g a g c g g c g g c g c a a g c

RsrII 8040 30 35 40 45 ADYPATRHTVDPDRHIERVT PuroR ADYPATRHTVDPDRHIERVT c c g a c t a c c c c g c c a c g c g c c a c a c c g t c g a t c c g g a c c g c c a c a t c g a g c g g g t c a c c g g g c t g a t g g g g c g g t g c g c g g t g t g g c a g c t a g g c c t g g c g g t g t a g c t c g c c c a g t g g c

8100 50 55 60 65 ELQELFLTRVGLDIGKVWVA PuroR ELQELFLTRVGLDIGKVWVA a g c t g c a a g a a c t c t t c c t c a c g c g c g t c g g g c t c g a c a t c g g c a a g g t g t g g g t c g c g g t c g a c g t t c t t g a g a a g g a g t g c g c g c a g c c c g a g c t g t a g c c g t t c c a c a c c c a g c g c c

8160 70 75 80 85 DDGAAVAVWTTPESVEAGAV PuroR DDGAAVAVWTTPESVEAGAV a c g a c g g c g c c g c g g t g g c g g t c t g g a c c a c g c c g g a g a g c g t c g a a g c g g g g g c g g t g t t g c t g c c g c g g c g c c a c c g c c a g a c c t g g t g c g g c c t c t c g c a g c t t c g c c c c c g c c a c a

8220 90 95 100 105 FAEIGPRMAELSGSRLAAQQ PuroR FAEIGPRMAELSGSRLAAQQ t c g c c g a g a t c g g c c c g c g c a t g g c c g a g t t g a g c g g t t c c c g g c t g g c c g c g c a g c a a c a g c g g c t c t a g c c g g g c g c g t a c c g g c t c a a c t c g c c a a g g g c c g a c c g g c g c g t c g t t g

8280 110 115 120 125 QMEGLLAPHRPKEPAWFLAT PuroR QMEGLLAPHRPKEPAWFLAT a g a t g g a a g g c c t c c t g g c g c c g c a c c g g c c c a a g g a g c c c g c g t g g t t c c t g g c c a c c g t c t a c c t t c c g g a g g a c c g c g g c g t g g c c g g g t t c c t c g g g c g c a c c a a g g a c c g g t g g c

8340 130 135 140 145 VGVSPDHQGKGLGSAVVLPG PuroR VGVSPDHQGKGLGSAVVLPG t c g g c g t c t c g c c c g a c c a c c a g g g c a a g g g t c t g g g c a g c g c c g t c g t g c t c c c c g g a g a g c c g c a g a g c g g g c t g g t g g t c c c g t t c c c a g a c c c g t c g c g g c a g c a c g a g g g g c c t c

8400 150 155 160 165 VEAAERAGVPAFLETSAPRN PuroR G C A A C C Puro-F VEAAERAGVPAFLETSAPRN t g g a g g c g g c c g a g c g c g c c g g g g t g c c c g c c t t c c t g g a g a c c t c c g c g c c c c g c a a c c a c c t c c g c c g g c t c g c g c g g c c c c a c g g g c g g a a g g a c c t c t g g a g g c g c g g g g c g t t g g

8460 170 175 180 185 LPFYERLGFTVTADVEVPEG PuroR T C C C C T T C T A C G A G C Puro-F LPFYERLGFTVTADVEVPEG t c c c c t t c t a c g a g c g g c t c g g c t t c a c c g t c a c c g c c g a c g t c g a g g t g c c c g a a g g a c a g g g g a a g a t g c t c g c c g a g c c g a a g t g g c a g t g g c g g c t g c a g c t c c a c g g g c t t c c t g

BsaBI*SexAI*CsiI 8520 190 195 200 PRTWCMTRKPGA* PuroR WPRE PRTWCMTRKPGA* c g c g c a c c t g g t g c a t g a c c c g c a a g c c c g g t g c c t a a a t c g a t a g a t c c t a a t c a a c c t g c g c g t g g a c c a c g t a c t g g g c g t t c g g g c c a c g g a t t t a g c t a t c t a g g a t t a g t t g g a

8580 WPRE A C A A C G A G G A A A A T G C WPRE-R c t g g a t t a c a a a a t t t g t g a a a g a t t g a c t g g t a t t c t t a a c t a t g t t g c t c c t t t t a c g g a c c t a a t g t t t t a a a c a c t t t c t a a c t g a c c a t a a g a a t t g a t a c a a c g a g g a a a a t g c

8640 WPRE G A T A C WPRE-R MPLYHAIASRMAF MLLLPVWLS c t a t g t g g a t a c g c t g c t t t a a t g c c t t t g t a t c a t g c t a t t g c t t c c c g t a t g g c t t t c g a t a c a c c t a t g c g a c g a a a t t a c g g a a a c a t a g t a c g a t a a c g a a g g g c a t a c c g a a a g

8700 WPRE IFSSLYKSWLLSLYEELWPV FSPPCINPGCCLFMRSCGPL a t t t t c t c c t c c t t g t a t a a a t c c t g g t t g c t g t c t c t t t a t g a g g a g t t g t g g c c c g t t t a a a a g a g g a g g a a c a t a t t t a g g a c c a a c g a c a g a g a a a t a c t c c t c a a c a c c g g g c a a

8760 WPRE VRQRGVVCTVFADATPTGWG SGNVAWCALCLLTQPPLVGA g t c a g g c a a c g t g g c g t g g t g t g c a c t g t g t t t g c t g a c g c a a c c c c c a c t g g t t g g g g c c a g t c c g t t g c a c c g c a c c a c a c g t g a c a c a a a c g a c t g c g t t g g g g g t g a c c a a c c c c g

8820 WPRE IATTCQLLSGTFAFPLPIAT LPPPVSSFPGLSLSPSLLPR a t t g c c a c c a c c t g t c a g c t c c t t t c c g g g a c t t t c g c t t t c c c c c t c c c t a t t g c c a c g t a a c g g t g g t g g a c a g t c g a g g a a a g g c c c t g a a a g c g a a a g g g g g a g g g a t a a c g g t g c

8880 WPRE AELIAACLARCWTGARLLGT RNSSPPALPAAGQGLGCWAL g c g g a a c t c a t c g c c g c c t g c c t t g c c c g c t g c t g g a c a g g g g c t c g g c t g t t g g g c a c t c g c c t t g a g t a g c g g c g g a c g g a a c g g g c g a c g a c c t g t c c c c g a g c c g a c a a c c c g t g a

8940 WPRE (in frame with Factor Xa site) *RGKRPQEGTN DNSVVLSGKSSSFPWLLACV TIPWCCRGNHRPFLGCSPVL g a c a a t t c c g t g g t g t t g t c g g g g a a a t c a t c g t c c t t t c c t t g g c t g c t c g c c t g t g t t c t g t t a a g g c a c c a c a a c a g c c c c t t t a g t a g c a g g a a a g g a a c c g a c g a g c g g a c a c a a

9000 WPRE (in frame with Factor Xa site) GGPNQAPRGEAVDR 1 RGEI Factor Xa site ATWILRGTSFCYVPSALNPA PPGFCAGRPSATSLRPSIQR g c c a c c t g g a t t c t g c g c g g g a c g t c c t t c t g c t a c g t c c c t t c g g c c c t c a a t c c a g c g c g g t g g a c c t a a g a c g c g c c c t g c a g g a a g a c g a t g c a g g g a a g c c g g g a g t t a g g t c g c

9060 WPRE DLPSRGLLPALRPLPRLRLR TFLPAACCRLCGLFRVFAFA g a c c t t c c t t c c c g c g g c c t g c t g c c g g c t c t g c g g c c t c t t c c g c g t c t t c g c c t t c g c c t g g a a g g a a g g g c g c c g g a c g a c g g c c g a g a c g c c g g a g a a g g c g c a g a a g c g g a a g c g

9120 WPRE PQTSRISLWAASPPEIL* LRRVGSPFGPPPRLRSFKTN c c t c a g a c g a g t c g g a t c t c c c t t t g g g c c g c c t c c c c g c c t g a g a t c c t t t a a g a c c a a g g a g t c t g c t c a g c c t a g a g g g a a a c c c g g c g g a g g g g c g g a c t c t a g g a a a t t c t g g t t

9180 3' LTR ( Δ U3) DLQGSCRS* t g a c t t a c a a g g c a g c t g t a g a t c t t a g c c a c t t t t t a a a a g a a a a g g g g g g a c t g g a a g a c t g a a t g t t c c g t c g a c a t c t a g a a t c g g t g a a a a a t t t t c t t t t c c c c c c t g a c c t t c

9240 3' LTR ( Δ U3) g g c t a a t t c a c t c c c a a c g a a g a c a a g a t c t g c t t t t t g c t t g t a c t g g g t c t c t c t g g t c c g a t t a a g t g a g g g t t g c t t c t g t t c t a g a c g a a a a a c g a a c a t g a c c c a g a g a g a c c a

9300 3' LTR ( Δ U3) t a g a c c a g a t c t g a g c c t g g g a g c t c t c t g g c t a a c t a g g g a a c c c a c t g c t t a a g c c t c a t c t g g t c t a g a c t c g g a c c c t c g a g a g a c c g a t t g a t c c c t t g g g t g a c g a a t t c g g a g

9360 3' LTR ( Δ U3) a a t a a a g c t t g c c t t g a g t g c t t c a a g t a g t g t g t g c c c g t c t g t t g t g t g a c t c t g g t a t t a t t t c g a a c g g a a c t c a c g a a g t t c a t c a c a c a c g g g c a g a c a a c a c a c t g a g a c c a t

9420 3' LTR ( Δ U3) a c t a g a g a t c c c t c a g a c c c t t t t a g t c a g t g t g g a a a a t c t c t a g c a g t a g t a g t t c a t t g a t c t c t a g g g a g t c t g g g a a a a t c a g t c a c a c c t t t t a g a g a t c g t c a t c a t c a a g t a

9480 g t c a t c t t a t t a t t c a g t a t t t a t a a c t t g c a a a g a a a t g a a t a t c a g a g a g t g a g a g g c c a g t a g a a t a a t a a g t c a t a a a t a t t g a a c g t t t c t t t a c t t a t a g t c t c t c a c t c t c c g

PacI 9540 C G G A G C A C T A T G C G pBRforEco c c g g g t t a a t t a a g g a a a g g g c t a g a t c a t t c t t g a a g a c g a a a g g g c c t c g t g a t a c g c g g c c c a a t t a a t t c c t t t c c c g a t c t a g t a a g a a c t t c t g c t t t c c c g g a g c a c t a t g c g

9600 G A T A A pBRforEco c t a t t t t t a t a g g t t a a t g t c a t g a t a a t a a t g g t t t c t t a g a c g t c a g g t g g c a c t t t t g a t a a a a a t a t c c a a t t a c a g t a c t a t t a t t a c c a a a g a a t c t g c a g t c c a c c g t g a a a a

9660 AmpR promoter c g g g g a a a t g t g c g c g g a a c c c c t a t t t g t t t a t t t t t c t a a a t a c a t t c a a a t a t g t a t g c c c c t t t a c a c g c g c c t t g g g g a t a a a c a a a t a a a a a g a t t t a t g t a a g t t t a t a c a t a

9720 1 sig... M AmpR promoter AmpR M c c g c t c a t g a g a c a a t a a c c c t g a t a a a t g c t t c a a t a a t a t t g a a a a a g g a a g a g t a t g g g c g a g t a c t c t g t t a t t g g g a c t a t t t a c g a a g t t a t t a t a a c t t t t t c c t t c t c a t a c

9780 5 10 15 20 signal sequence SIQHFRVALIPFFAAFCLPV AmpR SIQHFRVALIPFFAAFCLPV a g t a t t c a a c a t t t c c g t g t c g c c c t t a t t c c c t t t t t t g c g g c a t t t t g c c t t c c t g t t t c a t a a g t t g t a a a g g c a c a g c g g g a a t a a g g g a a a a a a c g c c g t a a a a c g g a a g g a c a a

9840 signal se... FA 25 30 35 40 HPETLVKVKDAEDQLGAR AmpR FAHPETLVKVKDAEDQLGAR t t t g c t c a c c c a g a a a c g c t g g t g a a a g t a a a a g a t g c t g a a g a t c a g t t g g g t g c a c g a a a a c g a g t g g g t c t t t g c g a c c a c t t t c a t t t t c t a c g a c t t c t a g t c a a c c c a c g t g c t

9900 45 50 55 60 VGYIELDLNSGKILESFRPE AmpR VGYIELDLNSGKILESFRPE g t g g g t t a c a t c g a a c t g g a t c t c a a c a g c g g t a a g a t c c t t g a g a g t t t t c g c c c c g a a c a c c c a a t g t a g c t t g a c c t a g a g t t g t c g c c a t t c t a g g a a c t c t c a a a a g c g g g g c t t

9960 65 70 75 80 ERFPMMSTFKVLLCGAVLSR AmpR ERFPMMSTFKVLLCGAVLSR g a a c g t t t t c c a a t g a t g a g c a c t t t t a a a g t t c t g c t a t g t g g c g c g g t a t t a t c c c g t c t t g c a a a a g g t t a c t a c t c g t g a a a a t t t c a a g a c g a t a c a c c g c g c c a t a a t a g g g c a

10,020 85 90 95 100 VDAGQEQLGRRIHYSQNDLV AmpR VDAGQEQLGRRIHYSQNDLV g t t g a c g c c g g g c a a g a g c a a c t c g g t c g c c g c a t a c a c t a t t c t c a g a a t g a c t t g g t t c a a c t g c g g c c c g t t c t c g t t g a g c c a g c g g c g t a t g t g a t a a g a g t c t t a c t g a a c c a a

10,080 105 110 115 120 EYSPVTEKHLTDGMTVRELC AmpR EYSPVTEKHLTDGMTVRELC g a g t a c t c a c c a g t c a c a g a a a a g c a t c t t a c g g a t g g c a t g a c a g t a a g a g a a t t a t g c c t c a t g a g t g g t c a g t g t c t t t t c g t a g a a t g c c t a c c g t a c t g t c a t t c t c t t a a t a c g

PvuI 10,140 125 130 135 140 SAAITMSDNTAANLLLTTIG AmpR SAAITMSDNTAANLLLTTIG a g t g c t g c c a t a a c c a t g a g t g a t a a c a c t g c g g c c a a c t t a c t t c t g a c a a c g a t c g g a t c a c g a c g g t a t t g g t a c t c a c t a t t g t g a c g c c g g t t g a a t g a a g a c t g t t g c t a g c c t

10,200 145 150 155 160 GPKELTAFLHNMGDHVTRLD AmpR GPKELTAFLHNMGDHVTRLD g g a c c g a a g g a g c t a a c c g c t t t t t t g c a c a a c a t g g g g g a t c a t g t a a c t c g c c t t g a t c c t g g c t t c c t c g a t t g g c g a a a a a a c g t g t t g t a c c c c c t a g t a c a t t g a g c g g a a c t a

10,260 165 170 175 180 RWEPELNEAIPNDERDTTMP AmpR RWEPELNEAIPNDERDTTMP c g t t g g g a a c c g g a g c t g a a t g a a g c c a t a c c a a a c g a c g a g c g t g a c a c c a c g a t g c c t g c a a c c c t t g g c c t c g a c t t a c t t c g g t a t g g t t t g c t g c t c g c a c t g t g g t g c t a c g g a

FspI 10,320 185 190 195 200 VAMATTLRKLLTGELLTLAS AmpR VAMATTLRKLLTGELLTLAS g t a g c a a t g g c a a c a a c g t t g c g c a a a c t a t t a a c t g g c g a a c t a c t t a c t c t a g c t t c c c a t c g t t a c c g t t g t t g c a a c g c g t t t g a t a a t t g a c c g c t t g a t g a a t g a g a t c g a a g g

10,380 205 210 215 220 RQQLIDWMEADKVAGPLLRS AmpR RQQLIDWMEADKVAGPLLRS c g g c a a c a a t t a a t a g a c t g g a t g g a g g c g g a t a a a g t t g c a g g a c c a c t t c t g c g c t c g g c c g t t g t t a a t t a t c t g a c c t a c c t c c g c c t a t t t c a a c g t c c t g g t g a a g a c g c g a g c

10,440 225 230 235 240 ALPAGWFIADKSGAGERGSR AmpR ALPAGWFIADKSGAGERGSR g c c c t t c c g g c t g g c t g g t t t a t t g c t g a t a a a t c t g g a g c c g g t g a g c g t g g g t c t c g c c g g g a a g g c c g a c c g a c c a a a t a a c g a c t a t t t a g a c c t c g g c c a c t c g c a c c c a g a g c g

10,500 245 250 255 260 GIIAALGPDGKPSRIVVIYT AmpR GIIAALGPDGKPSRIVVIYT g g t a t c a t t g c a g c a c t g g g g c c a g a t g g t a a g c c c t c c c g t a t c g t a g t t a t c t a c a c g c c a t a g t a a c g t c g t g a c c c c g g t c t a c c a t t c g g g a g g g c a t a g c a t c a a t a g a t g t g c

10,560 265 270 275 280 TGSQATMDERNRQIAEIGAS AmpR TGSQATMDERNRQIAEIGAS a c g g g g a g t c a g g c a a c t a t g g a t g a a c g a a a t a g a c a g a t c g c t g a g a t a g g t g c c t c a t g c c c c t c a g t c c g t t g a t a c c t a c t t g c t t t a t c t g t c t a g c g a c t c t a t c c a c g g a g t

10,620 285 LIKHW* AmpR LIKHW* c t g a t t a a g c a t t g g t a a c t g t c a g a c c a a g t t t a c t c a t a t a t a c t t t a g a t t g a t t t a g a c t a a t t c g t a a c c a t t g a c a g t c t g g t t c a a a t g a g t a t a t a t g a a a t c t a a c t a a a t

10,680 a a a c t t c a t t t t t a a t t t a a a a g g a t c t a g g t g a a g a t c c t t t t t g a t a a t c t c a t g a c c t t t g a a g t a a a a a t t a a a t t t t c c t a g a t c c a c t t c t a g g a a a a a c t a t t a g a g t a c t g g

10,740 a a a a t c c c t t a a c g t g a g t t t t c g t t c c a c t g a g c g t c a g a c c c c g t a g a a a a g a t c a a a t t t t a g g g a a t t g c a c t c a a a a g c a a g g t g a c t c g c a g t c t g g g g c a t c t t t t c t a g t t t

10,800 ori g g a t c t t c t t g a g a t c c t t t t t t t c t g c g c g t a a t c t g c t g c t t g c a a a c a a a a a a a c c a c c t a g a a g a a c t c t a g g a a a a a a a g a c g c g c a t t a g a c g a c g a a c g t t t g t t t t t t t g g t

10,860 ori c c g c t a c c a g c g g t g g t t t g t t t g c c g g a t c a a g a g c t a c c a a c t c t t t t t c c g a a g g t a g g c g a t g g t c g c c a c c a a a c a a a c g g c c t a g t t c t c g a t g g t t g a g a a a a a g g c t t c c a t

10,920 ori a c t g g c t t c a g c a g a g c g c a g a t a c c a a a t a c t g t t c t t c t a g t g t a g c c g t a g t t a g g c t g a c c g a a g t c g t c t c g c g t c t a t g g t t t a t g a c a a g a a g a t c a c a t c g g c a t c a a t c c g

10,980 ori c a c c a c t t c a a g a a c t c t g t a g c a c c g c c t a c a t a c c t c g c t c t g c t a a t c c t g t t a c c a g t g g t g a a g t t c t t g a g a c a t c g t g g c g g a t g t a t g g a g c g a g a c g a t t a g g a c a a t g g t

11,040 ori g t g g c t g c t g c c a g t g g c g a t a a g t c g t g t c t t a c c g g g t t g g a c t c a a g a c g a t a g t t a c a c c g a c g a c g g t c a c c g c t a t t c a g c a c a g a a t g g c c c a a c c t g a g t t c t g c t a t c a a t

11,100 ori c c g g a t a a g g c g c a g c g g t c g g g c t g a a c g g g g g g t t c g t g c a c a c a g c c c a g c t t g g a g g g c c t a t t c c g c g t c g c c a g c c c g a c t t g c c c c c c a a g c a c g t g t g t c g g g t c g a a c c t c

11,160 ori c g a a c g a c c t a c a c c g a a c t g a g a t a c c t a c a g c g t g a g c t a t g a g a a a g c g c c a c g c t t g c t t g c t g g a t g t g g c t t g a c t c t a t g g a t g t c g c a c t c g a t a c t c t t t c g c g g t g c g a a

11,220 ori c c c g a a g g g a g a a a g g c g g a c a g g t a t c c g g t a a g c g g c a g g g t c g g a a c a g g a g a g c g c g g g c t t c c c t c t t t c c g c c t g t c c a t a g g c c a t t c g c c g t c c c a g c c t t g t c c t c t c g c g

11,280 ori G G G A A A C G C C T G G T A T C T T T pBR322ori-F a c g a g g g a g c t t c c a g g g g g a a a c g c c t g g t a t c t t t a t a g t c c t g t c g g g t t t c g c c a c t g c t c c c t c g a a g g t c c c c c t t t g c g g a c c a t a g a a a t a t c a g g a c a g c c c a a a g c g g t g

11,340 ori c t c t g a c t t g a g c g t c g a t t t t t g t g a t g c t c g t c a g g g g g g c g g a g c c t a t g g a a a a a c g a g a c t g a a c t c g c a g c t a a a a a c a c t a c g a g c a g t c c c c c c g c c t c g g a t a c c t t t t t g

11,400 g c c a g c a a c g c g g c c t t t t t a c g g t t c c t g g c c t t t t g c t g g c c t t t t g c t c a c a t g t t c c g g t c g t t g c g c c g g a a a a a t g c c a a g g a c c g g a a a a c g a c c g g a a a a c g a g t g t a c a a g

11,460 t t t c c t g c g t t a t c c c c t g a t t c t g t g g a t a a c c g t a t t a c c g c c t t t g a g t g a g c t g a t a a a g g a c g c a a t a g g g g a c t a a g a c a c c t a t t g g c a t a a t g g c g g a a a c t c a c t c g a c t a

11,520 A G C G A G T C A G T G A G C G A G L4440 a c c g c t c g c c g c a g c c g a a c g a c c g a g c g c a g c g a g t c a g t g a g c g a g g a a g c g g a a g a g t g g c g a g c g g c g t c g g c t t g c t g g c t c g c g t c g c t c a g t c a c t c g c t c c t t c g c c t t c t c

11,580 c g c c c a a t a c g c a a a c c g c c t c t c c c c g c g c g t t g g c c g a t t c a t t a a t g c a g c a a g c t c g c g g g t t a t g c g t t t g g c g g a g a g g g g c g c g c a a c c g g c t a a g t a a t t a c g t c g t t c g a g

SfiI 11,640 a t g g c t g a c t a a t t t t t t t t a t t t a t g c a g a g g c c g a g g c c g c c t c g g c c t c t g a g c t a t t a c c g a c t g a t t a a a a a a a a t a a a t a c g t c t c c g g c t c c g g c g g a g c c g g a g a c t c g a t a

11,700 t c c a g a a g t a g t g a g g a g g c t t t t t t g g a g g c c t a g g c t t t t g c a a a a a g c t c c c c g t g g a g g t c t t c a t c a c t c c t c c g a a a a a a c c t c c g g a t c c g a a a a c g t t t t t c g a g g g g c a c c

11,760 CAP binding site c a c g a c a g g t t t c c c g a c t g g a a a g c g g g c a g t g a g c g c a a c g c a a t t a a t g t g a g t t a g g t g c t g t c c a a a g g g c t g a c c t t t c g c c c g t c a c t c g c g t t g c g t t a a t t a c a c t c a a t c

11,820 -35 -10 CAP binding site lac promoter c t c a c t c a t t a g g c a c c c c a g g c t t t a c a c t t t a t g c t t c c g g c t c g t a t g t t g t g t g g a g a g t g a g t a a t c c g t g g g g t c c g a a a t g t g a a a t a c g a a g g c c g a g c a t a c a a c a c a c c t

11,880 lac operator M13 rev A G C G G A T A A C A A T T T C A C A C A G G M13/pUC Reverse C A G G A A A C A G C T A T G A C M13 Reverse a t t g t g a g c g g a t a a c a a t t t c a c a c a g g a a a c a g c t a t g a c a t g a t t a c g a a t t t c a c a t a a c a c t c g c c t a t t g t t a a a g t g t g t c c t t t g t c g a t a c t g t a c t a a t g c t t a a a g t g t

11,940 G T G G T T T G T C C A A A C T C A T C EBV-rev a a t a a a g c a t t t t t t t c a c t g c a t t c t a g t t g t g g t t t g t c c a a a c t c a t c a a t g t a t c t t t a t t t c g t a a a a a a a g t g a c g t a a g a t c a a c a c c a a a c a g g t t t g a g t a g t t a c a t a g a

12,000 t a t c a t g t c t g g a t c a a c t g g a t a a c t c a a g c t a a c c a a a a t c a t c c c a a a c t t c c c a c c a t a g t a c a g a c c t a g t t g a c c t a t t g a g t t c g a t t g g t t t t a g t a g g g t t t g a a g g g t g g

12,060 c c a t a c c c t a t t a c c a c t g c c a a t t a c c t g t g g t t t c a t t t a c t c t a a a c c t g t g a t t c c g g t a t g g g a t a a t g g t g a c g g t t a a t g g a c a c c a a a g t a a a t g a g a t t t g g a c a c t a a g g

12,120 t c t g a a t t a t t t t c a t t t t a a a g a a a t t g t a t t t g t t a a a t a t g t a c t a c a a a c t t a g t a a g a c t t a a t a a a a g t a a a a t t t c t t t a a c a t a a a c a a t t t a t a c a t g a t g t t t g a a t c a t

3ʹ 5ʹ 12,122 g t c a

Restriction Enzymes

Instructions: By default, all cutters are shown. Filter on number of cut sites or search by enzyme name.

Filter

Features

Primers

BLAST

BLAST (Basic Local Alignment Search Tool) finds regions of similarity between biological sequences. Click on the buttons below to submit a BLAST search to NCBI. The results will appear in a new window. See your recent BLAST results on NCBI's website.

  • Nucleotide-Nucleotide BLAST (BLASTN)

  • Translated Nucleotide-Protein BLAST (BLASTX)

  • Sequence alignment using BLAST (BLAST2)

Sequence Analyzer Guide

Map

Displays a graphical map based on nucleotide sequence data labeled with restriction enzymes, plasmid features, ORFs (theoretical open reading frames) and primers. Hovering over data labels will display additional information (e.g. cut site)

To select a portion of sequence, click one location on the plasmid and then a second location to display the sequence between the two locations.

Sequence

Displays both strands of base paired nucleotide sequences with annotated enzymes, plasmid features, ORFs (theoretical open reading frames) and primers. Hovering over data labels will display additional information (e.g. cut site).

To select a portion of sequence, click one location on the sequence and then a second location to display the sequence between the two locations.

Enzymes

List of restriction enzymes that can cut a given nucleotide sequence. Table lists enzyme name and the sequence location of the cut.

Features

List of common features detected in a given nucleotide sequence. Table lists feature name, location, size, color used to indicate its position on the map, and direction (if relevant).

Primers

List of commonly used primers detected in a given nucleotide sequence. Table lists primer name, sequence, length, binding site location, and direction.

BLAST

Use Basic Local Alignment Search Tool (BLAST) via the NCBI website to determine similarity between a given sequence and nucleotide (BLASTN) or protein (BLASTX) sequence databases. Additionally, align a custom nucleotide sequence against a given sequence using BLAST2.

File Downloads

GenBank

File contains the nucleotide sequence and annotated features in GenBank flat file format. Open the file with a text editor or plasmid mapping software to view the sequence.

SnapGene

File contains the nucleotide sequence and enhanced annotations from SnapGene Server. Open the file with SnapGene software or the free Viewer to view the plasmid map, sequence, and perform additional sequence analysis.