DISC1
exons by Chris
Carter is licensed under a Creative
Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.
.........
Back to BLASTS |
Viral insertions |
Examples of mechanisms of action |
AIDS |
PolygenicBlog on the relations between genes, risk factors and autoimmunity |
DISC1 was exonified from the UCSC Genome browser track Thank you, Kirsty
Every human protein is constructed from multiple viral inserts: This is illustrated for a single protein, DISC1
Exon1 >chr1.2096.6_prot length=37 SRDCLPPAACFRRRRLARRPGYMRSSTGPGIGFLSPA | GENE ID: 6372476 201phi2-1p197 | gp197; similar to phiKZ gp118 and EL gp76 [Pseudomonas phage 201phi2-1] (10 or fewer PubMed links) Score = 29.5 bits (62), Expect = 3.0, Method: Composition-based stats. Identities = 13/27 (48%), Positives = 13/27 (48%), Gaps = 10/27 (37%) Query 11 FRRRRLARRPGYMRSSTGP-GIGFLSP 36 FRR MR T P GI FLSP Sbjct 380 FRR---------MRNYTAPRGITFLSP 397 >gb|AAN17399.1| sigma C protein [Avian orthoreovirus] Length=245 Score = 26.1 bits (54), Expect = 40, Method: Composition-based stats. Identities = 8/11 (72%), Positives = 8/11 (72%), Gaps = 0/11 (0%) Query 18 RRPGYMRSSTG 28 RR YM SSTG Sbjct 207 RRTDYMMSSTG 217 >gb|AAM68138.1| glycoprotein [Borna disease virus] gb|AAM68144.1| glycoprotein [Borna disease virus] Length=503 Query 5 LPPAACF-RRRRLAR 18 LP A F RRRRL R Sbjct 488 LP--ASFARRRRLGR 500 GENE ID: 5142024 TNSV_gp113 | calyx/polyhedrin envelope protein [Trichoplusia ni SNPV] (10 or fewer PubMed links) Query 2 RDCLPPAACFRRR 14 RDCLP RRR Sbjct 100 RDCLP-----RRR 107 | |
Exon 2: >chr1.2096.7_prot length=71 FIRLSLGSAGERGEAEGCPPSREAESHCQSPQEMGAKAASLDGPHEDPRC LSRPFSLLATRVSADLAQA AR | >gb|ABR27363.1| polyprotein [Hepatitis C virus] Length=3011 Query 17 GCPPSREAESH 27 GCPP +AESH Sbjct 2375 GCPPDSDAESH 2385 GENE ID: 8207486 PSS2_gp077 | tail tape measure protein [Marine cyanobacterial siphovirus PSS2] Score = 24.4 bits (50), Expect = 471, Method: Composition-based stats. Identities = 8/10 (80%), Positives = 8/10 (80%), Gaps = 0/10 (0%) Query 2 IRLSLGSAGE 11 IRL LG AGE Sbjct 261 IRLTLGIAGE 270 GENE ID: 921127 THVgp136 | t112 [Tupaiid herpesvirus 1] Score = 24.0 bits (49), Expect = 512, Method: Composition-based stats. Identities = 7/8 (87%), Positives = 7/8 (87%), Gaps = 0/8 (0%) Query 42 DGPHEDPR 49 DGPHED R Sbjct 155 DGPHEDRR 162 GENE ID: 1259714 Bxz1_gp69 | gp69 [Mycobacterium phage Bxz1] (10 or fewer PubMed links) Score = 23.1 bits (47), Expect = 939, Method: Composition-based stats. Identities = 7/7 (100%), Positives = 7/7 (100%), Gaps = 0/7 (0%) Query 63 SADLAQA 69 SADLAQA Sbjct 17 SADLAQA 23 GENE ID: 4955109 RGDVS3CP | core capsid protein [Rice gall dwarf virus] (10 or fewer PubMed links) Score = 22.7 bits (46), Expect = 1170, Method: Composition-based stats. Identities = 8/16 (50%), Positives = 9/16 (56%), Gaps = 5/16 (31%) Query 1 FIRLSLGSAGERGEAE 16 F+R ERGEAE Sbjct 853 FVR-----SNERGEAE 863 >gb|AAC68884.1| outer capsid protein VP4 [Human rotavirus strain NnB1] Length=775 Score = 29.1 bits (61), Expect = 15, Method: Composition-based stats. Identities = 11/18 (61%), Positives = 11/18 (61%), Gaps = 5/18 (27%) Query 46 EDPRCLSRPFSLLATRVS 63 EDP PFS L TRVS Sbjct 432 EDP-----PFSILRTRVS 444 | |
Exon 3 >chr1.2096.8_prot length=18 KWEPVLRDCLLRNRRQME | ||
Exon 4 >chr1.2096.9_prot length=14 KLQEDAVEN DDYDK | GENE ID: 5130597 Q54_gp11 | ERF recombination protein [Lactococcus phage Q54] (10 or fewer PubMed links) Score = 26.5 bits (55), Expect = 23 Identities = 7/8 (87%), Positives = 8/8 (100%), Gaps = 0/8 (0%) Query 4 EDAVENDD 11 EDAVEN+D Sbjct 120 EDAVENND 127 >gb|ADJ55329.1| gp56 dCTPase [Enterobacteria phage RB16] Length=177 Score = 25.2 bits (52), Expect = 56 Identities = 6/10 (60%), Positives = 10/10 (100%), Gaps = 0/10 (0%) Query 1 KLQEDAVEND 10 KLQ+DA+E++ Sbjct 76 KLQDDAIEDE 85GENE ID: 3162006 MIMI_R890 | hypothetical protein [Acanthamoeba polyphaga mimivirus] (10 or fewer PubMed links) Score = 24.8 bits (51), Expect = 75 Identities = 7/13 (53%), Positives = 10/13 (76%), Gaps = 0/13 (0%) Query 2 LQEDAVENDDYDK 14 L++D +DDYDK Sbjct 103 LEDDPLTDDDYDK 115 | |
Exon 5 >chr1.2096.16_prot length=50 ETLQQRLEDLEQEKISLHFQLPSRQPALSSFLGHLAAQVQAALRRGATQQ | ||
Exon 6 >chr1.2097.1_prot length=21 DSLHVSITRRDWLLQEKQQLQDSLHVSITRRDWLLQEKQQLQ | GENE ID: 6334528 GPC | glycoprotein precursor [Parana virus] (10 or fewer PubMed links) Score = 26.1 bits (54), Expect = 30 Identities = 9/12 (75%), Positives = 9/12 (75%), Gaps = 1/12 (8%) Query 9 RRDWLLQEKQQL 20 R DWLL E QQL Sbjct 413 RNDWLL-ESQQL 423GENE ID: 3161875 MIMI_L112 | ankyrin containing protein [Acanthamoeba polyphaga mimivirus] (10 or fewer PubMed links) Score = 26.1 bits (54), Expect = 30 Identities = 7/9 (77%), Positives = 8/9 (88%), Gaps = 0/9 (0%) Query 7 ITRRDWLLQ 15 ITRRDW L+ Sbjct 308 ITRRDWALE 316>dbj|BAD00047.1| Fusion protein, Feo [Hepatitis C virus]Length=832 Score = 21.4 bits (43), Expect = 772 Identities = 5/5 (100%), Positives = 5/5 (100%), Gaps = 0/5 (0%) Query 10 RDWLL 14 RDWLL Sbjct 657 RDWLL 661GENE ID: 6334588 3 | portal protein [Mycobacterium phage BPs] Score = 21.4 bits (43), Expect = 772 Identities = 8/11 (72%), Positives = 8/11 (72%), Gaps = 1/11 (9%) Query 1 DSLHVSITRRD 11 DSL SIT RD Sbjct 164 DSL-LSITSRD 173GENE ID: 8997344 112 | hypothetical protein [Klebsiella phage KP15] Score = 21.4 bits (43), Expect = 772 Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%) Query 2 SLHVSI 7 SLHVSI Sbjct 2 SLHVSI 7 | |
Exon 7 >chr1.2097.3_prot length=78 KEIEALQARMFVLEAKDQQLRREIEEQEQQLQWQGCDLTPLVGQLSLGQLQEVSKALQDTLASAGQIPFHAEPPETIR | ||
Exon 8 >chr1.2097.6_prot length=18 LQERIKSLNLSLKEITTK LQERIKSLNLSLKEITTK | >gb|ABD75545.1| spike glycoprotein [Human coronavirus HKU1] Length=1351 Query start position Subject start position Score = 31.6 bits (67), Expect = 0.67 Identities = 12/19 (63%), Positives = 12/19 (63%), Gaps = 3/19 (15%) Query 2 QERIKSLN---LSLKEITT 17 QE IKSLN LKEI T Sbjct 1269 QESIKSLNSSFINLKEIGT 1287 GENE ID: 4157810 Qyrzulap12 | gp12 [Mycobacterium phage Qyrzula] (10 or fewer PubMed links) Score = 24.8 bits (51), Expect = 74 Identities = 10/13 (76%), Positives = 10/13 (76%), Gaps = 3/13 (23%) Query 9 NL-SLKEI--TTK 18 NL SLKEI TTK Sbjct 214 NLDSLKEIVRTTK 226GENE ID: 3431466 ORF-97 p87/vp80 | P87/VP80 [Chrysodeixis chalcites nucleopolyhedrovirus] (10 or fewer PubMed links) Score = 24.8 bits (51), Expect = 74 Identities = 7/7 (100%), Positives = 7/7 (100%), Gaps = 0/7 (0%) Query 4 RIKSLNL 10 RIKSLNL Sbjct 425 RIKSLNL 431 | |
Exon 9 >chr1.2097.8_prot length=34 VCMSEKFCSTLRKKVNDIETQLPALLEAKMHAIS | ||
Exon 10 >chr1.2097.10_prot length=62 NHFWTAKDLTEEIRSLTSEREGLEGLLSKLLVLSSRNVKKLGSVKEDYNRLRREVEHQETAY | ||
Exon 11 >chr1.2099.4_prot length=16 CKCPLLGKVWEADLEA | ||
Exon 12 >chr1.2099.13_prot length=5 REQTK REQTK
| >gb|ACI26571.1| hemagglutinin [Influenza A virus (A/Siena/10/1990(H3N2))] Length=566 Score = 19.3 bits (38), Expect = 1739 Identities = 5/5 (100%), Positives = 5/5 (100%), Gaps = 0/5 (0%) Query 1 REQTK 5 REQTK Sbjct 205 REQTK 209
| |