Please cite these papers and this site if using these data. Thank you

 

Creative Commons Licence
DISC1 exons by Chris Carter is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.

 .........      

Back to BLASTS

Viral insertions

Examples of mechanisms of action

AIDS

     

PolygenicBlog on the relations between genes, risk factors and autoimmunity

    
DISC1 was exonified from  the UCSC Genome browser track Thank you, Kirsty

Every human protein is constructed from multiple viral inserts: This is illustrated for a single protein, DISC1

 

Other examples are shown below

Viral Homologues of different DISC1 exons
Exon1 
>chr1.2096.6_prot length=37
SRDCLPPAACFRRRRLARRPGYMRSSTGPGIGFLSPA


 GENE ID: 6372476 201phi2-1p197 | gp197; similar to phiKZ gp118 and EL gp76
[Pseudomonas phage 201phi2-1] (10 or fewer PubMed links)

 Score = 29.5 bits (62),  Expect = 3.0, Method: Composition-based stats.
 Identities = 13/27 (48%), Positives = 13/27 (48%), Gaps = 10/27 (37%)

Query  11   FRRRRLARRPGYMRSSTGP-GIGFLSP  36
            FRR         MR  T P GI FLSP
Sbjct  380  FRR---------MRNYTAPRGITFLSP  397

>gb|AAN17399.1|  sigma C protein [Avian orthoreovirus]
Length=245

 Score = 26.1 bits (54),  Expect =    40, Method: Composition-based stats.
 Identities = 8/11 (72%), Positives = 8/11 (72%), Gaps = 0/11 (0%)

Query  18   RRPGYMRSSTG  28
            RR  YM SSTG
Sbjct  207  RRTDYMMSSTG  217

>gb|AAM68138.1|  glycoprotein [Borna disease virus]
 gb|AAM68144.1|  glycoprotein [Borna disease virus]
Length=503


Query  5    LPPAACF-RRRRLAR  18
            LP  A F RRRRL R
Sbjct  488  LP--ASFARRRRLGR  500

GENE ID: 5142024 TNSV_gp113 | calyx/polyhedrin envelope protein
[Trichoplusia ni SNPV] (10 or fewer PubMed links)


Query  2    RDCLPPAACFRRR  14
            RDCLP     RRR
Sbjct  100  RDCLP-----RRR  107


Exon 2: >chr1.2096.7_prot length=71 FIRLSLGSAGERGEAEGCPPSREAESHCQSPQEMGAKAASLDGPHEDPRC LSRPFSLLATRVSADLAQA AR
>gb|ABR27363.1|  polyprotein [Hepatitis C virus]
Length=3011

 
Query  17    GCPPSREAESH  27
             GCPP  +AESH
Sbjct  2375  GCPPDSDAESH  2385

 GENE ID: 8207486 PSS2_gp077 | tail tape measure protein
[Marine cyanobacterial siphovirus PSS2]

 Score = 24.4 bits (50),  Expect =   471, Method: Composition-based stats.
 Identities = 8/10 (80%), Positives = 8/10 (80%), Gaps = 0/10 (0%)

Query  2    IRLSLGSAGE  11
            IRL LG AGE
Sbjct  261  IRLTLGIAGE  270
GENE ID: 921127 THVgp136 | t112 [Tupaiid herpesvirus 1]

 Score = 24.0 bits (49),  Expect =   512, Method: Composition-based stats.
 Identities = 7/8 (87%), Positives = 7/8 (87%), Gaps = 0/8 (0%)

Query  42   DGPHEDPR  49
            DGPHED R 
Sbjct  155  DGPHEDRR  162

 GENE ID: 1259714 Bxz1_gp69 | gp69 [Mycobacterium phage Bxz1]
(10 or fewer PubMed links)

 Score = 23.1 bits (47),  Expect =   939, Method: Composition-based stats.
 Identities = 7/7 (100%), Positives = 7/7 (100%), Gaps = 0/7 (0%)

Query  63  SADLAQA  69
           SADLAQA
Sbjct  17  SADLAQA  23
 GENE ID: 4955109 RGDVS3CP | core capsid protein [Rice gall dwarf virus]
(10 or fewer PubMed links)

 Score = 22.7 bits (46),  Expect =  1170, Method: Composition-based stats.
 Identities = 8/16 (50%), Positives = 9/16 (56%), Gaps = 5/16 (31%)

Query  1    FIRLSLGSAGERGEAE  16
            F+R       ERGEAE
Sbjct  853  FVR-----SNERGEAE  863
>gb|AAC68884.1|  outer capsid protein VP4 [Human rotavirus strain NnB1]
Length=775

 Score = 29.1 bits (61),  Expect =    15, Method: Composition-based stats.
 Identities = 11/18 (61%), Positives = 11/18 (61%), Gaps = 5/18 (27%)

Query  46   EDPRCLSRPFSLLATRVS  63
            EDP     PFS L TRVS
Sbjct  432  EDP-----PFSILRTRVS  444
Exon 3 >chr1.2096.8_prot length=18 KWEPVLRDCLLRNRRQME
Exon 4 >chr1.2096.9_prot length=14 KLQEDAVEN DDYDK
 GENE ID: 5130597 Q54_gp11 | ERF recombination protein [Lactococcus phage Q54]
(10 or fewer PubMed links)

 Score = 26.5 bits (55),  Expect =    23
 Identities = 7/8 (87%), Positives = 8/8 (100%), Gaps = 0/8 (0%)

Query  4    EDAVENDD  11
            EDAVEN+D
Sbjct  120  EDAVENND  127
>gb|ADJ55329.1|  gp56 dCTPase [Enterobacteria phage RB16]
Length=177

 Score = 25.2 bits (52),  Expect =    56
 Identities = 6/10 (60%), Positives = 10/10 (100%), Gaps = 0/10 (0%)

Query  1   KLQEDAVEND  10
           KLQ+DA+E++
Sbjct  76  KLQDDAIEDE  85

 GENE ID: 3162006 MIMI_R890 | hypothetical protein
[Acanthamoeba polyphaga mimivirus] (10 or fewer PubMed links)

 Score = 24.8 bits (51),  Expect =    75
 Identities = 7/13 (53%), Positives = 10/13 (76%), Gaps = 0/13 (0%)

Query  2    LQEDAVENDDYDK  14
            L++D   +DDYDK
Sbjct  103  LEDDPLTDDDYDK  115
Exon 5 >chr1.2096.16_prot length=50 ETLQQRLEDLEQEKISLHFQLPSRQPALSSFLGHLAAQVQAALRRGATQQ

Exon 6 >chr1.2097.1_prot length=21

DSLHVSITRRDWLLQEKQQLQDSLHVSITRRDWLLQEKQQLQ

 
 GENE ID: 6334528 GPC | glycoprotein precursor [Parana virus]
(10 or fewer PubMed links)

 Score = 26.1 bits (54),  Expect =    30
 Identities = 9/12 (75%), Positives = 9/12 (75%), Gaps = 1/12 (8%)

Query  9    RRDWLLQEKQQL  20
            R DWLL E QQL
Sbjct  413  RNDWLL-ESQQL  423
 GENE ID: 3161875 MIMI_L112 | ankyrin containing protein
[Acanthamoeba polyphaga mimivirus] (10 or fewer PubMed links)

 Score = 26.1 bits (54),  Expect =    30
 Identities = 7/9 (77%), Positives = 8/9 (88%), Gaps = 0/9 (0%)

Query  7    ITRRDWLLQ  15
            ITRRDW L+
Sbjct  308  ITRRDWALE  316
>dbj|BAD00047.1|  Fusion protein, Feo [Hepatitis C virus]
Length=832 Score = 21.4 bits (43), Expect = 772 Identities = 5/5 (100%), Positives = 5/5 (100%), Gaps = 0/5 (0%) Query 10 RDWLL 14 RDWLL Sbjct 657 RDWLL 661
 GENE ID: 6334588 3 | portal protein [Mycobacterium phage BPs]

 Score = 21.4 bits (43),  Expect =   772
 Identities = 8/11 (72%), Positives = 8/11 (72%), Gaps = 1/11 (9%)

Query  1    DSLHVSITRRD  11
            DSL  SIT RD
Sbjct  164  DSL-LSITSRD  173
GENE ID: 8997344 112 | hypothetical protein [Klebsiella phage KP15]

 Score = 21.4 bits (43),  Expect =   772
 Identities = 6/6 (100%), Positives = 6/6 (100%), Gaps = 0/6 (0%)

Query  2  SLHVSI  7
          SLHVSI
Sbjct  2  SLHVSI  7


Exon 7 >chr1.2097.3_prot length=78 KEIEALQARMFVLEAKDQQLRREIEEQEQQLQWQGCDLTPLVGQLSLGQLQEVSKALQDTLASAGQIPFHAEPPETIR

Exon 8 >chr1.2097.6_prot length=18

LQERIKSLNLSLKEITTK

LQERIKSLNLSLKEITTK

>gb|ABD75545.1|  spike glycoprotein [Human coronavirus HKU1]
Length=1351


                                                                                                                    Query start position  Subject start position
 Score = 31.6 bits (67),  Expect = 0.67
 Identities = 12/19 (63%), Positives = 12/19 (63%), Gaps = 3/19 (15%)

Query  2     QERIKSLN---LSLKEITT  17
             QE IKSLN     LKEI T
Sbjct  1269  QESIKSLNSSFINLKEIGT  1287
 GENE ID: 4157810 Qyrzulap12 | gp12 [Mycobacterium phage Qyrzula]
(10 or fewer PubMed links)

 Score = 24.8 bits (51),  Expect =    74
 Identities = 10/13 (76%), Positives = 10/13 (76%), Gaps = 3/13 (23%)

Query  9    NL-SLKEI--TTK  18
            NL SLKEI  TTK
Sbjct  214  NLDSLKEIVRTTK  226
 GENE ID: 3431466 ORF-97 p87/vp80 | P87/VP80
[Chrysodeixis chalcites nucleopolyhedrovirus] (10 or fewer PubMed links)

 Score = 24.8 bits (51),  Expect =    74
 Identities = 7/7 (100%), Positives = 7/7 (100%), Gaps = 0/7 (0%)

Query  4    RIKSLNL  10
            RIKSLNL
Sbjct  425  RIKSLNL  431
Exon 9 >chr1.2097.8_prot length=34 VCMSEKFCSTLRKKVNDIETQLPALLEAKMHAIS
Exon 10 >chr1.2097.10_prot length=62 NHFWTAKDLTEEIRSLTSEREGLEGLLSKLLVLSSRNVKKLGSVKEDYNRLRREVEHQETAY
Exon 11 >chr1.2099.4_prot length=16 CKCPLLGKVWEADLEA

Exon 12 >chr1.2099.13_prot length=5 REQTK

REQTK

>gb|ACI26571.1|  hemagglutinin [Influenza A virus (A/Siena/10/1990(H3N2))]

Length=566 Score = 19.3 bits (38), Expect = 1739 Identities = 5/5 (100%), Positives = 5/5 (100%), Gaps = 0/5 (0%) Query 1 REQTK 5

REQTK

Sbjct 205 REQTK 209

 

web counter
View Stats