CSPI06G21170 (gene) Wild cucumber (PI 183967)

NameCSPI06G21170
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionU4/U6 small nuclear ribonucleoprotein PRP4-like protein
LocationChr6 : 19193653 .. 19198986 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAGAAAAACCTAGGTAAGGCCGTACCCGCTCTTTTACTTCCTCACTCTCGGGGAGACTATCTTTTCTTCACGCCGGTTCCCTCCCTTACGCACCTCCTTGCCACCGCCGCCGTTGGTATAGTCTCTCTTCCTGCTTTGTCCGTCTTCTACCAGTTTTTTACTGCTATACATTTGGCACTCCCGTCACCCCTTGTCCGGTTCAAAGTTCACCTGTCCACGAGGGCAATACTGTATCATCTCCGGCAACTGGAAATGTTTTGACGGTACGCATCCCTTGCTAGATCCGAATCTCCCTCCATTTCTCTTTCTCTTTCTCTTTCTCCGGTAACTTTAGTTATTTCACTCATCTATTTCTCTATTGATTGGCCTTTTCTGGTGCTGTAACAGTGATGGTTCTTTTTTTTCCTGGTTCGTGTTATTACTTCACTGTGAACATGATGGGAGAAATTAGTGTTTATGTAACTACGAGAGTCTCCAAAGTGTGTATTGCAGTGTCGTGTTTAGAGAAGAAAAGGGATTTTGTTTCTATTTTCTTGAATTGTATCTTTCTTCAAATGTCTGTCCGATAATCTGTTACAGGATCGGCGCAGCCTCAGAATGTTACTTACTGTTTGAATTTCTTGAATTGTATCTTTCTTCAAATGGTTTGATTTTTTTTTTCCCCCTTTACTTTTGAAGGTCTCCTGTTAACAGTGAAATTCTCGTTTTACTATTAGTGCTAGACTTTGTTTCTCATTTGTGTTATCTTAACTAATTTTTGGGAGCTTGATTGTAGAGAAGAAATGAAAAGAGGAGCTTAATATCTAGGATAAGCTTGTGTTGATTAGAAGCTTCATTTCCGAGCAAGCTATAATGGAAATTGATGATCAAAACCCTGCATCAACTGCTGCGGAATCTCCTGAAACCCTTCCTGGTGGTGAAAATGAGGAACTTGATATTCCAGCCGAACCAACTCAACCTGCAGCTACATCAGTTATTCCACCTTCAATTGTCCCTGCCATTGCTCCTATCCCCCCTCCAATTATCCGTCCATTGGCTCCTCTTCCGAGCCGACCTCCCCTCTTTAGGCCCCCTGTAACACAAAATGGTGAGCTGAGAACAAGTGACTCAGACTCGGAACATGATGAATTGGCTCCTTCCCGAACAGCTCCAGGTTCAACTGCAGAGTATGAAATCTCAGAAGAGAGCAGGCAGGCTAGGGAGCGTCATGAAAAAGCCATGCAGGAATTTTTGATGAAGCGTCGTGCTTCTGCTCTAGCAGTGCCTACTAATGACATGGCTGTTCGAGCTCGTCTTCGACGTCTTGGTGAGCCCATAACTCTTTTTGGAGAAAGAGAAATGGAAAGACGGGACAGGTTGCGGTCGATAATGGCACGATTGGATGCTGAAGGGCAGTTAGAGAAGCTCATGAAAGTTCACGAGGAAGAGGAGGCTGCAGCTACTGGTGGAACTGAGGAGGCTGAGGAGGAAGTGCTTCAGTATCCCTTTTATACTGAGGGGTCCAAAGCTCTCTTGGATGCAAGAATTGATATTGCAAAGTATTCCATCCTAAGAGCATCTTCACGCCTTGAGCGTGCAAAGAGGAAACGGGATGACCCAGATGAAGATGTAGAAGCGGAAATGGATTGGGCTCTGAGGCAGGCTGAAAGTTTGGTCCTGGACTGTAGTGAGATAGGAGATGATCGACCACTTTCTGGTTGTTCTTTCTCGTCTGATGGAAAATTTCTTGCCACTAGGTAAATCTTCTCACTTTGCAGTTAATGATTTTTAGCTACTGTACCAGTTTCCCCTTTTTAATATCCATCCCCGTTTCACCTCTTGGTAGAAGGGGGAAAAACGAAAAAAGAAAATATAATAGATCTTTCTGGCTATAGATTAGCTATTATCTTCCTTAGCTAAACTTCCTCAGTTGAACACGAAACCAAAGATAATATAGAAATTAATGTTTTTCCATTTTTCATTCCAGTACGTGATAAGCTTTACAGGCTTTGTATTTAGCTTTCATTTGCTACTGGATTTGCATTTGATGGTTTCTGTATTTTTTTTTCTTGTTTCATTTCAATAGTTCATTGAGTGGAGTTGCAAAATTGTGGAGCATGCCTCAAGTAAGGAAGGTTTCCAACTTTAAGGGACACACAGAGCGTGTTACTGATGTAATGTTTTCTCCAGTGAACGAGTGTTTAGCAACTGCCTCTGCTGACCGAACTGCAAGGTTGTGGTCTGCGGAAGGATCTCTACTTAAAACATTTGAGGGCCATCTAGACCGCCTTGCACGAATTGCCTTCCACCCGTCGGGCAAGTACTTGGGCACAACTAGCTTTGACAAGACCTGGAGATTATGGGATGTTGAAACTGGTGTAGAATTACTTCTTCAAGAAGGTCACAGTAGAAGTGTCTACGGGATAGCCTTCCACCATGATGGATCCTTGGTATCATCTTGTGGACTTGATGCGCTTGCTCGCGTTTGGGATCTTCGAACTGGTAGAAGTGTTCTTGCCTTGGAAGGCCACGTCAAGCCAGTAAAGCTCTCTCCTTTCTTTCGATGGGATTGTAGAACATTTGCTTTTTTAATAAATTGGCTTGTTTGCTTCTCACTGATTCCTTGTTTTTCAATGGATGTCATATGTCTATGATGTTATAACCAGGTCCTTGGAGTTAGTTTTTCACCCAACGGTTATCATTTAGCTACTGGTGGAGAAGATAACACTTGTCGAATATGGGATTTAAGGAAGAAAAAATCTCTTTACATAATACCTGCACATTCAAACTTAGTTTCACAGGTGAAATATGAACCTCAGGAGGGATATTTCTTGGTTACTGCATCATTTGATATGACAGCAAAGGTAATCTTGAATACTGCCAGGATGATATAATATTGTTCAATTCTTAATATTTTCATTTGATTGTTGCTTAATCTTACTTGTTGTCTAAGGTTTCACTATCATATGCTCAACTTTTTTTTTCTGCAGATTTGGTCTGCTCGAGATTTTAAGCCTGTGAAGACACTCTCTGGTCACGAAGCAAAAGTTACATCTTTGGATATAATTTCAGGTTAGTTATATAACAACTCCTAGGTCATTACTATTATTGCGTTATTTACACAGATTGCCACCTAAAGATTTGTAATTTTCACCGGTAATTAGCCTTATTTAGGCACAATCTCAGATTGAACTAATTAACAAAAATATCTTTCTGAGGTGGAACCATATTATGTCATTGATGAGCTGGAATTCGGGATCCTATTTTATTAGCTCTAGTGGTGAAAATGGACAGCTTTTTCTTGGTCTAAAATTAAGTGCTCTTTCAAACATTATGGGTTATCGATTTTAATTACTGGTTCACGATCTCTTCTTTGATCACCTTTAGATTCTTGAGATATCCTCCTTATTTCATTTTGAAATATTCCTTTTACAAAAAAAAAAGAGGTTTCGGACCTTCGGTGGCGGGCGATGGCCATGTTAGCTCCAGTGAGAGAGAGCAAACGAGTGAGAGAGAAAGGGGACTAACCAATGAGTGGTACAACCAGTAGGGAAGGGGGAACTAAAACCAACTAGTGATGGAGAGAAGGGGAAGAAGTACCTTTTGGTTTTATTGGGAGAAGTGTTGAGCTCGAGTGAGAGGGAGATAAACAAAATGGTTGTTGATTGAAGTCTCATCTCTGCAATTTATGAAAGAGAGAAAAAAGGAAAGAAAGAATGACTTGAAGTCTTTAACCTTCAAAAAATTGTGTGTCTATATATATAGATAGATAGGTATATGAGATTGATTCAGTTCAGGTTGAATCAAAGTTCTTTTCAAGTATGCCCAATTCTTTTGTTTATTTTTGTGATCTTGTACAATTCACAGCGGAGTAACCAACCCAAGTTCTAATGACTCAACTTTTTTCATCAACATAGATTGGAAGGTTTATATTGAGAGAAGTGAATGAATATAGGAGGGAATTAGAAAAAACGTGCAACAGAAAGAGGGGATCAGACTAACTACGTGGACTCCAGTCCAAATGAACAAGACCTAGGTCATAAGTACAAAAAGAACTACTGATTGACGCCCACAAGGACATGTTAGACCGTTCAAGAGATTTTACTGTAGCTTAGCTCTTTCCTTCGTTGTTTGGTGGCCTCTTCTTTGGCTACCTGTTGGGTGTTCTCACTTTAGAAGGTTAAAAATTCTAAAAAAGGTAAATCTTTTTGTGGTGAATTCTTAAAAGTAGTGCCAGTATGGTTGGACTATACCAGGGGAAGGGGATCTAGAACACCTTTAGGGTGGTGTTCCTATGGCTTAGAACAAATTCTTTTTCCCGGATGATGGATGTTAGTTCAATCCAAGGTTGTTAAATGATCTTTTGGGCACTGGAAAGTATTTTAGTAATCTTTGTAATACGATGTTGGTTTTTTCTCCAATCTTCACCCTTGCTGTAATATTTCAGCCCATTATTGTTTATGTGAAATTTCAAATAAGACCAACTCAACCAGAATATGGTTAAGGATTTAGGAAATTATTAAAGGATACTGGACTGTTTTTGTGACAAGTAAAACGTTGGAAGGTGTTTGAATGTAATTTGCTTGTCTTTAAGGACGCAACCATAAAAGTTATCAACAGTTTTTCTTTGGTTTTAATATGAATGGTTACTGATACATCTTTAACACTAAGCTCTAAGTTTGTGCATTGTATGCTTAAAATACTTTTTGGAGCTTTATTCTGCATATATGCTTCTGATACCCTGATATTGTTCTTGTCAATAACAGATGGACAGTGTATTGCAACCGTCTCACATGATCGGACCATAAAGCTCTGGTCTGTTAATAGTAAAGACATTCAGACTATGGACGTTGATTGACTTTTCATAGTACCAACATGGGAGATGTTTCACGACCTTACTTGAACAGGGTTCTGTTCTATAAATTGTTCGTCTCGTCCTTGAGTACTAAGAGGCATGAGAGATGTTTCGAAACTTTGTGGCCTCAGGTGATAAAGAGCTTGGATCAGATGATCTCTGTAACATCATGTATCTGATTGAAATTTTTTAGGGAAAACCCCTGTACTTTTTTTTTTTTTTACCTAGAAGAGAAATGGAAGGATTATATCAATGGAGGACATAGAGAAAAAGGGAGATTGGGATATTTCCCGCATGTTGAGATTAAAAATGCACCAAAATTAGAGGTAAGAATTGATAAGTAAATTACATGTAGTTTTCTTTAGATTTCCTTTTGGTTAGAAATGGTTTGTGCACCGACTACCATAGTGATTATTCGGTTAAAATACAATTTCAGGTAAGCCTAATTGGACCAATTAGACTCTAACCAAATCTATAAATTTATCAATTGCTTTTCCG

mRNA sequence

ATGGAAATTGATGATCAAAACCCTGCATCAACTGCTGCGGAATCTCCTGAAACCCTTCCTGGTGGTGAAAATGAGGAACTTGATATTCCAGCCGAACCAACTCAACCTGCAGCTACATCAGTTATTCCACCTTCAATTGTCCCTGCCATTGCTCCTATCCCCCCTCCAATTATCCGTCCATTGGCTCCTCTTCCGAGCCGACCTCCCCTCTTTAGGCCCCCTGTAACACAAAATGGTGAGCTGAGAACAAGTGACTCAGACTCGGAACATGATGAATTGGCTCCTTCCCGAACAGCTCCAGGTTCAACTGCAGAGTATGAAATCTCAGAAGAGAGCAGGCAGGCTAGGGAGCGTCATGAAAAAGCCATGCAGGAATTTTTGATGAAGCGTCGTGCTTCTGCTCTAGCAGTGCCTACTAATGACATGGCTGTTCGAGCTCGTCTTCGACGTCTTGGTGAGCCCATAACTCTTTTTGGAGAAAGAGAAATGGAAAGACGGGACAGGTTGCGGTCGATAATGGCACGATTGGATGCTGAAGGGCAGTTAGAGAAGCTCATGAAAGTTCACGAGGAAGAGGAGGCTGCAGCTACTGGTGGAACTGAGGAGGCTGAGGAGGAAGTGCTTCAGTATCCCTTTTATACTGAGGGGTCCAAAGCTCTCTTGGATGCAAGAATTGATATTGCAAAGTATTCCATCCTAAGAGCATCTTCACGCCTTGAGCGTGCAAAGAGGAAACGGGATGACCCAGATGAAGATGTAGAAGCGGAAATGGATTGGGCTCTGAGGCAGGCTGAAAGTTTGGTCCTGGACTGTAGTGAGATAGGAGATGATCGACCACTTTCTGGTTGTTCTTTCTCGTCTGATGGAAAATTTCTTGCCACTAGTTCATTGAGTGGAGTTGCAAAATTGTGGAGCATGCCTCAAGTAAGGAAGGTTTCCAACTTTAAGGGACACACAGAGCGTGTTACTGATGTAATGTTTTCTCCAGTGAACGAGTGTTTAGCAACTGCCTCTGCTGACCGAACTGCAAGGTTGTGGTCTGCGGAAGGATCTCTACTTAAAACATTTGAGGGCCATCTAGACCGCCTTGCACGAATTGCCTTCCACCCGTCGGGCAAGTACTTGGGCACAACTAGCTTTGACAAGACCTGGAGATTATGGGATGTTGAAACTGGTGTAGAATTACTTCTTCAAGAAGGTCACAGTAGAAGTGTCTACGGGATAGCCTTCCACCATGATGGATCCTTGGTATCATCTTGTGGACTTGATGCGCTTGCTCGCGTTTGGGATCTTCGAACTGGTAGAAGTGTTCTTGCCTTGGAAGGCCACGTCAAGCCAGTCCTTGGAGTTAGTTTTTCACCCAACGGTTATCATTTAGCTACTGGTGGAGAAGATAACACTTGTCGAATATGGGATTTAAGGAAGAAAAAATCTCTTTACATAATACCTGCACATTCAAACTTAGTTTCACAGGTGAAATATGAACCTCAGGAGGGATATTTCTTGGTTACTGCATCATTTGATATGACAGCAAAGATTTGGTCTGCTCGAGATTTTAAGCCTGTGAAGACACTCTCTGGTCACGAAGCAAAAGTTACATCTTTGGATATAATTTCAGATGGACAGTGTATTGCAACCGTCTCACATGATCGGACCATAAAGCTCTGGTCTGTTAATAGTAAAGACATTCAGACTATGGACGTTGATTGA

Coding sequence (CDS)

ATGGAAATTGATGATCAAAACCCTGCATCAACTGCTGCGGAATCTCCTGAAACCCTTCCTGGTGGTGAAAATGAGGAACTTGATATTCCAGCCGAACCAACTCAACCTGCAGCTACATCAGTTATTCCACCTTCAATTGTCCCTGCCATTGCTCCTATCCCCCCTCCAATTATCCGTCCATTGGCTCCTCTTCCGAGCCGACCTCCCCTCTTTAGGCCCCCTGTAACACAAAATGGTGAGCTGAGAACAAGTGACTCAGACTCGGAACATGATGAATTGGCTCCTTCCCGAACAGCTCCAGGTTCAACTGCAGAGTATGAAATCTCAGAAGAGAGCAGGCAGGCTAGGGAGCGTCATGAAAAAGCCATGCAGGAATTTTTGATGAAGCGTCGTGCTTCTGCTCTAGCAGTGCCTACTAATGACATGGCTGTTCGAGCTCGTCTTCGACGTCTTGGTGAGCCCATAACTCTTTTTGGAGAAAGAGAAATGGAAAGACGGGACAGGTTGCGGTCGATAATGGCACGATTGGATGCTGAAGGGCAGTTAGAGAAGCTCATGAAAGTTCACGAGGAAGAGGAGGCTGCAGCTACTGGTGGAACTGAGGAGGCTGAGGAGGAAGTGCTTCAGTATCCCTTTTATACTGAGGGGTCCAAAGCTCTCTTGGATGCAAGAATTGATATTGCAAAGTATTCCATCCTAAGAGCATCTTCACGCCTTGAGCGTGCAAAGAGGAAACGGGATGACCCAGATGAAGATGTAGAAGCGGAAATGGATTGGGCTCTGAGGCAGGCTGAAAGTTTGGTCCTGGACTGTAGTGAGATAGGAGATGATCGACCACTTTCTGGTTGTTCTTTCTCGTCTGATGGAAAATTTCTTGCCACTAGTTCATTGAGTGGAGTTGCAAAATTGTGGAGCATGCCTCAAGTAAGGAAGGTTTCCAACTTTAAGGGACACACAGAGCGTGTTACTGATGTAATGTTTTCTCCAGTGAACGAGTGTTTAGCAACTGCCTCTGCTGACCGAACTGCAAGGTTGTGGTCTGCGGAAGGATCTCTACTTAAAACATTTGAGGGCCATCTAGACCGCCTTGCACGAATTGCCTTCCACCCGTCGGGCAAGTACTTGGGCACAACTAGCTTTGACAAGACCTGGAGATTATGGGATGTTGAAACTGGTGTAGAATTACTTCTTCAAGAAGGTCACAGTAGAAGTGTCTACGGGATAGCCTTCCACCATGATGGATCCTTGGTATCATCTTGTGGACTTGATGCGCTTGCTCGCGTTTGGGATCTTCGAACTGGTAGAAGTGTTCTTGCCTTGGAAGGCCACGTCAAGCCAGTCCTTGGAGTTAGTTTTTCACCCAACGGTTATCATTTAGCTACTGGTGGAGAAGATAACACTTGTCGAATATGGGATTTAAGGAAGAAAAAATCTCTTTACATAATACCTGCACATTCAAACTTAGTTTCACAGGTGAAATATGAACCTCAGGAGGGATATTTCTTGGTTACTGCATCATTTGATATGACAGCAAAGATTTGGTCTGCTCGAGATTTTAAGCCTGTGAAGACACTCTCTGGTCACGAAGCAAAAGTTACATCTTTGGATATAATTTCAGATGGACAGTGTATTGCAACCGTCTCACATGATCGGACCATAAAGCTCTGGTCTGTTAATAGTAAAGACATTCAGACTATGGACGTTGATTGA
BLAST of CSPI06G21170 vs. Swiss-Prot
Match: PRP4L_ARATH (U4/U6 small nuclear ribonucleoprotein PRP4-like protein OS=Arabidopsis thaliana GN=LIS PE=2 SV=1)

HSP 1 Score: 709.9 bits (1831), Expect = 2.2e-203
Identity = 366/575 (63.65%), Postives = 442/575 (76.87%), Query Frame = 1

Query: 1   MEIDDQNPASTAAESPETLPGGENEELDIPAEPTQPAATSVIPPSIVPAIAPIPPPIIRP 60
           ME +  +  S AA +  + P    +   +P     P    V+PPS  P +APIP   + P
Sbjct: 1   MEPNKDDNVSLAATAQISAPPVLQDASSLPGFSAIPP---VVPPSFPPPMAPIP---MMP 60

Query: 61  LAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPSRTAPGSTAEYEISEESRQARERHE 120
             P+ +RPP FRPPV+QNG ++TSDSDSE D+              EISEES+Q RER E
Sbjct: 61  HPPV-ARPPTFRPPVSQNGGVKTSDSDSESDD-----------EHIEISEESKQVRERQE 120

Query: 121 KAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEG 180
           KA+Q+ L+KRRA+A+AVPTND AVR RLRRLGEPITLFGE+EMERR RL  ++ R D  G
Sbjct: 121 KALQDLLVKRRAAAMAVPTNDKAVRDRLRRLGEPITLFGEQEMERRARLTQLLTRYDING 180

Query: 181 QLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLE 240
           QL+KL+K HEE+        EE ++EVL+YPF+TEG K L +ARI+IAK+S+ RA+ R++
Sbjct: 181 QLDKLVKDHEEDVTP----KEEVDDEVLEYPFFTEGPKELREARIEIAKFSVKRAAVRIQ 240

Query: 241 RAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGV 300
           RAKR+RDDPDED++AE  WAL+ A+ + LDCS  GDDRPL+GCSFS DGK LAT SLSGV
Sbjct: 241 RAKRRRDDPDEDMDAETKWALKHAKHMALDCSNFGDDRPLTGCSFSRDGKILATCSLSGV 300

Query: 301 AKLWSMPQV-RKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLKTFEGH 360
            KLW MPQV   ++  K H ER TDV+FSPV++CLATASADRTA+LW  +G+LL+TFEGH
Sbjct: 301 TKLWEMPQVTNTIAVLKDHKERATDVVFSPVDDCLATASADRTAKLWKTDGTLLQTFEGH 360

Query: 361 LDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVSS 420
           LDRLAR+AFHPSGKYLGTTS+DKTWRLWD+ TG ELLLQEGHSRSVYGIAF  DG+L +S
Sbjct: 361 LDRLARVAFHPSGKYLGTTSYDKTWRLWDINTGAELLLQEGHSRSVYGIAFQQDGALAAS 420

Query: 421 CGLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSL 480
           CGLD+LARVWDLRTGRS+L  +GH+KPV  V+FSPNGYHLA+GGEDN CRIWDLR +KSL
Sbjct: 421 CGLDSLARVWDLRTGRSILVFQGHIKPVFSVNFSPNGYHLASGGEDNQCRIWDLRMRKSL 480

Query: 481 YIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIIS 540
           YIIPAH+NLVSQVKYEPQEGYFL TAS+DM   IWS RDF  VK+L+GHE+KV SLDI +
Sbjct: 481 YIIPAHANLVSQVKYEPQEGYFLATASYDMKVNIWSGRDFSLVKSLAGHESKVASLDITA 540

Query: 541 DGQCIATVSHDRTIKLWSVNSKD-----IQTMDVD 570
           D  CIATVSHDRTIKLW+ +  D      +TMD+D
Sbjct: 541 DSSCIATVSHDRTIKLWTSSGNDDEDEEKETMDID 553

BLAST of CSPI06G21170 vs. Swiss-Prot
Match: PRP4_PONAB (U4/U6 small nuclear ribonucleoprotein Prp4 OS=Pongo abelii GN=PRPF4 PE=2 SV=1)

HSP 1 Score: 415.2 bits (1066), Expect = 1.1e-114
Identity = 215/459 (46.84%), Postives = 297/459 (64.71%), Query Frame = 1

Query: 107 EISEESRQARERHEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERR 166
           E+ E      ER  + + EF  ++RA  + V T+D  V+A LR LGEPITLFGE   ERR
Sbjct: 70  EVFEIEEHISERQAEVLAEFERRKRARQINVSTDDSEVKACLRALGEPITLFGEGPAERR 129

Query: 167 DRLRSIMARLDAEGQLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARID 226
           +RLR+I++ +  +     L K  +++E +      +  +E  Q  +Y EG  +L  AR+ 
Sbjct: 130 ERLRNILSVVGTDA----LKKTKKDDEKS------KKSKEEYQQTWYHEGPNSLKVARLW 189

Query: 227 IAKYSILRASSRLERAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFS 286
           IA YS+ RA  RLE A+  ++ P+    ++M    +   SL   CS+IGDDRP+S C FS
Sbjct: 190 IANYSLPRAMKRLEEARLHKEIPETTRASQMQELHKSLRSLNNFCSQIGDDRPISYCHFS 249

Query: 287 SDGKFLATSSLSGVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNEC--------LATAS 346
            + K LAT+  SG+ KLWS+P    +   +GH   V  ++F P +          LA+ +
Sbjct: 250 PNSKMLATACWSGLCKLWSVPDCNLLHTLRGHNTNVGAIVFHPKSTVSLDQKDVNLASCA 309

Query: 347 ADRTARLWSAEGSL-LKTFEGHLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLL 406
           AD + +LWS +    +   EGH  R+AR+ +HPSG++LGTT +D++WRLWD+E   E+L 
Sbjct: 310 ADGSVKLWSLDSDEPVADIEGHTVRVARVMWHPSGRFLGTTCYDRSWRLWDLEAQEEILH 369

Query: 407 QEGHSRSVYGIAFHHDGSLVSSCGLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGY 466
           QEGHS  VY IAFH DGSL  + GLDA  RVWDLRTGR ++ LEGH+K + G++FSPNGY
Sbjct: 370 QEGHSMGVYDIAFHQDGSLAGTGGLDAFGRVWDLRTGRCIMFLEGHLKEIYGINFSPNGY 429

Query: 467 HLATGGEDNTCRIWDLRKKKSLYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSAR 526
           H+ATG  DNTC++WDLR+++ +Y IPAH NLV+ VK+EP  G FL+T ++D TAKIW+  
Sbjct: 430 HIATGSGDNTCKVWDLRQRRCVYTIPAHQNLVTGVKFEPIHGNFLLTGAYDNTAKIWTHP 489

Query: 527 DFKPVKTLSGHEAKVTSLDIISDGQCIATVSHDRTIKLW 557
            + P+KTL+GHE KV  LDI SDGQ IAT S+DRT KLW
Sbjct: 490 GWSPLKTLAGHEGKVMGLDISSDGQLIATCSYDRTFKLW 518

BLAST of CSPI06G21170 vs. Swiss-Prot
Match: PRP4_HUMAN (U4/U6 small nuclear ribonucleoprotein Prp4 OS=Homo sapiens GN=PRPF4 PE=1 SV=2)

HSP 1 Score: 415.2 bits (1066), Expect = 1.1e-114
Identity = 215/459 (46.84%), Postives = 297/459 (64.71%), Query Frame = 1

Query: 107 EISEESRQARERHEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERR 166
           E+ E      ER  + + EF  ++RA  + V T+D  V+A LR LGEPITLFGE   ERR
Sbjct: 71  EVFEIEEHISERQAEVLAEFERRKRARQINVSTDDSEVKACLRALGEPITLFGEGPAERR 130

Query: 167 DRLRSIMARLDAEGQLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARID 226
           +RLR+I++ +  +     L K  +++E +      +  +E  Q  +Y EG  +L  AR+ 
Sbjct: 131 ERLRNILSVVGTDA----LKKTKKDDEKS------KKSKEEYQQTWYHEGPNSLKVARLW 190

Query: 227 IAKYSILRASSRLERAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFS 286
           IA YS+ RA  RLE A+  ++ P+    ++M    +   SL   CS+IGDDRP+S C FS
Sbjct: 191 IANYSLPRAMKRLEEARLHKEIPETTRTSQMQELHKSLRSLNNFCSQIGDDRPISYCHFS 250

Query: 287 SDGKFLATSSLSGVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNEC--------LATAS 346
            + K LAT+  SG+ KLWS+P    +   +GH   V  ++F P +          LA+ +
Sbjct: 251 PNSKMLATACWSGLCKLWSVPDCNLLHTLRGHNTNVGAIVFHPKSTVSLDPKDVNLASCA 310

Query: 347 ADRTARLWSAEGSL-LKTFEGHLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLL 406
           AD + +LWS +    +   EGH  R+AR+ +HPSG++LGTT +D++WRLWD+E   E+L 
Sbjct: 311 ADGSVKLWSLDSDEPVADIEGHTVRVARVMWHPSGRFLGTTCYDRSWRLWDLEAQEEILH 370

Query: 407 QEGHSRSVYGIAFHHDGSLVSSCGLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGY 466
           QEGHS  VY IAFH DGSL  + GLDA  RVWDLRTGR ++ LEGH+K + G++FSPNGY
Sbjct: 371 QEGHSMGVYDIAFHQDGSLAGTGGLDAFGRVWDLRTGRCIMFLEGHLKEIYGINFSPNGY 430

Query: 467 HLATGGEDNTCRIWDLRKKKSLYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSAR 526
           H+ATG  DNTC++WDLR+++ +Y IPAH NLV+ VK+EP  G FL+T ++D TAKIW+  
Sbjct: 431 HIATGSGDNTCKVWDLRQRRCVYTIPAHQNLVTGVKFEPIHGNFLLTGAYDNTAKIWTHP 490

Query: 527 DFKPVKTLSGHEAKVTSLDIISDGQCIATVSHDRTIKLW 557
            + P+KTL+GHE KV  LDI SDGQ IAT S+DRT KLW
Sbjct: 491 GWSPLKTLAGHEGKVMGLDISSDGQLIATCSYDRTFKLW 519

BLAST of CSPI06G21170 vs. Swiss-Prot
Match: PRP4_BOVIN (U4/U6 small nuclear ribonucleoprotein Prp4 OS=Bos taurus GN=PRPF4 PE=2 SV=1)

HSP 1 Score: 415.2 bits (1066), Expect = 1.1e-114
Identity = 215/459 (46.84%), Postives = 297/459 (64.71%), Query Frame = 1

Query: 107 EISEESRQARERHEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERR 166
           E+ E      ER  + + EF  ++RA  + V T+D  V+A LR LGEPITLFGE   ERR
Sbjct: 70  EVFEIEEHISERQAEVLAEFERRKRARQINVSTDDSEVKACLRALGEPITLFGEGPAERR 129

Query: 167 DRLRSIMARLDAEGQLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARID 226
           +RLR+I++ +  +     L K  +++E +      +  +E  Q  +Y EG  +L  AR+ 
Sbjct: 130 ERLRNILSVVGTDA----LKKTKKDDEKS------KKSKEEYQQTWYHEGPHSLKVARLW 189

Query: 227 IAKYSILRASSRLERAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFS 286
           IA YS+ RA  RLE A+  ++ P+    ++M    +   SL   CS+IGDDRP+S C FS
Sbjct: 190 IANYSLPRAMKRLEEARLHKEIPETTRTSQMQELHKSLRSLNNFCSQIGDDRPISYCHFS 249

Query: 287 SDGKFLATSSLSGVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNEC--------LATAS 346
            + K LAT+  SG+ KLWS+P    +   +GH   V  ++F P +          LA+ +
Sbjct: 250 PNSKMLATACWSGLCKLWSVPDCNLLHTLRGHNTNVGAIVFHPKSTVSLDQKDVNLASCA 309

Query: 347 ADRTARLWSAEGSL-LKTFEGHLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLL 406
           AD + +LWS +    +   EGH  R+AR+ +HPSG++LGTT +D++WRLWD+E   E+L 
Sbjct: 310 ADGSVKLWSLDSDEPVADIEGHTVRVARVTWHPSGRFLGTTCYDRSWRLWDLEAQEEILH 369

Query: 407 QEGHSRSVYGIAFHHDGSLVSSCGLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGY 466
           QEGHS  VY IAFH DGSL  + GLDA  RVWDLRTGR ++ LEGH+K + G++FSPNGY
Sbjct: 370 QEGHSMGVYDIAFHQDGSLAGTGGLDAFGRVWDLRTGRCIMFLEGHLKEIYGINFSPNGY 429

Query: 467 HLATGGEDNTCRIWDLRKKKSLYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSAR 526
           H+ATG  DNTC++WDLR+++ +Y IPAH NLV+ VK+EP  G FL+T ++D TAKIW+  
Sbjct: 430 HIATGSGDNTCKVWDLRQRRCVYTIPAHQNLVTGVKFEPIHGNFLLTGAYDNTAKIWTHP 489

Query: 527 DFKPVKTLSGHEAKVTSLDIISDGQCIATVSHDRTIKLW 557
            + P+KTL+GHE KV  LDI SDGQ IAT S+DRT KLW
Sbjct: 490 GWSPLKTLAGHEGKVMGLDISSDGQLIATCSYDRTFKLW 518

BLAST of CSPI06G21170 vs. Swiss-Prot
Match: PRP4_MOUSE (U4/U6 small nuclear ribonucleoprotein Prp4 OS=Mus musculus GN=Prpf4 PE=1 SV=1)

HSP 1 Score: 414.5 bits (1064), Expect = 1.9e-114
Identity = 215/459 (46.84%), Postives = 297/459 (64.71%), Query Frame = 1

Query: 107 EISEESRQARERHEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERR 166
           E+ E      ER  + + EF  ++RA  + V T+D  V+A LR LGEPITLFGE   ERR
Sbjct: 70  EVFEIEEHISERQAEVLAEFERRKRARQINVSTDDSEVKACLRALGEPITLFGEGPAERR 129

Query: 167 DRLRSIMARLDAEGQLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARID 226
           +RLR+I++ +  +     L K  +++E +      +  +E  Q  +Y EG  +L  AR+ 
Sbjct: 130 ERLRNILSVVGTDA----LKKTKKDDEKS------KKSKEEYQQTWYHEGPNSLKVARLW 189

Query: 227 IAKYSILRASSRLERAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFS 286
           IA YS+ RA  RLE A+  ++ P+    ++M    +   SL   CS+IGDDRP+S C FS
Sbjct: 190 IANYSLPRAMKRLEEARLHKEIPETTRTSQMQELHKSLRSLNNFCSQIGDDRPISYCHFS 249

Query: 287 SDGKFLATSSLSGVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNEC--------LATAS 346
            + K LAT+  SG+ KLWS+P    +   +GH   V  ++F P +          LA+ +
Sbjct: 250 PNSKMLATACWSGLCKLWSVPDCSLLHTLRGHNTNVGAIVFHPKSTVSLDQKDVNLASCA 309

Query: 347 ADRTARLWSAEGSL-LKTFEGHLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLL 406
           AD + +LWS +    +   EGH  R+AR+ +HPSG++LGTT +D++WRLWD+E   E+L 
Sbjct: 310 ADGSVKLWSLDSDEPVADIEGHTVRVARVMWHPSGRFLGTTCYDRSWRLWDLEAQEEILH 369

Query: 407 QEGHSRSVYGIAFHHDGSLVSSCGLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGY 466
           QEGHS  VY IAFH DGSL  + GLDA  RVWDLRTGR ++ LEGH+K + G++FSPNGY
Sbjct: 370 QEGHSMGVYDIAFHQDGSLAGTGGLDAFGRVWDLRTGRCIMFLEGHLKEIYGINFSPNGY 429

Query: 467 HLATGGEDNTCRIWDLRKKKSLYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSAR 526
           H+ATG  DNTC++WDLR+++ +Y IPAH NLV+ VK+EP  G FL+T ++D TAKIW+  
Sbjct: 430 HIATGSGDNTCKVWDLRQRRCVYTIPAHQNLVTGVKFEPIHGDFLLTGAYDNTAKIWTHP 489

Query: 527 DFKPVKTLSGHEAKVTSLDIISDGQCIATVSHDRTIKLW 557
            + P+KTL+GHE KV  LDI SDGQ IAT S+DRT KLW
Sbjct: 490 GWSPLKTLAGHEGKVMGLDISSDGQLIATCSYDRTFKLW 518

BLAST of CSPI06G21170 vs. TrEMBL
Match: A0A0A0KHD8_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G404170 PE=4 SV=1)

HSP 1 Score: 1108.6 bits (2866), Expect = 0.0e+00
Identity = 568/569 (99.82%), Postives = 568/569 (99.82%), Query Frame = 1

Query: 1   MEIDDQNPASTAAESPETLPGGENEELDIPAEPTQPAATSVIPPSIVPAIAPIPPPIIRP 60
           MEIDDQNPASTAAESPETLPGGENEELD PAEPTQPAATSVIPPSIVPAIAPIPPPIIRP
Sbjct: 1   MEIDDQNPASTAAESPETLPGGENEELDNPAEPTQPAATSVIPPSIVPAIAPIPPPIIRP 60

Query: 61  LAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPSRTAPGSTAEYEISEESRQARERHE 120
           LAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPSRTAPGSTAEYEISEESRQARERHE
Sbjct: 61  LAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPSRTAPGSTAEYEISEESRQARERHE 120

Query: 121 KAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEG 180
           KAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEG
Sbjct: 121 KAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEG 180

Query: 181 QLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLE 240
           QLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLE
Sbjct: 181 QLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLE 240

Query: 241 RAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGV 300
           RAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGV
Sbjct: 241 RAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGV 300

Query: 301 AKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLKTFEGHL 360
           AKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLKTFEGHL
Sbjct: 301 AKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLKTFEGHL 360

Query: 361 DRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVSSC 420
           DRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVSSC
Sbjct: 361 DRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVSSC 420

Query: 421 GLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLY 480
           GLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLY
Sbjct: 421 GLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLY 480

Query: 481 IIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISD 540
           IIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISD
Sbjct: 481 IIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISD 540

Query: 541 GQCIATVSHDRTIKLWSVNSKDIQTMDVD 570
           GQCIATVSHDRTIKLWSVNSKDIQTMDVD
Sbjct: 541 GQCIATVSHDRTIKLWSVNSKDIQTMDVD 569

BLAST of CSPI06G21170 vs. TrEMBL
Match: F6HLM8_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_08s0007g05930 PE=4 SV=1)

HSP 1 Score: 863.2 bits (2229), Expect = 1.7e-247
Identity = 442/577 (76.60%), Postives = 486/577 (84.23%), Query Frame = 1

Query: 1   MEIDDQNPASTAAESPETLPGGENEELDI----PAEPTQPAATSVIPPSIVPAIAPIPP- 60
           M++D++NP S +   P      +   +      P    QP   S++PP IVP IAPIP  
Sbjct: 83  MDVDEENPVSVSLAEPSAAVTDDQTPVSTIDPTPLPAMQPIIPSLVPPPIVPPIAPIPSV 142

Query: 61  --PIIRPLAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPSRTAPGSTAEYEISEESR 120
             PI+RPLAPLP RPP+ RPP+ QNGE+R SDSDS+ D+   ++ A GS  EYEISEESR
Sbjct: 143 SAPILRPLAPLPVRPPVLRPPLPQNGEMRASDSDSDRDDSGRAQAASGSAVEYEISEESR 202

Query: 121 QARERHEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIM 180
           Q RER EKA QEFLMKRRASALAVPTNDMAVR RLRRLGEPITLFGEREMERRDRLR IM
Sbjct: 203 QFRERQEKAKQEFLMKRRASALAVPTNDMAVRTRLRRLGEPITLFGEREMERRDRLRMIM 262

Query: 181 ARLDAEGQLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSIL 240
           A+LDAEGQLEKLMK HEEEEAAA    EE EEE LQYPFYTEGSK+LL+AR++IAKYSI 
Sbjct: 263 AKLDAEGQLEKLMKAHEEEEAAAPVAMEEVEEETLQYPFYTEGSKSLLEARVEIAKYSIK 322

Query: 241 RASSRLERAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLA 300
           RA+SRL RA+RKRDDPDED++AEMDW L++A SLVLDCSEIGDDRPLSGCSFS DGK LA
Sbjct: 323 RAASRLYRARRKRDDPDEDLDAEMDWVLKEAGSLVLDCSEIGDDRPLSGCSFSHDGKLLA 382

Query: 301 TSSLSGVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLL 360
             +LSGVAK+WSMPQV KVS  KGHTER TDV FSP    LATASADRTARLW++EGSLL
Sbjct: 383 ACALSGVAKIWSMPQVNKVSALKGHTERATDVAFSPALNHLATASADRTARLWNSEGSLL 442

Query: 361 KTFEGHLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHD 420
           KTFEGHLDRLARIAFHPSGKYLGT SFDKTWRLWDVETG ELLLQEGHSRSVYGI+FH D
Sbjct: 443 KTFEGHLDRLARIAFHPSGKYLGTASFDKTWRLWDVETGEELLLQEGHSRSVYGISFHRD 502

Query: 421 GSLVSSCGLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDL 480
           GSL +SCGLDAL RVWDLR+GRS+LALEGHVKPVLG+ FSPNGYHLATG EDNTCRIWDL
Sbjct: 503 GSLAASCGLDALGRVWDLRSGRSILALEGHVKPVLGICFSPNGYHLATGAEDNTCRIWDL 562

Query: 481 RKKKSLYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVT 540
           RKKKSLY+IPAHSNLVSQVK+EPQEGYFLVTAS+DMTAK+WSARDFKPVKTLSGHEAKVT
Sbjct: 563 RKKKSLYVIPAHSNLVSQVKFEPQEGYFLVTASYDMTAKVWSARDFKPVKTLSGHEAKVT 622

Query: 541 SLDIISDGQCIATVSHDRTIKLW-SVNSKDIQTMDVD 570
           SLDI  DG CIATVSHDRTIKLW S   +  + MD+D
Sbjct: 623 SLDITEDGHCIATVSHDRTIKLWSSAEIEKEKAMDID 659

BLAST of CSPI06G21170 vs. TrEMBL
Match: A5ATG0_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_016440 PE=4 SV=1)

HSP 1 Score: 859.8 bits (2220), Expect = 1.9e-246
Identity = 444/570 (77.89%), Postives = 483/570 (84.74%), Query Frame = 1

Query: 4    DDQNPASTAAESPETLPGGENEELDIPAEPTQPAATSVIPPSIVPAIAPIPP---PIIRP 63
            DDQ P ST   +P            +PA   QP   S++PP IVP IAPIP    PI+RP
Sbjct: 533  DDQTPVSTIDPTP------------LPA--MQPIIPSLVPPPIVPPIAPIPSVSAPILRP 592

Query: 64   LAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPSRTAPGSTAEYEISEESRQARERHE 123
            LAPLP RPP+ RPP+ QNGE+R SDSDS+ D+   ++ A GS  EYEISEESRQ RER E
Sbjct: 593  LAPLPVRPPVLRPPLPQNGEMRASDSDSDRDDSGRAQAASGSAVEYEISEESRQFRERQE 652

Query: 124  KAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEG 183
            KA QEFLMKRRASALAVPTNDMAVR RLRRLGEPITLFGEREMERRDRLR IMA+LDAEG
Sbjct: 653  KAKQEFLMKRRASALAVPTNDMAVRTRLRRLGEPITLFGEREMERRDRLRMIMAKLDAEG 712

Query: 184  QLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLE 243
            QLEKLMK HEEEEAAA    EE EEE LQYPFYTEGSK+LL+AR++IAKYSI RA+SRL 
Sbjct: 713  QLEKLMKAHEEEEAAAPVAMEEVEEETLQYPFYTEGSKSLLEARVEIAKYSIKRAASRLY 772

Query: 244  RAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGV 303
            RA+RKRDDPDED++AEMDW L++A SLVLDCSEIGDDRPLSGCSFS DGK LA  +LSGV
Sbjct: 773  RARRKRDDPDEDLDAEMDWVLKEAGSLVLDCSEIGDDRPLSGCSFSHDGKLLAACALSGV 832

Query: 304  AKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLKTFEGHL 363
            AK+WSMPQV KVS  KGHTER TDV FSP    LATASADRTARLW++EGSLLKTFEGHL
Sbjct: 833  AKIWSMPQVNKVSALKGHTERATDVAFSPALNHLATASADRTARLWNSEGSLLKTFEGHL 892

Query: 364  DRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVSSC 423
            DRLARIAFHPSGKYLGT SFDKTWRLWDVETG ELLLQEGHSRSVYGI+FH DGSL +SC
Sbjct: 893  DRLARIAFHPSGKYLGTASFDKTWRLWDVETGEELLLQEGHSRSVYGISFHRDGSLAASC 952

Query: 424  GLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLY 483
            GLDAL RVWDLR+GRS+LALEGHVKPVLG+ FSPNGYHLATG EDNTCRIWDLRKKKSLY
Sbjct: 953  GLDALGRVWDLRSGRSILALEGHVKPVLGICFSPNGYHLATGAEDNTCRIWDLRKKKSLY 1012

Query: 484  IIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISD 543
            +IPAHSNLVSQVK+EPQEGYFLVTAS+DMTAK+WSARDFKPVKTLSGHEAKVTSLDI  D
Sbjct: 1013 VIPAHSNLVSQVKFEPQEGYFLVTASYDMTAKVWSARDFKPVKTLSGHEAKVTSLDITED 1072

Query: 544  GQCIATVSHDRTIKLW-SVNSKDIQTMDVD 570
            G CIATVSHDRTIKLW S   +  + MD+D
Sbjct: 1073 GHCIATVSHDRTIKLWSSAEIEKEKAMDID 1088

BLAST of CSPI06G21170 vs. TrEMBL
Match: W9RUJ5_9ROSA (U4/U6 small nuclear ribonucleoprotein PRP4-like protein OS=Morus notabilis GN=L484_015499 PE=4 SV=1)

HSP 1 Score: 856.3 bits (2211), Expect = 2.1e-245
Identity = 452/615 (73.50%), Postives = 506/615 (82.28%), Query Frame = 1

Query: 1   MEIDDQNP-ASTAAESPETLPGGENEELDIPA------EPTQPAATSVIPPS-------- 60
           ME+D++NP +ST  E+   +  G+     +PA      +P  P    VIPP         
Sbjct: 1   MEVDEENPPSSTPVEASSLVDDGQTV---VPAVNSTAIQPIPPIIPPVIPPPVVPPVAPP 60

Query: 61  IVPAIAPIP----PPIIRPLAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPSRTAPG 120
           IVPA+APIP    PP++RPLAPLP RPP+ RPPV QNGE+RTSDSDS+ D+  P RTAPG
Sbjct: 61  IVPAMAPIPTLPTPPVLRPLAPLPIRPPIPRPPVPQNGEMRTSDSDSD-DDSDPGRTAPG 120

Query: 121 STAEYEISEESRQARERHEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGER 180
           ST EYEISEESRQ RER EKAM E +MKRRASALAVPTNDMAVRARLRRLGEPITLFGER
Sbjct: 121 STQEYEISEESRQVRERQEKAMLELMMKRRASALAVPTNDMAVRARLRRLGEPITLFGER 180

Query: 181 EMERRDRLRSIMARLDAEGQLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALL 240
           EMERRDRLR +MA+LDAEGQLEKLMK HEEEEAAA+   EEAEEE+LQYPFYTEGSKALL
Sbjct: 181 EMERRDRLRMLMAKLDAEGQLEKLMKAHEEEEAAASATGEEAEEEMLQYPFYTEGSKALL 240

Query: 241 DARIDIAKYSILRASSRLERAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLS 300
           DARIDIAKYSI+RA++RL+RA+RKRDDPDEDV+AEMDWAL+QA SL LDCSEIGDDRPLS
Sbjct: 241 DARIDIAKYSIVRAATRLQRAQRKRDDPDEDVDAEMDWALKQAGSLALDCSEIGDDRPLS 300

Query: 301 GCSFSSDGKFLATSSLSGVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADR 360
           GCS S DGKFLAT SL+GVAKLWSMP+V+KVS  KGHTER+TDV FSP +  +AT SADR
Sbjct: 301 GCSLSRDGKFLATCSLTGVAKLWSMPKVQKVSTLKGHTERLTDVKFSPTDNLIATGSADR 360

Query: 361 TARLWSAEGSLLKTFEGHLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGH 420
           TARLW+ EG  LKTFEGHLDRLARIAFHPSGKYLGTTSFDKTWRLWD++TGVELLLQEG 
Sbjct: 361 TARLWNTEGFHLKTFEGHLDRLARIAFHPSGKYLGTTSFDKTWRLWDIDTGVELLLQEGQ 420

Query: 421 SRSVYGIAFHHDGSLVSSCGLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLAT 480
           SRSVYGI FH DGSL +SCGLD+L RVWDLRTGRS+LALEGHVKPVLG+SFS NGY+LA+
Sbjct: 421 SRSVYGIDFHQDGSLAASCGLDSLVRVWDLRTGRSILALEGHVKPVLGLSFSANGYYLAS 480

Query: 481 GGEDNTCRIWDLRKKKSLYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKP 540
           GGEDNTCRIWDLRKKKS+Y+IPAHSNL+SQVK+EPQEGYFL+TAS+DMTAKIWSARDFKP
Sbjct: 481 GGEDNTCRIWDLRKKKSVYMIPAHSNLISQVKFEPQEGYFLITASYDMTAKIWSARDFKP 540

Query: 541 VKTLSGHEAKVTSLD--------------------------IISDGQCIATVSHDRTIKL 570
           VKTLSGHEAKVTSLD                          +++DG C+ATVSHDRTIKL
Sbjct: 541 VKTLSGHEAKVTSLDVAGGRSHSPEKKFKQLCSRFNKSVLLVMADGNCVATVSHDRTIKL 600

BLAST of CSPI06G21170 vs. TrEMBL
Match: B9H9H5_POPTR (Transducin family protein OS=Populus trichocarpa GN=POPTR_0006s04430g PE=4 SV=2)

HSP 1 Score: 847.0 bits (2187), Expect = 1.3e-242
Identity = 433/577 (75.04%), Postives = 494/577 (85.62%), Query Frame = 1

Query: 1   MEIDDQNPA--STAAESPETLPGGENEELDI----PAEPTQPAATSVIPPSI--VPAIAP 60
           M  +++NPA  +   ES   +P   ++EL I    P +PT P    VIP SI  +P+IAP
Sbjct: 1   MAAEEENPALSNLPEESSAVVP---DDELMIDNSSPIQPTPPIIPPVIPSSIPVLPSIAP 60

Query: 61  IPPPIIRPLAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPSRTAPGSTAEYEISEES 120
           IP    RPLAPLP RPP  RPP  QNGE+RTSDSDS+ +EL+P+ T PGST  YEISE S
Sbjct: 61  IPIVPPRPLAPLPIRPPATRPPGVQNGEMRTSDSDSDQEELSPTGTTPGSTGGYEISEAS 120

Query: 121 RQARERHEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSI 180
           R  RER +KAMQEF+MK+RA+ALAVPTNDMAVR RLRRLGEPITLFGEREMERRDRLR +
Sbjct: 121 RLVRERQQKAMQEFMMKKRAAALAVPTNDMAVRTRLRRLGEPITLFGEREMERRDRLRML 180

Query: 181 MARLDAEGQLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSI 240
           MA+LD+EGQLEKLMKVHEEEEAA+T   E+AEEE +QYPFYTEGSK LLDARIDIAKYSI
Sbjct: 181 MAKLDSEGQLEKLMKVHEEEEAASTAAAEDAEEEFVQYPFYTEGSKELLDARIDIAKYSI 240

Query: 241 LRASSRLERAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFL 300
            +A+ RL+RA+RKRDDPDED +AE+DW+L QAESL L+CSE+GDDRPLSGCSFS DG+ L
Sbjct: 241 SKAALRLQRARRKRDDPDEDEDAEIDWSLNQAESLSLNCSELGDDRPLSGCSFSCDGEML 300

Query: 301 ATSSLSGVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSL 360
           AT SLSGVAK+WS+PQV KVSN KGH ER TDV FSPV+  LATASADRTARLW+ +GSL
Sbjct: 301 ATCSLSGVAKIWSVPQVTKVSNLKGHMERATDVAFSPVHNHLATASADRTARLWNTDGSL 360

Query: 361 LKTFEGHLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHH 420
           L  FEGHLDRLAR+AFHPSGKYLGTTSFDKTWRLWD+++GVELLLQEGHSRS+YGIAFHH
Sbjct: 361 LMKFEGHLDRLARVAFHPSGKYLGTTSFDKTWRLWDIDSGVELLLQEGHSRSIYGIAFHH 420

Query: 421 DGSLVSSCGLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWD 480
           DGSL +SCGLDALARVWDLRTGRS++A EGHVKP+LG+SFSPNGYHLATGGEDNTCRIWD
Sbjct: 421 DGSLAASCGLDALARVWDLRTGRSIMAFEGHVKPLLGISFSPNGYHLATGGEDNTCRIWD 480

Query: 481 LRKKKSLYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKV 540
           LRKKKSLY+IPAHSNLVSQVK+EPQEGY+LVT+S+DMTAK+WS RDFK VKTLS HEAKV
Sbjct: 481 LRKKKSLYVIPAHSNLVSQVKFEPQEGYYLVTSSYDMTAKVWSGRDFKHVKTLSAHEAKV 540

Query: 541 TSLDIISDGQCIATVSHDRTIKLWSVNSKDIQTMDVD 570
           TSLDI +DG+ IATVSHDRTIKLWS  S +   M+V+
Sbjct: 541 TSLDISADGRLIATVSHDRTIKLWSSRSNEKDAMEVE 574

BLAST of CSPI06G21170 vs. TAIR10
Match: AT2G41500.1 (AT2G41500.1 WD-40 repeat family protein / small nuclear ribonucleoprotein Prp4p-related)

HSP 1 Score: 709.9 bits (1831), Expect = 1.3e-204
Identity = 366/575 (63.65%), Postives = 442/575 (76.87%), Query Frame = 1

Query: 1   MEIDDQNPASTAAESPETLPGGENEELDIPAEPTQPAATSVIPPSIVPAIAPIPPPIIRP 60
           ME +  +  S AA +  + P    +   +P     P    V+PPS  P +APIP   + P
Sbjct: 1   MEPNKDDNVSLAATAQISAPPVLQDASSLPGFSAIPP---VVPPSFPPPMAPIP---MMP 60

Query: 61  LAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPSRTAPGSTAEYEISEESRQARERHE 120
             P+ +RPP FRPPV+QNG ++TSDSDSE D+              EISEES+Q RER E
Sbjct: 61  HPPV-ARPPTFRPPVSQNGGVKTSDSDSESDD-----------EHIEISEESKQVRERQE 120

Query: 121 KAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEG 180
           KA+Q+ L+KRRA+A+AVPTND AVR RLRRLGEPITLFGE+EMERR RL  ++ R D  G
Sbjct: 121 KALQDLLVKRRAAAMAVPTNDKAVRDRLRRLGEPITLFGEQEMERRARLTQLLTRYDING 180

Query: 181 QLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLE 240
           QL+KL+K HEE+        EE ++EVL+YPF+TEG K L +ARI+IAK+S+ RA+ R++
Sbjct: 181 QLDKLVKDHEEDVTP----KEEVDDEVLEYPFFTEGPKELREARIEIAKFSVKRAAVRIQ 240

Query: 241 RAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGV 300
           RAKR+RDDPDED++AE  WAL+ A+ + LDCS  GDDRPL+GCSFS DGK LAT SLSGV
Sbjct: 241 RAKRRRDDPDEDMDAETKWALKHAKHMALDCSNFGDDRPLTGCSFSRDGKILATCSLSGV 300

Query: 301 AKLWSMPQV-RKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLKTFEGH 360
            KLW MPQV   ++  K H ER TDV+FSPV++CLATASADRTA+LW  +G+LL+TFEGH
Sbjct: 301 TKLWEMPQVTNTIAVLKDHKERATDVVFSPVDDCLATASADRTAKLWKTDGTLLQTFEGH 360

Query: 361 LDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVSS 420
           LDRLAR+AFHPSGKYLGTTS+DKTWRLWD+ TG ELLLQEGHSRSVYGIAF  DG+L +S
Sbjct: 361 LDRLARVAFHPSGKYLGTTSYDKTWRLWDINTGAELLLQEGHSRSVYGIAFQQDGALAAS 420

Query: 421 CGLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSL 480
           CGLD+LARVWDLRTGRS+L  +GH+KPV  V+FSPNGYHLA+GGEDN CRIWDLR +KSL
Sbjct: 421 CGLDSLARVWDLRTGRSILVFQGHIKPVFSVNFSPNGYHLASGGEDNQCRIWDLRMRKSL 480

Query: 481 YIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIIS 540
           YIIPAH+NLVSQVKYEPQEGYFL TAS+DM   IWS RDF  VK+L+GHE+KV SLDI +
Sbjct: 481 YIIPAHANLVSQVKYEPQEGYFLATASYDMKVNIWSGRDFSLVKSLAGHESKVASLDITA 540

Query: 541 DGQCIATVSHDRTIKLWSVNSKD-----IQTMDVD 570
           D  CIATVSHDRTIKLW+ +  D      +TMD+D
Sbjct: 541 DSSCIATVSHDRTIKLWTSSGNDDEDEEKETMDID 553

BLAST of CSPI06G21170 vs. TAIR10
Match: AT2G05720.1 (AT2G05720.1 Transducin/WD40 repeat-like superfamily protein)

HSP 1 Score: 207.6 bits (527), Expect = 2.0e-53
Identity = 115/280 (41.07%), Postives = 159/280 (56.79%), Query Frame = 1

Query: 216 GSKALLDARIDIAKYSILRASSRLERAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIG 275
           G   L +ARI+I K  I RA+ R++R  R+R+DPDED  AE   AL+  + +VL  S+ G
Sbjct: 2   GPTELREARIEITKDFIKRAALRIQRENRRRNDPDEDKNAETKLALKHCKDMVLGSSKFG 61

Query: 276 DDRPLSGCSFSSDGKFLATSSLSGVAKLWSMPQV-RKVSNFKGHTERVTDVMFSPV-NEC 335
           DDRPL+GCS S DGK L T SLSGV KLW +PQV  K+   KGH E VTDV+FS V +EC
Sbjct: 62  DDRPLTGCSLSRDGKILVTCSLSGVPKLWEVPQVTNKIVVLKGHKEHVTDVVFSSVDDEC 121

Query: 336 LATASADRTARLWSAEGSLLKTFEGHLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGV 395
           LATAS DRT ++W  +G+LL+TF+                   ++ FD   R+WD+ T  
Sbjct: 122 LATASTDRTEKIWKTDGTLLQTFK------------------ASSGFDSLARVWDLRTAR 181

Query: 396 ELLLQEGHSRSVYGIAFHHDGSLVSSCGLDALARVWDLRTGRSVLALEGHVKPVLGVSFS 455
            +L+ +GH + V  + F  +G  ++S G D   R+WDLR  + +  +  HV  V  V + 
Sbjct: 182 NILIFQGHIKQVLSVDFSPNGYHLASGGEDNQCRIWDLRMRKLLYIIPAHVNLVSQVKYE 241

Query: 456 P-NGYHLATGGEDNTCRIWDLRKKKSLYIIPAHSNLVSQV 493
           P   Y LAT   D    IW  R    +  +  H + V+ +
Sbjct: 242 PQERYFLATASHDMNVNIWSGRDFSLVKSLVGHESKVASL 263

BLAST of CSPI06G21170 vs. TAIR10
Match: AT3G49660.1 (AT3G49660.1 Transducin/WD40 repeat-like superfamily protein)

HSP 1 Score: 171.0 bits (432), Expect = 2.1e-42
Identity = 98/290 (33.79%), Postives = 146/290 (50.34%), Query Frame = 1

Query: 277 DRPLSGCSFSSDGKFLATSSLSGVAKLWSM-----PQVRKVSNFKGHTERVTDVMFSPVN 336
           +R +S   FSSDG+ LA++S     + +++     P    V  F GH   ++DV FS   
Sbjct: 24  NRAVSSVKFSSDGRLLASASADKTIRTYTINTINDPIAEPVQEFTGHENGISDVAFSSDA 83

Query: 337 ECLATASADRTARLWSAE-GSLLKTFEGHLDRLARIAFHPSGKYLGTTSFDKTWRLWDVE 396
             + +AS D+T +LW  E GSL+KT  GH +    + F+P    + + SFD+T R+WDV 
Sbjct: 84  RFIVSASDDKTLKLWDVETGSLIKTLIGHTNYAFCVNFNPQSNMIVSGSFDETVRIWDVT 143

Query: 397 TGVELLLQEGHSRSVYGIAFHHDGSLVSSCGLDALARVWDLRTGRSVLAL-EGHVKPVLG 456
           TG  L +   HS  V  + F+ DGSL+ S   D L R+WD  TG  V  L +    PV  
Sbjct: 144 TGKCLKVLPAHSDPVTAVDFNRDGSLIVSSSYDGLCRIWDSGTGHCVKTLIDDENPPVSF 203

Query: 457 VSFSPNGYHLATGGEDNTCRIWDLRKKKSLYIIPAHSNLVSQVK--YEPQEGYFLVTASF 516
           V FSPNG  +  G  DNT R+W++   K L     H N    +   +    G  +V+ S 
Sbjct: 204 VRFSPNGKFILVGTLDNTLRLWNISSAKFLKTYTGHVNAQYCISSAFSVTNGKRIVSGSE 263

Query: 517 DMTAKIWSARDFKPVKTLSGHEAKVTSLDIISDGQCIATVSHDRTIKLWS 558
           D    +W     K ++ L GH   V ++        IA+ S D+T+++W+
Sbjct: 264 DNCVHMWELNSKKLLQKLEGHTETVMNVACHPTENLIASGSLDKTVRIWT 313

BLAST of CSPI06G21170 vs. TAIR10
Match: AT4G02730.1 (AT4G02730.1 Transducin/WD40 repeat-like superfamily protein)

HSP 1 Score: 148.3 bits (373), Expect = 1.5e-35
Identity = 82/260 (31.54%), Postives = 139/260 (53.46%), Query Frame = 1

Query: 310 RKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEG-SLLKTFEGHLDRLARIAF 369
           R +   +GHT  ++ V FS     LA+AS D+T  LWSA   SL+  +EGH   ++ +A+
Sbjct: 34  RHLKTLEGHTAAISCVKFSNDGNLLASASVDKTMILWSATNYSLIHRYEGHSSGISDLAW 93

Query: 370 HPSGKYLGTTSFDKTWRLWDVETGVELL-LQEGHSRSVYGIAFHHDGSLVSSCGLDALAR 429
                Y  + S D T R+WD  +  E L +  GH+  V+ + F+   +L+ S   D   R
Sbjct: 94  SSDSHYTCSASDDCTLRIWDARSPYECLKVLRGHTNFVFCVNFNPPSNLIVSGSFDETIR 153

Query: 430 VWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSL-YIIPAHS 489
           +W+++TG+ V  ++ H  P+  V F+ +G  + +   D +C+IWD ++   L  +I   S
Sbjct: 154 IWEVKTGKCVRMIKAHSMPISSVHFNRDGSLIVSASHDGSCKIWDAKEGTCLKTLIDDKS 213

Query: 490 NLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKV---TSLDIISDGQC 549
             VS  K+ P  G F++ A+ D T K+ +    K +K  +GH  KV   TS   +++G+ 
Sbjct: 214 PAVSFAKFSP-NGKFILVATLDSTLKLSNYATGKFLKVYTGHTNKVFCITSAFSVTNGKY 273

Query: 550 IATVSHDRTIKLWSVNSKDI 564
           I + S D  + LW + +++I
Sbjct: 274 IVSGSEDNCVYLWDLQARNI 292

BLAST of CSPI06G21170 vs. TAIR10
Match: AT2G33340.1 (AT2G33340.1 MOS4-associated complex 3B)

HSP 1 Score: 143.3 bits (360), Expect = 4.7e-34
Identity = 86/288 (29.86%), Postives = 145/288 (50.35%), Query Frame = 1

Query: 292 LATSSLSGVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEG- 351
           +AT  +   A L+  P  + +S   GH+++VT V F   ++ + TASAD+T R+W   G 
Sbjct: 237 IATGGVDATAVLFDRPSGQILSTLTGHSKKVTSVKFVGDSDLVLTASADKTVRIWRNPGD 296

Query: 352 ---SLLKTFEGHLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSV-- 411
              +   T   H   +  +  HP+ KY  + S D TW  +D+ +G  L      S++V  
Sbjct: 297 GNYACGYTLNDHSAEVRAVTVHPTNKYFVSASLDGTWCFYDLSSGSCLAQVSDDSKNVDY 356

Query: 412 YGIAFHHDGSLVSSCGLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGED 471
              AFH DG ++ +    ++ ++WD+++  +V   +GH   V  +SFS NGY LAT  ED
Sbjct: 357 TAAAFHPDGLILGTGTSQSVVKIWDVKSQANVAKFDGHTGEVTAISFSENGYFLATAAED 416

Query: 472 NTCRIWDLRKKKSL-YIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSAR-DFKPVK 531
              R+WDLRK ++    + A +N    V+++P   Y  + AS     +  S + ++  +K
Sbjct: 417 GV-RLWDLRKLRNFKSFLSADAN---SVEFDPSGSYLGIAASDIKVYQTASVKAEWNLIK 476

Query: 532 TLS--GHEAKVTSLDIISDGQCIATVSHDRTIKLWSVNSKDIQTMDVD 570
           TL       K T +   SD Q +A  S DR ++++ +   +   +D D
Sbjct: 477 TLPDLSGTGKATCVKFGSDAQYVAVGSMDRNLRIFGLPGDEKANVDDD 520

BLAST of CSPI06G21170 vs. NCBI nr
Match: gi|449458027|ref|XP_004146749.1| (PREDICTED: U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Cucumis sativus])

HSP 1 Score: 1108.6 bits (2866), Expect = 0.0e+00
Identity = 568/569 (99.82%), Postives = 568/569 (99.82%), Query Frame = 1

Query: 1   MEIDDQNPASTAAESPETLPGGENEELDIPAEPTQPAATSVIPPSIVPAIAPIPPPIIRP 60
           MEIDDQNPASTAAESPETLPGGENEELD PAEPTQPAATSVIPPSIVPAIAPIPPPIIRP
Sbjct: 1   MEIDDQNPASTAAESPETLPGGENEELDNPAEPTQPAATSVIPPSIVPAIAPIPPPIIRP 60

Query: 61  LAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPSRTAPGSTAEYEISEESRQARERHE 120
           LAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPSRTAPGSTAEYEISEESRQARERHE
Sbjct: 61  LAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPSRTAPGSTAEYEISEESRQARERHE 120

Query: 121 KAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEG 180
           KAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEG
Sbjct: 121 KAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEG 180

Query: 181 QLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLE 240
           QLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLE
Sbjct: 181 QLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLE 240

Query: 241 RAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGV 300
           RAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGV
Sbjct: 241 RAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGV 300

Query: 301 AKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLKTFEGHL 360
           AKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLKTFEGHL
Sbjct: 301 AKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLKTFEGHL 360

Query: 361 DRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVSSC 420
           DRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVSSC
Sbjct: 361 DRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVSSC 420

Query: 421 GLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLY 480
           GLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLY
Sbjct: 421 GLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLY 480

Query: 481 IIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISD 540
           IIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISD
Sbjct: 481 IIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISD 540

Query: 541 GQCIATVSHDRTIKLWSVNSKDIQTMDVD 570
           GQCIATVSHDRTIKLWSVNSKDIQTMDVD
Sbjct: 541 GQCIATVSHDRTIKLWSVNSKDIQTMDVD 569

BLAST of CSPI06G21170 vs. NCBI nr
Match: gi|659129524|ref|XP_008464716.1| (PREDICTED: U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Cucumis melo])

HSP 1 Score: 1099.3 bits (2842), Expect = 0.0e+00
Identity = 563/569 (98.95%), Postives = 565/569 (99.30%), Query Frame = 1

Query: 1   MEIDDQNPASTAAESPETLPGGENEELDIPAEPTQPAATSVIPPSIVPAIAPIPPPIIRP 60
           MEIDDQNPASTAAESPETLPGGENEELD PAEPTQPAATSVIPPSIVPAIAPIPPPIIRP
Sbjct: 1   MEIDDQNPASTAAESPETLPGGENEELDNPAEPTQPAATSVIPPSIVPAIAPIPPPIIRP 60

Query: 61  LAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPSRTAPGSTAEYEISEESRQARERHE 120
           LAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAP+RTAPGSTAEYEISEESRQARERHE
Sbjct: 61  LAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPARTAPGSTAEYEISEESRQARERHE 120

Query: 121 KAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEG 180
           KAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEG
Sbjct: 121 KAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEG 180

Query: 181 QLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLE 240
           QLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLE
Sbjct: 181 QLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLE 240

Query: 241 RAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGV 300
           RAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGV
Sbjct: 241 RAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGV 300

Query: 301 AKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLKTFEGHL 360
           AKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWS EGSLLKTFEGHL
Sbjct: 301 AKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSPEGSLLKTFEGHL 360

Query: 361 DRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVSSC 420
           DRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFH DGSLVSSC
Sbjct: 361 DRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHQDGSLVSSC 420

Query: 421 GLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLY 480
           GLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLY
Sbjct: 421 GLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLY 480

Query: 481 IIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISD 540
           IIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISD
Sbjct: 481 IIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISD 540

Query: 541 GQCIATVSHDRTIKLWSVNSKDIQTMDVD 570
           GQCIATVSHDRTIKLWSVNSKD QTMD+D
Sbjct: 541 GQCIATVSHDRTIKLWSVNSKDKQTMDID 569

BLAST of CSPI06G21170 vs. NCBI nr
Match: gi|731400356|ref|XP_010653924.1| (PREDICTED: U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Vitis vinifera])

HSP 1 Score: 863.2 bits (2229), Expect = 2.5e-247
Identity = 442/577 (76.60%), Postives = 486/577 (84.23%), Query Frame = 1

Query: 1   MEIDDQNPASTAAESPETLPGGENEELDI----PAEPTQPAATSVIPPSIVPAIAPIPP- 60
           M++D++NP S +   P      +   +      P    QP   S++PP IVP IAPIP  
Sbjct: 1   MDVDEENPVSVSLAEPSAAVTDDQTPVSTIDPTPLPAMQPIIPSLVPPPIVPPIAPIPSV 60

Query: 61  --PIIRPLAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPSRTAPGSTAEYEISEESR 120
             PI+RPLAPLP RPP+ RPP+ QNGE+R SDSDS+ D+   ++ A GS  EYEISEESR
Sbjct: 61  SAPILRPLAPLPVRPPVLRPPLPQNGEMRASDSDSDRDDSGRAQAASGSAVEYEISEESR 120

Query: 121 QARERHEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIM 180
           Q RER EKA QEFLMKRRASALAVPTNDMAVR RLRRLGEPITLFGEREMERRDRLR IM
Sbjct: 121 QFRERQEKAKQEFLMKRRASALAVPTNDMAVRTRLRRLGEPITLFGEREMERRDRLRMIM 180

Query: 181 ARLDAEGQLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSIL 240
           A+LDAEGQLEKLMK HEEEEAAA    EE EEE LQYPFYTEGSK+LL+AR++IAKYSI 
Sbjct: 181 AKLDAEGQLEKLMKAHEEEEAAAPVAMEEVEEETLQYPFYTEGSKSLLEARVEIAKYSIK 240

Query: 241 RASSRLERAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLA 300
           RA+SRL RA+RKRDDPDED++AEMDW L++A SLVLDCSEIGDDRPLSGCSFS DGK LA
Sbjct: 241 RAASRLYRARRKRDDPDEDLDAEMDWVLKEAGSLVLDCSEIGDDRPLSGCSFSHDGKLLA 300

Query: 301 TSSLSGVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLL 360
             +LSGVAK+WSMPQV KVS  KGHTER TDV FSP    LATASADRTARLW++EGSLL
Sbjct: 301 ACALSGVAKIWSMPQVNKVSALKGHTERATDVAFSPALNHLATASADRTARLWNSEGSLL 360

Query: 361 KTFEGHLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHD 420
           KTFEGHLDRLARIAFHPSGKYLGT SFDKTWRLWDVETG ELLLQEGHSRSVYGI+FH D
Sbjct: 361 KTFEGHLDRLARIAFHPSGKYLGTASFDKTWRLWDVETGEELLLQEGHSRSVYGISFHRD 420

Query: 421 GSLVSSCGLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDL 480
           GSL +SCGLDAL RVWDLR+GRS+LALEGHVKPVLG+ FSPNGYHLATG EDNTCRIWDL
Sbjct: 421 GSLAASCGLDALGRVWDLRSGRSILALEGHVKPVLGICFSPNGYHLATGAEDNTCRIWDL 480

Query: 481 RKKKSLYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVT 540
           RKKKSLY+IPAHSNLVSQVK+EPQEGYFLVTAS+DMTAK+WSARDFKPVKTLSGHEAKVT
Sbjct: 481 RKKKSLYVIPAHSNLVSQVKFEPQEGYFLVTASYDMTAKVWSARDFKPVKTLSGHEAKVT 540

Query: 541 SLDIISDGQCIATVSHDRTIKLW-SVNSKDIQTMDVD 570
           SLDI  DG CIATVSHDRTIKLW S   +  + MD+D
Sbjct: 541 SLDITEDGHCIATVSHDRTIKLWSSAEIEKEKAMDID 577

BLAST of CSPI06G21170 vs. NCBI nr
Match: gi|147819065|emb|CAN64891.1| (hypothetical protein VITISV_016440 [Vitis vinifera])

HSP 1 Score: 859.8 bits (2220), Expect = 2.8e-246
Identity = 444/570 (77.89%), Postives = 483/570 (84.74%), Query Frame = 1

Query: 4    DDQNPASTAAESPETLPGGENEELDIPAEPTQPAATSVIPPSIVPAIAPIPP---PIIRP 63
            DDQ P ST   +P            +PA   QP   S++PP IVP IAPIP    PI+RP
Sbjct: 533  DDQTPVSTIDPTP------------LPA--MQPIIPSLVPPPIVPPIAPIPSVSAPILRP 592

Query: 64   LAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPSRTAPGSTAEYEISEESRQARERHE 123
            LAPLP RPP+ RPP+ QNGE+R SDSDS+ D+   ++ A GS  EYEISEESRQ RER E
Sbjct: 593  LAPLPVRPPVLRPPLPQNGEMRASDSDSDRDDSGRAQAASGSAVEYEISEESRQFRERQE 652

Query: 124  KAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEG 183
            KA QEFLMKRRASALAVPTNDMAVR RLRRLGEPITLFGEREMERRDRLR IMA+LDAEG
Sbjct: 653  KAKQEFLMKRRASALAVPTNDMAVRTRLRRLGEPITLFGEREMERRDRLRMIMAKLDAEG 712

Query: 184  QLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLE 243
            QLEKLMK HEEEEAAA    EE EEE LQYPFYTEGSK+LL+AR++IAKYSI RA+SRL 
Sbjct: 713  QLEKLMKAHEEEEAAAPVAMEEVEEETLQYPFYTEGSKSLLEARVEIAKYSIKRAASRLY 772

Query: 244  RAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGV 303
            RA+RKRDDPDED++AEMDW L++A SLVLDCSEIGDDRPLSGCSFS DGK LA  +LSGV
Sbjct: 773  RARRKRDDPDEDLDAEMDWVLKEAGSLVLDCSEIGDDRPLSGCSFSHDGKLLAACALSGV 832

Query: 304  AKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLKTFEGHL 363
            AK+WSMPQV KVS  KGHTER TDV FSP    LATASADRTARLW++EGSLLKTFEGHL
Sbjct: 833  AKIWSMPQVNKVSALKGHTERATDVAFSPALNHLATASADRTARLWNSEGSLLKTFEGHL 892

Query: 364  DRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVSSC 423
            DRLARIAFHPSGKYLGT SFDKTWRLWDVETG ELLLQEGHSRSVYGI+FH DGSL +SC
Sbjct: 893  DRLARIAFHPSGKYLGTASFDKTWRLWDVETGEELLLQEGHSRSVYGISFHRDGSLAASC 952

Query: 424  GLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLY 483
            GLDAL RVWDLR+GRS+LALEGHVKPVLG+ FSPNGYHLATG EDNTCRIWDLRKKKSLY
Sbjct: 953  GLDALGRVWDLRSGRSILALEGHVKPVLGICFSPNGYHLATGAEDNTCRIWDLRKKKSLY 1012

Query: 484  IIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISD 543
            +IPAHSNLVSQVK+EPQEGYFLVTAS+DMTAK+WSARDFKPVKTLSGHEAKVTSLDI  D
Sbjct: 1013 VIPAHSNLVSQVKFEPQEGYFLVTASYDMTAKVWSARDFKPVKTLSGHEAKVTSLDITED 1072

Query: 544  GQCIATVSHDRTIKLW-SVNSKDIQTMDVD 570
            G CIATVSHDRTIKLW S   +  + MD+D
Sbjct: 1073 GHCIATVSHDRTIKLWSSAEIEKEKAMDID 1088

BLAST of CSPI06G21170 vs. NCBI nr
Match: gi|703122993|ref|XP_010102702.1| (U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Morus notabilis])

HSP 1 Score: 856.3 bits (2211), Expect = 3.1e-245
Identity = 452/615 (73.50%), Postives = 506/615 (82.28%), Query Frame = 1

Query: 1   MEIDDQNP-ASTAAESPETLPGGENEELDIPA------EPTQPAATSVIPPS-------- 60
           ME+D++NP +ST  E+   +  G+     +PA      +P  P    VIPP         
Sbjct: 1   MEVDEENPPSSTPVEASSLVDDGQTV---VPAVNSTAIQPIPPIIPPVIPPPVVPPVAPP 60

Query: 61  IVPAIAPIP----PPIIRPLAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPSRTAPG 120
           IVPA+APIP    PP++RPLAPLP RPP+ RPPV QNGE+RTSDSDS+ D+  P RTAPG
Sbjct: 61  IVPAMAPIPTLPTPPVLRPLAPLPIRPPIPRPPVPQNGEMRTSDSDSD-DDSDPGRTAPG 120

Query: 121 STAEYEISEESRQARERHEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGER 180
           ST EYEISEESRQ RER EKAM E +MKRRASALAVPTNDMAVRARLRRLGEPITLFGER
Sbjct: 121 STQEYEISEESRQVRERQEKAMLELMMKRRASALAVPTNDMAVRARLRRLGEPITLFGER 180

Query: 181 EMERRDRLRSIMARLDAEGQLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALL 240
           EMERRDRLR +MA+LDAEGQLEKLMK HEEEEAAA+   EEAEEE+LQYPFYTEGSKALL
Sbjct: 181 EMERRDRLRMLMAKLDAEGQLEKLMKAHEEEEAAASATGEEAEEEMLQYPFYTEGSKALL 240

Query: 241 DARIDIAKYSILRASSRLERAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLS 300
           DARIDIAKYSI+RA++RL+RA+RKRDDPDEDV+AEMDWAL+QA SL LDCSEIGDDRPLS
Sbjct: 241 DARIDIAKYSIVRAATRLQRAQRKRDDPDEDVDAEMDWALKQAGSLALDCSEIGDDRPLS 300

Query: 301 GCSFSSDGKFLATSSLSGVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADR 360
           GCS S DGKFLAT SL+GVAKLWSMP+V+KVS  KGHTER+TDV FSP +  +AT SADR
Sbjct: 301 GCSLSRDGKFLATCSLTGVAKLWSMPKVQKVSTLKGHTERLTDVKFSPTDNLIATGSADR 360

Query: 361 TARLWSAEGSLLKTFEGHLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGH 420
           TARLW+ EG  LKTFEGHLDRLARIAFHPSGKYLGTTSFDKTWRLWD++TGVELLLQEG 
Sbjct: 361 TARLWNTEGFHLKTFEGHLDRLARIAFHPSGKYLGTTSFDKTWRLWDIDTGVELLLQEGQ 420

Query: 421 SRSVYGIAFHHDGSLVSSCGLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLAT 480
           SRSVYGI FH DGSL +SCGLD+L RVWDLRTGRS+LALEGHVKPVLG+SFS NGY+LA+
Sbjct: 421 SRSVYGIDFHQDGSLAASCGLDSLVRVWDLRTGRSILALEGHVKPVLGLSFSANGYYLAS 480

Query: 481 GGEDNTCRIWDLRKKKSLYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKP 540
           GGEDNTCRIWDLRKKKS+Y+IPAHSNL+SQVK+EPQEGYFL+TAS+DMTAKIWSARDFKP
Sbjct: 481 GGEDNTCRIWDLRKKKSVYMIPAHSNLISQVKFEPQEGYFLITASYDMTAKIWSARDFKP 540

Query: 541 VKTLSGHEAKVTSLD--------------------------IISDGQCIATVSHDRTIKL 570
           VKTLSGHEAKVTSLD                          +++DG C+ATVSHDRTIKL
Sbjct: 541 VKTLSGHEAKVTSLDVAGGRSHSPEKKFKQLCSRFNKSVLLVMADGNCVATVSHDRTIKL 600

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PRP4L_ARATH2.2e-20363.65U4/U6 small nuclear ribonucleoprotein PRP4-like protein OS=Arabidopsis thaliana ... [more]
PRP4_PONAB1.1e-11446.84U4/U6 small nuclear ribonucleoprotein Prp4 OS=Pongo abelii GN=PRPF4 PE=2 SV=1[more]
PRP4_HUMAN1.1e-11446.84U4/U6 small nuclear ribonucleoprotein Prp4 OS=Homo sapiens GN=PRPF4 PE=1 SV=2[more]
PRP4_BOVIN1.1e-11446.84U4/U6 small nuclear ribonucleoprotein Prp4 OS=Bos taurus GN=PRPF4 PE=2 SV=1[more]
PRP4_MOUSE1.9e-11446.84U4/U6 small nuclear ribonucleoprotein Prp4 OS=Mus musculus GN=Prpf4 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KHD8_CUCSA0.0e+0099.82Uncharacterized protein OS=Cucumis sativus GN=Csa_6G404170 PE=4 SV=1[more]
F6HLM8_VITVI1.7e-24776.60Putative uncharacterized protein OS=Vitis vinifera GN=VIT_08s0007g05930 PE=4 SV=... [more]
A5ATG0_VITVI1.9e-24677.89Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_016440 PE=4 SV=1[more]
W9RUJ5_9ROSA2.1e-24573.50U4/U6 small nuclear ribonucleoprotein PRP4-like protein OS=Morus notabilis GN=L4... [more]
B9H9H5_POPTR1.3e-24275.04Transducin family protein OS=Populus trichocarpa GN=POPTR_0006s04430g PE=4 SV=2[more]
Match NameE-valueIdentityDescription
AT2G41500.11.3e-20463.65 WD-40 repeat family protein / small nuclear ribonucleoprotein Prp4p-... [more]
AT2G05720.12.0e-5341.07 Transducin/WD40 repeat-like superfamily protein[more]
AT3G49660.12.1e-4233.79 Transducin/WD40 repeat-like superfamily protein[more]
AT4G02730.11.5e-3531.54 Transducin/WD40 repeat-like superfamily protein[more]
AT2G33340.14.7e-3429.86 MOS4-associated complex 3B[more]
Match NameE-valueIdentityDescription
gi|449458027|ref|XP_004146749.1|0.0e+0099.82PREDICTED: U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Cucumis sati... [more]
gi|659129524|ref|XP_008464716.1|0.0e+0098.95PREDICTED: U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Cucumis melo... [more]
gi|731400356|ref|XP_010653924.1|2.5e-24776.60PREDICTED: U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Vitis vinife... [more]
gi|147819065|emb|CAN64891.1|2.8e-24677.89hypothetical protein VITISV_016440 [Vitis vinifera][more]
gi|703122993|ref|XP_010102702.1|3.1e-24573.50U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Morus notabilis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001680WD40_repeat
IPR014906PRP4-like
IPR015943WD40/YVTN_repeat-like_dom_sf
IPR017986WD40_repeat_dom
IPR019775WD40_repeat_CS
IPR020472G-protein_beta_WD-40_rep
IPR027106Prp4
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008380 RNA splicing
biological_process GO:0006468 protein phosphorylation
biological_process GO:0008150 biological_process
cellular_component GO:0030529 intracellular ribonucleoprotein complex
cellular_component GO:0019013 viral nucleocapsid
cellular_component GO:0005575 cellular_component
molecular_function GO:0005515 protein binding
molecular_function GO:0005524 ATP binding
molecular_function GO:0004672 protein kinase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI06G21170.1CSPI06G21170.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001680WD40 repeatPFAMPF00400WD40coord: 520..557
score: 1.5E-6coord: 312..347
score: 1.2E-4coord: 437..472
score: 6.0E-9coord: 399..430
score: 5.4E-4coord: 350..388
score: 1.
IPR001680WD40 repeatSMARTSM00320WD40_4coord: 518..557
score: 3.0E-10coord: 475..515
score: 2.1E-4coord: 391..430
score: 2.5E-6coord: 264..305
score: 0.021coord: 433..472
score: 4.4E-11coord: 308..347
score: 1.3E-8coord: 349..388
score: 1.
IPR001680WD40 repeatPROFILEPS50082WD_REPEATS_2coord: 315..347
score: 13.516coord: 525..566
score: 14.017coord: 356..397
score: 13.483coord: 398..439
score: 14.251coord: 482..524
score: 11.879coord: 440..481
score: 16
IPR014906Pre-mRNA processing factor 4 (PRP4)-likePFAMPF08799PRP4coord: 144..171
score: 9.3
IPR014906Pre-mRNA processing factor 4 (PRP4)-likeSMARTSM00500pr04_2coord: 139..192
score: 5.3
IPR014906Pre-mRNA processing factor 4 (PRP4)-likeunknownSSF158230PRP4-likecoord: 121..177
score: 7.72
IPR015943WD40/YVTN repeat-like-containing domainGENE3DG3DSA:2.130.10.10coord: 379..561
score: 2.3E-58coord: 275..378
score: 8.7
IPR017986WD40-repeat-containing domainPROFILEPS50294WD_REPEATS_REGIONcoord: 273..566
score: 6
IPR017986WD40-repeat-containing domainunknownSSF50978WD40 repeat-likecoord: 273..556
score: 5.49
IPR019775WD40 repeat, conserved sitePROSITEPS00678WD_REPEATS_1coord: 459..473
score: -coord: 375..389
score: -coord: 417..431
scor
IPR020472G-protein beta WD-40 repeatPRINTSPR00320GPROTEINBRPTcoord: 334..348
score: 2.2E-6coord: 544..558
score: 2.2E-6coord: 459..473
score: 2.
IPR027106U4/U6 small nuclear ribonucleoprotein Prp4PANTHERPTHR19846WD40 REPEAT PROTEINcoord: 4..563
score:
NoneNo IPR availableunknownCoilCoilcoord: 175..195
scor
NoneNo IPR availablePANTHERPTHR19846:SF0U4/U6 SMALL NUCLEAR RIBONUCLEOPROTEIN PRP4coord: 4..563
score: