CsGy6G020435.1 (mRNA) Cucumber (Gy14) v2.1

Overview
NameCsGy6G020435.1
TypemRNA
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionU4/U6 small nuclear ribonucleoprotein PRP4-like protein
LocationGy14Chr6: 21200565 .. 21206275 (+)
Sequence length2932
RNA-Seq ExpressionCsGy6G020435.1
SyntenyCsGy6G020435.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TAAAAAAAGAAAAACCTAGGTAAGGCCTTACCCGCTCTTTTACTTCCTCACTCTCGGAGAGACTATCCTTTCTTCACGCCGGTTCCCTCCCTTACGCACCTCCTTGCCACCGCCGCCGTTGGTATAGTCTCTCTTCCCGCTTTGTCCGTCTTCTACCAGTTTTTTACTGCTATACATTTGGCACTCCCGTCATCCCTTGTCCGGTTCAAAGTTCACCTGTGCACGAGGGCAATACTGTATCATCTCCGTCAACTGGAAATGTTTTGACGGTACGCATCCCTTGCTAGATCCGAATCTCCCTCCATTTCTCTTTCTCTTTCTCTTTCTCCGGTAACTTTAGTTATTTCACTCATCTATTTCTCTATTGATTGGCCTTTTCTGGTGCTGTAACAGTGATGGTTCTTTTTTTTCCTGGTTCGTGTTATTACTGCACTGTGAACATGATGGGAGAAATTATTGTTTATGTAACTACGAGAGTCTCCAAAGTGTGTATTGCAGTGTCGTGTTTAGAGAAGAAAAGGGATTTTGTTTCTATTTTCTTGAACTGTATCTTTCTTCAAATGTCTGTCCGATAATCTGTTATAGGATCGGCGCAGCCTCAGAATGTTACTTACTGTTTGAATTTCTTGAATTGTATCTTTCTTCAAATGGTTTGATTTTTTTTTCCCCCTTTACTTTTGAAGGTCTCCTGTTAACAGTGAAATTCTCGTTTTACTATTAGTGTTAGACTTTGTTTCTCATTTGTGTTATGTTAACTAATTTTTGGGAGCTTGATTGTAGAGAAGAAATGAAAAGAGGAGCTTAATATCTAGGATAAGCTTGTGTTGATTAGAAGCTTCATTTCCGAGCAAGCTATAATGGAAATTGATGATCAAAACCCATCAACTGCTGCGGAATCTCCTGAAACCCTTCCTGGTGGTGAAAATGAGGAACTTGATAATCCAGCCGAACCAACTCAACCTGCAGCTACATCAGTTATTCCACCTTCAATTGTCCCTGCCATTGCTCCTATCCCCCCTCCAATTATCCGTCCATTGGCTCCTCTTCCGAGCCGACCTCCCCTCTTTAGGCCCCCTGTAACACAAAATGGTGAGCTGAGAACAAGTGACTCAGACTCGGAACATGATGAATTGGCTCCTTCCCGAACAGCTCCAGGTTCAACTGCAGAGTATGAAATCTCAGAAGAGAGCAGGCAAGCTAGGGAGCGTCATGAAAAAGCCATGCAGGAATTTTTGATGAAGCGTCGTGCTTCTGCTCTAGCAGTGCCTACTAATGACATGGCTGTTCGAGCTCGTCTTCGACGTCTTGGTGAGCCCATAACTCTTTTTGGAGAAAGAGAAATGGAAAGACGGGACAGGTTGCGGTCGATAATGGCACGATTGGATGCTGAAGGGCAGTTAGAGAAGCTCATGAAAGTTCACGAGGAAGAGGAGGCTGCAGCTACTGGTGGAACTGAGGAGGCTGAGGAGGAAGTGCTTCAGTATCCCTTTTATACTGAGGGGTCCAAAGCTCTCTTGGATGCAAGAATTGATATTGCAAAGTATTCCATCCTAAGAGCATCTTCACGCCTGGAGCGTGCAAAGAGGAAACGGGATGACCCAGATGAAGATGTGGAAGCGGAAATGGATTGGGCTCTGAGACAGGCTGAAAGTTTGGTCCTGGACTGTAGTGAGATAGGAGATGATCGACCACTTTCTGGTTGTTCTTTCTCGTCTGATGGAAAATTTCTTGCCACTAGGTATCTTCTCACTTTGCAGTTAATGATTTTTAGCTACTGTACCAGTTTCCCCTTTTTAATATCCATCCCCATTTCGCCTCTTGGTAGAAGGGGGAAAAACGAAAAAAGAAAATATAATAGATCTTTCTGGCTATAGATTAGCTATTATCTTCCTACTTAGCTAAACTTCCTCAGTTGAACACGAAACCAAAGATAATATAGAAATTAATGTTTTTCCATTTTTCATTCCAGTACGTGATAAGCTTTACAGGCTTTGTATTTAGCTTTCATTTGCTACTGGCTTTGCATTTGATGGTTTCTGTATTTTTTTTTCTTGTTTCATTTCTATAGTTCACTGAGTGGAGTTGCAAAGTTGTGGAGCATGCCTCAAGTAAGGAAGGTTTCCAACTTTAAGGGACACACAGAGCGTGTTACTGATGTAATGTTTTCTCCAGTGAACGAGTGTTTAGCAACTGCCTCTGCTGACCGAACTGCAAGGTTGTGGTCTGCGGAAGGATCTCTACTTAAAACATTTGAGGGCCATCTAGACCGCCTTGCACGAATTGCCTTCCACCCGTCGGGCAAGTACTTGGGCACAACTAGCTTTGACAAGACCTGGAGATTATGGGATGTTGAAACTGGTGTAGAATTACTTCTTCAAGAAGGTCACAGTAGAAGTGTCTATGGGATAGCCTTCCACCATGATGGATCCTTGGTATCATCTTGTGGACTTGATGCGCTTGCTCGCGTTTGGGATCTTCGAACTGGTAGAAGTGTTCTTGCCTTGGAAGGCCACGTCAAGCCAGTAAAGCTCTCTCCTTTCTTTCGATGGGATTGTAGAACATTTGCTTTTTTAATAAATTGGCTTGTTTGCTTCTCACTGATTCCTTGTCATATGTCTATGATGTTATAACCAGGTCCTTGGAGTTAGTTTTTCACCCAACGGTTATCATTTAGCTACTGGTGGAGAAGATAACACTTGTCGAATATGGGATTTAAGGAAGAAAAAATCTCTTTACATAATACCTGCACATTCAAACTTAGTTTCACAGGTGAAATATGAACCTCAGGAGGGATATTTCTTGGTTACTGCATCATTTGATATGACAGCAAAGGTAATCTTGAATACTGCCAGGATGATATAATATTGTTCAATTCTTAATATTTTCATTTGATTGTTGCTTAATCTCACTTGTTGTCTAAGGTTTCACTATCATATGCTCAACTTTTTTTTTCTGCAGATTTGGTCTGCTCGAGATTTTAAGCCTGTGAAGACACTCTCTGGTCACGAAGCAAAAGTTACATCTTTGGATATAATTTCAGGTTAGTTATATAACAACTCCTAGGTCATTTACTATTATTGCGTTGTTTACACAGATTGCCACCTAAAGATTTGTAATTTTCACCGGTAATTAGCCTTATTTAGGCACAATCTCAGATTGAACTAATTAACAAAAATATCTTTCTGAGGTGGAACCATATTATGTCATTGATGAGCTGGAATTCGGGATCCTATTTTATTAGCTCTAGTGGTGAAAATGGACAGCTTTTTCTTGGTCTAAAATTAAGTGCTCTTTCAAACATTATAGGTTATCGATTTTAGTTACTGGTTCACGATCTCTTCTTTGATCACCTTTAGATTCTTGAGATATCCTCCTTATTTCATTTTGAAATATTCCTTTTACACAAAAAAAAAAAGAGGTTTCAGACCTTCGGTGGCGGGCGATGGCCATGTTAGCTCCAGTGAGAGAGAGCAAACGAGTGAGAGAGAAAGGGGACTAACCAATGAGTGGTACAACCAGTAGGGAAGGGGGAACTAAAACCAACTAGTGATGGAGAGAAGGGGAAGAAGTACCTTTTGGTTTTATTGGGAGAAGTGTTGAGCTCGAGTGAGAGGGAGATAAACAAAATGGTTGTTGATTGAAGTCTCATCTCTGCAATTTATGAAAGAGAGAAAAAAGGAAAGAAAGAATGACTTGAAGTCTTTAACCTTCAAAAAATTGTGTGTGTCTATATATATAGATAGATAGGTATATGAGATTGATTCAGTTCAGGTTGAATCAAAGTTCTTTTCAAGTATGCCCAATTCTTTTGTTTATTTTTGTGATCTTGTACAATTCACAGCGGAGTAACCAACCCAAGTTCTAATTACTCAACTTTTTTCATCAACATAGATTGGAAGGTTTATATTGAGAGAAGTGAATGAATATAGGAGGGAATTAGAAAAAACGTGCAACAGAAAGAGGGGATCAGACTAACTACGTGGACTCCAGTCCAAATGAACAAGACCTAGGTCATAAGTACAAAAAGAACTACTGATTGACGCCCACAAGGACATGTTAGGCCGTTCAAGAGATTTTACTGTAGCTTAGCTTTTTCCTTCGTTGTTTGGTGGCCTCTTCTTTGGCTACCTGTTGGGTGTTCTCACTTTAGAAGGTTAAAAATTCTAAAAAAGGTAAATCTTTTTGTGGTGAATTCTTAAAAGTAGTGCCAGTATGGTTGGACTATACCAGGGGAAGGGGATCTAGAACACCTTTAGGGTGGTGTTCCTATGGCTTAGAACAAATTCTTTTTCCCGGATGATGGATGTTAGTTCAATCCAAGGTTGTTAAATGATCTTTTGGGCACTGGAAAGTATTTCAGTAATCTTTGATATACGGTGTTGGTTTTTTCTCTAATCTTCACCCTTGCTGTAATATTTCAGCCCATTATTGTTTATGTGAAATTTCAAATAAGACCAACTCAACCAGAATATGGTTAAGGATTTAGGAAATTATTAAAGGATACTGGACTGTTTTTGTGACAAGTAAAACGTTGGAAGGTGTTTGAATGTAATTTGCTTGTCTTTAAGGACACAACCATAAAAGTTATCAACAGTTTTTCTTTGGTTTTAATATGAATGGTTACTGATACATCTTTAACACTAAGCTCTAAGTTTGTGCATTGTATGCTTAAAATACTTTTTGGAGCTTTATTCTGCATATATGTTTCTGATACCCTGATATTGTTCTTGTCAATAACAGACGGACAGTGTATTGCAACCGTCTCACATGATCGGGCCATAAAGCTCTGGTCTGTTAATAGTAAAGACATTCAGACTATGGACGTTGATTGACTTTTCATAGTACCAACATGGGAGATGTTTCACGACCTTACTTGATCAGGGTTCTGTTCTATAAATTGTTCGTCTCGTCCTTGAGTACTAAAGAGGCATGAGAGATGTTTCGAAACTTTGTGGCCTCAGGTGATAAAGAGCTTGGATCAGATGATCTCTGTAACATCATGTATCTGATTGAAATTTTTTAGGGAAAACCCCTGTACTTTTTTTTTCCTTGAAGAGAAATGGAAGGATTATATCAATGGAGGACATAGAGAAAAAGGGAGATTGGGATATTTCCCGCATGTTGAGATTAAAAATGCACTAAAATTAGAGGTAAGAATTGATAAGTAAATTACATGTAGTTTTCTTTAGATTTCCTTTTGGTTAGAAATGGTTTGTGCACCGACTACCATAGTGATTATTCGGTTAAAATACAATTTCAGGTAAGCCTAATTGGACCAATTAGACTCTAACCAAAACTATAAATTTATCAATTGCTTTTCCGTCTTAATCTCTGTTGAGTTCGATAAGCAAATGTTTTTTAATTAACTCTAAATTGAATAAGTTGGTGGTGAAGATGATCACATTCCCTATTTGGATGTGTGCACAATTCTCGAAAATCTTAGTTATGTTGAATTGAAATAACCGAAAATGTATCGATTAAGTATGGAAGAGAATTATGCTTTTTTTGTTGGATTTGGTTGTAACAGTATGATCTAGAATTTTAGATTTTCATTTCAATACGAAATCAGAAGTAAATGAAATAAGAAAATACATGCGAGTTGATAAGTGATGAGAACAAGAAAAAGGAATTGAAGAGAGAGAGAGAGAGAGAGGTTAGGAAAAAGAAATTAAAATTGTCTTTGTTTGTAATGGGAAACCCCAAACACGTGC

mRNA sequence

TAAAAAAAGAAAAACCTAGGTAAGGCCTTACCCGCTCTTTTACTTCCTCACTCTCGGAGAGACTATCCTTTCTTCACGCCGGTTCCCTCCCTTACGCACCTCCTTGCCACCGCCGCCGTTGGTATAGTCTCTCTTCCCGCTTTGTCCGTCTTCTACCAGTTTTTTACTGCTATACATTTGGCACTCCCGTCATCCCTTGTCCGGTTCAAAGTTCACCTGTGCACGAGGGCAATACTGTATCATCTCCGTCAACTGGAAATGTTTTGACGAGAAGAAATGAAAAGAGGAGCTTAATATCTAGGATAAGCTTGTGTTGATTAGAAGCTTCATTTCCGAGCAAGCTATAATGGAAATTGATGATCAAAACCCATCAACTGCTGCGGAATCTCCTGAAACCCTTCCTGGTGGTGAAAATGAGGAACTTGATAATCCAGCCGAACCAACTCAACCTGCAGCTACATCAGTTATTCCACCTTCAATTGTCCCTGCCATTGCTCCTATCCCCCCTCCAATTATCCGTCCATTGGCTCCTCTTCCGAGCCGACCTCCCCTCTTTAGGCCCCCTGTAACACAAAATGGTGAGCTGAGAACAAGTGACTCAGACTCGGAACATGATGAATTGGCTCCTTCCCGAACAGCTCCAGGTTCAACTGCAGAGTATGAAATCTCAGAAGAGAGCAGGCAAGCTAGGGAGCGTCATGAAAAAGCCATGCAGGAATTTTTGATGAAGCGTCGTGCTTCTGCTCTAGCAGTGCCTACTAATGACATGGCTGTTCGAGCTCGTCTTCGACGTCTTGGTGAGCCCATAACTCTTTTTGGAGAAAGAGAAATGGAAAGACGGGACAGGTTGCGGTCGATAATGGCACGATTGGATGCTGAAGGGCAGTTAGAGAAGCTCATGAAAGTTCACGAGGAAGAGGAGGCTGCAGCTACTGGTGGAACTGAGGAGGCTGAGGAGGAAGTGCTTCAGTATCCCTTTTATACTGAGGGGTCCAAAGCTCTCTTGGATGCAAGAATTGATATTGCAAAGTATTCCATCCTAAGAGCATCTTCACGCCTGGAGCGTGCAAAGAGGAAACGGGATGACCCAGATGAAGATGTGGAAGCGGAAATGGATTGGGCTCTGAGACAGGCTGAAAGTTTGGTCCTGGACTGTAGTGAGATAGGAGATGATCGACCACTTTCTGGTTGTTCTTTCTCGTCTGATGGAAAATTTCTTGCCACTAGTTCACTGAGTGGAGTTGCAAAGTTGTGGAGCATGCCTCAAGTAAGGAAGGTTTCCAACTTTAAGGGACACACAGAGCGTGTTACTGATGTAATGTTTTCTCCAGTGAACGAGTGTTTAGCAACTGCCTCTGCTGACCGAACTGCAAGGTTGTGGTCTGCGGAAGGATCTCTACTTAAAACATTTGAGGGCCATCTAGACCGCCTTGCACGAATTGCCTTCCACCCGTCGGGCAAGTACTTGGGCACAACTAGCTTTGACAAGACCTGGAGATTATGGGATGTTGAAACTGGTGTAGAATTACTTCTTCAAGAAGGTCACAGTAGAAGTGTCTATGGGATAGCCTTCCACCATGATGGATCCTTGGTATCATCTTGTGGACTTGATGCGCTTGCTCGCGTTTGGGATCTTCGAACTGGTAGAAGTGTTCTTGCCTTGGAAGGCCACGTCAAGCCAGTCCTTGGAGTTAGTTTTTCACCCAACGGTTATCATTTAGCTACTGGTGGAGAAGATAACACTTGTCGAATATGGGATTTAAGGAAGAAAAAATCTCTTTACATAATACCTGCACATTCAAACTTAGTTTCACAGGTGAAATATGAACCTCAGGAGGGATATTTCTTGGTTACTGCATCATTTGATATGACAGCAAAGATTTGGTCTGCTCGAGATTTTAAGCCTGTGAAGACACTCTCTGGTCACGAAGCAAAAGTTACATCTTTGGATATAATTTCAGACGGACAGTGTATTGCAACCGTCTCACATGATCGGGCCATAAAGCTCTGGTCTGTTAATAGTAAAGACATTCAGACTATGGACGTTGATTGACTTTTCATAGTACCAACATGGGAGATGTTTCACGACCTTACTTGATCAGGGTTCTGTTCTATAAATTGTTCGTCTCGTCCTTGAGTACTAAAGAGGCATGAGAGATGTTTCGAAACTTTGTGGCCTCAGGTGATAAAGAGCTTGGATCAGATGATCTCTGTAACATCATGTATCTGATTGAAATTTTTTAGGGAAAACCCCTGTACTTTTTTTTTCCTTGAAGAGAAATGGAAGGATTATATCAATGGAGGACATAGAGAAAAAGGGAGATTGGGATATTTCCCGCATGTTGAGATTAAAAATGCACTAAAATTAGAGGTAAGAATTGATAAGTAAATTACATGTAGTTTTCTTTAGATTTCCTTTTGGTTAGAAATGGTTTGTGCACCGACTACCATAGTGATTATTCGGTTAAAATACAATTTCAGGTAAGCCTAATTGGACCAATTAGACTCTAACCAAAACTATAAATTTATCAATTGCTTTTCCGTCTTAATCTCTGTTGAGTTCGATAAGCAAATGTTTTTTAATTAACTCTAAATTGAATAAGTTGGTGGTGAAGATGATCACATTCCCTATTTGGATGTGTGCACAATTCTCGAAAATCTTAGTTATGTTGAATTGAAATAACCGAAAATGTATCGATTAAGTATGGAAGAGAATTATGCTTTTTTTGTTGGATTTGGTTGTAACAGTATGATCTAGAATTTTAGATTTTCATTTCAATACGAAATCAGAAGTAAATGAAATAAGAAAATACATGCGAGTTGATAAGTGATGAGAACAAGAAAAAGGAATTGAAGAGAGAGAGAGAGAGAGAGGTTAGGAAAAAGAAATTAAAATTGTCTTTGTTTGTAATGGGAAACCCCAAACACGTGC

Coding sequence (CDS)

ATGGAAATTGATGATCAAAACCCATCAACTGCTGCGGAATCTCCTGAAACCCTTCCTGGTGGTGAAAATGAGGAACTTGATAATCCAGCCGAACCAACTCAACCTGCAGCTACATCAGTTATTCCACCTTCAATTGTCCCTGCCATTGCTCCTATCCCCCCTCCAATTATCCGTCCATTGGCTCCTCTTCCGAGCCGACCTCCCCTCTTTAGGCCCCCTGTAACACAAAATGGTGAGCTGAGAACAAGTGACTCAGACTCGGAACATGATGAATTGGCTCCTTCCCGAACAGCTCCAGGTTCAACTGCAGAGTATGAAATCTCAGAAGAGAGCAGGCAAGCTAGGGAGCGTCATGAAAAAGCCATGCAGGAATTTTTGATGAAGCGTCGTGCTTCTGCTCTAGCAGTGCCTACTAATGACATGGCTGTTCGAGCTCGTCTTCGACGTCTTGGTGAGCCCATAACTCTTTTTGGAGAAAGAGAAATGGAAAGACGGGACAGGTTGCGGTCGATAATGGCACGATTGGATGCTGAAGGGCAGTTAGAGAAGCTCATGAAAGTTCACGAGGAAGAGGAGGCTGCAGCTACTGGTGGAACTGAGGAGGCTGAGGAGGAAGTGCTTCAGTATCCCTTTTATACTGAGGGGTCCAAAGCTCTCTTGGATGCAAGAATTGATATTGCAAAGTATTCCATCCTAAGAGCATCTTCACGCCTGGAGCGTGCAAAGAGGAAACGGGATGACCCAGATGAAGATGTGGAAGCGGAAATGGATTGGGCTCTGAGACAGGCTGAAAGTTTGGTCCTGGACTGTAGTGAGATAGGAGATGATCGACCACTTTCTGGTTGTTCTTTCTCGTCTGATGGAAAATTTCTTGCCACTAGTTCACTGAGTGGAGTTGCAAAGTTGTGGAGCATGCCTCAAGTAAGGAAGGTTTCCAACTTTAAGGGACACACAGAGCGTGTTACTGATGTAATGTTTTCTCCAGTGAACGAGTGTTTAGCAACTGCCTCTGCTGACCGAACTGCAAGGTTGTGGTCTGCGGAAGGATCTCTACTTAAAACATTTGAGGGCCATCTAGACCGCCTTGCACGAATTGCCTTCCACCCGTCGGGCAAGTACTTGGGCACAACTAGCTTTGACAAGACCTGGAGATTATGGGATGTTGAAACTGGTGTAGAATTACTTCTTCAAGAAGGTCACAGTAGAAGTGTCTATGGGATAGCCTTCCACCATGATGGATCCTTGGTATCATCTTGTGGACTTGATGCGCTTGCTCGCGTTTGGGATCTTCGAACTGGTAGAAGTGTTCTTGCCTTGGAAGGCCACGTCAAGCCAGTCCTTGGAGTTAGTTTTTCACCCAACGGTTATCATTTAGCTACTGGTGGAGAAGATAACACTTGTCGAATATGGGATTTAAGGAAGAAAAAATCTCTTTACATAATACCTGCACATTCAAACTTAGTTTCACAGGTGAAATATGAACCTCAGGAGGGATATTTCTTGGTTACTGCATCATTTGATATGACAGCAAAGATTTGGTCTGCTCGAGATTTTAAGCCTGTGAAGACACTCTCTGGTCACGAAGCAAAAGTTACATCTTTGGATATAATTTCAGACGGACAGTGTATTGCAACCGTCTCACATGATCGGGCCATAAAGCTCTGGTCTGTTAATAGTAAAGACATTCAGACTATGGACGTTGATTGA

Protein sequence

MEIDDQNPSTAAESPETLPGGENEELDNPAEPTQPAATSVIPPSIVPAIAPIPPPIIRPLAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPSRTAPGSTAEYEISEESRQARERHEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEGQLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLERAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLKTFEGHLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVSSCGLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISDGQCIATVSHDRAIKLWSVNSKDIQTMDVD*
Homology
BLAST of CsGy6G020435.1 vs. ExPASy Swiss-Prot
Match: O22212 (U4/U6 small nuclear ribonucleoprotein PRP4-like protein OS=Arabidopsis thaliana OX=3702 GN=LIS PE=2 SV=1)

HSP 1 Score: 708.0 bits (1826), Expect = 8.7e-203
Identity = 366/570 (64.21%), Postives = 439/570 (77.02%), Query Frame = 0

Query: 5   DQNPSTAAESPETLPGGENEELDNPAEPTQPAATSVIPPSIVPAIAPIPPPIIRPLAPLP 64
           D N S AA +  + P       D  + P   A   V+PPS  P +APIP   + P  P+ 
Sbjct: 6   DDNVSLAATAQISAPPVLQ---DASSLPGFSAIPPVVPPSFPPPMAPIP---MMPHPPV- 65

Query: 65  SRPPLFRPPVTQNGELRTSDSDSEHDELAPSRTAPGSTAEYEISEESRQARERHEKAMQE 124
           +RPP FRPPV+QNG ++TSDSDSE D+              EISEES+Q RER EKA+Q+
Sbjct: 66  ARPPTFRPPVSQNGGVKTSDSDSESDD-----------EHIEISEESKQVRERQEKALQD 125

Query: 125 FLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEGQLEKL 184
            L+KRRA+A+AVPTND AVR RLRRLGEPITLFGE+EMERR RL  ++ R D  GQL+KL
Sbjct: 126 LLVKRRAAAMAVPTNDKAVRDRLRRLGEPITLFGEQEMERRARLTQLLTRYDINGQLDKL 185

Query: 185 MKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLERAKRK 244
           +K HEE+        EE ++EVL+YPF+TEG K L +ARI+IAK+S+ RA+ R++RAKR+
Sbjct: 186 VKDHEEDVTP----KEEVDDEVLEYPFFTEGPKELREARIEIAKFSVKRAAVRIQRAKRR 245

Query: 245 RDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGVAKLWS 304
           RDDPDED++AE  WAL+ A+ + LDCS  GDDRPL+GCSFS DGK LAT SLSGV KLW 
Sbjct: 246 RDDPDEDMDAETKWALKHAKHMALDCSNFGDDRPLTGCSFSRDGKILATCSLSGVTKLWE 305

Query: 305 MPQV-RKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLKTFEGHLDRLA 364
           MPQV   ++  K H ER TDV+FSPV++CLATASADRTA+LW  +G+LL+TFEGHLDRLA
Sbjct: 306 MPQVTNTIAVLKDHKERATDVVFSPVDDCLATASADRTAKLWKTDGTLLQTFEGHLDRLA 365

Query: 365 RIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVSSCGLDA 424
           R+AFHPSGKYLGTTS+DKTWRLWD+ TG ELLLQEGHSRSVYGIAF  DG+L +SCGLD+
Sbjct: 366 RVAFHPSGKYLGTTSYDKTWRLWDINTGAELLLQEGHSRSVYGIAFQQDGALAASCGLDS 425

Query: 425 LARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLYIIPA 484
           LARVWDLRTGRS+L  +GH+KPV  V+FSPNGYHLA+GGEDN CRIWDLR +KSLYIIPA
Sbjct: 426 LARVWDLRTGRSILVFQGHIKPVFSVNFSPNGYHLASGGEDNQCRIWDLRMRKSLYIIPA 485

Query: 485 HSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISDGQCI 544
           H+NLVSQVKYEPQEGYFL TAS+DM   IWS RDF  VK+L+GHE+KV SLDI +D  CI
Sbjct: 486 HANLVSQVKYEPQEGYFLATASYDMKVNIWSGRDFSLVKSLAGHESKVASLDITADSSCI 545

Query: 545 ATVSHDRAIKLWSVNSKD-----IQTMDVD 569
           ATVSHDR IKLW+ +  D      +TMD+D
Sbjct: 546 ATVSHDRTIKLWTSSGNDDEDEEKETMDID 553

BLAST of CsGy6G020435.1 vs. ExPASy Swiss-Prot
Match: Q3MHE2 (U4/U6 small nuclear ribonucleoprotein Prp4 OS=Bos taurus OX=9913 GN=PRPF4 PE=2 SV=1)

HSP 1 Score: 413.7 bits (1062), Expect = 3.4e-114
Identity = 214/459 (46.62%), Postives = 296/459 (64.49%), Query Frame = 0

Query: 106 EISEESRQARERHEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERR 165
           E+ E      ER  + + EF  ++RA  + V T+D  V+A LR LGEPITLFGE   ERR
Sbjct: 70  EVFEIEEHISERQAEVLAEFERRKRARQINVSTDDSEVKACLRALGEPITLFGEGPAERR 129

Query: 166 DRLRSIMARLDAEGQLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARID 225
           +RLR+I++ +  +     L K  +++E +      +  +E  Q  +Y EG  +L  AR+ 
Sbjct: 130 ERLRNILSVVGTDA----LKKTKKDDEKS------KKSKEEYQQTWYHEGPHSLKVARLW 189

Query: 226 IAKYSILRASSRLERAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFS 285
           IA YS+ RA  RLE A+  ++ P+    ++M    +   SL   CS+IGDDRP+S C FS
Sbjct: 190 IANYSLPRAMKRLEEARLHKEIPETTRTSQMQELHKSLRSLNNFCSQIGDDRPISYCHFS 249

Query: 286 SDGKFLATSSLSGVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNEC--------LATAS 345
            + K LAT+  SG+ KLWS+P    +   +GH   V  ++F P +          LA+ +
Sbjct: 250 PNSKMLATACWSGLCKLWSVPDCNLLHTLRGHNTNVGAIVFHPKSTVSLDQKDVNLASCA 309

Query: 346 ADRTARLWSAEG-SLLKTFEGHLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLL 405
           AD + +LWS +    +   EGH  R+AR+ +HPSG++LGTT +D++WRLWD+E   E+L 
Sbjct: 310 ADGSVKLWSLDSDEPVADIEGHTVRVARVTWHPSGRFLGTTCYDRSWRLWDLEAQEEILH 369

Query: 406 QEGHSRSVYGIAFHHDGSLVSSCGLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGY 465
           QEGHS  VY IAFH DGSL  + GLDA  RVWDLRTGR ++ LEGH+K + G++FSPNGY
Sbjct: 370 QEGHSMGVYDIAFHQDGSLAGTGGLDAFGRVWDLRTGRCIMFLEGHLKEIYGINFSPNGY 429

Query: 466 HLATGGEDNTCRIWDLRKKKSLYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSAR 525
           H+ATG  DNTC++WDLR+++ +Y IPAH NLV+ VK+EP  G FL+T ++D TAKIW+  
Sbjct: 430 HIATGSGDNTCKVWDLRQRRCVYTIPAHQNLVTGVKFEPIHGNFLLTGAYDNTAKIWTHP 489

Query: 526 DFKPVKTLSGHEAKVTSLDIISDGQCIATVSHDRAIKLW 556
            + P+KTL+GHE KV  LDI SDGQ IAT S+DR  KLW
Sbjct: 490 GWSPLKTLAGHEGKVMGLDISSDGQLIATCSYDRTFKLW 518

BLAST of CsGy6G020435.1 vs. ExPASy Swiss-Prot
Match: O43172 (U4/U6 small nuclear ribonucleoprotein Prp4 OS=Homo sapiens OX=9606 GN=PRPF4 PE=1 SV=2)

HSP 1 Score: 413.7 bits (1062), Expect = 3.4e-114
Identity = 214/459 (46.62%), Postives = 296/459 (64.49%), Query Frame = 0

Query: 106 EISEESRQARERHEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERR 165
           E+ E      ER  + + EF  ++RA  + V T+D  V+A LR LGEPITLFGE   ERR
Sbjct: 71  EVFEIEEHISERQAEVLAEFERRKRARQINVSTDDSEVKACLRALGEPITLFGEGPAERR 130

Query: 166 DRLRSIMARLDAEGQLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARID 225
           +RLR+I++ +  +     L K  +++E +      +  +E  Q  +Y EG  +L  AR+ 
Sbjct: 131 ERLRNILSVVGTDA----LKKTKKDDEKS------KKSKEEYQQTWYHEGPNSLKVARLW 190

Query: 226 IAKYSILRASSRLERAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFS 285
           IA YS+ RA  RLE A+  ++ P+    ++M    +   SL   CS+IGDDRP+S C FS
Sbjct: 191 IANYSLPRAMKRLEEARLHKEIPETTRTSQMQELHKSLRSLNNFCSQIGDDRPISYCHFS 250

Query: 286 SDGKFLATSSLSGVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNEC--------LATAS 345
            + K LAT+  SG+ KLWS+P    +   +GH   V  ++F P +          LA+ +
Sbjct: 251 PNSKMLATACWSGLCKLWSVPDCNLLHTLRGHNTNVGAIVFHPKSTVSLDPKDVNLASCA 310

Query: 346 ADRTARLWSAEG-SLLKTFEGHLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLL 405
           AD + +LWS +    +   EGH  R+AR+ +HPSG++LGTT +D++WRLWD+E   E+L 
Sbjct: 311 ADGSVKLWSLDSDEPVADIEGHTVRVARVMWHPSGRFLGTTCYDRSWRLWDLEAQEEILH 370

Query: 406 QEGHSRSVYGIAFHHDGSLVSSCGLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGY 465
           QEGHS  VY IAFH DGSL  + GLDA  RVWDLRTGR ++ LEGH+K + G++FSPNGY
Sbjct: 371 QEGHSMGVYDIAFHQDGSLAGTGGLDAFGRVWDLRTGRCIMFLEGHLKEIYGINFSPNGY 430

Query: 466 HLATGGEDNTCRIWDLRKKKSLYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSAR 525
           H+ATG  DNTC++WDLR+++ +Y IPAH NLV+ VK+EP  G FL+T ++D TAKIW+  
Sbjct: 431 HIATGSGDNTCKVWDLRQRRCVYTIPAHQNLVTGVKFEPIHGNFLLTGAYDNTAKIWTHP 490

Query: 526 DFKPVKTLSGHEAKVTSLDIISDGQCIATVSHDRAIKLW 556
            + P+KTL+GHE KV  LDI SDGQ IAT S+DR  KLW
Sbjct: 491 GWSPLKTLAGHEGKVMGLDISSDGQLIATCSYDRTFKLW 519

BLAST of CsGy6G020435.1 vs. ExPASy Swiss-Prot
Match: Q5NVD0 (U4/U6 small nuclear ribonucleoprotein Prp4 OS=Pongo abelii OX=9601 GN=PRPF4 PE=2 SV=1)

HSP 1 Score: 413.7 bits (1062), Expect = 3.4e-114
Identity = 214/459 (46.62%), Postives = 296/459 (64.49%), Query Frame = 0

Query: 106 EISEESRQARERHEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERR 165
           E+ E      ER  + + EF  ++RA  + V T+D  V+A LR LGEPITLFGE   ERR
Sbjct: 70  EVFEIEEHISERQAEVLAEFERRKRARQINVSTDDSEVKACLRALGEPITLFGEGPAERR 129

Query: 166 DRLRSIMARLDAEGQLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARID 225
           +RLR+I++ +  +     L K  +++E +      +  +E  Q  +Y EG  +L  AR+ 
Sbjct: 130 ERLRNILSVVGTDA----LKKTKKDDEKS------KKSKEEYQQTWYHEGPNSLKVARLW 189

Query: 226 IAKYSILRASSRLERAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFS 285
           IA YS+ RA  RLE A+  ++ P+    ++M    +   SL   CS+IGDDRP+S C FS
Sbjct: 190 IANYSLPRAMKRLEEARLHKEIPETTRASQMQELHKSLRSLNNFCSQIGDDRPISYCHFS 249

Query: 286 SDGKFLATSSLSGVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNEC--------LATAS 345
            + K LAT+  SG+ KLWS+P    +   +GH   V  ++F P +          LA+ +
Sbjct: 250 PNSKMLATACWSGLCKLWSVPDCNLLHTLRGHNTNVGAIVFHPKSTVSLDQKDVNLASCA 309

Query: 346 ADRTARLWSAEG-SLLKTFEGHLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLL 405
           AD + +LWS +    +   EGH  R+AR+ +HPSG++LGTT +D++WRLWD+E   E+L 
Sbjct: 310 ADGSVKLWSLDSDEPVADIEGHTVRVARVMWHPSGRFLGTTCYDRSWRLWDLEAQEEILH 369

Query: 406 QEGHSRSVYGIAFHHDGSLVSSCGLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGY 465
           QEGHS  VY IAFH DGSL  + GLDA  RVWDLRTGR ++ LEGH+K + G++FSPNGY
Sbjct: 370 QEGHSMGVYDIAFHQDGSLAGTGGLDAFGRVWDLRTGRCIMFLEGHLKEIYGINFSPNGY 429

Query: 466 HLATGGEDNTCRIWDLRKKKSLYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSAR 525
           H+ATG  DNTC++WDLR+++ +Y IPAH NLV+ VK+EP  G FL+T ++D TAKIW+  
Sbjct: 430 HIATGSGDNTCKVWDLRQRRCVYTIPAHQNLVTGVKFEPIHGNFLLTGAYDNTAKIWTHP 489

Query: 526 DFKPVKTLSGHEAKVTSLDIISDGQCIATVSHDRAIKLW 556
            + P+KTL+GHE KV  LDI SDGQ IAT S+DR  KLW
Sbjct: 490 GWSPLKTLAGHEGKVMGLDISSDGQLIATCSYDRTFKLW 518

BLAST of CsGy6G020435.1 vs. ExPASy Swiss-Prot
Match: Q9DAW6 (U4/U6 small nuclear ribonucleoprotein Prp4 OS=Mus musculus OX=10090 GN=Prpf4 PE=1 SV=1)

HSP 1 Score: 412.9 bits (1060), Expect = 5.8e-114
Identity = 214/459 (46.62%), Postives = 296/459 (64.49%), Query Frame = 0

Query: 106 EISEESRQARERHEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERR 165
           E+ E      ER  + + EF  ++RA  + V T+D  V+A LR LGEPITLFGE   ERR
Sbjct: 70  EVFEIEEHISERQAEVLAEFERRKRARQINVSTDDSEVKACLRALGEPITLFGEGPAERR 129

Query: 166 DRLRSIMARLDAEGQLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARID 225
           +RLR+I++ +  +     L K  +++E +      +  +E  Q  +Y EG  +L  AR+ 
Sbjct: 130 ERLRNILSVVGTDA----LKKTKKDDEKS------KKSKEEYQQTWYHEGPNSLKVARLW 189

Query: 226 IAKYSILRASSRLERAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFS 285
           IA YS+ RA  RLE A+  ++ P+    ++M    +   SL   CS+IGDDRP+S C FS
Sbjct: 190 IANYSLPRAMKRLEEARLHKEIPETTRTSQMQELHKSLRSLNNFCSQIGDDRPISYCHFS 249

Query: 286 SDGKFLATSSLSGVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNEC--------LATAS 345
            + K LAT+  SG+ KLWS+P    +   +GH   V  ++F P +          LA+ +
Sbjct: 250 PNSKMLATACWSGLCKLWSVPDCSLLHTLRGHNTNVGAIVFHPKSTVSLDQKDVNLASCA 309

Query: 346 ADRTARLWSAEG-SLLKTFEGHLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLL 405
           AD + +LWS +    +   EGH  R+AR+ +HPSG++LGTT +D++WRLWD+E   E+L 
Sbjct: 310 ADGSVKLWSLDSDEPVADIEGHTVRVARVMWHPSGRFLGTTCYDRSWRLWDLEAQEEILH 369

Query: 406 QEGHSRSVYGIAFHHDGSLVSSCGLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGY 465
           QEGHS  VY IAFH DGSL  + GLDA  RVWDLRTGR ++ LEGH+K + G++FSPNGY
Sbjct: 370 QEGHSMGVYDIAFHQDGSLAGTGGLDAFGRVWDLRTGRCIMFLEGHLKEIYGINFSPNGY 429

Query: 466 HLATGGEDNTCRIWDLRKKKSLYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSAR 525
           H+ATG  DNTC++WDLR+++ +Y IPAH NLV+ VK+EP  G FL+T ++D TAKIW+  
Sbjct: 430 HIATGSGDNTCKVWDLRQRRCVYTIPAHQNLVTGVKFEPIHGDFLLTGAYDNTAKIWTHP 489

Query: 526 DFKPVKTLSGHEAKVTSLDIISDGQCIATVSHDRAIKLW 556
            + P+KTL+GHE KV  LDI SDGQ IAT S+DR  KLW
Sbjct: 490 GWSPLKTLAGHEGKVMGLDISSDGQLIATCSYDRTFKLW 518

BLAST of CsGy6G020435.1 vs. NCBI nr
Match: XP_004146749.1 (U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Cucumis sativus] >XP_031742738.1 U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Cucumis sativus] >KGN47802.1 hypothetical protein Csa_003570 [Cucumis sativus])

HSP 1 Score: 1095 bits (2833), Expect = 0.0
Identity = 567/569 (99.65%), Postives = 567/569 (99.65%), Query Frame = 0

Query: 1   MEIDDQNP-STAAESPETLPGGENEELDNPAEPTQPAATSVIPPSIVPAIAPIPPPIIRP 60
           MEIDDQNP STAAESPETLPGGENEELDNPAEPTQPAATSVIPPSIVPAIAPIPPPIIRP
Sbjct: 1   MEIDDQNPASTAAESPETLPGGENEELDNPAEPTQPAATSVIPPSIVPAIAPIPPPIIRP 60

Query: 61  LAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPSRTAPGSTAEYEISEESRQARERHE 120
           LAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPSRTAPGSTAEYEISEESRQARERHE
Sbjct: 61  LAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPSRTAPGSTAEYEISEESRQARERHE 120

Query: 121 KAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEG 180
           KAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEG
Sbjct: 121 KAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEG 180

Query: 181 QLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLE 240
           QLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLE
Sbjct: 181 QLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLE 240

Query: 241 RAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGV 300
           RAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGV
Sbjct: 241 RAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGV 300

Query: 301 AKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLKTFEGHL 360
           AKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLKTFEGHL
Sbjct: 301 AKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLKTFEGHL 360

Query: 361 DRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVSSC 420
           DRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVSSC
Sbjct: 361 DRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVSSC 420

Query: 421 GLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLY 480
           GLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLY
Sbjct: 421 GLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLY 480

Query: 481 IIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISD 540
           IIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISD
Sbjct: 481 IIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISD 540

Query: 541 GQCIATVSHDRAIKLWSVNSKDIQTMDVD 568
           GQCIATVSHDR IKLWSVNSKDIQTMDVD
Sbjct: 541 GQCIATVSHDRTIKLWSVNSKDIQTMDVD 569

BLAST of CsGy6G020435.1 vs. NCBI nr
Match: XP_008464716.1 (PREDICTED: U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Cucumis melo] >XP_016903247.1 PREDICTED: U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Cucumis melo] >XP_016903248.1 PREDICTED: U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Cucumis melo] >XP_016903249.1 PREDICTED: U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Cucumis melo])

HSP 1 Score: 1086 bits (2809), Expect = 0.0
Identity = 562/569 (98.77%), Postives = 564/569 (99.12%), Query Frame = 0

Query: 1   MEIDDQNP-STAAESPETLPGGENEELDNPAEPTQPAATSVIPPSIVPAIAPIPPPIIRP 60
           MEIDDQNP STAAESPETLPGGENEELDNPAEPTQPAATSVIPPSIVPAIAPIPPPIIRP
Sbjct: 1   MEIDDQNPASTAAESPETLPGGENEELDNPAEPTQPAATSVIPPSIVPAIAPIPPPIIRP 60

Query: 61  LAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPSRTAPGSTAEYEISEESRQARERHE 120
           LAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAP+RTAPGSTAEYEISEESRQARERHE
Sbjct: 61  LAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPARTAPGSTAEYEISEESRQARERHE 120

Query: 121 KAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEG 180
           KAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEG
Sbjct: 121 KAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEG 180

Query: 181 QLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLE 240
           QLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLE
Sbjct: 181 QLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLE 240

Query: 241 RAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGV 300
           RAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGV
Sbjct: 241 RAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGV 300

Query: 301 AKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLKTFEGHL 360
           AKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWS EGSLLKTFEGHL
Sbjct: 301 AKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSPEGSLLKTFEGHL 360

Query: 361 DRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVSSC 420
           DRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFH DGSLVSSC
Sbjct: 361 DRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHQDGSLVSSC 420

Query: 421 GLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLY 480
           GLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLY
Sbjct: 421 GLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLY 480

Query: 481 IIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISD 540
           IIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISD
Sbjct: 481 IIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISD 540

Query: 541 GQCIATVSHDRAIKLWSVNSKDIQTMDVD 568
           GQCIATVSHDR IKLWSVNSKD QTMD+D
Sbjct: 541 GQCIATVSHDRTIKLWSVNSKDKQTMDID 569

BLAST of CsGy6G020435.1 vs. NCBI nr
Match: XP_038885479.1 (U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Benincasa hispida] >XP_038885480.1 U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Benincasa hispida])

HSP 1 Score: 1070 bits (2766), Expect = 0.0
Identity = 551/569 (96.84%), Postives = 560/569 (98.42%), Query Frame = 0

Query: 1   MEIDDQNP-STAAESPETLPGGENEELDNPAEPTQPAATSVIPPSIVPAIAPIPPPIIRP 60
           ME+DDQNP STAAESPETLPGGENE+LDNPAEP QPAATSVIPPS+VP+IAPIPPPIIRP
Sbjct: 1   MEVDDQNPASTAAESPETLPGGENEDLDNPAEPIQPAATSVIPPSVVPSIAPIPPPIIRP 60

Query: 61  LAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPSRTAPGSTAEYEISEESRQARERHE 120
           LAPLPSRPP FRPPVTQNGE+RTSDSDSEHDELAPSRTA GSTAEYE+SEESRQ RER E
Sbjct: 61  LAPLPSRPPHFRPPVTQNGEMRTSDSDSEHDELAPSRTAGGSTAEYEVSEESRQVRERQE 120

Query: 121 KAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEG 180
           KAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEG
Sbjct: 121 KAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEG 180

Query: 181 QLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLE 240
           QLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRA+SRLE
Sbjct: 181 QLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRAASRLE 240

Query: 241 RAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGV 300
           RAKRKRDDPDEDVEAEMDWALRQA SLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGV
Sbjct: 241 RAKRKRDDPDEDVEAEMDWALRQAGSLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGV 300

Query: 301 AKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLKTFEGHL 360
           AKLWSMPQVRKVSNFKGHTERVTDV+FSPVNECLATASADRTARLWSAEGSLLKTFEGHL
Sbjct: 301 AKLWSMPQVRKVSNFKGHTERVTDVIFSPVNECLATASADRTARLWSAEGSLLKTFEGHL 360

Query: 361 DRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVSSC 420
           DRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVSSC
Sbjct: 361 DRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVSSC 420

Query: 421 GLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLY 480
           GLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLY
Sbjct: 421 GLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLY 480

Query: 481 IIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISD 540
           IIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISD
Sbjct: 481 IIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISD 540

Query: 541 GQCIATVSHDRAIKLWSVNSKDIQTMDVD 568
           GQCIATVSHDR IKLWSVNSKD QTMD+D
Sbjct: 541 GQCIATVSHDRTIKLWSVNSKDEQTMDID 569

BLAST of CsGy6G020435.1 vs. NCBI nr
Match: KAA0040989.1 (U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Cucumis melo var. makuwa] >TYK20339.1 U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Cucumis melo var. makuwa])

HSP 1 Score: 1062 bits (2746), Expect = 0.0
Identity = 553/569 (97.19%), Postives = 555/569 (97.54%), Query Frame = 0

Query: 1   MEIDDQNP-STAAESPETLPGGENEELDNPAEPTQPAATSVIPPSIVPAIAPIPPPIIRP 60
           MEIDDQNP STAAESPETLPGGENEELDNPAEPTQPAATSVIPPSIVPAIAPIPPPIIRP
Sbjct: 1   MEIDDQNPASTAAESPETLPGGENEELDNPAEPTQPAATSVIPPSIVPAIAPIPPPIIRP 60

Query: 61  LAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPSRTAPGSTAEYEISEESRQARERHE 120
           LAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAP+RTAPGSTAEYEISEESRQARERHE
Sbjct: 61  LAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPARTAPGSTAEYEISEESRQARERHE 120

Query: 121 KAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEG 180
           KAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEG
Sbjct: 121 KAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEG 180

Query: 181 QLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLE 240
           QLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLE
Sbjct: 181 QLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLE 240

Query: 241 RAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGV 300
           RAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSF         SSLSGV
Sbjct: 241 RAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSF---------SSLSGV 300

Query: 301 AKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLKTFEGHL 360
           AKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWS EGSLLKTFEGHL
Sbjct: 301 AKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSPEGSLLKTFEGHL 360

Query: 361 DRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVSSC 420
           DRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFH DGSLVSSC
Sbjct: 361 DRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHQDGSLVSSC 420

Query: 421 GLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLY 480
           GLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLY
Sbjct: 421 GLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLY 480

Query: 481 IIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISD 540
           IIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISD
Sbjct: 481 IIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISD 540

Query: 541 GQCIATVSHDRAIKLWSVNSKDIQTMDVD 568
           GQCIATVSHDR IKLWSVNSKD QTMD+D
Sbjct: 541 GQCIATVSHDRTIKLWSVNSKDKQTMDID 560

BLAST of CsGy6G020435.1 vs. NCBI nr
Match: XP_022999661.1 (U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Cucurbita maxima])

HSP 1 Score: 1043 bits (2697), Expect = 0.0
Identity = 536/570 (94.04%), Postives = 554/570 (97.19%), Query Frame = 0

Query: 1   MEIDDQNP-STAAESPETLPGGENEELDNPAEPTQPAATSVIPPSIVPAIAPIPPPII-R 60
           M++DDQNP STAAESPE LPGGENE+LDNPAEP QPAAT+VIP SIVP+IAPIPPP+I R
Sbjct: 1   MDVDDQNPASTAAESPEILPGGENEDLDNPAEPMQPAATTVIPSSIVPSIAPIPPPLITR 60

Query: 61  PLAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPSRTAPGSTAEYEISEESRQARERH 120
           PLAPLPSRP LFRPPV QNGE+RTSDSDSEHDELAPSR   GSTAEYE+SEESRQ RER 
Sbjct: 61  PLAPLPSRPLLFRPPVAQNGEMRTSDSDSEHDELAPSRATQGSTAEYEVSEESRQVRERQ 120

Query: 121 EKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAE 180
           EKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAE
Sbjct: 121 EKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAE 180

Query: 181 GQLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRL 240
           GQLEKL+KVHEEEEAAATGGTEEAEEEVLQYPFYTEG KALLDARIDIAKYSI+RA+SRL
Sbjct: 181 GQLEKLLKVHEEEEAAATGGTEEAEEEVLQYPFYTEGPKALLDARIDIAKYSIVRAASRL 240

Query: 241 ERAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSG 300
           ERAKRKRDDPDEDVEAEMDWALRQAESL+LDCSEIGDDRPLSGCSFSSDGKFLATSSLSG
Sbjct: 241 ERAKRKRDDPDEDVEAEMDWALRQAESLILDCSEIGDDRPLSGCSFSSDGKFLATSSLSG 300

Query: 301 VAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLKTFEGH 360
           VAK+WSMPQVRKVSNFKGHTERVTDV+FSPVNECLATASADRTARLWSAEGSLL+TFEGH
Sbjct: 301 VAKMWSMPQVRKVSNFKGHTERVTDVIFSPVNECLATASADRTARLWSAEGSLLRTFEGH 360

Query: 361 LDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVSS 420
           LDRLARIAFHPSGKYLGTTSFDKTWRLWD+ETGVELLLQEGHSRSVYGI FHHDGSLVSS
Sbjct: 361 LDRLARIAFHPSGKYLGTTSFDKTWRLWDIETGVELLLQEGHSRSVYGIDFHHDGSLVSS 420

Query: 421 CGLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSL 480
           CGLDALARVWDLRTGRSVLALEGHVKPVLGV+FSPNGYHLATGGEDNTCRIWDLRKKKSL
Sbjct: 421 CGLDALARVWDLRTGRSVLALEGHVKPVLGVNFSPNGYHLATGGEDNTCRIWDLRKKKSL 480

Query: 481 YIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIIS 540
           YIIPAHSNL+SQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIIS
Sbjct: 481 YIIPAHSNLISQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIIS 540

Query: 541 DGQCIATVSHDRAIKLWSVNSKDIQTMDVD 568
           DGQCIATVSHDR IKLWSVNSKD QTMDVD
Sbjct: 541 DGQCIATVSHDRTIKLWSVNSKDEQTMDVD 570

BLAST of CsGy6G020435.1 vs. ExPASy TrEMBL
Match: A0A0A0KHD8 (WD_REPEATS_REGION domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G404170 PE=4 SV=1)

HSP 1 Score: 1095 bits (2833), Expect = 0.0
Identity = 567/569 (99.65%), Postives = 567/569 (99.65%), Query Frame = 0

Query: 1   MEIDDQNP-STAAESPETLPGGENEELDNPAEPTQPAATSVIPPSIVPAIAPIPPPIIRP 60
           MEIDDQNP STAAESPETLPGGENEELDNPAEPTQPAATSVIPPSIVPAIAPIPPPIIRP
Sbjct: 1   MEIDDQNPASTAAESPETLPGGENEELDNPAEPTQPAATSVIPPSIVPAIAPIPPPIIRP 60

Query: 61  LAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPSRTAPGSTAEYEISEESRQARERHE 120
           LAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPSRTAPGSTAEYEISEESRQARERHE
Sbjct: 61  LAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPSRTAPGSTAEYEISEESRQARERHE 120

Query: 121 KAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEG 180
           KAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEG
Sbjct: 121 KAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEG 180

Query: 181 QLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLE 240
           QLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLE
Sbjct: 181 QLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLE 240

Query: 241 RAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGV 300
           RAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGV
Sbjct: 241 RAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGV 300

Query: 301 AKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLKTFEGHL 360
           AKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLKTFEGHL
Sbjct: 301 AKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLKTFEGHL 360

Query: 361 DRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVSSC 420
           DRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVSSC
Sbjct: 361 DRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVSSC 420

Query: 421 GLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLY 480
           GLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLY
Sbjct: 421 GLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLY 480

Query: 481 IIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISD 540
           IIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISD
Sbjct: 481 IIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISD 540

Query: 541 GQCIATVSHDRAIKLWSVNSKDIQTMDVD 568
           GQCIATVSHDR IKLWSVNSKDIQTMDVD
Sbjct: 541 GQCIATVSHDRTIKLWSVNSKDIQTMDVD 569

BLAST of CsGy6G020435.1 vs. ExPASy TrEMBL
Match: A0A1S4E4U8 (U4/U6 small nuclear ribonucleoprotein PRP4-like protein OS=Cucumis melo OX=3656 GN=LOC103502534 PE=4 SV=1)

HSP 1 Score: 1086 bits (2809), Expect = 0.0
Identity = 562/569 (98.77%), Postives = 564/569 (99.12%), Query Frame = 0

Query: 1   MEIDDQNP-STAAESPETLPGGENEELDNPAEPTQPAATSVIPPSIVPAIAPIPPPIIRP 60
           MEIDDQNP STAAESPETLPGGENEELDNPAEPTQPAATSVIPPSIVPAIAPIPPPIIRP
Sbjct: 1   MEIDDQNPASTAAESPETLPGGENEELDNPAEPTQPAATSVIPPSIVPAIAPIPPPIIRP 60

Query: 61  LAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPSRTAPGSTAEYEISEESRQARERHE 120
           LAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAP+RTAPGSTAEYEISEESRQARERHE
Sbjct: 61  LAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPARTAPGSTAEYEISEESRQARERHE 120

Query: 121 KAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEG 180
           KAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEG
Sbjct: 121 KAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEG 180

Query: 181 QLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLE 240
           QLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLE
Sbjct: 181 QLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLE 240

Query: 241 RAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGV 300
           RAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGV
Sbjct: 241 RAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGV 300

Query: 301 AKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLKTFEGHL 360
           AKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWS EGSLLKTFEGHL
Sbjct: 301 AKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSPEGSLLKTFEGHL 360

Query: 361 DRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVSSC 420
           DRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFH DGSLVSSC
Sbjct: 361 DRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHQDGSLVSSC 420

Query: 421 GLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLY 480
           GLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLY
Sbjct: 421 GLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLY 480

Query: 481 IIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISD 540
           IIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISD
Sbjct: 481 IIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISD 540

Query: 541 GQCIATVSHDRAIKLWSVNSKDIQTMDVD 568
           GQCIATVSHDR IKLWSVNSKD QTMD+D
Sbjct: 541 GQCIATVSHDRTIKLWSVNSKDKQTMDID 569

BLAST of CsGy6G020435.1 vs. ExPASy TrEMBL
Match: A0A5A7TI50 (U4/U6 small nuclear ribonucleoprotein PRP4-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold228G00300 PE=4 SV=1)

HSP 1 Score: 1062 bits (2746), Expect = 0.0
Identity = 553/569 (97.19%), Postives = 555/569 (97.54%), Query Frame = 0

Query: 1   MEIDDQNP-STAAESPETLPGGENEELDNPAEPTQPAATSVIPPSIVPAIAPIPPPIIRP 60
           MEIDDQNP STAAESPETLPGGENEELDNPAEPTQPAATSVIPPSIVPAIAPIPPPIIRP
Sbjct: 1   MEIDDQNPASTAAESPETLPGGENEELDNPAEPTQPAATSVIPPSIVPAIAPIPPPIIRP 60

Query: 61  LAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPSRTAPGSTAEYEISEESRQARERHE 120
           LAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAP+RTAPGSTAEYEISEESRQARERHE
Sbjct: 61  LAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPARTAPGSTAEYEISEESRQARERHE 120

Query: 121 KAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEG 180
           KAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEG
Sbjct: 121 KAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEG 180

Query: 181 QLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLE 240
           QLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLE
Sbjct: 181 QLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLE 240

Query: 241 RAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGV 300
           RAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSF         SSLSGV
Sbjct: 241 RAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSF---------SSLSGV 300

Query: 301 AKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLKTFEGHL 360
           AKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWS EGSLLKTFEGHL
Sbjct: 301 AKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSPEGSLLKTFEGHL 360

Query: 361 DRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVSSC 420
           DRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFH DGSLVSSC
Sbjct: 361 DRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHQDGSLVSSC 420

Query: 421 GLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLY 480
           GLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLY
Sbjct: 421 GLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLY 480

Query: 481 IIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISD 540
           IIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISD
Sbjct: 481 IIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISD 540

Query: 541 GQCIATVSHDRAIKLWSVNSKDIQTMDVD 568
           GQCIATVSHDR IKLWSVNSKD QTMD+D
Sbjct: 541 GQCIATVSHDRTIKLWSVNSKDKQTMDID 560

BLAST of CsGy6G020435.1 vs. ExPASy TrEMBL
Match: A0A6J1KDQ7 (U4/U6 small nuclear ribonucleoprotein PRP4-like protein OS=Cucurbita maxima OX=3661 GN=LOC111493950 PE=4 SV=1)

HSP 1 Score: 1043 bits (2697), Expect = 0.0
Identity = 536/570 (94.04%), Postives = 554/570 (97.19%), Query Frame = 0

Query: 1   MEIDDQNP-STAAESPETLPGGENEELDNPAEPTQPAATSVIPPSIVPAIAPIPPPII-R 60
           M++DDQNP STAAESPE LPGGENE+LDNPAEP QPAAT+VIP SIVP+IAPIPPP+I R
Sbjct: 1   MDVDDQNPASTAAESPEILPGGENEDLDNPAEPMQPAATTVIPSSIVPSIAPIPPPLITR 60

Query: 61  PLAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPSRTAPGSTAEYEISEESRQARERH 120
           PLAPLPSRP LFRPPV QNGE+RTSDSDSEHDELAPSR   GSTAEYE+SEESRQ RER 
Sbjct: 61  PLAPLPSRPLLFRPPVAQNGEMRTSDSDSEHDELAPSRATQGSTAEYEVSEESRQVRERQ 120

Query: 121 EKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAE 180
           EKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAE
Sbjct: 121 EKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAE 180

Query: 181 GQLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRL 240
           GQLEKL+KVHEEEEAAATGGTEEAEEEVLQYPFYTEG KALLDARIDIAKYSI+RA+SRL
Sbjct: 181 GQLEKLLKVHEEEEAAATGGTEEAEEEVLQYPFYTEGPKALLDARIDIAKYSIVRAASRL 240

Query: 241 ERAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSG 300
           ERAKRKRDDPDEDVEAEMDWALRQAESL+LDCSEIGDDRPLSGCSFSSDGKFLATSSLSG
Sbjct: 241 ERAKRKRDDPDEDVEAEMDWALRQAESLILDCSEIGDDRPLSGCSFSSDGKFLATSSLSG 300

Query: 301 VAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLKTFEGH 360
           VAK+WSMPQVRKVSNFKGHTERVTDV+FSPVNECLATASADRTARLWSAEGSLL+TFEGH
Sbjct: 301 VAKMWSMPQVRKVSNFKGHTERVTDVIFSPVNECLATASADRTARLWSAEGSLLRTFEGH 360

Query: 361 LDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVSS 420
           LDRLARIAFHPSGKYLGTTSFDKTWRLWD+ETGVELLLQEGHSRSVYGI FHHDGSLVSS
Sbjct: 361 LDRLARIAFHPSGKYLGTTSFDKTWRLWDIETGVELLLQEGHSRSVYGIDFHHDGSLVSS 420

Query: 421 CGLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSL 480
           CGLDALARVWDLRTGRSVLALEGHVKPVLGV+FSPNGYHLATGGEDNTCRIWDLRKKKSL
Sbjct: 421 CGLDALARVWDLRTGRSVLALEGHVKPVLGVNFSPNGYHLATGGEDNTCRIWDLRKKKSL 480

Query: 481 YIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIIS 540
           YIIPAHSNL+SQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIIS
Sbjct: 481 YIIPAHSNLISQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIIS 540

Query: 541 DGQCIATVSHDRAIKLWSVNSKDIQTMDVD 568
           DGQCIATVSHDR IKLWSVNSKD QTMDVD
Sbjct: 541 DGQCIATVSHDRTIKLWSVNSKDEQTMDVD 570

BLAST of CsGy6G020435.1 vs. ExPASy TrEMBL
Match: A0A6J1G322 (U4/U6 small nuclear ribonucleoprotein PRP4-like protein OS=Cucurbita moschata OX=3662 GN=LOC111450343 PE=4 SV=1)

HSP 1 Score: 1040 bits (2689), Expect = 0.0
Identity = 534/570 (93.68%), Postives = 553/570 (97.02%), Query Frame = 0

Query: 1   MEIDDQNP-STAAESPETLPGGENEELDNPAEPTQPAATSVIPPSIVPAIAPIPPPII-R 60
           M++DDQNP STAAESPE LPGGENE+LDNPAEP QPAAT+VIP SIVP+IAPIPPP+I R
Sbjct: 1   MDVDDQNPASTAAESPEILPGGENEDLDNPAEPMQPAATTVIPSSIVPSIAPIPPPLITR 60

Query: 61  PLAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPSRTAPGSTAEYEISEESRQARERH 120
           PLAPLPSRP LFRPPV QNGE+RTSDSDSEHDELAPSR   GSTAEYE+SEESRQ RER 
Sbjct: 61  PLAPLPSRPLLFRPPVAQNGEMRTSDSDSEHDELAPSRATQGSTAEYEVSEESRQVRERQ 120

Query: 121 EKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAE 180
           EKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAE
Sbjct: 121 EKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAE 180

Query: 181 GQLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRL 240
           GQLEKL+KVHEEEEAAATGGTEEAEEEVLQYPFYTEG KALLDARIDIAKYS++RA+SRL
Sbjct: 181 GQLEKLLKVHEEEEAAATGGTEEAEEEVLQYPFYTEGPKALLDARIDIAKYSVVRAASRL 240

Query: 241 ERAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSG 300
           ERAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSG
Sbjct: 241 ERAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSG 300

Query: 301 VAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLKTFEGH 360
           VAK+WSMPQVRKVSNF GHTERVTDV+FSPVNECLATASADRTARLWSAEGSLL+TFEGH
Sbjct: 301 VAKMWSMPQVRKVSNFNGHTERVTDVIFSPVNECLATASADRTARLWSAEGSLLRTFEGH 360

Query: 361 LDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVSS 420
           LDRLARIAFHPSGKYLGTTSFDKTWRLWD+ETGVELLLQEGHSRSVYGI FHHDGSLVSS
Sbjct: 361 LDRLARIAFHPSGKYLGTTSFDKTWRLWDIETGVELLLQEGHSRSVYGIDFHHDGSLVSS 420

Query: 421 CGLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSL 480
           CGLDALARVWDLRTGRSVLALEGHVKPVLGV+FSPNGYHLATGGEDNTCRIWDLRKK+SL
Sbjct: 421 CGLDALARVWDLRTGRSVLALEGHVKPVLGVNFSPNGYHLATGGEDNTCRIWDLRKKRSL 480

Query: 481 YIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIIS 540
           YIIPAHSNL+SQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIIS
Sbjct: 481 YIIPAHSNLISQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIIS 540

Query: 541 DGQCIATVSHDRAIKLWSVNSKDIQTMDVD 568
           DGQCIATVSHDR IKLWSVNSKD QTMDVD
Sbjct: 541 DGQCIATVSHDRTIKLWSVNSKDEQTMDVD 570

BLAST of CsGy6G020435.1 vs. TAIR 10
Match: AT2G41500.1 (WD-40 repeat family protein / small nuclear ribonucleoprotein Prp4p-related )

HSP 1 Score: 708.0 bits (1826), Expect = 6.2e-204
Identity = 366/570 (64.21%), Postives = 439/570 (77.02%), Query Frame = 0

Query: 5   DQNPSTAAESPETLPGGENEELDNPAEPTQPAATSVIPPSIVPAIAPIPPPIIRPLAPLP 64
           D N S AA +  + P       D  + P   A   V+PPS  P +APIP   + P  P+ 
Sbjct: 6   DDNVSLAATAQISAPPVLQ---DASSLPGFSAIPPVVPPSFPPPMAPIP---MMPHPPV- 65

Query: 65  SRPPLFRPPVTQNGELRTSDSDSEHDELAPSRTAPGSTAEYEISEESRQARERHEKAMQE 124
           +RPP FRPPV+QNG ++TSDSDSE D+              EISEES+Q RER EKA+Q+
Sbjct: 66  ARPPTFRPPVSQNGGVKTSDSDSESDD-----------EHIEISEESKQVRERQEKALQD 125

Query: 125 FLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEGQLEKL 184
            L+KRRA+A+AVPTND AVR RLRRLGEPITLFGE+EMERR RL  ++ R D  GQL+KL
Sbjct: 126 LLVKRRAAAMAVPTNDKAVRDRLRRLGEPITLFGEQEMERRARLTQLLTRYDINGQLDKL 185

Query: 185 MKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSRLERAKRK 244
           +K HEE+        EE ++EVL+YPF+TEG K L +ARI+IAK+S+ RA+ R++RAKR+
Sbjct: 186 VKDHEEDVTP----KEEVDDEVLEYPFFTEGPKELREARIEIAKFSVKRAAVRIQRAKRR 245

Query: 245 RDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGVAKLWS 304
           RDDPDED++AE  WAL+ A+ + LDCS  GDDRPL+GCSFS DGK LAT SLSGV KLW 
Sbjct: 246 RDDPDEDMDAETKWALKHAKHMALDCSNFGDDRPLTGCSFSRDGKILATCSLSGVTKLWE 305

Query: 305 MPQV-RKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLKTFEGHLDRLA 364
           MPQV   ++  K H ER TDV+FSPV++CLATASADRTA+LW  +G+LL+TFEGHLDRLA
Sbjct: 306 MPQVTNTIAVLKDHKERATDVVFSPVDDCLATASADRTAKLWKTDGTLLQTFEGHLDRLA 365

Query: 365 RIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVSSCGLDA 424
           R+AFHPSGKYLGTTS+DKTWRLWD+ TG ELLLQEGHSRSVYGIAF  DG+L +SCGLD+
Sbjct: 366 RVAFHPSGKYLGTTSYDKTWRLWDINTGAELLLQEGHSRSVYGIAFQQDGALAASCGLDS 425

Query: 425 LARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLYIIPA 484
           LARVWDLRTGRS+L  +GH+KPV  V+FSPNGYHLA+GGEDN CRIWDLR +KSLYIIPA
Sbjct: 426 LARVWDLRTGRSILVFQGHIKPVFSVNFSPNGYHLASGGEDNQCRIWDLRMRKSLYIIPA 485

Query: 485 HSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIISDGQCI 544
           H+NLVSQVKYEPQEGYFL TAS+DM   IWS RDF  VK+L+GHE+KV SLDI +D  CI
Sbjct: 486 HANLVSQVKYEPQEGYFLATASYDMKVNIWSGRDFSLVKSLAGHESKVASLDITADSSCI 545

Query: 545 ATVSHDRAIKLWSVNSKD-----IQTMDVD 569
           ATVSHDR IKLW+ +  D      +TMD+D
Sbjct: 546 ATVSHDRTIKLWTSSGNDDEDEEKETMDID 553

BLAST of CsGy6G020435.1 vs. TAIR 10
Match: AT2G05720.1 (Transducin/WD40 repeat-like superfamily protein )

HSP 1 Score: 311.6 bits (797), Expect = 1.3e-84
Identity = 168/327 (51.38%), Postives = 200/327 (61.16%), Query Frame = 0

Query: 215 GSKALLDARIDIAKYSILRASSRLERAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIG 274
           G   L +ARI+I K  I RA+ R++R  R+R+DPDED  AE   AL+  + +VL  S+ G
Sbjct: 2   GPTELREARIEITKDFIKRAALRIQRENRRRNDPDEDKNAETKLALKHCKDMVLGSSKFG 61

Query: 275 DDRPLSGCSFSSDGKFLATSSLSGVAKLWSMPQV-RKVSNFKGHTERVTDVMFSPV-NEC 334
           DDRPL+GCS S DGK L T SLSGV KLW +PQV  K+   KGH E VTDV+FS V +EC
Sbjct: 62  DDRPLTGCSLSRDGKILVTCSLSGVPKLWEVPQVTNKIVVLKGHKEHVTDVVFSSVDDEC 121

Query: 335 LATASADRTARLWSAEGSLLKTFEGHLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGV 394
           LATAS DRT ++W  +G+LL+TF+                                    
Sbjct: 122 LATASTDRTEKIWKTDGTLLQTFK------------------------------------ 181

Query: 395 ELLLQEGHSRSVYGIAFHHDGSLVSSCGLDALARVWDLRTGRSVLALEGHVKPVLGVSFS 454
                                   +S G D+LARVWDLRT R++L  +GH+K VL V FS
Sbjct: 182 ------------------------ASSGFDSLARVWDLRTARNILIFQGHIKQVLSVDFS 241

Query: 455 PNGYHLATGGEDNTCRIWDLRKKKSLYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKI 514
           PNGYHLA+GGEDN CRIWDLR +K LYIIPAH NLVSQVKYEPQE YFL TAS DM   I
Sbjct: 242 PNGYHLASGGEDNQCRIWDLRMRKLLYIIPAHVNLVSQVKYEPQERYFLATASHDMNVNI 268

Query: 515 WSARDFKPVKTLSGHEAKVTSLDIISD 540
           WS RDF  VK+L GHE+KV SLDI  D
Sbjct: 302 WSGRDFSLVKSLVGHESKVASLDIAVD 268

BLAST of CsGy6G020435.1 vs. TAIR 10
Match: AT3G49660.1 (Transducin/WD40 repeat-like superfamily protein )

HSP 1 Score: 169.5 bits (428), Expect = 7.9e-42
Identity = 97/290 (33.45%), Postives = 145/290 (50.00%), Query Frame = 0

Query: 276 DRPLSGCSFSSDGKFLATSSLSGVAKLWSM-----PQVRKVSNFKGHTERVTDVMFSPVN 335
           +R +S   FSSDG+ LA++S     + +++     P    V  F GH   ++DV FS   
Sbjct: 24  NRAVSSVKFSSDGRLLASASADKTIRTYTINTINDPIAEPVQEFTGHENGISDVAFSSDA 83

Query: 336 ECLATASADRTARLWSAE-GSLLKTFEGHLDRLARIAFHPSGKYLGTTSFDKTWRLWDVE 395
             + +AS D+T +LW  E GSL+KT  GH +    + F+P    + + SFD+T R+WDV 
Sbjct: 84  RFIVSASDDKTLKLWDVETGSLIKTLIGHTNYAFCVNFNPQSNMIVSGSFDETVRIWDVT 143

Query: 396 TGVELLLQEGHSRSVYGIAFHHDGSLVSSCGLDALARVWDLRTGRSVLAL-EGHVKPVLG 455
           TG  L +   HS  V  + F+ DGSL+ S   D L R+WD  TG  V  L +    PV  
Sbjct: 144 TGKCLKVLPAHSDPVTAVDFNRDGSLIVSSSYDGLCRIWDSGTGHCVKTLIDDENPPVSF 203

Query: 456 VSFSPNGYHLATGGEDNTCRIWDLRKKKSLYIIPAHSNLVSQVK--YEPQEGYFLVTASF 515
           V FSPNG  +  G  DNT R+W++   K L     H N    +   +    G  +V+ S 
Sbjct: 204 VRFSPNGKFILVGTLDNTLRLWNISSAKFLKTYTGHVNAQYCISSAFSVTNGKRIVSGSE 263

Query: 516 DMTAKIWSARDFKPVKTLSGHEAKVTSLDIISDGQCIATVSHDRAIKLWS 557
           D    +W     K ++ L GH   V ++        IA+ S D+ +++W+
Sbjct: 264 DNCVHMWELNSKKLLQKLEGHTETVMNVACHPTENLIASGSLDKTVRIWT 313

BLAST of CsGy6G020435.1 vs. TAIR 10
Match: AT4G02730.1 (Transducin/WD40 repeat-like superfamily protein )

HSP 1 Score: 149.1 bits (375), Expect = 1.1e-35
Identity = 82/260 (31.54%), Postives = 139/260 (53.46%), Query Frame = 0

Query: 309 RKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEG-SLLKTFEGHLDRLARIAF 368
           R +   +GHT  ++ V FS     LA+AS D+T  LWSA   SL+  +EGH   ++ +A+
Sbjct: 34  RHLKTLEGHTAAISCVKFSNDGNLLASASVDKTMILWSATNYSLIHRYEGHSSGISDLAW 93

Query: 369 HPSGKYLGTTSFDKTWRLWDVETGVELL-LQEGHSRSVYGIAFHHDGSLVSSCGLDALAR 428
                Y  + S D T R+WD  +  E L +  GH+  V+ + F+   +L+ S   D   R
Sbjct: 94  SSDSHYTCSASDDCTLRIWDARSPYECLKVLRGHTNFVFCVNFNPPSNLIVSGSFDETIR 153

Query: 429 VWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSL-YIIPAHS 488
           +W+++TG+ V  ++ H  P+  V F+ +G  + +   D +C+IWD ++   L  +I   S
Sbjct: 154 IWEVKTGKCVRMIKAHSMPISSVHFNRDGSLIVSASHDGSCKIWDAKEGTCLKTLIDDKS 213

Query: 489 NLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKV---TSLDIISDGQC 548
             VS  K+ P  G F++ A+ D T K+ +    K +K  +GH  KV   TS   +++G+ 
Sbjct: 214 PAVSFAKFSP-NGKFILVATLDSTLKLSNYATGKFLKVYTGHTNKVFCITSAFSVTNGKY 273

Query: 549 IATVSHDRAIKLWSVNSKDI 563
           I + S D  + LW + +++I
Sbjct: 274 IVSGSEDNCVYLWDLQARNI 292

BLAST of CsGy6G020435.1 vs. TAIR 10
Match: AT2G33340.1 (MOS4-associated complex 3B )

HSP 1 Score: 142.9 bits (359), Expect = 7.9e-34
Identity = 86/288 (29.86%), Postives = 145/288 (50.35%), Query Frame = 0

Query: 291 LATSSLSGVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEG- 350
           +AT  +   A L+  P  + +S   GH+++VT V F   ++ + TASAD+T R+W   G 
Sbjct: 237 IATGGVDATAVLFDRPSGQILSTLTGHSKKVTSVKFVGDSDLVLTASADKTVRIWRNPGD 296

Query: 351 ---SLLKTFEGHLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSV-- 410
              +   T   H   +  +  HP+ KY  + S D TW  +D+ +G  L      S++V  
Sbjct: 297 GNYACGYTLNDHSAEVRAVTVHPTNKYFVSASLDGTWCFYDLSSGSCLAQVSDDSKNVDY 356

Query: 411 YGIAFHHDGSLVSSCGLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGED 470
              AFH DG ++ +    ++ ++WD+++  +V   +GH   V  +SFS NGY LAT  ED
Sbjct: 357 TAAAFHPDGLILGTGTSQSVVKIWDVKSQANVAKFDGHTGEVTAISFSENGYFLATAAED 416

Query: 471 NTCRIWDLRKKKSL-YIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSAR-DFKPVK 530
              R+WDLRK ++    + A +N    V+++P   Y  + AS     +  S + ++  +K
Sbjct: 417 GV-RLWDLRKLRNFKSFLSADAN---SVEFDPSGSYLGIAASDIKVYQTASVKAEWNLIK 476

Query: 531 TLS--GHEAKVTSLDIISDGQCIATVSHDRAIKLWSVNSKDIQTMDVD 569
           TL       K T +   SD Q +A  S DR ++++ +   +   +D D
Sbjct: 477 TLPDLSGTGKATCVKFGSDAQYVAVGSMDRNLRIFGLPGDEKANVDDD 520

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
O222128.7e-20364.21U4/U6 small nuclear ribonucleoprotein PRP4-like protein OS=Arabidopsis thaliana ... [more]
Q3MHE23.4e-11446.62U4/U6 small nuclear ribonucleoprotein Prp4 OS=Bos taurus OX=9913 GN=PRPF4 PE=2 S... [more]
O431723.4e-11446.62U4/U6 small nuclear ribonucleoprotein Prp4 OS=Homo sapiens OX=9606 GN=PRPF4 PE=1... [more]
Q5NVD03.4e-11446.62U4/U6 small nuclear ribonucleoprotein Prp4 OS=Pongo abelii OX=9601 GN=PRPF4 PE=2... [more]
Q9DAW65.8e-11446.62U4/U6 small nuclear ribonucleoprotein Prp4 OS=Mus musculus OX=10090 GN=Prpf4 PE=... [more]
Match NameE-valueIdentityDescription
XP_004146749.10.099.65U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Cucumis sativus] >XP_03... [more]
XP_008464716.10.098.77PREDICTED: U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Cucumis melo... [more]
XP_038885479.10.096.84U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Benincasa hispida] >XP_... [more]
KAA0040989.10.097.19U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Cucumis melo var. makuw... [more]
XP_022999661.10.094.04U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A0A0KHD80.099.65WD_REPEATS_REGION domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G... [more]
A0A1S4E4U80.098.77U4/U6 small nuclear ribonucleoprotein PRP4-like protein OS=Cucumis melo OX=3656 ... [more]
A0A5A7TI500.097.19U4/U6 small nuclear ribonucleoprotein PRP4-like protein OS=Cucumis melo var. mak... [more]
A0A6J1KDQ70.094.04U4/U6 small nuclear ribonucleoprotein PRP4-like protein OS=Cucurbita maxima OX=3... [more]
A0A6J1G3220.093.68U4/U6 small nuclear ribonucleoprotein PRP4-like protein OS=Cucurbita moschata OX... [more]
Match NameE-valueIdentityDescription
AT2G41500.16.2e-20464.21WD-40 repeat family protein / small nuclear ribonucleoprotein Prp4p-related [more]
AT2G05720.11.3e-8451.38Transducin/WD40 repeat-like superfamily protein [more]
AT3G49660.17.9e-4233.45Transducin/WD40 repeat-like superfamily protein [more]
AT4G02730.11.1e-3531.54Transducin/WD40 repeat-like superfamily protein [more]
AT2G33340.17.9e-3429.86MOS4-associated complex 3B [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Gy14) v2.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 174..194
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 46..71
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..118
NoneNo IPR availablePANTHERPTHR19846WD40 REPEAT PROTEINcoord: 103..562
NoneNo IPR availablePANTHERPTHR19846:SF3BNAC04G01920D PROTEINcoord: 103..562
NoneNo IPR availablePROSITEPS50294WD_REPEATS_REGIONcoord: 314..346
score: 11.207411
NoneNo IPR availablePROSITEPS50294WD_REPEATS_REGIONcoord: 524..559
score: 10.917412
NoneNo IPR availablePROSITEPS50294WD_REPEATS_REGIONcoord: 355..391
score: 10.917412
NoneNo IPR availablePROSITEPS50294WD_REPEATS_REGIONcoord: 397..434
score: 10.917412
NoneNo IPR availablePROSITEPS50294WD_REPEATS_REGIONcoord: 439..473
score: 13.2374
NoneNo IPR availablePROSITEPS50294WD_REPEATS_REGIONcoord: 481..514
score: 9.151057
NoneNo IPR availableCDDcd00200WD40coord: 278..556
e-value: 5.26211E-90
score: 277.294
IPR020472G-protein beta WD-40 repeatPRINTSPR00320GPROTEINBRPTcoord: 374..388
score: 37.95
coord: 333..347
score: 36.13
coord: 458..472
score: 38.39
IPR014906Pre-mRNA processing factor 4 (PRP4)-likeSMARTSM00500pr04_2coord: 138..191
e-value: 5.3E-18
score: 75.8
IPR014906Pre-mRNA processing factor 4 (PRP4)-likePFAMPF08799PRP4coord: 143..170
e-value: 1.1E-12
score: 47.2
IPR001680WD40 repeatSMARTSM00320WD40_4coord: 263..304
e-value: 0.021
score: 24.0
coord: 348..387
e-value: 1.1E-7
score: 41.6
coord: 307..346
e-value: 1.3E-8
score: 44.6
coord: 390..429
e-value: 2.5E-6
score: 37.0
coord: 474..514
e-value: 2.1E-4
score: 30.6
coord: 517..556
e-value: 4.4E-9
score: 46.2
coord: 432..471
e-value: 4.4E-11
score: 52.8
IPR001680WD40 repeatPFAMPF00400WD40coord: 519..556
e-value: 1.7E-5
score: 25.4
coord: 398..429
e-value: 5.3E-4
score: 20.7
coord: 436..471
e-value: 5.4E-9
score: 36.5
coord: 311..346
e-value: 1.0E-4
score: 23.0
coord: 349..387
e-value: 1.3E-4
score: 22.7
IPR001680WD40 repeatPROSITEPS50082WD_REPEATS_2coord: 355..396
score: 13.482671
IPR001680WD40 repeatPROSITEPS50082WD_REPEATS_2coord: 314..346
score: 13.516088
IPR001680WD40 repeatPROSITEPS50082WD_REPEATS_2coord: 439..480
score: 16.958164
IPR001680WD40 repeatPROSITEPS50082WD_REPEATS_2coord: 524..565
score: 13.549507
IPR001680WD40 repeatPROSITEPS50082WD_REPEATS_2coord: 397..438
score: 14.251289
IPR001680WD40 repeatPROSITEPS50082WD_REPEATS_2coord: 481..523
score: 11.878597
IPR036285PRP4-like superfamilyGENE3D4.10.280.110coord: 109..185
e-value: 1.6E-14
score: 55.1
IPR036285PRP4-like superfamilySUPERFAMILY158230PRP4-likecoord: 120..176
IPR015943WD40/YVTN repeat-like-containing domain superfamilyGENE3D2.130.10.10coord: 256..439
e-value: 1.2E-49
score: 170.9
IPR015943WD40/YVTN repeat-like-containing domain superfamilyGENE3D2.130.10.10coord: 440..567
e-value: 2.8E-39
score: 137.0
IPR019775WD40 repeat, conserved sitePROSITEPS00678WD_REPEATS_1coord: 416..430
IPR019775WD40 repeat, conserved sitePROSITEPS00678WD_REPEATS_1coord: 458..472
IPR019775WD40 repeat, conserved sitePROSITEPS00678WD_REPEATS_1coord: 374..388
IPR036322WD40-repeat-containing domain superfamilySUPERFAMILY50978WD40 repeat-likecoord: 272..555

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CsGy6G020435CsGy6G020435gene


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsGy6G020435.1.utr5p1CsGy6G020435.1.utr5p1five_prime_UTR
CsGy6G020435.1.utr5p2CsGy6G020435.1.utr5p2five_prime_UTR


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsGy6G020435.1.exon1CsGy6G020435.1.exon1exon
CsGy6G020435.1.exon2CsGy6G020435.1.exon2exon
CsGy6G020435.1.exon3CsGy6G020435.1.exon3exon
CsGy6G020435.1.exon4CsGy6G020435.1.exon4exon
CsGy6G020435.1.exon5CsGy6G020435.1.exon5exon
CsGy6G020435.1.exon6CsGy6G020435.1.exon6exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
cds.CsGy6G020435.1cds.CsGy6G020435.1CDS
cds.CsGy6G020435.1cds.CsGy6G020435.1_2CDS
cds.CsGy6G020435.1cds.CsGy6G020435.1_3CDS
cds.CsGy6G020435.1cds.CsGy6G020435.1_4CDS
cds.CsGy6G020435.1cds.CsGy6G020435.1_5CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CsGy6G020435.1.utr3p1CsGy6G020435.1.utr3p1three_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CsGy6G020435.1CsGy6G020435.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0000398 mRNA splicing, via spliceosome
cellular_component GO:0046540 U4/U6 x U5 tri-snRNP complex
molecular_function GO:0005515 protein binding
molecular_function GO:0030621 U4 snRNA binding
molecular_function GO:0017070 U6 snRNA binding