CmaCh04G017210 (gene) Cucurbita maxima (Rimu) v1.1

Overview
NameCmaCh04G017210
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu) v1.1)
DescriptionU4/U6 small nuclear ribonucleoprotein Prp31 homolog
LocationCma_Chr04: 8656339 .. 8659980 (+)
RNA-Seq ExpressionCmaCh04G017210
SyntenyCmaCh04G017210
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGACGTTTAGGTTTCAGCACTTTCCTTTGTGCGCTGGTTTATTTTAGTTCTTACGTTCTTGCGTGTAAACTCAAATTGACCTATCTAGGAATTGTGGACTTTGGAGTGTGCTATTAAATCTGTCGCTTAATTCTTTATCAGCTGCTAATTTTGTGTTTCTAATCGTTCTTCCTTGAATTATAGAGTTTCTCAAGAGATAGCTTGTTCTGCCTACAGGGAGCATGTATATAAGAATTTTCTGAGTTAGCTTATAATTTATATGGTTGGTAAAGGAGAACTTGATTACTTTCTCTCTCGGATGTAATTTATTAAACTAATGACCAAGTTTTGAATCGTTACACTTCTTTTCTACTTTATTTCATACATTCATTAGGTTTTTAAACTTGCTATTGTGCTTCTTCTTTCAAATGTTCTTTAAGATCTATCTTGAAAATGTGAGTTATCTTGTCTATTAATTGTCATATTATTTTCTTCTGTTTGTAGGCTACTTTGGCTGATTCTTTTCTAGCAGATCTTGATGAACTCTCTGATGAAGACAATTTTCTGGTTAGCAAATGTTTTCTTACCTTTCTCTCTGGAAGATCAATTCATCATTATTTTGCTTGGTAGGAGTGTGAATCCATCTACCTTATATTTTGAGGCAATAGTTATTGGATGATTACAGGATGAAGAAGATGCTGATGCCGAAAATATGGAAGAGGATATTGATGGGGATCTTGCAGATCTAGAAAGCCTTAATTATGATGATCTGGATAGTGTATCAAAATTGCAGAAAACACAGAGATACAATGATATTATGCAGGTTAGTTTACATGAAAGTTTCCTTATTGACCTTGGTGTGGCTTGAATAGAAAGGGGTTAGAAGAGAGGCAATGAATATATTCAATCCTTGCTGTCTTGTCTATTATTCTAACCTTTTAGTGAGTGATAGGTTTTGGGATGTTGTATAATTATTGGATCTTTTGCAATATATCTCTCATGAATAATGATGTTCACTGCACTTCCCCCTCGATTTGATTGGATTTGTTGGCATCTGAATTACAGAAAGTGGCAGATGCATTGGAGATGGATTCTAATGTCTCAAATCAGGGATTTGTATTAGAAGATGATCCTGAGTACCAACTGATTGTAGAGTGTAATGCCTTGTCAGTGGATATTGAGAATGAGATTATTATTATTCACAATTTTATACGTGATAAATATCGACTAAAATTTCCAGAGCTTGAATCACTTGTGCACCATCCAATCGATTATGCTCGAGTTGTTAAGAAGATTGGGAATGAAATGGATTTGACTCTTGTAGATTTAGAAGGGCTTTTACCCTCTGCTGTCATTATGGTTGTCTCTGTTACAGCATCTACCACAAGTGGCAAGCCACTTCCGGAGGAAATTCTCCAGAAAACAATTGATGCATGTGATCGAGCTCTTACTTTAGATTCAGCAAAGAAAATGGTCCTTAATTTTGTTGAAAGTAGAATGGGATATATTGCACCAAATCTTTCAGCGATAGTTGGAAGTGCTGTTGCAGCAAAACTAATGGGAACTGCTGGTGGCCTTGTTGCCTTAGCTAAGATGCCTGCTTGCAATGTTCAGCTTCTTGGTGCAAAGAAAAAAAACCTTGCTGGGTTTTCCACAGCGACCTCACAATTTCGAGTAGGTTATATTGAGCAAACAGAGATATTCCAATCAACACCTCCTCCTTTGAAGATGCGTGCTTGTCGACTCATAGCGGCGAAGTCAACACTTGCAGCACGAGTTGACTCCACCAGGGGTGACCCTACAGGGAAGACGGGTAGAGTCTTCAAAGATGAGATCCTTAAGAAAATTGAGAAGTGGCAAGAACCTCCACCTGCAAAACAACCAAAACCTCTTCCTGTCCCTGATTCTGAACCCAAAAAGAAGAGAGGTGGCCGTCGATTAAGGAAGATGAAGGAAAGGTATATGTTACATTTGTCTTTCATCTCTCTAGCAATAGAAGCTTTGATCTGTTGCACTTTTCCGTTGTTCATGGATTGATATGGATTTCCTTGTCAGGTATGCAACGACGGAGATGAGGAAGCTAGCTAACAGGATGCAGTTTGGGGTGCCTGAAGAGAGTTCTTTAGGTAAAACATTCTTTCATTTCTGAATAATCTGGATTGATCAAATTGATAGTTTTTCTTCTGCTACGTTTTGTGTTACGCATTGGAATGAGATGCAATGTTATGCTGTTCAGCAAGTTTGAGTTGTCATGTTGTCTGGTGAACCAGAAAATTTTCTGAATTGCAATAGTAACATCATAGCCGTCTTGAGTCTGTCCTATTACAGGATTATTTCTTATCAAACGTAGTTACATTTTTGTCTGCATACCAGCTGTTTCTTACAATTGTCCTGAACGGAAAAAGTAGTTGCTTGAGGTATTCTGCTGTAATATTCTGAATAAGCATTTCACTTCTTGATTTTTCATGCCACAACAGGAGATGGATTGGGGGAAGGGTATGGAATGCTTGGTCAGGCTGGGAGTGGCAAGTTGCGTGTGTCAGCTGCTCAGAGCAAGCTTGCCGCAAAAGTTGTTAAGAAGTAAGTCAAGTCATTGAGCTGATCCTAATAACCACACCCACCCATCCACTTGCGCGCGCACACACGCAAGTAATAATGAAAAAAGAAAAGAAAGAAAAAAGAATAATAACTATAACAAGGAGTGGAAAATGATATTTGTAGCATAACATTCGCTTCTTCATCTGTTATCCTTTACTTATCTTCTTATCACGAATAGGTTCAAGGAAAAACGCTATGGAAGCAGTGGTGCTACATCCGGGCTGACCTCAAGTTTGGCATTTACTCCAGTACAAGTCAGTATTTTTTGTCCTCCTCACCATCTCCTCCTATGTTATCTAAATTTGCCTCGGAAGTACTTGTTTTATTCATGACACTGCCTGGGATGCTTAATATCAAGGTTGTGAAAATTTCCTGTTTTCCATGTAGGGAATCGAGCTGTCGAATCCTCAGGCCCATTTGAACCAGCTAGGCGGTGGAACTCAAAGCACCTACTTTTCTGAAACAGGAACATTTTCAAAGATCAGGAAAAACTGAGTCATTAGACCATCAAGCTTTATGTTTCGTGCCCATGCTGTAATCATACATGGATATGGCTTACGATTAGGCTTTGTAATTCAGTACTAACAGAAAGCCACAAGAGTTTATGTTTTTATAACTCAGAATGTTTCTTATTCCTGCTTGCTTGATCTTTTTTTGGCTTTAGTTGACTGATTAGTGTTATCATTTTGGCCGGCGTTGATATTATATGGGATGCCATACACTATAACTGGTGGATTTGTCTTGGTCAGTCTCTTATTTAGTGCCATCAAATAATAGCTAGTAGTACTTCCTGTCTGATTTCCTCTAAGTTGAAAATCATAGAAACTTATGCCATACCATTCTCTGTATTCCCATGAGAGGAAGATTAGTACCACTGTGGGCTTTGATTTGGCATCTATGATATGCGGCTATGCTCTTCTCTTAATGTATCTCTCATTTCATTCAGGAATGGCTATCATATATCACTGATTTGGGTAGTTCAAAAAGGTTGCATCTATGGGTGAAGCTTTCTTGCCCTATGTAATGTAATATCTTTCAATTTTCTTTGACCCGACCAATCCG

mRNA sequence

ATGCGACGTTTAGGTTTCAGCACTTTCCTTTGTGCGCTGGCTACTTTGGCTGATTCTTTTCTAGCAGATCTTGATGAACTCTCTGATGAAGACAATTTTCTGGATGAAGAAGATGCTGATGCCGAAAATATGGAAGAGGATATTGATGGGGATCTTGCAGATCTAGAAAGCCTTAATTATGATGATCTGGATAGTGTATCAAAATTGCAGAAAACACAGAGATACAATGATATTATGCAGAAAGTGGCAGATGCATTGGAGATGGATTCTAATGTCTCAAATCAGGGATTTGTATTAGAAGATGATCCTGAGTACCAACTGATTGTAGAGTGTAATGCCTTGTCAGTGGATATTGAGAATGAGATTATTATTATTCACAATTTTATACGTGATAAATATCGACTAAAATTTCCAGAGCTTGAATCACTTGTGCACCATCCAATCGATTATGCTCGAGTTGTTAAGAAGATTGGGAATGAAATGGATTTGACTCTTGTAGATTTAGAAGGGCTTTTACCCTCTGCTGTCATTATGGTTGTCTCTGTTACAGCATCTACCACAAGTGGCAAGCCACTTCCGGAGGAAATTCTCCAGAAAACAATTGATGCATGTGATCGAGCTCTTACTTTAGATTCAGCAAAGAAAATGGTCCTTAATTTTGTTGAAAGTAGAATGGGATATATTGCACCAAATCTTTCAGCGATAGTTGGAAGTGCTGTTGCAGCAAAACTAATGGGAACTGCTGGTGGCCTTGTTGCCTTAGCTAAGATGCCTGCTTGCAATGTTCAGCTTCTTGGTGCAAAGAAAAAAAACCTTGCTGGGTTTTCCACAGCGACCTCACAATTTCGAGTAGGTTATATTGAGCAAACAGAGATATTCCAATCAACACCTCCTCCTTTGAAGATGCGTGCTTGTCGACTCATAGCGGCGAAGTCAACACTTGCAGCACGAGTTGACTCCACCAGGGGTGACCCTACAGGGAAGACGGGTAGAGTCTTCAAAGATGAGATCCTTAAGAAAATTGAGAAGTGGCAAGAACCTCCACCTGCAAAACAACCAAAACCTCTTCCTGTCCCTGATTCTGAACCCAAAAAGAAGAGAGGTGGCCGTCGATTAAGGAAGATGAAGGAAAGGTATGCAACGACGGAGATGAGGAAGCTAGCTAACAGGATGCAGTTTGGGGTGCCTGAAGAGAGTTCTTTAGGAGATGGATTGGGGGAAGGGTATGGAATGCTTGGTCAGGCTGGGAGTGGCAAGTTGCGTGTGTCAGCTGCTCAGAGCAAGCTTGCCGCAAAAGTTGTTAAGAAGTTCAAGGAAAAACGCTATGGAAGCAGTGGTGCTACATCCGGGCTGACCTCAAGTTTGGCATTTACTCCAGTACAAGGAATCGAGCTGTCGAATCCTCAGGCCCATTTGAACCAGCTAGGCGGTGGAACTCAAAGCACCTACTTTTCTGAAACAGGAACATTTTCAAAGATCAGGAAAAACTGAGTCATTAGACCATCAAGCTTTATGTTTCGTGCCCATGCTGTAATCATACATGGATATGGCTTACGATTAGGCTTTGTAATTCAGTACTAACAGAAAGCCACAAGAGTTTATGTTTTTATAACTCAGAATGTTTCTTATTCCTGCTTGCTTGATCTTTTTTTGGCTTTAGTTGACTGATTAGTGTTATCATTTTGGCCGGCGTTGATATTATATGGGATGCCATACACTATAACTGGTGGATTTGTCTTGGTCAGTCTCTTATTTAGTGCCATCAAATAATAGCTAGTAGTACTTCCTGTCTGATTTCCTCTAAGTTGAAAATCATAGAAACTTATGCCATACCATTCTCTGTATTCCCATGAGAGGAAGATTAGTACCACTGTGGGCTTTGATTTGGCATCTATGATATGCGGCTATGCTCTTCTCTTAATGTATCTCTCATTTCATTCAGGAATGGCTATCATATATCACTGATTTGGGTAGTTCAAAAAGGTTGCATCTATGGGTGAAGCTTTCTTGCCCTATGTAATGTAATATCTTTCAATTTTCTTTGACCCGACCAATCCG

Coding sequence (CDS)

ATGCGACGTTTAGGTTTCAGCACTTTCCTTTGTGCGCTGGCTACTTTGGCTGATTCTTTTCTAGCAGATCTTGATGAACTCTCTGATGAAGACAATTTTCTGGATGAAGAAGATGCTGATGCCGAAAATATGGAAGAGGATATTGATGGGGATCTTGCAGATCTAGAAAGCCTTAATTATGATGATCTGGATAGTGTATCAAAATTGCAGAAAACACAGAGATACAATGATATTATGCAGAAAGTGGCAGATGCATTGGAGATGGATTCTAATGTCTCAAATCAGGGATTTGTATTAGAAGATGATCCTGAGTACCAACTGATTGTAGAGTGTAATGCCTTGTCAGTGGATATTGAGAATGAGATTATTATTATTCACAATTTTATACGTGATAAATATCGACTAAAATTTCCAGAGCTTGAATCACTTGTGCACCATCCAATCGATTATGCTCGAGTTGTTAAGAAGATTGGGAATGAAATGGATTTGACTCTTGTAGATTTAGAAGGGCTTTTACCCTCTGCTGTCATTATGGTTGTCTCTGTTACAGCATCTACCACAAGTGGCAAGCCACTTCCGGAGGAAATTCTCCAGAAAACAATTGATGCATGTGATCGAGCTCTTACTTTAGATTCAGCAAAGAAAATGGTCCTTAATTTTGTTGAAAGTAGAATGGGATATATTGCACCAAATCTTTCAGCGATAGTTGGAAGTGCTGTTGCAGCAAAACTAATGGGAACTGCTGGTGGCCTTGTTGCCTTAGCTAAGATGCCTGCTTGCAATGTTCAGCTTCTTGGTGCAAAGAAAAAAAACCTTGCTGGGTTTTCCACAGCGACCTCACAATTTCGAGTAGGTTATATTGAGCAAACAGAGATATTCCAATCAACACCTCCTCCTTTGAAGATGCGTGCTTGTCGACTCATAGCGGCGAAGTCAACACTTGCAGCACGAGTTGACTCCACCAGGGGTGACCCTACAGGGAAGACGGGTAGAGTCTTCAAAGATGAGATCCTTAAGAAAATTGAGAAGTGGCAAGAACCTCCACCTGCAAAACAACCAAAACCTCTTCCTGTCCCTGATTCTGAACCCAAAAAGAAGAGAGGTGGCCGTCGATTAAGGAAGATGAAGGAAAGGTATGCAACGACGGAGATGAGGAAGCTAGCTAACAGGATGCAGTTTGGGGTGCCTGAAGAGAGTTCTTTAGGAGATGGATTGGGGGAAGGGTATGGAATGCTTGGTCAGGCTGGGAGTGGCAAGTTGCGTGTGTCAGCTGCTCAGAGCAAGCTTGCCGCAAAAGTTGTTAAGAAGTTCAAGGAAAAACGCTATGGAAGCAGTGGTGCTACATCCGGGCTGACCTCAAGTTTGGCATTTACTCCAGTACAAGGAATCGAGCTGTCGAATCCTCAGGCCCATTTGAACCAGCTAGGCGGTGGAACTCAAAGCACCTACTTTTCTGAAACAGGAACATTTTCAAAGATCAGGAAAAACTGA

Protein sequence

MRRLGFSTFLCALATLADSFLADLDELSDEDNFLDEEDADAENMEEDIDGDLADLESLNYDDLDSVSKLQKTQRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDKYRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPLPEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGLVALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIAAKSTLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRLRKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAKVVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGGGTQSTYFSETGTFSKIRKN
Homology
BLAST of CmaCh04G017210 vs. ExPASy Swiss-Prot
Match: Q8RXN6 (U4/U6 small nuclear ribonucleoprotein Prp31 homolog OS=Arabidopsis thaliana OX=3702 GN=PRP31 PE=1 SV=1)

HSP 1 Score: 690.6 bits (1781), Expect = 1.3e-197
Identity = 371/485 (76.49%), Postives = 417/485 (85.98%), Query Frame = 0

Query: 13  LATLADSFLADLDELSDEDNFLDEEDADAENMEEDIDGDLADLESLNYDDLDSVSKLQKT 72
           +ATL DSFLADLDELSD +  LDE D D    EED+D D+ADLE+LNYDDLD+VSKLQK+
Sbjct: 1   MATLEDSFLADLDELSDNEAELDENDGDVGKEEEDVDMDMADLETLNYDDLDNVSKLQKS 60

Query: 73  QRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK 132
           QRY DIM KV +AL  DS+ + +G VLEDDPEY+LIV+CN LSVDIENEI+I+HNFI+DK
Sbjct: 61  QRYADIMHKVEEALGKDSDGAEKGTVLEDDPEYKLIVDCNQLSVDIENEIVIVHNFIKDK 120

Query: 133 YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL 192
           Y+LKF ELESLVHHPIDYA VVKKIGNE DL LVDL  LLPSA+IMVVSVTA TT G  L
Sbjct: 121 YKLKFQELESLVHHPIDYACVVKKIGNETDLALVDLADLLPSAIIMVVSVTALTTKGSAL 180

Query: 193 PEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGLV 252
           PE++LQK ++ACDRAL LDSA+K VL FVES+MG IAPNLSAIVGSAVAAKLMGTAGGL 
Sbjct: 181 PEDVLQKVLEACDRALDLDSARKKVLEFVESKMGSIAPNLSAIVGSAVAAKLMGTAGGLS 240

Query: 253 ALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIAAKS 312
           ALAKMPACNVQ+LG K+KNLAGFS+ATSQ RVGY+EQTEI+QSTPP L+ RA RL+AAKS
Sbjct: 241 ALAKMPACNVQVLGHKRKNLAGFSSATSQSRVGYLEQTEIYQSTPPGLQARAGRLVAAKS 300

Query: 313 TLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL 372
           TLAARVD+TRGDP G +G+ F++EI KKIEKWQEPPPA+QPKPLPVPDSEPKK+RGGRRL
Sbjct: 301 TLAARVDATRGDPLGISGKAFREEIRKKIEKWQEPPPARQPKPLPVPDSEPKKRRGGRRL 360

Query: 373 RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLA-- 432
           RKMKERY  T+MRKLANRM FG PEESSLGDGLGEGYGMLGQAGS +LRVS+  SKL   
Sbjct: 361 RKMKERYQVTDMRKLANRMAFGTPEESSLGDGLGEGYGMLGQAGSNRLRVSSVPSKLKIN 420

Query: 433 AKVVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGGGTQSTYFSETGTF 492
           AKV KK KE++Y     TSGLTSSLAFTPVQGIEL NPQ  L  LG GTQSTYFSE+GTF
Sbjct: 421 AKVAKKLKERQYAGGATTSGLTSSLAFTPVQGIELCNPQQALG-LGSGTQSTYFSESGTF 480

Query: 493 SKIRK 496
           SK++K
Sbjct: 481 SKLKK 484

BLAST of CmaCh04G017210 vs. ExPASy Swiss-Prot
Match: Q8CCF0 (U4/U6 small nuclear ribonucleoprotein Prp31 OS=Mus musculus OX=10090 GN=Prpf31 PE=1 SV=3)

HSP 1 Score: 382.9 bits (982), Expect = 5.6e-105
Identity = 233/496 (46.98%), Postives = 328/496 (66.13%), Query Frame = 0

Query: 15  TLADSFLADLDELSDED---NFLDEEDADA-ENMEEDIDGDLADLESLNYDDLDSVSKLQ 74
           +LAD  LADL+E ++E+   ++ +EE+  A E+++E+   DL+       D + S++KL 
Sbjct: 2   SLADELLADLEEAAEEEEGGSYGEEEEEPAIEDVQEETQLDLSG------DSVKSIAKLW 61

Query: 75  KTQRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIR 134
            ++ + +IM K+ + +   +NVS     +E  PEY++IV+ N L+V+IENE+ IIH FIR
Sbjct: 62  DSKMFAEIMMKIEEYISKQANVSEVMGPVEAAPEYRVIVDANNLTVEIENELNIIHKFIR 121

Query: 135 DKYRLKFPELESLVHHPIDYARVVKKIGNEMD--LTLVDLEGLLPSAVIMVVSVTASTTS 194
           DKY  +FPELESLV + +DY R VK++GN +D      +L+ +L +A IMVVSVTASTT 
Sbjct: 122 DKYSKRFPELESLVPNALDYIRTVKELGNSLDKCKNNENLQQILTNATIMVVSVTASTTQ 181

Query: 195 GKPLPEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTA 254
           G+ L +E L++  +ACD AL L+++K  +  +VESRM +IAPNLS I+G++ AAK+MG A
Sbjct: 182 GQQLSDEELERLEEACDMALELNASKHRIYEYVESRMSFIAPNLSIIIGASTAAKIMGVA 241

Query: 255 GGLVALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLI 314
           GGL  L+KMPACN+ LLGA++K L+GFS+ +     GYI  ++I QS PP L+ +A RL+
Sbjct: 242 GGLTNLSKMPACNIMLLGAQRKTLSGFSSTSVLPHTGYIYHSDIVQSLPPDLRRKAARLV 301

Query: 315 AAKSTLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRG 374
           AAK TLAARVDS      GK G   KDEI +K +KWQEPPP KQ KPLP P    +KKRG
Sbjct: 302 AAKCTLAARVDSFHESTEGKVGYELKDEIERKFDKWQEPPPVKQVKPLPAPLDGQRKKRG 361

Query: 375 GRRLRKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLR---VSAA 434
           GRR RKMKER   TE+RK ANRM FG  EE +  + LG   G LG++GSG++R   V+ A
Sbjct: 362 GRRYRKMKERLGLTEIRKQANRMSFGEIEEDAYQEDLGFSLGHLGKSGSGRVRQTQVNEA 421

Query: 435 QSKLAAKVVKKFKEKR---YGSSGA----TSGLTSSLAFTPVQGIELSNPQAHLNQLGGG 494
                +K +++  +K+   YG        +SG  SS+AFTP+QG+E+ NPQA   ++   
Sbjct: 422 TKARISKTLQRTLQKQSVVYGGKSTIRDRSSGTASSVAFTPLQGLEIVNPQAAEKKVAEA 481

BLAST of CmaCh04G017210 vs. ExPASy Swiss-Prot
Match: Q5U5C5 (U4/U6 small nuclear ribonucleoprotein Prp31 OS=Xenopus laevis OX=8355 GN=prpf31 PE=2 SV=1)

HSP 1 Score: 379.4 bits (973), Expect = 6.2e-104
Identity = 231/495 (46.67%), Postives = 324/495 (65.45%), Query Frame = 0

Query: 15  TLADSFLADLDELS--DEDNFLDEEDADA-ENMEEDIDGDLADLESLNYDDLDSVSKLQK 74
           +LAD  LADL+E +  +E+N +DE+D +  E ++E++  D      LN + + S++KL  
Sbjct: 2   SLADELLADLEEAAEEEEENLIDEDDLETIEEVDEEMQVD------LNAESVKSIAKLSD 61

Query: 75  TQRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRD 134
           ++ +++I+ K+   ++     S     +E  PEY++IV+ N L+V+IENE+ IIH FIRD
Sbjct: 62  SKLFSEILLKIEGYIQKQPKASEVMGPVEAAPEYKVIVDANNLTVEIENELNIIHKFIRD 121

Query: 135 KYRLKFPELESLVHHPIDYARVVKKIGNEMD--LTLVDLEGLLPSAVIMVVSVTASTTSG 194
           KY  +FPELESLV + +DY R VK++GN +D      +L+ +L +A IMVVSVTASTT G
Sbjct: 122 KYSKRFPELESLVPNALDYIRTVKELGNNLDKCKNNENLQQILTNATIMVVSVTASTTQG 181

Query: 195 KPLPEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAG 254
           + L +E L++  +ACD AL L+ +K  +  +VESRM +IAPNLS IVG++ AAK+MG AG
Sbjct: 182 QQLTDEELERIEEACDMALELNQSKHRIYEYVESRMSFIAPNLSIIVGASTAAKIMGIAG 241

Query: 255 GLVALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIA 314
           GL  L+KMPACNV LLGA++K L GFS+ +     GYI  +EI QS P  L  +A RL++
Sbjct: 242 GLTNLSKMPACNVMLLGAQRKTLTGFSSTSVLPHTGYIYHSEIVQSLPSDLHRKAARLVS 301

Query: 315 AKSTLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGG 374
           AK TLA+RVDS   +P GK G   K+EI +K +KWQEPPP KQ KPLP P    +KKRGG
Sbjct: 302 AKCTLASRVDSFHENPEGKIGYDLKEEIERKFDKWQEPPPVKQVKPLPAPLDGQRKKRGG 361

Query: 375 RRLRKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLR---VSAAQ 434
           RR RKMKER   TE+RK ANRM FG  EE +  + LG   G LG++GSG++R   V+ A 
Sbjct: 362 RRYRKMKERLGLTEIRKQANRMSFGEIEEDAYQEDLGFSLGHLGKSGSGRIRQAQVNEAT 421

Query: 435 SKLAAKVVKKFKEKR---YGSSGA----TSGLTSSLAFTPVQGIELSNPQAHLNQLGGGT 494
               +K +++  +K+   YG        +SG  SS+AFTP+QG+E+ NPQA   ++    
Sbjct: 422 KARISKTLQRTLQKQSVVYGGKSTVRDRSSGTASSVAFTPLQGLEIVNPQAAEKKVAEAN 481

BLAST of CmaCh04G017210 vs. ExPASy Swiss-Prot
Match: Q8WWY3 (U4/U6 small nuclear ribonucleoprotein Prp31 OS=Homo sapiens OX=9606 GN=PRPF31 PE=1 SV=2)

HSP 1 Score: 379.0 bits (972), Expect = 8.1e-104
Identity = 231/496 (46.57%), Postives = 326/496 (65.73%), Query Frame = 0

Query: 15  TLADSFLADLDELSDED---NFLDEEDADA-ENMEEDIDGDLADLESLNYDDLDSVSKLQ 74
           +LAD  LADL+E ++E+   ++ +EE+  A E+++E+   DL+       D + +++KL 
Sbjct: 2   SLADELLADLEEAAEEEEGGSYGEEEEEPAIEDVQEETQLDLSG------DSVKTIAKLW 61

Query: 75  KTQRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIR 134
            ++ + +IM K+ + +   +  S     +E  PEY++IV+ N L+V+IENE+ IIH FIR
Sbjct: 62  DSKMFAEIMMKIEEYISKQAKASEVMGPVEAAPEYRVIVDANNLTVEIENELNIIHKFIR 121

Query: 135 DKYRLKFPELESLVHHPIDYARVVKKIGNEMD--LTLVDLEGLLPSAVIMVVSVTASTTS 194
           DKY  +FPELESLV + +DY R VK++GN +D      +L+ +L +A IMVVSVTASTT 
Sbjct: 122 DKYSKRFPELESLVPNALDYIRTVKELGNSLDKCKNNENLQQILTNATIMVVSVTASTTQ 181

Query: 195 GKPLPEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTA 254
           G+ L EE L++  +ACD AL L+++K  +  +VESRM +IAPNLS I+G++ AAK+MG A
Sbjct: 182 GQQLSEEELERLEEACDMALELNASKHRIYEYVESRMSFIAPNLSIIIGASTAAKIMGVA 241

Query: 255 GGLVALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLI 314
           GGL  L+KMPACN+ LLGA++K L+GFS+ +     GYI  ++I QS PP L+ +A RL+
Sbjct: 242 GGLTNLSKMPACNIMLLGAQRKTLSGFSSTSVLPHTGYIYHSDIVQSLPPDLRRKAARLV 301

Query: 315 AAKSTLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRG 374
           AAK TLAARVDS      GK G   KDEI +K +KWQEPPP KQ KPLP P    +KKRG
Sbjct: 302 AAKCTLAARVDSFHESTEGKVGYELKDEIERKFDKWQEPPPVKQVKPLPAPLDGQRKKRG 361

Query: 375 GRRLRKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLR---VSAA 434
           GRR RKMKER   TE+RK ANRM FG  EE +  + LG   G LG++GSG++R   V+ A
Sbjct: 362 GRRYRKMKERLGLTEIRKQANRMSFGEIEEDAYQEDLGFSLGHLGKSGSGRVRQTQVNEA 421

Query: 435 QSKLAAKVVKKFKEKR---YGSSGA----TSGLTSSLAFTPVQGIELSNPQAHLNQLGGG 494
                +K +++  +K+   YG        +SG  SS+AFTP+QG+E+ NPQA   ++   
Sbjct: 422 TKARISKTLQRTLQKQSVVYGGKSTIRDRSSGTASSVAFTPLQGLEIVNPQAAEKKVAEA 481

BLAST of CmaCh04G017210 vs. ExPASy Swiss-Prot
Match: Q6NVP6 (U4/U6 small nuclear ribonucleoprotein Prp31 OS=Xenopus tropicalis OX=8364 GN=prpf31 PE=2 SV=1)

HSP 1 Score: 376.7 bits (966), Expect = 4.0e-103
Identity = 230/495 (46.46%), Postives = 323/495 (65.25%), Query Frame = 0

Query: 15  TLADSFLADLDELS--DEDNFLDEEDADA-ENMEEDIDGDLADLESLNYDDLDSVSKLQK 74
           +LAD  LADL+E +  +E+N +DE+D +  E ++E++  D      LN + + S++KL  
Sbjct: 2   SLADELLADLEEAAEEEEENLIDEDDLETIEEVQEEMQVD------LNAESVKSIAKLSD 61

Query: 75  TQRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRD 134
           ++ +++I+ K+   ++     S     +E  PEY++IV+ N L+V+IENE+ IIH FIRD
Sbjct: 62  SKLFSEILLKIDGYIKKQPKASEVMGPVEAAPEYKVIVDANNLTVEIENELNIIHKFIRD 121

Query: 135 KYRLKFPELESLVHHPIDYARVVKKIGNEMD--LTLVDLEGLLPSAVIMVVSVTASTTSG 194
           KY  +FPELESLV + +DY R VK++GN +D      +L+ +L +A IMVVSVTASTT G
Sbjct: 122 KYSKRFPELESLVPNALDYIRTVKELGNNLDKCKNNENLQQILTNATIMVVSVTASTTQG 181

Query: 195 KPLPEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAG 254
           + L +E L++  +ACD AL L+ +K  +  +VESRM +IAPNLS IVG++ AAK+MG AG
Sbjct: 182 QQLTDEELERIEEACDMALELNQSKHRIYEYVESRMSFIAPNLSIIVGASTAAKIMGIAG 241

Query: 255 GLVALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIA 314
           GL  L+KMPACNV LLGA++K L+GFS+ +     GYI  ++I QS PP L  +A RL++
Sbjct: 242 GLTNLSKMPACNVMLLGAQRKTLSGFSSTSVLPHTGYIYHSDIVQSLPPDLHRKAARLVS 301

Query: 315 AKSTLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGG 374
           AK TLAARVDS      GK G   K+EI +K +KWQEPPP KQ KPLP P    +KKRGG
Sbjct: 302 AKCTLAARVDSFHESSEGKVGYDLKEEIERKFDKWQEPPPVKQVKPLPAPLDGQRKKRGG 361

Query: 375 RRLRKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLR---VSAAQ 434
           RR RKMKER   TE+RK ANRM F   EE +  + LG   G LG++GSG++R   V+ A 
Sbjct: 362 RRYRKMKERLGLTEIRKQANRMSFAEIEEDAYQEDLGFSLGHLGKSGSGRIRQAQVNEAT 421

Query: 435 SKLAAKVVKKFKEKR---YGSSGA----TSGLTSSLAFTPVQGIELSNPQAHLNQLGGGT 494
               +K +++  +K+   YG        +SG  SS+AFTP+QG+E+ NPQA   ++    
Sbjct: 422 KARISKTLQRTLQKQSVVYGGKSTIRDRSSGTASSVAFTPLQGLEIVNPQAAEKKVAEAN 481

BLAST of CmaCh04G017210 vs. ExPASy TrEMBL
Match: A0A6J1KRI6 (U4/U6 small nuclear ribonucleoprotein Prp31 homolog isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111497587 PE=3 SV=1)

HSP 1 Score: 909.4 bits (2349), Expect = 6.4e-261
Identity = 483/484 (99.79%), Postives = 484/484 (100.00%), Query Frame = 0

Query: 13  LATLADSFLADLDELSDEDNFLDEEDADAENMEEDIDGDLADLESLNYDDLDSVSKLQKT 72
           +ATLADSFLADLDELSDEDNFLDEEDADAENMEEDIDGDLADLESLNYDDLDSVSKLQKT
Sbjct: 1   MATLADSFLADLDELSDEDNFLDEEDADAENMEEDIDGDLADLESLNYDDLDSVSKLQKT 60

Query: 73  QRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK 132
           QRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK
Sbjct: 61  QRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK 120

Query: 133 YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL 192
           YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL
Sbjct: 121 YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL 180

Query: 193 PEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGLV 252
           PEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGLV
Sbjct: 181 PEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGLV 240

Query: 253 ALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIAAKS 312
           ALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIAAKS
Sbjct: 241 ALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIAAKS 300

Query: 313 TLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL 372
           TLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL
Sbjct: 301 TLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL 360

Query: 373 RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK 432
           RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK
Sbjct: 361 RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK 420

Query: 433 VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGGGTQSTYFSETGTFSK 492
           VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGGGTQSTYFSETGTFSK
Sbjct: 421 VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGGGTQSTYFSETGTFSK 480

Query: 493 IRKN 497
           IRKN
Sbjct: 481 IRKN 484

BLAST of CmaCh04G017210 vs. ExPASy TrEMBL
Match: A0A6J1GXP7 (U4/U6 small nuclear ribonucleoprotein Prp31 homolog isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111458320 PE=3 SV=1)

HSP 1 Score: 909.4 bits (2349), Expect = 6.4e-261
Identity = 483/484 (99.79%), Postives = 484/484 (100.00%), Query Frame = 0

Query: 13  LATLADSFLADLDELSDEDNFLDEEDADAENMEEDIDGDLADLESLNYDDLDSVSKLQKT 72
           +ATLADSFLADLDELSDEDNFLDEEDADAENMEEDIDGDLADLESLNYDDLDSVSKLQKT
Sbjct: 1   MATLADSFLADLDELSDEDNFLDEEDADAENMEEDIDGDLADLESLNYDDLDSVSKLQKT 60

Query: 73  QRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK 132
           QRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK
Sbjct: 61  QRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK 120

Query: 133 YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL 192
           YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL
Sbjct: 121 YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL 180

Query: 193 PEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGLV 252
           PEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGLV
Sbjct: 181 PEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGLV 240

Query: 253 ALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIAAKS 312
           ALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIAAKS
Sbjct: 241 ALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIAAKS 300

Query: 313 TLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL 372
           TLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL
Sbjct: 301 TLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL 360

Query: 373 RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK 432
           RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK
Sbjct: 361 RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK 420

Query: 433 VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGGGTQSTYFSETGTFSK 492
           VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGGGTQSTYFSETGTFSK
Sbjct: 421 VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGGGTQSTYFSETGTFSK 480

Query: 493 IRKN 497
           IRKN
Sbjct: 481 IRKN 484

BLAST of CmaCh04G017210 vs. ExPASy TrEMBL
Match: A0A6J1DBC9 (U4/U6 small nuclear ribonucleoprotein Prp31 homolog isoform X1 OS=Momordica charantia OX=3673 GN=LOC111019139 PE=3 SV=1)

HSP 1 Score: 887.9 bits (2293), Expect = 2.0e-254
Identity = 471/484 (97.31%), Postives = 475/484 (98.14%), Query Frame = 0

Query: 13  LATLADSFLADLDELSDEDNFLDEEDADAENMEEDIDGDLADLESLNYDDLDSVSKLQKT 72
           +ATLADSFLADLDELSDEDNFLDEED DAENMEEDIDGDLADLESLNYDDLDSVSKLQKT
Sbjct: 1   MATLADSFLADLDELSDEDNFLDEEDVDAENMEEDIDGDLADLESLNYDDLDSVSKLQKT 60

Query: 73  QRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK 132
           QRYNDIMQKV  AL+ DSN+SNQG VLEDDPEYQLIVECN+LSVDIENEIIIIHNFIRDK
Sbjct: 61  QRYNDIMQKVEGALQKDSNISNQGLVLEDDPEYQLIVECNSLSVDIENEIIIIHNFIRDK 120

Query: 133 YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL 192
           YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL
Sbjct: 121 YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL 180

Query: 193 PEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGLV 252
           PEEILQKTIDACDRAL LDSAKK VLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGL 
Sbjct: 181 PEEILQKTIDACDRALALDSAKKKVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGLA 240

Query: 253 ALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIAAKS 312
           ALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIAAKS
Sbjct: 241 ALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIAAKS 300

Query: 313 TLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL 372
           TLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL
Sbjct: 301 TLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL 360

Query: 373 RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK 432
           RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK
Sbjct: 361 RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK 420

Query: 433 VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGGGTQSTYFSETGTFSK 492
           VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLG GTQSTYFSETGTFSK
Sbjct: 421 VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGSGTQSTYFSETGTFSK 480

Query: 493 IRKN 497
           IRKN
Sbjct: 481 IRKN 484

BLAST of CmaCh04G017210 vs. ExPASy TrEMBL
Match: A0A6J1G0W2 (U4/U6 small nuclear ribonucleoprotein Prp31 homolog OS=Cucurbita moschata OX=3662 GN=LOC111449657 PE=3 SV=1)

HSP 1 Score: 885.2 bits (2286), Expect = 1.3e-253
Identity = 468/484 (96.69%), Postives = 477/484 (98.55%), Query Frame = 0

Query: 13  LATLADSFLADLDELSDEDNFLDEEDADAENMEEDIDGDLADLESLNYDDLDSVSKLQKT 72
           +ATLADSFLADLDELSDEDNF DEED DAENMEEDIDGDLADLESLNYDDLD+VSKLQK+
Sbjct: 1   MATLADSFLADLDELSDEDNFPDEEDVDAENMEEDIDGDLADLESLNYDDLDNVSKLQKS 60

Query: 73  QRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK 132
           QRY+DIMQKV DAL+ DSN+SNQG VLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK
Sbjct: 61  QRYSDIMQKVEDALQKDSNISNQGLVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK 120

Query: 133 YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL 192
           YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL
Sbjct: 121 YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL 180

Query: 193 PEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGLV 252
           PEEILQKTIDACDRAL LDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGL 
Sbjct: 181 PEEILQKTIDACDRALALDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGLG 240

Query: 253 ALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIAAKS 312
           +LAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRL++AKS
Sbjct: 241 SLAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLVSAKS 300

Query: 313 TLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL 372
           TLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL
Sbjct: 301 TLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL 360

Query: 373 RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK 432
           RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK
Sbjct: 361 RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK 420

Query: 433 VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGGGTQSTYFSETGTFSK 492
           VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGGGTQSTYFSETGTFSK
Sbjct: 421 VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGGGTQSTYFSETGTFSK 480

Query: 493 IRKN 497
           IRKN
Sbjct: 481 IRKN 484

BLAST of CmaCh04G017210 vs. ExPASy TrEMBL
Match: A0A6J1HSU1 (U4/U6 small nuclear ribonucleoprotein Prp31 homolog OS=Cucurbita maxima OX=3661 GN=LOC111466452 PE=3 SV=1)

HSP 1 Score: 878.6 bits (2269), Expect = 1.2e-251
Identity = 464/484 (95.87%), Postives = 475/484 (98.14%), Query Frame = 0

Query: 13  LATLADSFLADLDELSDEDNFLDEEDADAENMEEDIDGDLADLESLNYDDLDSVSKLQKT 72
           +ATLADSFLADLDELSDEDNF DEED DAENMEEDIDGDLADLESLNYDDLD+VSKLQK+
Sbjct: 1   MATLADSFLADLDELSDEDNFPDEEDVDAENMEEDIDGDLADLESLNYDDLDNVSKLQKS 60

Query: 73  QRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK 132
           QRY+DIMQKV DAL+ DSN+SNQG VLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK
Sbjct: 61  QRYSDIMQKVEDALQKDSNISNQGLVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK 120

Query: 133 YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL 192
           YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL
Sbjct: 121 YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL 180

Query: 193 PEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGLV 252
           PEEILQKTIDACDRAL LDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGL 
Sbjct: 181 PEEILQKTIDACDRALALDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGLA 240

Query: 253 ALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIAAKS 312
           +LAKMPACNVQLLGAKKKNLAGFSTATSQFR+GYIEQTEIFQSTPP LKMRACRL++AKS
Sbjct: 241 SLAKMPACNVQLLGAKKKNLAGFSTATSQFRLGYIEQTEIFQSTPPHLKMRACRLVSAKS 300

Query: 313 TLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL 372
           TLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL
Sbjct: 301 TLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL 360

Query: 373 RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK 432
           RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK
Sbjct: 361 RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK 420

Query: 433 VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGGGTQSTYFSETGTFSK 492
           VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAH+NQLG GTQSTYFSETGTFSK
Sbjct: 421 VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHMNQLGSGTQSTYFSETGTFSK 480

Query: 493 IRKN 497
           IRKN
Sbjct: 481 IRKN 484

BLAST of CmaCh04G017210 vs. NCBI nr
Match: XP_022956658.1 (U4/U6 small nuclear ribonucleoprotein Prp31 homolog isoform X1 [Cucurbita moschata] >XP_023004221.1 U4/U6 small nuclear ribonucleoprotein Prp31 homolog isoform X1 [Cucurbita maxima] >XP_023526610.1 U4/U6 small nuclear ribonucleoprotein Prp31 homolog isoform X1 [Cucurbita pepo subsp. pepo] >KAG6601489.1 U4/U6 small nuclear ribonucleoprotein Prp31-like protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 909.4 bits (2349), Expect = 1.3e-260
Identity = 483/484 (99.79%), Postives = 484/484 (100.00%), Query Frame = 0

Query: 13  LATLADSFLADLDELSDEDNFLDEEDADAENMEEDIDGDLADLESLNYDDLDSVSKLQKT 72
           +ATLADSFLADLDELSDEDNFLDEEDADAENMEEDIDGDLADLESLNYDDLDSVSKLQKT
Sbjct: 1   MATLADSFLADLDELSDEDNFLDEEDADAENMEEDIDGDLADLESLNYDDLDSVSKLQKT 60

Query: 73  QRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK 132
           QRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK
Sbjct: 61  QRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK 120

Query: 133 YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL 192
           YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL
Sbjct: 121 YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL 180

Query: 193 PEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGLV 252
           PEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGLV
Sbjct: 181 PEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGLV 240

Query: 253 ALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIAAKS 312
           ALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIAAKS
Sbjct: 241 ALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIAAKS 300

Query: 313 TLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL 372
           TLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL
Sbjct: 301 TLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL 360

Query: 373 RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK 432
           RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK
Sbjct: 361 RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK 420

Query: 433 VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGGGTQSTYFSETGTFSK 492
           VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGGGTQSTYFSETGTFSK
Sbjct: 421 VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGGGTQSTYFSETGTFSK 480

Query: 493 IRKN 497
           IRKN
Sbjct: 481 IRKN 484

BLAST of CmaCh04G017210 vs. NCBI nr
Match: KAG7032269.1 (U4/U6 small nuclear ribonucleoprotein Prp31-like protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 908.7 bits (2347), Expect = 2.2e-260
Identity = 483/487 (99.18%), Postives = 485/487 (99.59%), Query Frame = 0

Query: 10  LCALATLADSFLADLDELSDEDNFLDEEDADAENMEEDIDGDLADLESLNYDDLDSVSKL 69
           + + ATLADSFLADLDELSDEDNFLDEEDADAENMEEDIDGDLADLESLNYDDLDSVSKL
Sbjct: 60  ISSAATLADSFLADLDELSDEDNFLDEEDADAENMEEDIDGDLADLESLNYDDLDSVSKL 119

Query: 70  QKTQRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFI 129
           QKTQRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFI
Sbjct: 120 QKTQRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFI 179

Query: 130 RDKYRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSG 189
           RDKYRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSG
Sbjct: 180 RDKYRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSG 239

Query: 190 KPLPEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAG 249
           KPLPEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAG
Sbjct: 240 KPLPEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAG 299

Query: 250 GLVALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIA 309
           GLVALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIA
Sbjct: 300 GLVALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIA 359

Query: 310 AKSTLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGG 369
           AKSTLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGG
Sbjct: 360 AKSTLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGG 419

Query: 370 RRLRKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKL 429
           RRLRKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKL
Sbjct: 420 RRLRKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKL 479

Query: 430 AAKVVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGGGTQSTYFSETGT 489
           AAKVVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGGGTQSTYFSETGT
Sbjct: 480 AAKVVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGGGTQSTYFSETGT 539

Query: 490 FSKIRKN 497
           FSKIRKN
Sbjct: 540 FSKIRKN 546

BLAST of CmaCh04G017210 vs. NCBI nr
Match: XP_022151138.1 (U4/U6 small nuclear ribonucleoprotein Prp31 homolog isoform X1 [Momordica charantia])

HSP 1 Score: 887.9 bits (2293), Expect = 4.1e-254
Identity = 471/484 (97.31%), Postives = 475/484 (98.14%), Query Frame = 0

Query: 13  LATLADSFLADLDELSDEDNFLDEEDADAENMEEDIDGDLADLESLNYDDLDSVSKLQKT 72
           +ATLADSFLADLDELSDEDNFLDEED DAENMEEDIDGDLADLESLNYDDLDSVSKLQKT
Sbjct: 1   MATLADSFLADLDELSDEDNFLDEEDVDAENMEEDIDGDLADLESLNYDDLDSVSKLQKT 60

Query: 73  QRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK 132
           QRYNDIMQKV  AL+ DSN+SNQG VLEDDPEYQLIVECN+LSVDIENEIIIIHNFIRDK
Sbjct: 61  QRYNDIMQKVEGALQKDSNISNQGLVLEDDPEYQLIVECNSLSVDIENEIIIIHNFIRDK 120

Query: 133 YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL 192
           YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL
Sbjct: 121 YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL 180

Query: 193 PEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGLV 252
           PEEILQKTIDACDRAL LDSAKK VLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGL 
Sbjct: 181 PEEILQKTIDACDRALALDSAKKKVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGLA 240

Query: 253 ALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIAAKS 312
           ALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIAAKS
Sbjct: 241 ALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIAAKS 300

Query: 313 TLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL 372
           TLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL
Sbjct: 301 TLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL 360

Query: 373 RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK 432
           RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK
Sbjct: 361 RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK 420

Query: 433 VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGGGTQSTYFSETGTFSK 492
           VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLG GTQSTYFSETGTFSK
Sbjct: 421 VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGSGTQSTYFSETGTFSK 480

Query: 493 IRKN 497
           IRKN
Sbjct: 481 IRKN 484

BLAST of CmaCh04G017210 vs. NCBI nr
Match: XP_022945419.1 (U4/U6 small nuclear ribonucleoprotein Prp31 homolog [Cucurbita moschata] >XP_022945420.1 U4/U6 small nuclear ribonucleoprotein Prp31 homolog [Cucurbita moschata] >KAG6573747.1 U4/U6 small nuclear ribonucleoprotein Prp31-like protein, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 885.2 bits (2286), Expect = 2.6e-253
Identity = 468/484 (96.69%), Postives = 477/484 (98.55%), Query Frame = 0

Query: 13  LATLADSFLADLDELSDEDNFLDEEDADAENMEEDIDGDLADLESLNYDDLDSVSKLQKT 72
           +ATLADSFLADLDELSDEDNF DEED DAENMEEDIDGDLADLESLNYDDLD+VSKLQK+
Sbjct: 1   MATLADSFLADLDELSDEDNFPDEEDVDAENMEEDIDGDLADLESLNYDDLDNVSKLQKS 60

Query: 73  QRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK 132
           QRY+DIMQKV DAL+ DSN+SNQG VLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK
Sbjct: 61  QRYSDIMQKVEDALQKDSNISNQGLVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK 120

Query: 133 YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL 192
           YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL
Sbjct: 121 YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL 180

Query: 193 PEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGLV 252
           PEEILQKTIDACDRAL LDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGL 
Sbjct: 181 PEEILQKTIDACDRALALDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGLG 240

Query: 253 ALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIAAKS 312
           +LAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRL++AKS
Sbjct: 241 SLAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLVSAKS 300

Query: 313 TLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL 372
           TLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL
Sbjct: 301 TLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL 360

Query: 373 RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK 432
           RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK
Sbjct: 361 RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK 420

Query: 433 VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGGGTQSTYFSETGTFSK 492
           VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGGGTQSTYFSETGTFSK
Sbjct: 421 VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGGGTQSTYFSETGTFSK 480

Query: 493 IRKN 497
           IRKN
Sbjct: 481 IRKN 484

BLAST of CmaCh04G017210 vs. NCBI nr
Match: XP_038892283.1 (U4/U6 small nuclear ribonucleoprotein Prp31 homolog isoform X1 [Benincasa hispida])

HSP 1 Score: 884.4 bits (2284), Expect = 4.5e-253
Identity = 468/484 (96.69%), Postives = 476/484 (98.35%), Query Frame = 0

Query: 13  LATLADSFLADLDELSDEDNFLDEEDADAENMEEDIDGDLADLESLNYDDLDSVSKLQKT 72
           +ATLADSFLADLDELSDEDNF+ +EDADAENMEEDIDGDLADLESLNYDDLDSVSKLQ+T
Sbjct: 1   MATLADSFLADLDELSDEDNFVAKEDADAENMEEDIDGDLADLESLNYDDLDSVSKLQQT 60

Query: 73  QRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK 132
           QRYNDIMQKV DAL  DSN+SNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK
Sbjct: 61  QRYNDIMQKVEDALRTDSNISNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK 120

Query: 133 YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL 192
           YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL
Sbjct: 121 YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL 180

Query: 193 PEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGLV 252
           P+EILQKTIDACDRAL LDSAKKMVLNFVESRMG+IAPNLSAIVGSAVAAKLMGTAGGL 
Sbjct: 181 PDEILQKTIDACDRALALDSAKKMVLNFVESRMGFIAPNLSAIVGSAVAAKLMGTAGGLA 240

Query: 253 ALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIAAKS 312
           ALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLI+AKS
Sbjct: 241 ALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLISAKS 300

Query: 313 TLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL 372
           TLAARVDSTRGDPTGKTGR FKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL
Sbjct: 301 TLAARVDSTRGDPTGKTGRAFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL 360

Query: 373 RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK 432
           RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK
Sbjct: 361 RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK 420

Query: 433 VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGGGTQSTYFSETGTFSK 492
           VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLG GTQSTYFSETGTFSK
Sbjct: 421 VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGSGTQSTYFSETGTFSK 480

Query: 493 IRKN 497
           IRKN
Sbjct: 481 IRKN 484

BLAST of CmaCh04G017210 vs. TAIR 10
Match: AT1G60170.1 (pre-mRNA processing ribonucleoprotein binding region-containing protein )

HSP 1 Score: 690.6 bits (1781), Expect = 8.9e-199
Identity = 371/485 (76.49%), Postives = 417/485 (85.98%), Query Frame = 0

Query: 13  LATLADSFLADLDELSDEDNFLDEEDADAENMEEDIDGDLADLESLNYDDLDSVSKLQKT 72
           +ATL DSFLADLDELSD +  LDE D D    EED+D D+ADLE+LNYDDLD+VSKLQK+
Sbjct: 1   MATLEDSFLADLDELSDNEAELDENDGDVGKEEEDVDMDMADLETLNYDDLDNVSKLQKS 60

Query: 73  QRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK 132
           QRY DIM KV +AL  DS+ + +G VLEDDPEY+LIV+CN LSVDIENEI+I+HNFI+DK
Sbjct: 61  QRYADIMHKVEEALGKDSDGAEKGTVLEDDPEYKLIVDCNQLSVDIENEIVIVHNFIKDK 120

Query: 133 YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL 192
           Y+LKF ELESLVHHPIDYA VVKKIGNE DL LVDL  LLPSA+IMVVSVTA TT G  L
Sbjct: 121 YKLKFQELESLVHHPIDYACVVKKIGNETDLALVDLADLLPSAIIMVVSVTALTTKGSAL 180

Query: 193 PEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGLV 252
           PE++LQK ++ACDRAL LDSA+K VL FVES+MG IAPNLSAIVGSAVAAKLMGTAGGL 
Sbjct: 181 PEDVLQKVLEACDRALDLDSARKKVLEFVESKMGSIAPNLSAIVGSAVAAKLMGTAGGLS 240

Query: 253 ALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIAAKS 312
           ALAKMPACNVQ+LG K+KNLAGFS+ATSQ RVGY+EQTEI+QSTPP L+ RA RL+AAKS
Sbjct: 241 ALAKMPACNVQVLGHKRKNLAGFSSATSQSRVGYLEQTEIYQSTPPGLQARAGRLVAAKS 300

Query: 313 TLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL 372
           TLAARVD+TRGDP G +G+ F++EI KKIEKWQEPPPA+QPKPLPVPDSEPKK+RGGRRL
Sbjct: 301 TLAARVDATRGDPLGISGKAFREEIRKKIEKWQEPPPARQPKPLPVPDSEPKKRRGGRRL 360

Query: 373 RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLA-- 432
           RKMKERY  T+MRKLANRM FG PEESSLGDGLGEGYGMLGQAGS +LRVS+  SKL   
Sbjct: 361 RKMKERYQVTDMRKLANRMAFGTPEESSLGDGLGEGYGMLGQAGSNRLRVSSVPSKLKIN 420

Query: 433 AKVVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGGGTQSTYFSETGTF 492
           AKV KK KE++Y     TSGLTSSLAFTPVQGIEL NPQ  L  LG GTQSTYFSE+GTF
Sbjct: 421 AKVAKKLKERQYAGGATTSGLTSSLAFTPVQGIELCNPQQALG-LGSGTQSTYFSESGTF 480

Query: 493 SKIRK 496
           SK++K
Sbjct: 481 SKLKK 484

BLAST of CmaCh04G017210 vs. TAIR 10
Match: AT1G70400.1 (CONTAINS InterPro DOMAIN/s: NOSIC (InterPro:IPR012976); BEST Arabidopsis thaliana protein match is: pre-mRNA processing ribonucleoprotein binding region-containing protein (TAIR:AT1G60170.1); Has 479 Blast hits to 479 proteins in 178 species: Archae - 0; Bacteria - 0; Metazoa - 125; Fungi - 138; Plants - 124; Viruses - 0; Other Eukaryotes - 92 (source: NCBI BLink). )

HSP 1 Score: 211.5 bits (537), Expect = 1.6e-54
Identity = 117/172 (68.02%), Postives = 136/172 (79.07%), Query Frame = 0

Query: 51  DLADLESLNYDDLDSVSKLQKTQRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVE 110
           D+ +L +L YDDLDSVSKLQK++RY DIMQ+V +ALE        G VLE     +LIV+
Sbjct: 2   DMTELNTLTYDDLDSVSKLQKSRRYADIMQQVEEALE--------GSVLE---YKKLIVD 61

Query: 111 CNALSVDIENEIIIIHNFIRDKYRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEG 170
           C  L VDIENEI+I+ NFIRDKYR+KF ELE LV HPIDYARVVK+IGNEMDL LVDLEG
Sbjct: 62  CKQLLVDIENEIVIVQNFIRDKYRVKFQELELLVPHPIDYARVVKRIGNEMDLKLVDLEG 121

Query: 171 LLPSAVIMVVSVTASTTSGKPLPEEILQKTIDACDRALTLDSAKKMVLNFVE 223
           LLPSA+IMV+ VTA TT G  LPE++L KTIDACDRAL LDSA+K VL FV+
Sbjct: 122 LLPSAMIMVLLVTALTTKGNQLPEDVLLKTIDACDRALDLDSARKKVLEFVD 162

BLAST of CmaCh04G017210 vs. TAIR 10
Match: AT1G70400.2 (CONTAINS InterPro DOMAIN/s: NOSIC (InterPro:IPR012976); BEST Arabidopsis thaliana protein match is: pre-mRNA processing ribonucleoprotein binding region-containing protein (TAIR:AT1G60170.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 204.1 bits (518), Expect = 2.5e-52
Identity = 113/165 (68.48%), Postives = 131/165 (79.39%), Query Frame = 0

Query: 51  DLADLESLNYDDLDSVSKLQKTQRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVE 110
           D+ +L +L YDDLDSVSKLQK++RY DIMQ+V +ALE        G VLE     +LIV+
Sbjct: 2   DMTELNTLTYDDLDSVSKLQKSRRYADIMQQVEEALE--------GSVLE---YKKLIVD 61

Query: 111 CNALSVDIENEIIIIHNFIRDKYRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEG 170
           C  L VDIENEI+I+ NFIRDKYR+KF ELE LV HPIDYARVVK+IGNEMDL LVDLEG
Sbjct: 62  CKQLLVDIENEIVIVQNFIRDKYRVKFQELELLVPHPIDYARVVKRIGNEMDLKLVDLEG 121

Query: 171 LLPSAVIMVVSVTASTTSGKPLPEEILQKTIDACDRALTLDSAKK 216
           LLPSA+IMV+ VTA TT G  LPE++L KTIDACDRAL LDSA+K
Sbjct: 122 LLPSAMIMVLLVTALTTKGNQLPEDVLLKTIDACDRALDLDSARK 155

BLAST of CmaCh04G017210 vs. TAIR 10
Match: AT1G70400.3 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: egg cell; CONTAINS InterPro DOMAIN/s: NOSIC (InterPro:IPR012976); BEST Arabidopsis thaliana protein match is: pre-mRNA processing ribonucleoprotein binding region-containing protein (TAIR:AT1G60170.1); Has 484 Blast hits to 484 proteins in 190 species: Archae - 0; Bacteria - 0; Metazoa - 149; Fungi - 134; Plants - 124; Viruses - 0; Other Eukaryotes - 77 (source: NCBI BLink). )

HSP 1 Score: 204.1 bits (518), Expect = 2.5e-52
Identity = 113/165 (68.48%), Postives = 131/165 (79.39%), Query Frame = 0

Query: 51  DLADLESLNYDDLDSVSKLQKTQRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVE 110
           D+ +L +L YDDLDSVSKLQK++RY DIMQ+V +ALE        G VLE     +LIV+
Sbjct: 2   DMTELNTLTYDDLDSVSKLQKSRRYADIMQQVEEALE--------GSVLE---YKKLIVD 61

Query: 111 CNALSVDIENEIIIIHNFIRDKYRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEG 170
           C  L VDIENEI+I+ NFIRDKYR+KF ELE LV HPIDYARVVK+IGNEMDL LVDLEG
Sbjct: 62  CKQLLVDIENEIVIVQNFIRDKYRVKFQELELLVPHPIDYARVVKRIGNEMDLKLVDLEG 121

Query: 171 LLPSAVIMVVSVTASTTSGKPLPEEILQKTIDACDRALTLDSAKK 216
           LLPSA+IMV+ VTA TT G  LPE++L KTIDACDRAL LDSA+K
Sbjct: 122 LLPSAMIMVLLVTALTTKGNQLPEDVLLKTIDACDRALDLDSARK 155

BLAST of CmaCh04G017210 vs. TAIR 10
Match: AT3G05060.1 (NOP56-like pre RNA processing ribonucleoprotein )

HSP 1 Score: 102.1 bits (253), Expect = 1.4e-21
Identity = 70/245 (28.57%), Postives = 120/245 (48.98%), Query Frame = 0

Query: 85  ALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDKYRLKFPELESLV 144
           +L +  +++        D    +I++   L  D++ E+      +R+ Y   FPEL  ++
Sbjct: 138 SLGLSHSLARYKLKFSSDKVDTMIIQAIGLLDDLDKELNTYAMRVREWYGWHFPELAKII 197

Query: 145 HHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPLPEEILQKTIDAC 204
              I YA+ VK +GN ++   +D   +L   +   +   A  + G  + +  L    + C
Sbjct: 198 SDNILYAKSVKLMGNRVNAAKLDFSEILADEIEADLKDAAVISMGTEVSDLDLLHIRELC 257

Query: 205 DRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGLVALAKMPACNVQL 264
           D+ L+L   +  + ++++SRM  IAPNL+A+VG  V A+L+   G L+ L+K P   VQ+
Sbjct: 258 DQVLSLSEYRAQLYDYLKSRMNTIAPNLTALVGELVGARLISHGGSLLNLSKQPGSTVQI 317

Query: 265 LGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIAAKSTLAARVDSTRGD 324
           LGA+K       T  +  + G I    +     P  K +  R +AAK+ LA RVD+  GD
Sbjct: 318 LGAEKALFRALKTKHATPKYGLIFHASLVGQAAPKHKGKISRSLAAKTVLAIRVDAL-GD 377

Query: 325 PTGKT 330
               T
Sbjct: 378 SQDNT 381

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8RXN61.3e-19776.49U4/U6 small nuclear ribonucleoprotein Prp31 homolog OS=Arabidopsis thaliana OX=3... [more]
Q8CCF05.6e-10546.98U4/U6 small nuclear ribonucleoprotein Prp31 OS=Mus musculus OX=10090 GN=Prpf31 P... [more]
Q5U5C56.2e-10446.67U4/U6 small nuclear ribonucleoprotein Prp31 OS=Xenopus laevis OX=8355 GN=prpf31 ... [more]
Q8WWY38.1e-10446.57U4/U6 small nuclear ribonucleoprotein Prp31 OS=Homo sapiens OX=9606 GN=PRPF31 PE... [more]
Q6NVP64.0e-10346.46U4/U6 small nuclear ribonucleoprotein Prp31 OS=Xenopus tropicalis OX=8364 GN=prp... [more]
Match NameE-valueIdentityDescription
A0A6J1KRI66.4e-26199.79U4/U6 small nuclear ribonucleoprotein Prp31 homolog isoform X1 OS=Cucurbita maxi... [more]
A0A6J1GXP76.4e-26199.79U4/U6 small nuclear ribonucleoprotein Prp31 homolog isoform X1 OS=Cucurbita mosc... [more]
A0A6J1DBC92.0e-25497.31U4/U6 small nuclear ribonucleoprotein Prp31 homolog isoform X1 OS=Momordica char... [more]
A0A6J1G0W21.3e-25396.69U4/U6 small nuclear ribonucleoprotein Prp31 homolog OS=Cucurbita moschata OX=366... [more]
A0A6J1HSU11.2e-25195.87U4/U6 small nuclear ribonucleoprotein Prp31 homolog OS=Cucurbita maxima OX=3661 ... [more]
Match NameE-valueIdentityDescription
XP_022956658.11.3e-26099.79U4/U6 small nuclear ribonucleoprotein Prp31 homolog isoform X1 [Cucurbita moscha... [more]
KAG7032269.12.2e-26099.18U4/U6 small nuclear ribonucleoprotein Prp31-like protein, partial [Cucurbita arg... [more]
XP_022151138.14.1e-25497.31U4/U6 small nuclear ribonucleoprotein Prp31 homolog isoform X1 [Momordica charan... [more]
XP_022945419.12.6e-25396.69U4/U6 small nuclear ribonucleoprotein Prp31 homolog [Cucurbita moschata] >XP_022... [more]
XP_038892283.14.5e-25396.69U4/U6 small nuclear ribonucleoprotein Prp31 homolog isoform X1 [Benincasa hispid... [more]
Match NameE-valueIdentityDescription
AT1G60170.18.9e-19976.49pre-mRNA processing ribonucleoprotein binding region-containing protein [more]
AT1G70400.11.6e-5468.02CONTAINS InterPro DOMAIN/s: NOSIC (InterPro:IPR012976); BEST Arabidopsis thalian... [more]
AT1G70400.22.5e-5268.48CONTAINS InterPro DOMAIN/s: NOSIC (InterPro:IPR012976); BEST Arabidopsis thalian... [more]
AT1G70400.32.5e-5268.48FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
AT3G05060.11.4e-2128.57NOP56-like pre RNA processing ribonucleoprotein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita maxima (Rimu) v1.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR012976NOSICSMARTSM00931NOSIC_2coord: 107..159
e-value: 6.4E-23
score: 92.1
IPR042239Nop, C-terminal domainGENE3D1.10.246.90Nop domaincoord: 229..354
e-value: 2.0E-40
score: 139.4
NoneNo IPR availableGENE3D1.10.287.4070coord: 90..228
e-value: 1.8E-56
score: 191.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 342..373
NoneNo IPR availablePANTHERPTHR13904:SF0U4/U6 SMALL NUCLEAR RIBONUCLEOPROTEIN PRP31coord: 14..493
IPR019175Prp31 C-terminalPFAMPF09785Prp31_Ccoord: 351..468
e-value: 4.0E-43
score: 147.3
IPR002687Nop domainPFAMPF01798Nopcoord: 115..343
e-value: 5.9E-73
score: 245.1
IPR002687Nop domainPROSITEPS51358NOPcoord: 228..346
score: 37.437477
IPR027105U4/U6 small nuclear ribonucleoprotein Prp31PANTHERPTHR13904PRE-MRNA SPLICING FACTOR PRP31coord: 14..493
IPR036070Nop domain superfamilySUPERFAMILY89124Nop domaincoord: 100..346

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh04G017210.1CmaCh04G017210.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0000244 spliceosomal tri-snRNP complex assembly
biological_process GO:0000398 mRNA splicing, via spliceosome
cellular_component GO:0071011 precatalytic spliceosome
cellular_component GO:0005687 U4 snRNP
cellular_component GO:0046540 U4/U6 x U5 tri-snRNP complex
molecular_function GO:0003723 RNA binding