CmaCh04G017210 (gene) Cucurbita maxima (Rimu)

NameCmaCh04G017210
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionU4/U6 small nuclear ribonucleoprotein Prp31
LocationCma_Chr04 : 8656339 .. 8659980 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGACGTTTAGGTTTCAGCACTTTCCTTTGTGCGCTGGTTTATTTTAGTTCTTACGTTCTTGCGTGTAAACTCAAATTGACCTATCTAGGAATTGTGGACTTTGGAGTGTGCTATTAAATCTGTCGCTTAATTCTTTATCAGCTGCTAATTTTGTGTTTCTAATCGTTCTTCCTTGAATTATAGAGTTTCTCAAGAGATAGCTTGTTCTGCCTACAGGGAGCATGTATATAAGAATTTTCTGAGTTAGCTTATAATTTATATGGTTGGTAAAGGAGAACTTGATTACTTTCTCTCTCGGATGTAATTTATTAAACTAATGACCAAGTTTTGAATCGTTACACTTCTTTTCTACTTTATTTCATACATTCATTAGGTTTTTAAACTTGCTATTGTGCTTCTTCTTTCAAATGTTCTTTAAGATCTATCTTGAAAATGTGAGTTATCTTGTCTATTAATTGTCATATTATTTTCTTCTGTTTGTAGGCTACTTTGGCTGATTCTTTTCTAGCAGATCTTGATGAACTCTCTGATGAAGACAATTTTCTGGTTAGCAAATGTTTTCTTACCTTTCTCTCTGGAAGATCAATTCATCATTATTTTGCTTGGTAGGAGTGTGAATCCATCTACCTTATATTTTGAGGCAATAGTTATTGGATGATTACAGGATGAAGAAGATGCTGATGCCGAAAATATGGAAGAGGATATTGATGGGGATCTTGCAGATCTAGAAAGCCTTAATTATGATGATCTGGATAGTGTATCAAAATTGCAGAAAACACAGAGATACAATGATATTATGCAGGTTAGTTTACATGAAAGTTTCCTTATTGACCTTGGTGTGGCTTGAATAGAAAGGGGTTAGAAGAGAGGCAATGAATATATTCAATCCTTGCTGTCTTGTCTATTATTCTAACCTTTTAGTGAGTGATAGGTTTTGGGATGTTGTATAATTATTGGATCTTTTGCAATATATCTCTCATGAATAATGATGTTCACTGCACTTCCCCCTCGATTTGATTGGATTTGTTGGCATCTGAATTACAGAAAGTGGCAGATGCATTGGAGATGGATTCTAATGTCTCAAATCAGGGATTTGTATTAGAAGATGATCCTGAGTACCAACTGATTGTAGAGTGTAATGCCTTGTCAGTGGATATTGAGAATGAGATTATTATTATTCACAATTTTATACGTGATAAATATCGACTAAAATTTCCAGAGCTTGAATCACTTGTGCACCATCCAATCGATTATGCTCGAGTTGTTAAGAAGATTGGGAATGAAATGGATTTGACTCTTGTAGATTTAGAAGGGCTTTTACCCTCTGCTGTCATTATGGTTGTCTCTGTTACAGCATCTACCACAAGTGGCAAGCCACTTCCGGAGGAAATTCTCCAGAAAACAATTGATGCATGTGATCGAGCTCTTACTTTAGATTCAGCAAAGAAAATGGTCCTTAATTTTGTTGAAAGTAGAATGGGATATATTGCACCAAATCTTTCAGCGATAGTTGGAAGTGCTGTTGCAGCAAAACTAATGGGAACTGCTGGTGGCCTTGTTGCCTTAGCTAAGATGCCTGCTTGCAATGTTCAGCTTCTTGGTGCAAAGAAAAAAAACCTTGCTGGGTTTTCCACAGCGACCTCACAATTTCGAGTAGGTTATATTGAGCAAACAGAGATATTCCAATCAACACCTCCTCCTTTGAAGATGCGTGCTTGTCGACTCATAGCGGCGAAGTCAACACTTGCAGCACGAGTTGACTCCACCAGGGGTGACCCTACAGGGAAGACGGGTAGAGTCTTCAAAGATGAGATCCTTAAGAAAATTGAGAAGTGGCAAGAACCTCCACCTGCAAAACAACCAAAACCTCTTCCTGTCCCTGATTCTGAACCCAAAAAGAAGAGAGGTGGCCGTCGATTAAGGAAGATGAAGGAAAGGTATATGTTACATTTGTCTTTCATCTCTCTAGCAATAGAAGCTTTGATCTGTTGCACTTTTCCGTTGTTCATGGATTGATATGGATTTCCTTGTCAGGTATGCAACGACGGAGATGAGGAAGCTAGCTAACAGGATGCAGTTTGGGGTGCCTGAAGAGAGTTCTTTAGGTAAAACATTCTTTCATTTCTGAATAATCTGGATTGATCAAATTGATAGTTTTTCTTCTGCTACGTTTTGTGTTACGCATTGGAATGAGATGCAATGTTATGCTGTTCAGCAAGTTTGAGTTGTCATGTTGTCTGGTGAACCAGAAAATTTTCTGAATTGCAATAGTAACATCATAGCCGTCTTGAGTCTGTCCTATTACAGGATTATTTCTTATCAAACGTAGTTACATTTTTGTCTGCATACCAGCTGTTTCTTACAATTGTCCTGAACGGAAAAAGTAGTTGCTTGAGGTATTCTGCTGTAATATTCTGAATAAGCATTTCACTTCTTGATTTTTCATGCCACAACAGGAGATGGATTGGGGGAAGGGTATGGAATGCTTGGTCAGGCTGGGAGTGGCAAGTTGCGTGTGTCAGCTGCTCAGAGCAAGCTTGCCGCAAAAGTTGTTAAGAAGTAAGTCAAGTCATTGAGCTGATCCTAATAACCACACCCACCCATCCACTTGCGCGCGCACACACGCAAGTAATAATGAAAAAAGAAAAGAAAGAAAAAAGAATAATAACTATAACAAGGAGTGGAAAATGATATTTGTAGCATAACATTCGCTTCTTCATCTGTTATCCTTTACTTATCTTCTTATCACGAATAGGTTCAAGGAAAAACGCTATGGAAGCAGTGGTGCTACATCCGGGCTGACCTCAAGTTTGGCATTTACTCCAGTACAAGTCAGTATTTTTTGTCCTCCTCACCATCTCCTCCTATGTTATCTAAATTTGCCTCGGAAGTACTTGTTTTATTCATGACACTGCCTGGGATGCTTAATATCAAGGTTGTGAAAATTTCCTGTTTTCCATGTAGGGAATCGAGCTGTCGAATCCTCAGGCCCATTTGAACCAGCTAGGCGGTGGAACTCAAAGCACCTACTTTTCTGAAACAGGAACATTTTCAAAGATCAGGAAAAACTGAGTCATTAGACCATCAAGCTTTATGTTTCGTGCCCATGCTGTAATCATACATGGATATGGCTTACGATTAGGCTTTGTAATTCAGTACTAACAGAAAGCCACAAGAGTTTATGTTTTTATAACTCAGAATGTTTCTTATTCCTGCTTGCTTGATCTTTTTTTGGCTTTAGTTGACTGATTAGTGTTATCATTTTGGCCGGCGTTGATATTATATGGGATGCCATACACTATAACTGGTGGATTTGTCTTGGTCAGTCTCTTATTTAGTGCCATCAAATAATAGCTAGTAGTACTTCCTGTCTGATTTCCTCTAAGTTGAAAATCATAGAAACTTATGCCATACCATTCTCTGTATTCCCATGAGAGGAAGATTAGTACCACTGTGGGCTTTGATTTGGCATCTATGATATGCGGCTATGCTCTTCTCTTAATGTATCTCTCATTTCATTCAGGAATGGCTATCATATATCACTGATTTGGGTAGTTCAAAAAGGTTGCATCTATGGGTGAAGCTTTCTTGCCCTATGTAATGTAATATCTTTCAATTTTCTTTGACCCGACCAATCCG

mRNA sequence

ATGCGACGTTTAGGTTTCAGCACTTTCCTTTGTGCGCTGGCTACTTTGGCTGATTCTTTTCTAGCAGATCTTGATGAACTCTCTGATGAAGACAATTTTCTGGATGAAGAAGATGCTGATGCCGAAAATATGGAAGAGGATATTGATGGGGATCTTGCAGATCTAGAAAGCCTTAATTATGATGATCTGGATAGTGTATCAAAATTGCAGAAAACACAGAGATACAATGATATTATGCAGAAAGTGGCAGATGCATTGGAGATGGATTCTAATGTCTCAAATCAGGGATTTGTATTAGAAGATGATCCTGAGTACCAACTGATTGTAGAGTGTAATGCCTTGTCAGTGGATATTGAGAATGAGATTATTATTATTCACAATTTTATACGTGATAAATATCGACTAAAATTTCCAGAGCTTGAATCACTTGTGCACCATCCAATCGATTATGCTCGAGTTGTTAAGAAGATTGGGAATGAAATGGATTTGACTCTTGTAGATTTAGAAGGGCTTTTACCCTCTGCTGTCATTATGGTTGTCTCTGTTACAGCATCTACCACAAGTGGCAAGCCACTTCCGGAGGAAATTCTCCAGAAAACAATTGATGCATGTGATCGAGCTCTTACTTTAGATTCAGCAAAGAAAATGGTCCTTAATTTTGTTGAAAGTAGAATGGGATATATTGCACCAAATCTTTCAGCGATAGTTGGAAGTGCTGTTGCAGCAAAACTAATGGGAACTGCTGGTGGCCTTGTTGCCTTAGCTAAGATGCCTGCTTGCAATGTTCAGCTTCTTGGTGCAAAGAAAAAAAACCTTGCTGGGTTTTCCACAGCGACCTCACAATTTCGAGTAGGTTATATTGAGCAAACAGAGATATTCCAATCAACACCTCCTCCTTTGAAGATGCGTGCTTGTCGACTCATAGCGGCGAAGTCAACACTTGCAGCACGAGTTGACTCCACCAGGGGTGACCCTACAGGGAAGACGGGTAGAGTCTTCAAAGATGAGATCCTTAAGAAAATTGAGAAGTGGCAAGAACCTCCACCTGCAAAACAACCAAAACCTCTTCCTGTCCCTGATTCTGAACCCAAAAAGAAGAGAGGTGGCCGTCGATTAAGGAAGATGAAGGAAAGGTATGCAACGACGGAGATGAGGAAGCTAGCTAACAGGATGCAGTTTGGGGTGCCTGAAGAGAGTTCTTTAGGAGATGGATTGGGGGAAGGGTATGGAATGCTTGGTCAGGCTGGGAGTGGCAAGTTGCGTGTGTCAGCTGCTCAGAGCAAGCTTGCCGCAAAAGTTGTTAAGAAGTTCAAGGAAAAACGCTATGGAAGCAGTGGTGCTACATCCGGGCTGACCTCAAGTTTGGCATTTACTCCAGTACAAGGAATCGAGCTGTCGAATCCTCAGGCCCATTTGAACCAGCTAGGCGGTGGAACTCAAAGCACCTACTTTTCTGAAACAGGAACATTTTCAAAGATCAGGAAAAACTGAGTCATTAGACCATCAAGCTTTATGTTTCGTGCCCATGCTGTAATCATACATGGATATGGCTTACGATTAGGCTTTGTAATTCAGTACTAACAGAAAGCCACAAGAGTTTATGTTTTTATAACTCAGAATGTTTCTTATTCCTGCTTGCTTGATCTTTTTTTGGCTTTAGTTGACTGATTAGTGTTATCATTTTGGCCGGCGTTGATATTATATGGGATGCCATACACTATAACTGGTGGATTTGTCTTGGTCAGTCTCTTATTTAGTGCCATCAAATAATAGCTAGTAGTACTTCCTGTCTGATTTCCTCTAAGTTGAAAATCATAGAAACTTATGCCATACCATTCTCTGTATTCCCATGAGAGGAAGATTAGTACCACTGTGGGCTTTGATTTGGCATCTATGATATGCGGCTATGCTCTTCTCTTAATGTATCTCTCATTTCATTCAGGAATGGCTATCATATATCACTGATTTGGGTAGTTCAAAAAGGTTGCATCTATGGGTGAAGCTTTCTTGCCCTATGTAATGTAATATCTTTCAATTTTCTTTGACCCGACCAATCCG

Coding sequence (CDS)

ATGCGACGTTTAGGTTTCAGCACTTTCCTTTGTGCGCTGGCTACTTTGGCTGATTCTTTTCTAGCAGATCTTGATGAACTCTCTGATGAAGACAATTTTCTGGATGAAGAAGATGCTGATGCCGAAAATATGGAAGAGGATATTGATGGGGATCTTGCAGATCTAGAAAGCCTTAATTATGATGATCTGGATAGTGTATCAAAATTGCAGAAAACACAGAGATACAATGATATTATGCAGAAAGTGGCAGATGCATTGGAGATGGATTCTAATGTCTCAAATCAGGGATTTGTATTAGAAGATGATCCTGAGTACCAACTGATTGTAGAGTGTAATGCCTTGTCAGTGGATATTGAGAATGAGATTATTATTATTCACAATTTTATACGTGATAAATATCGACTAAAATTTCCAGAGCTTGAATCACTTGTGCACCATCCAATCGATTATGCTCGAGTTGTTAAGAAGATTGGGAATGAAATGGATTTGACTCTTGTAGATTTAGAAGGGCTTTTACCCTCTGCTGTCATTATGGTTGTCTCTGTTACAGCATCTACCACAAGTGGCAAGCCACTTCCGGAGGAAATTCTCCAGAAAACAATTGATGCATGTGATCGAGCTCTTACTTTAGATTCAGCAAAGAAAATGGTCCTTAATTTTGTTGAAAGTAGAATGGGATATATTGCACCAAATCTTTCAGCGATAGTTGGAAGTGCTGTTGCAGCAAAACTAATGGGAACTGCTGGTGGCCTTGTTGCCTTAGCTAAGATGCCTGCTTGCAATGTTCAGCTTCTTGGTGCAAAGAAAAAAAACCTTGCTGGGTTTTCCACAGCGACCTCACAATTTCGAGTAGGTTATATTGAGCAAACAGAGATATTCCAATCAACACCTCCTCCTTTGAAGATGCGTGCTTGTCGACTCATAGCGGCGAAGTCAACACTTGCAGCACGAGTTGACTCCACCAGGGGTGACCCTACAGGGAAGACGGGTAGAGTCTTCAAAGATGAGATCCTTAAGAAAATTGAGAAGTGGCAAGAACCTCCACCTGCAAAACAACCAAAACCTCTTCCTGTCCCTGATTCTGAACCCAAAAAGAAGAGAGGTGGCCGTCGATTAAGGAAGATGAAGGAAAGGTATGCAACGACGGAGATGAGGAAGCTAGCTAACAGGATGCAGTTTGGGGTGCCTGAAGAGAGTTCTTTAGGAGATGGATTGGGGGAAGGGTATGGAATGCTTGGTCAGGCTGGGAGTGGCAAGTTGCGTGTGTCAGCTGCTCAGAGCAAGCTTGCCGCAAAAGTTGTTAAGAAGTTCAAGGAAAAACGCTATGGAAGCAGTGGTGCTACATCCGGGCTGACCTCAAGTTTGGCATTTACTCCAGTACAAGGAATCGAGCTGTCGAATCCTCAGGCCCATTTGAACCAGCTAGGCGGTGGAACTCAAAGCACCTACTTTTCTGAAACAGGAACATTTTCAAAGATCAGGAAAAACTGA

Protein sequence

MRRLGFSTFLCALATLADSFLADLDELSDEDNFLDEEDADAENMEEDIDGDLADLESLNYDDLDSVSKLQKTQRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDKYRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPLPEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGLVALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIAAKSTLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRLRKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAKVVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGGGTQSTYFSETGTFSKIRKN
BLAST of CmaCh04G017210 vs. Swiss-Prot
Match: PRP31_MOUSE (U4/U6 small nuclear ribonucleoprotein Prp31 OS=Mus musculus GN=Prpf31 PE=1 SV=3)

HSP 1 Score: 382.9 bits (982), Expect = 5.4e-105
Identity = 233/496 (46.98%), Postives = 328/496 (66.13%), Query Frame = 1

Query: 15  TLADSFLADLDELSDED---NFLDEEDADA-ENMEEDIDGDLADLESLNYDDLDSVSKLQ 74
           +LAD  LADL+E ++E+   ++ +EE+  A E+++E+   DL+       D + S++KL 
Sbjct: 2   SLADELLADLEEAAEEEEGGSYGEEEEEPAIEDVQEETQLDLSG------DSVKSIAKLW 61

Query: 75  KTQRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIR 134
            ++ + +IM K+ + +   +NVS     +E  PEY++IV+ N L+V+IENE+ IIH FIR
Sbjct: 62  DSKMFAEIMMKIEEYISKQANVSEVMGPVEAAPEYRVIVDANNLTVEIENELNIIHKFIR 121

Query: 135 DKYRLKFPELESLVHHPIDYARVVKKIGNEMDLTL--VDLEGLLPSAVIMVVSVTASTTS 194
           DKY  +FPELESLV + +DY R VK++GN +D      +L+ +L +A IMVVSVTASTT 
Sbjct: 122 DKYSKRFPELESLVPNALDYIRTVKELGNSLDKCKNNENLQQILTNATIMVVSVTASTTQ 181

Query: 195 GKPLPEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTA 254
           G+ L +E L++  +ACD AL L+++K  +  +VESRM +IAPNLS I+G++ AAK+MG A
Sbjct: 182 GQQLSDEELERLEEACDMALELNASKHRIYEYVESRMSFIAPNLSIIIGASTAAKIMGVA 241

Query: 255 GGLVALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLI 314
           GGL  L+KMPACN+ LLGA++K L+GFS+ +     GYI  ++I QS PP L+ +A RL+
Sbjct: 242 GGLTNLSKMPACNIMLLGAQRKTLSGFSSTSVLPHTGYIYHSDIVQSLPPDLRRKAARLV 301

Query: 315 AAKSTLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRG 374
           AAK TLAARVDS      GK G   KDEI +K +KWQEPPP KQ KPLP P    +KKRG
Sbjct: 302 AAKCTLAARVDSFHESTEGKVGYELKDEIERKFDKWQEPPPVKQVKPLPAPLDGQRKKRG 361

Query: 375 GRRLRKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLR---VSAA 434
           GRR RKMKER   TE+RK ANRM FG  EE +  + LG   G LG++GSG++R   V+ A
Sbjct: 362 GRRYRKMKERLGLTEIRKQANRMSFGEIEEDAYQEDLGFSLGHLGKSGSGRVRQTQVNEA 421

Query: 435 QSKLAAKVVKKFKEKR---YGSSGA----TSGLTSSLAFTPVQGIELSNPQAHLNQLGGG 494
                +K +++  +K+   YG        +SG  SS+AFTP+QG+E+ NPQA   ++   
Sbjct: 422 TKARISKTLQRTLQKQSVVYGGKSTIRDRSSGTASSVAFTPLQGLEIVNPQAAEKKVAEA 481

BLAST of CmaCh04G017210 vs. Swiss-Prot
Match: PRP31_XENLA (U4/U6 small nuclear ribonucleoprotein Prp31 OS=Xenopus laevis GN=prpf31 PE=2 SV=1)

HSP 1 Score: 379.4 bits (973), Expect = 6.0e-104
Identity = 231/495 (46.67%), Postives = 324/495 (65.45%), Query Frame = 1

Query: 15  TLADSFLADLDELSDED--NFLDEEDADA-ENMEEDIDGDLADLESLNYDDLDSVSKLQK 74
           +LAD  LADL+E ++E+  N +DE+D +  E ++E++  DL      N + + S++KL  
Sbjct: 2   SLADELLADLEEAAEEEEENLIDEDDLETIEEVDEEMQVDL------NAESVKSIAKLSD 61

Query: 75  TQRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRD 134
           ++ +++I+ K+   ++     S     +E  PEY++IV+ N L+V+IENE+ IIH FIRD
Sbjct: 62  SKLFSEILLKIEGYIQKQPKASEVMGPVEAAPEYKVIVDANNLTVEIENELNIIHKFIRD 121

Query: 135 KYRLKFPELESLVHHPIDYARVVKKIGNEMDLTL--VDLEGLLPSAVIMVVSVTASTTSG 194
           KY  +FPELESLV + +DY R VK++GN +D      +L+ +L +A IMVVSVTASTT G
Sbjct: 122 KYSKRFPELESLVPNALDYIRTVKELGNNLDKCKNNENLQQILTNATIMVVSVTASTTQG 181

Query: 195 KPLPEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAG 254
           + L +E L++  +ACD AL L+ +K  +  +VESRM +IAPNLS IVG++ AAK+MG AG
Sbjct: 182 QQLTDEELERIEEACDMALELNQSKHRIYEYVESRMSFIAPNLSIIVGASTAAKIMGIAG 241

Query: 255 GLVALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIA 314
           GL  L+KMPACNV LLGA++K L GFS+ +     GYI  +EI QS P  L  +A RL++
Sbjct: 242 GLTNLSKMPACNVMLLGAQRKTLTGFSSTSVLPHTGYIYHSEIVQSLPSDLHRKAARLVS 301

Query: 315 AKSTLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGG 374
           AK TLA+RVDS   +P GK G   K+EI +K +KWQEPPP KQ KPLP P    +KKRGG
Sbjct: 302 AKCTLASRVDSFHENPEGKIGYDLKEEIERKFDKWQEPPPVKQVKPLPAPLDGQRKKRGG 361

Query: 375 RRLRKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLR---VSAAQ 434
           RR RKMKER   TE+RK ANRM FG  EE +  + LG   G LG++GSG++R   V+ A 
Sbjct: 362 RRYRKMKERLGLTEIRKQANRMSFGEIEEDAYQEDLGFSLGHLGKSGSGRIRQAQVNEAT 421

Query: 435 SKLAAKVVKKFKEKR---YGSSGA----TSGLTSSLAFTPVQGIELSNPQAHLNQLGGGT 494
               +K +++  +K+   YG        +SG  SS+AFTP+QG+E+ NPQA   ++    
Sbjct: 422 KARISKTLQRTLQKQSVVYGGKSTVRDRSSGTASSVAFTPLQGLEIVNPQAAEKKVAEAN 481

BLAST of CmaCh04G017210 vs. Swiss-Prot
Match: PRP31_HUMAN (U4/U6 small nuclear ribonucleoprotein Prp31 OS=Homo sapiens GN=PRPF31 PE=1 SV=2)

HSP 1 Score: 379.0 bits (972), Expect = 7.8e-104
Identity = 231/496 (46.57%), Postives = 326/496 (65.73%), Query Frame = 1

Query: 15  TLADSFLADLDELSDED---NFLDEEDADA-ENMEEDIDGDLADLESLNYDDLDSVSKLQ 74
           +LAD  LADL+E ++E+   ++ +EE+  A E+++E+   DL+       D + +++KL 
Sbjct: 2   SLADELLADLEEAAEEEEGGSYGEEEEEPAIEDVQEETQLDLSG------DSVKTIAKLW 61

Query: 75  KTQRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIR 134
            ++ + +IM K+ + +   +  S     +E  PEY++IV+ N L+V+IENE+ IIH FIR
Sbjct: 62  DSKMFAEIMMKIEEYISKQAKASEVMGPVEAAPEYRVIVDANNLTVEIENELNIIHKFIR 121

Query: 135 DKYRLKFPELESLVHHPIDYARVVKKIGNEMDLTL--VDLEGLLPSAVIMVVSVTASTTS 194
           DKY  +FPELESLV + +DY R VK++GN +D      +L+ +L +A IMVVSVTASTT 
Sbjct: 122 DKYSKRFPELESLVPNALDYIRTVKELGNSLDKCKNNENLQQILTNATIMVVSVTASTTQ 181

Query: 195 GKPLPEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTA 254
           G+ L EE L++  +ACD AL L+++K  +  +VESRM +IAPNLS I+G++ AAK+MG A
Sbjct: 182 GQQLSEEELERLEEACDMALELNASKHRIYEYVESRMSFIAPNLSIIIGASTAAKIMGVA 241

Query: 255 GGLVALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLI 314
           GGL  L+KMPACN+ LLGA++K L+GFS+ +     GYI  ++I QS PP L+ +A RL+
Sbjct: 242 GGLTNLSKMPACNIMLLGAQRKTLSGFSSTSVLPHTGYIYHSDIVQSLPPDLRRKAARLV 301

Query: 315 AAKSTLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRG 374
           AAK TLAARVDS      GK G   KDEI +K +KWQEPPP KQ KPLP P    +KKRG
Sbjct: 302 AAKCTLAARVDSFHESTEGKVGYELKDEIERKFDKWQEPPPVKQVKPLPAPLDGQRKKRG 361

Query: 375 GRRLRKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLR---VSAA 434
           GRR RKMKER   TE+RK ANRM FG  EE +  + LG   G LG++GSG++R   V+ A
Sbjct: 362 GRRYRKMKERLGLTEIRKQANRMSFGEIEEDAYQEDLGFSLGHLGKSGSGRVRQTQVNEA 421

Query: 435 QSKLAAKVVKKFKEKR---YGSSGA----TSGLTSSLAFTPVQGIELSNPQAHLNQLGGG 494
                +K +++  +K+   YG        +SG  SS+AFTP+QG+E+ NPQA   ++   
Sbjct: 422 TKARISKTLQRTLQKQSVVYGGKSTIRDRSSGTASSVAFTPLQGLEIVNPQAAEKKVAEA 481

BLAST of CmaCh04G017210 vs. Swiss-Prot
Match: PRP31_XENTR (U4/U6 small nuclear ribonucleoprotein Prp31 OS=Xenopus tropicalis GN=prpf31 PE=2 SV=1)

HSP 1 Score: 376.7 bits (966), Expect = 3.9e-103
Identity = 230/495 (46.46%), Postives = 323/495 (65.25%), Query Frame = 1

Query: 15  TLADSFLADLDELSDED--NFLDEEDADA-ENMEEDIDGDLADLESLNYDDLDSVSKLQK 74
           +LAD  LADL+E ++E+  N +DE+D +  E ++E++  DL      N + + S++KL  
Sbjct: 2   SLADELLADLEEAAEEEEENLIDEDDLETIEEVQEEMQVDL------NAESVKSIAKLSD 61

Query: 75  TQRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRD 134
           ++ +++I+ K+   ++     S     +E  PEY++IV+ N L+V+IENE+ IIH FIRD
Sbjct: 62  SKLFSEILLKIDGYIKKQPKASEVMGPVEAAPEYKVIVDANNLTVEIENELNIIHKFIRD 121

Query: 135 KYRLKFPELESLVHHPIDYARVVKKIGNEMDLTL--VDLEGLLPSAVIMVVSVTASTTSG 194
           KY  +FPELESLV + +DY R VK++GN +D      +L+ +L +A IMVVSVTASTT G
Sbjct: 122 KYSKRFPELESLVPNALDYIRTVKELGNNLDKCKNNENLQQILTNATIMVVSVTASTTQG 181

Query: 195 KPLPEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAG 254
           + L +E L++  +ACD AL L+ +K  +  +VESRM +IAPNLS IVG++ AAK+MG AG
Sbjct: 182 QQLTDEELERIEEACDMALELNQSKHRIYEYVESRMSFIAPNLSIIVGASTAAKIMGIAG 241

Query: 255 GLVALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIA 314
           GL  L+KMPACNV LLGA++K L+GFS+ +     GYI  ++I QS PP L  +A RL++
Sbjct: 242 GLTNLSKMPACNVMLLGAQRKTLSGFSSTSVLPHTGYIYHSDIVQSLPPDLHRKAARLVS 301

Query: 315 AKSTLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGG 374
           AK TLAARVDS      GK G   K+EI +K +KWQEPPP KQ KPLP P    +KKRGG
Sbjct: 302 AKCTLAARVDSFHESSEGKVGYDLKEEIERKFDKWQEPPPVKQVKPLPAPLDGQRKKRGG 361

Query: 375 RRLRKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLR---VSAAQ 434
           RR RKMKER   TE+RK ANRM F   EE +  + LG   G LG++GSG++R   V+ A 
Sbjct: 362 RRYRKMKERLGLTEIRKQANRMSFAEIEEDAYQEDLGFSLGHLGKSGSGRIRQAQVNEAT 421

Query: 435 SKLAAKVVKKFKEKR---YGSSGA----TSGLTSSLAFTPVQGIELSNPQAHLNQLGGGT 494
               +K +++  +K+   YG        +SG  SS+AFTP+QG+E+ NPQA   ++    
Sbjct: 422 KARISKTLQRTLQKQSVVYGGKSTIRDRSSGTASSVAFTPLQGLEIVNPQAAEKKVAEAN 481

BLAST of CmaCh04G017210 vs. Swiss-Prot
Match: PRP31_DANRE (U4/U6 small nuclear ribonucleoprotein Prp31 OS=Danio rerio GN=prpf31 PE=2 SV=1)

HSP 1 Score: 370.9 bits (951), Expect = 2.1e-101
Identity = 224/502 (44.62%), Postives = 317/502 (63.15%), Query Frame = 1

Query: 15  TLADSFLADLDELSDEDNFL---DEEDADAENMEEDIDGDLADL-ESLNYD-----DLDS 74
           +LAD  LADL+E  +ED      +E ++D E  E  +DG L D+ E +  D      + S
Sbjct: 2   SLADELLADLEEAGEEDGLYPGGEEGESDGEPGERQVDGGLEDIPEEMEVDYSSTESVTS 61

Query: 75  VSKLQKTQRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIII 134
           ++KL+ ++ + +IM K++  +      S     +E DPEY+LIV  N L+V+I+NE+ II
Sbjct: 62  IAKLRHSKPFAEIMDKISHYVGNQRKNSEVSGPVEADPEYRLIVAANNLTVEIDNELNII 121

Query: 135 HNFIRDKYRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVD--LEGLLPSAVIMVVSVT 194
           H F+RDKY  +FPELESLV + +DY R VK++GN ++    +  L+ +L +A IMVVSVT
Sbjct: 122 HKFVRDKYSKRFPELESLVPNALDYIRTVKELGNNLEKCKNNETLQQILTNATIMVVSVT 181

Query: 195 ASTTSGKPLPEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAK 254
           ASTT G  L ++ LQ+  +ACD AL L+ +K  +  +VESRM +IAPNLS IVG++ AAK
Sbjct: 182 ASTTQGTMLGDDELQRLEEACDMALELNQSKHRIYEYVESRMSFIAPNLSIIVGASTAAK 241

Query: 255 LMGTAGGLVALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMR 314
           +MG AGGL  L+KMPACN+ LLGA+++ L+GFS+ +     GYI   ++ Q+ PP L+ +
Sbjct: 242 IMGVAGGLTNLSKMPACNLMLLGAQRRTLSGFSSTSLLPHTGYIYHCDVVQTLPPDLRRK 301

Query: 315 ACRLIAAKSTLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEP 374
           A RL++AK TLA+RVDS      GK G   K+EI +K +KWQEPPP KQ KPLP P    
Sbjct: 302 AARLVSAKCTLASRVDSFHESADGKVGYDLKEEIERKFDKWQEPPPVKQVKPLPAPLDGQ 361

Query: 375 KKKRGGRRLRKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVS 434
           +KKRGGRR RKMKER   TE+RK ANRM F   E+ +  + LG   G LG++GSG++R +
Sbjct: 362 RKKRGGRRYRKMKERLGLTEIRKHANRMTFAEIEDDAYQEDLGFSLGQLGKSGSGRVRQA 421

Query: 435 AAQSKLAAKVVKKF------KEKRYGSSGA----TSGLTSSLAFTPVQGIELSNPQAHLN 494
                  A++ K        +   YG        +SG +SS+AFTP+QG+E+ NPQA   
Sbjct: 422 QVNDSTKARISKSLQRTLQKQSMTYGGKSTVRDRSSGTSSSVAFTPLQGLEIVNPQAAEK 481

Query: 495 QLGGGTQSTYFSETGTFSKIRK 496
           ++    Q  YFS    F K+++
Sbjct: 482 KVAEANQK-YFSNMAEFLKVKR 502

BLAST of CmaCh04G017210 vs. TrEMBL
Match: A0A0A0KSD2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_5G602130 PE=4 SV=1)

HSP 1 Score: 872.1 bits (2252), Expect = 3.3e-250
Identity = 463/484 (95.66%), Postives = 471/484 (97.31%), Query Frame = 1

Query: 13  LATLADSFLADLDELSDEDNFLDEEDADAENMEEDIDGDLADLESLNYDDLDSVSKLQKT 72
           +AT ADSFLADLDELSDED F  E  ADAENMEEDIDGDLADLESLNY+DLDSVSKLQKT
Sbjct: 1   MATFADSFLADLDELSDEDKFQGEAGADAENMEEDIDGDLADLESLNYEDLDSVSKLQKT 60

Query: 73  QRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK 132
           QRYNDIMQKV DAL+ DSN+SNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK
Sbjct: 61  QRYNDIMQKVEDALQTDSNISNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK 120

Query: 133 YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL 192
           YRLKFPELESLVHHPIDYARVVKKIGNE+DLTLVDLEGLLPSAVIMVVSVTASTTSGKPL
Sbjct: 121 YRLKFPELESLVHHPIDYARVVKKIGNEVDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL 180

Query: 193 PEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGLV 252
           PEEILQKTIDACDRAL LDSAKKMVL FVESRMG+IAPNLSAIVGSAVAAKLMGTAGGL 
Sbjct: 181 PEEILQKTIDACDRALALDSAKKMVLTFVESRMGHIAPNLSAIVGSAVAAKLMGTAGGLA 240

Query: 253 ALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIAAKS 312
           ALAKMPACNVQLLGAK+KNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLI+AKS
Sbjct: 241 ALAKMPACNVQLLGAKRKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLISAKS 300

Query: 313 TLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL 372
           TLAARVDST GDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL
Sbjct: 301 TLAARVDSTMGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL 360

Query: 373 RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK 432
           RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK
Sbjct: 361 RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK 420

Query: 433 VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGGGTQSTYFSETGTFSK 492
           VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLG GTQSTYFSETGTFSK
Sbjct: 421 VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGSGTQSTYFSETGTFSK 480

Query: 493 IRKN 497
           IRKN
Sbjct: 481 IRKN 484

BLAST of CmaCh04G017210 vs. TrEMBL
Match: A0A067K434_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_17722 PE=4 SV=1)

HSP 1 Score: 826.2 bits (2133), Expect = 2.1e-236
Identity = 434/484 (89.67%), Postives = 461/484 (95.25%), Query Frame = 1

Query: 13  LATLADSFLADLDELSDED-NFLDEEDADAENMEEDIDGDLADLESLNYDDLDSVSKLQK 72
           +ATLADSFLADLDELSD D + ++E+D DA NMEED+DGD+AD+E+LNYDDLDSVSKLQK
Sbjct: 1   MATLADSFLADLDELSDNDADLVEEDDVDAGNMEEDVDGDMADIEALNYDDLDSVSKLQK 60

Query: 73  TQRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRD 132
           TQRYNDIMQKV DALE  S++SNQG VLEDDPEYQLIVECNALSVDIENEIIIIHNFIRD
Sbjct: 61  TQRYNDIMQKVEDALEKGSDISNQGMVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRD 120

Query: 133 KYRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKP 192
           KYRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSA+IMVVSVTASTTSGKP
Sbjct: 121 KYRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAIIMVVSVTASTTSGKP 180

Query: 193 LPEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGL 252
           LPEE+LQKTIDACDRAL LDSAKK VL+FVESRMGYIAPNLSAIVGSAVAAKLMGTAGGL
Sbjct: 181 LPEEVLQKTIDACDRALALDSAKKKVLDFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGL 240

Query: 253 VALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIAAK 312
            ALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTE+FQ+TPP L+MRACRL+AAK
Sbjct: 241 SALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEVFQTTPPALRMRACRLLAAK 300

Query: 313 STLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRR 372
           STLAARVDSTRGDP+G+TGR  ++EI KKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRR
Sbjct: 301 STLAARVDSTRGDPSGRTGRALREEIHKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRR 360

Query: 373 LRKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAA 432
           LRKMKERYA T+MRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVS  QSKLAA
Sbjct: 361 LRKMKERYAVTDMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSIGQSKLAA 420

Query: 433 KVVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGGGTQSTYFSETGTFS 492
           KV KKFKEK YGSSGATSGLTSSLAFTPVQGIEL+NPQAH +QLG GTQSTYFSETGTFS
Sbjct: 421 KVAKKFKEKNYGSSGATSGLTSSLAFTPVQGIELTNPQAHAHQLGSGTQSTYFSETGTFS 480

Query: 493 KIRK 496
           KI++
Sbjct: 481 KIKR 484

BLAST of CmaCh04G017210 vs. TrEMBL
Match: M5W5L0_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004930mg PE=4 SV=1)

HSP 1 Score: 808.1 bits (2086), Expect = 5.8e-231
Identity = 426/484 (88.02%), Postives = 458/484 (94.63%), Query Frame = 1

Query: 13  LATLADSFLADLDELSD-EDNFLDEEDADAENMEEDIDGDLADLESLNYDDLDSVSKLQK 72
           +ATLADSFLADLDELSD E + + E+DADA NMEEDIDGDLADLE+LNYDDLDSVSKLQK
Sbjct: 1   MATLADSFLADLDELSDNEADVIVEDDADAGNMEEDIDGDLADLETLNYDDLDSVSKLQK 60

Query: 73  TQRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRD 132
           TQRY DIMQKV +ALE  S++S+ G VLEDDPEYQLIV+CNALSVDIENEI+IIHNFIRD
Sbjct: 61  TQRYTDIMQKVEEALEKGSDMSSHGIVLEDDPEYQLIVDCNALSVDIENEIVIIHNFIRD 120

Query: 133 KYRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKP 192
           KYR KFPELESLVHHPIDYARVVKKIGNEMD+TLVDLEGLLPSA+IMVVSVTASTTSGKP
Sbjct: 121 KYRPKFPELESLVHHPIDYARVVKKIGNEMDVTLVDLEGLLPSAIIMVVSVTASTTSGKP 180

Query: 193 LPEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGL 252
           LPEE+L KT +ACDRAL LDS+KK VL+FVESRMG+IAPNLSAIVGSAVAAKLMGTAGGL
Sbjct: 181 LPEEVLTKTNEACDRALALDSSKKKVLDFVESRMGFIAPNLSAIVGSAVAAKLMGTAGGL 240

Query: 253 VALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIAAK 312
           V+LAKMPACNVQLLGAK+KNLAGFSTATSQFRVGY+EQTEIFQ+TPP L+MRACRL+AAK
Sbjct: 241 VSLAKMPACNVQLLGAKRKNLAGFSTATSQFRVGYVEQTEIFQTTPPSLRMRACRLLAAK 300

Query: 313 STLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRR 372
           STLAARVDSTRGDP+G TGR F++EI KKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRR
Sbjct: 301 STLAARVDSTRGDPSGNTGRAFREEIRKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRR 360

Query: 373 LRKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAA 432
           LRKMKERYA T+MRKLANRMQFG+PEESSLGDGLGEGYGMLGQAGSGKLRVS  QSKLAA
Sbjct: 361 LRKMKERYAITDMRKLANRMQFGIPEESSLGDGLGEGYGMLGQAGSGKLRVSMGQSKLAA 420

Query: 433 KVVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGGGTQSTYFSETGTFS 492
           KV KKFKEK YGSSGATSGLTSSLAFTPVQGIELSNPQAH +QLGGGTQSTYFSETGTFS
Sbjct: 421 KVAKKFKEKNYGSSGATSGLTSSLAFTPVQGIELSNPQAHAHQLGGGTQSTYFSETGTFS 480

Query: 493 KIRK 496
           KI++
Sbjct: 481 KIKR 484

BLAST of CmaCh04G017210 vs. TrEMBL
Match: A0A0S3RVE2_PHAAN (Uncharacterized protein OS=Vigna angularis var. angularis GN=Vigan.04G197000 PE=4 SV=1)

HSP 1 Score: 804.3 bits (2076), Expect = 8.4e-230
Identity = 424/484 (87.60%), Postives = 454/484 (93.80%), Query Frame = 1

Query: 13  LATLADSFLADLDELSD-EDNFLDEEDADAENMEEDIDGDLADLESLNYDDLDSVSKLQK 72
           +ATLADSFLADLDELSD E   L+  D DA +MEED+DGDLADLE+LNYDDLDSVSKLQK
Sbjct: 1   MATLADSFLADLDELSDNEAEILENNDVDAADMEEDVDGDLADLENLNYDDLDSVSKLQK 60

Query: 73  TQRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRD 132
           TQRY DI+QKV +AL+M S+VS QG  LEDDPEYQLIV+CNALSVDIENEI+IIHNFIRD
Sbjct: 61  TQRYIDIIQKVEEALKMGSDVSTQGLDLEDDPEYQLIVDCNALSVDIENEIVIIHNFIRD 120

Query: 133 KYRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKP 192
           KYRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSA+IMVVSVTASTT+GKP
Sbjct: 121 KYRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAIIMVVSVTASTTTGKP 180

Query: 193 LPEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGL 252
           LPEE+L KT++ACDRAL LDSAKK VL+FVESRMGYIAPNLSAIVGSAVAAKLMGTAGGL
Sbjct: 181 LPEEVLSKTVEACDRALDLDSAKKKVLDFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGL 240

Query: 253 VALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIAAK 312
            +LAKMPACNVQLLGAKKKNLAGFSTATSQFRVGY+EQTEIFQ+TPP L+MRACRL+AAK
Sbjct: 241 ASLAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYLEQTEIFQTTPPSLRMRACRLLAAK 300

Query: 313 STLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRR 372
           STL ARVDS +GDP+G TGR FKDEI KKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRR
Sbjct: 301 STLVARVDSIQGDPSGNTGRAFKDEIHKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRR 360

Query: 373 LRKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAA 432
           LRKMKERYA T+MRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVS AQSKLAA
Sbjct: 361 LRKMKERYAITDMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSVAQSKLAA 420

Query: 433 KVVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGGGTQSTYFSETGTFS 492
           KV K+FKEK YGSSGATSGLTSSLAFTPVQGIEL+NPQAH +QLG GTQSTYFSETGTFS
Sbjct: 421 KVAKRFKEKNYGSSGATSGLTSSLAFTPVQGIELTNPQAHAHQLGSGTQSTYFSETGTFS 480

Query: 493 KIRK 496
           KI++
Sbjct: 481 KIKR 484

BLAST of CmaCh04G017210 vs. TrEMBL
Match: A0A0L9U659_PHAAN (Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan03g152400 PE=4 SV=1)

HSP 1 Score: 803.9 bits (2075), Expect = 1.1e-229
Identity = 424/483 (87.78%), Postives = 453/483 (93.79%), Query Frame = 1

Query: 14  ATLADSFLADLDELSD-EDNFLDEEDADAENMEEDIDGDLADLESLNYDDLDSVSKLQKT 73
           ATLADSFLADLDELSD E   L+  D DA +MEED+DGDLADLE+LNYDDLDSVSKLQKT
Sbjct: 22  ATLADSFLADLDELSDNEAEILENNDVDAADMEEDVDGDLADLENLNYDDLDSVSKLQKT 81

Query: 74  QRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK 133
           QRY DI+QKV +AL+M S+VS QG  LEDDPEYQLIV+CNALSVDIENEI+IIHNFIRDK
Sbjct: 82  QRYIDIIQKVEEALKMGSDVSTQGLDLEDDPEYQLIVDCNALSVDIENEIVIIHNFIRDK 141

Query: 134 YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL 193
           YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSA+IMVVSVTASTT+GKPL
Sbjct: 142 YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAIIMVVSVTASTTTGKPL 201

Query: 194 PEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGLV 253
           PEE+L KT++ACDRAL LDSAKK VL+FVESRMGYIAPNLSAIVGSAVAAKLMGTAGGL 
Sbjct: 202 PEEVLSKTVEACDRALDLDSAKKKVLDFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGLA 261

Query: 254 ALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIAAKS 313
           +LAKMPACNVQLLGAKKKNLAGFSTATSQFRVGY+EQTEIFQ+TPP L+MRACRL+AAKS
Sbjct: 262 SLAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYLEQTEIFQTTPPSLRMRACRLLAAKS 321

Query: 314 TLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL 373
           TL ARVDS +GDP+G TGR FKDEI KKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL
Sbjct: 322 TLVARVDSIQGDPSGNTGRAFKDEIHKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL 381

Query: 374 RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK 433
           RKMKERYA T+MRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVS AQSKLAAK
Sbjct: 382 RKMKERYAITDMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSVAQSKLAAK 441

Query: 434 VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGGGTQSTYFSETGTFSK 493
           V K+FKEK YGSSGATSGLTSSLAFTPVQGIEL+NPQAH +QLG GTQSTYFSETGTFSK
Sbjct: 442 VAKRFKEKNYGSSGATSGLTSSLAFTPVQGIELTNPQAHAHQLGSGTQSTYFSETGTFSK 501

Query: 494 IRK 496
           I++
Sbjct: 502 IKR 504

BLAST of CmaCh04G017210 vs. TAIR10
Match: AT1G60170.1 (AT1G60170.1 pre-mRNA processing ribonucleoprotein binding region-containing protein)

HSP 1 Score: 690.6 bits (1781), Expect = 6.8e-199
Identity = 371/485 (76.49%), Postives = 417/485 (85.98%), Query Frame = 1

Query: 13  LATLADSFLADLDELSDEDNFLDEEDADAENMEEDIDGDLADLESLNYDDLDSVSKLQKT 72
           +ATL DSFLADLDELSD +  LDE D D    EED+D D+ADLE+LNYDDLD+VSKLQK+
Sbjct: 1   MATLEDSFLADLDELSDNEAELDENDGDVGKEEEDVDMDMADLETLNYDDLDNVSKLQKS 60

Query: 73  QRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK 132
           QRY DIM KV +AL  DS+ + +G VLEDDPEY+LIV+CN LSVDIENEI+I+HNFI+DK
Sbjct: 61  QRYADIMHKVEEALGKDSDGAEKGTVLEDDPEYKLIVDCNQLSVDIENEIVIVHNFIKDK 120

Query: 133 YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL 192
           Y+LKF ELESLVHHPIDYA VVKKIGNE DL LVDL  LLPSA+IMVVSVTA TT G  L
Sbjct: 121 YKLKFQELESLVHHPIDYACVVKKIGNETDLALVDLADLLPSAIIMVVSVTALTTKGSAL 180

Query: 193 PEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGLV 252
           PE++LQK ++ACDRAL LDSA+K VL FVES+MG IAPNLSAIVGSAVAAKLMGTAGGL 
Sbjct: 181 PEDVLQKVLEACDRALDLDSARKKVLEFVESKMGSIAPNLSAIVGSAVAAKLMGTAGGLS 240

Query: 253 ALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIAAKS 312
           ALAKMPACNVQ+LG K+KNLAGFS+ATSQ RVGY+EQTEI+QSTPP L+ RA RL+AAKS
Sbjct: 241 ALAKMPACNVQVLGHKRKNLAGFSSATSQSRVGYLEQTEIYQSTPPGLQARAGRLVAAKS 300

Query: 313 TLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL 372
           TLAARVD+TRGDP G +G+ F++EI KKIEKWQEPPPA+QPKPLPVPDSEPKK+RGGRRL
Sbjct: 301 TLAARVDATRGDPLGISGKAFREEIRKKIEKWQEPPPARQPKPLPVPDSEPKKRRGGRRL 360

Query: 373 RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLA-- 432
           RKMKERY  T+MRKLANRM FG PEESSLGDGLGEGYGMLGQAGS +LRVS+  SKL   
Sbjct: 361 RKMKERYQVTDMRKLANRMAFGTPEESSLGDGLGEGYGMLGQAGSNRLRVSSVPSKLKIN 420

Query: 433 AKVVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGGGTQSTYFSETGTF 492
           AKV KK KE++Y     TSGLTSSLAFTPVQGIEL NPQ  L  LG GTQSTYFSE+GTF
Sbjct: 421 AKVAKKLKERQYAGGATTSGLTSSLAFTPVQGIELCNPQQALG-LGSGTQSTYFSESGTF 480

Query: 493 SKIRK 496
           SK++K
Sbjct: 481 SKLKK 484

BLAST of CmaCh04G017210 vs. TAIR10
Match: AT1G70400.3 (AT1G70400.3 FUNCTIONS IN: molecular_function unknown)

HSP 1 Score: 203.8 bits (517), Expect = 2.5e-52
Identity = 113/165 (68.48%), Postives = 131/165 (79.39%), Query Frame = 1

Query: 51  DLADLESLNYDDLDSVSKLQKTQRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVE 110
           D+ +L +L YDDLDSVSKLQK++RY DIMQ+V +ALE        G VLE     +LIV+
Sbjct: 2   DMTELNTLTYDDLDSVSKLQKSRRYADIMQQVEEALE--------GSVLEYK---KLIVD 61

Query: 111 CNALSVDIENEIIIIHNFIRDKYRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEG 170
           C  L VDIENEI+I+ NFIRDKYR+KF ELE LV HPIDYARVVK+IGNEMDL LVDLEG
Sbjct: 62  CKQLLVDIENEIVIVQNFIRDKYRVKFQELELLVPHPIDYARVVKRIGNEMDLKLVDLEG 121

Query: 171 LLPSAVIMVVSVTASTTSGKPLPEEILQKTIDACDRALTLDSAKK 216
           LLPSA+IMV+ VTA TT G  LPE++L KTIDACDRAL LDSA+K
Sbjct: 122 LLPSAMIMVLLVTALTTKGNQLPEDVLLKTIDACDRALDLDSARK 155

BLAST of CmaCh04G017210 vs. TAIR10
Match: AT3G05060.1 (AT3G05060.1 NOP56-like pre RNA processing ribonucleoprotein)

HSP 1 Score: 102.1 bits (253), Expect = 1.0e-21
Identity = 70/245 (28.57%), Postives = 120/245 (48.98%), Query Frame = 1

Query: 85  ALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDKYRLKFPELESLV 144
           +L +  +++        D    +I++   L  D++ E+      +R+ Y   FPEL  ++
Sbjct: 138 SLGLSHSLARYKLKFSSDKVDTMIIQAIGLLDDLDKELNTYAMRVREWYGWHFPELAKII 197

Query: 145 HHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPLPEEILQKTIDAC 204
              I YA+ VK +GN ++   +D   +L   +   +   A  + G  + +  L    + C
Sbjct: 198 SDNILYAKSVKLMGNRVNAAKLDFSEILADEIEADLKDAAVISMGTEVSDLDLLHIRELC 257

Query: 205 DRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGLVALAKMPACNVQL 264
           D+ L+L   +  + ++++SRM  IAPNL+A+VG  V A+L+   G L+ L+K P   VQ+
Sbjct: 258 DQVLSLSEYRAQLYDYLKSRMNTIAPNLTALVGELVGARLISHGGSLLNLSKQPGSTVQI 317

Query: 265 LGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIAAKSTLAARVDSTRGD 324
           LGA+K       T  +  + G I    +     P  K +  R +AAK+ LA RVD+  GD
Sbjct: 318 LGAEKALFRALKTKHATPKYGLIFHASLVGQAAPKHKGKISRSLAAKTVLAIRVDAL-GD 377

Query: 325 PTGKT 330
               T
Sbjct: 378 SQDNT 381

BLAST of CmaCh04G017210 vs. TAIR10
Match: AT5G27120.1 (AT5G27120.1 NOP56-like pre RNA processing ribonucleoprotein)

HSP 1 Score: 99.4 bits (246), Expect = 6.7e-21
Identity = 80/291 (27.49%), Postives = 138/291 (47.42%), Query Frame = 1

Query: 85  ALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDKYRLKFPELESLV 144
           +L +  +++        D    +I++   L  D++ E+      +R+ +   FPEL  +V
Sbjct: 137 SLGLSHSLARYKLKFSSDKVDTMIIQAIGLLDDLDKELNTYAMRVREWFGWHFPELAKIV 196

Query: 145 HHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPLPEEILQKTIDAC 204
              I YA+ VK +GN ++   +D   +L   +   +   A  + G  + +  L    + C
Sbjct: 197 QDNILYAKAVKLMGNRINAAKLDFSEILADEIEAELKEAAVISMGTEVSDLDLLHIRELC 256

Query: 205 DRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGLVALAKMPACNVQL 264
           D+ L+L   +  + ++++SRM  IAPNL+A+VG  V A+L+   G L+ LAK P   VQ+
Sbjct: 257 DQVLSLAEYRAQLYDYLKSRMNTIAPNLTALVGELVGARLISHGGSLLNLAKQPGSTVQI 316

Query: 265 LGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIAAKSTLAARVDS---T 324
           LGA+K       T  +  + G I    +     P  K +  R +AAKS LA R D+   +
Sbjct: 317 LGAEKALFRALKTKHATPKYGLIFHASVVGQAAPKNKGKISRSLAAKSVLAIRCDALGDS 376

Query: 325 RGDPTGKTGRVFKDEILKKIE---KWQEPPPAKQPKPLPVPDSEPKKKRGG 370
           + +  G   R+  +  L+ +E     +    AK    + V D + KK  GG
Sbjct: 377 QDNTMGVENRLKLEARLRTLEGKDLGRLSGSAKGKPKIEVYDKDKKKGSGG 427

BLAST of CmaCh04G017210 vs. TAIR10
Match: AT5G27140.1 (AT5G27140.1 NOP56-like pre RNA processing ribonucleoprotein)

HSP 1 Score: 91.3 bits (225), Expect = 1.8e-18
Identity = 75/281 (26.69%), Postives = 137/281 (48.75%), Query Frame = 1

Query: 107 LIVECNALSVDIENEIIIIHNFIRDKYRLKFPELESLVHHPIDYARVVKKIGNEMDLTLV 166
           +I+   +L  D++ E+      + + Y L FPEL ++V   I YA+VVK +GN ++   +
Sbjct: 129 MIILSISLLDDLDKELNTYTTSVCELYGLHFPELANIVQDNILYAKVVKLMGNRINAATL 188

Query: 167 DLEGLLPSAVIMVVSVTASTTSGKPLPEEILQKTIDACDRALTLDSAKKMVLNFVESRMG 226
           D   +L   V   +   +  ++   + +  L    + CD+ L++   K ++ + ++++M 
Sbjct: 189 DFSEILADEVEAELKEASMVSTRTEVSDLDLMHIQELCDQVLSIAEDKTLLCDDLKNKMN 248

Query: 227 YIAPNLSAIVGSAVAAKLMGTAGGLVALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGY 286
            IAPNL+A+VG  V A+L+   G L  L+K+P   +Q+LGA+K       T  +  + G 
Sbjct: 249 KIAPNLTALVGELVGARLISHCGSLWNLSKLPWSTIQILGAEKTLYKALKTKQATPKYGL 308

Query: 287 IEQTEIFQSTPPPLKMRACRLIAAKSTLAARVD---STRGDPTGKTGRVFKDEILKKIE- 346
           I    + +   P  K +  R +AAKS LA R D   + + +  G   R+  +  L+ +E 
Sbjct: 309 IYHAPLVRQAAPENKGKIARSLAAKSALAIRCDAFGNGQDNTMGVESRLKLEARLRNLEG 368

Query: 347 ------KWQEPPPAKQPKPLPVPDSEPKKKRGGRRLRKMKE 378
                 + +E    K  K     + EPK +   ++ +K  E
Sbjct: 369 GDLGACEEEEEVNDKDTKKEADDEEEPKTEECSKKRKKEAE 409

BLAST of CmaCh04G017210 vs. NCBI nr
Match: gi|659090826|ref|XP_008446223.1| (PREDICTED: U4/U6 small nuclear ribonucleoprotein Prp31 isoform X1 [Cucumis melo])

HSP 1 Score: 876.3 bits (2263), Expect = 2.5e-251
Identity = 466/484 (96.28%), Postives = 473/484 (97.73%), Query Frame = 1

Query: 13  LATLADSFLADLDELSDEDNFLDEEDADAENMEEDIDGDLADLESLNYDDLDSVSKLQKT 72
           +ATLADSFLADLDELSDED F  E  ADAENMEEDIDGDLADLESLNY+DLDSVSKLQKT
Sbjct: 1   MATLADSFLADLDELSDEDKFQGEAGADAENMEEDIDGDLADLESLNYEDLDSVSKLQKT 60

Query: 73  QRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK 132
           QRYNDIMQKV DAL+ DSN+SNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK
Sbjct: 61  QRYNDIMQKVEDALQTDSNISNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK 120

Query: 133 YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL 192
           YRLKFPELESLVHHPIDYARVVKKIGNE+DLTLVDLEGLLPSAVIMVVSVTASTTSGKPL
Sbjct: 121 YRLKFPELESLVHHPIDYARVVKKIGNEVDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL 180

Query: 193 PEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGLV 252
           PEEILQKTIDACDRAL LDSAKKMVL FVESRMG+IAPNLSAIVGSAVAAKLMGTAGGLV
Sbjct: 181 PEEILQKTIDACDRALALDSAKKMVLTFVESRMGHIAPNLSAIVGSAVAAKLMGTAGGLV 240

Query: 253 ALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIAAKS 312
           ALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLI+AKS
Sbjct: 241 ALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLISAKS 300

Query: 313 TLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL 372
           TLAARVDST GDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL
Sbjct: 301 TLAARVDSTMGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL 360

Query: 373 RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK 432
           RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK
Sbjct: 361 RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK 420

Query: 433 VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGGGTQSTYFSETGTFSK 492
           VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLG GTQSTYFSETGTFSK
Sbjct: 421 VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGSGTQSTYFSETGTFSK 480

Query: 493 IRKN 497
           IRKN
Sbjct: 481 IRKN 484

BLAST of CmaCh04G017210 vs. NCBI nr
Match: gi|778705094|ref|XP_011655634.1| (PREDICTED: U4/U6 small nuclear ribonucleoprotein Prp31 isoform X1 [Cucumis sativus])

HSP 1 Score: 872.1 bits (2252), Expect = 4.7e-250
Identity = 463/484 (95.66%), Postives = 471/484 (97.31%), Query Frame = 1

Query: 13  LATLADSFLADLDELSDEDNFLDEEDADAENMEEDIDGDLADLESLNYDDLDSVSKLQKT 72
           +AT ADSFLADLDELSDED F  E  ADAENMEEDIDGDLADLESLNY+DLDSVSKLQKT
Sbjct: 1   MATFADSFLADLDELSDEDKFQGEAGADAENMEEDIDGDLADLESLNYEDLDSVSKLQKT 60

Query: 73  QRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK 132
           QRYNDIMQKV DAL+ DSN+SNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK
Sbjct: 61  QRYNDIMQKVEDALQTDSNISNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRDK 120

Query: 133 YRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL 192
           YRLKFPELESLVHHPIDYARVVKKIGNE+DLTLVDLEGLLPSAVIMVVSVTASTTSGKPL
Sbjct: 121 YRLKFPELESLVHHPIDYARVVKKIGNEVDLTLVDLEGLLPSAVIMVVSVTASTTSGKPL 180

Query: 193 PEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGLV 252
           PEEILQKTIDACDRAL LDSAKKMVL FVESRMG+IAPNLSAIVGSAVAAKLMGTAGGL 
Sbjct: 181 PEEILQKTIDACDRALALDSAKKMVLTFVESRMGHIAPNLSAIVGSAVAAKLMGTAGGLA 240

Query: 253 ALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIAAKS 312
           ALAKMPACNVQLLGAK+KNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLI+AKS
Sbjct: 241 ALAKMPACNVQLLGAKRKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLISAKS 300

Query: 313 TLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL 372
           TLAARVDST GDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL
Sbjct: 301 TLAARVDSTMGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRRL 360

Query: 373 RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK 432
           RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK
Sbjct: 361 RKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAAK 420

Query: 433 VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGGGTQSTYFSETGTFSK 492
           VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLG GTQSTYFSETGTFSK
Sbjct: 421 VVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGSGTQSTYFSETGTFSK 480

Query: 493 IRKN 497
           IRKN
Sbjct: 481 IRKN 484

BLAST of CmaCh04G017210 vs. NCBI nr
Match: gi|659090828|ref|XP_008446224.1| (PREDICTED: U4/U6 small nuclear ribonucleoprotein Prp31 isoform X2 [Cucumis melo])

HSP 1 Score: 836.3 bits (2159), Expect = 2.9e-239
Identity = 441/453 (97.35%), Postives = 447/453 (98.68%), Query Frame = 1

Query: 44  MEEDIDGDLADLESLNYDDLDSVSKLQKTQRYNDIMQKVADALEMDSNVSNQGFVLEDDP 103
           MEEDIDGDLADLESLNY+DLDSVSKLQKTQRYNDIMQKV DAL+ DSN+SNQGFVLEDDP
Sbjct: 1   MEEDIDGDLADLESLNYEDLDSVSKLQKTQRYNDIMQKVEDALQTDSNISNQGFVLEDDP 60

Query: 104 EYQLIVECNALSVDIENEIIIIHNFIRDKYRLKFPELESLVHHPIDYARVVKKIGNEMDL 163
           EYQLIVECNALSVDIENEIIIIHNFIRDKYRLKFPELESLVHHPIDYARVVKKIGNE+DL
Sbjct: 61  EYQLIVECNALSVDIENEIIIIHNFIRDKYRLKFPELESLVHHPIDYARVVKKIGNEVDL 120

Query: 164 TLVDLEGLLPSAVIMVVSVTASTTSGKPLPEEILQKTIDACDRALTLDSAKKMVLNFVES 223
           TLVDLEGLLPSAVIMVVSVTASTTSGKPLPEEILQKTIDACDRAL LDSAKKMVL FVES
Sbjct: 121 TLVDLEGLLPSAVIMVVSVTASTTSGKPLPEEILQKTIDACDRALALDSAKKMVLTFVES 180

Query: 224 RMGYIAPNLSAIVGSAVAAKLMGTAGGLVALAKMPACNVQLLGAKKKNLAGFSTATSQFR 283
           RMG+IAPNLSAIVGSAVAAKLMGTAGGLVALAKMPACNVQLLGAKKKNLAGFSTATSQFR
Sbjct: 181 RMGHIAPNLSAIVGSAVAAKLMGTAGGLVALAKMPACNVQLLGAKKKNLAGFSTATSQFR 240

Query: 284 VGYIEQTEIFQSTPPPLKMRACRLIAAKSTLAARVDSTRGDPTGKTGRVFKDEILKKIEK 343
           VGYIEQTEIFQSTPPPLKMRACRLI+AKSTLAARVDST GDPTGKTGRVFKDEILKKIEK
Sbjct: 241 VGYIEQTEIFQSTPPPLKMRACRLISAKSTLAARVDSTMGDPTGKTGRVFKDEILKKIEK 300

Query: 344 WQEPPPAKQPKPLPVPDSEPKKKRGGRRLRKMKERYATTEMRKLANRMQFGVPEESSLGD 403
           WQEPPPAKQPKPLPVPDSEPKKKRGGRRLRKMKERYATTEMRKLANRMQFGVPEESSLGD
Sbjct: 301 WQEPPPAKQPKPLPVPDSEPKKKRGGRRLRKMKERYATTEMRKLANRMQFGVPEESSLGD 360

Query: 404 GLGEGYGMLGQAGSGKLRVSAAQSKLAAKVVKKFKEKRYGSSGATSGLTSSLAFTPVQGI 463
           GLGEGYGMLGQAGSGKLRVSAAQSKLAAKVVKKFKEKRYGSSGATSGLTSSLAFTPVQGI
Sbjct: 361 GLGEGYGMLGQAGSGKLRVSAAQSKLAAKVVKKFKEKRYGSSGATSGLTSSLAFTPVQGI 420

Query: 464 ELSNPQAHLNQLGGGTQSTYFSETGTFSKIRKN 497
           ELSNPQAHLNQLG GTQSTYFSETGTFSKIRKN
Sbjct: 421 ELSNPQAHLNQLGSGTQSTYFSETGTFSKIRKN 453

BLAST of CmaCh04G017210 vs. NCBI nr
Match: gi|778705097|ref|XP_011655635.1| (PREDICTED: U4/U6 small nuclear ribonucleoprotein Prp31 isoform X2 [Cucumis sativus])

HSP 1 Score: 833.6 bits (2152), Expect = 1.9e-238
Identity = 439/453 (96.91%), Postives = 446/453 (98.45%), Query Frame = 1

Query: 44  MEEDIDGDLADLESLNYDDLDSVSKLQKTQRYNDIMQKVADALEMDSNVSNQGFVLEDDP 103
           MEEDIDGDLADLESLNY+DLDSVSKLQKTQRYNDIMQKV DAL+ DSN+SNQGFVLEDDP
Sbjct: 1   MEEDIDGDLADLESLNYEDLDSVSKLQKTQRYNDIMQKVEDALQTDSNISNQGFVLEDDP 60

Query: 104 EYQLIVECNALSVDIENEIIIIHNFIRDKYRLKFPELESLVHHPIDYARVVKKIGNEMDL 163
           EYQLIVECNALSVDIENEIIIIHNFIRDKYRLKFPELESLVHHPIDYARVVKKIGNE+DL
Sbjct: 61  EYQLIVECNALSVDIENEIIIIHNFIRDKYRLKFPELESLVHHPIDYARVVKKIGNEVDL 120

Query: 164 TLVDLEGLLPSAVIMVVSVTASTTSGKPLPEEILQKTIDACDRALTLDSAKKMVLNFVES 223
           TLVDLEGLLPSAVIMVVSVTASTTSGKPLPEEILQKTIDACDRAL LDSAKKMVL FVES
Sbjct: 121 TLVDLEGLLPSAVIMVVSVTASTTSGKPLPEEILQKTIDACDRALALDSAKKMVLTFVES 180

Query: 224 RMGYIAPNLSAIVGSAVAAKLMGTAGGLVALAKMPACNVQLLGAKKKNLAGFSTATSQFR 283
           RMG+IAPNLSAIVGSAVAAKLMGTAGGL ALAKMPACNVQLLGAK+KNLAGFSTATSQFR
Sbjct: 181 RMGHIAPNLSAIVGSAVAAKLMGTAGGLAALAKMPACNVQLLGAKRKNLAGFSTATSQFR 240

Query: 284 VGYIEQTEIFQSTPPPLKMRACRLIAAKSTLAARVDSTRGDPTGKTGRVFKDEILKKIEK 343
           VGYIEQTEIFQSTPPPLKMRACRLI+AKSTLAARVDST GDPTGKTGRVFKDEILKKIEK
Sbjct: 241 VGYIEQTEIFQSTPPPLKMRACRLISAKSTLAARVDSTMGDPTGKTGRVFKDEILKKIEK 300

Query: 344 WQEPPPAKQPKPLPVPDSEPKKKRGGRRLRKMKERYATTEMRKLANRMQFGVPEESSLGD 403
           WQEPPPAKQPKPLPVPDSEPKKKRGGRRLRKMKERYATTEMRKLANRMQFGVPEESSLGD
Sbjct: 301 WQEPPPAKQPKPLPVPDSEPKKKRGGRRLRKMKERYATTEMRKLANRMQFGVPEESSLGD 360

Query: 404 GLGEGYGMLGQAGSGKLRVSAAQSKLAAKVVKKFKEKRYGSSGATSGLTSSLAFTPVQGI 463
           GLGEGYGMLGQAGSGKLRVSAAQSKLAAKVVKKFKEKRYGSSGATSGLTSSLAFTPVQGI
Sbjct: 361 GLGEGYGMLGQAGSGKLRVSAAQSKLAAKVVKKFKEKRYGSSGATSGLTSSLAFTPVQGI 420

Query: 464 ELSNPQAHLNQLGGGTQSTYFSETGTFSKIRKN 497
           ELSNPQAHLNQLG GTQSTYFSETGTFSKIRKN
Sbjct: 421 ELSNPQAHLNQLGSGTQSTYFSETGTFSKIRKN 453

BLAST of CmaCh04G017210 vs. NCBI nr
Match: gi|802718753|ref|XP_012085347.1| (PREDICTED: U4/U6 small nuclear ribonucleoprotein Prp31 [Jatropha curcas])

HSP 1 Score: 826.2 bits (2133), Expect = 3.0e-236
Identity = 434/484 (89.67%), Postives = 461/484 (95.25%), Query Frame = 1

Query: 13  LATLADSFLADLDELSDED-NFLDEEDADAENMEEDIDGDLADLESLNYDDLDSVSKLQK 72
           +ATLADSFLADLDELSD D + ++E+D DA NMEED+DGD+AD+E+LNYDDLDSVSKLQK
Sbjct: 1   MATLADSFLADLDELSDNDADLVEEDDVDAGNMEEDVDGDMADIEALNYDDLDSVSKLQK 60

Query: 73  TQRYNDIMQKVADALEMDSNVSNQGFVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRD 132
           TQRYNDIMQKV DALE  S++SNQG VLEDDPEYQLIVECNALSVDIENEIIIIHNFIRD
Sbjct: 61  TQRYNDIMQKVEDALEKGSDISNQGMVLEDDPEYQLIVECNALSVDIENEIIIIHNFIRD 120

Query: 133 KYRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAVIMVVSVTASTTSGKP 192
           KYRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSA+IMVVSVTASTTSGKP
Sbjct: 121 KYRLKFPELESLVHHPIDYARVVKKIGNEMDLTLVDLEGLLPSAIIMVVSVTASTTSGKP 180

Query: 193 LPEEILQKTIDACDRALTLDSAKKMVLNFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGL 252
           LPEE+LQKTIDACDRAL LDSAKK VL+FVESRMGYIAPNLSAIVGSAVAAKLMGTAGGL
Sbjct: 181 LPEEVLQKTIDACDRALALDSAKKKVLDFVESRMGYIAPNLSAIVGSAVAAKLMGTAGGL 240

Query: 253 VALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEIFQSTPPPLKMRACRLIAAK 312
            ALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTE+FQ+TPP L+MRACRL+AAK
Sbjct: 241 SALAKMPACNVQLLGAKKKNLAGFSTATSQFRVGYIEQTEVFQTTPPALRMRACRLLAAK 300

Query: 313 STLAARVDSTRGDPTGKTGRVFKDEILKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRR 372
           STLAARVDSTRGDP+G+TGR  ++EI KKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRR
Sbjct: 301 STLAARVDSTRGDPSGRTGRALREEIHKKIEKWQEPPPAKQPKPLPVPDSEPKKKRGGRR 360

Query: 373 LRKMKERYATTEMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSAAQSKLAA 432
           LRKMKERYA T+MRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVS  QSKLAA
Sbjct: 361 LRKMKERYAVTDMRKLANRMQFGVPEESSLGDGLGEGYGMLGQAGSGKLRVSIGQSKLAA 420

Query: 433 KVVKKFKEKRYGSSGATSGLTSSLAFTPVQGIELSNPQAHLNQLGGGTQSTYFSETGTFS 492
           KV KKFKEK YGSSGATSGLTSSLAFTPVQGIEL+NPQAH +QLG GTQSTYFSETGTFS
Sbjct: 421 KVAKKFKEKNYGSSGATSGLTSSLAFTPVQGIELTNPQAHAHQLGSGTQSTYFSETGTFS 480

Query: 493 KIRK 496
           KI++
Sbjct: 481 KIKR 484

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
PRP31_MOUSE5.4e-10546.98U4/U6 small nuclear ribonucleoprotein Prp31 OS=Mus musculus GN=Prpf31 PE=1 SV=3[more]
PRP31_XENLA6.0e-10446.67U4/U6 small nuclear ribonucleoprotein Prp31 OS=Xenopus laevis GN=prpf31 PE=2 SV=... [more]
PRP31_HUMAN7.8e-10446.57U4/U6 small nuclear ribonucleoprotein Prp31 OS=Homo sapiens GN=PRPF31 PE=1 SV=2[more]
PRP31_XENTR3.9e-10346.46U4/U6 small nuclear ribonucleoprotein Prp31 OS=Xenopus tropicalis GN=prpf31 PE=2... [more]
PRP31_DANRE2.1e-10144.62U4/U6 small nuclear ribonucleoprotein Prp31 OS=Danio rerio GN=prpf31 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KSD2_CUCSA3.3e-25095.66Uncharacterized protein OS=Cucumis sativus GN=Csa_5G602130 PE=4 SV=1[more]
A0A067K434_JATCU2.1e-23689.67Uncharacterized protein OS=Jatropha curcas GN=JCGZ_17722 PE=4 SV=1[more]
M5W5L0_PRUPE5.8e-23188.02Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa004930mg PE=4 SV=1[more]
A0A0S3RVE2_PHAAN8.4e-23087.60Uncharacterized protein OS=Vigna angularis var. angularis GN=Vigan.04G197000 PE=... [more]
A0A0L9U659_PHAAN1.1e-22987.78Uncharacterized protein OS=Phaseolus angularis GN=LR48_Vigan03g152400 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G60170.16.8e-19976.49 pre-mRNA processing ribonucleoprotein binding region-containing prot... [more]
AT1G70400.32.5e-5268.48 FUNCTIONS IN: molecular_function unknown[more]
AT3G05060.11.0e-2128.57 NOP56-like pre RNA processing ribonucleoprotein[more]
AT5G27120.16.7e-2127.49 NOP56-like pre RNA processing ribonucleoprotein[more]
AT5G27140.11.8e-1826.69 NOP56-like pre RNA processing ribonucleoprotein[more]
Match NameE-valueIdentityDescription
gi|659090826|ref|XP_008446223.1|2.5e-25196.28PREDICTED: U4/U6 small nuclear ribonucleoprotein Prp31 isoform X1 [Cucumis melo][more]
gi|778705094|ref|XP_011655634.1|4.7e-25095.66PREDICTED: U4/U6 small nuclear ribonucleoprotein Prp31 isoform X1 [Cucumis sativ... [more]
gi|659090828|ref|XP_008446224.1|2.9e-23997.35PREDICTED: U4/U6 small nuclear ribonucleoprotein Prp31 isoform X2 [Cucumis melo][more]
gi|778705097|ref|XP_011655635.1|1.9e-23896.91PREDICTED: U4/U6 small nuclear ribonucleoprotein Prp31 isoform X2 [Cucumis sativ... [more]
gi|802718753|ref|XP_012085347.1|3.0e-23689.67PREDICTED: U4/U6 small nuclear ribonucleoprotein Prp31 [Jatropha curcas][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002687Nop_dom
IPR012976NOSIC
IPR019175Prp31_C
IPR027105Prp31
Vocabulary: Biological Process
TermDefinition
GO:0000398mRNA splicing, via spliceosome
GO:0000244spliceosomal tri-snRNP complex assembly
Vocabulary: Cellular Component
TermDefinition
GO:0046540U4/U6 x U5 tri-snRNP complex
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006346 methylation-dependent chromatin silencing
biological_process GO:0009409 response to cold
biological_process GO:0009845 seed germination
biological_process GO:0000244 spliceosomal tri-snRNP complex assembly
biological_process GO:0000398 mRNA splicing, via spliceosome
cellular_component GO:0046540 U4/U6 x U5 tri-snRNP complex
cellular_component GO:0019013 viral nucleocapsid
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh04G017210.1CmaCh04G017210.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002687Nop domainPFAMPF01798Nopcoord: 115..343
score: 1.8
IPR002687Nop domainPROFILEPS51358NOPcoord: 228..346
score: 37
IPR002687Nop domainunknownSSF89124Nop domaincoord: 100..346
score: 7.06
IPR012976NOSICSMARTSM00931NOSIC_2coord: 107..159
score: 6.4
IPR019175Prp31 C-terminalPFAMPF09785Prp31_Ccoord: 351..468
score: 1.1
IPR027105U4/U6 small nuclear ribonucleoprotein Prp31PANTHERPTHR13904PRE-MRNA SPLICING FACTOR PRP31coord: 14..496
score: 4.4E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmaCh04G017210CmaCh18G008060Cucurbita maxima (Rimu)cmacmaB404
The following block(s) are covering this gene:
GeneOrganismBlock
CmaCh04G017210Cucurbita maxima (Rimu)cmacmaB189
CmaCh04G017210Cucurbita maxima (Rimu)cmacmaB540
CmaCh04G017210Cucurbita moschata (Rifu)cmacmoB710
CmaCh04G017210Cucurbita moschata (Rifu)cmacmoB747
CmaCh04G017210Cucurbita pepo (Zucchini)cmacpeB704
CmaCh04G017210Silver-seed gourdcarcmaB0706
CmaCh04G017210Wax gourdcmawgoB0891