Moc08g31960 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc08g31960
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionU4/U6 small nuclear ribonucleoprotein PRP4-like protein
Locationchr8: 23106024 .. 23110297 (-)
RNA-Seq ExpressionMoc08g31960
SyntenyMoc08g31960
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGTTGATGATCAAAACCCTGCATCAACTTCTGCAGAATCTCCCGAAACTCTTCCAGATGGTGAAAATGTGGTGGACCTTGACAGCCCATCAGAACCGATCCAACCTGCAGCTTCATCAGTTATTCCACCTCCAATTGTGCCTGCCATTGCTCCCATTCCTCCTCCTCCAGTTATCCGTCCATTGGCTCCTCTGCCAAGCCGACCTCCCCTCTTCAGACCCCCTGTAGCACAAAATGGTGAGTTGAGAACAAGTGACTCAGACTCGGAACATGATGATTTAGCCCCTTCGCGAGCAGCTCCAGGTTCAGCTGCAGATTATGAAGTGTCGGAAGAGAGCAGACAGGTGAGGGAGCGTCAGGAAAAGGCGATGCAGGAATTTCTTATGAAGCGTCGTGCTTCTGCTCTAGCAGTGCCTACTAATGACATGGCCGTTCGGGCTCGTCTTCGTCGTCTTGGTGAACCCATAACTCTTTTCGGGGAAAGAGAAATGGAAAGACGGGACAGGTTGCGGTCAATAATGGCGAGATTGGATGCTGAAGGGCAATTAGAGAAGCTGCTGAAAGCTCATGAGGAAGAGGAGGCTGCAGCTACTGCTGGAACTGAGGATGCTGAGGAGGAGGTGCTTCAGTATCCCTTTTATACTGAGGGGTCCAAAGCTCTCTTGAATACCAGACTTGATATTGCAAAGTATTCAATTGTGAGAGCAGCTTCGCGTCTTGAGCGTGCAAAGAGGAAAAGGGATGACCCAGATGAAGATGTGGAAGCAGAAATGGATTGGGCACTGAGACAGGCTGGAAGTTTGGTCCTTGACTGCAGTGAGATAGGAGATGATCGACCACTTTCTGGTTGTTCTTTCTCGTCTGATGGAAAATTCCTTGCCACTAGGTACCCTTCTAATCATTGCAGGTTAAAGATTTTTAGTTACTGATACCAGTTTGTCCTTGTTAGTATCATCGACCCTCAAACCTCCTGATAATAAAAGGGAAAAAAAAAGGAAAAGAAGACAAAAGAAAAAGGAAAAATAATTTGTCAGTTGAGAAACAAGATGCCAAAGATACATATAAGTGAAGGTTTAACCATTTTCCATCTCTAATACTTGATAGGCTTTGTAGTTTGTGCATAGTTTTACTGGTTTTGTTTGTGATGATTTTAGTATTTTTCTGCTTGTTTAATTTCTTTAGTTCATTAAGTGGAGTTGCAAAATTGTGGAGCATGCCTCAAGTAAGGAAGGTTTCCAACTTTAAGGGACACACGGAGCGTGTTACCGATGTAATGTTTTCTCCTGTGAATGAGTGTTTAGCAACTGCCTCTGCTGACCGAACTGCAAGGTTGTGGTCTGCTGAAGGATCTCTCCTTAGAACATTCGAGGGTCATCTGGATCGCCTTGCCCGAATTGCCTTCCATCCATCTGGGAAGTACTTGGGCACAACTAGCTTTGACAAGACTTGGAGATTATGGGACGTGGAAACTGGTGTAGAGTTGCTTCTTCAAGAAGGTCATAGTAGGAGTGTCTATGGGATAGCCTTTCACCATGATGGATCCTTAGTATCATCTTGTGGACTTGATGCACTTGCTCGTATTTGGGACCTTCGAACTGGCAGAAGTATTCTTGCGTTGGAGGGCCATGTCAAGCCAGTAAATATCTTTCCTTTATTTCGTTGTGATGGTAGAACATTTATTTTTAATAATACTGATTAGTTTGCTCCTTACTGATGCCTTGCTTTTCAATGGATTTCTTATGTCTATGATTTTCCAGGTTCTTGGAGTCAGCTTTTCACCCAATGGTTATCATTTAGCTACTGGTGGTGAAGATAACACTTGTAGAATATGGGACTTAAGGAAGAAAAAATCTCTATACATAATACCTGCACACTCGAACCTTGTTTCACAAGTGAAATATGAGCCTCAAGAGGGATACTTCTTGGTAACTGCATCATTTGATATGACAGCAAAGGTAATCTAATTGCTTGAACACCGTCACCATGAGAGTAATATCGTTCAAGTCTTGACATTTCAATTTGATTGTTGCTTCATCCCACTTTTAGTATAATGTTTCATTTTCTATCATATACTAAATTGTTTTTCCAGATTTGGTCGGCTCGAGATTTTAAGCCAGTGAAGACACTCTCTGGACACGAAGCAAAAGTTACATCTTTGGACATCATTGCAGGTTGGTTATTTACTGATTCCTAGGTCATTATCATTCACAATTTTTGAATCCTTCTAGGCTTCTATGGTTTTTTCCACAAATTACCTCTATTTAGCATGAACTTGCGTTGATAAACACCACTAACATCTTATGCTCTAATGCTTAAATCACTGGAGATAATCTAACGGGTGTGGGGAGTGGTAGGTAAAATCATAGAAATTGTTGCTTATGCTATACTCATATCTTTGCTTAGTCATGAAGTTCTTGCTATAATTTCTGATAGATTATTTTCCATGGGAAAGTGTCTTTAGAAGTTTCAACTTTGCTACTGTGGAGCTGGTTCTTGTAGCTCTGTTGCAAAGAATGGATGGCATAGACAGTAGTTACATAGCACCATATGCTTTTTATAAAAATGTTTTAATTTTTCAATGGCAATAATTTATTGGTATGATGAACTGATGTGCTTGAAAGCACATGCTAAGTGTTAGGGATGATAATCTGTTGGGTTCGATATCATTTGCCTAGTCTGCAATCTTCTACAAGATAGGGCATTGCTGGGATTTCCCGACTGCAAAATCTGCAGACCTATCTAAGAAAACCCATTGCTCCTGGATGGAATTTCCGGGCCACATGTCTCCAGAGCAGAACCTGAGAAAATCTGAACTGAAGTCTCAAGAACCCAAAAGAACTAATCTTATATAAGTCTAAATTTGTCTAAGAAATTAACAGCTACTTCCTGAGAAAAGTCTTGACGACTACGAGATCAAGAACAATTGAAATTACGTAGTTTTGGGCAGGGTTGGGAAAAGAGAGTTGCTAGAATAGTTGAATGATTTTAGTAGTTTTTGACCTTGTATTACAAATATAAAGGGTCAAACTTTTTGAAGAGAAAATTTGGAAGTTTGGATGTTAGACTGCTTGTGGTGTTCATTTTGGGTGTAGAAAACAAAATAAAATTTGAATGTATACTTTCAAGTCATGACAAATTGGTTAGCTTTTAAGAACCGTTTGACACAGCCGAAAATTTGTTTTTAGAAATTGTTTTTGAGGTTGTCAATGGGTAGCTTGTAAATGTTAAGCTTATTATAAACTATATTATGAGAATTTTGGCTCCATATTACAAGCTTTTTCTTATGAACTTTATGGTTGAAAAGTAACAACCACATGTTCAACGATAAATTTGCCTACTTCGACAGTTTTTTGGATTCTGTTTATGTTCTTGCTTTATCATGGTGTAAATGCACTCTTGCAAATTATTGCTTAAATGATCTTTTAGCTATTTAGGGGGCTTTTTTGTAACATCCATTAGCATGGTGTTGGCTTTCTTCTGCCCTTTATATATATATAAATATTTTATCACATCAATGTTATGTGATTTTTGACTGAATACTAACTTCAATTGTTTATAAGAATAGGCCACCCATTATAACCATCATGGTATGACCTAGTGGTCAATACGGGCCAATGAAAATAGCGAAGGATTTAGAGGGAATGAGTTCAAACAATGGTGGCCACCTACCTAAAGATTTAATATCCTACGAGTTATCTTGACAACCAAATGTAGTAGGTCAAAGCGAACCCTTTCATTCCGAATTAGAAGTCTGATGGTTATCTCGTGAGATTAGTCAAGGTGTGTGCAAGTTGGTCTGAACACTCACGGTTATCAAATATAAAAAAAAGCCACCCGGGAGAACAAATTAAGGAAAAATAAAGTCTTTAGGAAATTTGAAGGACACATCGAAACTTTCTTGGAGAACAGTAGAATGTTAGAGGGTGTCTGAGTGGAATTCTCTTGGCTTTATCTCAATCATATAAGCGTCCAGTCAGTTTTCTTTGGTTTTAAGGTGAATGGTCACCGATGCATCATTAACAGTTAGCTCTGAGTAAGTTTATATATGCATATGGCCTTTGTATGCTGATGTTAAAATACTTTTTGGAATTTGCTTCTCTATTATCTAAACATTCTTATGGATTCGAAAAGTTGTCATATTCTGCTGATATTCAAATATTGTTTTTGTTCGATGATAGACGGGCAGGGTATCGCAACAGTCTCACACGATCGGACCATAAAGCTCTGGTCTGTTAATAATAAAGACGAACAGACCATGGATGTCGATTGA

mRNA sequence

ATGGAAGTTGATGATCAAAACCCTGCATCAACTTCTGCAGAATCTCCCGAAACTCTTCCAGATGGTGAAAATGTGGTGGACCTTGACAGCCCATCAGAACCGATCCAACCTGCAGCTTCATCAGTTATTCCACCTCCAATTGTGCCTGCCATTGCTCCCATTCCTCCTCCTCCAGTTATCCGTCCATTGGCTCCTCTGCCAAGCCGACCTCCCCTCTTCAGACCCCCTGTAGCACAAAATGGTGAGTTGAGAACAAGTGACTCAGACTCGGAACATGATGATTTAGCCCCTTCGCGAGCAGCTCCAGGTTCAGCTGCAGATTATGAAGTGTCGGAAGAGAGCAGACAGGTGAGGGAGCGTCAGGAAAAGGCGATGCAGGAATTTCTTATGAAGCGTCGTGCTTCTGCTCTAGCAGTGCCTACTAATGACATGGCCGTTCGGGCTCGTCTTCGTCGTCTTGGTGAACCCATAACTCTTTTCGGGGAAAGAGAAATGGAAAGACGGGACAGGTTGCGGTCAATAATGGCGAGATTGGATGCTGAAGGGCAATTAGAGAAGCTGCTGAAAGCTCATGAGGAAGAGGAGGCTGCAGCTACTGCTGGAACTGAGGATGCTGAGGAGGAGGTGCTTCAGTATCCCTTTTATACTGAGGGGTCCAAAGCTCTCTTGAATACCAGACTTGATATTGCAAAGTATTCAATTGTGAGAGCAGCTTCGCGTCTTGAGCGTGCAAAGAGGAAAAGGGATGACCCAGATGAAGATGTGGAAGCAGAAATGGATTGGGCACTGAGACAGGCTGGAAGTTTGGTCCTTGACTGCAGTGAGATAGGAGATGATCGACCACTTTCTGGTTGTTCTTTCTCGTCTGATGGAAAATTCCTTGCCACTAGTTCATTAAGTGGAGTTGCAAAATTGTGGAGCATGCCTCAAGTAAGGAAGGTTTCCAACTTTAAGGGACACACGGAGCGTGTTACCGATGTAATGTTTTCTCCTGTGAATGAGTGTTTAGCAACTGCCTCTGCTGACCGAACTGCAAGGTTGTGGTCTGCTGAAGGATCTCTCCTTAGAACATTCGAGGGTCATCTGGATCGCCTTGCCCGAATTGCCTTCCATCCATCTGGGAAGTACTTGGGCACAACTAGCTTTGACAAGACTTGGAGATTATGGGACGTGGAAACTGGTGTAGAGTTGCTTCTTCAAGAAGGTCATAGTAGGAGTGTCTATGGGATAGCCTTTCACCATGATGGATCCTTAGTATCATCTTGTGGACTTGATGCACTTGCTCGTATTTGGGACCTTCGAACTGGCAGAAGTATTCTTGCGTTGGAGGGCCATGTCAAGCCAGTTCTTGGAGTCAGCTTTTCACCCAATGGTTATCATTTAGCTACTGGTGGTGAAGATAACACTTGTAGAATATGGGACTTAAGGAAGAAAAAATCTCTATACATAATACCTGCACACTCGAACCTTGTTTCACAAGTGAAATATGAGCCTCAAGAGGGATACTTCTTGGTAACTGCATCATTTGATATGACAGCAAAGATTTGGTCGGCTCGAGATTTTAAGCCAGTGAAGACACTCTCTGGACACGAAGCAAAAGTTACATCTTTGGACATCATTGCAGACGGGCAGGGTATCGCAACAGTCTCACACGATCGGACCATAAAGCTCTGGTCTGTTAATAATAAAGACGAACAGACCATGGATGTCGATTGA

Coding sequence (CDS)

ATGGAAGTTGATGATCAAAACCCTGCATCAACTTCTGCAGAATCTCCCGAAACTCTTCCAGATGGTGAAAATGTGGTGGACCTTGACAGCCCATCAGAACCGATCCAACCTGCAGCTTCATCAGTTATTCCACCTCCAATTGTGCCTGCCATTGCTCCCATTCCTCCTCCTCCAGTTATCCGTCCATTGGCTCCTCTGCCAAGCCGACCTCCCCTCTTCAGACCCCCTGTAGCACAAAATGGTGAGTTGAGAACAAGTGACTCAGACTCGGAACATGATGATTTAGCCCCTTCGCGAGCAGCTCCAGGTTCAGCTGCAGATTATGAAGTGTCGGAAGAGAGCAGACAGGTGAGGGAGCGTCAGGAAAAGGCGATGCAGGAATTTCTTATGAAGCGTCGTGCTTCTGCTCTAGCAGTGCCTACTAATGACATGGCCGTTCGGGCTCGTCTTCGTCGTCTTGGTGAACCCATAACTCTTTTCGGGGAAAGAGAAATGGAAAGACGGGACAGGTTGCGGTCAATAATGGCGAGATTGGATGCTGAAGGGCAATTAGAGAAGCTGCTGAAAGCTCATGAGGAAGAGGAGGCTGCAGCTACTGCTGGAACTGAGGATGCTGAGGAGGAGGTGCTTCAGTATCCCTTTTATACTGAGGGGTCCAAAGCTCTCTTGAATACCAGACTTGATATTGCAAAGTATTCAATTGTGAGAGCAGCTTCGCGTCTTGAGCGTGCAAAGAGGAAAAGGGATGACCCAGATGAAGATGTGGAAGCAGAAATGGATTGGGCACTGAGACAGGCTGGAAGTTTGGTCCTTGACTGCAGTGAGATAGGAGATGATCGACCACTTTCTGGTTGTTCTTTCTCGTCTGATGGAAAATTCCTTGCCACTAGTTCATTAAGTGGAGTTGCAAAATTGTGGAGCATGCCTCAAGTAAGGAAGGTTTCCAACTTTAAGGGACACACGGAGCGTGTTACCGATGTAATGTTTTCTCCTGTGAATGAGTGTTTAGCAACTGCCTCTGCTGACCGAACTGCAAGGTTGTGGTCTGCTGAAGGATCTCTCCTTAGAACATTCGAGGGTCATCTGGATCGCCTTGCCCGAATTGCCTTCCATCCATCTGGGAAGTACTTGGGCACAACTAGCTTTGACAAGACTTGGAGATTATGGGACGTGGAAACTGGTGTAGAGTTGCTTCTTCAAGAAGGTCATAGTAGGAGTGTCTATGGGATAGCCTTTCACCATGATGGATCCTTAGTATCATCTTGTGGACTTGATGCACTTGCTCGTATTTGGGACCTTCGAACTGGCAGAAGTATTCTTGCGTTGGAGGGCCATGTCAAGCCAGTTCTTGGAGTCAGCTTTTCACCCAATGGTTATCATTTAGCTACTGGTGGTGAAGATAACACTTGTAGAATATGGGACTTAAGGAAGAAAAAATCTCTATACATAATACCTGCACACTCGAACCTTGTTTCACAAGTGAAATATGAGCCTCAAGAGGGATACTTCTTGGTAACTGCATCATTTGATATGACAGCAAAGATTTGGTCGGCTCGAGATTTTAAGCCAGTGAAGACACTCTCTGGACACGAAGCAAAAGTTACATCTTTGGACATCATTGCAGACGGGCAGGGTATCGCAACAGTCTCACACGATCGGACCATAAAGCTCTGGTCTGTTAATAATAAAGACGAACAGACCATGGATGTCGATTGA

Protein sequence

MEVDDQNPASTSAESPETLPDGENVVDLDSPSEPIQPAASSVIPPPIVPAIAPIPPPPVIRPLAPLPSRPPLFRPPVAQNGELRTSDSDSEHDDLAPSRAAPGSAADYEVSEESRQVRERQEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDAEGQLEKLLKAHEEEEAAATAGTEDAEEEVLQYPFYTEGSKALLNTRLDIAKYSIVRAASRLERAKRKRDDPDEDVEAEMDWALRQAGSLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLRTFEGHLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVSSCGLDALARIWDLRTGRSILALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIIADGQGIATVSHDRTIKLWSVNNKDEQTMDVD
Homology
BLAST of Moc08g31960 vs. NCBI nr
Match: XP_022151320.1 (U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Momordica charantia])

HSP 1 Score: 1110.1 bits (2870), Expect = 0.0e+00
Identity = 571/571 (100.00%), Postives = 571/571 (100.00%), Query Frame = 0

Query: 1   MEVDDQNPASTSAESPETLPDGENVVDLDSPSEPIQPAASSVIPPPIVPAIAPIPPPPVI 60
           MEVDDQNPASTSAESPETLPDGENVVDLDSPSEPIQPAASSVIPPPIVPAIAPIPPPPVI
Sbjct: 1   MEVDDQNPASTSAESPETLPDGENVVDLDSPSEPIQPAASSVIPPPIVPAIAPIPPPPVI 60

Query: 61  RPLAPLPSRPPLFRPPVAQNGELRTSDSDSEHDDLAPSRAAPGSAADYEVSEESRQVRER 120
           RPLAPLPSRPPLFRPPVAQNGELRTSDSDSEHDDLAPSRAAPGSAADYEVSEESRQVRER
Sbjct: 61  RPLAPLPSRPPLFRPPVAQNGELRTSDSDSEHDDLAPSRAAPGSAADYEVSEESRQVRER 120

Query: 121 QEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDA 180
           QEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDA
Sbjct: 121 QEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDA 180

Query: 181 EGQLEKLLKAHEEEEAAATAGTEDAEEEVLQYPFYTEGSKALLNTRLDIAKYSIVRAASR 240
           EGQLEKLLKAHEEEEAAATAGTEDAEEEVLQYPFYTEGSKALLNTRLDIAKYSIVRAASR
Sbjct: 181 EGQLEKLLKAHEEEEAAATAGTEDAEEEVLQYPFYTEGSKALLNTRLDIAKYSIVRAASR 240

Query: 241 LERAKRKRDDPDEDVEAEMDWALRQAGSLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLS 300
           LERAKRKRDDPDEDVEAEMDWALRQAGSLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLS
Sbjct: 241 LERAKRKRDDPDEDVEAEMDWALRQAGSLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLS 300

Query: 301 GVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLRTFEG 360
           GVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLRTFEG
Sbjct: 301 GVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLRTFEG 360

Query: 361 HLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVS 420
           HLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVS
Sbjct: 361 HLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVS 420

Query: 421 SCGLDALARIWDLRTGRSILALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKS 480
           SCGLDALARIWDLRTGRSILALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKS
Sbjct: 421 SCGLDALARIWDLRTGRSILALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKS 480

Query: 481 LYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDII 540
           LYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDII
Sbjct: 481 LYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDII 540

Query: 541 ADGQGIATVSHDRTIKLWSVNNKDEQTMDVD 572
           ADGQGIATVSHDRTIKLWSVNNKDEQTMDVD
Sbjct: 541 ADGQGIATVSHDRTIKLWSVNNKDEQTMDVD 571

BLAST of Moc08g31960 vs. NCBI nr
Match: XP_038885479.1 (U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Benincasa hispida] >XP_038885480.1 U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Benincasa hispida])

HSP 1 Score: 1046.6 bits (2705), Expect = 7.9e-302
Identity = 535/571 (93.70%), Postives = 557/571 (97.55%), Query Frame = 0

Query: 1   MEVDDQNPASTSAESPETLPDGENVVDLDSPSEPIQPAASSVIPPPIVPAIAPIPPPPVI 60
           MEVDDQNPAST+AESPETLP GEN  DLD+P+EPIQPAA+SVIPP +VP+IAPI PPP+I
Sbjct: 1   MEVDDQNPASTAAESPETLPGGEN-EDLDNPAEPIQPAATSVIPPSVVPSIAPI-PPPII 60

Query: 61  RPLAPLPSRPPLFRPPVAQNGELRTSDSDSEHDDLAPSRAAPGSAADYEVSEESRQVRER 120
           RPLAPLPSRPP FRPPV QNGE+RTSDSDSEHD+LAPSR A GS A+YEVSEESRQVRER
Sbjct: 61  RPLAPLPSRPPHFRPPVTQNGEMRTSDSDSEHDELAPSRTAGGSTAEYEVSEESRQVRER 120

Query: 121 QEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDA 180
           QEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDA
Sbjct: 121 QEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDA 180

Query: 181 EGQLEKLLKAHEEEEAAATAGTEDAEEEVLQYPFYTEGSKALLNTRLDIAKYSIVRAASR 240
           EGQLEKL+K HEEEEAAAT GTE+AEEEVLQYPFYTEGSKALL+ R+DIAKYSI+RAASR
Sbjct: 181 EGQLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRAASR 240

Query: 241 LERAKRKRDDPDEDVEAEMDWALRQAGSLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLS 300
           LERAKRKRDDPDEDVEAEMDWALRQAGSLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLS
Sbjct: 241 LERAKRKRDDPDEDVEAEMDWALRQAGSLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLS 300

Query: 301 GVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLRTFEG 360
           GVAKLWSMPQVRKVSNFKGHTERVTDV+FSPVNECLATASADRTARLWSAEGSLL+TFEG
Sbjct: 301 GVAKLWSMPQVRKVSNFKGHTERVTDVIFSPVNECLATASADRTARLWSAEGSLLKTFEG 360

Query: 361 HLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVS 420
           HLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVS
Sbjct: 361 HLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVS 420

Query: 421 SCGLDALARIWDLRTGRSILALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKS 480
           SCGLDALAR+WDLRTGRS+LALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKS
Sbjct: 421 SCGLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKS 480

Query: 481 LYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDII 540
           LYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDII
Sbjct: 481 LYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDII 540

Query: 541 ADGQGIATVSHDRTIKLWSVNNKDEQTMDVD 572
           +DGQ IATVSHDRTIKLWSVN+KDEQTMD+D
Sbjct: 541 SDGQCIATVSHDRTIKLWSVNSKDEQTMDID 569

BLAST of Moc08g31960 vs. NCBI nr
Match: XP_004146749.1 (U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Cucumis sativus] >XP_031742738.1 U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Cucumis sativus] >KGN47802.1 hypothetical protein Csa_003570 [Cucumis sativus])

HSP 1 Score: 1042.0 bits (2693), Expect = 1.9e-300
Identity = 533/571 (93.35%), Postives = 554/571 (97.02%), Query Frame = 0

Query: 1   MEVDDQNPASTSAESPETLPDGENVVDLDSPSEPIQPAASSVIPPPIVPAIAPIPPPPVI 60
           ME+DDQNPAST+AESPETLP GEN  +LD+P+EP QPAA+SVIPP IVPAIAPI PPP+I
Sbjct: 1   MEIDDQNPASTAAESPETLPGGEN-EELDNPAEPTQPAATSVIPPSIVPAIAPI-PPPII 60

Query: 61  RPLAPLPSRPPLFRPPVAQNGELRTSDSDSEHDDLAPSRAAPGSAADYEVSEESRQVRER 120
           RPLAPLPSRPPLFRPPV QNGELRTSDSDSEHD+LAPSR APGS A+YE+SEESRQ RER
Sbjct: 61  RPLAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPSRTAPGSTAEYEISEESRQARER 120

Query: 121 QEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDA 180
            EKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDA
Sbjct: 121 HEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDA 180

Query: 181 EGQLEKLLKAHEEEEAAATAGTEDAEEEVLQYPFYTEGSKALLNTRLDIAKYSIVRAASR 240
           EGQLEKL+K HEEEEAAAT GTE+AEEEVLQYPFYTEGSKALL+ R+DIAKYSI+RA+SR
Sbjct: 181 EGQLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSR 240

Query: 241 LERAKRKRDDPDEDVEAEMDWALRQAGSLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLS 300
           LERAKRKRDDPDEDVEAEMDWALRQA SLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLS
Sbjct: 241 LERAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLS 300

Query: 301 GVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLRTFEG 360
           GVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLL+TFEG
Sbjct: 301 GVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLKTFEG 360

Query: 361 HLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVS 420
           HLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVS
Sbjct: 361 HLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVS 420

Query: 421 SCGLDALARIWDLRTGRSILALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKS 480
           SCGLDALAR+WDLRTGRS+LALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKS
Sbjct: 421 SCGLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKS 480

Query: 481 LYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDII 540
           LYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDII
Sbjct: 481 LYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDII 540

Query: 541 ADGQGIATVSHDRTIKLWSVNNKDEQTMDVD 572
           +DGQ IATVSHDRTIKLWSVN+KD QTMDVD
Sbjct: 541 SDGQCIATVSHDRTIKLWSVNSKDIQTMDVD 569

BLAST of Moc08g31960 vs. NCBI nr
Match: XP_008464716.1 (PREDICTED: U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Cucumis melo] >XP_016903247.1 PREDICTED: U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Cucumis melo] >XP_016903248.1 PREDICTED: U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Cucumis melo] >XP_016903249.1 PREDICTED: U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Cucumis melo])

HSP 1 Score: 1036.9 bits (2680), Expect = 6.3e-299
Identity = 529/571 (92.64%), Postives = 553/571 (96.85%), Query Frame = 0

Query: 1   MEVDDQNPASTSAESPETLPDGENVVDLDSPSEPIQPAASSVIPPPIVPAIAPIPPPPVI 60
           ME+DDQNPAST+AESPETLP GEN  +LD+P+EP QPAA+SVIPP IVPAIAPI PPP+I
Sbjct: 1   MEIDDQNPASTAAESPETLPGGEN-EELDNPAEPTQPAATSVIPPSIVPAIAPI-PPPII 60

Query: 61  RPLAPLPSRPPLFRPPVAQNGELRTSDSDSEHDDLAPSRAAPGSAADYEVSEESRQVRER 120
           RPLAPLPSRPPLFRPPV QNGELRTSDSDSEHD+LAP+R APGS A+YE+SEESRQ RER
Sbjct: 61  RPLAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPARTAPGSTAEYEISEESRQARER 120

Query: 121 QEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDA 180
            EKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDA
Sbjct: 121 HEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDA 180

Query: 181 EGQLEKLLKAHEEEEAAATAGTEDAEEEVLQYPFYTEGSKALLNTRLDIAKYSIVRAASR 240
           EGQLEKL+K HEEEEAAAT GTE+AEEEVLQYPFYTEGSKALL+ R+DIAKYSI+RA+SR
Sbjct: 181 EGQLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSR 240

Query: 241 LERAKRKRDDPDEDVEAEMDWALRQAGSLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLS 300
           LERAKRKRDDPDEDVEAEMDWALRQA SLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLS
Sbjct: 241 LERAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLS 300

Query: 301 GVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLRTFEG 360
           GVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWS EGSLL+TFEG
Sbjct: 301 GVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSPEGSLLKTFEG 360

Query: 361 HLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVS 420
           HLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFH DGSLVS
Sbjct: 361 HLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHQDGSLVS 420

Query: 421 SCGLDALARIWDLRTGRSILALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKS 480
           SCGLDALAR+WDLRTGRS+LALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKS
Sbjct: 421 SCGLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKS 480

Query: 481 LYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDII 540
           LYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDII
Sbjct: 481 LYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDII 540

Query: 541 ADGQGIATVSHDRTIKLWSVNNKDEQTMDVD 572
           +DGQ IATVSHDRTIKLWSVN+KD+QTMD+D
Sbjct: 541 SDGQCIATVSHDRTIKLWSVNSKDKQTMDID 569

BLAST of Moc08g31960 vs. NCBI nr
Match: XP_022999661.1 (U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Cucurbita maxima])

HSP 1 Score: 1032.7 bits (2669), Expect = 1.2e-297
Identity = 527/571 (92.29%), Postives = 552/571 (96.67%), Query Frame = 0

Query: 1   MEVDDQNPASTSAESPETLPDGENVVDLDSPSEPIQPAASSVIPPPIVPAIAPIPPPPVI 60
           M+VDDQNPAST+AESPE LP GEN  DLD+P+EP+QPAA++VIP  IVP+IAPIPPP + 
Sbjct: 1   MDVDDQNPASTAAESPEILPGGEN-EDLDNPAEPMQPAATTVIPSSIVPSIAPIPPPLIT 60

Query: 61  RPLAPLPSRPPLFRPPVAQNGELRTSDSDSEHDDLAPSRAAPGSAADYEVSEESRQVRER 120
           RPLAPLPSRP LFRPPVAQNGE+RTSDSDSEHD+LAPSRA  GS A+YEVSEESRQVRER
Sbjct: 61  RPLAPLPSRPLLFRPPVAQNGEMRTSDSDSEHDELAPSRATQGSTAEYEVSEESRQVRER 120

Query: 121 QEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDA 180
           QEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDA
Sbjct: 121 QEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDA 180

Query: 181 EGQLEKLLKAHEEEEAAATAGTEDAEEEVLQYPFYTEGSKALLNTRLDIAKYSIVRAASR 240
           EGQLEKLLK HEEEEAAAT GTE+AEEEVLQYPFYTEG KALL+ R+DIAKYSIVRAASR
Sbjct: 181 EGQLEKLLKVHEEEEAAATGGTEEAEEEVLQYPFYTEGPKALLDARIDIAKYSIVRAASR 240

Query: 241 LERAKRKRDDPDEDVEAEMDWALRQAGSLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLS 300
           LERAKRKRDDPDEDVEAEMDWALRQA SL+LDCSEIGDDRPLSGCSFSSDGKFLATSSLS
Sbjct: 241 LERAKRKRDDPDEDVEAEMDWALRQAESLILDCSEIGDDRPLSGCSFSSDGKFLATSSLS 300

Query: 301 GVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLRTFEG 360
           GVAK+WSMPQVRKVSNFKGHTERVTDV+FSPVNECLATASADRTARLWSAEGSLLRTFEG
Sbjct: 301 GVAKMWSMPQVRKVSNFKGHTERVTDVIFSPVNECLATASADRTARLWSAEGSLLRTFEG 360

Query: 361 HLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVS 420
           HLDRLARIAFHPSGKYLGTTSFDKTWRLWD+ETGVELLLQEGHSRSVYGI FHHDGSLVS
Sbjct: 361 HLDRLARIAFHPSGKYLGTTSFDKTWRLWDIETGVELLLQEGHSRSVYGIDFHHDGSLVS 420

Query: 421 SCGLDALARIWDLRTGRSILALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKS 480
           SCGLDALAR+WDLRTGRS+LALEGHVKPVLGV+FSPNGYHLATGGEDNTCRIWDLRKKKS
Sbjct: 421 SCGLDALARVWDLRTGRSVLALEGHVKPVLGVNFSPNGYHLATGGEDNTCRIWDLRKKKS 480

Query: 481 LYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDII 540
           LYIIPAHSNL+SQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDII
Sbjct: 481 LYIIPAHSNLISQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDII 540

Query: 541 ADGQGIATVSHDRTIKLWSVNNKDEQTMDVD 572
           +DGQ IATVSHDRTIKLWSVN+KDEQTMDVD
Sbjct: 541 SDGQCIATVSHDRTIKLWSVNSKDEQTMDVD 570

BLAST of Moc08g31960 vs. ExPASy Swiss-Prot
Match: O22212 (U4/U6 small nuclear ribonucleoprotein PRP4-like protein OS=Arabidopsis thaliana OX=3702 GN=LIS PE=2 SV=1)

HSP 1 Score: 706.8 bits (1823), Expect = 1.9e-202
Identity = 361/549 (65.76%), Postives = 429/549 (78.14%), Query Frame = 0

Query: 29  DSPSEPIQPAASSVIPPPIVPAIAPIPPPPVIRPLAPLPSRPPLFRPPVAQNGELRTSDS 88
           D+ S P   A   V+PP   P +APIP  P      P  +RPP FRPPV+QNG ++TSDS
Sbjct: 25  DASSLPGFSAIPPVVPPSFPPPMAPIPMMP-----HPPVARPPTFRPPVSQNGGVKTSDS 84

Query: 89  DSEHDDLAPSRAAPGSAADYEVSEESRQVRERQEKAMQEFLMKRRASALAVPTNDMAVRA 148
           DSE DD              E+SEES+QVRERQEKA+Q+ L+KRRA+A+AVPTND AVR 
Sbjct: 85  DSESDD-----------EHIEISEESKQVRERQEKALQDLLVKRRAAAMAVPTNDKAVRD 144

Query: 149 RLRRLGEPITLFGEREMERRDRLRSIMARLDAEGQLEKLLKAHEEEEAAATAGTEDAEEE 208
           RLRRLGEPITLFGE+EMERR RL  ++ R D  GQL+KL+K HEE+        E+ ++E
Sbjct: 145 RLRRLGEPITLFGEQEMERRARLTQLLTRYDINGQLDKLVKDHEED----VTPKEEVDDE 204

Query: 209 VLQYPFYTEGSKALLNTRLDIAKYSIVRAASRLERAKRKRDDPDEDVEAEMDWALRQAGS 268
           VL+YPF+TEG K L   R++IAK+S+ RAA R++RAKR+RDDPDED++AE  WAL+ A  
Sbjct: 205 VLEYPFFTEGPKELREARIEIAKFSVKRAAVRIQRAKRRRDDPDEDMDAETKWALKHAKH 264

Query: 269 LVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGVAKLWSMPQV-RKVSNFKGHTERVTDV 328
           + LDCS  GDDRPL+GCSFS DGK LAT SLSGV KLW MPQV   ++  K H ER TDV
Sbjct: 265 MALDCSNFGDDRPLTGCSFSRDGKILATCSLSGVTKLWEMPQVTNTIAVLKDHKERATDV 324

Query: 329 MFSPVNECLATASADRTARLWSAEGSLLRTFEGHLDRLARIAFHPSGKYLGTTSFDKTWR 388
           +FSPV++CLATASADRTA+LW  +G+LL+TFEGHLDRLAR+AFHPSGKYLGTTS+DKTWR
Sbjct: 325 VFSPVDDCLATASADRTAKLWKTDGTLLQTFEGHLDRLARVAFHPSGKYLGTTSYDKTWR 384

Query: 389 LWDVETGVELLLQEGHSRSVYGIAFHHDGSLVSSCGLDALARIWDLRTGRSILALEGHVK 448
           LWD+ TG ELLLQEGHSRSVYGIAF  DG+L +SCGLD+LAR+WDLRTGRSIL  +GH+K
Sbjct: 385 LWDINTGAELLLQEGHSRSVYGIAFQQDGALAASCGLDSLARVWDLRTGRSILVFQGHIK 444

Query: 449 PVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLYIIPAHSNLVSQVKYEPQEGYFLVTA 508
           PV  V+FSPNGYHLA+GGEDN CRIWDLR +KSLYIIPAH+NLVSQVKYEPQEGYFL TA
Sbjct: 445 PVFSVNFSPNGYHLASGGEDNQCRIWDLRMRKSLYIIPAHANLVSQVKYEPQEGYFLATA 504

Query: 509 SFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIIADGQGIATVSHDRTIKLWSVNNKDE-- 568
           S+DM   IWS RDF  VK+L+GHE+KV SLDI AD   IATVSHDRTIKLW+ +  D+  
Sbjct: 505 SYDMKVNIWSGRDFSLVKSLAGHESKVASLDITADSSCIATVSHDRTIKLWTSSGNDDED 553

Query: 569 ---QTMDVD 572
              +TMD+D
Sbjct: 565 EEKETMDID 553

BLAST of Moc08g31960 vs. ExPASy Swiss-Prot
Match: Q3MHE2 (U4/U6 small nuclear ribonucleoprotein Prp4 OS=Bos taurus OX=9913 GN=PRPF4 PE=2 SV=1)

HSP 1 Score: 416.4 bits (1069), Expect = 5.3e-115
Identity = 217/459 (47.28%), Postives = 298/459 (64.92%), Query Frame = 0

Query: 109 EVSEESRQVRERQEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERR 168
           EV E    + ERQ + + EF  ++RA  + V T+D  V+A LR LGEPITLFGE   ERR
Sbjct: 70  EVFEIEEHISERQAEVLAEFERRKRARQINVSTDDSEVKACLRALGEPITLFGEGPAERR 129

Query: 169 DRLRSIMARLDAEGQLEKLLKAHEEEEAAATAGTEDAEEEVLQYPFYTEGSKALLNTRLD 228
           +RLR+I++ +  +  L+K  K  E+ + +         +E  Q  +Y EG  +L   RL 
Sbjct: 130 ERLRNILSVVGTDA-LKKTKKDDEKSKKS---------KEEYQQTWYHEGPHSLKVARLW 189

Query: 229 IAKYSIVRAASRLERAKRKRDDPDEDVEAEMDWALRQAGSLVLDCSEIGDDRPLSGCSFS 288
           IA YS+ RA  RLE A+  ++ P+    ++M    +   SL   CS+IGDDRP+S C FS
Sbjct: 190 IANYSLPRAMKRLEEARLHKEIPETTRTSQMQELHKSLRSLNNFCSQIGDDRPISYCHFS 249

Query: 289 SDGKFLATSSLSGVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNEC--------LATAS 348
            + K LAT+  SG+ KLWS+P    +   +GH   V  ++F P +          LA+ +
Sbjct: 250 PNSKMLATACWSGLCKLWSVPDCNLLHTLRGHNTNVGAIVFHPKSTVSLDQKDVNLASCA 309

Query: 349 ADRTARLWSAEG-SLLRTFEGHLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLL 408
           AD + +LWS +    +   EGH  R+AR+ +HPSG++LGTT +D++WRLWD+E   E+L 
Sbjct: 310 ADGSVKLWSLDSDEPVADIEGHTVRVARVTWHPSGRFLGTTCYDRSWRLWDLEAQEEILH 369

Query: 409 QEGHSRSVYGIAFHHDGSLVSSCGLDALARIWDLRTGRSILALEGHVKPVLGVSFSPNGY 468
           QEGHS  VY IAFH DGSL  + GLDA  R+WDLRTGR I+ LEGH+K + G++FSPNGY
Sbjct: 370 QEGHSMGVYDIAFHQDGSLAGTGGLDAFGRVWDLRTGRCIMFLEGHLKEIYGINFSPNGY 429

Query: 469 HLATGGEDNTCRIWDLRKKKSLYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSAR 528
           H+ATG  DNTC++WDLR+++ +Y IPAH NLV+ VK+EP  G FL+T ++D TAKIW+  
Sbjct: 430 HIATGSGDNTCKVWDLRQRRCVYTIPAHQNLVTGVKFEPIHGNFLLTGAYDNTAKIWTHP 489

Query: 529 DFKPVKTLSGHEAKVTSLDIIADGQGIATVSHDRTIKLW 559
            + P+KTL+GHE KV  LDI +DGQ IAT S+DRT KLW
Sbjct: 490 GWSPLKTLAGHEGKVMGLDISSDGQLIATCSYDRTFKLW 518

BLAST of Moc08g31960 vs. ExPASy Swiss-Prot
Match: O43172 (U4/U6 small nuclear ribonucleoprotein Prp4 OS=Homo sapiens OX=9606 GN=PRPF4 PE=1 SV=2)

HSP 1 Score: 416.4 bits (1069), Expect = 5.3e-115
Identity = 217/459 (47.28%), Postives = 298/459 (64.92%), Query Frame = 0

Query: 109 EVSEESRQVRERQEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERR 168
           EV E    + ERQ + + EF  ++RA  + V T+D  V+A LR LGEPITLFGE   ERR
Sbjct: 71  EVFEIEEHISERQAEVLAEFERRKRARQINVSTDDSEVKACLRALGEPITLFGEGPAERR 130

Query: 169 DRLRSIMARLDAEGQLEKLLKAHEEEEAAATAGTEDAEEEVLQYPFYTEGSKALLNTRLD 228
           +RLR+I++ +  +  L+K  K  E+ + +         +E  Q  +Y EG  +L   RL 
Sbjct: 131 ERLRNILSVVGTDA-LKKTKKDDEKSKKS---------KEEYQQTWYHEGPNSLKVARLW 190

Query: 229 IAKYSIVRAASRLERAKRKRDDPDEDVEAEMDWALRQAGSLVLDCSEIGDDRPLSGCSFS 288
           IA YS+ RA  RLE A+  ++ P+    ++M    +   SL   CS+IGDDRP+S C FS
Sbjct: 191 IANYSLPRAMKRLEEARLHKEIPETTRTSQMQELHKSLRSLNNFCSQIGDDRPISYCHFS 250

Query: 289 SDGKFLATSSLSGVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNEC--------LATAS 348
            + K LAT+  SG+ KLWS+P    +   +GH   V  ++F P +          LA+ +
Sbjct: 251 PNSKMLATACWSGLCKLWSVPDCNLLHTLRGHNTNVGAIVFHPKSTVSLDPKDVNLASCA 310

Query: 349 ADRTARLWSAEG-SLLRTFEGHLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLL 408
           AD + +LWS +    +   EGH  R+AR+ +HPSG++LGTT +D++WRLWD+E   E+L 
Sbjct: 311 ADGSVKLWSLDSDEPVADIEGHTVRVARVMWHPSGRFLGTTCYDRSWRLWDLEAQEEILH 370

Query: 409 QEGHSRSVYGIAFHHDGSLVSSCGLDALARIWDLRTGRSILALEGHVKPVLGVSFSPNGY 468
           QEGHS  VY IAFH DGSL  + GLDA  R+WDLRTGR I+ LEGH+K + G++FSPNGY
Sbjct: 371 QEGHSMGVYDIAFHQDGSLAGTGGLDAFGRVWDLRTGRCIMFLEGHLKEIYGINFSPNGY 430

Query: 469 HLATGGEDNTCRIWDLRKKKSLYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSAR 528
           H+ATG  DNTC++WDLR+++ +Y IPAH NLV+ VK+EP  G FL+T ++D TAKIW+  
Sbjct: 431 HIATGSGDNTCKVWDLRQRRCVYTIPAHQNLVTGVKFEPIHGNFLLTGAYDNTAKIWTHP 490

Query: 529 DFKPVKTLSGHEAKVTSLDIIADGQGIATVSHDRTIKLW 559
            + P+KTL+GHE KV  LDI +DGQ IAT S+DRT KLW
Sbjct: 491 GWSPLKTLAGHEGKVMGLDISSDGQLIATCSYDRTFKLW 519

BLAST of Moc08g31960 vs. ExPASy Swiss-Prot
Match: Q5NVD0 (U4/U6 small nuclear ribonucleoprotein Prp4 OS=Pongo abelii OX=9601 GN=PRPF4 PE=2 SV=1)

HSP 1 Score: 416.4 bits (1069), Expect = 5.3e-115
Identity = 217/459 (47.28%), Postives = 298/459 (64.92%), Query Frame = 0

Query: 109 EVSEESRQVRERQEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERR 168
           EV E    + ERQ + + EF  ++RA  + V T+D  V+A LR LGEPITLFGE   ERR
Sbjct: 70  EVFEIEEHISERQAEVLAEFERRKRARQINVSTDDSEVKACLRALGEPITLFGEGPAERR 129

Query: 169 DRLRSIMARLDAEGQLEKLLKAHEEEEAAATAGTEDAEEEVLQYPFYTEGSKALLNTRLD 228
           +RLR+I++ +  +  L+K  K  E+ + +         +E  Q  +Y EG  +L   RL 
Sbjct: 130 ERLRNILSVVGTDA-LKKTKKDDEKSKKS---------KEEYQQTWYHEGPNSLKVARLW 189

Query: 229 IAKYSIVRAASRLERAKRKRDDPDEDVEAEMDWALRQAGSLVLDCSEIGDDRPLSGCSFS 288
           IA YS+ RA  RLE A+  ++ P+    ++M    +   SL   CS+IGDDRP+S C FS
Sbjct: 190 IANYSLPRAMKRLEEARLHKEIPETTRASQMQELHKSLRSLNNFCSQIGDDRPISYCHFS 249

Query: 289 SDGKFLATSSLSGVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNEC--------LATAS 348
            + K LAT+  SG+ KLWS+P    +   +GH   V  ++F P +          LA+ +
Sbjct: 250 PNSKMLATACWSGLCKLWSVPDCNLLHTLRGHNTNVGAIVFHPKSTVSLDQKDVNLASCA 309

Query: 349 ADRTARLWSAEG-SLLRTFEGHLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLL 408
           AD + +LWS +    +   EGH  R+AR+ +HPSG++LGTT +D++WRLWD+E   E+L 
Sbjct: 310 ADGSVKLWSLDSDEPVADIEGHTVRVARVMWHPSGRFLGTTCYDRSWRLWDLEAQEEILH 369

Query: 409 QEGHSRSVYGIAFHHDGSLVSSCGLDALARIWDLRTGRSILALEGHVKPVLGVSFSPNGY 468
           QEGHS  VY IAFH DGSL  + GLDA  R+WDLRTGR I+ LEGH+K + G++FSPNGY
Sbjct: 370 QEGHSMGVYDIAFHQDGSLAGTGGLDAFGRVWDLRTGRCIMFLEGHLKEIYGINFSPNGY 429

Query: 469 HLATGGEDNTCRIWDLRKKKSLYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSAR 528
           H+ATG  DNTC++WDLR+++ +Y IPAH NLV+ VK+EP  G FL+T ++D TAKIW+  
Sbjct: 430 HIATGSGDNTCKVWDLRQRRCVYTIPAHQNLVTGVKFEPIHGNFLLTGAYDNTAKIWTHP 489

Query: 529 DFKPVKTLSGHEAKVTSLDIIADGQGIATVSHDRTIKLW 559
            + P+KTL+GHE KV  LDI +DGQ IAT S+DRT KLW
Sbjct: 490 GWSPLKTLAGHEGKVMGLDISSDGQLIATCSYDRTFKLW 518

BLAST of Moc08g31960 vs. ExPASy Swiss-Prot
Match: Q9DAW6 (U4/U6 small nuclear ribonucleoprotein Prp4 OS=Mus musculus OX=10090 GN=Prpf4 PE=1 SV=1)

HSP 1 Score: 415.6 bits (1067), Expect = 9.0e-115
Identity = 217/459 (47.28%), Postives = 298/459 (64.92%), Query Frame = 0

Query: 109 EVSEESRQVRERQEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERR 168
           EV E    + ERQ + + EF  ++RA  + V T+D  V+A LR LGEPITLFGE   ERR
Sbjct: 70  EVFEIEEHISERQAEVLAEFERRKRARQINVSTDDSEVKACLRALGEPITLFGEGPAERR 129

Query: 169 DRLRSIMARLDAEGQLEKLLKAHEEEEAAATAGTEDAEEEVLQYPFYTEGSKALLNTRLD 228
           +RLR+I++ +  +  L+K  K  E+ + +         +E  Q  +Y EG  +L   RL 
Sbjct: 130 ERLRNILSVVGTDA-LKKTKKDDEKSKKS---------KEEYQQTWYHEGPNSLKVARLW 189

Query: 229 IAKYSIVRAASRLERAKRKRDDPDEDVEAEMDWALRQAGSLVLDCSEIGDDRPLSGCSFS 288
           IA YS+ RA  RLE A+  ++ P+    ++M    +   SL   CS+IGDDRP+S C FS
Sbjct: 190 IANYSLPRAMKRLEEARLHKEIPETTRTSQMQELHKSLRSLNNFCSQIGDDRPISYCHFS 249

Query: 289 SDGKFLATSSLSGVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNEC--------LATAS 348
            + K LAT+  SG+ KLWS+P    +   +GH   V  ++F P +          LA+ +
Sbjct: 250 PNSKMLATACWSGLCKLWSVPDCSLLHTLRGHNTNVGAIVFHPKSTVSLDQKDVNLASCA 309

Query: 349 ADRTARLWSAEG-SLLRTFEGHLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLL 408
           AD + +LWS +    +   EGH  R+AR+ +HPSG++LGTT +D++WRLWD+E   E+L 
Sbjct: 310 ADGSVKLWSLDSDEPVADIEGHTVRVARVMWHPSGRFLGTTCYDRSWRLWDLEAQEEILH 369

Query: 409 QEGHSRSVYGIAFHHDGSLVSSCGLDALARIWDLRTGRSILALEGHVKPVLGVSFSPNGY 468
           QEGHS  VY IAFH DGSL  + GLDA  R+WDLRTGR I+ LEGH+K + G++FSPNGY
Sbjct: 370 QEGHSMGVYDIAFHQDGSLAGTGGLDAFGRVWDLRTGRCIMFLEGHLKEIYGINFSPNGY 429

Query: 469 HLATGGEDNTCRIWDLRKKKSLYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSAR 528
           H+ATG  DNTC++WDLR+++ +Y IPAH NLV+ VK+EP  G FL+T ++D TAKIW+  
Sbjct: 430 HIATGSGDNTCKVWDLRQRRCVYTIPAHQNLVTGVKFEPIHGDFLLTGAYDNTAKIWTHP 489

Query: 529 DFKPVKTLSGHEAKVTSLDIIADGQGIATVSHDRTIKLW 559
            + P+KTL+GHE KV  LDI +DGQ IAT S+DRT KLW
Sbjct: 490 GWSPLKTLAGHEGKVMGLDISSDGQLIATCSYDRTFKLW 518

BLAST of Moc08g31960 vs. ExPASy TrEMBL
Match: A0A6J1DAU7 (U4/U6 small nuclear ribonucleoprotein PRP4-like protein OS=Momordica charantia OX=3673 GN=LOC111019283 PE=4 SV=1)

HSP 1 Score: 1110.1 bits (2870), Expect = 0.0e+00
Identity = 571/571 (100.00%), Postives = 571/571 (100.00%), Query Frame = 0

Query: 1   MEVDDQNPASTSAESPETLPDGENVVDLDSPSEPIQPAASSVIPPPIVPAIAPIPPPPVI 60
           MEVDDQNPASTSAESPETLPDGENVVDLDSPSEPIQPAASSVIPPPIVPAIAPIPPPPVI
Sbjct: 1   MEVDDQNPASTSAESPETLPDGENVVDLDSPSEPIQPAASSVIPPPIVPAIAPIPPPPVI 60

Query: 61  RPLAPLPSRPPLFRPPVAQNGELRTSDSDSEHDDLAPSRAAPGSAADYEVSEESRQVRER 120
           RPLAPLPSRPPLFRPPVAQNGELRTSDSDSEHDDLAPSRAAPGSAADYEVSEESRQVRER
Sbjct: 61  RPLAPLPSRPPLFRPPVAQNGELRTSDSDSEHDDLAPSRAAPGSAADYEVSEESRQVRER 120

Query: 121 QEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDA 180
           QEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDA
Sbjct: 121 QEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDA 180

Query: 181 EGQLEKLLKAHEEEEAAATAGTEDAEEEVLQYPFYTEGSKALLNTRLDIAKYSIVRAASR 240
           EGQLEKLLKAHEEEEAAATAGTEDAEEEVLQYPFYTEGSKALLNTRLDIAKYSIVRAASR
Sbjct: 181 EGQLEKLLKAHEEEEAAATAGTEDAEEEVLQYPFYTEGSKALLNTRLDIAKYSIVRAASR 240

Query: 241 LERAKRKRDDPDEDVEAEMDWALRQAGSLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLS 300
           LERAKRKRDDPDEDVEAEMDWALRQAGSLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLS
Sbjct: 241 LERAKRKRDDPDEDVEAEMDWALRQAGSLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLS 300

Query: 301 GVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLRTFEG 360
           GVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLRTFEG
Sbjct: 301 GVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLRTFEG 360

Query: 361 HLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVS 420
           HLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVS
Sbjct: 361 HLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVS 420

Query: 421 SCGLDALARIWDLRTGRSILALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKS 480
           SCGLDALARIWDLRTGRSILALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKS
Sbjct: 421 SCGLDALARIWDLRTGRSILALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKS 480

Query: 481 LYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDII 540
           LYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDII
Sbjct: 481 LYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDII 540

Query: 541 ADGQGIATVSHDRTIKLWSVNNKDEQTMDVD 572
           ADGQGIATVSHDRTIKLWSVNNKDEQTMDVD
Sbjct: 541 ADGQGIATVSHDRTIKLWSVNNKDEQTMDVD 571

BLAST of Moc08g31960 vs. ExPASy TrEMBL
Match: A0A0A0KHD8 (WD_REPEATS_REGION domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G404170 PE=4 SV=1)

HSP 1 Score: 1042.0 bits (2693), Expect = 9.4e-301
Identity = 533/571 (93.35%), Postives = 554/571 (97.02%), Query Frame = 0

Query: 1   MEVDDQNPASTSAESPETLPDGENVVDLDSPSEPIQPAASSVIPPPIVPAIAPIPPPPVI 60
           ME+DDQNPAST+AESPETLP GEN  +LD+P+EP QPAA+SVIPP IVPAIAPI PPP+I
Sbjct: 1   MEIDDQNPASTAAESPETLPGGEN-EELDNPAEPTQPAATSVIPPSIVPAIAPI-PPPII 60

Query: 61  RPLAPLPSRPPLFRPPVAQNGELRTSDSDSEHDDLAPSRAAPGSAADYEVSEESRQVRER 120
           RPLAPLPSRPPLFRPPV QNGELRTSDSDSEHD+LAPSR APGS A+YE+SEESRQ RER
Sbjct: 61  RPLAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPSRTAPGSTAEYEISEESRQARER 120

Query: 121 QEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDA 180
            EKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDA
Sbjct: 121 HEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDA 180

Query: 181 EGQLEKLLKAHEEEEAAATAGTEDAEEEVLQYPFYTEGSKALLNTRLDIAKYSIVRAASR 240
           EGQLEKL+K HEEEEAAAT GTE+AEEEVLQYPFYTEGSKALL+ R+DIAKYSI+RA+SR
Sbjct: 181 EGQLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSR 240

Query: 241 LERAKRKRDDPDEDVEAEMDWALRQAGSLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLS 300
           LERAKRKRDDPDEDVEAEMDWALRQA SLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLS
Sbjct: 241 LERAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLS 300

Query: 301 GVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLRTFEG 360
           GVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLL+TFEG
Sbjct: 301 GVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLKTFEG 360

Query: 361 HLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVS 420
           HLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVS
Sbjct: 361 HLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVS 420

Query: 421 SCGLDALARIWDLRTGRSILALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKS 480
           SCGLDALAR+WDLRTGRS+LALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKS
Sbjct: 421 SCGLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKS 480

Query: 481 LYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDII 540
           LYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDII
Sbjct: 481 LYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDII 540

Query: 541 ADGQGIATVSHDRTIKLWSVNNKDEQTMDVD 572
           +DGQ IATVSHDRTIKLWSVN+KD QTMDVD
Sbjct: 541 SDGQCIATVSHDRTIKLWSVNSKDIQTMDVD 569

BLAST of Moc08g31960 vs. ExPASy TrEMBL
Match: A0A1S4E4U8 (U4/U6 small nuclear ribonucleoprotein PRP4-like protein OS=Cucumis melo OX=3656 GN=LOC103502534 PE=4 SV=1)

HSP 1 Score: 1036.9 bits (2680), Expect = 3.0e-299
Identity = 529/571 (92.64%), Postives = 553/571 (96.85%), Query Frame = 0

Query: 1   MEVDDQNPASTSAESPETLPDGENVVDLDSPSEPIQPAASSVIPPPIVPAIAPIPPPPVI 60
           ME+DDQNPAST+AESPETLP GEN  +LD+P+EP QPAA+SVIPP IVPAIAPI PPP+I
Sbjct: 1   MEIDDQNPASTAAESPETLPGGEN-EELDNPAEPTQPAATSVIPPSIVPAIAPI-PPPII 60

Query: 61  RPLAPLPSRPPLFRPPVAQNGELRTSDSDSEHDDLAPSRAAPGSAADYEVSEESRQVRER 120
           RPLAPLPSRPPLFRPPV QNGELRTSDSDSEHD+LAP+R APGS A+YE+SEESRQ RER
Sbjct: 61  RPLAPLPSRPPLFRPPVTQNGELRTSDSDSEHDELAPARTAPGSTAEYEISEESRQARER 120

Query: 121 QEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDA 180
            EKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDA
Sbjct: 121 HEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDA 180

Query: 181 EGQLEKLLKAHEEEEAAATAGTEDAEEEVLQYPFYTEGSKALLNTRLDIAKYSIVRAASR 240
           EGQLEKL+K HEEEEAAAT GTE+AEEEVLQYPFYTEGSKALL+ R+DIAKYSI+RA+SR
Sbjct: 181 EGQLEKLMKVHEEEEAAATGGTEEAEEEVLQYPFYTEGSKALLDARIDIAKYSILRASSR 240

Query: 241 LERAKRKRDDPDEDVEAEMDWALRQAGSLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLS 300
           LERAKRKRDDPDEDVEAEMDWALRQA SLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLS
Sbjct: 241 LERAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLS 300

Query: 301 GVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLRTFEG 360
           GVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWS EGSLL+TFEG
Sbjct: 301 GVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSPEGSLLKTFEG 360

Query: 361 HLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVS 420
           HLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFH DGSLVS
Sbjct: 361 HLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHQDGSLVS 420

Query: 421 SCGLDALARIWDLRTGRSILALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKS 480
           SCGLDALAR+WDLRTGRS+LALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKS
Sbjct: 421 SCGLDALARVWDLRTGRSVLALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKS 480

Query: 481 LYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDII 540
           LYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDII
Sbjct: 481 LYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDII 540

Query: 541 ADGQGIATVSHDRTIKLWSVNNKDEQTMDVD 572
           +DGQ IATVSHDRTIKLWSVN+KD+QTMD+D
Sbjct: 541 SDGQCIATVSHDRTIKLWSVNSKDKQTMDID 569

BLAST of Moc08g31960 vs. ExPASy TrEMBL
Match: A0A6J1KDQ7 (U4/U6 small nuclear ribonucleoprotein PRP4-like protein OS=Cucurbita maxima OX=3661 GN=LOC111493950 PE=4 SV=1)

HSP 1 Score: 1032.7 bits (2669), Expect = 5.7e-298
Identity = 527/571 (92.29%), Postives = 552/571 (96.67%), Query Frame = 0

Query: 1   MEVDDQNPASTSAESPETLPDGENVVDLDSPSEPIQPAASSVIPPPIVPAIAPIPPPPVI 60
           M+VDDQNPAST+AESPE LP GEN  DLD+P+EP+QPAA++VIP  IVP+IAPIPPP + 
Sbjct: 1   MDVDDQNPASTAAESPEILPGGEN-EDLDNPAEPMQPAATTVIPSSIVPSIAPIPPPLIT 60

Query: 61  RPLAPLPSRPPLFRPPVAQNGELRTSDSDSEHDDLAPSRAAPGSAADYEVSEESRQVRER 120
           RPLAPLPSRP LFRPPVAQNGE+RTSDSDSEHD+LAPSRA  GS A+YEVSEESRQVRER
Sbjct: 61  RPLAPLPSRPLLFRPPVAQNGEMRTSDSDSEHDELAPSRATQGSTAEYEVSEESRQVRER 120

Query: 121 QEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDA 180
           QEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDA
Sbjct: 121 QEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDA 180

Query: 181 EGQLEKLLKAHEEEEAAATAGTEDAEEEVLQYPFYTEGSKALLNTRLDIAKYSIVRAASR 240
           EGQLEKLLK HEEEEAAAT GTE+AEEEVLQYPFYTEG KALL+ R+DIAKYSIVRAASR
Sbjct: 181 EGQLEKLLKVHEEEEAAATGGTEEAEEEVLQYPFYTEGPKALLDARIDIAKYSIVRAASR 240

Query: 241 LERAKRKRDDPDEDVEAEMDWALRQAGSLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLS 300
           LERAKRKRDDPDEDVEAEMDWALRQA SL+LDCSEIGDDRPLSGCSFSSDGKFLATSSLS
Sbjct: 241 LERAKRKRDDPDEDVEAEMDWALRQAESLILDCSEIGDDRPLSGCSFSSDGKFLATSSLS 300

Query: 301 GVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLRTFEG 360
           GVAK+WSMPQVRKVSNFKGHTERVTDV+FSPVNECLATASADRTARLWSAEGSLLRTFEG
Sbjct: 301 GVAKMWSMPQVRKVSNFKGHTERVTDVIFSPVNECLATASADRTARLWSAEGSLLRTFEG 360

Query: 361 HLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVS 420
           HLDRLARIAFHPSGKYLGTTSFDKTWRLWD+ETGVELLLQEGHSRSVYGI FHHDGSLVS
Sbjct: 361 HLDRLARIAFHPSGKYLGTTSFDKTWRLWDIETGVELLLQEGHSRSVYGIDFHHDGSLVS 420

Query: 421 SCGLDALARIWDLRTGRSILALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKS 480
           SCGLDALAR+WDLRTGRS+LALEGHVKPVLGV+FSPNGYHLATGGEDNTCRIWDLRKKKS
Sbjct: 421 SCGLDALARVWDLRTGRSVLALEGHVKPVLGVNFSPNGYHLATGGEDNTCRIWDLRKKKS 480

Query: 481 LYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDII 540
           LYIIPAHSNL+SQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDII
Sbjct: 481 LYIIPAHSNLISQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDII 540

Query: 541 ADGQGIATVSHDRTIKLWSVNNKDEQTMDVD 572
           +DGQ IATVSHDRTIKLWSVN+KDEQTMDVD
Sbjct: 541 SDGQCIATVSHDRTIKLWSVNSKDEQTMDVD 570

BLAST of Moc08g31960 vs. ExPASy TrEMBL
Match: A0A6J1G322 (U4/U6 small nuclear ribonucleoprotein PRP4-like protein OS=Cucurbita moschata OX=3662 GN=LOC111450343 PE=4 SV=1)

HSP 1 Score: 1029.6 bits (2661), Expect = 4.8e-297
Identity = 525/571 (91.94%), Postives = 551/571 (96.50%), Query Frame = 0

Query: 1   MEVDDQNPASTSAESPETLPDGENVVDLDSPSEPIQPAASSVIPPPIVPAIAPIPPPPVI 60
           M+VDDQNPAST+AESPE LP GEN  DLD+P+EP+QPAA++VIP  IVP+IAPIPPP + 
Sbjct: 1   MDVDDQNPASTAAESPEILPGGEN-EDLDNPAEPMQPAATTVIPSSIVPSIAPIPPPLIT 60

Query: 61  RPLAPLPSRPPLFRPPVAQNGELRTSDSDSEHDDLAPSRAAPGSAADYEVSEESRQVRER 120
           RPLAPLPSRP LFRPPVAQNGE+RTSDSDSEHD+LAPSRA  GS A+YEVSEESRQVRER
Sbjct: 61  RPLAPLPSRPLLFRPPVAQNGEMRTSDSDSEHDELAPSRATQGSTAEYEVSEESRQVRER 120

Query: 121 QEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDA 180
           QEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDA
Sbjct: 121 QEKAMQEFLMKRRASALAVPTNDMAVRARLRRLGEPITLFGEREMERRDRLRSIMARLDA 180

Query: 181 EGQLEKLLKAHEEEEAAATAGTEDAEEEVLQYPFYTEGSKALLNTRLDIAKYSIVRAASR 240
           EGQLEKLLK HEEEEAAAT GTE+AEEEVLQYPFYTEG KALL+ R+DIAKYS+VRAASR
Sbjct: 181 EGQLEKLLKVHEEEEAAATGGTEEAEEEVLQYPFYTEGPKALLDARIDIAKYSVVRAASR 240

Query: 241 LERAKRKRDDPDEDVEAEMDWALRQAGSLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLS 300
           LERAKRKRDDPDEDVEAEMDWALRQA SLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLS
Sbjct: 241 LERAKRKRDDPDEDVEAEMDWALRQAESLVLDCSEIGDDRPLSGCSFSSDGKFLATSSLS 300

Query: 301 GVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEGSLLRTFEG 360
           GVAK+WSMPQVRKVSNF GHTERVTDV+FSPVNECLATASADRTARLWSAEGSLLRTFEG
Sbjct: 301 GVAKMWSMPQVRKVSNFNGHTERVTDVIFSPVNECLATASADRTARLWSAEGSLLRTFEG 360

Query: 361 HLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSVYGIAFHHDGSLVS 420
           HLDRLARIAFHPSGKYLGTTSFDKTWRLWD+ETGVELLLQEGHSRSVYGI FHHDGSLVS
Sbjct: 361 HLDRLARIAFHPSGKYLGTTSFDKTWRLWDIETGVELLLQEGHSRSVYGIDFHHDGSLVS 420

Query: 421 SCGLDALARIWDLRTGRSILALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKS 480
           SCGLDALAR+WDLRTGRS+LALEGHVKPVLGV+FSPNGYHLATGGEDNTCRIWDLRKK+S
Sbjct: 421 SCGLDALARVWDLRTGRSVLALEGHVKPVLGVNFSPNGYHLATGGEDNTCRIWDLRKKRS 480

Query: 481 LYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDII 540
           LYIIPAHSNL+SQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDII
Sbjct: 481 LYIIPAHSNLISQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKVTSLDII 540

Query: 541 ADGQGIATVSHDRTIKLWSVNNKDEQTMDVD 572
           +DGQ IATVSHDRTIKLWSVN+KDEQTMDVD
Sbjct: 541 SDGQCIATVSHDRTIKLWSVNSKDEQTMDVD 570

BLAST of Moc08g31960 vs. TAIR 10
Match: AT2G41500.1 (WD-40 repeat family protein / small nuclear ribonucleoprotein Prp4p-related )

HSP 1 Score: 706.8 bits (1823), Expect = 1.4e-203
Identity = 361/549 (65.76%), Postives = 429/549 (78.14%), Query Frame = 0

Query: 29  DSPSEPIQPAASSVIPPPIVPAIAPIPPPPVIRPLAPLPSRPPLFRPPVAQNGELRTSDS 88
           D+ S P   A   V+PP   P +APIP  P      P  +RPP FRPPV+QNG ++TSDS
Sbjct: 25  DASSLPGFSAIPPVVPPSFPPPMAPIPMMP-----HPPVARPPTFRPPVSQNGGVKTSDS 84

Query: 89  DSEHDDLAPSRAAPGSAADYEVSEESRQVRERQEKAMQEFLMKRRASALAVPTNDMAVRA 148
           DSE DD              E+SEES+QVRERQEKA+Q+ L+KRRA+A+AVPTND AVR 
Sbjct: 85  DSESDD-----------EHIEISEESKQVRERQEKALQDLLVKRRAAAMAVPTNDKAVRD 144

Query: 149 RLRRLGEPITLFGEREMERRDRLRSIMARLDAEGQLEKLLKAHEEEEAAATAGTEDAEEE 208
           RLRRLGEPITLFGE+EMERR RL  ++ R D  GQL+KL+K HEE+        E+ ++E
Sbjct: 145 RLRRLGEPITLFGEQEMERRARLTQLLTRYDINGQLDKLVKDHEED----VTPKEEVDDE 204

Query: 209 VLQYPFYTEGSKALLNTRLDIAKYSIVRAASRLERAKRKRDDPDEDVEAEMDWALRQAGS 268
           VL+YPF+TEG K L   R++IAK+S+ RAA R++RAKR+RDDPDED++AE  WAL+ A  
Sbjct: 205 VLEYPFFTEGPKELREARIEIAKFSVKRAAVRIQRAKRRRDDPDEDMDAETKWALKHAKH 264

Query: 269 LVLDCSEIGDDRPLSGCSFSSDGKFLATSSLSGVAKLWSMPQV-RKVSNFKGHTERVTDV 328
           + LDCS  GDDRPL+GCSFS DGK LAT SLSGV KLW MPQV   ++  K H ER TDV
Sbjct: 265 MALDCSNFGDDRPLTGCSFSRDGKILATCSLSGVTKLWEMPQVTNTIAVLKDHKERATDV 324

Query: 329 MFSPVNECLATASADRTARLWSAEGSLLRTFEGHLDRLARIAFHPSGKYLGTTSFDKTWR 388
           +FSPV++CLATASADRTA+LW  +G+LL+TFEGHLDRLAR+AFHPSGKYLGTTS+DKTWR
Sbjct: 325 VFSPVDDCLATASADRTAKLWKTDGTLLQTFEGHLDRLARVAFHPSGKYLGTTSYDKTWR 384

Query: 389 LWDVETGVELLLQEGHSRSVYGIAFHHDGSLVSSCGLDALARIWDLRTGRSILALEGHVK 448
           LWD+ TG ELLLQEGHSRSVYGIAF  DG+L +SCGLD+LAR+WDLRTGRSIL  +GH+K
Sbjct: 385 LWDINTGAELLLQEGHSRSVYGIAFQQDGALAASCGLDSLARVWDLRTGRSILVFQGHIK 444

Query: 449 PVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSLYIIPAHSNLVSQVKYEPQEGYFLVTA 508
           PV  V+FSPNGYHLA+GGEDN CRIWDLR +KSLYIIPAH+NLVSQVKYEPQEGYFL TA
Sbjct: 445 PVFSVNFSPNGYHLASGGEDNQCRIWDLRMRKSLYIIPAHANLVSQVKYEPQEGYFLATA 504

Query: 509 SFDMTAKIWSARDFKPVKTLSGHEAKVTSLDIIADGQGIATVSHDRTIKLWSVNNKDE-- 568
           S+DM   IWS RDF  VK+L+GHE+KV SLDI AD   IATVSHDRTIKLW+ +  D+  
Sbjct: 505 SYDMKVNIWSGRDFSLVKSLAGHESKVASLDITADSSCIATVSHDRTIKLWTSSGNDDED 553

Query: 569 ---QTMDVD 572
              +TMD+D
Sbjct: 565 EEKETMDID 553

BLAST of Moc08g31960 vs. TAIR 10
Match: AT2G05720.1 (Transducin/WD40 repeat-like superfamily protein )

HSP 1 Score: 310.1 bits (793), Expect = 3.8e-84
Identity = 167/327 (51.07%), Postives = 197/327 (60.24%), Query Frame = 0

Query: 218 GSKALLNTRLDIAKYSIVRAASRLERAKRKRDDPDEDVEAEMDWALRQAGSLVLDCSEIG 277
           G   L   R++I K  I RAA R++R  R+R+DPDED  AE   AL+    +VL  S+ G
Sbjct: 2   GPTELREARIEITKDFIKRAALRIQRENRRRNDPDEDKNAETKLALKHCKDMVLGSSKFG 61

Query: 278 DDRPLSGCSFSSDGKFLATSSLSGVAKLWSMPQV-RKVSNFKGHTERVTDVMFSPV-NEC 337
           DDRPL+GCS S DGK L T SLSGV KLW +PQV  K+   KGH E VTDV+FS V +EC
Sbjct: 62  DDRPLTGCSLSRDGKILVTCSLSGVPKLWEVPQVTNKIVVLKGHKEHVTDVVFSSVDDEC 121

Query: 338 LATASADRTARLWSAEGSLLRTFEGHLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGV 397
           LATAS DRT ++W  +G+LL+TF+                                    
Sbjct: 122 LATASTDRTEKIWKTDGTLLQTFK------------------------------------ 181

Query: 398 ELLLQEGHSRSVYGIAFHHDGSLVSSCGLDALARIWDLRTGRSILALEGHVKPVLGVSFS 457
                                   +S G D+LAR+WDLRT R+IL  +GH+K VL V FS
Sbjct: 182 ------------------------ASSGFDSLARVWDLRTARNILIFQGHIKQVLSVDFS 241

Query: 458 PNGYHLATGGEDNTCRIWDLRKKKSLYIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKI 517
           PNGYHLA+GGEDN CRIWDLR +K LYIIPAH NLVSQVKYEPQE YFL TAS DM   I
Sbjct: 242 PNGYHLASGGEDNQCRIWDLRMRKLLYIIPAHVNLVSQVKYEPQERYFLATASHDMNVNI 268

Query: 518 WSARDFKPVKTLSGHEAKVTSLDIIAD 543
           WS RDF  VK+L GHE+KV SLDI  D
Sbjct: 302 WSGRDFSLVKSLVGHESKVASLDIAVD 268

BLAST of Moc08g31960 vs. TAIR 10
Match: AT3G49660.1 (Transducin/WD40 repeat-like superfamily protein )

HSP 1 Score: 168.7 bits (426), Expect = 1.4e-41
Identity = 97/290 (33.45%), Postives = 146/290 (50.34%), Query Frame = 0

Query: 279 DRPLSGCSFSSDGKFLATSSLSGVAKLWSM-----PQVRKVSNFKGHTERVTDVMFSPVN 338
           +R +S   FSSDG+ LA++S     + +++     P    V  F GH   ++DV FS   
Sbjct: 24  NRAVSSVKFSSDGRLLASASADKTIRTYTINTINDPIAEPVQEFTGHENGISDVAFSSDA 83

Query: 339 ECLATASADRTARLWSAE-GSLLRTFEGHLDRLARIAFHPSGKYLGTTSFDKTWRLWDVE 398
             + +AS D+T +LW  E GSL++T  GH +    + F+P    + + SFD+T R+WDV 
Sbjct: 84  RFIVSASDDKTLKLWDVETGSLIKTLIGHTNYAFCVNFNPQSNMIVSGSFDETVRIWDVT 143

Query: 399 TGVELLLQEGHSRSVYGIAFHHDGSLVSSCGLDALARIWDLRTGRSILAL-EGHVKPVLG 458
           TG  L +   HS  V  + F+ DGSL+ S   D L RIWD  TG  +  L +    PV  
Sbjct: 144 TGKCLKVLPAHSDPVTAVDFNRDGSLIVSSSYDGLCRIWDSGTGHCVKTLIDDENPPVSF 203

Query: 459 VSFSPNGYHLATGGEDNTCRIWDLRKKKSLYIIPAHSNLVSQVK--YEPQEGYFLVTASF 518
           V FSPNG  +  G  DNT R+W++   K L     H N    +   +    G  +V+ S 
Sbjct: 204 VRFSPNGKFILVGTLDNTLRLWNISSAKFLKTYTGHVNAQYCISSAFSVTNGKRIVSGSE 263

Query: 519 DMTAKIWSARDFKPVKTLSGHEAKVTSLDIIADGQGIATVSHDRTIKLWS 560
           D    +W     K ++ L GH   V ++        IA+ S D+T+++W+
Sbjct: 264 DNCVHMWELNSKKLLQKLEGHTETVMNVACHPTENLIASGSLDKTVRIWT 313

BLAST of Moc08g31960 vs. TAIR 10
Match: AT4G02730.1 (Transducin/WD40 repeat-like superfamily protein )

HSP 1 Score: 144.8 bits (364), Expect = 2.1e-34
Identity = 81/259 (31.27%), Postives = 136/259 (52.51%), Query Frame = 0

Query: 312 RKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEG-SLLRTFEGHLDRLARIAF 371
           R +   +GHT  ++ V FS     LA+AS D+T  LWSA   SL+  +EGH   ++ +A+
Sbjct: 34  RHLKTLEGHTAAISCVKFSNDGNLLASASVDKTMILWSATNYSLIHRYEGHSSGISDLAW 93

Query: 372 HPSGKYLGTTSFDKTWRLWDVETGVELL-LQEGHSRSVYGIAFHHDGSLVSSCGLDALAR 431
                Y  + S D T R+WD  +  E L +  GH+  V+ + F+   +L+ S   D   R
Sbjct: 94  SSDSHYTCSASDDCTLRIWDARSPYECLKVLRGHTNFVFCVNFNPPSNLIVSGSFDETIR 153

Query: 432 IWDLRTGRSILALEGHVKPVLGVSFSPNGYHLATGGEDNTCRIWDLRKKKSL-YIIPAHS 491
           IW+++TG+ +  ++ H  P+  V F+ +G  + +   D +C+IWD ++   L  +I   S
Sbjct: 154 IWEVKTGKCVRMIKAHSMPISSVHFNRDGSLIVSASHDGSCKIWDAKEGTCLKTLIDDKS 213

Query: 492 NLVSQVKYEPQEGYFLVTASFDMTAKIWSARDFKPVKTLSGHEAKV---TSLDIIADGQG 551
             VS  K+ P  G F++ A+ D T K+ +    K +K  +GH  KV   TS   + +G+ 
Sbjct: 214 PAVSFAKFSP-NGKFILVATLDSTLKLSNYATGKFLKVYTGHTNKVFCITSAFSVTNGKY 273

Query: 552 IATVSHDRTIKLWSVNNKD 565
           I + S D  + LW +  ++
Sbjct: 274 IVSGSEDNCVYLWDLQARN 291

BLAST of Moc08g31960 vs. TAIR 10
Match: AT2G33340.1 (MOS4-associated complex 3B )

HSP 1 Score: 142.1 bits (357), Expect = 1.4e-33
Identity = 85/288 (29.51%), Postives = 146/288 (50.69%), Query Frame = 0

Query: 294 LATSSLSGVAKLWSMPQVRKVSNFKGHTERVTDVMFSPVNECLATASADRTARLWSAEG- 353
           +AT  +   A L+  P  + +S   GH+++VT V F   ++ + TASAD+T R+W   G 
Sbjct: 237 IATGGVDATAVLFDRPSGQILSTLTGHSKKVTSVKFVGDSDLVLTASADKTVRIWRNPGD 296

Query: 354 ---SLLRTFEGHLDRLARIAFHPSGKYLGTTSFDKTWRLWDVETGVELLLQEGHSRSV-- 413
              +   T   H   +  +  HP+ KY  + S D TW  +D+ +G  L      S++V  
Sbjct: 297 GNYACGYTLNDHSAEVRAVTVHPTNKYFVSASLDGTWCFYDLSSGSCLAQVSDDSKNVDY 356

Query: 414 YGIAFHHDGSLVSSCGLDALARIWDLRTGRSILALEGHVKPVLGVSFSPNGYHLATGGED 473
              AFH DG ++ +    ++ +IWD+++  ++   +GH   V  +SFS NGY LAT  ED
Sbjct: 357 TAAAFHPDGLILGTGTSQSVVKIWDVKSQANVAKFDGHTGEVTAISFSENGYFLATAAED 416

Query: 474 NTCRIWDLRKKKSL-YIIPAHSNLVSQVKYEPQEGYFLVTASFDMTAKIWSAR-DFKPVK 533
              R+WDLRK ++    + A +N    V+++P   Y  + AS     +  S + ++  +K
Sbjct: 417 GV-RLWDLRKLRNFKSFLSADAN---SVEFDPSGSYLGIAASDIKVYQTASVKAEWNLIK 476

Query: 534 TLS--GHEAKVTSLDIIADGQGIATVSHDRTIKLWSVNNKDEQTMDVD 572
           TL       K T +   +D Q +A  S DR ++++ +   ++  +D D
Sbjct: 477 TLPDLSGTGKATCVKFGSDAQYVAVGSMDRNLRIFGLPGDEKANVDDD 520

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022151320.10.0e+00100.00U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Momordica charantia][more]
XP_038885479.17.9e-30293.70U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Benincasa hispida] >XP_... [more]
XP_004146749.11.9e-30093.35U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Cucumis sativus] >XP_03... [more]
XP_008464716.16.3e-29992.64PREDICTED: U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Cucumis melo... [more]
XP_022999661.11.2e-29792.29U4/U6 small nuclear ribonucleoprotein PRP4-like protein [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
O222121.9e-20265.76U4/U6 small nuclear ribonucleoprotein PRP4-like protein OS=Arabidopsis thaliana ... [more]
Q3MHE25.3e-11547.28U4/U6 small nuclear ribonucleoprotein Prp4 OS=Bos taurus OX=9913 GN=PRPF4 PE=2 S... [more]
O431725.3e-11547.28U4/U6 small nuclear ribonucleoprotein Prp4 OS=Homo sapiens OX=9606 GN=PRPF4 PE=1... [more]
Q5NVD05.3e-11547.28U4/U6 small nuclear ribonucleoprotein Prp4 OS=Pongo abelii OX=9601 GN=PRPF4 PE=2... [more]
Q9DAW69.0e-11547.28U4/U6 small nuclear ribonucleoprotein Prp4 OS=Mus musculus OX=10090 GN=Prpf4 PE=... [more]
Match NameE-valueIdentityDescription
A0A6J1DAU70.0e+00100.00U4/U6 small nuclear ribonucleoprotein PRP4-like protein OS=Momordica charantia O... [more]
A0A0A0KHD89.4e-30193.35WD_REPEATS_REGION domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G... [more]
A0A1S4E4U83.0e-29992.64U4/U6 small nuclear ribonucleoprotein PRP4-like protein OS=Cucumis melo OX=3656 ... [more]
A0A6J1KDQ75.7e-29892.29U4/U6 small nuclear ribonucleoprotein PRP4-like protein OS=Cucurbita maxima OX=3... [more]
A0A6J1G3224.8e-29791.94U4/U6 small nuclear ribonucleoprotein PRP4-like protein OS=Cucurbita moschata OX... [more]
Match NameE-valueIdentityDescription
AT2G41500.11.4e-20365.76WD-40 repeat family protein / small nuclear ribonucleoprotein Prp4p-related [more]
AT2G05720.13.8e-8451.07Transducin/WD40 repeat-like superfamily protein [more]
AT3G49660.11.4e-4133.45Transducin/WD40 repeat-like superfamily protein [more]
AT4G02730.12.1e-3431.27Transducin/WD40 repeat-like superfamily protein [more]
AT2G33340.11.4e-3329.51MOS4-associated complex 3B [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 177..208
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 38..74
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..121
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..18
NoneNo IPR availablePANTHERPTHR19846WD40 REPEAT PROTEINcoord: 106..564
NoneNo IPR availablePANTHERPTHR19846:SF3BNAC04G01920D PROTEINcoord: 106..564
NoneNo IPR availablePROSITEPS50294WD_REPEATS_REGIONcoord: 484..517
score: 9.151057
NoneNo IPR availablePROSITEPS50294WD_REPEATS_REGIONcoord: 358..394
score: 10.917412
NoneNo IPR availablePROSITEPS50294WD_REPEATS_REGIONcoord: 400..437
score: 10.943775
NoneNo IPR availablePROSITEPS50294WD_REPEATS_REGIONcoord: 317..349
score: 11.207411
NoneNo IPR availablePROSITEPS50294WD_REPEATS_REGIONcoord: 442..476
score: 13.2374
NoneNo IPR availablePROSITEPS50294WD_REPEATS_REGIONcoord: 527..562
score: 11.049229
NoneNo IPR availableCDDcd00200WD40coord: 281..559
e-value: 5.00431E-92
score: 282.301
IPR020472G-protein beta WD-40 repeatPRINTSPR00320GPROTEINBRPTcoord: 546..560
score: 31.77
coord: 336..350
score: 36.13
coord: 461..475
score: 40.13
IPR014906Pre-mRNA processing factor 4 (PRP4)-likeSMARTSM00500pr04_2coord: 141..194
e-value: 4.3E-17
score: 72.8
IPR014906Pre-mRNA processing factor 4 (PRP4)-likePFAMPF08799PRP4coord: 146..173
e-value: 1.1E-12
score: 47.2
IPR001680WD40 repeatSMARTSM00320WD40_4coord: 266..307
e-value: 0.014
score: 24.6
coord: 477..517
e-value: 2.1E-4
score: 30.6
coord: 435..474
e-value: 5.0E-11
score: 52.6
coord: 351..390
e-value: 7.9E-8
score: 42.0
coord: 520..559
e-value: 4.1E-9
score: 46.3
coord: 310..349
e-value: 1.3E-8
score: 44.6
coord: 393..432
e-value: 2.1E-6
score: 37.3
IPR001680WD40 repeatPFAMPF00400WD40coord: 440..474
e-value: 6.0E-9
score: 36.4
coord: 401..432
e-value: 2.9E-4
score: 21.6
coord: 522..559
e-value: 1.4E-5
score: 25.7
coord: 352..390
e-value: 7.9E-5
score: 23.4
coord: 314..349
e-value: 1.0E-4
score: 23.0
IPR001680WD40 repeatPROSITEPS50082WD_REPEATS_2coord: 400..441
score: 14.251289
IPR001680WD40 repeatPROSITEPS50082WD_REPEATS_2coord: 358..399
score: 13.482671
IPR001680WD40 repeatPROSITEPS50082WD_REPEATS_2coord: 484..526
score: 11.878597
IPR001680WD40 repeatPROSITEPS50082WD_REPEATS_2coord: 317..349
score: 13.516088
IPR001680WD40 repeatPROSITEPS50082WD_REPEATS_2coord: 442..483
score: 16.958164
IPR001680WD40 repeatPROSITEPS50082WD_REPEATS_2coord: 527..568
score: 13.917108
IPR036285PRP4-like superfamilyGENE3D4.10.280.110coord: 112..186
e-value: 2.7E-14
score: 54.4
IPR036285PRP4-like superfamilySUPERFAMILY158230PRP4-likecoord: 123..179
IPR015943WD40/YVTN repeat-like-containing domain superfamilyGENE3D2.130.10.10coord: 261..440
e-value: 2.1E-49
score: 170.1
IPR015943WD40/YVTN repeat-like-containing domain superfamilyGENE3D2.130.10.10coord: 441..570
e-value: 2.1E-39
score: 137.4
IPR019775WD40 repeat, conserved sitePROSITEPS00678WD_REPEATS_1coord: 377..391
IPR019775WD40 repeat, conserved sitePROSITEPS00678WD_REPEATS_1coord: 419..433
IPR019775WD40 repeat, conserved sitePROSITEPS00678WD_REPEATS_1coord: 461..475
IPR036322WD40-repeat-containing domain superfamilySUPERFAMILY50978WD40 repeat-likecoord: 276..558

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc08g31960.1Moc08g31960.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0000398 mRNA splicing, via spliceosome
cellular_component GO:0046540 U4/U6 x U5 tri-snRNP complex
molecular_function GO:0005515 protein binding
molecular_function GO:0030621 U4 snRNA binding
molecular_function GO:0017070 U6 snRNA binding