Lsi01G009630 (gene) Bottle gourd (USVL1VR-Ls) v1

Overview
NameLsi01G009630
Typegene
OrganismLagenaria siceraria (Bottle gourd (USVL1VR-Ls) v1)
DescriptionU1 small nuclear ribonucleoprotein A isoform X2
Locationchr01: 7813381 .. 7821070 (+)
RNA-Seq ExpressionLsi01G009630
SyntenyLsi01G009630
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
TGTGAGCCCATTTGAGCGACGATGCTTTTATCAGAATTGAAGCAATCCGACCCAAAATTCAGGGCGTCCAGCTCTCTCTCTCTCGCTACTTCGTCAGAGACTCGGAGAGAACAGCACTCCGATGGACGACATGGCGAGCTACTATCCGCAACCGCAACCACCGCCCCAGCCTTCTGGTCTTGAACCTCCTCACTATCCCTATTACCAGGTGCCTCCTCCTCCTCCTTCAGCACCGCCGTCTCAGCATTACCTCTCTCAACACCCGCCCACCTTCGCTTCCTATGGCTTACCTTTTTTACCTCACGCAGCCTCCATCAATGAGGTCCGAACCCTATTCATAGCTGGCCTTCCTGAAGATGTCAAGCCCCGAGAAATTTACAATCTCTTCCGGGAGTTTCCTGGATACGAGTCCTCTCATCTTCGGACCCCCACGCAGACGACCCAGGTGTTGTACTTTGCTTTTCTGTTTTGCAAAATTGTTCATTCTTTCGGAATTTTCTTTTTCTTCTGAATGTGAATAGTTTCTAATTTGGGTGCCTGGATGCTTACTCATGTAAATGCGCTTAGCTTCTAGAGTCTGACGTCTGAGTAAATTCGTGGGCATTTAGTTGCTTTAGTATCAAAGATGGAGATGGGTGCTGGGAGAGCAAACAAACTTCTACATTGGTTAGGGAAGGAAGTGATCACAAACAAAGACCAGATGATTGGGTATATAAATGTAGACAATTATCGTTGATACGAGAACTTCTGAGTGGTTCCAAAACTAAAGCCACGAGGGTTTATGCCAAAAATGGACAATATCATACCATTGTGAAAATAATGAAGATAGGTGAAGTATTGTTGTCTTTATCAATTAAAAACTGGTTCATAGACCATTTTGGTTAGGTTACTGTACGTTTCTGGCTTTTGAAGAAGATCAAGATGATAATGGTTAAAAGAAATCAGCTTGGACCTGGTTTATTGGGGCTCATTTAAAATTCATCTTAACTAGATATTTAATGATTGAATACATTGCAGCCATTTATCCAAGATTTTTTTTGGCACCCAAATGTTACAGTGTTAGGCAAGACACAAAGTGTGTCCAACTAGTCTGAACACTTACTAAGGTAAAAAACACAAGAAATACAAAGTTAAAAGTAAACCACTCAAATGAAAAGAAATACGAAGTAAACCTAATTCTCAAAACTTAATTCAATACCTTGAAACTCATCAATGCCTACTGTGAGATTGTTAGATAAACTATAAATGTGCTGTGTCAAATTCTCACAGAAATGTCATTATGAATGTTGATACTGATGCTTATAGCCTAATACCAATGCCGATATGAAATCTGATGCGTACGTGACTCCATTGGCCTAGTTTTAGTGGGATGCAATTTTTTTAGATTAACAAAAAATTCCTTAGTAACACCCTTCACCCATCATTATTCCACAATGCTATGGAGCACGAGCACCTCAGTAGTCAATATTCGCACCATTAAACGCAAGTTAGAATTAGAAGGCAGAAATTTGGGTTTCACCTATACTACCCACAGCTAAATTCAAGTTAGTATCCAACTTCTAATCCCTTAATTTGATTCGAAGGCCATCCACCAAGTTTGCACCTTTGCTCCCCTTCACAAGTGGATTCTTCTTCCGACAACTATAAAAATGGTTGTAGAAGTATAGGTCGCCCCCTACCTTGAGAGTCATGCTTCAGTACAACCCAAATAATGAAATTATCTTGCTGCCTCGAAATTGGTGACCTTTGTCATGTTTGTAACTATATCTTTTGCTCTGCTCCTTTTGTGAACCTTCTATTAGATGATATAATATTAAACTAGTCTTCACCCATAAGCTTATTTATTTATTTATTTTTTATTAACACCTTCCTCAATGAAACCTCTTTACCTATCAATGACAATGAAAGTTGTGTCCTACCACTTTGTTGTTAGAATTTTTTTGGTACAAAATTATTGTTTAGTAACTTCAAAGGCACATTTCGATCACAATGTTTCACGTGTAATAATATACTATATTAAGTGATGGAGGTCACTTCTCTCTTGTCTATGCTTGAGGAGTTTTTTTTAGGGGTGGGAGAAGGGATGTTTGCTTTTAGAGGCCAACCTTCCAGAGGGATTCTCTTGTGAATCATTCTTTCATTATTTATTGGACCCTGCTCCCACTAGTGAGCCTTTTTTTTTTAATAAAAAAAATCTTTTTCCCCTGCTCTTTGGAGGATTATGAGATCCCAAGGAAAATGTAGTTCTTTGCTAGACAAGTTTTACATGGAAGAGCTAACACTTTGGACGGGCTTTCGAGGAGGATGCTCACTTTAATTGGGTCGTTTTGTTGTATTCTCTATTGGAAGGTGGAAGAAAACCTAGATCACATTCTTTGGAGATGTAAGTTTACTAGTTTATTTGGGCTTATTTCTTCCAAACGTTTGAGTTCTTGCCTAGTTGTCTGTGTCAGACACTTGGTCACGTATAACACACTTGTTAGTATAAAAGATGTGTTAGACACTAGTTACACACAGTCAAATTAGAACCAATATTTGTTCGATATGTATCAAACACTTGTTAAGTATACTAAGTAGACACATATATGACAAGAATAATAAATTTGGAGAGTGAAATATATCCAAGTAATTTTTTAAGCATATAAATCCATAGACTTGTTGATTTTTAATTTCTTCTTGTTTAAAAATGATATATATTTTAGGTTTAGGTCCTATTTTAGTCCCTGAACTTTGAATCTTGTTCTATTTTGGTCCCTAAACTTTCAAAAACGTTCGTTTTAGTCCCTAAACTTTAAAAAAGTGTTTATTTTAGTGCCTACTATTAAATTTATGTTAACTCTTCAACAGAAATGTAAGATAGCTTGTATTTTTGCATGATTAGGCTTTAGATGTATTGAGCAAGATGAATGTGAAGTAAAATGCCAACAATCAATACGTCAAAATACTGAACGACCAAAATTTTGAACTAAAAATATTTTTAAATTCTCCCGCTTTGCCACCTTATCCAAAAATGAAATTCTAGATTTTTTTAGCGTAACTAACTTATTTGGTAGCCTAGTCAACCAAAAATACAAGTCATCTCGCCAATTTGTTAAAGGGTTAACAAAATCTTAATATAGTAGGGACCAAAATAAGCACTTTTTAAAAGTTCAAGGACTAAAACGAACATTTTTACAAGTTTAGGGACCAAAATAATGAACGTCGTGTTTGTGTCTTAAATTTTTTAAAATTGACGTGTTGCTATATTTGTGTCGAGTCGTATCTTTGTCTTGTATTCGTATCTATGCTTCTTAGGTTTTTGCGAGCTCGACATAAGGACATTACCGACACGGTCGGGAAATTCTTCTTCCATTCACCTTTTTGTGAGAATAGTTGTTTTTTGTGGTTTGCTGGGGGTGTGCTATTTTGTGGGTCTTTGTTGAGAACAGAATAACAAAATGTTTAGAGGGTTGGAGAGGGATCCTAGTAATATTTGGTCCCTTGTTAGATTTCATGTTTTTCTTTGGGCTCTAATTTAGAAGATCTTTTGTAACTATTCCTTAAGTACTATTTTGTATAGTTGGAGTGCGTTTATTCCGAAGATCTTTTTTTTTCCCCTTCTTCTTCTTTCAATCTTTTCGTATGCCTTTATATTCTTTCATTTGGTTTTCATGGAAGTGGTTATTATTATTTCTTTCTTTCTTTTGTACCATTTAATTGGATGATAGTGGCTTGTACTTTAGATGTATGCATTTTGTTGTCAGGACAGGAAACTAAAACATTTTTTGGCAATGCCTAAACATGATTGCATTGTGTTATTAGTATTTAGTATCCATGTTTAAGAGTTCTAAAGTTGGTCAATATAATCTCAGGTCCTGGGATGGTCAGGGTGGTCATACGTATTAGTCTCTGGATTGTGATCAAGTTAAGATGTGTGGAGACCATAACATAAACTGAGATATTGACATCCCATGTTTTCAACTGCAAATACTATGCTAATCTGTAGTTTCAAATCCAGTTAATATCGAATAGAAAGTGAAAGCTTGGGGACCTTTGTTGGTAAAGGCAATTTACCAGCTTTCCATATTCTCTGACCAACAACCTACATTGCTGCAATTAATTATAGCAAATTTATGCAAGTACTCTTCAGATACGTGTACTGAGTGCCTATGACTGTCACTTGAACTTTTAAAAGATTATCGTATTCTTTTTCCACTCTCATGAAATATCTTGTGTATACAGCATACCTATCACCGCTTGTGTTGATGGTAGACCTTATATTTTATTGATTAGCGTTTTGCACAGTTTTTCTTACAAATTTTTTGGTGTGGTTACAGCCATTTGCATTTGCTGTATTCTCGGACCAGCAGTCTGCCGTTGGTGCAATGCATGCTGTAAATGTAAGTACTTGTTCCCTGCTTCTGTATTCCAAGTTCTTTGGTTAAGAAAATCATTTCAATGGTCAAAGGAAGTACATAAAAGGGACGTGAAAGAAGGGCATGATGGGTTGGGGAACGAAGTAAGAAGAAAAATAAAGAAGAGAGGTTGCGATCAAGTGGGTGGTGGAAGAACGAGAAGATCTTAAATTTAGATTTGATTCAGATCCGTGAGAGGAGAGGGAGAGAAGAGGTGGCAACGGCGGATGGGCGGCAACTTAGTGGAGAAAGAGGAAGGGGGCCAGAGGGAAGGGCCAAAGGGGGTGGGTGGAGGTCAAGAGAGTTTTTTTGTTTGTCTATAAATTATTATTATTTTTATTATTTTAAAATTATTTTTATAATGCCACGTCATTTCTGTAATGGAGCTGTTAACCAAGGGACTAAATTGAACTATTTCCTATGCTTTGAGGACTTGATTGTTCAATTTTAAAACTTAGGAACTAAATTGAGACAAACCCTAAACCTTATGTACCAAAAAAGTAATTTACCCTTCATTTTTCTTTCTTCTGAATTTTTGCTTTCAATTGGTTGGTCGTTCTTTTGAAGCTGCTAATAATAATAATTGTAACAAGGGGTTAGAAAGTGTTTTGGGTCTCCTCCTGAGCAGCTTATAAAATTGGAAATTGCAAGGATGAAGATAACAAGATTTTTACTCTGAGTAATGTATCACATTAAATTATTAAAACAATTGGTTGCTCTTTGTTGTAGGGCATGGTGTTTGATCTTGAGAAGCAGTCAGTACTGTTTGTTGATTTGGCTAAATCCAATTCAAGATCAAAACGGACGAGGACAGGTACATATCTTAATGGCTGTCCTACTATGATTCTATGAATTTATATTTCTATTTTATTTAGTGGTATAACTATAACCAGTTTGAATATTTTTTGATGTGTTCATGAAATGCTTAGAGGATGAAAGATATGGATCGGATAAGAAAGCTAAAGTATCCATCTTTTCAAGGAGTACTCCTGATCCCGGTAAGATGATTGCTTAGCACTTTTTATGCTTGTAAGCTATTTCTTATACAAGCTCCTTTTCTTTAAAGAAAATAAATAATTAAATTCTGTCCTTTTGATGCCCCTTACGGTATCTATTTTTTATGCTTATTTTTAATGTATAAATCTGTCCCCTCTGATTTATTTGAGGATGATTATGTGCTGCCCGTACAAACTCTTGGGGTAAGGAGTAATTGATTAGTCACGCAGAGGACACATTATTCTAAAAGCTCATTTTGCACATGTTTTGCACGCTTGGGCACTAGCAGGTCTTGGCAGCACTCACATGTCTGGAATGGGTAATTCTGCTTACAACACGATTGGTTATCCATCTGCACAAAGGTCATCTGCACTACTTAGCACATCTTTATTATTGAAATGGTTAATTACAAGCTTATGTGAAGCTTATACATTATTGTCAACAGTCATCACTAGTAGAGTGTCCAAGGGCTTCAGTAAAAAGCTCTTACGATATGCCCTGACTCCGGTGTTAGGTTTTTAACTTTTGTATACTTGCTTCGTTATGTTTCTTCTTCTAAGTTCTAACTAACTAGTACTTCCTATCATCAGCCATGGAAGCTTTGATAACAAAACTGTAAATGATACAGTGGCTGCAAATGTGGTAAGGCCCCTTTTCTGTTACAATCCATGATTCTTCAGTCTCTATAAGCTGCCATACATCCTCTCTTTCTGTCCCAAAAGCATTATATGACCAATAATTGAATTACTTATTCTAATATAACTAGCAAGTAATTTTTGCACTTAATTATATAGCAAATTGGGATAACAATTCACATCATTAGCTAAAATAACTTTAATTTTCATTTTCTGTCCAGAAGAGTGGCTTTGAGTGTAGATAGGTTGTGGGAGGATGTGTGGATATGGACACCCTTTAATGCCTTCCCATGGGTTCATAGTTCCAAATTGTTTTCTCCGAATTACCAAAAGTTGAATTCTTTCTTGTAGGTCGCCTTTTATTCCACACTTCTCTTTTGGTCTTTTTATGGGTTTCCTTTTGTACGTTCAGTGAAAACTGAAAAGTTAATTTTCCCATTACAAAATAATTATAACTATATAGGCAGGCTTAACTTATTGAGAACAAGCATAGGGGGAAATTTATTTACCTTGAACTTCTACAATAATATTTGCACGTATTTTTTTGGTTTTTTATCGTTTCTTTTCGAAGGGAGGGAATTCTGGGAATAAGGCTGCAACCATATTAAATAAAGGAAAAAGCTTCAACCATAATAAACTGGTGTATTTGAAGATTTTCCCATGAACAGATAATGTATTAGTGTATGCATTGCTTTACAATTCTGCCATGACATTTTTTATATTTAGGCATAAATAAAGTCACCTTGTGTTATAGCAAAAGTATTTGATTTATGTGTGAATTACCATTCGTAGCACAATGTATTTGGTGAATCCTGTTAACTTCGGGTGAATTATTCTTCATTTCCAGATTCCTCAAAATCCCCCATGTCCAACACTTTTTGTGGCAAATCTAGGGCCAAGTTGCACCGAGCAAGAGCTTATTCAAATTTTTTCAAGGTTGGTAATTTTACTGCACGTTATTTAGGTCATGTTCTGAACATCTAAGAACTTGCATTATCTGTTTATTTGATAATTCCTACCAGATGCCCGGGCTTCTTAAAACTAAAGATGCAGAGCACATATGGGGCCCCAGTTGCTTTTGTTGATTTTCAGGTTAGCAGAGCCCTTCCTGTTCTCAATATTATTTTTCTGTAGTTGATAAATCTTTTCCTACTTGGCATTGAAATTCATGAATCAGACTCCATTCTGTGGCTTTTTTCCGTTAATTATGTGCAGACTATCTGCACTACCGACATTATCACCCTTGTGGCTTTATATTCTTATGGATTTTTATATCCTTGACCATTCACAAGGCTTGTTGTTTTTAGATACTCCAGGGAACTGTTTTAATGGTGTAAATAATATTTGGAAGTTGCCTTCTGGTGTGTTGCTTAAATTGTTACATGTTCAGGCCATAAAATCTTTCACATAGTAGGGATTTAAAGCGACACTAAAGTTTTTTTAGATGGAAAAGGTATCTGGGTCTTCCATTGTTATGTTGTGTGAAATGTGAAGTTGCAATGTGTGCAGGACACTGCCTGTTCAACTGGAGCTCTGAACCATCTGCAAGGCTCAATTCTGTACTCATCACCTCCTGGGGAGGGCATGCGATTGGAGTATCCTTTTACAAGATTTTTATTCTTTACTATTGTACTGCTACATCATGTCTTCAAATGTTTTGCTTAA

mRNA sequence

TGTGAGCCCATTTGAGCGACGATGCTTTTATCAGAATTGAAGCAATCCGACCCAAAATTCAGGGCGTCCAGCTCTCTCTCTCTCGCTACTTCGTCAGAGACTCGGAGAGAACAGCACTCCGATGGACGACATGGCGAGCTACTATCCGCAACCGCAACCACCGCCCCAGCCTTCTGGTCTTGAACCTCCTCACTATCCCTATTACCAGGTGCCTCCTCCTCCTCCTTCAGCACCGCCGTCTCAGCATTACCTCTCTCAACACCCGCCCACCTTCGCTTCCTATGGCTTACCTTTTTTACCTCACGCAGCCTCCATCAATGAGGTCCGAACCCTATTCATAGCTGGCCTTCCTGAAGATGTCAAGCCCCGAGAAATTTACAATCTCTTCCGGGAGTTTCCTGGATACGAGTCCTCTCATCTTCGGACCCCCACGCAGACGACCCAGCCATTTGCATTTGCTGTATTCTCGGACCAGCAGTCTGCCGTTGGTGCAATGCATGCTGTAAATGGCATGGTGTTTGATCTTGAGAAGCAGTCAGTACTGTTTGTTGATTTGGCTAAATCCAATTCAAGATCAAAACGGACGAGGACAGAGGATGAAAGATATGGATCGGATAAGAAAGCTAAAGTATCCATCTTTTCAAGGAGTACTCCTGATCCCGGTCTTGGCAGCACTCACATGTCTGGAATGGGTAATTCTGCTTACAACACGATTGGTTATCCATCTGCACAAAGCCATGGAAGCTTTGATAACAAAACTGTAAATGATACAGTGGCTGCAAATGTGATTCCTCAAAATCCCCCATGTCCAACACTTTTTGTGGCAAATCTAGGGCCAAGTTGCACCGAGCAAGAGCTTATTCAAATTTTTTCAAGATGCCCGGGCTTCTTAAAACTAAAGATGCAGAGCACATATGGGGCCCCAGTTGCTTTTGTTGATTTTCAGGACACTGCCTGTTCAACTGGAGCTCTGAACCATCTGCAAGGCTCAATTCTGTACTCATCACCTCCTGGGGAGGGCATGCGATTGGAGTATCCTTTTACAAGATTTTTATTCTTTACTATTGTACTGCTACATCATGTCTTCAAATGTTTTGCTTAA

Coding sequence (CDS)

ATGGACGACATGGCGAGCTACTATCCGCAACCGCAACCACCGCCCCAGCCTTCTGGTCTTGAACCTCCTCACTATCCCTATTACCAGGTGCCTCCTCCTCCTCCTTCAGCACCGCCGTCTCAGCATTACCTCTCTCAACACCCGCCCACCTTCGCTTCCTATGGCTTACCTTTTTTACCTCACGCAGCCTCCATCAATGAGGTCCGAACCCTATTCATAGCTGGCCTTCCTGAAGATGTCAAGCCCCGAGAAATTTACAATCTCTTCCGGGAGTTTCCTGGATACGAGTCCTCTCATCTTCGGACCCCCACGCAGACGACCCAGCCATTTGCATTTGCTGTATTCTCGGACCAGCAGTCTGCCGTTGGTGCAATGCATGCTGTAAATGGCATGGTGTTTGATCTTGAGAAGCAGTCAGTACTGTTTGTTGATTTGGCTAAATCCAATTCAAGATCAAAACGGACGAGGACAGAGGATGAAAGATATGGATCGGATAAGAAAGCTAAAGTATCCATCTTTTCAAGGAGTACTCCTGATCCCGGTCTTGGCAGCACTCACATGTCTGGAATGGGTAATTCTGCTTACAACACGATTGGTTATCCATCTGCACAAAGCCATGGAAGCTTTGATAACAAAACTGTAAATGATACAGTGGCTGCAAATGTGATTCCTCAAAATCCCCCATGTCCAACACTTTTTGTGGCAAATCTAGGGCCAAGTTGCACCGAGCAAGAGCTTATTCAAATTTTTTCAAGATGCCCGGGCTTCTTAAAACTAAAGATGCAGAGCACATATGGGGCCCCAGTTGCTTTTGTTGATTTTCAGGACACTGCCTGTTCAACTGGAGCTCTGAACCATCTGCAAGGCTCAATTCTGTACTCATCACCTCCTGGGGAGGGCATGCGATTGGAGTATCCTTTTACAAGATTTTTATTCTTTACTATTGTACTGCTACATCATGTCTTCAAATGTTTTGCTTAA

Protein sequence

MDDMASYYPQPQPPPQPSGLEPPHYPYYQVPPPPPSAPPSQHYLSQHPPTFASYGLPFLPHAASINEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQSVLFVDLAKSNSRSKRTRTEDERYGSDKKAKVSIFSRSTPDPGLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILYSSPPGEGMRLEYPFTRFLFFTIVLLHHVFKCFA
Homology
BLAST of Lsi01G009630 vs. ExPASy Swiss-Prot
Match: Q8VC52 (RNA-binding protein with multiple splicing 2 OS=Mus musculus OX=10090 GN=Rbpms2 PE=1 SV=1)

HSP 1 Score: 80.9 bits (198), Expect = 3.0e-14
Identity = 39/90 (43.33%), Postives = 60/90 (66.67%), Query Frame = 0

Query: 67  EVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQQSAVGAMH 126
           EVRTLF++GLP D+KPRE+Y LFR F GYE S ++  ++  QP  F +F  +  A  A +
Sbjct: 23  EVRTLFVSGLPVDIKPRELYLLFRPFKGYEGSLIKLTSR--QPVGFVIFDSRAGAEAAKN 82

Query: 127 AVNGMVFDLEKQSVLFVDLAKSNSRSKRTR 157
           A+NG+ FD E    L ++ AK+N++  +++
Sbjct: 83  ALNGIRFDPENPQTLRLEFAKANTKMAKSK 110

BLAST of Lsi01G009630 vs. ExPASy Swiss-Prot
Match: Q6ZRY4 (RNA-binding protein with multiple splicing 2 OS=Homo sapiens OX=9606 GN=RBPMS2 PE=1 SV=1)

HSP 1 Score: 80.5 bits (197), Expect = 3.9e-14
Identity = 39/90 (43.33%), Postives = 59/90 (65.56%), Query Frame = 0

Query: 67  EVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQQSAVGAMH 126
           EVRTLF++GLP D+KPRE+Y LFR F GYE S ++   +  QP  F +F  +  A  A +
Sbjct: 29  EVRTLFVSGLPVDIKPRELYLLFRPFKGYEGSLIKLTAR--QPVGFVIFDSRAGAEAAKN 88

Query: 127 AVNGMVFDLEKQSVLFVDLAKSNSRSKRTR 157
           A+NG+ FD E    L ++ AK+N++  +++
Sbjct: 89  ALNGIRFDPENPQTLRLEFAKANTKMAKSK 116

BLAST of Lsi01G009630 vs. ExPASy Swiss-Prot
Match: Q6DH13 (RNA-binding protein, mRNA-processing factor 2a OS=Danio rerio OX=7955 GN=rbpms2a PE=1 SV=1)

HSP 1 Score: 80.1 bits (196), Expect = 5.1e-14
Identity = 39/90 (43.33%), Postives = 59/90 (65.56%), Query Frame = 0

Query: 67  EVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQQSAVGAMH 126
           EVRTLF++GLP D+KPRE+Y LFR F GYE S ++  ++  QP  F  F  +  A  A +
Sbjct: 18  EVRTLFVSGLPVDIKPRELYLLFRPFKGYEGSLIKLTSK--QPVGFVTFDSRSGAEAAKN 77

Query: 127 AVNGMVFDLEKQSVLFVDLAKSNSRSKRTR 157
           A+NG+ FD E    L ++ AK+N++  +++
Sbjct: 78  ALNGIRFDPESPQTLRLEFAKANTKMAKSK 105

BLAST of Lsi01G009630 vs. ExPASy Swiss-Prot
Match: Q9W6I1 (RNA-binding protein with multiple splicing 2 OS=Gallus gallus OX=9031 GN=RBPMS2 PE=1 SV=1)

HSP 1 Score: 79.7 bits (195), Expect = 6.7e-14
Identity = 39/90 (43.33%), Postives = 59/90 (65.56%), Query Frame = 0

Query: 67  EVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQQSAVGAMH 126
           EVRTLF++GLP D+KPRE+Y LFR F GYE S ++  ++  QP  F  F  +  A  A +
Sbjct: 20  EVRTLFVSGLPVDIKPRELYLLFRPFKGYEGSLIKLTSK--QPVGFVTFDSRAGAEAAKN 79

Query: 127 AVNGMVFDLEKQSVLFVDLAKSNSRSKRTR 157
           A+NG+ FD E    L ++ AK+N++  +++
Sbjct: 80  ALNGIRFDPENPQTLRLEFAKANTKMAKSK 107

BLAST of Lsi01G009630 vs. ExPASy Swiss-Prot
Match: Q9YGP5 (RNA-binding protein with multiple splicing 2 OS=Xenopus laevis OX=8355 GN=rbpms2 PE=2 SV=1)

HSP 1 Score: 79.3 bits (194), Expect = 8.7e-14
Identity = 39/90 (43.33%), Postives = 59/90 (65.56%), Query Frame = 0

Query: 67  EVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQQSAVGAMH 126
           EVRTLF++GLP D+KPRE+Y LFR F GYE S ++  ++  QP  F  F ++  A  A +
Sbjct: 18  EVRTLFVSGLPIDIKPRELYLLFRPFKGYEGSLIKLTSK--QPVGFVTFDNRAGAEAAKN 77

Query: 127 AVNGMVFDLEKQSVLFVDLAKSNSRSKRTR 157
           A+NG+ FD E    L ++ AK+N++  + +
Sbjct: 78  ALNGIRFDPENPQTLRLEFAKANTKMAKNK 105

BLAST of Lsi01G009630 vs. ExPASy TrEMBL
Match: A0A1S3B1E9 (protein WHI4 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103484791 PE=4 SV=1)

HSP 1 Score: 589.0 bits (1517), Expect = 1.2e-164
Identity = 295/309 (95.47%), Postives = 297/309 (96.12%), Query Frame = 0

Query: 1   MDDMASYYPQPQPPPQPSGLEPPHYPYYQVPPPPPSAPPSQHYLSQHPPTFASYGLPFLP 60
           MDDM SYYP PQPPPQPSGLEP HYPYYQV PPPPSAPPSQHYLSQHPPTFASYGLP LP
Sbjct: 40  MDDMTSYYPPPQPPPQPSGLEPSHYPYYQV-PPPPSAPPSQHYLSQHPPTFASYGLPLLP 99

Query: 61  HAASINEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQQS 120
           H  SINEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQQS
Sbjct: 100 HTTSINEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQQS 159

Query: 121 AVGAMHAVNGMVFDLEKQSVLFVDLAKSNSRSKRTRTEDERYGSDKKAKVSIFSRSTPDP 180
           AVGAMHAVNGMVFDLEKQSVL+VDLAKSNSRSKRTRTEDERYGSDKKAKVSI SRSTPDP
Sbjct: 160 AVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSIISRSTPDP 219

Query: 181 GLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPS 240
           GLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGP 
Sbjct: 220 GLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPG 279

Query: 241 CTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILYSSPPGEG 300
           CTEQELIQIF RCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILYSSPPGEG
Sbjct: 280 CTEQELIQIFLRCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILYSSPPGEG 339

Query: 301 MRLEYPFTR 310
           MRLEY  +R
Sbjct: 340 MRLEYAKSR 347

BLAST of Lsi01G009630 vs. ExPASy TrEMBL
Match: A0A6J1GD02 (protein WHI4 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111453048 PE=4 SV=1)

HSP 1 Score: 585.1 bits (1507), Expect = 1.8e-163
Identity = 294/312 (94.23%), Postives = 301/312 (96.47%), Query Frame = 0

Query: 1   MDDMASYY---PQPQPPPQPSGLEPPHYPYYQVPPPPPSAPPSQHYLSQHPPTFASYGLP 60
           MDD+ SYY   PQPQPP QPSGLE PHYPYYQVPPPP SAP SQHYL+QHPPTFASYGLP
Sbjct: 1   MDDITSYYAPPPQPQPPSQPSGLEAPHYPYYQVPPPPSSAPSSQHYLAQHPPTFASYGLP 60

Query: 61  FLPHAASINEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSD 120
           FLPHAASINEVRTLFIAGLP+DVKPREIYNLFREFPGYESSHLR+PTQTTQPFAFAVFSD
Sbjct: 61  FLPHAASINEVRTLFIAGLPDDVKPREIYNLFREFPGYESSHLRSPTQTTQPFAFAVFSD 120

Query: 121 QQSAVGAMHAVNGMVFDLEKQSVLFVDLAKSNSRSKRTRTEDERYGSDKKAKVSIFSRST 180
           QQSAVGAMHAVNGMVFDLEKQSVL+VDLAKSNSRSKRTRTEDERYGSDKKAKVSIFSRST
Sbjct: 121 QQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSIFSRST 180

Query: 181 PDPGLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANL 240
           PDPGLGSTHMSGMGNSAYNTIGYPSAQSHGSFD+KTVNDTVAANV PQNPPCPTLFVANL
Sbjct: 181 PDPGLGSTHMSGMGNSAYNTIGYPSAQSHGSFDSKTVNDTVAANVTPQNPPCPTLFVANL 240

Query: 241 GPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILYSSPP 300
           GPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILYSSPP
Sbjct: 241 GPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILYSSPP 300

Query: 301 GEGMRLEYPFTR 310
           GEGMRLEY  +R
Sbjct: 301 GEGMRLEYAKSR 312

BLAST of Lsi01G009630 vs. ExPASy TrEMBL
Match: A0A5D3CNE3 (U1 small nuclear ribonucleoprotein A isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G004640 PE=4 SV=1)

HSP 1 Score: 584.3 bits (1505), Expect = 3.1e-163
Identity = 292/304 (96.05%), Postives = 294/304 (96.71%), Query Frame = 0

Query: 1   MDDMASYYPQPQPPPQPSGLEPPHYPYYQVPPPPPSAPPSQHYLSQHPPTFASYGLPFLP 60
           MDDM SYYP PQPPPQPSGLEP HYPYYQV PPPPSAPPSQHYLSQHPPTFASYGLP LP
Sbjct: 41  MDDMTSYYPPPQPPPQPSGLEPSHYPYYQV-PPPPSAPPSQHYLSQHPPTFASYGLPLLP 100

Query: 61  HAASINEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQQS 120
           H  SINEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQQS
Sbjct: 101 HTTSINEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQQS 160

Query: 121 AVGAMHAVNGMVFDLEKQSVLFVDLAKSNSRSKRTRTEDERYGSDKKAKVSIFSRSTPDP 180
           AVGAMHAVNGMVFDLEKQSVL+VDLAKSNSRSKRTRTEDERYGSDKKAKVSI SRSTPDP
Sbjct: 161 AVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSIISRSTPDP 220

Query: 181 GLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPS 240
           GLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGP 
Sbjct: 221 GLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPG 280

Query: 241 CTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILYSSPPGEG 300
           CTEQELIQIF RCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILYSSPPGEG
Sbjct: 281 CTEQELIQIFLRCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILYSSPPGEG 340

Query: 301 MRLE 305
           MRL+
Sbjct: 341 MRLD 343

BLAST of Lsi01G009630 vs. ExPASy TrEMBL
Match: A0A1S4DUH9 (U1 small nuclear ribonucleoprotein A isoform X1 OS=Cucumis melo OX=3656 GN=LOC103484791 PE=4 SV=1)

HSP 1 Score: 584.3 bits (1505), Expect = 3.1e-163
Identity = 295/310 (95.16%), Postives = 297/310 (95.81%), Query Frame = 0

Query: 1   MDDMASYYPQPQPPPQPSGLEPPHYPYYQVPPPPPSAPPSQHYLSQHPPTFASYGLPFLP 60
           MDDM SYYP PQPPPQPSGLEP HYPYYQV PPPPSAPPSQHYLSQHPPTFASYGLP LP
Sbjct: 40  MDDMTSYYPPPQPPPQPSGLEPSHYPYYQV-PPPPSAPPSQHYLSQHPPTFASYGLPLLP 99

Query: 61  HAASINEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQQS 120
           H  SINEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQQS
Sbjct: 100 HTTSINEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQQS 159

Query: 121 AVGAMHAVNGMVFDLEKQSVLFVDLAKSNSRSKRTRTEDERYGSDKKAKVSIFSRSTPDP 180
           AVGAMHAVNGMVFDLEKQSVL+VDLAKSNSRSKRTRTEDERYGSDKKAKVSI SRSTPDP
Sbjct: 160 AVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSIISRSTPDP 219

Query: 181 -GLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGP 240
            GLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGP
Sbjct: 220 AGLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGP 279

Query: 241 SCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILYSSPPGE 300
            CTEQELIQIF RCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILYSSPPGE
Sbjct: 280 GCTEQELIQIFLRCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILYSSPPGE 339

Query: 301 GMRLEYPFTR 310
           GMRLEY  +R
Sbjct: 340 GMRLEYAKSR 348

BLAST of Lsi01G009630 vs. ExPASy TrEMBL
Match: A0A6J1IQG5 (protein WHI4 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111477875 PE=4 SV=1)

HSP 1 Score: 583.9 bits (1504), Expect = 4.0e-163
Identity = 294/312 (94.23%), Postives = 299/312 (95.83%), Query Frame = 0

Query: 1   MDDMASYY---PQPQPPPQPSGLEPPHYPYYQVPPPPPSAPPSQHYLSQHPPTFASYGLP 60
           MDD+ SYY   PQPQPP QPSGLE PHYPYYQVPPPP SAP SQHYL+QHPPTFASYGLP
Sbjct: 1   MDDITSYYAPPPQPQPPSQPSGLEAPHYPYYQVPPPPSSAPSSQHYLAQHPPTFASYGLP 60

Query: 61  FLPHAASINEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSD 120
           FLPHAASINEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLR+PTQTTQPFAFAVFSD
Sbjct: 61  FLPHAASINEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRSPTQTTQPFAFAVFSD 120

Query: 121 QQSAVGAMHAVNGMVFDLEKQSVLFVDLAKSNSRSKRTRTEDERYGSDKKAKVSIFSRST 180
           QQSAVGAMHAVNGMVFDLEKQSVL+VDLAKSNSRSKR RTEDERYGSDKKAKVSIFSR T
Sbjct: 121 QQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRMRTEDERYGSDKKAKVSIFSRRT 180

Query: 181 PDPGLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANL 240
           PDPGLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANV PQNPPCPTLFVANL
Sbjct: 181 PDPGLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVTPQNPPCPTLFVANL 240

Query: 241 GPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILYSSPP 300
           GPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILYSSPP
Sbjct: 241 GPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILYSSPP 300

Query: 301 GEGMRLEYPFTR 310
           GEGMRLEY  +R
Sbjct: 301 GEGMRLEYAKSR 312

BLAST of Lsi01G009630 vs. NCBI nr
Match: XP_038881694.1 (protein WHI4 isoform X2 [Benincasa hispida])

HSP 1 Score: 607.1 bits (1564), Expect = 9.2e-170
Identity = 301/309 (97.41%), Postives = 305/309 (98.71%), Query Frame = 0

Query: 1   MDDMASYYPQPQPPPQPSGLEPPHYPYYQVPPPPPSAPPSQHYLSQHPPTFASYGLPFLP 60
           MDDMASYYPQPQPPPQPSGLEPPHYP+YQVPPPP SAPPSQHYLSQHPPTFASYGLPFLP
Sbjct: 1   MDDMASYYPQPQPPPQPSGLEPPHYPFYQVPPPPSSAPPSQHYLSQHPPTFASYGLPFLP 60

Query: 61  HAASINEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQQS 120
           HAASINEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQQS
Sbjct: 61  HAASINEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQQS 120

Query: 121 AVGAMHAVNGMVFDLEKQSVLFVDLAKSNSRSKRTRTEDERYGSDKKAKVSIFSRSTPDP 180
           A+GAMHAVNGMVFDLEKQSVL+VDLAKSNSRSKRTRTEDERYGSDKKAKVSIFSRSTPDP
Sbjct: 121 AIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSIFSRSTPDP 180

Query: 181 GLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPS 240
           GLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPS
Sbjct: 181 GLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPS 240

Query: 241 CTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILYSSPPGEG 300
           CTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDT CSTGALNHLQGSILYSSPPGEG
Sbjct: 241 CTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTGCSTGALNHLQGSILYSSPPGEG 300

Query: 301 MRLEYPFTR 310
           MRLEY  +R
Sbjct: 301 MRLEYAKSR 309

BLAST of Lsi01G009630 vs. NCBI nr
Match: XP_038881693.1 (protein WHI4 isoform X1 [Benincasa hispida])

HSP 1 Score: 602.4 bits (1552), Expect = 2.3e-168
Identity = 301/310 (97.10%), Postives = 305/310 (98.39%), Query Frame = 0

Query: 1   MDDMASYYPQPQPPPQPSGLEPPHYPYYQVPPPPPSAPPSQHYLSQHPPTFASYGLPFLP 60
           MDDMASYYPQPQPPPQPSGLEPPHYP+YQVPPPP SAPPSQHYLSQHPPTFASYGLPFLP
Sbjct: 1   MDDMASYYPQPQPPPQPSGLEPPHYPFYQVPPPPSSAPPSQHYLSQHPPTFASYGLPFLP 60

Query: 61  HAASINEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQQS 120
           HAASINEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQQS
Sbjct: 61  HAASINEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQQS 120

Query: 121 AVGAMHAVNGMVFDLEKQSVLFVDLAKSNSRSKRTRTEDERYGSDKKAKVSIFSRSTPDP 180
           A+GAMHAVNGMVFDLEKQSVL+VDLAKSNSRSKRTRTEDERYGSDKKAKVSIFSRSTPDP
Sbjct: 121 AIGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSIFSRSTPDP 180

Query: 181 -GLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGP 240
            GLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGP
Sbjct: 181 AGLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGP 240

Query: 241 SCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILYSSPPGE 300
           SCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDT CSTGALNHLQGSILYSSPPGE
Sbjct: 241 SCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTGCSTGALNHLQGSILYSSPPGE 300

Query: 301 GMRLEYPFTR 310
           GMRLEY  +R
Sbjct: 301 GMRLEYAKSR 310

BLAST of Lsi01G009630 vs. NCBI nr
Match: XP_008440306.2 (PREDICTED: protein WHI4 isoform X2 [Cucumis melo])

HSP 1 Score: 589.0 bits (1517), Expect = 2.6e-164
Identity = 295/309 (95.47%), Postives = 297/309 (96.12%), Query Frame = 0

Query: 1   MDDMASYYPQPQPPPQPSGLEPPHYPYYQVPPPPPSAPPSQHYLSQHPPTFASYGLPFLP 60
           MDDM SYYP PQPPPQPSGLEP HYPYYQV PPPPSAPPSQHYLSQHPPTFASYGLP LP
Sbjct: 40  MDDMTSYYPPPQPPPQPSGLEPSHYPYYQV-PPPPSAPPSQHYLSQHPPTFASYGLPLLP 99

Query: 61  HAASINEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQQS 120
           H  SINEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQQS
Sbjct: 100 HTTSINEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQQS 159

Query: 121 AVGAMHAVNGMVFDLEKQSVLFVDLAKSNSRSKRTRTEDERYGSDKKAKVSIFSRSTPDP 180
           AVGAMHAVNGMVFDLEKQSVL+VDLAKSNSRSKRTRTEDERYGSDKKAKVSI SRSTPDP
Sbjct: 160 AVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSIISRSTPDP 219

Query: 181 GLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPS 240
           GLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGP 
Sbjct: 220 GLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANLGPG 279

Query: 241 CTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILYSSPPGEG 300
           CTEQELIQIF RCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILYSSPPGEG
Sbjct: 280 CTEQELIQIFLRCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILYSSPPGEG 339

Query: 301 MRLEYPFTR 310
           MRLEY  +R
Sbjct: 340 MRLEYAKSR 347

BLAST of Lsi01G009630 vs. NCBI nr
Match: XP_023543723.1 (protein WHI4 isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 585.9 bits (1509), Expect = 2.2e-163
Identity = 294/312 (94.23%), Postives = 301/312 (96.47%), Query Frame = 0

Query: 1   MDDMASYY---PQPQPPPQPSGLEPPHYPYYQVPPPPPSAPPSQHYLSQHPPTFASYGLP 60
           MDD+ SYY   PQPQPP QPSGLE PHYPYYQVPPPP SAP SQHYL+QHPPTFASYGLP
Sbjct: 1   MDDITSYYAPPPQPQPPSQPSGLEAPHYPYYQVPPPPSSAPSSQHYLAQHPPTFASYGLP 60

Query: 61  FLPHAASINEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSD 120
           FLPHAASINEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLR+PTQTTQPFAFAVFSD
Sbjct: 61  FLPHAASINEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRSPTQTTQPFAFAVFSD 120

Query: 121 QQSAVGAMHAVNGMVFDLEKQSVLFVDLAKSNSRSKRTRTEDERYGSDKKAKVSIFSRST 180
           QQSAVGAMHAVNGMVFDLEKQSVL+VDLAKSNSRSKRTRTEDERYGSDKKAKVSIFSRST
Sbjct: 121 QQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSIFSRST 180

Query: 181 PDPGLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANL 240
           PDPGLGSTHMSGMGNSAYNTIGYPSAQSHGSFD+KTVNDTVAANV PQNPPCPTLFVANL
Sbjct: 181 PDPGLGSTHMSGMGNSAYNTIGYPSAQSHGSFDSKTVNDTVAANVTPQNPPCPTLFVANL 240

Query: 241 GPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILYSSPP 300
           GPSCTEQEL+QIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILYSSPP
Sbjct: 241 GPSCTEQELVQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILYSSPP 300

Query: 301 GEGMRLEYPFTR 310
           GEGMRLEY  +R
Sbjct: 301 GEGMRLEYAKSR 312

BLAST of Lsi01G009630 vs. NCBI nr
Match: XP_022949757.1 (protein WHI4 isoform X2 [Cucurbita moschata])

HSP 1 Score: 585.1 bits (1507), Expect = 3.7e-163
Identity = 294/312 (94.23%), Postives = 301/312 (96.47%), Query Frame = 0

Query: 1   MDDMASYY---PQPQPPPQPSGLEPPHYPYYQVPPPPPSAPPSQHYLSQHPPTFASYGLP 60
           MDD+ SYY   PQPQPP QPSGLE PHYPYYQVPPPP SAP SQHYL+QHPPTFASYGLP
Sbjct: 1   MDDITSYYAPPPQPQPPSQPSGLEAPHYPYYQVPPPPSSAPSSQHYLAQHPPTFASYGLP 60

Query: 61  FLPHAASINEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSD 120
           FLPHAASINEVRTLFIAGLP+DVKPREIYNLFREFPGYESSHLR+PTQTTQPFAFAVFSD
Sbjct: 61  FLPHAASINEVRTLFIAGLPDDVKPREIYNLFREFPGYESSHLRSPTQTTQPFAFAVFSD 120

Query: 121 QQSAVGAMHAVNGMVFDLEKQSVLFVDLAKSNSRSKRTRTEDERYGSDKKAKVSIFSRST 180
           QQSAVGAMHAVNGMVFDLEKQSVL+VDLAKSNSRSKRTRTEDERYGSDKKAKVSIFSRST
Sbjct: 121 QQSAVGAMHAVNGMVFDLEKQSVLYVDLAKSNSRSKRTRTEDERYGSDKKAKVSIFSRST 180

Query: 181 PDPGLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVANL 240
           PDPGLGSTHMSGMGNSAYNTIGYPSAQSHGSFD+KTVNDTVAANV PQNPPCPTLFVANL
Sbjct: 181 PDPGLGSTHMSGMGNSAYNTIGYPSAQSHGSFDSKTVNDTVAANVTPQNPPCPTLFVANL 240

Query: 241 GPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILYSSPP 300
           GPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILYSSPP
Sbjct: 241 GPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILYSSPP 300

Query: 301 GEGMRLEYPFTR 310
           GEGMRLEY  +R
Sbjct: 301 GEGMRLEYAKSR 312

BLAST of Lsi01G009630 vs. TAIR 10
Match: AT2G42240.2 (RNA-binding (RRM/RBD/RNP motifs) family protein )

HSP 1 Score: 288.1 bits (736), Expect = 8.8e-78
Identity = 176/326 (53.99%), Postives = 212/326 (65.03%), Query Frame = 0

Query: 1   MDDMASYYPQPQPPPQPSGLEPPHYPYYQVPPPPPSAPP----SQHYLSQHPPTFASYGL 60
           MDD+ +YY     P               VPPPPP   P    S H  S + PT  S G 
Sbjct: 1   MDDLEAYYSHYNLPA-------------MVPPPPPGVSPIPITSAH--SVYLPTHVSIG- 60

Query: 61  PFLPHAASINEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFS 120
                  + +EVRTLF+AGLPEDVKPREIYNLFREFPGYE+SHLR+ +   +PFAFAVFS
Sbjct: 61  -------ARDEVRTLFVAGLPEDVKPREIYNLFREFPGYETSHLRS-SDGAKPFAFAVFS 120

Query: 121 DQQSAVGAMHAVNGMVFDLEKQSVLFVDLAKSNSRSKRTRTEDERYGSDKKAKVSIFSRS 180
           D QSAV  MHA+NGMVFDLEK S L +DLAKSN +SKR+RT+D   G +   K+  ++ +
Sbjct: 121 DLQSAVAVMHALNGMVFDLEKHSTLHIDLAKSNPKSKRSRTDD---GWESLKKLKSWN-T 180

Query: 181 TPDPGLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVAN 240
           T + G GS    GM +SAYNTIGY  AQS G   N       +        PCPTLF+AN
Sbjct: 181 TTESGFGSFQTPGMSSSAYNTIGYSPAQSQG-IANVAGRAPTSRKPSKAADPCPTLFIAN 240

Query: 241 LGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILYSSP 300
           +GP+CTE ELIQ+FSRC GFLKLK+Q TYG PVAFVDFQD +CS+ AL+ LQG++LYSS 
Sbjct: 241 MGPNCTEAELIQVFSRCRGFLKLKIQGTYGTPVAFVDFQDVSCSSEALHTLQGTVLYSSL 297

Query: 301 PGEGMRLEYPF---TRFLFFTIVLLH 320
            GE +RL+YP      FL F +V LH
Sbjct: 301 TGEVLRLQYPSLLPILFLCFLLVGLH 297

BLAST of Lsi01G009630 vs. TAIR 10
Match: AT2G42240.1 (RNA-binding (RRM/RBD/RNP motifs) family protein )

HSP 1 Score: 285.0 bits (728), Expect = 7.4e-77
Identity = 170/313 (54.31%), Postives = 206/313 (65.81%), Query Frame = 0

Query: 1   MDDMASYYPQPQPPPQPSGLEPPHYPYYQVPPPPPSAPP----SQHYLSQHPPTFASYGL 60
           MDD+ +YY     P               VPPPPP   P    S H  S + PT  S G 
Sbjct: 1   MDDLEAYYSHYNLPA-------------MVPPPPPGVSPIPITSAH--SVYLPTHVSIG- 60

Query: 61  PFLPHAASINEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFS 120
                  + +EVRTLF+AGLPEDVKPREIYNLFREFPGYE+SHLR+ +   +PFAFAVFS
Sbjct: 61  -------ARDEVRTLFVAGLPEDVKPREIYNLFREFPGYETSHLRS-SDGAKPFAFAVFS 120

Query: 121 DQQSAVGAMHAVNGMVFDLEKQSVLFVDLAKSNSRSKRTRTEDERYGSDKKAKVSIFSRS 180
           D QSAV  MHA+NGMVFDLEK S L +DLAKSN +SKR+RT+D   G +   K+  ++ +
Sbjct: 121 DLQSAVAVMHALNGMVFDLEKHSTLHIDLAKSNPKSKRSRTDD---GWESLKKLKSWN-T 180

Query: 181 TPDPGLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVAN 240
           T + G GS    GM +SAYNTIGY  AQS G   N       +        PCPTLF+AN
Sbjct: 181 TTESGFGSFQTPGMSSSAYNTIGYSPAQSQG-IANVAGRAPTSRKPSKAADPCPTLFIAN 240

Query: 241 LGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILYSSP 300
           +GP+CTE ELIQ+FSRC GFLKLK+Q TYG PVAFVDFQD +CS+ AL+ LQG++LYSS 
Sbjct: 241 MGPNCTEAELIQVFSRCRGFLKLKIQGTYGTPVAFVDFQDVSCSSEALHTLQGTVLYSSL 284

Query: 301 PGEGMRLEYPFTR 310
            GE +RL+Y  +R
Sbjct: 301 TGEVLRLQYARSR 284

BLAST of Lsi01G009630 vs. TAIR 10
Match: AT2G42240.3 (RNA-binding (RRM/RBD/RNP motifs) family protein )

HSP 1 Score: 281.6 bits (719), Expect = 8.2e-76
Identity = 168/308 (54.55%), Postives = 203/308 (65.91%), Query Frame = 0

Query: 1   MDDMASYYPQPQPPPQPSGLEPPHYPYYQVPPPPPSAPP----SQHYLSQHPPTFASYGL 60
           MDD+ +YY     P               VPPPPP   P    S H  S + PT  S G 
Sbjct: 1   MDDLEAYYSHYNLPA-------------MVPPPPPGVSPIPITSAH--SVYLPTHVSIG- 60

Query: 61  PFLPHAASINEVRTLFIAGLPEDVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFS 120
                  + +EVRTLF+AGLPEDVKPREIYNLFREFPGYE+SHLR+ +   +PFAFAVFS
Sbjct: 61  -------ARDEVRTLFVAGLPEDVKPREIYNLFREFPGYETSHLRS-SDGAKPFAFAVFS 120

Query: 121 DQQSAVGAMHAVNGMVFDLEKQSVLFVDLAKSNSRSKRTRTEDERYGSDKKAKVSIFSRS 180
           D QSAV  MHA+NGMVFDLEK S L +DLAKSN +SKR+RT+D   G +   K+  ++ +
Sbjct: 121 DLQSAVAVMHALNGMVFDLEKHSTLHIDLAKSNPKSKRSRTDD---GWESLKKLKSWN-T 180

Query: 181 TPDPGLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTVAANVIPQNPPCPTLFVAN 240
           T + G GS    GM +SAYNTIGY  AQS G   N       +        PCPTLF+AN
Sbjct: 181 TTESGFGSFQTPGMSSSAYNTIGYSPAQSQG-IANVAGRAPTSRKPSKAADPCPTLFIAN 240

Query: 241 LGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILYSSP 300
           +GP+CTE ELIQ+FSRC GFLKLK+Q TYG PVAFVDFQD +CS+ AL+ LQG++LYSS 
Sbjct: 241 MGPNCTEAELIQVFSRCRGFLKLKIQGTYGTPVAFVDFQDVSCSSEALHTLQGTVLYSSL 279

Query: 301 PGEGMRLE 305
            GE +RL+
Sbjct: 301 TGEVLRLQ 279

BLAST of Lsi01G009630 vs. TAIR 10
Match: AT3G13700.2 (RNA-binding (RRM/RBD/RNP motifs) family protein )

HSP 1 Score: 144.4 bits (363), Expect = 1.6e-34
Identity = 110/308 (35.71%), Postives = 151/308 (49.03%), Query Frame = 0

Query: 24  HYPY-----YQVPPPPPSAPPSQHYLSQHPPTFASYGLPFLPHAASINEVRTLFIAGLPE 83
           H PY     YQ+   P   PP    L+  P                   + TLF++GLP 
Sbjct: 4   HQPYDPFYVYQLHSHPHHLPPQLPLLADEP-----------------GAINTLFVSGLPN 63

Query: 84  DVKPREIYNLFREFPGYESSHLRTPTQTTQPFAFAVFSDQQSAVGAMHAVNGMVFDLEKQ 143
           DVK REI+NLFR   G+ES  L+   +  Q  AFA F+  + A+ AM+ +NG+ FD +  
Sbjct: 64  DVKAREIHNLFRRRHGFESCQLKYTGRGDQVVAFATFTSHRFALAAMNELNGVKFDPQTG 123

Query: 144 SVLFVDLAKSNSRSKRTRTEDERYGSDKKAKVSIFSRSTPDPGLGSTHMSGMGNSAYNTI 203
           S L ++LA+SNSR K      ER GS       +      +        S  G+S  + +
Sbjct: 124 SNLHIELARSNSRRK------ERPGS---GPYVVIDNRNKEISKSQDDQSDEGDSDPDEV 183

Query: 204 GYPSAQSHGSFDNKTVNDTV--AANVIPQNP-------------------PCPTLFVANL 263
                Q  G+ D+   NDT    A+  P +                     C TLF+ANL
Sbjct: 184 -----QEPGNSDSPKENDTTKSEADSEPDSKAPSANGHLEKASEGGSGARACSTLFIANL 243

Query: 264 GPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTACSTGALNHLQGSILYSSPP 306
           GP+CTE EL Q+ SR PGF  LK+++  G PVAF DF++   +T A+NHLQG++L SS  
Sbjct: 244 GPNCTEDELKQLLSRYPGFHILKIRARGGMPVAFADFEEIEQATDAMNHLQGNLLSSSDR 279

BLAST of Lsi01G009630 vs. TAIR 10
Match: AT3G13700.1 (RNA-binding (RRM/RBD/RNP motifs) family protein )

HSP 1 Score: 131.0 bits (328), Expect = 1.8e-30
Identity = 109/325 (33.54%), Postives = 149/325 (45.85%), Query Frame = 0

Query: 24  HYPY-----YQVPPPPPSAPPSQHYLSQHPPTFASYGLPFLPHAASINEVRTLFIAGLPE 83
           H PY     YQ+   P   PP    L+  P                   + TLF++GLP 
Sbjct: 4   HQPYDPFYVYQLHSHPHHLPPQLPLLADEP-----------------GAINTLFVSGLPN 63

Query: 84  DVKPREIYNLFREFPGYESSHLR------------------TPTQTTQPFAFAVFSDQQS 143
           DVK REI+NLFR   G+ES  L+                   P       AFA F+  + 
Sbjct: 64  DVKAREIHNLFRRRHGFESCQLKYTGRGDQVCKNLQFLFFFIPIFRKAVVAFATFTSHRF 123

Query: 144 AVGAMHAVNGMVFDLEKQSVLFVDLAKSNSRSKRTRTEDERYGSDKKAKVSIFSRSTPDP 203
           A+ AM+ +NG+ FD +  S L ++LA+SNSR K      ER GS       +      + 
Sbjct: 124 ALAAMNELNGVKFDPQTGSNLHIELARSNSRRK------ERPGS---GPYVVIDNRNKEI 183

Query: 204 GLGSTHMSGMGNSAYNTIGYPSAQSHGSFDNKTVNDTV--AANVIPQNP----------- 263
                  S  G+S  + +     Q  G+ D+   NDT    A+  P +            
Sbjct: 184 SKSQDDQSDEGDSDPDEV-----QEPGNSDSPKENDTTKSEADSEPDSKAPSANGHLEKA 243

Query: 264 --------PCPTLFVANLGPSCTEQELIQIFSRCPGFLKLKMQSTYGAPVAFVDFQDTAC 305
                    C TLF+ANLGP+CTE EL Q+ SR PGF  LK+++  G PVAF DF++   
Sbjct: 244 SEGGSGARACSTLFIANLGPNCTEDELKQLLSRYPGFHILKIRARGGMPVAFADFEEIEQ 296

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8VC523.0e-1443.33RNA-binding protein with multiple splicing 2 OS=Mus musculus OX=10090 GN=Rbpms2 ... [more]
Q6ZRY43.9e-1443.33RNA-binding protein with multiple splicing 2 OS=Homo sapiens OX=9606 GN=RBPMS2 P... [more]
Q6DH135.1e-1443.33RNA-binding protein, mRNA-processing factor 2a OS=Danio rerio OX=7955 GN=rbpms2a... [more]
Q9W6I16.7e-1443.33RNA-binding protein with multiple splicing 2 OS=Gallus gallus OX=9031 GN=RBPMS2 ... [more]
Q9YGP58.7e-1443.33RNA-binding protein with multiple splicing 2 OS=Xenopus laevis OX=8355 GN=rbpms2... [more]
Match NameE-valueIdentityDescription
A0A1S3B1E91.2e-16495.47protein WHI4 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103484791 PE=4 SV=1[more]
A0A6J1GD021.8e-16394.23protein WHI4 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111453048 PE=4 SV=1[more]
A0A5D3CNE33.1e-16396.05U1 small nuclear ribonucleoprotein A isoform X2 OS=Cucumis melo var. makuwa OX=1... [more]
A0A1S4DUH93.1e-16395.16U1 small nuclear ribonucleoprotein A isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A6J1IQG54.0e-16394.23protein WHI4 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111477875 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
XP_038881694.19.2e-17097.41protein WHI4 isoform X2 [Benincasa hispida][more]
XP_038881693.12.3e-16897.10protein WHI4 isoform X1 [Benincasa hispida][more]
XP_008440306.22.6e-16495.47PREDICTED: protein WHI4 isoform X2 [Cucumis melo][more]
XP_023543723.12.2e-16394.23protein WHI4 isoform X2 [Cucurbita pepo subsp. pepo][more]
XP_022949757.13.7e-16394.23protein WHI4 isoform X2 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
AT2G42240.28.8e-7853.99RNA-binding (RRM/RBD/RNP motifs) family protein [more]
AT2G42240.17.4e-7754.31RNA-binding (RRM/RBD/RNP motifs) family protein [more]
AT2G42240.38.2e-7654.55RNA-binding (RRM/RBD/RNP motifs) family protein [more]
AT3G13700.21.6e-3435.71RNA-binding (RRM/RBD/RNP motifs) family protein [more]
AT3G13700.11.8e-3033.54RNA-binding (RRM/RBD/RNP motifs) family protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (USVL1VR-Ls) v1
Date Performed: 2021-10-18
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000504RNA recognition motif domainSMARTSM00360rrm1_1coord: 70..142
e-value: 2.9E-8
score: 43.5
coord: 231..300
e-value: 5.3E-5
score: 32.6
IPR000504RNA recognition motif domainPFAMPF00076RRM_1coord: 232..294
e-value: 4.9E-9
score: 35.9
coord: 71..134
e-value: 5.2E-10
score: 39.0
IPR000504RNA recognition motif domainPROSITEPS50102RRMcoord: 69..148
score: 12.065728
IPR000504RNA recognition motif domainPROSITEPS50102RRMcoord: 230..308
score: 10.564854
IPR012677Nucleotide-binding alpha-beta plait domain superfamilyGENE3D3.30.70.330coord: 65..158
e-value: 3.1E-20
score: 74.1
IPR012677Nucleotide-binding alpha-beta plait domain superfamilyGENE3D3.30.70.330coord: 219..306
e-value: 2.6E-16
score: 61.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..44
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 8..42
NoneNo IPR availablePANTHERPTHR10501:SF43RNA-BINDING PROTEIN-RELATEDcoord: 1..303
NoneNo IPR availablePANTHERPTHR10501U1 SMALL NUCLEAR RIBONUCLEOPROTEIN A/U2 SMALL NUCLEAR RIBONUCLEOPROTEIN Bcoord: 1..303
NoneNo IPR availableCDDcd12420RRM_RBPMS_likecoord: 69..144
e-value: 2.33309E-29
score: 105.815
NoneNo IPR availableCDDcd12245RRM_scw1_likecoord: 228..306
e-value: 5.41453E-39
score: 130.787
IPR035979RNA-binding domain superfamilySUPERFAMILY54928RNA-binding domain, RBDcoord: 68..294

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lsi01G009630.1Lsi01G009630.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003729 mRNA binding
molecular_function GO:0003676 nucleic acid binding