CmoCh02G016310.1 (mRNA) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh02G016310.1
TypemRNA
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionMSC domain-containing protein
LocationCmo_Chr02: 9401143 .. 9406378 (+)
Sequence length1672
RNA-Seq ExpressionCmoCh02G016310.1
SyntenyCmoCh02G016310.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AGAAAAAAAAAAAAAGGGAAAAAAAAGAGTAGTTTACTGATGTGGAAGGTTATTTTTGTCAAAACAACCAAGATAACCTAACCTACACCATAAATGGAGGGTATTTTGAGAAAATTAATTTTTGTACCGCTCACGAAGCCCAATTTTCCAAAATAAAAAAAAAGGAGGCAAAGGAACGAAGCTGGAGAGGGGACTGAGCGAGGCCGCGAAGAACGACGTTTATCGATGTCTTCAACTCCGAAGAGGCGAACGAAATTCAAGCATAATCTGAACTCTGATGTCGCTTCTAAACGCGATTCTTATGGTTCATCTTCTGCAGTGTTGCTGAATTCTATCAAGGGACCGCCTCGTGATTTCTTTCCCTCGAAGGATGATCTTACTAGGCTAATTACTGTACTTTTCATCGCCGGCTTGGTTTTTGTGAGTTGTAACTTCTTTGTATCTAGACTTGAAACTCGCCGCCCGAGGCCTTTCTGCGACTCCGACGCCGATTCTTTTGATTTGCTTTCTGGTAAGTCGTAGCATTGTTCAACCTTTCCTGTGTTTGGTGTTTAAAAACTCGAAGATGAGGAATTTGAAAATGAATTTTTTGAATGATGATTTCTAGCGTATGTTTGTTCTTTCTGAAGTAATGCTTTGTGTCTTCACAATTTTGGTGCACATTTTAGCTCATTTTTGGACGTTGCATTTAGTTTTTGTTCTTTCGTTTGCCAACAATAAATTCCGGTCACTTCAAATGGTTGTCTTATGCATAATGGCGCCATTAACTAGTACTTTCTGTTATGCACTCGTGCAAACTTGTGGAACCATAATATCATATCCTCTTTCATGGTAAAGCGAACATACTAAGATAAAGTCACTATCTCCTCTGATGAGAGAGACAAGATCAGCATCCGACGTAGAGCCCTTTGATAAACCTGATTAAGTCAGTGGTTAGCGATTGCAAAGAGAACGCTCCTTGATTAAGTCAGTGGTTAGTCGTCTGCAGAATGTTTGACTCGAGATCATCTTGTCATCCATTATATTGTTTAGAGGCGATAGATTAAAATTTGTTCTGTTTTATTCTTTGTTCATATTGCATCAACTTTTTTTTCTGTTAAAACTTGCAGGCCTTTTAGCTTTACCCTAACATTTGCTTCTTCTAACTTTCTGTTTATTTTATTTATGACTTCCAAGTTGAACGTAAAGCCAGTATAAGTGATTACATATGAAGTAATCTGTCGATGGTGATTAGTTTGGTTTTGTTCTGATAGTTCATACTTCTAACCAAAATATATATAACTAATTTCAGATGCTTGTGAGCCTTGTCCAAGTCATGGAGAATGCCATGAAGGTAAGTTGGAATGCGGTCATGGTTATAGAGAGCATGGTAGGTTATGTATAGAAGATGGAGTAATCAATAAAGCAGTTAAGAAACTTGTATGTGTGAGGTTTTTTCCCTGGTAACTGTAATTTTCTCTCTCCGTTTTCTTATTGGTCATAATTTCACACTTCTTTATCCTATTGTTGCAGTCAGAATGGCTAGAATCTCACCTCTGTGAAGCAAATGCCAAGTTTTTATGCGATGGAATTGGGATAGTTTGGGTTTGTTATCTCGGTGTCCCTATAAAAATTTCAGTTTTGTATTTATTTCCCTCTTCAATAGCACTCTAGCCTTGCTCCTTTTTGTTGTGGCGCCTCTATTTTGTTTCTCTCTTATTTAGTGTTAAGTTTAATGGTTCAAATTTAGTTGCACGTTTAATGTGGAAGGACCTAGTATTTTTGTTGTCAGCACACGGAGTTTTGATTTTAATTCATTATCAGGTTCAAGAGGATGCTATATGGGACGATCTCGATGGTAAAGCGCTGGTGGAAAACATTGATTCTGACAACACCACTATTATGTATGCAAAGAGCAAGGCATTGGAAACTATTGGTGGGTTATTCCAGGCACGGCAAAATGCTCTTGGGTATGCCATTCAACTTTTTGTATAAAGTCAGAGCCTTAGTGGAAACAATGTATCATTTGATTAAAATATGTGCTGATTGAATTTCTTTGCATCCTCCTCGTTGTGTAGTATCAAGGAATTGAAATGCCCAGATCACCTAGCTGAAAGTTACAAGCCTTTTACTTGCCGTATTCGTCACTGGGTTTTGCAGCATGCTTTTGTTGTTTTGCCAGTTTCCTTACTGGTATTCTACTCTACAAGGGAGCTTTGAACTGATTAGGTTTTAGTTGCAATATGTTGAAGCTAGTTGATGATATTGAGATGTACAGCTCGTGGGATGCACATGGTTACTATGGAAACTTTGCCGGAGACAATATCTAACTAATAGAGCTGAAGATCTGTACAACCAGGTATAAGCTATGGCTTTTTCAATTTTATTTTTCAAGTAGTCATTTTTCTTTGCGTGATTTGCTTTGTTGTTAGGTTGGTTCCTATTTGAAGCTAAAATTTTGTCCTCTCAGCATGCGTAGTCTGATTAACATTGTTAATTCACTACATAGATGGCAAATAAAAGATCTTCCAACTGATGAATTTTAGAATAGAAAATAATTGTAGCATGCATGCTTATTAATTCTTTTCCATGTGGTACTTTGTGATTGACCCTCCTGAAACAATTTCAGTTTCTTTCCTCTAAAGTTGAGACTTCGTTGAGATATGAAGTTTTATGCTATTCTCTAGTTTGCAATCATAAGTACAGAATTTTCAAGTTGTTGATCTCCCAATCAGGATTGTTTTAGAAGTTTATCACATTGAGTATTTGAAGTCCCAAAACTTTTTTTATATTTATTGACTAAAAAATTTCATCTTCTACTGTCAAGATAAATTGAAGTGTGACCAGATTTTTCACATACAGTTCGTGTGGTAATAAGAGAAACCCTTACCCCAAGACGTTTTGCCAACCCCGGTAGATTTTTTTCACTCTAATGCTAATCTCTCGGTCTTCATTACAGGTTTGCGAAATACTTGAGGAAAATGCTTTGATGTCAACGAGGAACAGTGGACAATGTGAATCATGGGTTGTTGCTTCTAGGTTACGTGACCATCTTCTTTTGCCACGAGAGAGGAAGGATCCTTTATTATGGAGAAAGGTATCACGTCCCTCCTTGACTCTCATTTTTTCTTATTTCATTTAGAATCATTACATTTTGTTTTTTGTCGCATGGATTTTTAACGTTTACTGAAGTTACAATCCGAGTGGTGGATGTAGTTCTGAATATTTGGCAATAGAGTTTGCTAAAATACTTAGGTCGTTCAAATTCATTGAAATGTACATAGAAAAATGTTTTCTCGATTCATATGCGAGATGGATTTTGATAGATGAATAATTCTTTTTGGATATAGCCATGGCTAAAATTTGGAAACCAATCTGAAATCCTAATTTTTTCCATCAACAAAGATTAAAGAGATCACATAATCTGTTTGTCCTCTCCCCATTCCAATTTATAATCACTTCATTTGATGTTCTTCTTGATGCTTTCCCTAATTAAGTTAAATTACAAGTTTAGTTCTTTACCTTTCATGTTTATTTGTAATTATCCTACTTGTTTTCTTGTTTCCTACATTAATGAAATTGTTTGCTTCCTGAAAAAACACTTCCATATTTGTGTTTTGGTCTCTAAACTTTAAAAAAAATATTTGTTAGGTTTTGAATTTAAATTTTATGTATAAAAGGTCTTTGATATATTAAAAAATTAACAAATGATTAGATATTAAACACAAAATTAAAGTACAAGGACTCATTGAACACAAATTTGAAAGTTTTGGGAATCATTAATACATAAGTTCAAGAATTTGTAAACACAAAATTAAAAGTTTATGGTCCTACTAAATATCCTTTTAAAGCTAAGGAACATGTTTGGCACAAACATGAAAGTTAAGGGATTAAATTTATGATCTTACCATTACTTTATTTTATGTAGGTAGAAGAGTTGGTTCAGGAAGATTCACGAATAGATCGGTACCCGAGATTGGTCAAGGGGGATGGAAAAGAAGTATGGGAATGGCAAGGTATGAAATATTTTTTCTTTTCATTTCTAATTCCTAATATAGAATGAGATAAACTCATCTATAAATAAAATCATGATGACTAGATATGGAACTCGTCCCTAGACAAAATTCATATATTATGCCAACAAAATTCATATATTATGCCAACATGCACCTAACTTTCCTCATGAGGTTGTGGGTTTGAATTCCCAACCTCACATGTGATTTAATATTCTAAAAAGGAAAATACGTATATTATAACAATATAAACATGGTTAATTTTTAACTTGGGATGATTCGTGCAAAATTCTTTTTGGTATACATTTATGCAGACACATTTAAAGGGTTAATTTTAAATTTGAAAAGTGGAGAGATAAAATAGAATATTAGGAACTTTTGGAACGCAGATGAATCGAGGACTTGCATTTAAACCCATCTTTTTAAGTTAGAAGCACTAAAAGCTTGGGGACTCAAACATGAATTGTTACAAGACAGAGCGAGCAAAACGATTCTTTTGGCGTTGTCATCTCCAGTACAAGTAAATCTGATATCCTTGTTCTTGTTGAACAGTAGAAGGCTCTTTAAGCTCTTCAAAGGAAAAGAGACTGGCTAGCAAATCCAGTTCCAGAATGGTGATGGGAGTAAATTCTGACGTAATATACTCAAAAATGGAGAACGGTGAGTTATATATATGCTTGATCGAGTTACAGAATCGGGTTTTTTCTCCATTCTCATGCTATGTCGACTTTCTTCCATTCATTTAGTTTTGAAAAATGACTAAACTAATGCACAGAGCTGAAGGCAGTAGTTTCGTGACATGGCGCTCGTCCATGAGGATATTGTAGCTTAGCTGGTTAATAGAAAGAATGAAAACAAGAGGTACAAGATTTAAAAGTCCATAATTATTTTGTTTGATTCTTGTGTGAAGTACTTGTTAAGCTGGTTCTGTTGCGACAACTGGGGCCTGTAGGGGTATGAGGTACTGTAATATTAATTCCTTTCCACTGTTCTTTCCATAGCACTTCGTTAATCTGCTTCTGTTGCGACAACTGGGGTCTGTAGGGGTATGAGGTACTGTAATATTAATCCCTTCAAACTGTTCTTTCCATAGCACTTTGTTAATCTGCTTCTGTTGCGACAACTGGGGCCTGTAGGGGTATGAGGTACTATAATATTATTCCCTTCTACTGTTCTTTCGATAGCACTATCTTGCCCCCTCCCACCTGGAGAATCCCATAGTATTTTAATGGACTGTTTGCGTG

mRNA sequence

AGAAAAAAAAAAAAAGGGAAAAAAAAGAGTAGTTTACTGATGTGGAAGGTTATTTTTGTCAAAACAACCAAGATAACCTAACCTACACCATAAATGGAGGGTATTTTGAGAAAATTAATTTTTGTACCGCTCACGAAGCCCAATTTTCCAAAATAAAAAAAAAGGAGGCAAAGGAACGAAGCTGGAGAGGGGACTGAGCGAGGCCGCGAAGAACGACGTTTATCGATGTCTTCAACTCCGAAGAGGCGAACGAAATTCAAGCATAATCTGAACTCTGATGTCGCTTCTAAACGCGATTCTTATGGTTCATCTTCTGCAGTGTTGCTGAATTCTATCAAGGGACCGCCTCGTGATTTCTTTCCCTCGAAGGATGATCTTACTAGGCTAATTACTGTACTTTTCATCGCCGGCTTGGTTTTTGTGAGTTGTAACTTCTTTGTATCTAGACTTGAAACTCGCCGCCCGAGGCCTTTCTGCGACTCCGACGCCGATTCTTTTGATTTGCTTTCTGATGCTTGTGAGCCTTGTCCAAGTCATGGAGAATGCCATGAAGGTAAGTTGGAATGCGGTCATGGTTATAGAGAGCATGGTAGGTTATGTATAGAAGATGGAGTAATCAATAAAGCAGTTAAGAAACTTTCAGAATGGCTAGAATCTCACCTCTGTGAAGCAAATGCCAAGTTTTTATGCGATGGAATTGGGATAGTTTGGGTTCAAGAGGATGCTATATGGGACGATCTCGATGGTAAAGCGCTGGTGGAAAACATTGATTCTGACAACACCACTATTATGTATGCAAAGAGCAAGGCATTGGAAACTATTGGTGGGTTATTCCAGGCACGGCAAAATGCTCTTGGTATCAAGGAATTGAAATGCCCAGATCACCTAGCTGAAAGTTACAAGCCTTTTACTTGCCGTATTCGTCACTGGGTTTTGCAGCATGCTTTTGTTGTTTTGCCAGTTTCCTTACTGCTCGTGGGATGCACATGGTTACTATGGAAACTTTGCCGGAGACAATATCTAACTAATAGAGCTGAAGATCTGTACAACCAGGTTTGCGAAATACTTGAGGAAAATGCTTTGATGTCAACGAGGAACAGTGGACAATGTGAATCATGGGTTGTTGCTTCTAGGTTACGTGACCATCTTCTTTTGCCACGAGAGAGGAAGGATCCTTTATTATGGAGAAAGGTAGAAGAGTTGGTTCAGGAAGATTCACGAATAGATCGGTACCCGAGATTGGTCAAGGGGGATGGAAAAGAAGTATGGGAATGGCAAGTAGAAGGCTCTTTAAGCTCTTCAAAGGAAAAGAGACTGGCTAGCAAATCCAGTTCCAGAATGGTGATGGGAGTAAATTCTGACGTAATATACTCAAAAATGGAGAACGAGCTGAAGGCAGTAGTTTCGTGACATGGCGCTCGTCCATGAGGATATTGTAGCTTAGCTGGTTAATAGAAAGAATGAAAACAAGAGGTACAAGATTTAAAAGTCCATAATTATTTTGTTTGATTCTTGTGTGAAGTACTTGTTAAGCTGGTTCTGTTGCGACAACTGGGGCCTGTAGGGGTATGAGGTACTATAATATTATTCCCTTCTACTGTTCTTTCGATAGCACTATCTTGCCCCCTCCCACCTGGAGAATCCCATAGTATTTTAATGGACTGTTTGCGTG

Coding sequence (CDS)

ATGTCTTCAACTCCGAAGAGGCGAACGAAATTCAAGCATAATCTGAACTCTGATGTCGCTTCTAAACGCGATTCTTATGGTTCATCTTCTGCAGTGTTGCTGAATTCTATCAAGGGACCGCCTCGTGATTTCTTTCCCTCGAAGGATGATCTTACTAGGCTAATTACTGTACTTTTCATCGCCGGCTTGGTTTTTGTGAGTTGTAACTTCTTTGTATCTAGACTTGAAACTCGCCGCCCGAGGCCTTTCTGCGACTCCGACGCCGATTCTTTTGATTTGCTTTCTGATGCTTGTGAGCCTTGTCCAAGTCATGGAGAATGCCATGAAGGTAAGTTGGAATGCGGTCATGGTTATAGAGAGCATGGTAGGTTATGTATAGAAGATGGAGTAATCAATAAAGCAGTTAAGAAACTTTCAGAATGGCTAGAATCTCACCTCTGTGAAGCAAATGCCAAGTTTTTATGCGATGGAATTGGGATAGTTTGGGTTCAAGAGGATGCTATATGGGACGATCTCGATGGTAAAGCGCTGGTGGAAAACATTGATTCTGACAACACCACTATTATGTATGCAAAGAGCAAGGCATTGGAAACTATTGGTGGGTTATTCCAGGCACGGCAAAATGCTCTTGGTATCAAGGAATTGAAATGCCCAGATCACCTAGCTGAAAGTTACAAGCCTTTTACTTGCCGTATTCGTCACTGGGTTTTGCAGCATGCTTTTGTTGTTTTGCCAGTTTCCTTACTGCTCGTGGGATGCACATGGTTACTATGGAAACTTTGCCGGAGACAATATCTAACTAATAGAGCTGAAGATCTGTACAACCAGGTTTGCGAAATACTTGAGGAAAATGCTTTGATGTCAACGAGGAACAGTGGACAATGTGAATCATGGGTTGTTGCTTCTAGGTTACGTGACCATCTTCTTTTGCCACGAGAGAGGAAGGATCCTTTATTATGGAGAAAGGTAGAAGAGTTGGTTCAGGAAGATTCACGAATAGATCGGTACCCGAGATTGGTCAAGGGGGATGGAAAAGAAGTATGGGAATGGCAAGTAGAAGGCTCTTTAAGCTCTTCAAAGGAAAAGAGACTGGCTAGCAAATCCAGTTCCAGAATGGTGATGGGAGTAAATTCTGACGTAATATACTCAAAAATGGAGAACGAGCTGAAGGCAGTAGTTTCGTGA

Protein sequence

MSSTPKRRTKFKHNLNSDVASKRDSYGSSSAVLLNSIKGPPRDFFPSKDDLTRLITVLFIAGLVFVSCNFFVSRLETRRPRPFCDSDADSFDLLSDACEPCPSHGECHEGKLECGHGYREHGRLCIEDGVINKAVKKLSEWLESHLCEANAKFLCDGIGIVWVQEDAIWDDLDGKALVENIDSDNTTIMYAKSKALETIGGLFQARQNALGIKELKCPDHLAESYKPFTCRIRHWVLQHAFVVLPVSLLLVGCTWLLWKLCRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSKEKRLASKSSSRMVMGVNSDVIYSKMENELKAVVS
Homology
BLAST of CmoCh02G016310.1 vs. ExPASy TrEMBL
Match: A0A6J1H2A7 (uncharacterized protein LOC111459381 isoform X3 OS=Cucurbita moschata OX=3662 GN=LOC111459381 PE=4 SV=1)

HSP 1 Score: 806.2 bits (2081), Expect = 6.0e-230
Identity = 394/394 (100.00%), Postives = 394/394 (100.00%), Query Frame = 0

Query: 1   MSSTPKRRTKFKHNLNSDVASKRDSYGSSSAVLLNSIKGPPRDFFPSKDDLTRLITVLFI 60
           MSSTPKRRTKFKHNLNSDVASKRDSYGSSSAVLLNSIKGPPRDFFPSKDDLTRLITVLFI
Sbjct: 1   MSSTPKRRTKFKHNLNSDVASKRDSYGSSSAVLLNSIKGPPRDFFPSKDDLTRLITVLFI 60

Query: 61  AGLVFVSCNFFVSRLETRRPRPFCDSDADSFDLLSDACEPCPSHGECHEGKLECGHGYRE 120
           AGLVFVSCNFFVSRLETRRPRPFCDSDADSFDLLSDACEPCPSHGECHEGKLECGHGYRE
Sbjct: 61  AGLVFVSCNFFVSRLETRRPRPFCDSDADSFDLLSDACEPCPSHGECHEGKLECGHGYRE 120

Query: 121 HGRLCIEDGVINKAVKKLSEWLESHLCEANAKFLCDGIGIVWVQEDAIWDDLDGKALVEN 180
           HGRLCIEDGVINKAVKKLSEWLESHLCEANAKFLCDGIGIVWVQEDAIWDDLDGKALVEN
Sbjct: 121 HGRLCIEDGVINKAVKKLSEWLESHLCEANAKFLCDGIGIVWVQEDAIWDDLDGKALVEN 180

Query: 181 IDSDNTTIMYAKSKALETIGGLFQARQNALGIKELKCPDHLAESYKPFTCRIRHWVLQHA 240
           IDSDNTTIMYAKSKALETIGGLFQARQNALGIKELKCPDHLAESYKPFTCRIRHWVLQHA
Sbjct: 181 IDSDNTTIMYAKSKALETIGGLFQARQNALGIKELKCPDHLAESYKPFTCRIRHWVLQHA 240

Query: 241 FVVLPVSLLLVGCTWLLWKLCRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV 300
           FVVLPVSLLLVGCTWLLWKLCRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV
Sbjct: 241 FVVLPVSLLLVGCTWLLWKLCRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV 300

Query: 301 ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK 360
           ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK
Sbjct: 301 ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK 360

Query: 361 EKRLASKSSSRMVMGVNSDVIYSKMENELKAVVS 395
           EKRLASKSSSRMVMGVNSDVIYSKMENELKAVVS
Sbjct: 361 EKRLASKSSSRMVMGVNSDVIYSKMENELKAVVS 394

BLAST of CmoCh02G016310.1 vs. ExPASy TrEMBL
Match: A0A6J1H1Y9 (uncharacterized protein LOC111459381 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111459381 PE=4 SV=1)

HSP 1 Score: 801.2 bits (2068), Expect = 1.9e-228
Identity = 394/396 (99.49%), Postives = 394/396 (99.49%), Query Frame = 0

Query: 1   MSSTPKRRTKFKHNLNSDVASKRDSYGSSSAVLLNSIKGPPRDFFPSKDDLTRLITVLFI 60
           MSSTPKRRTKFKHNLNSDVASKRDSYGSSSAVLLNSIKGPPRDFFPSKDDLTRLITVLFI
Sbjct: 1   MSSTPKRRTKFKHNLNSDVASKRDSYGSSSAVLLNSIKGPPRDFFPSKDDLTRLITVLFI 60

Query: 61  AGLVFVSCNFFVSRLETRRPRPFCDSDADSFDLLSDACEPCPSHGECHEGKLECGHGYRE 120
           AGLVFVSCNFFVSRLETRRPRPFCDSDADSFDLLSDACEPCPSHGECHEGKLECGHGYRE
Sbjct: 61  AGLVFVSCNFFVSRLETRRPRPFCDSDADSFDLLSDACEPCPSHGECHEGKLECGHGYRE 120

Query: 121 HGRLCIEDGVINKAVKKL--SEWLESHLCEANAKFLCDGIGIVWVQEDAIWDDLDGKALV 180
           HGRLCIEDGVINKAVKKL  SEWLESHLCEANAKFLCDGIGIVWVQEDAIWDDLDGKALV
Sbjct: 121 HGRLCIEDGVINKAVKKLVCSEWLESHLCEANAKFLCDGIGIVWVQEDAIWDDLDGKALV 180

Query: 181 ENIDSDNTTIMYAKSKALETIGGLFQARQNALGIKELKCPDHLAESYKPFTCRIRHWVLQ 240
           ENIDSDNTTIMYAKSKALETIGGLFQARQNALGIKELKCPDHLAESYKPFTCRIRHWVLQ
Sbjct: 181 ENIDSDNTTIMYAKSKALETIGGLFQARQNALGIKELKCPDHLAESYKPFTCRIRHWVLQ 240

Query: 241 HAFVVLPVSLLLVGCTWLLWKLCRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESW 300
           HAFVVLPVSLLLVGCTWLLWKLCRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESW
Sbjct: 241 HAFVVLPVSLLLVGCTWLLWKLCRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESW 300

Query: 301 VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSS 360
           VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSS
Sbjct: 301 VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSS 360

Query: 361 SKEKRLASKSSSRMVMGVNSDVIYSKMENELKAVVS 395
           SKEKRLASKSSSRMVMGVNSDVIYSKMENELKAVVS
Sbjct: 361 SKEKRLASKSSSRMVMGVNSDVIYSKMENELKAVVS 396

BLAST of CmoCh02G016310.1 vs. ExPASy TrEMBL
Match: A0A6J1H3U4 (uncharacterized protein LOC111459381 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111459381 PE=4 SV=1)

HSP 1 Score: 795.0 bits (2052), Expect = 1.4e-226
Identity = 393/396 (99.24%), Postives = 393/396 (99.24%), Query Frame = 0

Query: 1   MSSTPKRRTKFKHNLNSDVASKRDSYGSSSAVLLNSIKGPPRDFFPSKDDLTRLITVLFI 60
           MSSTPKRRTKFKHNLNSDVASKRDSYGSSSAVLLNSIKGPPRDFFPSKDDLTRLITVLFI
Sbjct: 1   MSSTPKRRTKFKHNLNSDVASKRDSYGSSSAVLLNSIKGPPRDFFPSKDDLTRLITVLFI 60

Query: 61  AGLVFVSCNFFVSRLETRRPRPFCDSDADSFDLLSDACEPCPSHGECHEGKLECGHGYRE 120
           AGLVFVSCNFFVSRLETRRPRPFCDSDADSFDLLSDACEPCPSHGECHEGKLECGHGYRE
Sbjct: 61  AGLVFVSCNFFVSRLETRRPRPFCDSDADSFDLLSDACEPCPSHGECHEGKLECGHGYRE 120

Query: 121 HGRLCIEDGVINKAVKKL--SEWLESHLCEANAKFLCDGIGIVWVQEDAIWDDLDGKALV 180
           HGRLCIEDGVINKAVKKL  SEWLESHLCEANAKFLCDGIGIVWVQEDAIWDDLDGKALV
Sbjct: 121 HGRLCIEDGVINKAVKKLVCSEWLESHLCEANAKFLCDGIGIVWVQEDAIWDDLDGKALV 180

Query: 181 ENIDSDNTTIMYAKSKALETIGGLFQARQNALGIKELKCPDHLAESYKPFTCRIRHWVLQ 240
           ENIDSDNTTIMYAKSKALETIGGLFQARQNALGIKELKCPDHLAESYKPFTCRIRHWVLQ
Sbjct: 181 ENIDSDNTTIMYAKSKALETIGGLFQARQNALGIKELKCPDHLAESYKPFTCRIRHWVLQ 240

Query: 241 HAFVVLPVSLLLVGCTWLLWKLCRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESW 300
           HAFVVLPVSLLLVGCTWLLWKLCRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESW
Sbjct: 241 HAFVVLPVSLLLVGCTWLLWKLCRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESW 300

Query: 301 VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSS 360
           VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQ EGSLSS
Sbjct: 301 VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQ-EGSLSS 360

Query: 361 SKEKRLASKSSSRMVMGVNSDVIYSKMENELKAVVS 395
           SKEKRLASKSSSRMVMGVNSDVIYSKMENELKAVVS
Sbjct: 361 SKEKRLASKSSSRMVMGVNSDVIYSKMENELKAVVS 395

BLAST of CmoCh02G016310.1 vs. ExPASy TrEMBL
Match: A0A6J1K9D6 (uncharacterized protein LOC111491297 isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC111491297 PE=4 SV=1)

HSP 1 Score: 765.8 bits (1976), Expect = 9.0e-218
Identity = 378/394 (95.94%), Postives = 382/394 (96.95%), Query Frame = 0

Query: 1   MSSTPKRRTKFKHNLNSDVASKRDSYGSSSAVLLNSIKGPPRDFFPSKDDLTRLITVLFI 60
           MSSTPKRRTK+KHN NSDV  KRDSYGSSSAVLLNSIKGPPRDFFPSKDDLTRLITVLFI
Sbjct: 1   MSSTPKRRTKYKHNPNSDVTFKRDSYGSSSAVLLNSIKGPPRDFFPSKDDLTRLITVLFI 60

Query: 61  AGLVFVSCNFFVSRLETRRPRPFCDSDADSFDLLSDACEPCPSHGECHEGKLECGHGYRE 120
           AGLVFVSCNFFVSRLETRRPRPFCDSDADSFDLLSDACEPCPSHGECHEG LEC HGYRE
Sbjct: 61  AGLVFVSCNFFVSRLETRRPRPFCDSDADSFDLLSDACEPCPSHGECHEGTLECLHGYRE 120

Query: 121 HGRLCIEDGVINKAVKKLSEWLESHLCEANAKFLCDGIGIVWVQEDAIWDDLDGKALVEN 180
           HGRLCIEDGVINKAVKKLSEWLE HLCEANAKFLCDGIGIVWVQEDAIWDDLDGKALVEN
Sbjct: 121 HGRLCIEDGVINKAVKKLSEWLEFHLCEANAKFLCDGIGIVWVQEDAIWDDLDGKALVEN 180

Query: 181 IDSDNTTIMYAKSKALETIGGLFQARQNALGIKELKCPDHLAESYKPFTCRIRHWVLQHA 240
            DSDNTTIMYAKSKALETIGGLFQAR+NALGIKELKCPDHLAESYKP TCRIRHWVLQHA
Sbjct: 181 -DSDNTTIMYAKSKALETIGGLFQARKNALGIKELKCPDHLAESYKPLTCRIRHWVLQHA 240

Query: 241 FVVLPVSLLLVGCTWLLWKLCRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV 300
           F+VLPVSLLLVGCT LLWKLCRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV
Sbjct: 241 FLVLPVSLLLVGCTRLLWKLCRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV 300

Query: 301 ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK 360
           ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK
Sbjct: 301 ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK 360

Query: 361 EKRLASKSSSRMVMGVNSDVIYSKMENELKAVVS 395
            KRLASKSS+RM MGVNSDVIYSKMENE KAVVS
Sbjct: 361 AKRLASKSSARMAMGVNSDVIYSKMENEPKAVVS 393

BLAST of CmoCh02G016310.1 vs. ExPASy TrEMBL
Match: A0A6J1K380 (uncharacterized protein LOC111491297 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111491297 PE=4 SV=1)

HSP 1 Score: 760.8 bits (1963), Expect = 2.9e-216
Identity = 378/396 (95.45%), Postives = 382/396 (96.46%), Query Frame = 0

Query: 1   MSSTPKRRTKFKHNLNSDVASKRDSYGSSSAVLLNSIKGPPRDFFPSKDDLTRLITVLFI 60
           MSSTPKRRTK+KHN NSDV  KRDSYGSSSAVLLNSIKGPPRDFFPSKDDLTRLITVLFI
Sbjct: 1   MSSTPKRRTKYKHNPNSDVTFKRDSYGSSSAVLLNSIKGPPRDFFPSKDDLTRLITVLFI 60

Query: 61  AGLVFVSCNFFVSRLETRRPRPFCDSDADSFDLLSDACEPCPSHGECHEGKLECGHGYRE 120
           AGLVFVSCNFFVSRLETRRPRPFCDSDADSFDLLSDACEPCPSHGECHEG LEC HGYRE
Sbjct: 61  AGLVFVSCNFFVSRLETRRPRPFCDSDADSFDLLSDACEPCPSHGECHEGTLECLHGYRE 120

Query: 121 HGRLCIEDGVINKAVKKL--SEWLESHLCEANAKFLCDGIGIVWVQEDAIWDDLDGKALV 180
           HGRLCIEDGVINKAVKKL  SEWLE HLCEANAKFLCDGIGIVWVQEDAIWDDLDGKALV
Sbjct: 121 HGRLCIEDGVINKAVKKLVCSEWLEFHLCEANAKFLCDGIGIVWVQEDAIWDDLDGKALV 180

Query: 181 ENIDSDNTTIMYAKSKALETIGGLFQARQNALGIKELKCPDHLAESYKPFTCRIRHWVLQ 240
           EN DSDNTTIMYAKSKALETIGGLFQAR+NALGIKELKCPDHLAESYKP TCRIRHWVLQ
Sbjct: 181 EN-DSDNTTIMYAKSKALETIGGLFQARKNALGIKELKCPDHLAESYKPLTCRIRHWVLQ 240

Query: 241 HAFVVLPVSLLLVGCTWLLWKLCRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESW 300
           HAF+VLPVSLLLVGCT LLWKLCRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESW
Sbjct: 241 HAFLVLPVSLLLVGCTRLLWKLCRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESW 300

Query: 301 VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSS 360
           VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSS
Sbjct: 301 VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSS 360

Query: 361 SKEKRLASKSSSRMVMGVNSDVIYSKMENELKAVVS 395
           SK KRLASKSS+RM MGVNSDVIYSKMENE KAVVS
Sbjct: 361 SKAKRLASKSSARMAMGVNSDVIYSKMENEPKAVVS 395

BLAST of CmoCh02G016310.1 vs. NCBI nr
Match: XP_022958030.1 (uncharacterized protein LOC111459381 isoform X3 [Cucurbita moschata])

HSP 1 Score: 806.2 bits (2081), Expect = 1.2e-229
Identity = 394/394 (100.00%), Postives = 394/394 (100.00%), Query Frame = 0

Query: 1   MSSTPKRRTKFKHNLNSDVASKRDSYGSSSAVLLNSIKGPPRDFFPSKDDLTRLITVLFI 60
           MSSTPKRRTKFKHNLNSDVASKRDSYGSSSAVLLNSIKGPPRDFFPSKDDLTRLITVLFI
Sbjct: 1   MSSTPKRRTKFKHNLNSDVASKRDSYGSSSAVLLNSIKGPPRDFFPSKDDLTRLITVLFI 60

Query: 61  AGLVFVSCNFFVSRLETRRPRPFCDSDADSFDLLSDACEPCPSHGECHEGKLECGHGYRE 120
           AGLVFVSCNFFVSRLETRRPRPFCDSDADSFDLLSDACEPCPSHGECHEGKLECGHGYRE
Sbjct: 61  AGLVFVSCNFFVSRLETRRPRPFCDSDADSFDLLSDACEPCPSHGECHEGKLECGHGYRE 120

Query: 121 HGRLCIEDGVINKAVKKLSEWLESHLCEANAKFLCDGIGIVWVQEDAIWDDLDGKALVEN 180
           HGRLCIEDGVINKAVKKLSEWLESHLCEANAKFLCDGIGIVWVQEDAIWDDLDGKALVEN
Sbjct: 121 HGRLCIEDGVINKAVKKLSEWLESHLCEANAKFLCDGIGIVWVQEDAIWDDLDGKALVEN 180

Query: 181 IDSDNTTIMYAKSKALETIGGLFQARQNALGIKELKCPDHLAESYKPFTCRIRHWVLQHA 240
           IDSDNTTIMYAKSKALETIGGLFQARQNALGIKELKCPDHLAESYKPFTCRIRHWVLQHA
Sbjct: 181 IDSDNTTIMYAKSKALETIGGLFQARQNALGIKELKCPDHLAESYKPFTCRIRHWVLQHA 240

Query: 241 FVVLPVSLLLVGCTWLLWKLCRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV 300
           FVVLPVSLLLVGCTWLLWKLCRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV
Sbjct: 241 FVVLPVSLLLVGCTWLLWKLCRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV 300

Query: 301 ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK 360
           ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK
Sbjct: 301 ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK 360

Query: 361 EKRLASKSSSRMVMGVNSDVIYSKMENELKAVVS 395
           EKRLASKSSSRMVMGVNSDVIYSKMENELKAVVS
Sbjct: 361 EKRLASKSSSRMVMGVNSDVIYSKMENELKAVVS 394

BLAST of CmoCh02G016310.1 vs. NCBI nr
Match: KAG6606225.1 (hypothetical protein SDJN03_03542, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 804.7 bits (2077), Expect = 3.6e-229
Identity = 393/394 (99.75%), Postives = 393/394 (99.75%), Query Frame = 0

Query: 1   MSSTPKRRTKFKHNLNSDVASKRDSYGSSSAVLLNSIKGPPRDFFPSKDDLTRLITVLFI 60
           MSSTPKRRTKFKHNLNSDVASKRDSYGSSSAVLLNSIKGPPRDFFPSKDDLTRLITVLFI
Sbjct: 1   MSSTPKRRTKFKHNLNSDVASKRDSYGSSSAVLLNSIKGPPRDFFPSKDDLTRLITVLFI 60

Query: 61  AGLVFVSCNFFVSRLETRRPRPFCDSDADSFDLLSDACEPCPSHGECHEGKLECGHGYRE 120
           AGLVFVSCNFFVSRLETRRPRPFCDSDADSFDLLSDACEPCPSHGECHEGKLECGHGYRE
Sbjct: 61  AGLVFVSCNFFVSRLETRRPRPFCDSDADSFDLLSDACEPCPSHGECHEGKLECGHGYRE 120

Query: 121 HGRLCIEDGVINKAVKKLSEWLESHLCEANAKFLCDGIGIVWVQEDAIWDDLDGKALVEN 180
           HGRLCIEDGVINKAVKKLSEWLESHLCEANAKFLCDGIGIVWVQEDAIWDDLDGKALVEN
Sbjct: 121 HGRLCIEDGVINKAVKKLSEWLESHLCEANAKFLCDGIGIVWVQEDAIWDDLDGKALVEN 180

Query: 181 IDSDNTTIMYAKSKALETIGGLFQARQNALGIKELKCPDHLAESYKPFTCRIRHWVLQHA 240
           IDSDNTTIMYAKSKALETIGGLFQARQNALGIKELKCPDHLAESYKPFTCRIRHWVLQHA
Sbjct: 181 IDSDNTTIMYAKSKALETIGGLFQARQNALGIKELKCPDHLAESYKPFTCRIRHWVLQHA 240

Query: 241 FVVLPVSLLLVGCTWLLWKLCRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV 300
           FVVLPVSLLLVGCTWLLWKLCRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV
Sbjct: 241 FVVLPVSLLLVGCTWLLWKLCRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV 300

Query: 301 ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK 360
           ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK
Sbjct: 301 ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK 360

Query: 361 EKRLASKSSSRMVMGVNSDVIYSKMENELKAVVS 395
           EKRLASKSSSRM MGVNSDVIYSKMENELKAVVS
Sbjct: 361 EKRLASKSSSRMAMGVNSDVIYSKMENELKAVVS 394

BLAST of CmoCh02G016310.1 vs. NCBI nr
Match: XP_022958028.1 (uncharacterized protein LOC111459381 isoform X1 [Cucurbita moschata])

HSP 1 Score: 801.2 bits (2068), Expect = 4.0e-228
Identity = 394/396 (99.49%), Postives = 394/396 (99.49%), Query Frame = 0

Query: 1   MSSTPKRRTKFKHNLNSDVASKRDSYGSSSAVLLNSIKGPPRDFFPSKDDLTRLITVLFI 60
           MSSTPKRRTKFKHNLNSDVASKRDSYGSSSAVLLNSIKGPPRDFFPSKDDLTRLITVLFI
Sbjct: 1   MSSTPKRRTKFKHNLNSDVASKRDSYGSSSAVLLNSIKGPPRDFFPSKDDLTRLITVLFI 60

Query: 61  AGLVFVSCNFFVSRLETRRPRPFCDSDADSFDLLSDACEPCPSHGECHEGKLECGHGYRE 120
           AGLVFVSCNFFVSRLETRRPRPFCDSDADSFDLLSDACEPCPSHGECHEGKLECGHGYRE
Sbjct: 61  AGLVFVSCNFFVSRLETRRPRPFCDSDADSFDLLSDACEPCPSHGECHEGKLECGHGYRE 120

Query: 121 HGRLCIEDGVINKAVKKL--SEWLESHLCEANAKFLCDGIGIVWVQEDAIWDDLDGKALV 180
           HGRLCIEDGVINKAVKKL  SEWLESHLCEANAKFLCDGIGIVWVQEDAIWDDLDGKALV
Sbjct: 121 HGRLCIEDGVINKAVKKLVCSEWLESHLCEANAKFLCDGIGIVWVQEDAIWDDLDGKALV 180

Query: 181 ENIDSDNTTIMYAKSKALETIGGLFQARQNALGIKELKCPDHLAESYKPFTCRIRHWVLQ 240
           ENIDSDNTTIMYAKSKALETIGGLFQARQNALGIKELKCPDHLAESYKPFTCRIRHWVLQ
Sbjct: 181 ENIDSDNTTIMYAKSKALETIGGLFQARQNALGIKELKCPDHLAESYKPFTCRIRHWVLQ 240

Query: 241 HAFVVLPVSLLLVGCTWLLWKLCRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESW 300
           HAFVVLPVSLLLVGCTWLLWKLCRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESW
Sbjct: 241 HAFVVLPVSLLLVGCTWLLWKLCRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESW 300

Query: 301 VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSS 360
           VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSS
Sbjct: 301 VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSS 360

Query: 361 SKEKRLASKSSSRMVMGVNSDVIYSKMENELKAVVS 395
           SKEKRLASKSSSRMVMGVNSDVIYSKMENELKAVVS
Sbjct: 361 SKEKRLASKSSSRMVMGVNSDVIYSKMENELKAVVS 396

BLAST of CmoCh02G016310.1 vs. NCBI nr
Match: XP_023533382.1 (uncharacterized protein LOC111795284 isoform X3 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 797.3 bits (2058), Expect = 5.8e-227
Identity = 391/396 (98.74%), Postives = 392/396 (98.99%), Query Frame = 0

Query: 1   MSSTPKRRTKFKHNLNSDVASKRDSYGSSSAVLLNSIKGPPRDFFPSKDDLTRLITVLFI 60
           MSSTPKRRTKFKHNLNSDVASKRDSYGSSSAVLLNS+KGPPRDFFPSKDDLTRLITVLFI
Sbjct: 1   MSSTPKRRTKFKHNLNSDVASKRDSYGSSSAVLLNSVKGPPRDFFPSKDDLTRLITVLFI 60

Query: 61  AGLVFVSCNFFVSRLETRRPRPFCDSDADSFDLLSDACEPCPSHGECHEGKLECGHGYRE 120
           AGLVFVSCNFFVSRLETRRPRPFCDSDADSFDLLSDACEPCPSHGECHEGKLECGHGYRE
Sbjct: 61  AGLVFVSCNFFVSRLETRRPRPFCDSDADSFDLLSDACEPCPSHGECHEGKLECGHGYRE 120

Query: 121 HGRLCIEDGVINKAVKKL--SEWLESHLCEANAKFLCDGIGIVWVQEDAIWDDLDGKALV 180
           HGRLCIEDGVINKAVKKL  SEWLESHLCEANAKFLCDGIGIVWVQEDAIWDDLDGKALV
Sbjct: 121 HGRLCIEDGVINKAVKKLVCSEWLESHLCEANAKFLCDGIGIVWVQEDAIWDDLDGKALV 180

Query: 181 ENIDSDNTTIMYAKSKALETIGGLFQARQNALGIKELKCPDHLAESYKPFTCRIRHWVLQ 240
           ENIDSDNTTIMYAKSKALETIGGLFQARQNALGIKELKCPDHLAESYKPFTCRIRHWVLQ
Sbjct: 181 ENIDSDNTTIMYAKSKALETIGGLFQARQNALGIKELKCPDHLAESYKPFTCRIRHWVLQ 240

Query: 241 HAFVVLPVSLLLVGCTWLLWKLCRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESW 300
           HAFVVLPVSLLLVGCTWLLWKLCRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESW
Sbjct: 241 HAFVVLPVSLLLVGCTWLLWKLCRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESW 300

Query: 301 VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSS 360
           VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSS
Sbjct: 301 VVASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSS 360

Query: 361 SKEKRLASKSSSRMVMGVNSDVIYSKMENELKAVVS 395
           SKEKRLASKSSSRM MGVNSDVIYSKMENEL AVVS
Sbjct: 361 SKEKRLASKSSSRMAMGVNSDVIYSKMENELNAVVS 396

BLAST of CmoCh02G016310.1 vs. NCBI nr
Match: KAG7036172.1 (hypothetical protein SDJN02_02973 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 796.2 bits (2055), Expect = 1.3e-226
Identity = 386/393 (98.22%), Postives = 390/393 (99.24%), Query Frame = 0

Query: 1   MSSTPKRRTKFKHNLNSDVASKRDSYGSSSAVLLNSIKGPPRDFFPSKDDLTRLITVLFI 60
           MSSTPKRRTKFKHNLNSDVASKRDSYGSSSAVLLNSIKGPPRDFFPSKDDLTRL+TVLFI
Sbjct: 1   MSSTPKRRTKFKHNLNSDVASKRDSYGSSSAVLLNSIKGPPRDFFPSKDDLTRLVTVLFI 60

Query: 61  AGLVFVSCNFFVSRLETRRPRPFCDSDADSFDLLSDACEPCPSHGECHEGKLECGHGYRE 120
           AGLVFVSCNFFVSRLETRRPRPFCDSDADSFDLLSDACEPCPSHGECHEGKLECGHGYRE
Sbjct: 61  AGLVFVSCNFFVSRLETRRPRPFCDSDADSFDLLSDACEPCPSHGECHEGKLECGHGYRE 120

Query: 121 HGRLCIEDGVINKAVKKLSEWLESHLCEANAKFLCDGIGIVWVQEDAIWDDLDGKALVEN 180
           HGRLCIEDGVINKAVKKLSEWLESHLCEANAKFLCDGIGIVWVQEDAIWDDLDGKALVEN
Sbjct: 121 HGRLCIEDGVINKAVKKLSEWLESHLCEANAKFLCDGIGIVWVQEDAIWDDLDGKALVEN 180

Query: 181 IDSDNTTIMYAKSKALETIGGLFQARQNALGIKELKCPDHLAESYKPFTCRIRHWVLQHA 240
           IDSDNTTIMYAKSKALETIGGLFQARQNALGIKELKCPDHLAESYKPFTCRIRHWVLQHA
Sbjct: 181 IDSDNTTIMYAKSKALETIGGLFQARQNALGIKELKCPDHLAESYKPFTCRIRHWVLQHA 240

Query: 241 FVVLPVSLLLVGCTWLLWKLCRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV 300
           FVVLPVSLLLVGCTWLLWKLCRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV
Sbjct: 241 FVVLPVSLLLVGCTWLLWKLCRRQYLTNRAEDLYNQVCEILEENALMSTRNSGQCESWVV 300

Query: 301 ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK 360
           ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK
Sbjct: 301 ASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSSK 360

Query: 361 EKRLASKSSSRMVMGVNSDVIYSKMENELKAVV 394
           EKRLASKSSSRM MGVNSDVIYSKMEN+ +A +
Sbjct: 361 EKRLASKSSSRMAMGVNSDVIYSKMENDAEATI 393

BLAST of CmoCh02G016310.1 vs. TAIR 10
Match: AT5G46560.1 (CONTAINS InterPro DOMAIN/s: Inner nuclear membrane protein MAN1 (InterPro:IPR018996); Has 58 Blast hits to 58 proteins in 29 species: Archae - 0; Bacteria - 4; Metazoa - 11; Fungi - 15; Plants - 20; Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink). )

HSP 1 Score: 337.4 bits (864), Expect = 1.5e-92
Identity = 166/364 (45.60%), Postives = 234/364 (64.29%), Query Frame = 0

Query: 1   MSSTPKRRTKFKHNLNSDVASKRDSYGSSSAVLLNSIKGPPRDFFPSKDDLTRLITVLFI 60
           M S P++R K      S+  + R    SSS+  + S+  PP+  FPSK +   L+ VL +
Sbjct: 1   MDSIPRKRPK------SETRTGRTPKSSSSSSPIRSMLEPPQSLFPSKGEFFTLLKVLLV 60

Query: 61  AGLVFVSCNFFVSRLETRRPRPFCDSDADSFDLLSDACEPCPSHGECHEGKLECGHGYRE 120
           A  V  +CNF    L +   + FCDS+ +  D   D CEPCP +GEC++GKL+C  GY+ 
Sbjct: 61  ACAVAFTCNFLSKSLSSNPSKSFCDSNFNPIDSDLDICEPCPINGECYQGKLQCNLGYKN 120

Query: 121 HGRLCIEDGVINKAVKKLSEWLESHLCEANAKFLCDGIGIVWVQEDAIWDDLDGKALVEN 180
              LC+EDG IN++ KKL  + E  +CE+ A   C G G +WV E+ +W +L   + + N
Sbjct: 121 QRNLCVEDGEINESTKKLVGYFERKVCESYAHNECYGTGTIWVPENDVWTELRSNSFLSN 180

Query: 181 IDSDNTTIMYAKSKALETIGGLFQARQNALGIKELKCPDHLAESYKPFTCRIRHWVLQHA 240
           +  D +   + K KA+E +  L + R N+ GI ELKCP+ +A+SYKP TCR+  W+L+H 
Sbjct: 181 L--DESAYNFLKGKAVEGVTELLEKRTNSNGIDELKCPESVAKSYKPLTCRLHQWILRHI 240

Query: 241 FVVLPVSLLLVGCTWLLWKLCRRQYLTNRAEDLYNQVCEILEENALMS-TRNSGQCESWV 300
            ++     +LVG   L  ++ R+Q  + R E+LY+QVC+ LEENA+ S +  +  CE WV
Sbjct: 241 LIISSSCAMLVGSAMLRRRIQRKQCFSRRVEELYDQVCDFLEENAVASNSAETSNCEPWV 300

Query: 301 VASRLRDHLLLPRERKDPLLWRKVEELVQEDSRIDRYPRLVKGDGKEVWEWQVEGSLSSS 360
           +AS LRD+LLLPRER+DPLLW KVEEL++EDSRIDRY +L+KG+ K VWEWQVEGSLS S
Sbjct: 301 IASWLRDYLLLPRERRDPLLWTKVEELIKEDSRIDRYEKLLKGEKKVVWEWQVEGSLSLS 356

Query: 361 KEKR 364
           K K+
Sbjct: 361 KLKK 356

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1H2A76.0e-230100.00uncharacterized protein LOC111459381 isoform X3 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1H1Y91.9e-22899.49uncharacterized protein LOC111459381 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1H3U41.4e-22699.24uncharacterized protein LOC111459381 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1K9D69.0e-21895.94uncharacterized protein LOC111491297 isoform X3 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1K3802.9e-21695.45uncharacterized protein LOC111491297 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
Match NameE-valueIdentityDescription
XP_022958030.11.2e-229100.00uncharacterized protein LOC111459381 isoform X3 [Cucurbita moschata][more]
KAG6606225.13.6e-22999.75hypothetical protein SDJN03_03542, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022958028.14.0e-22899.49uncharacterized protein LOC111459381 isoform X1 [Cucurbita moschata][more]
XP_023533382.15.8e-22798.74uncharacterized protein LOC111795284 isoform X3 [Cucurbita pepo subsp. pepo][more]
KAG7036172.11.3e-22698.22hypothetical protein SDJN02_02973 [Cucurbita argyrosperma subsp. argyrosperma][more]
Match NameE-valueIdentityDescription
AT5G46560.11.5e-9245.60CONTAINS InterPro DOMAIN/s: Inner nuclear membrane protein MAN1 (InterPro:IPR018... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR018996Man1/Src1, C-terminalPFAMPF09402MSCcoord: 94..350
e-value: 6.7E-14
score: 52.1
IPR041885MAN1, winged-helix domainGENE3D1.10.10.1180coord: 256..350
e-value: 2.2E-12
score: 49.1
IPR044780Heh2/Src1-likePANTHERPTHR47808INNER NUCLEAR MEMBRANE PROTEIN HEH2-RELATEDcoord: 48..370

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CmoCh02G016310CmoCh02G016310gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh02G016310.1:exon:8368CmoCh02G016310.1:exon:8368exon
CmoCh02G016310.1:exon:8369CmoCh02G016310.1:exon:8369exon
CmoCh02G016310.1:exon:8370CmoCh02G016310.1:exon:8370exon
CmoCh02G016310.1:exon:8371CmoCh02G016310.1:exon:8371exon
CmoCh02G016310.1:exon:8372CmoCh02G016310.1:exon:8372exon
CmoCh02G016310.1:exon:8373CmoCh02G016310.1:exon:8373exon
CmoCh02G016310.1:exon:8374CmoCh02G016310.1:exon:8374exon
CmoCh02G016310.1:exon:8375CmoCh02G016310.1:exon:8375exon
CmoCh02G016310.1:exon:8376CmoCh02G016310.1:exon:8376exon
CmoCh02G016310.1:exon:8377CmoCh02G016310.1:exon:8377exon
CmoCh02G016310.1:exon:8378CmoCh02G016310.1:exon:8378exon


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh02G016310.1:five_prime_utrCmoCh02G016310.1:five_prime_utrfive_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh02G016310.1:cdsCmoCh02G016310.1:cdsCDS
CmoCh02G016310.1:cdsCmoCh02G016310.1:cds_2CDS
CmoCh02G016310.1:cdsCmoCh02G016310.1:cds_3CDS
CmoCh02G016310.1:cdsCmoCh02G016310.1:cds_4CDS
CmoCh02G016310.1:cdsCmoCh02G016310.1:cds_5CDS
CmoCh02G016310.1:cdsCmoCh02G016310.1:cds_6CDS
CmoCh02G016310.1:cdsCmoCh02G016310.1:cds_7CDS
CmoCh02G016310.1:cdsCmoCh02G016310.1:cds_8CDS
CmoCh02G016310.1:cdsCmoCh02G016310.1:cds_9CDS
CmoCh02G016310.1:cdsCmoCh02G016310.1:cds_10CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
CmoCh02G016310.1:three_prime_utrCmoCh02G016310.1:three_prime_utrthree_prime_UTR
CmoCh02G016310.1:three_prime_utrCmoCh02G016310.1:three_prime_utr_2three_prime_UTR


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CmoCh02G016310.1CmoCh02G016310.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0005637 nuclear inner membrane
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003682 chromatin binding