CmoCh10G008470 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh10G008470
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionUnknown protein
LocationCmo_Chr10: 4013840 .. 4017294 (-)
RNA-Seq ExpressionCmoCh10G008470
SyntenyCmoCh10G008470
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TATCAACAATTTGCTCTATGATCTATGGCCAACTTCTATTATTCAAAACCCAAACTCCATTCATCCACCACCTTTTTATTTCTTCTTTGCCTTCTACTTGCACGCCTACAACTTTCAAGTAAGTCTTTCTCTCTCTCGAGATTTGATCTCTTAAAATTATCGAGTTTGATACATAGAATCGAAAAATACATCTAAATTTTAAGTTTTGGTTAAACAATTCTTGAATATCGAACGGTTAGAGTTATAACGAGACTTACCGAATTTGATTCAGGTGCTAATCAGACATTCCCAACAAAAGTGAACCATAGTTATAGTCAAAGCAACATCCAAAGAATTCGACTTGACAATCCGACCCTTGGCTTTCTTGTGGAAAAGAAATTGTCATTGCTACTCAATTTGTACCCTTACTTGAGTTATATCAACACCCATAACATGCCTCTCAATTATGCACTTTTACAAGGACGTCGGCTATTGTTCATGGCAAATTTGTTTACCAAAATATGTTCCATGACTTTTTGGATGCTGTTTATTCTGTGTTGGACGAGTCCAATGGTGAAGAGGACTTTGTATCGGATAGTCGTTGGCTAACCGACGTTGTAGTAGAATTGTTTCAAGCAAATTTTCAAGAAGTTGCTAACATAAAACTATTCCAAGCAAAACACGCTGAACTACGGAGTTGTTATGATTGACCCACCAAAAAGAAGGTGCATCTTATCTTATTAATAATCACTTTCAATTCTGGTAAGTCTTTTTTAGTTATTCTATCCTCAAAGTCTTTATCATTTGAATATAGTATCGATTTACTCATATACTCCAAGTGTCACAACTTAGTATTTGGATAATAATTTGCAACAATCACATGATGGAAATAAAATTTACTTTGTTAAGCTTGATCATAGTTTCAGCTCTTAAGTAAAGTTCATGTAACAAGATTAAGAAAGCTAAAGGAAAATGTGAGATCCTACATCGATTGGAGAGAAGAACGAGTGCCAGCAATGACGCTGGACCCTAAACAGGAGTGGATTGTGAGATCCCTCATCAGTTAGGAGAAGAATGAAGCATTCTTTATAACGGCATGAAAACCTCTCCTTAGTAGATGTGTTTTTAAAACGGGAAAGAATCTAAAAAAGAAAAGCTCCCAGGGTGAGCATACCCTCACCTAGCTCACTAACCCATCACTTCAAGAATCTTCTCTAAACAATGCATAATATTAGGAGTTATGTGGGCTTATCCTTTAATAATCTCTCGACATAGACCACGAGACATCTGTCAAGAGAGAATACGTGAGTTAGAAGGCAAGGGTTTTTGATATACAGAAGAAGTAAGGAAAGGGAAGAAAAGGGCTAAATTTAAAGAGAAGCGGTGGAGTAGAAACCCGCCACGTTTAACTTGGTGGCAGGCTTCTGCAGGCATTCCTTTTCCTCTGCACAATCCACCACCGCCTTCCCTTGCATTCTCTTCAACAGCAGCTATTCTCTCTCCCTCTCTCTCTATCCGCCATGGAAGAAGAGCTTGTTTGATTATGGCCGATGAACCACCTGAGTTCATCAGGTTTTGTTTATTTTTCTTCATGTCTTAGCTTTTAGGATCTTTGATGTCAAGATTATGGATAAAAGTTTTCTTTGATTTGTCAGGATGGAGGGCTATCGCAGCATAGACTGGAATATGGAAGAACAACTATCCTCCGGTGATGGCCTGAGCAGTGAAAAGATATGCTCCGCCGTGCAAAAGGGTTGCAGTCTGGGGAAGAAGCTTCTCCTCACTGGTTTAGCCATATCCTCTGTTCCTGTGGTTCTTCCTCCATTGGTTATCATGTCAGCCTTTGGAATTGCAGCCTCAATACCCTATGGGGTTTTCCTCGCCTCTTATGCTTGTACTGAGACGATCATGAGTGTTTGGCTTCCGATTCCCCCGGCCCTGAAGCTCGACCGCGCTGACGAAGAGATTGTGGAGGAAGACATCTATGAAGATGAAGAAAAGCATATGATGGAAACAGCTAAGAGGGGTGAGAATTTGGATGATTTTGATATCGATGTGGTAGTAGTTCAGGGGGACGAAGAAAGCGAAACCGACATTGGAAGCAAAGGTCTGGCAGCAATTGAAGTGACCAATGTGGAATTTGAAGGAAATGGAGATATTGGAGATGAGGAAGAAGAGGAAGAAGAGTTGAAGGAAACTAGAGGTTTACTCGAAAGAATCAGGGATGAGGGACGAAGAGACAACGGTTTTGTTGATGAAAATGGAGGTGTTGAGGATGTTCGAGAGCTCGAGATTTCAATGGAGGACGAGAAACCAAGTGATTCTGTTGAAAAAAGTGTTCTAGGTTTGTTGAATGAAGTTGACTCTGCTGCTGTTTATCCTCATGAACACTACAGAACTTCTGAAGGTAATTCATTTCTTTTGTTTATAAGTTAAGAGTTAGAATTCAAGCCTATGTTTAGAACGCGAATCGGTTTATTATTCCTAGTTACGAGTTTGCTAATAACTCACGATATCTCTTTTCGGGTTGAGAGTTTGAATATCTAACCTTTGTCATCGGAATTAAAGGGGTCGGGTCGGCGAAATCTAGTGATGAAACAAATGCAATAACGACATTGACAAACAAGGCTGCCAAGTCTGAAGAAGCTGAGCAACTTCCTAAAGTAACAATGATTGATGTGATTGAATCTGATGAAGGTTTGTCTATATCAGAGGTGACTATTGAACACAAAGTTGAGGCAAATTTTCCACATAAAGATCATAGGACGTCTTCCAATGAGGTCAGTACATAATGTAAAGACATCGATTAAATCCGAATACCATGGCCTTGTTTCCTTCGTTCTTCGTTTACCGATCGAGCTAATGAATATGACTAACAAGTACTGAGTTTTAATTGATATCTGTTCTTGGCATCAACTTTGGCATACTCACTTTCTTGGGCTTGGATTGAATGTTTGAAATTCCTGAGTTTTCCCTTTTCTTTTTGCAAGGAACTATCAGGAGAGGTAAAGATAAGAGAAAAGATTGCTTCGATGAAGAAGATCGTAGGATACAAGGCTACCCCCCTCGGAACATACTTAGACGAAGTGAACGCTCTATATGCCTTCATCGGAGTCGAGCCACCTTCCCCGATGAAAGATTCTGCTAATGATGATGATATCAATCTACTTAACCAGAAGTTGCAGTTTCTCATGTCCATAGTAGGGGTCAAGTAGGGGTGTTAAAAGTTTGGCATTGAATAAGTTGCCATTTGTGCACTTTTGTTTGCCAATGGCAATATTTCCTTATGTTTGAGCTATGCACAGTTTGGTTTTGAAATTATGTACCAAATTTTGAACAGTTTATCTACTTTTTTCCCTCTTTCTGTTCAGTTCTTTTCATCCACAAGGGTTCTTGAATCACTCATGGGTGTTAGGAAGTGAAAAATCCAAGAGATCTATGAAATTTATAGCTTAAATGATGAAGAGTGATCGGAC

mRNA sequence

TATCAACAATTTGCTCTATGATCTATGGCCAACTTCTATTATTCAAAACCCAAACTCCATTCATCCACCACCTTTTTATTTCTTCTTTGCCTTCTACTTGCACGCCTACAACTTTCAAGTGCTAATCAGACATTCCCAACAAAAGTGAACCATAGTTATAGTCAAAGCAACATCCAAAGAATTCGACTTGACAATCCGACCCTTGGCTTTCTTGTGGAAAAGAAATTGTCATTGCTACTCAATTTGTACCCTTACTTGAGTTATATCAACACCCATAACATGCCTCTCAATTATGCACTTTTACAAGGACCTATTCTCTCTCCCTCTCTCTCTATCCGCCATGGAAGAAGAGCTTGTTTGATTATGGCCGATGAACCACCTGAGTTCATCAGGATGGAGGGCTATCGCAGCATAGACTGGAATATGGAAGAACAACTATCCTCCGGTGATGGCCTGAGCAGTGAAAAGATATGCTCCGCCGTGCAAAAGGGTTGCAGTCTGGGGAAGAAGCTTCTCCTCACTGGTTTAGCCATATCCTCTGTTCCTGTGGTTCTTCCTCCATTGGTTATCATGTCAGCCTTTGGAATTGCAGCCTCAATACCCTATGGGGTTTTCCTCGCCTCTTATGCTTGTACTGAGACGATCATGAGTGTTTGGCTTCCGATTCCCCCGGCCCTGAAGCTCGACCGCGCTGACGAAGAGATTGTGGAGGAAGACATCTATGAAGATGAAGAAAAGCATATGATGGAAACAGCTAAGAGGGGTGAGAATTTGGATGATTTTGATATCGATGTGGTAGTAGTTCAGGGGGACGAAGAAAGCGAAACCGACATTGGAAGCAAAGGTCTGGCAGCAATTGAAGTGACCAATGTGGAATTTGAAGGAAATGGAGATATTGGAGATGAGGAAGAAGAGGAAGAAGAGTTGAAGGAAACTAGAGGTTTACTCGAAAGAATCAGGGATGAGGGACGAAGAGACAACGGTTTTGTTGATGAAAATGGAGGTGTTGAGGATGTTCGAGAGCTCGAGATTTCAATGGAGGACGAGAAACCAAGTGATTCTGTTGAAAAAAGTGTTCTAGGTTTGTTGAATGAAGTTGACTCTGCTGCTGTTTATCCTCATGAACACTACAGAACTTCTGAAGGGGTCGGGTCGGCGAAATCTAGTGATGAAACAAATGCAATAACGACATTGACAAACAAGGCTGCCAAGTCTGAAGAAGCTGAGCAACTTCCTAAAGTAACAATGATTGATGTGATTGAATCTGATGAAGGTTTGTCTATATCAGAGGTGACTATTGAACACAAAGTTGAGGCAAATTTTCCACATAAAGATCATAGGACGTCTTCCAATGAGGAACTATCAGGAGAGGTAAAGATAAGAGAAAAGATTGCTTCGATGAAGAAGATCGTAGGATACAAGGCTACCCCCCTCGGAACATACTTAGACGAAGTGAACGCTCTATATGCCTTCATCGGAGTCGAGCCACCTTCCCCGATGAAAGATTCTGCTAATGATGATGATATCAATCTACTTAACCAGAAGTTGCAGTTTCTCATGTCCATAGTAGGGGTCAAGTAGGGGTGTTAAAAGTTTGGCATTGAATAAGTTGCCATTTGTGCACTTTTGTTTGCCAATGGCAATATTTCCTTATGTTTGAGCTATGCACAGTTTGGTTTTGAAATTATGTACCAAATTTTGAACAGTTTATCTACTTTTTTCCCTCTTTCTGTTCAGTTCTTTTCATCCACAAGGGTTCTTGAATCACTCATGGGTGTTAGGAAGTGAAAAATCCAAGAGATCTATGAAATTTATAGCTTAAATGATGAAGAGTGATCGGAC

Coding sequence (CDS)

ATGGCCAACTTCTATTATTCAAAACCCAAACTCCATTCATCCACCACCTTTTTATTTCTTCTTTGCCTTCTACTTGCACGCCTACAACTTTCAAGTGCTAATCAGACATTCCCAACAAAAGTGAACCATAGTTATAGTCAAAGCAACATCCAAAGAATTCGACTTGACAATCCGACCCTTGGCTTTCTTGTGGAAAAGAAATTGTCATTGCTACTCAATTTGTACCCTTACTTGAGTTATATCAACACCCATAACATGCCTCTCAATTATGCACTTTTACAAGGACCTATTCTCTCTCCCTCTCTCTCTATCCGCCATGGAAGAAGAGCTTGTTTGATTATGGCCGATGAACCACCTGAGTTCATCAGGATGGAGGGCTATCGCAGCATAGACTGGAATATGGAAGAACAACTATCCTCCGGTGATGGCCTGAGCAGTGAAAAGATATGCTCCGCCGTGCAAAAGGGTTGCAGTCTGGGGAAGAAGCTTCTCCTCACTGGTTTAGCCATATCCTCTGTTCCTGTGGTTCTTCCTCCATTGGTTATCATGTCAGCCTTTGGAATTGCAGCCTCAATACCCTATGGGGTTTTCCTCGCCTCTTATGCTTGTACTGAGACGATCATGAGTGTTTGGCTTCCGATTCCCCCGGCCCTGAAGCTCGACCGCGCTGACGAAGAGATTGTGGAGGAAGACATCTATGAAGATGAAGAAAAGCATATGATGGAAACAGCTAAGAGGGGTGAGAATTTGGATGATTTTGATATCGATGTGGTAGTAGTTCAGGGGGACGAAGAAAGCGAAACCGACATTGGAAGCAAAGGTCTGGCAGCAATTGAAGTGACCAATGTGGAATTTGAAGGAAATGGAGATATTGGAGATGAGGAAGAAGAGGAAGAAGAGTTGAAGGAAACTAGAGGTTTACTCGAAAGAATCAGGGATGAGGGACGAAGAGACAACGGTTTTGTTGATGAAAATGGAGGTGTTGAGGATGTTCGAGAGCTCGAGATTTCAATGGAGGACGAGAAACCAAGTGATTCTGTTGAAAAAAGTGTTCTAGGTTTGTTGAATGAAGTTGACTCTGCTGCTGTTTATCCTCATGAACACTACAGAACTTCTGAAGGGGTCGGGTCGGCGAAATCTAGTGATGAAACAAATGCAATAACGACATTGACAAACAAGGCTGCCAAGTCTGAAGAAGCTGAGCAACTTCCTAAAGTAACAATGATTGATGTGATTGAATCTGATGAAGGTTTGTCTATATCAGAGGTGACTATTGAACACAAAGTTGAGGCAAATTTTCCACATAAAGATCATAGGACGTCTTCCAATGAGGAACTATCAGGAGAGGTAAAGATAAGAGAAAAGATTGCTTCGATGAAGAAGATCGTAGGATACAAGGCTACCCCCCTCGGAACATACTTAGACGAAGTGAACGCTCTATATGCCTTCATCGGAGTCGAGCCACCTTCCCCGATGAAAGATTCTGCTAATGATGATGATATCAATCTACTTAACCAGAAGTTGCAGTTTCTCATGTCCATAGTAGGGGTCAAGTAG

Protein sequence

MANFYYSKPKLHSSTTFLFLLCLLLARLQLSSANQTFPTKVNHSYSQSNIQRIRLDNPTLGFLVEKKLSLLLNLYPYLSYINTHNMPLNYALLQGPILSPSLSIRHGRRACLIMADEPPEFIRMEGYRSIDWNMEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSVPVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIYEDEEKHMMETAKRGENLDDFDIDVVVVQGDEESETDIGSKGLAAIEVTNVEFEGNGDIGDEEEEEEELKETRGLLERIRDEGRRDNGFVDENGGVEDVRELEISMEDEKPSDSVEKSVLGLLNEVDSAAVYPHEHYRTSEGVGSAKSSDETNAITTLTNKAAKSEEAEQLPKVTMIDVIESDEGLSISEVTIEHKVEANFPHKDHRTSSNEELSGEVKIREKIASMKKIVGYKATPLGTYLDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFLMSIVGVK
Homology
BLAST of CmoCh10G008470 vs. ExPASy TrEMBL
Match: A0A6J1H9E7 (uncharacterized protein LOC111461753 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111461753 PE=4 SV=1)

HSP 1 Score: 766.9 bits (1979), Expect = 5.3e-218
Identity = 405/405 (100.00%), Postives = 405/405 (100.00%), Query Frame = 0

Query: 114 MADEPPEFIRMEGYRSIDWNMEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV 173
           MADEPPEFIRMEGYRSIDWNMEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV
Sbjct: 1   MADEPPEFIRMEGYRSIDWNMEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV 60

Query: 174 PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIY 233
           PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIY
Sbjct: 61  PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIY 120

Query: 234 EDEEKHMMETAKRGENLDDFDIDVVVVQGDEESETDIGSKGLAAIEVTNVEFEGNGDIGD 293
           EDEEKHMMETAKRGENLDDFDIDVVVVQGDEESETDIGSKGLAAIEVTNVEFEGNGDIGD
Sbjct: 121 EDEEKHMMETAKRGENLDDFDIDVVVVQGDEESETDIGSKGLAAIEVTNVEFEGNGDIGD 180

Query: 294 EEEEEEELKETRGLLERIRDEGRRDNGFVDENGGVEDVRELEISMEDEKPSDSVEKSVLG 353
           EEEEEEELKETRGLLERIRDEGRRDNGFVDENGGVEDVRELEISMEDEKPSDSVEKSVLG
Sbjct: 181 EEEEEEELKETRGLLERIRDEGRRDNGFVDENGGVEDVRELEISMEDEKPSDSVEKSVLG 240

Query: 354 LLNEVDSAAVYPHEHYRTSEGVGSAKSSDETNAITTLTNKAAKSEEAEQLPKVTMIDVIE 413
           LLNEVDSAAVYPHEHYRTSEGVGSAKSSDETNAITTLTNKAAKSEEAEQLPKVTMIDVIE
Sbjct: 241 LLNEVDSAAVYPHEHYRTSEGVGSAKSSDETNAITTLTNKAAKSEEAEQLPKVTMIDVIE 300

Query: 414 SDEGLSISEVTIEHKVEANFPHKDHRTSSNEELSGEVKIREKIASMKKIVGYKATPLGTY 473
           SDEGLSISEVTIEHKVEANFPHKDHRTSSNEELSGEVKIREKIASMKKIVGYKATPLGTY
Sbjct: 301 SDEGLSISEVTIEHKVEANFPHKDHRTSSNEELSGEVKIREKIASMKKIVGYKATPLGTY 360

Query: 474 LDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFLMSIVGVK 519
           LDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFLMSIVGVK
Sbjct: 361 LDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFLMSIVGVK 405

BLAST of CmoCh10G008470 vs. ExPASy TrEMBL
Match: A0A6J1J9E9 (uncharacterized protein LOC111484754 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111484754 PE=4 SV=1)

HSP 1 Score: 734.2 bits (1894), Expect = 3.8e-208
Identity = 389/405 (96.05%), Postives = 396/405 (97.78%), Query Frame = 0

Query: 114 MADEPPEFIRMEGYRSIDWNMEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV 173
           MADEPPEFIRMEGYRSIDWN+EEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV
Sbjct: 1   MADEPPEFIRMEGYRSIDWNIEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV 60

Query: 174 PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIY 233
           PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEE+VEEDIY
Sbjct: 61  PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEMVEEDIY 120

Query: 234 EDEEKHMMETAKRGENLDDFDIDVVVVQGDEESETDIGSKGLAAIEVTNVEFEGNGDIGD 293
           EDEEKHMMETAKRGENLDDFDIDVVVVQG EE ETDIGSKGLAAIEVTNVEFEGNGD GD
Sbjct: 121 EDEEKHMMETAKRGENLDDFDIDVVVVQGGEEGETDIGSKGLAAIEVTNVEFEGNGDNGD 180

Query: 294 EEEEEEELKETRGLLERIRDEGRRDNGFVDENGGVEDVRELEISMEDEKPSDSVEKSVLG 353
           EEEEEEELKETRGLLERIRDEGRRDNGFVDENGGV+DVRELEIS+EDEKPSDSVE+SVLG
Sbjct: 181 EEEEEEELKETRGLLERIRDEGRRDNGFVDENGGVDDVRELEISIEDEKPSDSVEESVLG 240

Query: 354 LLNEVDSAAVYPHEHYRTSEGVGSAKSSDETNAITTLTNKAAKSEEAEQLPKVTMIDVIE 413
           LLNEVDSAAVYPH  YRTSEGVG AKSSDETNAITTL+NKAAKSEEAEQLPKVTMIDVIE
Sbjct: 241 LLNEVDSAAVYPHADYRTSEGVGWAKSSDETNAITTLSNKAAKSEEAEQLPKVTMIDVIE 300

Query: 414 SDEGLSISEVTIEHKVEANFPHKDHRTSSNEELSGEVKIREKIASMKKIVGYKATPLGTY 473
           SDEGLSIS +TIEHKVEAN PHKDHR SSNEELSGEVKIREKIASMKKIVGYKATPLGTY
Sbjct: 301 SDEGLSISAMTIEHKVEANSPHKDHRMSSNEELSGEVKIREKIASMKKIVGYKATPLGTY 360

Query: 474 LDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFLMSIVGVK 519
           LDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFLMSIVGVK
Sbjct: 361 LDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFLMSIVGVK 405

BLAST of CmoCh10G008470 vs. ExPASy TrEMBL
Match: A0A6J1JHX4 (uncharacterized protein LOC111484755 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111484755 PE=4 SV=1)

HSP 1 Score: 730.3 bits (1884), Expect = 5.5e-207
Identity = 388/405 (95.80%), Postives = 394/405 (97.28%), Query Frame = 0

Query: 114 MADEPPEFIRMEGYRSIDWNMEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV 173
           MADEPP FIRMEGYRSIDWN+EEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV
Sbjct: 1   MADEPPVFIRMEGYRSIDWNIEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV 60

Query: 174 PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIY 233
           PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEE+VEEDIY
Sbjct: 61  PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEMVEEDIY 120

Query: 234 EDEEKHMMETAKRGENLDDFDIDVVVVQGDEESETDIGSKGLAAIEVTNVEFEGNGDIGD 293
           EDEEKHMMETAKRGENLDDFDIDVVVVQG EE ETDIGSKGLAAIEVTNVEFEGNGD GD
Sbjct: 121 EDEEKHMMETAKRGENLDDFDIDVVVVQGGEEGETDIGSKGLAAIEVTNVEFEGNGDNGD 180

Query: 294 EEEEEEELKETRGLLERIRDEGRRDNGFVDENGGVEDVRELEISMEDEKPSDSVEKSVLG 353
           EE EEEELKETRGLLERIRDEGRRDNGFVDENGGV+DVRELEIS+EDEKPSDSVEKSVLG
Sbjct: 181 EEGEEEELKETRGLLERIRDEGRRDNGFVDENGGVDDVRELEISIEDEKPSDSVEKSVLG 240

Query: 354 LLNEVDSAAVYPHEHYRTSEGVGSAKSSDETNAITTLTNKAAKSEEAEQLPKVTMIDVIE 413
           LLNEVDSAAVYPH  YRTSEGVG AKSSDETNAITTL+NKAAKSEEAEQLPKVTMIDVIE
Sbjct: 241 LLNEVDSAAVYPHADYRTSEGVGWAKSSDETNAITTLSNKAAKSEEAEQLPKVTMIDVIE 300

Query: 414 SDEGLSISEVTIEHKVEANFPHKDHRTSSNEELSGEVKIREKIASMKKIVGYKATPLGTY 473
           SDEGLSIS +TIEHKVEAN PHKDHR SSNEELSGEVKIREKIASMKKIVGYKATPLGTY
Sbjct: 301 SDEGLSISAMTIEHKVEANSPHKDHRMSSNEELSGEVKIREKIASMKKIVGYKATPLGTY 360

Query: 474 LDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFLMSIVGVK 519
           LDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFLMSIVGVK
Sbjct: 361 LDEVNALYAFIGVEPPSPMKDSANDDDINLLNQKLQFLMSIVGVK 405

BLAST of CmoCh10G008470 vs. ExPASy TrEMBL
Match: A0A6J1HB22 (uncharacterized protein LOC111461753 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111461753 PE=4 SV=1)

HSP 1 Score: 625.2 bits (1611), Expect = 2.5e-175
Identity = 332/332 (100.00%), Postives = 332/332 (100.00%), Query Frame = 0

Query: 114 MADEPPEFIRMEGYRSIDWNMEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV 173
           MADEPPEFIRMEGYRSIDWNMEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV
Sbjct: 1   MADEPPEFIRMEGYRSIDWNMEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV 60

Query: 174 PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIY 233
           PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIY
Sbjct: 61  PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIY 120

Query: 234 EDEEKHMMETAKRGENLDDFDIDVVVVQGDEESETDIGSKGLAAIEVTNVEFEGNGDIGD 293
           EDEEKHMMETAKRGENLDDFDIDVVVVQGDEESETDIGSKGLAAIEVTNVEFEGNGDIGD
Sbjct: 121 EDEEKHMMETAKRGENLDDFDIDVVVVQGDEESETDIGSKGLAAIEVTNVEFEGNGDIGD 180

Query: 294 EEEEEEELKETRGLLERIRDEGRRDNGFVDENGGVEDVRELEISMEDEKPSDSVEKSVLG 353
           EEEEEEELKETRGLLERIRDEGRRDNGFVDENGGVEDVRELEISMEDEKPSDSVEKSVLG
Sbjct: 181 EEEEEEELKETRGLLERIRDEGRRDNGFVDENGGVEDVRELEISMEDEKPSDSVEKSVLG 240

Query: 354 LLNEVDSAAVYPHEHYRTSEGVGSAKSSDETNAITTLTNKAAKSEEAEQLPKVTMIDVIE 413
           LLNEVDSAAVYPHEHYRTSEGVGSAKSSDETNAITTLTNKAAKSEEAEQLPKVTMIDVIE
Sbjct: 241 LLNEVDSAAVYPHEHYRTSEGVGSAKSSDETNAITTLTNKAAKSEEAEQLPKVTMIDVIE 300

Query: 414 SDEGLSISEVTIEHKVEANFPHKDHRTSSNEE 446
           SDEGLSISEVTIEHKVEANFPHKDHRTSSNEE
Sbjct: 301 SDEGLSISEVTIEHKVEANFPHKDHRTSSNEE 332

BLAST of CmoCh10G008470 vs. ExPASy TrEMBL
Match: A0A6J1JIH1 (uncharacterized protein LOC111484754 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111484754 PE=4 SV=1)

HSP 1 Score: 592.4 bits (1526), Expect = 1.8e-165
Identity = 316/332 (95.18%), Postives = 323/332 (97.29%), Query Frame = 0

Query: 114 MADEPPEFIRMEGYRSIDWNMEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV 173
           MADEPPEFIRMEGYRSIDWN+EEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV
Sbjct: 1   MADEPPEFIRMEGYRSIDWNIEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSV 60

Query: 174 PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIY 233
           PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEE+VEEDIY
Sbjct: 61  PVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLPIPPALKLDRADEEMVEEDIY 120

Query: 234 EDEEKHMMETAKRGENLDDFDIDVVVVQGDEESETDIGSKGLAAIEVTNVEFEGNGDIGD 293
           EDEEKHMMETAKRGENLDDFDIDVVVVQG EE ETDIGSKGLAAIEVTNVEFEGNGD GD
Sbjct: 121 EDEEKHMMETAKRGENLDDFDIDVVVVQGGEEGETDIGSKGLAAIEVTNVEFEGNGDNGD 180

Query: 294 EEEEEEELKETRGLLERIRDEGRRDNGFVDENGGVEDVRELEISMEDEKPSDSVEKSVLG 353
           EEEEEEELKETRGLLERIRDEGRRDNGFVDENGGV+DVRELEIS+EDEKPSDSVE+SVLG
Sbjct: 181 EEEEEEELKETRGLLERIRDEGRRDNGFVDENGGVDDVRELEISIEDEKPSDSVEESVLG 240

Query: 354 LLNEVDSAAVYPHEHYRTSEGVGSAKSSDETNAITTLTNKAAKSEEAEQLPKVTMIDVIE 413
           LLNEVDSAAVYPH  YRTSEGVG AKSSDETNAITTL+NKAAKSEEAEQLPKVTMIDVIE
Sbjct: 241 LLNEVDSAAVYPHADYRTSEGVGWAKSSDETNAITTLSNKAAKSEEAEQLPKVTMIDVIE 300

Query: 414 SDEGLSISEVTIEHKVEANFPHKDHRTSSNEE 446
           SDEGLSIS +TIEHKVEAN PHKDHR SSNEE
Sbjct: 301 SDEGLSISAMTIEHKVEANSPHKDHRMSSNEE 332

BLAST of CmoCh10G008470 vs. TAIR 10
Match: AT5G36100.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G65090.3); Has 57 Blast hits to 49 proteins in 15 species: Archae - 0; Bacteria - 4; Metazoa - 6; Fungi - 0; Plants - 39; Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink). )

HSP 1 Score: 87.0 bits (214), Expect = 4.7e-17
Identity = 100/371 (26.95%), Postives = 157/371 (42.32%), Query Frame = 0

Query: 154 QKGCSLGKKLLLTGLAISSVPVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLP 213
           +KG S+GKK+L     + S P ++P LV+ S   + +S+PY  FL SY CTE +M   LP
Sbjct: 16  RKGVSVGKKVLAACFLVFSAPFLVPALVVASTIALISSLPYCFFLVSYVCTEKLMRKLLP 75

Query: 214 IPPALKLDRADEEIVEEDIYEDEEKHMMETAKRGENLDDFDIDV-----VVVQGDEESET 273
                   R D E+V   +++++  H       G+  D+    V     V+VQ +EE+  
Sbjct: 76  ANAF--SGRCDHEMV---LHQNKISH-------GDIYDEAVARVAISEPVLVQIEEETTI 135

Query: 274 DIGSKGLAAIEVTNVEFEGNGDIGDEEEEEEELKETRGLLERIRDEGRRDNGFVDENGGV 333
            I  +                      E+E+  KE +  LE IRDEG+ +          
Sbjct: 136 AIAYR----------------------EDEDMTKELKSWLESIRDEGKNNQSL------- 195

Query: 334 EDVRELEISMEDEKPSDSVEKSVLGLLNEVDSAAVYPHEHYRTSEGVGSAKSSDETNAIT 393
                                                   YR   GV   K  +E +   
Sbjct: 196 ----------------------------------------YR---GVILEKGFEEEDKDQ 255

Query: 394 TLTNKAAKSEEAEQLPKVTMIDVI-ESDEGLSISEVTIEHKVEANFPHKDHRTSSNEELS 453
           ++  + AKSE      +  + D++ +  E ++I E  +E         KD   SS   L 
Sbjct: 256 SIVPRDAKSENV----RAKLEDLLGKKQESVTIHEGELESTTSKTSREKDMEISSTTVLY 295

Query: 454 GEVKIREKIASMKKIVGYKATPLGTYLDEVNALYAFIGVEPPSPMKDSANDDDINLLNQK 513
            E +I  KI +++K+VGY  T   TY +E+ ALY F GVE P+    +  + DI  +++ 
Sbjct: 316 SEEQIWTKIEALRKVVGYNVTRSTTYSEELKALYMFTGVELPT---STLENQDIAKVSEG 295

Query: 514 LQFLMSIVGVK 519
           L FLMS++G+K
Sbjct: 376 LSFLMSVIGIK 295

BLAST of CmoCh10G008470 vs. TAIR 10
Match: AT1G65090.3 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G36100.1). )

HSP 1 Score: 86.3 bits (212), Expect = 8.0e-17
Identity = 108/387 (27.91%), Postives = 162/387 (41.86%), Query Frame = 0

Query: 134 MEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSVPVVLPPLVIMSAFGIAASIP 193
           MEE  S+    S +K      K  S+GKK+L  G+ +SS P+++P L + S     +S+P
Sbjct: 5   MEEYQSNE---SEDKRSWIWSKAVSVGKKVLTAGVVVSSAPLLVPSLFVASTLAFLSSVP 64

Query: 194 YGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIYEDEEKHMMETAKRGENLDDF 253
           + +FLA+YACT+ +MS  LP           EE       +D+E    E +K G      
Sbjct: 65  FCLFLANYACTQKVMSTLLP---------DTEETGGVGKEDDDESGFDEYSKIGHG---- 124

Query: 254 DIDVVVVQGDEESETDIGSKGLAAIEVTNVEFEGNGDIGDEEEEEEELKETRGLLERIRD 313
                      E    +G   L   +   +  +        +E+EE  KE+  LLE+IRD
Sbjct: 125 -----------EGAAGVGEAALFRGKEEPIPIQ-------VKEDEEMAKESTSLLEKIRD 184

Query: 314 EGRRDNGFVDENGGVEDVRELEISMEDEKPSDSVEKSVLGLLNEVDSAAVYPHEHYRTSE 373
           EGR D                E +++D+K S +                           
Sbjct: 185 EGRTDK------------ETSERTLQDDKKSGN--------------------------- 244

Query: 374 GVGSAKSSDETNAITTLTNKAAKSEEAEQLPKVTMIDVIESDEGLSISEVTIEHKVEANF 433
                                AKSEE ++ P+       E+ E     E T   K+E + 
Sbjct: 245 ---------------------AKSEEVQEQPEKR-----EAPETRREGE-TGATKIETST 287

Query: 434 PHKDHRTSSNEELSGEVKIREKIASMKKIVGYKATPLGTYLDEVNALYAFIG-VEPPSPM 493
              D   SSNE  S E ++ E + +++K+VGY      T  +E+ ALY F G VEPP   
Sbjct: 305 GKDDEEISSNEVYS-EEQLWETMETLRKVVGYSVARSATCAEELKALYVFTGVVEPP--- 287

Query: 494 KDSANDD--DINLLNQKLQFLMSIVGV 518
           + S N D  DI  L  +L+FLMS++G+
Sbjct: 365 RSSLNQDTYDIAHLTIRLRFLMSVIGI 287

BLAST of CmoCh10G008470 vs. TAIR 10
Match: AT1G65090.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G36100.1); Has 1234 Blast hits to 904 proteins in 178 species: Archae - 0; Bacteria - 58; Metazoa - 431; Fungi - 95; Plants - 83; Viruses - 38; Other Eukaryotes - 529 (source: NCBI BLink). )

HSP 1 Score: 76.3 bits (186), Expect = 8.3e-14
Identity = 110/413 (26.63%), Postives = 171/413 (41.40%), Query Frame = 0

Query: 134 MEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSVPVVLPPLVIMSAFGIAASIP 193
           MEE  S+    S +K      K  S+GKK+L  G+ +SS P+++P L + S     +S+P
Sbjct: 5   MEEYQSNE---SEDKRSWIWSKAVSVGKKVLTAGVVVSSAPLLVPSLFVASTLAFLSSVP 64

Query: 194 YGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIYEDEEKHMMETAKRGENLDDF 253
           + +FLA+YACT+ +MS  LP           EE       +D+E    E +K G      
Sbjct: 65  FCLFLANYACTQKVMSTLLP---------DTEETGGVGKEDDDESGFDEYSKIGHG---- 124

Query: 254 DIDVVVVQGDEESETDIGSKGLAAIEVTNVEFEGNGDIGDEEEEEEELKETRGLLERIRD 313
                      E    +G   L   +   +  +        +E+EE  KE+  LLE+IRD
Sbjct: 125 -----------EGAAGVGEAALFRGKEEPIPIQ-------VKEDEEMAKESTSLLEKIRD 184

Query: 314 EGRRDNGFVDE---------NGGVEDVREL-------EISMEDEKPSDSVEKSVLGLLNE 373
           EGR D    +          N   E+V+E        E   E E  +  +E S      E
Sbjct: 185 EGRTDKETSERTLQDDKKSGNAKSEEVQEQPEKREAPETRREGETGATKIETSTGKDDEE 244

Query: 374 VDSAAVYPHEHYRTSEGVGSAKSSDET---NAITTLTNKAAKSEEAEQLPKVTMIDVIES 433
           + S    P +    ++G G  K  + T          N+  K             D++E 
Sbjct: 245 ISSNE--PIDQASGAQGTGEEKRKNTTKKKKKTGRAGNRFLKCHTWSSSKLCGRCDLLEC 304

Query: 434 --DEGLSISEVTIEHKVEANFPHKDHRTSS-----NEELSGEVKIREKIASMKKIVGYKA 493
             D    +    I     +       + S      N ++  E ++ E + +++K+VGY  
Sbjct: 305 CFDRVDCVVRRVITCSALSLISEASVKMSRICMVLNLQVYSEEQLWETMETLRKVVGYSV 364

Query: 494 TPLGTYLDEVNALYAFIG-VEPPSPMKDSANDD--DINLLNQKLQFLMSIVGV 518
               T  +E+ ALY F G VEPP   + S N D  DI  L  +L+FLMS++G+
Sbjct: 365 ARSATCAEELKALYVFTGVVEPP---RSSLNQDTYDIAHLTIRLRFLMSVIGI 378

BLAST of CmoCh10G008470 vs. TAIR 10
Match: AT1G65090.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G36100.1); Has 1435 Blast hits to 1033 proteins in 192 species: Archae - 0; Bacteria - 61; Metazoa - 511; Fungi - 123; Plants - 100; Viruses - 42; Other Eukaryotes - 598 (source: NCBI BLink). )

HSP 1 Score: 58.5 bits (140), Expect = 1.8e-08
Identity = 57/185 (30.81%), Postives = 85/185 (45.95%), Query Frame = 0

Query: 134 MEEQLSSGDGLSSEKICSAVQKGCSLGKKLLLTGLAISSVPVVLPPLVIMSAFGIAASIP 193
           MEE  S+    S +K      K  S+GKK+L  G+ +SS P+++P L + S     +S+P
Sbjct: 5   MEEYQSNE---SEDKRSWIWSKAVSVGKKVLTAGVVVSSAPLLVPSLFVASTLAFLSSVP 64

Query: 194 YGVFLASYACTETIMSVWLPIPPALKLDRADEEIVEEDIYEDEEKHMMETAKRGENLDDF 253
           + +FLA+YACT+ +MS  LP           EE       +D+E    E +K G      
Sbjct: 65  FCLFLANYACTQKVMSTLLP---------DTEETGGVGKEDDDESGFDEYSKIGHG---- 124

Query: 254 DIDVVVVQGDEESETDIGSKGLAAIEVTNVEFEGNGDIGDEEEEEEELKETRGLLERIRD 313
                      E    +G   L   +   +  +        +E+EE  KE+  LLE+IRD
Sbjct: 125 -----------EGAAGVGEAALFRGKEEPIPIQ-------VKEDEEMAKESTSLLEKIRD 155

Query: 314 EGRRD 319
           EGR D
Sbjct: 185 EGRTD 155

BLAST of CmoCh10G008470 vs. TAIR 10
Match: AT5G36100.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G65090.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 50.8 bits (120), Expect = 3.7e-06
Identity = 69/247 (27.94%), Postives = 112/247 (45.34%), Query Frame = 0

Query: 154 QKGCSLGKKLLLTGLAISSVPVVLPPLVIMSAFGIAASIPYGVFLASYACTETIMSVWLP 213
           +KG S+GKK+L     + S P ++P LV+ S   + +S+PY  FL SY CTE +M   LP
Sbjct: 16  RKGVSVGKKVLAACFLVFSAPFLVPALVVASTIALISSLPYCFFLVSYVCTEKLMRKLLP 75

Query: 214 IPPALKLDRADEEIVEEDIYEDEEKHMMETAKRGENLDDFDIDV-----VVVQGDEESET 273
                   R D E+V   +++++  H       G+  D+    V     V+VQ +EE+  
Sbjct: 76  ANAF--SGRCDHEMV---LHQNKISH-------GDIYDEAVARVAISEPVLVQIEEETTI 135

Query: 274 DIGSKGLAAIEVTNVEFEGNGDIGDEEEEEEELKETRGLLERIRDEGRRD----NGFVDE 333
            I  +                      E+E+  KE +  LE IRDEG+ +     G + E
Sbjct: 136 AIAYR----------------------EDEDMTKELKSWLESIRDEGKNNQSLYRGVILE 195

Query: 334 NGGVEDVRELEISMEDEKPSDSVEKSVLGLLNEVDSAAVYPHEHYRTSEGVGSAKSSDET 392
            G  E+ ++  I   D K S++V   +  LL +    +V  HE    S    +++  D  
Sbjct: 196 KGFEEEDKDQSIVPRDAK-SENVRAKLEDLLGK-KQESVTIHEGELESTTSKTSREKDME 226

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1H9E75.3e-218100.00uncharacterized protein LOC111461753 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1J9E93.8e-20896.05uncharacterized protein LOC111484754 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1JHX45.5e-20795.80uncharacterized protein LOC111484755 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1HB222.5e-175100.00uncharacterized protein LOC111461753 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1JIH11.8e-16595.18uncharacterized protein LOC111484754 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
Match NameE-valueIdentityDescription
AT5G36100.14.7e-1726.95unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G65090.38.0e-1727.91unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G65090.28.3e-1426.63unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G65090.11.8e-0830.81unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G36100.23.7e-0627.94unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 294..314
NoneNo IPR availablePANTHERPTHR37198NUCLEOLINcoord: 136..283
coord: 335..518

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh10G008470.1CmoCh10G008470.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016020 membrane