Cp4.1LG19g00090 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG19g00090
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionPlant protein 1589 of unknown function
LocationCp4.1LG19: 138670 .. 148864 (+)
RNA-Seq ExpressionCp4.1LG19g00090
SyntenyCp4.1LG19g00090
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TACACAGTTGTTGATCGCACAGTGCTCCTTTAGCTCGCCCTCTTCTCTTCCACAATGTAAAGCGAAAATTCCTGCTTTTCACCTCTTTCTCTCTCTAGGATGATAAATATTCCCATTTTTTTGCTGTGAAGATCTCCACTTCTCTTGGAATCAACCTACTAACTGGTAACTTCGATGAATTTTGCTTGTTTTCTCTACTCTTTTTCTTTTTGGCGTTTTCGATCCCTTGAATTTTGATTGTCTGCATATTTTATGCCTTCGAGGGTGCTGAGAGATAATAAAGACATAATTAAGCTTTCCAATGGTGCGTTTTTGTTAGGGTTTCGTTCTGTTCTGGATTTTATCCCGTTTGTGTGAATTGGGATTGGATGAAATTGGCTCCTCCTCGGGCGATCTTTCTGTTCGTGTTCTTGCGTGGGAACTGTAGCTTGTAATGGCTTTTTGTTGGCTGTGATTGGTGTTGAATTGTTAGCTGGAGTGGTGTCGAGGGCTGATTGTGACATTAAATGTTAAGATTAGGGTATAATTTGTGTTTGAGCAGTTCTGACATCACCTTGTTGCCGCCATTTCTTGCTACTTGTTTTGATTTTTTTAATTGGATATCTGTTTGGGATTGGGATCTTTATTTGTTTAATAAAATGCTGCTTAGCATTTTGTGATTTAAACTACATCACTATTTTTTAATTCTTATAAACTACATCAGTATCAATTGTTGTGATTTAAATGTGACTTCTTCATGTAAAGTTGTACTGTGAGTATACTTCAAAGTTTATGCTAGTTGCTGCTATCAATTTCTGTTGCCCAATTTTGTAAATTGTGACTTTTCTTGTGTTCCTTAATAAACTAAATAGAGGGCTCAAAAATTGGAAATTTTAATGGTTTTCCCCGTCATTAGGAAACACGAATTTTGTAAAAGGGTTGAAATACTAAAGTTTCCTGCATAAATAAGGCTTTCTTGACTGCTTTTGATGTATGTATCGATCTACTTCTCATTCGTTTGGTGTAAGATTCTTAGTTGTTTATGGTTGTTTACTAGATGCGAGGGTGGAATATTTGAGGCGAATACGATCTGCCAGTTGATTGAACAATTCAATTCTTATGCACTGGATTTGGAAACATGTCAACCGGAACTGCTAGACGGGTCTCACGTCAAGATATTCAACTGGTAAGCTGGTTTTTGAACTTACAAACACTTCACGACCCGCACATTCGTTCTAAATTGCTCATTTTTCCTTCTCTTTTCCAACCCAATAGATCCCAGGGAATTGGCTGAAAAGAAACTGATGCTCAAAAGCTAGATAACTGAATATTAGTTGTAGAAATAGATGACAGAGAAGTTCTAGACCATCATAAATAGTTTGGCCATGATCTTGAGGAACTTGTGAGGTTAGATGAATTGACTTTATAGGCTAATGAAACAAATTCCTTAATTTCCTTTCGGGATCGTGTTGTTGGTGTTGTCATATGAGATATTTGTGCCTGTACATGCCAATCTATGCTACATTTAGCCACTCAAAGTTGAAGTTACGCTGAAACATTTTGTCAGCCATGTATGCTATGAAGCTTGAATAATCTTTCTAGGCAAGTTGTCCGTGTCTATCAGTGACGTAATGGACATCCCGAAACTCTCACATTTGTTGGAAAGCTAGAGAATTGGATCTTTATTTTATCTAAAAGAATATTGTTTCATGAACATGACAAAAAAATATTAGAGAAGTGAAATAGCTTAGAACGAAAAATGTCGTGCTTATGTTTATATTTTAAGGTGACTAATATCATCCAGCTATTTATGTTTTTATTTTAGGTGCGAAGTCTTATAGAGCGATGCCTTCAGCTTGATATGAACCGAAAAGAAGTTGTGGAAACACTTTTGAATCATGAAAAAATTGACCCTAGTTTCACAGAGCATGGTAATCATGTATCAAATTTTTATTTCCATACATATTAATGGTCCCTTGTTTACTTGTTTAATCTGTGGGTGATAGATTTATGATATTTATCATAGTTTTGGTCACTTGTAGGCATTTGTGACTTTGCTTTTGGGACATAGTTGTTCTTACTTGTACACTCATCATCTTGTTCAAATAATCATTGATCAAGTCATGCGAGTATCTGAGTGGTCGAGCATTCAATCAACATTCTATTGAATGGATTATTGGATTTATTTTATTTTATTTTAATCTTAAAAAGATAAATTCTTTCTTGTTGATAAGCTCTTGTTACTCGTGTAATAGACCATTCTATTTCCTGACTATTTCCACTACACTAAGAATTGGCTCTTATCCATGCCAATTCCTCTTCTTTGAATCATACTGTGTGAGATCCCACATTGGTTGGAGAGGGGAACGAAGCATTCTTTTTAAGGGTGTGGAAACCTCTCCCTGGCTGACATGTTTTAAAATCTTGAAGGCAAGCTAGGAAGGAAAAGCCCAAAGAGAACAATATCTGCTAGCGGTGGGTTTGGACGGTTACACACTGCCAGTCTGTTCAAGCTCACTATTTCATTATTGGTACCTAAGGAATTCGTTGACATGGCCGAGCTTTAGGAATGACTAGAATACCATGTTAAAACTACCTTTAGACCTAAAAGTTTAAGCTAATGGGTTAGATCCCACATGGGTTGAAGCGAGGTACGAAACATTTCTTATAAGGATATGGAAACCTCTTCCTAGGAGATGTGTAATGGGGAGACCCATCAGGAAAAGCCCAAAAACCTTGAGAGGAAGCCCAGAAGGGAAAGCCCAAAGAGGACAATATTTGGATGAGTACTGGCTCTGTTGTGTAACTCTAAAAGACGTGTTTTAAACCTATGAGGCTGATGATGATACATAACGGCCCAAAGCGAACAATATCTGTTAGTGGTGGGCTTGGGCTGTTGCACGAAGAGAAAACAACCTATATGTCATATGTATTGCCTAGGTTAATAGCTAGGAAACTACGAGAGATAAGTATTTTAAAGGACAATTGAAACCTCCACATATCTCCACAATGGTATGAAACTGTCCACGGCTTTGCTTTTGGGTGCACCCAAAAGGCCTCATGGAGATTTGATTTTAAGTTTTGGTTCTTGCCGTGTTGAAGCTCAAAAATTTTCCTTTTCAATGTCTTTATATTGAATGGTTGTGCTCTAAGTCTCTAAGGCATGTTTGAAGATGCCTGTTGGTTAATAAAGATGGTTGTTTCACAATGTAACATCTTTCAAACCATTCAAGCAATTTTTCATTGAAGATTCCTAGGTTTTTTTCAAACCAAATGCTCTTTTGGAGTAGCTTTTAATGCCTTGCTTTAGTTTCCAACTTCACTCCCATTCACATTTGTTCAATATTGTCTTTAAAGCCTTTATCCAACGCTCGAGAAAAGGTAAAGATCTTAAATAAATGCTCCCAACACTTGCTATATTGTCAATGAAAGGATGATTTTAGGTGGATTGAAAAGAAAAAATTAGTAAATTTGAGACCCATGGTTGATAAGTGGACATAACAAAGCATACATGTAAGATTGTGAACAGTTTTAGTCCATCCTTTTGTGTTTACTTCCTCAATTGTTGTCAAAACCTTTTGCATTGCTTCTCTTCCAGTTTGGCAGAAGCTTGAAGAGGAGAATCAGGAATTCTTTAATGCATATTATCTGAGACTGATGGTGAAAAGCCAGATTATTGAATTCAACCGATTGCTTGAACAGCAAGCGAGAATGATGCACCAGATACATCCATGTGCTGTGACTGCGTTGTCTAGTTCTAATGGATCCCACGTCCAACCAAGTAAGACTGCTTTCCTCCTTGTTGGGAGCTTCATCTTCATTTCTTACTCGTTGCATTGTTATATATTGGATGTTAGCGTAGCTCTTTTGCTCTTTTGTGACTGAGGACAATTATATTATGTATGTTATTACATGCCTAAGTATGTCTCCTTCATTGTTTCTTAATTTCACAATTAGTGAGACATCTTGAAAGAGGAAGAGTTGCCTCATGGAAATTTTATGCAAAGACACTTTCTGTTCAAACCCCGATGCTTTCTACCTTGTGCTGAAACATCGGTTTTCTTTCTTGGAATTGAGCCTAGTGAGTACCATCTTGGCATTAAAATGAAGGAATTTTCTTAATATTAATCCAAGCCACGTCTCGTTTGTTTCTGAGCAACACTAAGTGATAAGGCAATCCTAATCCTCAGCTCAACGATCCAAGCTTTTGTTGAAGTAAGAACCCTTGAATCAAAGAAACTAACACCTTCTTATACAAAATTTGGATCAACCTAGAGATAAGACTAAAATTAAATTACCATCTAGAAGCAAGATTTGGAACCTTGTTCTAAATTGAGTCACTCCACAATCGAACTTGATCAAGGCTTGATTGAATGCCTCGGATGCAATCTACCACAAGACTTGCTTGAAATTTAGAAATGACAAAGATAATGTTCAAATTTCTAAGGCTGAAAGTCTCAACATGAGATTCATAAAAAATTTGCGTTTTACATACTAATTTGTGGGTATTTATAGTCTCTCCACGGAGGAACTCTCAACTCTTCGAATGGCTACTAAACGAAGTTAGTGGCCATGATTGGTAAGTTACTCCCCACTGCCCTTCAAATGTCCACTAAAATTTCCCACCCTTTTTCCCACTACAAGAAGTTATGGAATTAAATGGAATTAAAATAATAAAATAAAGTTTGCAAATAAATTATCAATCAAACCTCATTCTTCACTTCAATTGTGCAAATATTTACTTATGAAGCTCCAAATGATCCTTTCTCAATTGAATCAGTTTGTAACACATGAAATGAAGGTTGAACTGAGCCTATCCAAATTTGTAGATGAAGAATGAATGTTTGTTGAATCTTCTTGGCCTTGGATCGTGTAATAGGTCTAGTTGGAACATTTAGAACATTATAATCCTTATCACAAAGGAAGCCAAATGAATATGTATTATTGGACTTCATGCCCGAGCATACCTGTTGACTTCATGCCCGAACATGAGCTGTCAGTAAAGCTGGATTCTGTGTAATGCAGTTTGCGAGTTTGGATAATTTGGAAAGAGAGAAATAGGAGGATTTTACAATTATCGGAAAAGGATTGCAATTTAGTTGGGATAAACGTCCTTTTGTTTCATATAGTCAGGGCTCTCTTGATGGGAGTTCCTGTAATTATGTTTAATCATTGGTTTGTCCAAATTGGAGGGAGTTTATCATTATTAGCTTAAATTTCAAGTTTAGGCTTTGAAATTTTAGGATTGTGTCTATTTGGTTGGTCTATGTACATTAAAAAGTGTTTCACCTTTGTGTCAAATAGGTTCTTAAACTTTAAAAAATGTCTCACAAGTCCTTAAACTTTTAATAGGTTCCTTTATTTTCAATTTGACGTCCAATAGGTCTTTGACCTAGTTGACATTTTTTAAAATTCACAGGCTTACTCAACATAAAATTTAAAGTTGAGAAACTCATGAGATGTAAATTTCAATTTTATATCTAATAAGTCAGTTCATTTTTTTTTAAATTTTGAATATGTCCCGGAATGAATGGTTTCTTTCCCTATCCGGGATTAGTCCTGTCCTCAATGCCTTTCTTCTGAGCGCTTGTACAAGTTGTATCTGAAACTACATAAGAAGATAAAGTCCAAAGGAAAACGTGAAACCAAAAACCTCCGAAACTTTGGAAGTCTGAATACGCGCTAAAAGCATTTCCAACTTACAAGAGAGTACATATTTACTCCTGAACCTTAGACTCCATTCCTTGCTTTGAATTTCCTTACTTCTGCTTTCAAACCCCAAGGCCAGTTCTTTAAGTGATTCAGTTCCATTGCATCACCTAACTTAGCCCAACATAGTTAAATGGATAAAATACAGGATATAAATATGTTTTTTTCGATCTGGATAAAGCAGCTTCTCTCCTAAACAAAGGAAGAAACTCTGATTTATGATCATTTTTTTTCTTAAAGAATAGTAGGGTTTACAAGAATATGATTTATCTCAGAATTCCCCCTTTCTCCTTGATTCACAGAGATAAAATGGACTGGAGTCGAGGGAAAAACTTTTGTTTATCATTTCTAGAAAGACCACCCCGTAAACTCTCTCAGTAATCTTTTGTCTTCACGTCTCTTAAAAACATTCTGTCACCTTAATCAATCCATTCTTTTGGAGCCCCTGATGTCGGAGACAGGAGCCTCGGGTGTAGGTCCTGAGGCGAAGGGGTGCACAGAGAACTACCTTCGGTCTCTCTGGCGTGTCTTGCCTTAAATTCTCATTTCATTTATTTAACAAATGTTTTCGATATTTATGGAAAAAGAAAAGGCAAATAGGCTCCGAGGACAGATAGTTGAGCATCCATGATTCATGCATATCAATCATTGAATACTTAATTGATTTAGAATTTTCATTGTGGATGGACGAATATATTGTTAGTGTTTTATTTACAAAGCCACTTTCTATAATTTCTGGATCAGCATTGATTCAGTCACTTCAACTTTAATTGTTACATATTATAGTTCTTGATATTCAAAAACCTTTTTCCTTAATGAACTTGATTCCCGATTCATAAAAAAGGTATCATCAGATTCACTCATGAACTATTGATGTTACTGAATGGTTTTTCATTATCAACAACAAAAAAAGATACTAGAATAACATATTTTTCTCTCTCTAAATTAGTTCTAAAGGTGGAAGATCTTTTATTCACAAGGTATTGGTTCATCTTTGTTTTGATAATATTGTAGTTTCTATTTTGGAGATTCAAATAGAAGAGAAGTATGAGATCCCACATCAATTGGAGATTGGAGAGAGGAGCGAGTGCCAGCAAGGACGATGGGTCTCAAAGGGGGTGGATTGTGAGATCCCATATCGGTTGGAGAGGGGAACATTCTCTATAAGGGTGTGGAAATCTCTCCCTAGCAGACGCGTTTTAAAAACCTTAAGGGGAAACTTGAAAGGAAAAACCCAAAGAGGGCAATATTTGTTAGCAGTGAGCTTAGGCTATTATAAATGGTATCAGAGTCCCAAAGGGGGGTGGATTGTGAGATCTCATATCGGTTGGAGAGGGGAACAAAACATTCTTTATAAGGGTGTGGAAACCTCTCCCTAGCAGACACGTTTTAAAAACCTTAAGGGGAAACCAGAAAGGGAAAGCCCATAGAGAACAATATTTGCTAGCGGTGAGCTTAGGCTGTTATAAATGGTATCAGAGCCAGACACTGGGCGGTGTGCCAGCGAGAATGGTGATCCCGAAGGGGGGTGGACGCCGTGCGGCGTGCCAGTGAGGATGCTGGACCTCGAGAGGGGGTGGATTGTGAGATCCCACATCGATTGAAAAGGTAAACAAGTGCCAGAACGTTGGGCCTCGACGTGGGTGAATTGTGAGATCCTACATCGGTTGGAGAGGGGAACAAAGCATTCTTTATAAGGGTGTGGAAACTTTTCTCTAGCAGACGCGTTTTAAAAACCTTGAGGGTAAGCCTGAAAAGGAAAACCTAAAAAGGATAATATCTGCTAGCGGTGGGTGGGTGGGCTGGCTGTTACAAGAAGGTGGAAAGTGGAAAGTAATCTTGTGATGTTAGAGTTTATTCTATGCGTCTTACAAAATGCTTCATGAAATTATTATCTATCAAGTACCCCACGTTGTCAATATGTGATTCATACTGATGCAGATTTTCTATTCCACGGGATCTATAACCATTTTTATATATTGATGATCTTGTGAGGTTTTCATTCCTCAACTGTTATTTGGAGTTTGTCAATTCAATGTACCTTGTTTGAACAGTACCCCAGAGTTGCTACGCACCAGAGCATACAGGACCTGCTCTGAAACAAGATGACATAGACAACCCTGTTGGTGTCAGCTTAGGCAATGCATATTCTAATGGTACCCAACCTGTACACTCATCCATGCACACTGCCGTGGACATGTCTTCTCATGCCAGGAACGATGCTGCACCACAGAGCTCAAATATGGGTCTGTTTCAAGGAATGAATGGAGGGATGATCAAAGTAGAAACTGGATATTCAAACAGTTCTCCCTACATGTTCGGAACAGAGGGCAACGTCCTTGATGCACGTCAATCAATTGGTAATGCATCAGTTGCATCTTTTGCTAGTGTTGATTCCAACACACCATCCTTTAACGAATCGCTACTCGATCCGGATCCCTCTTCATTTGGGTTCATTAATCAAATTACCAGGAATTTTAGTCTCTCAGATCTGACAGCAGACTTTTCTCAGGGTTCAGGTACTTTTTTTTTTCTCATTTCCTATGTCAGTTTAATTTAGAATGAATTAGTCTCTTAATTTTCAGTTCTAAGAGATGTTCGAGAAGATTTCTAAATAAATGAATTCAGAACTTTTTCCCGAGGGAGAGCGAGTGGACCAGGACCATATAGATGTATTATGGGTCTTAACAGCATTACTTTGAACTTCAGATATACTAGAGAGCTATGGCAGATGTCCTTTTTTACCAACAGAAGCTGATAACATCCTTGATACTTGTGAGAATGGAGACCGTCTTGGTAATGATTTTCTATATCCGTTGATCGTTGCTTGTAATATTCAACAGGTGGTTATTGACTTAAAATTTCTTCGTATTTAAAGATGGTAAAGGGTTGGACAGTGTATCGAAAAGCTTGAGTTACGAAGAGTTCAGACACAATTAGCTGGTAAGTTATCTCGTACTTAATCGATCATTTTAAATGTATGCTTTTCTTTGACCGTTCTTTCAAAGTTGGTCCACTGAAATTGTGGGGGCTGCCCATTGATTCTCCAATGTGATGAAATCATGAAAGCACTTTTGTTTGGAATCCTATTATTTTTTACAAGAATAGGCTTCCTTGTGGGTCTACTGAAGTTTTTCATTGAGAAACTGTGTTATTTAAAAGTTTGGGAAAAAAAGACATCATGTTTACGCTTTATAGTTAGGCTGCCACCGTGGGAAGTAGAGGAAAAAAACTTGCGAGTCTGTAGGTTGCACATTCATGCCCATAATAGAATCGGCTGCTTGAGCAGCTGGGTGAATATCCTTGCAAGCTAGGCTGCCACCGTGGGAAGTAGAGGAAAAAAACTTGCGAGTCTGTAGGTTGCACATTCATGCCCATAATAGAATCGGCTGCTTGAGCAGCTGGGTGAATATCCTTGCAAGCTAGGCTGCCACCGTGGGAAGTAGAGGAAAAAAACTTGCGAATCTGTAGGTTGCACATTCATGCCCATAATAGAATCGGCTGCTTGAGCAGCTGGGTGAATATCCTTGCTATTGAATCTGCTTCAGTTAACGAAGAATTTTTAACCATGAAGATGGGAACGGTACACTANAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAATCTGGGATTGACTTTCGATCTTGAACGATTCGTTTTCATCAATATACCCAGGTAAATGTAGGATGTACCAAGCAAGTTTAAGATGGACAACCTGCTGTAAACAATAGAGCATAGAAGGAGAATTCGTGACCGACATACCAAGCAGTCTTTACATATACCAACAATATACGAGTTTCCAGCTATAGGATTTTAGGAAATGAGATGTGAATTAGAATGCTTGGCTTCATGAAATGGTGGAAATATTAATCTGTATAATCATGTCAATTAACATGATTTTCTTTTCTTTGTTGCAGTGCAGGAGAAATGTATATTATGTAATGCATCTTCACAGACATTTGGAGCTTTCCAGCTGCTTAGTGATGAACTTTTAAGCAGTTGTATATGATAAAGGCACATGGGTTTGCTGCAACCTGATAATAGAGATGCATCGTATATCATCATTTTGTTGCTATACTTGCACAGATGATTGTACATTGCGTTGTAGGGCTGGAGAAGCTTCTTGAATCTGAAACTTACTGATCATATATGCGCCCGCCCGTTTCACTGACATCGGCGGGTAAGGAATTTTCCTTGAGAACACAGGCTCCTGATTGATTACTCTCAGTGATTCTGAAAACCTTGTAGCTTCCAAGTGATTAAAGTCTCTAAAACTAACCGTTGATGTTTGATATTTGATGTTTTGATTGATGATTGCGGTATTGGAAGTGATTATTGTAATGATCTGATTATTGTGTTGAGTGTAGATTGTAAGAGTGAAATACATCAAGCTTGTTGGCTTTTAGGCTTTTGACCGTACATTGAAGACAATATCTACTAGCAGTAAACTTGGATCATTATATTGTCTCTAGTT

mRNA sequence

TACACAGTTGTTGATCGCACAGTGCTCCTTTAGCTCGCCCTCTTCTCTTCCACAATGTAAAGCGAAAATTCCTGCTTTTCACCTCTTTCTCTCTCTAGGATGATAAATATTCCCATTTTTTTGCTGTGAAGATCTCCACTTCTCTTGGAATCAACCTACTAACTGATGCGAGGGTGGAATATTTGAGGCGAATACGATCTGCCAGTTGATTGAACAATTCAATTCTTATGCACTGGATTTGGAAACATGTCAACCGGAACTGCTAGACGGGTCTCACGTCAAGATATTCAACTGGTGCGAAGTCTTATAGAGCGATGCCTTCAGCTTGATATGAACCGAAAAGAAGTTGTGGAAACACTTTTGAATCATGAAAAAATTGACCCTAGTTTCACAGAGCATGTTTGGCAGAAGCTTGAAGAGGAGAATCAGGAATTCTTTAATGCATATTATCTGAGACTGATGGTGAAAAGCCAGATTATTGAATTCAACCGATTGCTTGAACAGCAAGCGAGAATGATGCACCAGATACATCCATGTGCTGTGACTGCGTTGTCTAGTTCTAATGGATCCCACGTCCAACCAATACCCCAGAGTTGCTACGCACCAGAGCATACAGGACCTGCTCTGAAACAAGATGACATAGACAACCCTGTTGGTGTCAGCTTAGGCAATGCATATTCTAATGGTACCCAACCTGTACACTCATCCATGCACACTGCCGTGGACATGTCTTCTCATGCCAGGAACGATGCTGCACCACAGAGCTCAAATATGGGTCTGTTTCAAGGAATGAATGGAGGGATGATCAAAGTAGAAACTGGATATTCAAACAGTTCTCCCTACATGTTCGGAACAGAGGGCAACGTCCTTGATGCACGTCAATCAATTGGTAATGCATCAGTTGCATCTTTTGCTAGTGTTGATTCCAACACACCATCCTTTAACGAATCGCTACTCGATCCGGATCCCTCTTCATTTGGGTTCATTAATCAAATTACCAGGAATTTTAGTCTCTCAGATCTGACAGCAGACTTTTCTCAGGGTTCAGATATACTAGAGAGCTATGGCAGATGTCCTTTTTTACCAACAGAAGCTGATAACATCCTTGATACTTGTGAGAATGGAGACCGTCTTGATGGTAAAGGGTTGGACAGTGTATCGAAAAGCTTGAGTTACGAAGAGTTCAGACACAATTAGCTGTGCAGGAGAAATGTATATTATGTAATGCATCTTCACAGACATTTGGAGCTTTCCAGCTGCTTAGTGATGAACTTTTAAGCAGTTGTATATGATAAAGGCACATGGGTTTGCTGCAACCTGATAATAGAGATGCATCGTATATCATCATTTTGTTGCTATACTTGCACAGATGATTGTACATTGCGTTGTAGGGCTGGAGAAGCTTCTTGAATCTGAAACTTACTGATCATATATGCGCCCGCCCGTTTCACTGACATCGGCGGGTAAGGAATTTTCCTTGAGAACACAGGCTCCTGATTGATTACTCTCAGTGATTCTGAAAACCTTGTAGCTTCCAAGTGATTAAAGTCTCTAAAACTAACCGTTGATGTTTGATATTTGATGTTTTGATTGATGATTGCGGTATTGGAAGTGATTATTGTAATGATCTGATTATTGTGTTGAGTGTAGATTGTAAGAGTGAAATACATCAAGCTTGTTGGCTTTTAGGCTTTTGACCGTACATTGAAGACAATATCTACTAGCAGTAAACTTGGATCATTATATTGTCTCTAGTT

Coding sequence (CDS)

ATGTCAACCGGAACTGCTAGACGGGTCTCACGTCAAGATATTCAACTGGTGCGAAGTCTTATAGAGCGATGCCTTCAGCTTGATATGAACCGAAAAGAAGTTGTGGAAACACTTTTGAATCATGAAAAAATTGACCCTAGTTTCACAGAGCATGTTTGGCAGAAGCTTGAAGAGGAGAATCAGGAATTCTTTAATGCATATTATCTGAGACTGATGGTGAAAAGCCAGATTATTGAATTCAACCGATTGCTTGAACAGCAAGCGAGAATGATGCACCAGATACATCCATGTGCTGTGACTGCGTTGTCTAGTTCTAATGGATCCCACGTCCAACCAATACCCCAGAGTTGCTACGCACCAGAGCATACAGGACCTGCTCTGAAACAAGATGACATAGACAACCCTGTTGGTGTCAGCTTAGGCAATGCATATTCTAATGGTACCCAACCTGTACACTCATCCATGCACACTGCCGTGGACATGTCTTCTCATGCCAGGAACGATGCTGCACCACAGAGCTCAAATATGGGTCTGTTTCAAGGAATGAATGGAGGGATGATCAAAGTAGAAACTGGATATTCAAACAGTTCTCCCTACATGTTCGGAACAGAGGGCAACGTCCTTGATGCACGTCAATCAATTGGTAATGCATCAGTTGCATCTTTTGCTAGTGTTGATTCCAACACACCATCCTTTAACGAATCGCTACTCGATCCGGATCCCTCTTCATTTGGGTTCATTAATCAAATTACCAGGAATTTTAGTCTCTCAGATCTGACAGCAGACTTTTCTCAGGGTTCAGATATACTAGAGAGCTATGGCAGATGTCCTTTTTTACCAACAGAAGCTGATAACATCCTTGATACTTGTGAGAATGGAGACCGTCTTGATGGTAAAGGGTTGGACAGTGTATCGAAAAGCTTGAGTTACGAAGAGTTCAGACACAATTAG

Protein sequence

MSTGTARRVSRQDIQLVRSLIERCLQLDMNRKEVVETLLNHEKIDPSFTEHVWQKLEEENQEFFNAYYLRLMVKSQIIEFNRLLEQQARMMHQIHPCAVTALSSSNGSHVQPIPQSCYAPEHTGPALKQDDIDNPVGVSLGNAYSNGTQPVHSSMHTAVDMSSHARNDAAPQSSNMGLFQGMNGGMIKVETGYSNSSPYMFGTEGNVLDARQSIGNASVASFASVDSNTPSFNESLLDPDPSSFGFINQITRNFSLSDLTADFSQGSDILESYGRCPFLPTEADNILDTCENGDRLDGKGLDSVSKSLSYEEFRHN
Homology
BLAST of Cp4.1LG19g00090 vs. NCBI nr
Match: XP_023518640.1 (uncharacterized protein LOC111782088 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 625 bits (1613), Expect = 6.52e-226
Identity = 316/316 (100.00%), Postives = 316/316 (100.00%), Query Frame = 0

Query: 1   MSTGTARRVSRQDIQLVRSLIERCLQLDMNRKEVVETLLNHEKIDPSFTEHVWQKLEEEN 60
           MSTGTARRVSRQDIQLVRSLIERCLQLDMNRKEVVETLLNHEKIDPSFTEHVWQKLEEEN
Sbjct: 1   MSTGTARRVSRQDIQLVRSLIERCLQLDMNRKEVVETLLNHEKIDPSFTEHVWQKLEEEN 60

Query: 61  QEFFNAYYLRLMVKSQIIEFNRLLEQQARMMHQIHPCAVTALSSSNGSHVQPIPQSCYAP 120
           QEFFNAYYLRLMVKSQIIEFNRLLEQQARMMHQIHPCAVTALSSSNGSHVQPIPQSCYAP
Sbjct: 61  QEFFNAYYLRLMVKSQIIEFNRLLEQQARMMHQIHPCAVTALSSSNGSHVQPIPQSCYAP 120

Query: 121 EHTGPALKQDDIDNPVGVSLGNAYSNGTQPVHSSMHTAVDMSSHARNDAAPQSSNMGLFQ 180
           EHTGPALKQDDIDNPVGVSLGNAYSNGTQPVHSSMHTAVDMSSHARNDAAPQSSNMGLFQ
Sbjct: 121 EHTGPALKQDDIDNPVGVSLGNAYSNGTQPVHSSMHTAVDMSSHARNDAAPQSSNMGLFQ 180

Query: 181 GMNGGMIKVETGYSNSSPYMFGTEGNVLDARQSIGNASVASFASVDSNTPSFNESLLDPD 240
           GMNGGMIKVETGYSNSSPYMFGTEGNVLDARQSIGNASVASFASVDSNTPSFNESLLDPD
Sbjct: 181 GMNGGMIKVETGYSNSSPYMFGTEGNVLDARQSIGNASVASFASVDSNTPSFNESLLDPD 240

Query: 241 PSSFGFINQITRNFSLSDLTADFSQGSDILESYGRCPFLPTEADNILDTCENGDRLDGKG 300
           PSSFGFINQITRNFSLSDLTADFSQGSDILESYGRCPFLPTEADNILDTCENGDRLDGKG
Sbjct: 241 PSSFGFINQITRNFSLSDLTADFSQGSDILESYGRCPFLPTEADNILDTCENGDRLDGKG 300

Query: 301 LDSVSKSLSYEEFRHN 316
           LDSVSKSLSYEEFRHN
Sbjct: 301 LDSVSKSLSYEEFRHN 316

BLAST of Cp4.1LG19g00090 vs. NCBI nr
Match: XP_022966393.1 (uncharacterized protein LOC111466056 isoform X1 [Cucurbita maxima] >XP_022966394.1 uncharacterized protein LOC111466056 isoform X1 [Cucurbita maxima] >XP_022966395.1 uncharacterized protein LOC111466056 isoform X1 [Cucurbita maxima])

HSP 1 Score: 612 bits (1579), Expect = 9.94e-221
Identity = 310/316 (98.10%), Postives = 312/316 (98.73%), Query Frame = 0

Query: 1   MSTGTARRVSRQDIQLVRSLIERCLQLDMNRKEVVETLLNHEKIDPSFTEHVWQKLEEEN 60
           MSTGTARRVSRQDIQLVRSLIERCLQLDMNRKEVVETLLNHEKIDPSFTEHVWQKLEEEN
Sbjct: 1   MSTGTARRVSRQDIQLVRSLIERCLQLDMNRKEVVETLLNHEKIDPSFTEHVWQKLEEEN 60

Query: 61  QEFFNAYYLRLMVKSQIIEFNRLLEQQARMMHQIHPCAVTALSSSNGSHVQPIPQSCYAP 120
           QEFFNAYYLRLMVKSQIIEFNRLLEQQARMMHQIHPCAVTALSSSNGSHVQPI QSCYAP
Sbjct: 61  QEFFNAYYLRLMVKSQIIEFNRLLEQQARMMHQIHPCAVTALSSSNGSHVQPIHQSCYAP 120

Query: 121 EHTGPALKQDDIDNPVGVSLGNAYSNGTQPVHSSMHTAVDMSSHARNDAAPQSSNMGLFQ 180
           EHTGPALKQDDID PVGVSLGNAYSNGTQPVHS+MHTAVDMSSHARNDAAPQSSNMGLFQ
Sbjct: 121 EHTGPALKQDDIDYPVGVSLGNAYSNGTQPVHSTMHTAVDMSSHARNDAAPQSSNMGLFQ 180

Query: 181 GMNGGMIKVETGYSNSSPYMFGTEGNVLDARQSIGNASVASFASVDSNTPSFNESLLDPD 240
           GMNGGMIKVETGYSNSSPYMFGTEGN+LDARQSIGNASVASFASVDSNTPSFNESLLDPD
Sbjct: 181 GMNGGMIKVETGYSNSSPYMFGTEGNILDARQSIGNASVASFASVDSNTPSFNESLLDPD 240

Query: 241 PSSFGFINQITRNFSLSDLTADFSQGSDILESYGRCPFLPTEADNILDTCENGDRLDGKG 300
           PSSFGFINQITRNFSLSDLTADFSQGSDILESYGRCPFLPTEADNILDTCENGDRLDGK 
Sbjct: 241 PSSFGFINQITRNFSLSDLTADFSQGSDILESYGRCPFLPTEADNILDTCENGDRLDGKR 300

Query: 301 LDSVSKSLSYEEFRHN 316
           LDSVS SLSYEEFRHN
Sbjct: 301 LDSVSNSLSYEEFRHN 316

BLAST of Cp4.1LG19g00090 vs. NCBI nr
Match: XP_022925165.1 (uncharacterized protein LOC111432488 isoform X1 [Cucurbita moschata] >XP_022925166.1 uncharacterized protein LOC111432488 isoform X1 [Cucurbita moschata])

HSP 1 Score: 612 bits (1578), Expect = 1.41e-220
Identity = 310/316 (98.10%), Postives = 313/316 (99.05%), Query Frame = 0

Query: 1   MSTGTARRVSRQDIQLVRSLIERCLQLDMNRKEVVETLLNHEKIDPSFTEHVWQKLEEEN 60
           MS+GTARRVSRQDIQLVRSLIERCLQLDMNRKEVVETLLNHEKIDPSFTEHVWQKLEEEN
Sbjct: 1   MSSGTARRVSRQDIQLVRSLIERCLQLDMNRKEVVETLLNHEKIDPSFTEHVWQKLEEEN 60

Query: 61  QEFFNAYYLRLMVKSQIIEFNRLLEQQARMMHQIHPCAVTALSSSNGSHVQPIPQSCYAP 120
           QEFFNAYYLRLMVKSQIIEFNRLLEQQARMMHQIHPCAVTALSSSNGSHVQPI QSCYAP
Sbjct: 61  QEFFNAYYLRLMVKSQIIEFNRLLEQQARMMHQIHPCAVTALSSSNGSHVQPIHQSCYAP 120

Query: 121 EHTGPALKQDDIDNPVGVSLGNAYSNGTQPVHSSMHTAVDMSSHARNDAAPQSSNMGLFQ 180
           EHTGPALKQDDIDNPVGVSLGNAYSNGTQPV S+MHTAVDMSSHARNDAAPQSSNMGLFQ
Sbjct: 121 EHTGPALKQDDIDNPVGVSLGNAYSNGTQPVQSTMHTAVDMSSHARNDAAPQSSNMGLFQ 180

Query: 181 GMNGGMIKVETGYSNSSPYMFGTEGNVLDARQSIGNASVASFASVDSNTPSFNESLLDPD 240
           GMNGGMIKVETGYSNSSPYMFGTEGNVLDARQSIGNASVASFASVDSNTPSFNESLLDPD
Sbjct: 181 GMNGGMIKVETGYSNSSPYMFGTEGNVLDARQSIGNASVASFASVDSNTPSFNESLLDPD 240

Query: 241 PSSFGFINQITRNFSLSDLTADFSQGSDILESYGRCPFLPTEADNILDTCENGDRLDGKG 300
           PSSFGFINQITRNFSLSDLTADFSQGSDILESYGRCPFLPTEADNILDTCENGDRLDGK 
Sbjct: 241 PSSFGFINQITRNFSLSDLTADFSQGSDILESYGRCPFLPTEADNILDTCENGDRLDGKR 300

Query: 301 LDSVSKSLSYEEFRHN 316
           LDS+SKSLSYEEFRHN
Sbjct: 301 LDSLSKSLSYEEFRHN 316

BLAST of Cp4.1LG19g00090 vs. NCBI nr
Match: XP_038882081.1 (uncharacterized protein LOC120073356 [Benincasa hispida] >XP_038882082.1 uncharacterized protein LOC120073356 [Benincasa hispida])

HSP 1 Score: 585 bits (1508), Expect = 6.59e-210
Identity = 294/316 (93.04%), Postives = 305/316 (96.52%), Query Frame = 0

Query: 1   MSTGTARRVSRQDIQLVRSLIERCLQLDMNRKEVVETLLNHEKIDPSFTEHVWQKLEEEN 60
           MSTGT RR+SRQDIQLVRSLIERCLQLDM+RKEVVE LLNHEKIDPSFTEHVWQKLEEEN
Sbjct: 1   MSTGTVRRISRQDIQLVRSLIERCLQLDMSRKEVVEALLNHEKIDPSFTEHVWQKLEEEN 60

Query: 61  QEFFNAYYLRLMVKSQIIEFNRLLEQQARMMHQIHPCAVTALSSSNGSHVQPIPQSCYAP 120
           QEFFNAYYLRLMVKSQIIEFNRLLEQQARMMHQIHPCAVTALSSSNGSHVQPI QSCYAP
Sbjct: 61  QEFFNAYYLRLMVKSQIIEFNRLLEQQARMMHQIHPCAVTALSSSNGSHVQPIHQSCYAP 120

Query: 121 EHTGPALKQDDIDNPVGVSLGNAYSNGTQPVHSSMHTAVDMSSHARNDAAPQSSNMGLFQ 180
           EH+GP LKQDDID+PVGVSLGNAYSNGTQPV S+MHTAVDMSSHARND APQSSN+GLFQ
Sbjct: 121 EHSGPTLKQDDIDHPVGVSLGNAYSNGTQPVLSTMHTAVDMSSHARNDVAPQSSNVGLFQ 180

Query: 181 GMNGGMIKVETGYSNSSPYMFGTEGNVLDARQSIGNASVASFASVDSNTPSFNESLLDPD 240
           GMNGGMIKVETGYSNSSPYM GTE NVLDARQSIGN SVASFASVDSNTPSFNESLLDPD
Sbjct: 181 GMNGGMIKVETGYSNSSPYMLGTESNVLDARQSIGNTSVASFASVDSNTPSFNESLLDPD 240

Query: 241 PSSFGFINQITRNFSLSDLTADFSQGSDILESYGRCPFLPTEADNILDTCENGDRLDGKG 300
           PSSFGFINQITRNFSLSDLTADFSQGSD+LE+Y RCPFLPTEADNILDTCENGDRLDGK 
Sbjct: 241 PSSFGFINQITRNFSLSDLTADFSQGSDMLETYARCPFLPTEADNILDTCENGDRLDGKR 300

Query: 301 LDSVSKSLSYEEFRHN 316
           LD+VS+SLSYE+FRHN
Sbjct: 301 LDNVSESLSYEDFRHN 316

BLAST of Cp4.1LG19g00090 vs. NCBI nr
Match: XP_031740142.1 (uncharacterized protein LOC101204957 isoform X1 [Cucumis sativus] >XP_031740143.1 uncharacterized protein LOC101204957 isoform X1 [Cucumis sativus] >KAE8649610.1 hypothetical protein Csa_012300 [Cucumis sativus])

HSP 1 Score: 582 bits (1500), Expect = 1.09e-208
Identity = 292/316 (92.41%), Postives = 305/316 (96.52%), Query Frame = 0

Query: 1   MSTGTARRVSRQDIQLVRSLIERCLQLDMNRKEVVETLLNHEKIDPSFTEHVWQKLEEEN 60
           MSTGT RR+ RQDIQLVRSLIERCLQLDM+RKEVVETLLN EKIDP FTEHVWQKLEEEN
Sbjct: 1   MSTGTVRRIPRQDIQLVRSLIERCLQLDMSRKEVVETLLNQEKIDPGFTEHVWQKLEEEN 60

Query: 61  QEFFNAYYLRLMVKSQIIEFNRLLEQQARMMHQIHPCAVTALSSSNGSHVQPIPQSCYAP 120
           +EFFNAYYLRLMVKSQIIEFNRLLEQQARMMHQIHPCAVTALSSSNGS VQPI QSCYAP
Sbjct: 61  REFFNAYYLRLMVKSQIIEFNRLLEQQARMMHQIHPCAVTALSSSNGSQVQPIHQSCYAP 120

Query: 121 EHTGPALKQDDIDNPVGVSLGNAYSNGTQPVHSSMHTAVDMSSHARNDAAPQSSNMGLFQ 180
           +HTGP LKQDDID+PVGVS+GNAYSNGTQPVHS++HTAVDMSSH RNDAAPQSSN+GLFQ
Sbjct: 121 KHTGPTLKQDDIDHPVGVSIGNAYSNGTQPVHSTLHTAVDMSSHTRNDAAPQSSNVGLFQ 180

Query: 181 GMNGGMIKVETGYSNSSPYMFGTEGNVLDARQSIGNASVASFASVDSNTPSFNESLLDPD 240
           GMNGGMIKVETGYSNSS YMFGTEGNVLDARQSIGNASVASFASVDSNTPSFNESLLDPD
Sbjct: 181 GMNGGMIKVETGYSNSSHYMFGTEGNVLDARQSIGNASVASFASVDSNTPSFNESLLDPD 240

Query: 241 PSSFGFINQITRNFSLSDLTADFSQGSDILESYGRCPFLPTEADNILDTCENGDRLDGKG 300
           PSSFGFINQITRNFSLSDLTADFSQGSD+LESYGRCPFLPTEADNILDTCENGDRLD K 
Sbjct: 241 PSSFGFINQITRNFSLSDLTADFSQGSDMLESYGRCPFLPTEADNILDTCENGDRLDSKR 300

Query: 301 LDSVSKSLSYEEFRHN 316
           LD+VS+SLSYE+FRHN
Sbjct: 301 LDNVSESLSYEDFRHN 316

BLAST of Cp4.1LG19g00090 vs. ExPASy TrEMBL
Match: A0A6J1HRZ6 (uncharacterized protein LOC111466056 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111466056 PE=4 SV=1)

HSP 1 Score: 612 bits (1579), Expect = 4.81e-221
Identity = 310/316 (98.10%), Postives = 312/316 (98.73%), Query Frame = 0

Query: 1   MSTGTARRVSRQDIQLVRSLIERCLQLDMNRKEVVETLLNHEKIDPSFTEHVWQKLEEEN 60
           MSTGTARRVSRQDIQLVRSLIERCLQLDMNRKEVVETLLNHEKIDPSFTEHVWQKLEEEN
Sbjct: 1   MSTGTARRVSRQDIQLVRSLIERCLQLDMNRKEVVETLLNHEKIDPSFTEHVWQKLEEEN 60

Query: 61  QEFFNAYYLRLMVKSQIIEFNRLLEQQARMMHQIHPCAVTALSSSNGSHVQPIPQSCYAP 120
           QEFFNAYYLRLMVKSQIIEFNRLLEQQARMMHQIHPCAVTALSSSNGSHVQPI QSCYAP
Sbjct: 61  QEFFNAYYLRLMVKSQIIEFNRLLEQQARMMHQIHPCAVTALSSSNGSHVQPIHQSCYAP 120

Query: 121 EHTGPALKQDDIDNPVGVSLGNAYSNGTQPVHSSMHTAVDMSSHARNDAAPQSSNMGLFQ 180
           EHTGPALKQDDID PVGVSLGNAYSNGTQPVHS+MHTAVDMSSHARNDAAPQSSNMGLFQ
Sbjct: 121 EHTGPALKQDDIDYPVGVSLGNAYSNGTQPVHSTMHTAVDMSSHARNDAAPQSSNMGLFQ 180

Query: 181 GMNGGMIKVETGYSNSSPYMFGTEGNVLDARQSIGNASVASFASVDSNTPSFNESLLDPD 240
           GMNGGMIKVETGYSNSSPYMFGTEGN+LDARQSIGNASVASFASVDSNTPSFNESLLDPD
Sbjct: 181 GMNGGMIKVETGYSNSSPYMFGTEGNILDARQSIGNASVASFASVDSNTPSFNESLLDPD 240

Query: 241 PSSFGFINQITRNFSLSDLTADFSQGSDILESYGRCPFLPTEADNILDTCENGDRLDGKG 300
           PSSFGFINQITRNFSLSDLTADFSQGSDILESYGRCPFLPTEADNILDTCENGDRLDGK 
Sbjct: 241 PSSFGFINQITRNFSLSDLTADFSQGSDILESYGRCPFLPTEADNILDTCENGDRLDGKR 300

Query: 301 LDSVSKSLSYEEFRHN 316
           LDSVS SLSYEEFRHN
Sbjct: 301 LDSVSNSLSYEEFRHN 316

BLAST of Cp4.1LG19g00090 vs. ExPASy TrEMBL
Match: A0A6J1EH54 (uncharacterized protein LOC111432488 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111432488 PE=4 SV=1)

HSP 1 Score: 612 bits (1578), Expect = 6.84e-221
Identity = 310/316 (98.10%), Postives = 313/316 (99.05%), Query Frame = 0

Query: 1   MSTGTARRVSRQDIQLVRSLIERCLQLDMNRKEVVETLLNHEKIDPSFTEHVWQKLEEEN 60
           MS+GTARRVSRQDIQLVRSLIERCLQLDMNRKEVVETLLNHEKIDPSFTEHVWQKLEEEN
Sbjct: 1   MSSGTARRVSRQDIQLVRSLIERCLQLDMNRKEVVETLLNHEKIDPSFTEHVWQKLEEEN 60

Query: 61  QEFFNAYYLRLMVKSQIIEFNRLLEQQARMMHQIHPCAVTALSSSNGSHVQPIPQSCYAP 120
           QEFFNAYYLRLMVKSQIIEFNRLLEQQARMMHQIHPCAVTALSSSNGSHVQPI QSCYAP
Sbjct: 61  QEFFNAYYLRLMVKSQIIEFNRLLEQQARMMHQIHPCAVTALSSSNGSHVQPIHQSCYAP 120

Query: 121 EHTGPALKQDDIDNPVGVSLGNAYSNGTQPVHSSMHTAVDMSSHARNDAAPQSSNMGLFQ 180
           EHTGPALKQDDIDNPVGVSLGNAYSNGTQPV S+MHTAVDMSSHARNDAAPQSSNMGLFQ
Sbjct: 121 EHTGPALKQDDIDNPVGVSLGNAYSNGTQPVQSTMHTAVDMSSHARNDAAPQSSNMGLFQ 180

Query: 181 GMNGGMIKVETGYSNSSPYMFGTEGNVLDARQSIGNASVASFASVDSNTPSFNESLLDPD 240
           GMNGGMIKVETGYSNSSPYMFGTEGNVLDARQSIGNASVASFASVDSNTPSFNESLLDPD
Sbjct: 181 GMNGGMIKVETGYSNSSPYMFGTEGNVLDARQSIGNASVASFASVDSNTPSFNESLLDPD 240

Query: 241 PSSFGFINQITRNFSLSDLTADFSQGSDILESYGRCPFLPTEADNILDTCENGDRLDGKG 300
           PSSFGFINQITRNFSLSDLTADFSQGSDILESYGRCPFLPTEADNILDTCENGDRLDGK 
Sbjct: 241 PSSFGFINQITRNFSLSDLTADFSQGSDILESYGRCPFLPTEADNILDTCENGDRLDGKR 300

Query: 301 LDSVSKSLSYEEFRHN 316
           LDS+SKSLSYEEFRHN
Sbjct: 301 LDSLSKSLSYEEFRHN 316

BLAST of Cp4.1LG19g00090 vs. ExPASy TrEMBL
Match: A0A1S3C1F9 (uncharacterized protein LOC103495809 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103495809 PE=4 SV=1)

HSP 1 Score: 581 bits (1498), Expect = 1.07e-208
Identity = 290/316 (91.77%), Postives = 304/316 (96.20%), Query Frame = 0

Query: 1   MSTGTARRVSRQDIQLVRSLIERCLQLDMNRKEVVETLLNHEKIDPSFTEHVWQKLEEEN 60
           MSTGT RR+ RQDIQLVRSLIERCLQLDM+RKEVVETLLN EKIDP FTEHVWQKLEEEN
Sbjct: 1   MSTGTVRRIPRQDIQLVRSLIERCLQLDMSRKEVVETLLNQEKIDPGFTEHVWQKLEEEN 60

Query: 61  QEFFNAYYLRLMVKSQIIEFNRLLEQQARMMHQIHPCAVTALSSSNGSHVQPIPQSCYAP 120
           +EFFNAYYLRLMVKSQIIEFNRLLEQQARMMHQIHPCAVTALSSSNGSHVQPI QSCYAP
Sbjct: 61  REFFNAYYLRLMVKSQIIEFNRLLEQQARMMHQIHPCAVTALSSSNGSHVQPIHQSCYAP 120

Query: 121 EHTGPALKQDDIDNPVGVSLGNAYSNGTQPVHSSMHTAVDMSSHARNDAAPQSSNMGLFQ 180
           EHTGP LKQDDID+PVGVS+GN YSNGTQPVHS++HTAVDMSSH RNDAAPQ+SN+GLFQ
Sbjct: 121 EHTGPTLKQDDIDHPVGVSIGNVYSNGTQPVHSTLHTAVDMSSHTRNDAAPQTSNVGLFQ 180

Query: 181 GMNGGMIKVETGYSNSSPYMFGTEGNVLDARQSIGNASVASFASVDSNTPSFNESLLDPD 240
           GMNGGMIKVETGYSNSS YMFGTEGNVLDA QSIGNASVASFASVDSNTPSFNESLLDPD
Sbjct: 181 GMNGGMIKVETGYSNSSHYMFGTEGNVLDAHQSIGNASVASFASVDSNTPSFNESLLDPD 240

Query: 241 PSSFGFINQITRNFSLSDLTADFSQGSDILESYGRCPFLPTEADNILDTCENGDRLDGKG 300
           PSSFGFINQITRNFSLSDLTADFSQGSD+LESYGRCPFLPTEADNI+DTCENGDRLD K 
Sbjct: 241 PSSFGFINQITRNFSLSDLTADFSQGSDMLESYGRCPFLPTEADNIIDTCENGDRLDSKR 300

Query: 301 LDSVSKSLSYEEFRHN 316
           LD+VS+SLSYE+FRHN
Sbjct: 301 LDNVSESLSYEDFRHN 316

BLAST of Cp4.1LG19g00090 vs. ExPASy TrEMBL
Match: A0A6J1DI93 (uncharacterized protein LOC111021330 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111021330 PE=4 SV=1)

HSP 1 Score: 575 bits (1483), Expect = 2.06e-206
Identity = 288/316 (91.14%), Postives = 303/316 (95.89%), Query Frame = 0

Query: 1   MSTGTARRVSRQDIQLVRSLIERCLQLDMNRKEVVETLLNHEKIDPSFTEHVWQKLEEEN 60
           MS+GT RRVSRQDIQLVRSLIERC+QLDM+RKEV ETLLNHE IDPSFTE VWQKLEEEN
Sbjct: 1   MSSGTVRRVSRQDIQLVRSLIERCIQLDMSRKEVAETLLNHENIDPSFTEQVWQKLEEEN 60

Query: 61  QEFFNAYYLRLMVKSQIIEFNRLLEQQARMMHQIHPCAVTALSSSNGSHVQPIPQSCYAP 120
           QEFFNAYYLRLMVKSQIIEFNRLLEQQARMMHQIHPCAVTALSSSNGSHVQP+ QSCYAP
Sbjct: 61  QEFFNAYYLRLMVKSQIIEFNRLLEQQARMMHQIHPCAVTALSSSNGSHVQPVHQSCYAP 120

Query: 121 EHTGPALKQDDIDNPVGVSLGNAYSNGTQPVHSSMHTAVDMSSHARNDAAPQSSNMGLFQ 180
           EHTGPALKQDDID+PVGVSLGNAYSNGTQPVHS+MHTAVDMSSH  NDAAPQSSN+GLFQ
Sbjct: 121 EHTGPALKQDDIDHPVGVSLGNAYSNGTQPVHSTMHTAVDMSSHGMNDAAPQSSNVGLFQ 180

Query: 181 GMNGGMIKVETGYSNSSPYMFGTEGNVLDARQSIGNASVASFASVDSNTPSFNESLLDPD 240
           G+NGGMIK+ETGYSNSSPYMFGTEGNVLDARQSIGNASVASFASVDSNTPSFNESLLDPD
Sbjct: 181 GINGGMIKLETGYSNSSPYMFGTEGNVLDARQSIGNASVASFASVDSNTPSFNESLLDPD 240

Query: 241 PSSFGFINQITRNFSLSDLTADFSQGSDILESYGRCPFLPTEADNILDTCENGDRLDGKG 300
           PSSFGFI+ ITRNFSLSDLTA+FSQGSDIL+SY RCPFLP EADNILD CENGDRLD K 
Sbjct: 241 PSSFGFISPITRNFSLSDLTAEFSQGSDILDSYARCPFLPPEADNILDNCENGDRLDSKR 300

Query: 301 LDSVSKSLSYEEFRHN 316
           LD+VS+SLSYE+FRHN
Sbjct: 301 LDTVSESLSYEDFRHN 316

BLAST of Cp4.1LG19g00090 vs. ExPASy TrEMBL
Match: A0A6J1F3A3 (uncharacterized protein LOC111439400 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111439400 PE=4 SV=1)

HSP 1 Score: 558 bits (1437), Expect = 2.11e-199
Identity = 282/316 (89.24%), Postives = 296/316 (93.67%), Query Frame = 0

Query: 1   MSTGTARRVSRQDIQLVRSLIERCLQLDMNRKEVVETLLNHEKIDPSFTEHVWQKLEEEN 60
           MST T  RVSRQDIQ VRSLIERCLQLDMNRKEVVETLLNHEKIDP FTEHVWQKLEEEN
Sbjct: 1   MSTRTVGRVSRQDIQFVRSLIERCLQLDMNRKEVVETLLNHEKIDPGFTEHVWQKLEEEN 60

Query: 61  QEFFNAYYLRLMVKSQIIEFNRLLEQQARMMHQIHPCAVTALSSSNGSHVQPIPQSCYAP 120
           QEFFNAYYLRLM+KSQIIEFNRLLEQQARMMHQIHPCAVTALSSSNGSHVQPI Q+CYAP
Sbjct: 61  QEFFNAYYLRLMMKSQIIEFNRLLEQQARMMHQIHPCAVTALSSSNGSHVQPIHQNCYAP 120

Query: 121 EHTGPALKQDDIDNPVGVSLGNAYSNGTQPVHSSMHTAVDMSSHARNDAAPQSSNMGLFQ 180
           EHTGPALKQDD D+PVGVSLGN Y+NGTQPVHS++HT VD+ SHARNDAAPQSSN+GLFQ
Sbjct: 121 EHTGPALKQDDTDHPVGVSLGNVYTNGTQPVHSTIHTTVDLPSHARNDAAPQSSNVGLFQ 180

Query: 181 GMNGGMIKVETGYSNSSPYMFGTEGNVLDARQSIGNASVASFASVDSNTPSFNESLLDPD 240
           GMNGG+IKVETGYSNS PYMFGTE NVLDARQSIGNASVASFASVDSNT SFNESLLD D
Sbjct: 181 GMNGGIIKVETGYSNSPPYMFGTEDNVLDARQSIGNASVASFASVDSNTSSFNESLLDTD 240

Query: 241 PSSFGFINQITRNFSLSDLTADFSQGSDILESYGRCPFLPTEADNILDTCENGDRLDGKG 300
           PSSFGFINQITRNFSLSDLTADFSQGSDILESY RCP LPTEADNI+DT ENGDRLD K 
Sbjct: 241 PSSFGFINQITRNFSLSDLTADFSQGSDILESYARCPLLPTEADNIIDTHENGDRLDSKR 300

Query: 301 LDSVSKSLSYEEFRHN 316
           LD+VS+SLS+E+ RHN
Sbjct: 301 LDNVSESLSFEDSRHN 316

BLAST of Cp4.1LG19g00090 vs. TAIR 10
Match: AT3G10250.1 (Plant protein 1589 of unknown function )

HSP 1 Score: 323.6 bits (828), Expect = 1.8e-88
Identity = 170/320 (53.12%), Postives = 237/320 (74.06%), Query Frame = 0

Query: 1   MSTGTARRVSRQDIQLVRSLIERCLQLDMNRKEVVETLLNHEKIDPSFTEHVWQKLEEEN 60
           MS+GT RRVSRQDIQLV++LIERCLQL MN+KEVV+TLL   KI+P FTE VWQKLEEEN
Sbjct: 1   MSSGTVRRVSRQDIQLVQNLIERCLQLYMNQKEVVDTLLEQAKIEPGFTELVWQKLEEEN 60

Query: 61  QEFFNAYYLRLMVKSQIIEFNRLLEQQARMMHQIHPCAVTALSSSNGSHVQPIPQS--CY 120
           +EFF AYYLRLMVK QI+EFN+LLEQQ   M Q+HP  V ++ ++NGSH+Q + Q   CY
Sbjct: 61  REFFKAYYLRLMVKHQIMEFNKLLEQQVHHMRQMHPTGVASVQNTNGSHLQSMNQKQLCY 120

Query: 121 APEHTGPALKQDDIDNPVGVSLGNAYSNGTQPVHSSMHTAVDMSSHARN-DAAP-----Q 180
             EHT  +LK +   +P+  SL NA+ NG+  +++++ +++++S+HAR  DA+P     Q
Sbjct: 121 PSEHTDQSLKSESAHHPMASSLSNAFLNGSSTLNTNVPSSINISTHARRVDASPNMLSSQ 180

Query: 181 SSNMGLFQGMNGGMIKVETGYSNSSPYMFGTEGNVLDARQSIGNASVASFASVDSNTPSF 240
           ++NM + QGMNGGMIK ET ++N + +M+G E N L+   ++G+ S+ +F++  +N P  
Sbjct: 181 TTNMPMMQGMNGGMIKSETAFTNPASFMYGGERNALEGHSAVGDTSIPNFSNESNNQP-L 240

Query: 241 NESLLDPDPSSFGFINQITRNFSLSDLTADFSQGSDILESYGRCPFLPTEADNILDTCEN 300
           ++ LL+ + S+FGF+ QI RNFSLSDLTADFSQ S+ILESY R PFL   A+N LD+ + 
Sbjct: 241 SDPLLEAEASTFGFLGQIPRNFSLSDLTADFSQSSEILESYDRSPFLVPNAENFLDSRDR 300

Query: 301 GD-RLDGKGLDSVSKSLSYE 312
           G+ + D K LD++S+  SY+
Sbjct: 301 GEYQGDNKRLDTISEGFSYD 319

BLAST of Cp4.1LG19g00090 vs. TAIR 10
Match: AT3G10250.2 (Plant protein 1589 of unknown function )

HSP 1 Score: 323.6 bits (828), Expect = 1.8e-88
Identity = 170/320 (53.12%), Postives = 237/320 (74.06%), Query Frame = 0

Query: 1   MSTGTARRVSRQDIQLVRSLIERCLQLDMNRKEVVETLLNHEKIDPSFTEHVWQKLEEEN 60
           MS+GT RRVSRQDIQLV++LIERCLQL MN+KEVV+TLL   KI+P FTE VWQKLEEEN
Sbjct: 1   MSSGTVRRVSRQDIQLVQNLIERCLQLYMNQKEVVDTLLEQAKIEPGFTELVWQKLEEEN 60

Query: 61  QEFFNAYYLRLMVKSQIIEFNRLLEQQARMMHQIHPCAVTALSSSNGSHVQPIPQS--CY 120
           +EFF AYYLRLMVK QI+EFN+LLEQQ   M Q+HP  V ++ ++NGSH+Q + Q   CY
Sbjct: 61  REFFKAYYLRLMVKHQIMEFNKLLEQQVHHMRQMHPTGVASVQNTNGSHLQSMNQKQLCY 120

Query: 121 APEHTGPALKQDDIDNPVGVSLGNAYSNGTQPVHSSMHTAVDMSSHARN-DAAP-----Q 180
             EHT  +LK +   +P+  SL NA+ NG+  +++++ +++++S+HAR  DA+P     Q
Sbjct: 121 PSEHTDQSLKSESAHHPMASSLSNAFLNGSSTLNTNVPSSINISTHARRVDASPNMLSSQ 180

Query: 181 SSNMGLFQGMNGGMIKVETGYSNSSPYMFGTEGNVLDARQSIGNASVASFASVDSNTPSF 240
           ++NM + QGMNGGMIK ET ++N + +M+G E N L+   ++G+ S+ +F++  +N P  
Sbjct: 181 TTNMPMMQGMNGGMIKSETAFTNPASFMYGGERNALEGHSAVGDTSIPNFSNESNNQP-L 240

Query: 241 NESLLDPDPSSFGFINQITRNFSLSDLTADFSQGSDILESYGRCPFLPTEADNILDTCEN 300
           ++ LL+ + S+FGF+ QI RNFSLSDLTADFSQ S+ILESY R PFL   A+N LD+ + 
Sbjct: 241 SDPLLEAEASTFGFLGQIPRNFSLSDLTADFSQSSEILESYDRSPFLVPNAENFLDSRDR 300

Query: 301 GD-RLDGKGLDSVSKSLSYE 312
           G+ + D K LD++S+  SY+
Sbjct: 301 GEYQGDNKRLDTISEGFSYD 319

BLAST of Cp4.1LG19g00090 vs. TAIR 10
Match: AT5G04090.2 (Plant protein 1589 of unknown function )

HSP 1 Score: 278.9 bits (712), Expect = 5.2e-75
Identity = 165/323 (51.08%), Postives = 217/323 (67.18%), Query Frame = 0

Query: 1   MSTGTARRVSRQDIQLVRSLIERCLQLDMNRKEVVETLLNHEKIDPSFTEHVWQKLEEEN 60
           MS+ T RRVSR+DIQLV++LIERCLQL MN+KEVV+TLL   KI+P FTE VWQKLEEEN
Sbjct: 1   MSSLTVRRVSREDIQLVQNLIERCLQLYMNQKEVVDTLLEQAKIEPGFTELVWQKLEEEN 60

Query: 61  QEFFNAYYLRLMVKSQIIEFNRLLEQQARMMHQIHPCAVTALSSSNGSHVQPIPQSCYAP 120
           +EFF AYYLRLMVK QI+E+N LLEQQ   M Q+HP A  ++ + NGSHV P+ Q     
Sbjct: 61  REFFKAYYLRLMVKHQIMEYNELLEQQINHMRQMHPTAGASVRNRNGSHVPPMNQQQLLY 120

Query: 121 EHTGPALKQDDIDNPVGVSLGNAYSNGTQPVHSSMHTAVDMSSHAR------NDAAPQSS 180
           E      K+ D  +P   +L + Y NG   +++++ + VD SSH+R      N  + Q++
Sbjct: 121 ER-----KEPDQSSP---NLSSPYLNGGSAINTNIPSYVDFSSHSRRVDPSPNSLSLQAT 180

Query: 181 NMGLFQGMNGGMIKVETGYSNSSPYMFGTEGNVLDARQSIGNASVASFASVDSNTPSFNE 240
           NM L Q    GMIK ET Y N +PYM+G E     A+ ++G+ ++ASF++ DS+  S N+
Sbjct: 181 NMPLMQ----GMIKSETAYQNCAPYMYGGE-----AQSTVGDVTIASFSN-DSSNQSLND 240

Query: 241 SLLDPDPSSFGFINQITRNFSLSDLTADFSQGSDILESYGRCPFLPTEADNILDTCENGD 300
            L+DPD  +FG + QI +NFSLSDLTADFSQ SDILESY   PFL  +A+N LD+ E  +
Sbjct: 241 PLVDPDAPTFGSLGQIPQNFSLSDLTADFSQSSDILESYEGSPFLLADAENFLDSSERVE 300

Query: 301 RL-DGKGLDSVSKSLSYEEFRHN 317
              D + L ++S   SYE FR N
Sbjct: 301 HQGDHERLRTISSGFSYENFRSN 305

BLAST of Cp4.1LG19g00090 vs. TAIR 10
Match: AT5G04090.1 (Plant protein 1589 of unknown function )

HSP 1 Score: 214.9 bits (546), Expect = 9.1e-56
Identity = 129/272 (47.43%), Postives = 174/272 (63.97%), Query Frame = 0

Query: 52  VWQKLEEENQEFFNAYYLRLMVKSQIIEFNRLLEQQARMMHQIHPCAVTALSSSNGSHVQ 111
           VWQKLEEEN+EFF AYYLRLMVK QI+E+N LLEQQ   M Q+HP A  ++ + NGSHV 
Sbjct: 11  VWQKLEEENREFFKAYYLRLMVKHQIMEYNELLEQQINHMRQMHPTAGASVRNRNGSHVP 70

Query: 112 PIPQSCYAPEHTGPALKQDDIDNPVGVSLGNAYSNGTQPVHSSMHTAVDMSSHAR----- 171
           P+ Q     E      K+ D  +P   +L + Y NG   +++++ + VD SSH+R     
Sbjct: 71  PMNQQQLLYER-----KEPDQSSP---NLSSPYLNGGSAINTNIPSYVDFSSHSRRVDPS 130

Query: 172 -NDAAPQSSNMGLFQGMNGGMIKVETGYSNSSPYMFGTEGNVLDARQSIGNASVASFASV 231
            N  + Q++NM L Q    GMIK ET Y N +PYM+G E     A+ ++G+ ++ASF++ 
Sbjct: 131 PNSLSLQATNMPLMQ----GMIKSETAYQNCAPYMYGGE-----AQSTVGDVTIASFSN- 190

Query: 232 DSNTPSFNESLLDPDPSSFGFINQITRNFSLSDLTADFSQGSDILESYGRCPFLPTEADN 291
           DS+  S N+ L+DPD  +FG + QI +NFSLSDLTADFSQ SDILESY   PFL  +A+N
Sbjct: 191 DSSNQSLNDPLVDPDAPTFGSLGQIPQNFSLSDLTADFSQSSDILESYEGSPFLLADAEN 250

Query: 292 ILDTCENGDRL-DGKGLDSVSKSLSYEEFRHN 317
            LD+ E  +   D + L ++S   SYE FR N
Sbjct: 251 FLDSSERVEHQGDHERLRTISSGFSYENFRSN 264

BLAST of Cp4.1LG19g00090 vs. TAIR 10
Match: AT3G61700.1 (Plant protein 1589 of unknown function )

HSP 1 Score: 138.7 bits (348), Expect = 8.3e-33
Identity = 113/318 (35.53%), Postives = 160/318 (50.31%), Query Frame = 0

Query: 2   STGTARRVSRQDIQLVRSLIERCLQLDMNRKEVVETLLNHEKIDPSFTEHVWQKLEEENQ 61
           S+  +R+VSRQDI+LV++LIERCLQL MNR EVV+TLL   +IDP FT  VWQKLEEEN 
Sbjct: 43  SSNDSRKVSRQDIELVQNLIERCLQLYMNRDEVVKTLLTRARIDPGFTTLVWQKLEEENA 102

Query: 62  EFFNAYYLRLMVKSQIIEFNRLLEQQARMM-HQIHPCAVTALSSSNGSHVQPIPQSCYAP 121
           +FF AYY+RL +K QII FN LLE Q  +M +   P  V      NG H    P +    
Sbjct: 103 DFFRAYYIRLKLKKQIILFNHLLEHQYHLMKYPPGPPKVPLAPIQNGMH----PMA---- 162

Query: 122 EHTGPALKQDDIDNPVGVSLGNAYSNGTQ---PVHSSMHTAVDMSSHARNDAAPQSSNMG 181
                         PV + +G       Q   P H     A+ +SS    +  P  +N  
Sbjct: 163 --------------PVNMPMGYPVLQHPQMHVPGHPHHLDAMGVSSCHVVNGVPAPANFH 222

Query: 182 LFQGMNGGMIKVETGYSNSSPYMF----GTEGNVLDARQSIGNASVASFASVDSNTPSFN 241
             +      + ++T  ++++P +     G    ++ +  S+ ++    FA+ D +    +
Sbjct: 223 PLRMNTANDMVIDTTANDATPQVIPPNSGAMPEMVASPASVASSGHFPFAASDMSGMVMD 282

Query: 242 ESLLD--------PDPSSFG-------FINQITRNFSLSDLTADFSQGSDI--LESYGRC 295
            S+LD        PD    G         +QI  NFSLSDLTAD S   D+  L +Y   
Sbjct: 283 TSVLDSAFTSDVGPDGEGAGNSRDSLRSFDQIPWNFSLSDLTADLSNLGDLGALGNYPGS 338

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023518640.16.52e-226100.00uncharacterized protein LOC111782088 isoform X1 [Cucurbita pepo subsp. pepo][more]
XP_022966393.19.94e-22198.10uncharacterized protein LOC111466056 isoform X1 [Cucurbita maxima] >XP_022966394... [more]
XP_022925165.11.41e-22098.10uncharacterized protein LOC111432488 isoform X1 [Cucurbita moschata] >XP_0229251... [more]
XP_038882081.16.59e-21093.04uncharacterized protein LOC120073356 [Benincasa hispida] >XP_038882082.1 unchara... [more]
XP_031740142.11.09e-20892.41uncharacterized protein LOC101204957 isoform X1 [Cucumis sativus] >XP_031740143.... [more]
Match NameE-valueIdentityDescription
A0A6J1HRZ64.81e-22198.10uncharacterized protein LOC111466056 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1EH546.84e-22198.10uncharacterized protein LOC111432488 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A1S3C1F91.07e-20891.77uncharacterized protein LOC103495809 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A6J1DI932.06e-20691.14uncharacterized protein LOC111021330 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A6J1F3A32.11e-19989.24uncharacterized protein LOC111439400 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
Match NameE-valueIdentityDescription
AT3G10250.11.8e-8853.13Plant protein 1589 of unknown function [more]
AT3G10250.21.8e-8853.13Plant protein 1589 of unknown function [more]
AT5G04090.25.2e-7551.08Plant protein 1589 of unknown function [more]
AT5G04090.19.1e-5647.43Plant protein 1589 of unknown function [more]
AT3G61700.18.3e-3335.53Plant protein 1589 of unknown function [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006476Conserved hypothetical protein CHP01589, plantTIGRFAMTIGR01589TIGR01589coord: 14..70
e-value: 3.3E-22
score: 76.2
IPR006476Conserved hypothetical protein CHP01589, plantPFAMPF09713A_thal_3526coord: 17..68
e-value: 1.8E-23
score: 82.6
IPR006476Conserved hypothetical protein CHP01589, plantPANTHERPTHR31871OS02G0137100 PROTEINcoord: 1..310
NoneNo IPR availablePANTHERPTHR31871:SF1OS02G0137100 PROTEINcoord: 1..310

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG19g00090.1Cp4.1LG19g00090.1mRNA