Cp4.1LG18g08920 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG18g08920
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionC2H2-type domain-containing protein
LocationCp4.1LG18: 8015177 .. 8023045 (-)
RNA-Seq ExpressionCp4.1LG18g08920
SyntenyCp4.1LG18g08920
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAAACATACGAATGTTCCGTATTAAACACTGAATAAACGGTGGAATTTCCGCGCCACGCGTACGTGATTCGAGAGGCAAAGTTCGTGTTCTTCGCGGCTAGAATTCGAATCTCCAAGACCCCAATCGGGTCACAGCGGCTCGTAGCCGGGTTTTGCAACTTGGAGATCACCAGAGAAATGAAGAAAGCAACAACAGGCAGCATCGTTCTTCTTTTAGCTTTGCTTTCTCTTCAGGACTTTGTTCATTTTGCATTGGCGCTACCTGTTTCAGAGAGTAATCAGGTACCGAATCTGTCGTTATAGATACCTAAACACCCATACATCGTATGAGTTGATCTTGATGCGAAAACCATGGCTTGTGTCGTTTCTGTTTCTTACTACGATCCAGTGAGGTTTTTCTAATCCGTCTCTAGTCCTGGATTGAGCTTTTTGGATTTGGAATAGCCTCGACCAATTGCAATTTTATTGTCTGAGAAAATGAAAATTTTCTGTTTGTTATTTTTTAATCTGGAATAAGCTTTGCGAGGAAGATTTCGCAACTGAAAAATGTTCGAGTTAAAAATACTTTTCCTACTCAGTTATTCCTTGAATGTACTCTATTAGGTTATTTGATAGGTCTCTTAGACGGTTAGAGTTGTGTCAATTGGGTAACAAAATTGTCAATAATATCAGCATAACAATTAATCAAACTCTACGAATCAATCAGGTAAATACAGAGGAATTAGAGTGGGTTGAACTTTCTAGGATTTGCATCCACTTTCGCCACTGAAAATTAGACTCAAATCAGCAGATACAAAACAGAGCAATTTTTTGGCAATCCAACTCAGGAATCCCAAAACCCATGACCTCCTTTAAATTTTCATTCCCTAAATTAGCCAATGAACATGGCACTCCCTCCCAACAATCCTTCCTTTCGAACAAAGTACACTATAGAGCCTCCCCTAAGGCCGAAACAAAGTATACCCTCTGTTCTCTTTGACTACACCTTCGAGGCTCACAACTTCTTTGTTCGATATTTGAGGATTCTATTGACTTGGTTAAGTTAATAGCATGACTATGATACCATGTCAGGAATCACAAATTCCACAATGATATGATATTTTTCACTTCGAGTATAAGCTCTCGTGGCTTTGCTTTGGGCTTCCCCAAAGGGCCTCATACCATTGGAGATGTATTCCTTACTTATAAACCCATAATCATTCCCTAAATTAGCCTCATACCAATGGAGATGTATTCTTTACTTATAAACCCATAATCATTCCCTAAATTAGTCCATGAGCTTAAGCTCTTGAGTTGATTGGTTGATTGGTGATTTAATATTTGACTCTATCAGCTAATATGAAACCACGTTATGCAAAGAGGTCCGAGCCAATCAGTTTACAGTAATGTAACCGGGATAGTAGCCATTGTCAGCACTCACGTATAAGTTGGCCAAATTAACTGTCACTTAGACCATAAAAAATAACAGTTAGATTTCCGTTTACTGCGATCTGCTTGTCATGGAACTATAGAAATACTAGCGAAATGGATGCTGCCAGTTTCATTGCCCTTTATATTTGCTTACTTTGCTAATTCTTATCCTTACACTTCTATCTGGATATACTCTTCTTACCTTGGTCTTTTTCATTCATTTTAGGACGAGGAGCAATCTGCAACTACGAGGTAGCTCTTTTTCCCCATCACAGTGGACTCTCTCTCCCTCTCTCATGTTATAGCTTGGTCAAAACCCAAAGTTAGCTGACATGTGGGAGTATTGATTAATTGAACTGAGTTACTGGATTCATTTGATATACTGTATCTTCTGTGTATTAGTTCACCAAATAAGATGCACATACACACACACACATACGCATATATTTAAAAGAAACAAAAATTTTCATTGATTTGGTGGAAAGTCACCACAAAAACTTACACACCCCTATTTCAAGCTTATAAAAATGCACTACAATTTGAACTAATACAGAAACAGTATAATTACAAGCCTTAGGTAAGCAACACCATTAGTTGCATTGAGGATGAAGCCTAACCAAATTCATCCATCTTCCCACTGTTTTTAGTTTTCATAAATTCTTTAGCTTGCCAGCTTTCTCAGTACCAAAAGTATTTGAGGTAGATCACATGAGGAATACTCATGTTGGCTACTTCTAGTTTTACTTTATCCATTACAATATTGATATTGCATACAGAAGACTTCTCTACATTGATCTAATCTTAAAGCTGTCTTAAACATCTTTATTAATTTAAACATAAATTTCAGTAAAAACTTAGTTATCTCATAGCAAATGATTGCAAGGTCAGCAAACTGAGTTTTGAGAATTCTCTCTTAACCAGTGCAAAAACTTTCCAACAAGTCCTTTTGCATCTCTGGACTGGTCTTAGCTGCTCAGAGTATTTCCTATTCGTATAAACAAGGAGACAGGCTACCTTGTTTTAGCCCCTGTGCTGCCAAGAAATATTGAAAATTCAGTGTCTGAAATATATCCCATCATCCATTTCAGCAATTTACCTTCACATTCACATTTCTTCATCACCGCCAACCGAAACTCCCAATCCATGAAATGTATGTTTGTTCCGAGTCAATTTTAAAAATCCATTAACTTTTCTTCAACCATTTCACTTGCAATAGAATTGATGTAATTCCGAAGGAAATATGTTGATGCCCAATTTTGTATTTCAAGTTTCACAGTAACGTATTCTACATTCTCCCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTTTTTTTTTTTTACATTTTTGCTCCTCTATTTCTCTTTCTGTCACTCTCTCACTCTCACTAACCAATGCATCTTATTGTTATCTTTTGTGTGTCTCATCCTAGGCAAGCCAGTAGCTGCTGCCTTGTGTGTCTTATTTTTCAAACATATTATTCGGCACTTTTGGCTTCACTATTTAAATTTGTTTATGTAGACCTCTCGAGCAAGACAAAGAGCATTTTGAGGAAGTTCATTGTTCCAGAGAAAGAAGTAGGACAGCCTGGAATATTCTTGAGGAGGTACGGACTAAACTTTCTTTACCAGTTAAAGATTATGGACTCTACAGTCTGAAACAGACATGCTAATTGATGTTCTGTTGGTTCTACTTCAGCATTTGCTGCCGTTTATTGAGAAGGAGAATTATCAGGTTTCAACCAAGTGTAGGCTTCATCCAAACAATGATCTCTTCAGGGATCAGGAGCAGCACAAGATTCATTTTGATATAAATCATTGGCAGTGTGGATACTGTCGCAAAAGCTTTCGGGCGGAAAAATTTCTTGATAAGCATTTCGACAACAGACATTACGATCTTCTGAATGTTGTATGTTCTTCCTTCTTGTTATAATCATCGTTCATAATTATAGTATATTCTTTATAAGTTATCATATAACATTTGAATTTCAACCTTAAGAGGAGTATATGCCAATTACCACTTAGCTATACTCATGTTGGCCTAAAGTTATTATATATCATGGTTATCATTTTTGGGTCATGTCCTTGTATCTGCTGTTATTTGGAATTGGCCATTGCATAATGCATAGGTTTCTCGTAGTATCATCTATAATTACTTGGATTTCATGTTACAACAGACAGTGTGGCATTTGACTTTTCACTAACATGTGTCTGATGCTTCCTCTAATAATTGTAAATTAGAAGGTTTTACTTTCATGACAAAAATGGGTGAGTGTTTCAAGTATTTGAATATCAGTTAACAACAAATGATCTTATTGCAGTAATCCTAGAACTAACTGATCAGAAAAAGAACAATAGCAAGAGAAGAGTGCAGAAAAGGAGAGGATGACATATTTTCTGAAGAAAAATTCACAAAAATTCGCTCTGATTGGCTTGTATGAGGATAGTAGTGTAGTTCCAATGAAAGGTTAGTCGAAGTAGTCCCGTGTAGAAGCCGAGAAGGTCATGCTATCTTCAATCTCACTTTTCCCATCCATCTCTCCTGAGAAACGTAACAATTTCTCTCGAGTCAAGAAAGTCATGACATAATCAATCTCATCTATACTCATCCTTCTCACCTAAAATGACATGGAAATTTCTCTGAAACCATGGCTTCCCGAAAAATGCTCCATGGAGAAGTTTTTAATGCCATAAAAACTGCTCAATATCTGCATATATTAGGCTTTATATTCATGGTTCTATTTTTCTTCCCACAAGATATTTACCTGATGTCTACCTACATCTGAAGATTCTTGGACACTAGACGTATGTTAGCTTGGATTATTCACCTCAAGTAGCTCCTGTGCATTTTTATGAACTCAGTCTGTGTTATTTATTTGCTTGCTTTTGAGTTTGATTGCTGTTGTTTACCCATGCTATTAGTAAATGCAGAGTCACGGGAAGTGCTTAGCTGATTTATGTGGGGCTTTACATTGTGACCTAAAGATGGATATCAAGTCACGTAAATCTAAATGCAAGCCTGCAGCTGCAGCTAGGAACAAGCATTTATGTGAGGTTGCTTAGTGGTACACAAAATGCTTGCTAATTTCCTGTCTTCTCTTTTACGTATTGCATCGTTCATTTCTTTAAGTTCTTTAACAGAGTCTTGCTGATACTTGTTTTCCAATTAATGAGGGACCATCAGCAAGCCGTCTTCATGGTAAGTTTCTTCTCCTTCTCCCCACCCATCAAGCTCTTCTCCTTTTCCCTTTTTAGTTGTAATATCTGTTCTTTTCAAAGGTTCCTCATGGAATTGCTCATATTTTCTAATCTGATGCCTGAGGTACTCCGTAGAGCTATTTCTTCACCAATTCTGTGGAGCTCATTCTTGCACTGAGAAACAAAAGCCATTTTCTAGAGGAGCGGAGGTGTGCCTTTAAACATAAGCCATTTGCTGTAGATTGTTTGGTCATTACTTGCAATAAATATGTCAAGTTGGTATTCTGCTTGTTATGAAATTTTTGTTTCACCTTCAAATATGTCACTCCCTTGTGGAAAGCCATGGACTGGTGAGCCGTCCAATTTTATTTTATTTTTTTTTAAAAAAAAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAGTGAATGAACGAAAAAAGGGAATTGTTTATAGTTTCATATTCAAATTCTTCTTACACATTTGGAGTTTCATTCGATGTTTCTTTCTTATTGACTACTTTATGTCTGCAGAGGCAACCAGGCATCTTTTACATGGCATCTTCAATACTGATTTTGATGTTGCTACCAATTTTCTATGTCATTGTCTATTTGCACCGCAGGTACTTTCTTTTATCTTGTTGATGCCTCGTATGAATAATCTTCGCATGATCCGTAACCCCTTTTATTTATGGGCGTCTTTTCTAGAGAATCGAGAAACGGAATCGAAGTGCTTAGAAGAATCTCAAAGGCTGGACGTAAAACCAAACCCTTGTAACTGGTATCACCTTCAACCAAACTTCACTATTAGATTTTTAAGTTCGTCAACTCTGGGGCTCTAATTGTTTGTGGCCAAGGGTAGAAAATCATAGCCAAATGTTCTATAGCCATGGTGTCACCTAAATGTACCTCCGACATATCTTGCATTAAACTAAGTGACTTCTTTTTGTTCTCCGGCATTTGCATCTTTCTTTGTCATCATCTATTCTATTGCTGTTCAATTACTTTTTTTTTCTTTTTCTAGTAAAAGACTTGGAAGAAATGGAGGTTGATTACCAGTGTGAATGTATCGTATTCATGATTAGAAATCTATGAAAAATATAAACTTTTTTTTTTTCTTAATAACCACATCATTCCCTTGTTTGAATCGACCTTTTTCTCCACTAACATCATATGCTAGGTCAGTCTTGTTTTCAAGATTGCGTGGCATTTCATATATTGACAACAATTACTTCCAAGTATCAGCATCAACCTTGAAACGGCAATAGTACGTCTTTAATGAATCAATTTTCAGAGTGAACTCAGCCCCAACTGCCAACCTGATCAGAAGTTCATCCAGGATCGTCTCATCCAGCTTGCTGGTTTAGTCATACTTTGCCGCAACTCCCATGAAGAACAATCTATTATATGGTATGCAATTAGCTCAACTTTATAATCAGAACATTGATTTTGGACTCGAGAGCAATCTTGAACTTCCTCTAGGATTAATTCGCAAGATTCTCATGCCAGATGCAGGATCCTGCATTGTCTTAAAAAACTGGCAATCATTTAGTCCTCTTTACAATACCGGAAGTATGACTTGGAACACAAAATTTGGAATCAGGAAAGACCAGCCAAAGCAGGTTGGAATTCTAACATTCAGGGAGATATGACTCTAAATGCAAGAGAGATCAATTCGTTTGATAATAAAAACCCATCTGGATCACTAAGTCGAATGTGCCTCAGCTTCGTTAAACTATCATCTTTGCTCGATAGCAAGCCATTGCATTCATCAACTGTTAGAGCTCTCCTCTGCTCATCCAAGCAAACCACATGGCCACTTTTGATGTACGGTTCATAGGCCCTGCAAAGAGAATGTACAAGATGACCACCATAAGCTTTTCCAAAAGATTGTAAATCAAGGCCTGAGGAAGTTCTGAGAGAAAGCATCACAACATCCATAGCCATGTCCTTGATATCAACATCATTATCTCCATGGCAATCCACCAGTCCCTTCTCTAAGTTCTGTACATAACTGGTGTATTCCTTCAATTTCCGCGGCCTGGAGAACCTCAAACTACCGAGATAACTAGCCGCCCCTAAACCAAAACCATAGAAAGGATTGTTTTTCCAATAGGTAGAGTTGTGCTTACACTCAAACCCGCTCTTGCAGTAACTACTGATCTCATAATGGCTATAACCTCCCTCAGCGAGTGCCTTTGAAGCCATTCTGTAGAATCCAGCTGAATCTGTATCAGAAGGCAATGGAAATTCCCCTGGCTTGTACCTACACATGCCTATGTTCAAATTTCACATATAGCAACAATCTAAAATATTCATATCCAACACACTGCTACTTACAGTATCCCAAATTTTGTGTCTTCTTCAACTTGCAAATCATATACTGAAACATGAGTTGGTTGTGCTTCAATAGTGAGGCGTAAGCTCTCTTCCCACATCTCAGCTGTCTGGTGAGGTAGAGAAGATATGAGATCCATACTCCAGTTTTGAAGCCCACAAGACTTAATGATCTCAATAGCCTCATAAACTTCATCAACTCCATGAGCCCTTCCACAAGCCTTGAGTAACTCTTCCTGAAACGCCTGAACACCCAAAGACACTCTGTTCACATCCAACTTCATCAAGCCCTCCATTTTCTTCGCATCAAAAGTGCCGGGGTCCATTTCAATAGAGATTTCAGCATTCTTAGCCATCCCAAATTTCGACCCCAGCACATCTAAGATAACTGAGACGAGCCTCGGCGGCACAAGAGAAGGGGTGCCGCCTCCAAAGAAGACGGTTTCGAGGGGCCGGTCCGTTTGGAATTCTGATTTTGTGGCATTAATTTCTCGACAAAGCAACTCCACGTAGTCCCGAATTCGAGGGTCGTCATCGGTTTGGGAGGAGGAAGATCCCAAAGCGACGATTGGGAAGTCACAGTAGTGGCAGCGCTTTCGACAAAAAGGGAGGTGGATGTAGGCTGAAGTAGGGGGAATGGTAGTGGGGGTTTTGGTTGAGGCATTTTCTCGAACACTTGGCGTGATACGTGCATAAATATTGGGCATGGTGCACAGAGATGCTACTTTACGCATATTGGGCTTGCTTGATAAGGTCGAGAAAATGGGAAGAAAGCTTGATTTTAGAACGACCTGCTGAATGTTCATGGCAACGAGGAGAGGGAAGAGCGGAAGAGAGAGCGCTGCGGAGGAAGACAGTCGTGGATATTTGAACCAACCGGGGGAATTTATAGCGAACA

mRNA sequence

AAAAAACATACGAATGTTCCGTATTAAACACTGAATAAACGGTGGAATTTCCGCGCCACGCGTACGTGATTCGAGAGGCAAAGTTCGTGTTCTTCGCGGCTAGAATTCGAATCTCCAAGACCCCAATCGGGTCACAGCGGCTCGTAGCCGGGTTTTGCAACTTGGAGATCACCAGAGAAATGAAGAAAGCAACAACAGGCAGCATCGTTCTTCTTTTAGCTTTGCTTTCTCTTCAGGACTTTGTTCATTTTGCATTGGCGCTACCTGTTTCAGAGAGTAATCAGGACGAGGAGCAATCTGCAACTACGAGACCTCTCGAGCAAGACAAAGAGCATTTTGAGGAAGTTCATTGTTCCAGAGAAAGAAGTAGGACAGCCTGGAATATTCTTGAGGAGCATTTGCTGCCGTTTATTGAGAAGGAGAATTATCAGGTTTCAACCAAGTGTAGGCTTCATCCAAACAATGATCTCTTCAGGGATCAGGAGCAGCACAAGATTCATTTTGATATAAATCATTGGCAGTGTGGATACTGTCGCAAAAGCTTTCGGGCGGAAAAATTTCTTGATAAGCATTTCGACAACAGACATTACGATCTTCTGAATGTTAGTCACGGGAAGTGCTTAGCTGATTTATGTGGGGCTTTACATTGTGACCTAAAGATGGATATCAAGTCACGTAAATCTAAATGCAAGCCTGCAGCTGCAGCTAGGAACAAGCATTTATGTGAGAGTCTTGCTGATACTTGTTTTCCAATTAATGAGGGACCATCAGCAAGCCGTCTTCATGAGCTATTTCTTCACCAATTCTGTGGAGCTCATTCTTGCACTGAGAAACAAAAGCCATTTTCTAGAGGAGCGGAGAGGCAACCAGGCATCTTTTACATGGCATCTTCAATACTGATTTTGATGTTGCTACCAATTTTCTATGTCATTGTCTATTTGCACCGCAGAGAATCGAGAAACGGAATCGAAGTGCTTAGAAGAATCTCAAAGGCTGGACGTAAAACCAAACCCTTGTAACTGGTATCACCTTCAACCAAACTTCACTATTAGATTTTTAAGTTCGTCAACTCTGGGGCTCTAATTGTTTGTGGCCAAGGGTAGAAAATCATAGCCAAATGTTCTATAGCCATGGTGTCACCTAAATGTACCTCCGACATATCTTGCATTAAACTAAGTGACTTCTTTTTGTTCTCCGGCATTTGCATCTTTCTTTGTCATCATCTATTCTATTGCTGTTCAATTACTTTTTTTTTCTTTTTCTAGTAAAAGACTTGGAAGAAATGGAGGTTGATTACCAGTGTGAATGTATCGTATTCATGATTAGAAATCTATGAAAAATATAAACTTTTTTTTTTTCTTAATAACCACATCATTCCCTTGTTTGAATCGACCTTTTTCTCCACTAACATCATATGCTAGGTCAGTCTTGTTTTCAAGATTGCGTGGCATTTCATATATTGACAACAATTACTTCCAAGTATCAGCATCAACCTTGAAACGGCAATAGTACGTCTTTAATGAATCAATTTTCAGAGTGAACTCAGCCCCAACTGCCAACCTGATCAGAAGTTCATCCAGGATCGTCTCATCCAGCTTGCTGGTTTAGTCATACTTTGCCGCAACTCCCATGAAGAACAATCTATTATATGGTATGCAATTAGCTCAACTTTATAATCAGAACATTGATTTTGGACTCGAGAGCAATCTTGAACTTCCTCTAGGATTAATTCGCAAGATTCTCATGCCAGATGCAGGATCCTGCATTGTCTTAAAAAACTGGCAATCATTTAGTCCTCTTTACAATACCGGAAGTATGACTTGGAACACAAAATTTGGAATCAGGAAAGACCAGCCAAAGCAGGTTGGAATTCTAACATTCAGGGAGATATGACTCTAAATGCAAGAGAGATCAATTCGTTTGATAATAAAAACCCATCTGGATCACTAAGTCGAATGTGCCTCAGCTTCGTTAAACTATCATCTTTGCTCGATAGCAAGCCATTGCATTCATCAACTGTTAGAGCTCTCCTCTGCTCATCCAAGCAAACCACATGGCCACTTTTGATGTACGGTTCATAGGCCCTGCAAAGAGAATGTACAAGATGACCACCATAAGCTTTTCCAAAAGATTGTAAATCAAGGCCTGAGGAAGTTCTGAGAGAAAGCATCACAACATCCATAGCCATGTCCTTGATATCAACATCATTATCTCCATGGCAATCCACCAGTCCCTTCTCTAAGTTCTGTACATAACTGGTGTATTCCTTCAATTTCCGCGGCCTGGAGAACCTCAAACTACCGAGATAACTAGCCGCCCCTAAACCAAAACCATAGAAAGGATTGTTTTTCCAATAGGTAGAGTTGTGCTTACACTCAAACCCGCTCTTGCAGTAACTACTGATCTCATAATGGCTATAACCTCCCTCAGCGAGTGCCTTTGAAGCCATTCTGTAGAATCCAGCTGAATCTGTATCAGAAGGCAATGGAAATTCCCCTGGCTTGTACCTACACATGCCTATGTTCAAATTTCACATATAGCAACAATCTAAAATATTCATATCCAACACACTGCTACTTACAGTATCCCAAATTTTGTGTCTTCTTCAACTTGCAAATCATATACTGAAACATGAGTTGGTTGTGCTTCAATAGTGAGGCGTAAGCTCTCTTCCCACATCTCAGCTGTCTGGTGAGGTAGAGAAGATATGAGATCCATACTCCAGTTTTGAAGCCCACAAGACTTAATGATCTCAATAGCCTCATAAACTTCATCAACTCCATGAGCCCTTCCACAAGCCTTGAGTAACTCTTCCTGAAACGCCTGAACACCCAAAGACACTCTGTTCACATCCAACTTCATCAAGCCCTCCATTTTCTTCGCATCAAAAGTGCCGGGGTCCATTTCAATAGAGATTTCAGCATTCTTAGCCATCCCAAATTTCGACCCCAGCACATCTAAGATAACTGAGACGAGCCTCGGCGGCACAAGAGAAGGGGTGCCGCCTCCAAAGAAGACGGTTTCGAGGGGCCGGTCCGTTTGGAATTCTGATTTTGTGGCATTAATTTCTCGACAAAGCAACTCCACGTAGTCCCGAATTCGAGGGTCGTCATCGGTTTGGGAGGAGGAAGATCCCAAAGCGACGATTGGGAAGTCACAGTAGTGGCAGCGCTTTCGACAAAAAGGGAGGTGGATGTAGGCTGAAGTAGGGGGAATGGTAGTGGGGGTTTTGGTTGAGGCATTTTCTCGAACACTTGGCGTGATACGTGCATAAATATTGGGCATGGTGCACAGAGATGCTACTTTACGCATATTGGGCTTGCTTGATAAGGTCGAGAAAATGGGAAGAAAGCTTGATTTTAGAACGACCTGCTGAATGTTCATGGCAACGAGGAGAGGGAAGAGCGGAAGAGAGAGCGCTGCGGAGGAAGACAGTCGTGGATATTTGAACCAACCGGGGGAATTTATAGCGAACA

Coding sequence (CDS)

ATGAAGAAAGCAACAACAGGCAGCATCGTTCTTCTTTTAGCTTTGCTTTCTCTTCAGGACTTTGTTCATTTTGCATTGGCGCTACCTGTTTCAGAGAGTAATCAGGACGAGGAGCAATCTGCAACTACGAGACCTCTCGAGCAAGACAAAGAGCATTTTGAGGAAGTTCATTGTTCCAGAGAAAGAAGTAGGACAGCCTGGAATATTCTTGAGGAGCATTTGCTGCCGTTTATTGAGAAGGAGAATTATCAGGTTTCAACCAAGTGTAGGCTTCATCCAAACAATGATCTCTTCAGGGATCAGGAGCAGCACAAGATTCATTTTGATATAAATCATTGGCAGTGTGGATACTGTCGCAAAAGCTTTCGGGCGGAAAAATTTCTTGATAAGCATTTCGACAACAGACATTACGATCTTCTGAATGTTAGTCACGGGAAGTGCTTAGCTGATTTATGTGGGGCTTTACATTGTGACCTAAAGATGGATATCAAGTCACGTAAATCTAAATGCAAGCCTGCAGCTGCAGCTAGGAACAAGCATTTATGTGAGAGTCTTGCTGATACTTGTTTTCCAATTAATGAGGGACCATCAGCAAGCCGTCTTCATGAGCTATTTCTTCACCAATTCTGTGGAGCTCATTCTTGCACTGAGAAACAAAAGCCATTTTCTAGAGGAGCGGAGAGGCAACCAGGCATCTTTTACATGGCATCTTCAATACTGATTTTGATGTTGCTACCAATTTTCTATGTCATTGTCTATTTGCACCGCAGAGAATCGAGAAACGGAATCGAAGTGCTTAGAAGAATCTCAAAGGCTGGACGTAAAACCAAACCCTTGTAA

Protein sequence

MKKATTGSIVLLLALLSLQDFVHFALALPVSESNQDEEQSATTRPLEQDKEHFEEVHCSRERSRTAWNILEEHLLPFIEKENYQVSTKCRLHPNNDLFRDQEQHKIHFDINHWQCGYCRKSFRAEKFLDKHFDNRHYDLLNVSHGKCLADLCGALHCDLKMDIKSRKSKCKPAAAARNKHLCESLADTCFPINEGPSASRLHELFLHQFCGAHSCTEKQKPFSRGAERQPGIFYMASSILILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL
Homology
BLAST of Cp4.1LG18g08920 vs. NCBI nr
Match: XP_023515783.1 (uncharacterized protein LOC111779843 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 568 bits (1464), Expect = 1.91e-204
Identity = 279/279 (100.00%), Postives = 279/279 (100.00%), Query Frame = 0

Query: 1   MKKATTGSIVLLLALLSLQDFVHFALALPVSESNQDEEQSATTRPLEQDKEHFEEVHCSR 60
           MKKATTGSIVLLLALLSLQDFVHFALALPVSESNQDEEQSATTRPLEQDKEHFEEVHCSR
Sbjct: 1   MKKATTGSIVLLLALLSLQDFVHFALALPVSESNQDEEQSATTRPLEQDKEHFEEVHCSR 60

Query: 61  ERSRTAWNILEEHLLPFIEKENYQVSTKCRLHPNNDLFRDQEQHKIHFDINHWQCGYCRK 120
           ERSRTAWNILEEHLLPFIEKENYQVSTKCRLHPNNDLFRDQEQHKIHFDINHWQCGYCRK
Sbjct: 61  ERSRTAWNILEEHLLPFIEKENYQVSTKCRLHPNNDLFRDQEQHKIHFDINHWQCGYCRK 120

Query: 121 SFRAEKFLDKHFDNRHYDLLNVSHGKCLADLCGALHCDLKMDIKSRKSKCKPAAAARNKH 180
           SFRAEKFLDKHFDNRHYDLLNVSHGKCLADLCGALHCDLKMDIKSRKSKCKPAAAARNKH
Sbjct: 121 SFRAEKFLDKHFDNRHYDLLNVSHGKCLADLCGALHCDLKMDIKSRKSKCKPAAAARNKH 180

Query: 181 LCESLADTCFPINEGPSASRLHELFLHQFCGAHSCTEKQKPFSRGAERQPGIFYMASSIL 240
           LCESLADTCFPINEGPSASRLHELFLHQFCGAHSCTEKQKPFSRGAERQPGIFYMASSIL
Sbjct: 181 LCESLADTCFPINEGPSASRLHELFLHQFCGAHSCTEKQKPFSRGAERQPGIFYMASSIL 240

Query: 241 ILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL 279
           ILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL
Sbjct: 241 ILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL 279

BLAST of Cp4.1LG18g08920 vs. NCBI nr
Match: XP_022921975.1 (uncharacterized protein LOC111430069 [Cucurbita moschata])

HSP 1 Score: 563 bits (1450), Expect = 2.60e-202
Identity = 275/279 (98.57%), Postives = 277/279 (99.28%), Query Frame = 0

Query: 1   MKKATTGSIVLLLALLSLQDFVHFALALPVSESNQDEEQSATTRPLEQDKEHFEEVHCSR 60
           MKKATTGSIVLLL LLSLQDFVHFALALP SESNQDEEQSATTRPLEQDKEHFEEVHCSR
Sbjct: 1   MKKATTGSIVLLLVLLSLQDFVHFALALPPSESNQDEEQSATTRPLEQDKEHFEEVHCSR 60

Query: 61  ERSRTAWNILEEHLLPFIEKENYQVSTKCRLHPNNDLFRDQEQHKIHFDINHWQCGYCRK 120
           ERSRTAWNILEEHLLPF+EKENYQVSTKCRLHPNNDLFRDQEQHKIHFDINHWQCGYCRK
Sbjct: 61  ERSRTAWNILEEHLLPFVEKENYQVSTKCRLHPNNDLFRDQEQHKIHFDINHWQCGYCRK 120

Query: 121 SFRAEKFLDKHFDNRHYDLLNVSHGKCLADLCGALHCDLKMDIKSRKSKCKPAAAARNKH 180
           SFRAEKFLDKHFDNRHYDLLNVSHGKCLADLCGALHCDLKMDIKSRKSKCKPAAAARNKH
Sbjct: 121 SFRAEKFLDKHFDNRHYDLLNVSHGKCLADLCGALHCDLKMDIKSRKSKCKPAAAARNKH 180

Query: 181 LCESLADTCFPINEGPSASRLHELFLHQFCGAHSCTEKQKPFSRGAERQPGIFYMASSIL 240
           LCESLADTCFPINEGPSASRLHELFLHQFCGAHSCTEKQKPFSRGAERQPGIFYMASSIL
Sbjct: 181 LCESLADTCFPINEGPSASRLHELFLHQFCGAHSCTEKQKPFSRGAERQPGIFYMASSIL 240

Query: 241 ILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL 279
           ILMLLPIFYVIVYLHRRESRNGIEVLRRISK+GRKTKPL
Sbjct: 241 ILMLLPIFYVIVYLHRRESRNGIEVLRRISKSGRKTKPL 279

BLAST of Cp4.1LG18g08920 vs. NCBI nr
Match: KAG6589402.1 (hypothetical protein SDJN03_14825, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 557 bits (1436), Expect = 3.55e-200
Identity = 272/279 (97.49%), Postives = 277/279 (99.28%), Query Frame = 0

Query: 1   MKKATTGSIVLLLALLSLQDFVHFALALPVSESNQDEEQSATTRPLEQDKEHFEEVHCSR 60
           MKKATTGSIVLLLALLSLQDFV FALALP SESNQDEEQSATTRPLEQDKEHFEEVHCSR
Sbjct: 1   MKKATTGSIVLLLALLSLQDFVRFALALPPSESNQDEEQSATTRPLEQDKEHFEEVHCSR 60

Query: 61  ERSRTAWNILEEHLLPFIEKENYQVSTKCRLHPNNDLFRDQEQHKIHFDINHWQCGYCRK 120
           ERSRTAWNILEEHLLPF+EKENYQVSTKCRLHPNNDL+RDQEQHKIHFDINHWQCGYCRK
Sbjct: 61  ERSRTAWNILEEHLLPFVEKENYQVSTKCRLHPNNDLYRDQEQHKIHFDINHWQCGYCRK 120

Query: 121 SFRAEKFLDKHFDNRHYDLLNVSHGKCLADLCGALHCDLKMDIKSRKSKCKPAAAARNKH 180
           SFRAEKFLDKHFDNRHYDLLNVSHGKCLADLCGALHCDLKMDIKSRKSKCKPAAAARNKH
Sbjct: 121 SFRAEKFLDKHFDNRHYDLLNVSHGKCLADLCGALHCDLKMDIKSRKSKCKPAAAARNKH 180

Query: 181 LCESLADTCFPINEGPSASRLHELFLHQFCGAHSCTEKQKPFSRGAERQPGIFYMASSIL 240
           LCESLADTCFPINEGPSASRLHELFLHQFCGAHSCTEKQKPFSRGAERQPGIFYMASSIL
Sbjct: 181 LCESLADTCFPINEGPSASRLHELFLHQFCGAHSCTEKQKPFSRGAERQPGIFYMASSIL 240

Query: 241 ILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL 279
           ILMLLPIFYVIVYLHRRESRNGIEVLRRI+K+GRK+KPL
Sbjct: 241 ILMLLPIFYVIVYLHRRESRNGIEVLRRITKSGRKSKPL 279

BLAST of Cp4.1LG18g08920 vs. NCBI nr
Match: KAG7023082.1 (hypothetical protein SDJN02_14106, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 557 bits (1436), Expect = 8.93e-200
Identity = 272/279 (97.49%), Postives = 277/279 (99.28%), Query Frame = 0

Query: 1   MKKATTGSIVLLLALLSLQDFVHFALALPVSESNQDEEQSATTRPLEQDKEHFEEVHCSR 60
           MKKATTGSIVLLLALLSLQDFV FALALP SESNQDEEQSATTRPLEQDKEHFEEVHCSR
Sbjct: 26  MKKATTGSIVLLLALLSLQDFVRFALALPPSESNQDEEQSATTRPLEQDKEHFEEVHCSR 85

Query: 61  ERSRTAWNILEEHLLPFIEKENYQVSTKCRLHPNNDLFRDQEQHKIHFDINHWQCGYCRK 120
           ERSRTAWNILEEHLLPF+EKENYQVSTKCRLHPNNDL+RDQEQHKIHFDINHWQCGYCRK
Sbjct: 86  ERSRTAWNILEEHLLPFVEKENYQVSTKCRLHPNNDLYRDQEQHKIHFDINHWQCGYCRK 145

Query: 121 SFRAEKFLDKHFDNRHYDLLNVSHGKCLADLCGALHCDLKMDIKSRKSKCKPAAAARNKH 180
           SFRAEKFLDKHFDNRHYDLLNVSHGKCLADLCGALHCDLKMDIKSRKSKCKPAAAARNKH
Sbjct: 146 SFRAEKFLDKHFDNRHYDLLNVSHGKCLADLCGALHCDLKMDIKSRKSKCKPAAAARNKH 205

Query: 181 LCESLADTCFPINEGPSASRLHELFLHQFCGAHSCTEKQKPFSRGAERQPGIFYMASSIL 240
           LCESLADTCFPINEGPSASRLHELFLHQFCGAHSCTEKQKPFSRGAERQPGIFYMASSIL
Sbjct: 206 LCESLADTCFPINEGPSASRLHELFLHQFCGAHSCTEKQKPFSRGAERQPGIFYMASSIL 265

Query: 241 ILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL 279
           ILMLLPIFYVIVYLHRRESRNGIEVLRRI+K+GRK+KPL
Sbjct: 266 ILMLLPIFYVIVYLHRRESRNGIEVLRRITKSGRKSKPL 304

BLAST of Cp4.1LG18g08920 vs. NCBI nr
Match: XP_022987555.1 (uncharacterized protein LOC111485086 [Cucurbita maxima])

HSP 1 Score: 556 bits (1433), Expect = 1.02e-199
Identity = 271/279 (97.13%), Postives = 275/279 (98.57%), Query Frame = 0

Query: 1   MKKATTGSIVLLLALLSLQDFVHFALALPVSESNQDEEQSATTRPLEQDKEHFEEVHCSR 60
           MKKATTGSI+LLL L+SLQDFV FALALP SESNQDEEQSATTRPLEQDKEHFEEVHCSR
Sbjct: 1   MKKATTGSIILLLVLISLQDFVRFALALPPSESNQDEEQSATTRPLEQDKEHFEEVHCSR 60

Query: 61  ERSRTAWNILEEHLLPFIEKENYQVSTKCRLHPNNDLFRDQEQHKIHFDINHWQCGYCRK 120
           ERSRTAWNILEEH LPFIEKENYQVSTKCRLHPNNDLFRDQEQHKIHFDINHWQCGYCRK
Sbjct: 61  ERSRTAWNILEEHFLPFIEKENYQVSTKCRLHPNNDLFRDQEQHKIHFDINHWQCGYCRK 120

Query: 121 SFRAEKFLDKHFDNRHYDLLNVSHGKCLADLCGALHCDLKMDIKSRKSKCKPAAAARNKH 180
           SFRAEK+LDKHFDNRHYDLLNVSHGKCLADLCGALHCDLKMDIKSRKSKCKPAAAARNKH
Sbjct: 121 SFRAEKYLDKHFDNRHYDLLNVSHGKCLADLCGALHCDLKMDIKSRKSKCKPAAAARNKH 180

Query: 181 LCESLADTCFPINEGPSASRLHELFLHQFCGAHSCTEKQKPFSRGAERQPGIFYMASSIL 240
           LCESLADTCFPINEGPSASRLHELFLHQFCGAHSCTEKQ+PFSRGAERQPGIFYMASSIL
Sbjct: 181 LCESLADTCFPINEGPSASRLHELFLHQFCGAHSCTEKQRPFSRGAERQPGIFYMASSIL 240

Query: 241 ILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL 279
           ILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL
Sbjct: 241 ILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL 279

BLAST of Cp4.1LG18g08920 vs. ExPASy TrEMBL
Match: A0A6J1E7A2 (uncharacterized protein LOC111430069 OS=Cucurbita moschata OX=3662 GN=LOC111430069 PE=4 SV=1)

HSP 1 Score: 563 bits (1450), Expect = 1.26e-202
Identity = 275/279 (98.57%), Postives = 277/279 (99.28%), Query Frame = 0

Query: 1   MKKATTGSIVLLLALLSLQDFVHFALALPVSESNQDEEQSATTRPLEQDKEHFEEVHCSR 60
           MKKATTGSIVLLL LLSLQDFVHFALALP SESNQDEEQSATTRPLEQDKEHFEEVHCSR
Sbjct: 1   MKKATTGSIVLLLVLLSLQDFVHFALALPPSESNQDEEQSATTRPLEQDKEHFEEVHCSR 60

Query: 61  ERSRTAWNILEEHLLPFIEKENYQVSTKCRLHPNNDLFRDQEQHKIHFDINHWQCGYCRK 120
           ERSRTAWNILEEHLLPF+EKENYQVSTKCRLHPNNDLFRDQEQHKIHFDINHWQCGYCRK
Sbjct: 61  ERSRTAWNILEEHLLPFVEKENYQVSTKCRLHPNNDLFRDQEQHKIHFDINHWQCGYCRK 120

Query: 121 SFRAEKFLDKHFDNRHYDLLNVSHGKCLADLCGALHCDLKMDIKSRKSKCKPAAAARNKH 180
           SFRAEKFLDKHFDNRHYDLLNVSHGKCLADLCGALHCDLKMDIKSRKSKCKPAAAARNKH
Sbjct: 121 SFRAEKFLDKHFDNRHYDLLNVSHGKCLADLCGALHCDLKMDIKSRKSKCKPAAAARNKH 180

Query: 181 LCESLADTCFPINEGPSASRLHELFLHQFCGAHSCTEKQKPFSRGAERQPGIFYMASSIL 240
           LCESLADTCFPINEGPSASRLHELFLHQFCGAHSCTEKQKPFSRGAERQPGIFYMASSIL
Sbjct: 181 LCESLADTCFPINEGPSASRLHELFLHQFCGAHSCTEKQKPFSRGAERQPGIFYMASSIL 240

Query: 241 ILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL 279
           ILMLLPIFYVIVYLHRRESRNGIEVLRRISK+GRKTKPL
Sbjct: 241 ILMLLPIFYVIVYLHRRESRNGIEVLRRISKSGRKTKPL 279

BLAST of Cp4.1LG18g08920 vs. ExPASy TrEMBL
Match: A0A6J1JH74 (uncharacterized protein LOC111485086 OS=Cucurbita maxima OX=3661 GN=LOC111485086 PE=4 SV=1)

HSP 1 Score: 556 bits (1433), Expect = 4.92e-200
Identity = 271/279 (97.13%), Postives = 275/279 (98.57%), Query Frame = 0

Query: 1   MKKATTGSIVLLLALLSLQDFVHFALALPVSESNQDEEQSATTRPLEQDKEHFEEVHCSR 60
           MKKATTGSI+LLL L+SLQDFV FALALP SESNQDEEQSATTRPLEQDKEHFEEVHCSR
Sbjct: 1   MKKATTGSIILLLVLISLQDFVRFALALPPSESNQDEEQSATTRPLEQDKEHFEEVHCSR 60

Query: 61  ERSRTAWNILEEHLLPFIEKENYQVSTKCRLHPNNDLFRDQEQHKIHFDINHWQCGYCRK 120
           ERSRTAWNILEEH LPFIEKENYQVSTKCRLHPNNDLFRDQEQHKIHFDINHWQCGYCRK
Sbjct: 61  ERSRTAWNILEEHFLPFIEKENYQVSTKCRLHPNNDLFRDQEQHKIHFDINHWQCGYCRK 120

Query: 121 SFRAEKFLDKHFDNRHYDLLNVSHGKCLADLCGALHCDLKMDIKSRKSKCKPAAAARNKH 180
           SFRAEK+LDKHFDNRHYDLLNVSHGKCLADLCGALHCDLKMDIKSRKSKCKPAAAARNKH
Sbjct: 121 SFRAEKYLDKHFDNRHYDLLNVSHGKCLADLCGALHCDLKMDIKSRKSKCKPAAAARNKH 180

Query: 181 LCESLADTCFPINEGPSASRLHELFLHQFCGAHSCTEKQKPFSRGAERQPGIFYMASSIL 240
           LCESLADTCFPINEGPSASRLHELFLHQFCGAHSCTEKQ+PFSRGAERQPGIFYMASSIL
Sbjct: 181 LCESLADTCFPINEGPSASRLHELFLHQFCGAHSCTEKQRPFSRGAERQPGIFYMASSIL 240

Query: 241 ILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL 279
           ILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL
Sbjct: 241 ILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL 279

BLAST of Cp4.1LG18g08920 vs. ExPASy TrEMBL
Match: A0A0A0LU52 (C2H2-type domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G002750 PE=4 SV=1)

HSP 1 Score: 510 bits (1313), Expect = 1.00e-181
Identity = 246/278 (88.49%), Postives = 262/278 (94.24%), Query Frame = 0

Query: 2   KKATTGSIVLLLALLSLQDFVHFALALPVSESNQDEEQSATTRPLEQDKEHFEEVHCSRE 61
           KK T  +I+LL  LLSLQ+ VHFA +LP S +NQDEEQSAT RPLEQ++EH +EVHCSRE
Sbjct: 3   KKVTASTIILLSLLLSLQEVVHFAFSLPPSHNNQDEEQSATLRPLEQNEEHVDEVHCSRE 62

Query: 62  RSRTAWNILEEHLLPFIEKENYQVSTKCRLHPNNDLFRDQEQHKIHFDINHWQCGYCRKS 121
           RSRTAWNI+EEHLLPF+EKENY+VST+CRLHPNNDLFRDQEQHKIH DINHWQCGYCRKS
Sbjct: 63  RSRTAWNIIEEHLLPFMEKENYEVSTQCRLHPNNDLFRDQEQHKIHLDINHWQCGYCRKS 122

Query: 122 FRAEKFLDKHFDNRHYDLLNVSHGKCLADLCGALHCDLKMDIKSRKSKCKPAAAARNKHL 181
           FRAEKFLDKHFDNRH +LLNVSHGKCLADLCGALHCDLKMDIKSRKSKCKPAAAARNKHL
Sbjct: 123 FRAEKFLDKHFDNRHSNLLNVSHGKCLADLCGALHCDLKMDIKSRKSKCKPAAAARNKHL 182

Query: 182 CESLADTCFPINEGPSASRLHELFLHQFCGAHSCTEKQKPFSRGAERQPGIFYMASSILI 241
           CESLAD+CFPINEGPSA+RLHELFLHQFCGAHSCT KQKPFSRGA RQPGIFYMASSILI
Sbjct: 183 CESLADSCFPINEGPSANRLHELFLHQFCGAHSCTGKQKPFSRGAARQPGIFYMASSILI 242

Query: 242 LMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL 279
           LMLLPIFYVIVYLHRRESRNGIEVL+RISKAGRK KPL
Sbjct: 243 LMLLPIFYVIVYLHRRESRNGIEVLKRISKAGRKNKPL 280

BLAST of Cp4.1LG18g08920 vs. ExPASy TrEMBL
Match: A0A1S3CLJ0 (uncharacterized protein LOC103502344 OS=Cucumis melo OX=3656 GN=LOC103502344 PE=4 SV=1)

HSP 1 Score: 505 bits (1300), Expect = 9.57e-180
Identity = 244/278 (87.77%), Postives = 261/278 (93.88%), Query Frame = 0

Query: 2   KKATTGSIVLLLALLSLQDFVHFALALPVSESNQDEEQSATTRPLEQDKEHFEEVHCSRE 61
           KK T  +I+LL  LLSLQ+ +HFA  LP S +NQDEEQSAT RPLEQ++EH +EVHCSRE
Sbjct: 3   KKETASTIILLSFLLSLQELLHFAFPLPPSHNNQDEEQSATLRPLEQNEEHVDEVHCSRE 62

Query: 62  RSRTAWNILEEHLLPFIEKENYQVSTKCRLHPNNDLFRDQEQHKIHFDINHWQCGYCRKS 121
           RSRTAWNI+EEHLLPF+E ENY+VST+CRLHPNNDLFRDQEQHKIH DINHWQCGYCRKS
Sbjct: 63  RSRTAWNIIEEHLLPFMEIENYEVSTQCRLHPNNDLFRDQEQHKIHLDINHWQCGYCRKS 122

Query: 122 FRAEKFLDKHFDNRHYDLLNVSHGKCLADLCGALHCDLKMDIKSRKSKCKPAAAARNKHL 181
           FRAEKFLDKHFDNRH +LLNVSHGKCLADLCGALHCDLKMDIKSRKSKCKPAAAARNKHL
Sbjct: 123 FRAEKFLDKHFDNRHSNLLNVSHGKCLADLCGALHCDLKMDIKSRKSKCKPAAAARNKHL 182

Query: 182 CESLADTCFPINEGPSASRLHELFLHQFCGAHSCTEKQKPFSRGAERQPGIFYMASSILI 241
           CESLAD+CFPINEGPSA+RLHELFLHQFCGAHSCT KQKPFSRGA RQPGIFYMASSILI
Sbjct: 183 CESLADSCFPINEGPSANRLHELFLHQFCGAHSCTGKQKPFSRGAARQPGIFYMASSILI 242

Query: 242 LMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL 279
           LMLLPIFYVIVYLHRRESRNGIEVL+RISKAGRK+KPL
Sbjct: 243 LMLLPIFYVIVYLHRRESRNGIEVLKRISKAGRKSKPL 280

BLAST of Cp4.1LG18g08920 vs. ExPASy TrEMBL
Match: A0A6J1C036 (uncharacterized protein LOC111007249 OS=Momordica charantia OX=3673 GN=LOC111007249 PE=4 SV=1)

HSP 1 Score: 485 bits (1248), Expect = 9.99e-172
Identity = 239/283 (84.45%), Postives = 262/283 (92.58%), Query Frame = 0

Query: 1   MKKATTG--SIVLLLALLSLQDFVHFALALPVSESNQD--EEQSATTRPLEQDKEHFEEV 60
           MKKAT G  SI+L+L   SLQ  VHFA ALP SE+ QD   EQSAT+RPL++ +EH +EV
Sbjct: 5   MKKATAGAGSIILILMSFSLQS-VHFASALPPSETPQDLEVEQSATSRPLKEAEEHVDEV 64

Query: 61  HCSRERSRTAWNILEEHLLPFIEKENYQVSTKCRLHPNNDLFRDQEQHKIHFDINHWQCG 120
           HCSRERS+TAWNI+EEHLLPF+EKENYQVST+CRLHPNNDLFRDQEQHKIH DINHWQCG
Sbjct: 65  HCSRERSKTAWNIIEEHLLPFLEKENYQVSTECRLHPNNDLFRDQEQHKIHLDINHWQCG 124

Query: 121 YCRKSFRAEKFLDKHFDNRHYDLLNVSHGKCLADLCGALHCDLKMDIKSRKSKCKPAAAA 180
           YCRKSFRAEKFLDKHFDNRH++LLNVSHGKCLADLCGALHCD+KMD+KSRKSKC PAAAA
Sbjct: 125 YCRKSFRAEKFLDKHFDNRHHNLLNVSHGKCLADLCGALHCDMKMDMKSRKSKCSPAAAA 184

Query: 181 RNKHLCESLADTCFPINEGPSASRLHELFLHQFCGAHSCTEKQKPFSRGAERQPGIFYMA 240
           RNKHLCESLAD+CFPINEGPSASRLH+LFLHQFCGAHSCT K KPFS+GAERQPGIFYMA
Sbjct: 185 RNKHLCESLADSCFPINEGPSASRLHDLFLHQFCGAHSCTGKLKPFSKGAERQPGIFYMA 244

Query: 241 SSILILMLLPIFYVIVYLHRRESRNGIEVLRRISKAGRKTKPL 279
           SSILILMLLP+FYVIVYLHRRES+N I+VL+RISKAGRKTKPL
Sbjct: 245 SSILILMLLPLFYVIVYLHRRESKNEIQVLKRISKAGRKTKPL 286

BLAST of Cp4.1LG18g08920 vs. TAIR 10
Match: AT5G63280.1 (C2H2-like zinc finger protein )

HSP 1 Score: 330.5 bits (846), Expect = 1.3e-90
Identity = 153/272 (56.25%), Postives = 204/272 (75.00%), Query Frame = 0

Query: 9   IVLLLALLSLQDFVHFALALPVSESNQDEEQSATTRPLEQDKE--HFEEVHCSRERSRTA 68
           ++LLL +L L   V           +Q  E+S +TR + ++ E  +  E+HCSRERSR A
Sbjct: 10  VILLLLVLLLHQSV-----------SQGFEESESTRLVNEEVEVSNAPEIHCSRERSRAA 69

Query: 69  WNILEEHLLPFIEKENYQVSTKCRLHPNNDLFRDQEQHKIHFDINHWQCGYCRKSFRAEK 128
           W I++++L PF+E+E Y++   CRLHP+NDL+RDQE HK+H D+  W+CGYC+KSF  EK
Sbjct: 70  WQIIQDYLTPFVERERYEIPKNCRLHPDNDLYRDQEHHKVHVDVFEWKCGYCKKSFNDEK 129

Query: 129 FLDKHFDNRHYDLLNVSHGKCLADLCGALHCDLKMDIKSRKSKCKPAAAARNKHLCESLA 188
           FLDKHF  RHY+LLN +  KCLADLCGALHCD  +  K  KSKC P A A+N+HLCES+A
Sbjct: 130 FLDKHFSTRHYNLLNTTDTKCLADLCGALHCDFVLSSKKPKSKCNPPAVAKNRHLCESVA 189

Query: 189 DTCFPINEGPSASRLHELFLHQFCGAHSCTEKQKPFSRGAERQPGIFYMASSILILMLLP 248
           ++CFP+++GPSASRLHE FL QFC AH+CT   KPF RG +++ G+FY+A SIL LMLLP
Sbjct: 190 NSCFPVSQGPSASRLHEHFLRQFCDAHTCTGNDKPFPRGGKKKSGVFYLAISILTLMLLP 249

Query: 249 IFYVIVYLHRRESRNGIEVLRRISKAGRKTKP 279
           +FY++V+LH+RE R+G + LRRI K+G+KTKP
Sbjct: 250 LFYLLVFLHQREKRSGTQDLRRIIKSGKKTKP 270

BLAST of Cp4.1LG18g08920 vs. TAIR 10
Match: AT5G40710.1 (zinc finger (C2H2 type) family protein )

HSP 1 Score: 288.1 bits (736), Expect = 7.5e-78
Identity = 137/269 (50.93%), Postives = 194/269 (72.12%), Query Frame = 0

Query: 10  VLLLALLSLQDFVHFALALPVSESNQDEEQSATTRPLEQDKEHFEEVHCSRERSRTAWNI 69
           +LL   +S Q    +  +LP S   Q+ E        +  ++ F E+HCSRERSR AW I
Sbjct: 13  ILLCFFISSQ---FWGFSLPTSIIQQNLE------GFKDPEDGFHEIHCSRERSRVAWKI 72

Query: 70  LEEHLLPFIEKENYQVSTKCRLHPNNDLFRDQEQHKIHFDINHWQCGYCRKSFRAEKFLD 129
           ++E+L+P++EKE YQ+ + CR+H +ND++R+QE+HK+  DIN W+CG+C+K+F  EK+LD
Sbjct: 73  IQEYLMPYVEKERYQLPSTCRVHRDNDIYREQEEHKLRSDINEWRCGFCKKAFYEEKYLD 132

Query: 130 KHFDNRHYDLLNVSHGKCLADLCGALHCDLKMDIKSRKSKCKPAAAARNKHLCESLADTC 189
           KHFD+RHY+LLN SHGKCL+DLCGALHCDL +D    KSKC PAAAA+N+HLCESLA++C
Sbjct: 133 KHFDSRHYNLLNASHGKCLSDLCGALHCDLVVDTARLKSKCNPAAAAKNRHLCESLANSC 192

Query: 190 FPINEGPSASRLHELFLHQFCGAHSCTEKQKPFSRGAERQPGIFYMASSILILMLLPIFY 249
           FP+N+G SA+RLH+ FL QFC AH+C+   KP S+  +++  I Y+  SI++L++L ++Y
Sbjct: 193 FPVNKGSSANRLHDFFLRQFCDAHTCSGGSKPLSQKPKKR-SIVYIIFSIIVLVVLLLYY 252

Query: 250 VIVYLHRRESRNGIEVLRRISKAGRKTKP 279
             VYL RR  +   + L+RI   G K KP
Sbjct: 253 SFVYLFRRGLKRRSQDLKRIRHNGLKKKP 271

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023515783.11.91e-204100.00uncharacterized protein LOC111779843 [Cucurbita pepo subsp. pepo][more]
XP_022921975.12.60e-20298.57uncharacterized protein LOC111430069 [Cucurbita moschata][more]
KAG6589402.13.55e-20097.49hypothetical protein SDJN03_14825, partial [Cucurbita argyrosperma subsp. sorori... [more]
KAG7023082.18.93e-20097.49hypothetical protein SDJN02_14106, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_022987555.11.02e-19997.13uncharacterized protein LOC111485086 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1E7A21.26e-20298.57uncharacterized protein LOC111430069 OS=Cucurbita moschata OX=3662 GN=LOC1114300... [more]
A0A6J1JH744.92e-20097.13uncharacterized protein LOC111485086 OS=Cucurbita maxima OX=3661 GN=LOC111485086... [more]
A0A0A0LU521.00e-18188.49C2H2-type domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_1G002750 P... [more]
A0A1S3CLJ09.57e-18087.77uncharacterized protein LOC103502344 OS=Cucumis melo OX=3656 GN=LOC103502344 PE=... [more]
A0A6J1C0369.99e-17284.45uncharacterized protein LOC111007249 OS=Momordica charantia OX=3673 GN=LOC111007... [more]
Match NameE-valueIdentityDescription
AT5G63280.11.3e-9056.25C2H2-like zinc finger protein [more]
AT5G40710.17.5e-7850.93zinc finger (C2H2 type) family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR21385ZINC FINGER PROTEIN-RELATEDcoord: 17..278
NoneNo IPR availablePANTHERPTHR21385:SF5TRANSCRIPTION FACTOR C2H2 FAMILY-RELATEDcoord: 17..278
IPR013087Zinc finger C2H2-typePROSITEPS00028ZINC_FINGER_C2H2_1coord: 115..136

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG18g08920.1Cp4.1LG18g08920.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane