Cp4.1LG01g10540 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG01g10540
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionTDBD domain-containing protein
LocationCp4.1LG01: 6521592 .. 6527535 (-)
RNA-Seq ExpressionCp4.1LG01g10540
SyntenyCp4.1LG01g10540
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGTCGCTATCCAGTAGTCCCTCGGAAGTCCCTTTCTATTTCTTCTCACCCAGAGGAACATCGCGTGTTCCAACCTCCCCGGCAATTTCTCCGGCGAAGGTCAGCTCACTGAGAATCTATGATAGTTGTGAAATTTCGGAGGTTTCAGTCTCTCATTGTTGGTTTCTTTGATTTTGATCGGTATTGGATTGCTTGTGCATTTCTGAATCTTCTTTTGTGTGTTTCGGAATTTATTTTGATCGTGATTAGCCCTGATCGGCGTTCGATTTTTGGTTGCTCTATGAGTTTTGATTTTTGGTTGTTTGTGTGCAGATTGATTTTCGAAATCTTTGAGGTTTGCGCTTTCCGTGGTTCTTTGTTGTGTTTTTCATCGGTAACTTGAGTTGTGTCGTCGTCTAAATCTGACACTGGGCGTGGAGTTGGGTTCGAGTGTGTGTGGCGTGGAAGGAGTTGGCGTTATCGCGCGGAGGTTGAGTTTCAGAGCTTGGAGAAGCGAATTTGAAGTTTTGGGGTTTTTGCTTGGAAAAAAATGGTACGTTTGAATCGGGAATTTTGCATACTCTTCCACCTCTGAATGTTGCTTTGTGTAAACTTATTATTGTAAAAGGCTAATTTTCCCTTTCTACTTGACAATCGGTGCTTGCTAATACTGATTTCTTGCTACCATTTCTTTTTTCTCAAAATGCGGGCATCAAAATTTGTTAATCTGTGCTTGCATACTCAGATCATCATATTTGCCTCTAGTTAAAGTATTTCCTTATGGTGTTTTTTTTATCCCAAACGTTGGTTAATTTACCCTGTGTCAATCATTGCGAGTACGAGGATGGAAGCCTAGGATTATGCGTTGGATGCTTTCAATTCTCATAACTTTAAGCTTTTGTTTCGACTAGTAGTGGAATTGAAGAACACCTGATTGGAAAATATGTTCTACCCTTCAGATTAATTCCCCTTGTTTCAAAGAATTCTGATTCATTCAATATTTAGCCGGTGTAAGAAGTTGTTTGATTGATTTTAGGCATTGGTTGATTGAAATCAAGGGCTCTCAAACAACTTTCCTGCAAGGCCAATCCTCGACAAACTTCAAAGCCCTTACTAAAAAACTTACCAGGAAGTTTTAATGGTTCCATTAGTTTGAAAGATGTATGAATGTCGGTTTCGTTTATTGATTTTGGATGAATCGATGGCTACAATATTTTATTCAAAAGTGATGATGACAACTTTCAATATGTTATATTGTTTGAAGTGTATTTAGATGGAGATGTTAGTTAACAGAATCTTGACGGGGACGTCGACTTGTTTAGTAGTTGTATGGATATGCAAGAGAGATTAAACAGTTATGCAAGGAAGCTTAAACTTTCATAATGCTCGATCTAATACTATTTCGCCAAGATATCAAGTAAGAGGTTTATTTGTGAATGCGATAGCTGGACAAGATTTTAAACTCGAGGAAAATGACATTTAGTGAGGGAAGTAAATGATTGTACACTTTATTGATCACAATTTTTTTAGCATGACATTGGATTTTGTAATCAGACCTGATAAGCTTGTAGCTTACCTTTAAATTTGTTTCTGATTATTACATTATTTTCTGCTTGTGTAAGATACAACTTTAAGCCAGAGTTTTTATTAGTTCTGATCAAATGTACTTTATGTTTTTCATTTAGTTTTCAGTTCTCATGATTTGGAATCTTTCTGAAAATGGGTTCATGAAAAGAAGCTCATGAGTGTTTATGTTCACATACCTCATTGTTCTGGAATATCATCTGGCTGATATAATAGAGTTATTTTTAAATGCATGAATCATTTATGTGGTTCTGTAAGGCCTTTCTTTAAGCATTTTTCTTGTATCTTCTTCTTTCTGTAATGCGAAGAGATTAAAGCCTAACTTAATTTATAAATATGGTTGTATGTTGCAGAGCCAGAACGTCAGGTGCCCGAAAAAGGAAAAAAGAAATAACAACTGCGTTTCTAGTGTTTAGTCTTTTCCCTTATTGTTTTGAAGGCAAACTGTGAACTTACATATTATCTTTTGCACTTGTTTTTTTTAACTTCTCATTTGGGCTGTTTTGGGTTAAGTATTTGGGAAAGCCTCTCAAGTATTTGTTTTCTGGCAGTCTTTCCAGCATAAAAGCTTTTGGATACCAAGGGATGCTGGATGTTTAACTGATGGAGAGATGAACTATGATAGCTCCTCCAGAATCGAAGCCAAACGTGGCCATCAATGGTTCATGGATGGCAGTGCACCAGAGCTATTCAGTAGCAAGAAGCAAGCAATAGAAACCGTAAATACTAGACCTGTTCCAGGAGTTCCACACATGAATGTTTCTCCCTGGGAAAACACTTCAAGTTTTCAGTCAGTTCCTGGGCACTTCACTGATCGTCTTTTCGGCTCTGAGCCAGTACGAACTGTCAACTTGGTCGATAGAGGCATCACTGTTGGAAATGCAAATATGGACATGGGAAGAAAGGAGTATGAGAATCATTTTGCAAACAACCCATCCGTTGGCTTGTCCATGTCCCAATCCATTGAGGATTCCTCATCGTGCCTCAATTTTGGTGGAATTAGAAAAGTTAAAGTCAATCAAGTCAGGGATCCTGACATTGGCATGCCTGCCTCTTTGGGACATGCCTATAGTAGGGGTGATAATGGTACAATTTCAATGGGTACAACATTTAACAAAAACCACGAGCACGCCATTTCATTGGGCCATACATATAATAGTCGAGATGAGAATGCCATCTCGGTTGGCCCTGCATATCACAAGACAGACGACAGTTTCATTTCCATGGGTCATGCTTTCAGCAAAGGTGATGAGAATTTTATTACAATTGGTCCTAACTATAGCAAGGGAGATAATAGCATCCTATCGATGAGCCAGCCTTTTGACAAAGGGGATGACTCTTTTATTTCAATGGGTCAATCTTATGAGAAGGCAGAAGGTAATATCATTTCTTTTGGTGCCTCCTACAATAAAGGGCATGAAAATTTTATCTCAATGGGGCCAACCTATAGTAAGGCAGGTGATACTTTCATTTCAATGGCGCCCTCCTATAATAAGGGAAATGATGATAACTTGTCCATGGGTCCAACCTTTGACAAGGTAAACTCGGACGTTGTACATGTCGGTCCTAAATTTGACAAAGCCGATTCTGGTTCGGTGTCAATGACTCACAACTATCATAAAGGTGAGACTAATACCATATCTTTTGGAGGCTTTGATGATGAGAATGGGACAGGTAATCCTTCTGGTGGGATCATTAGCAGTTATGACTTGTTAATGGCGAATCAGGCTTCTGCCCAAGCATCAGAAGTATCAACCATGAGAGATTCAGTTCATCCCAATGTGGAAGTGAACATTAACAATGTCATAAAAGTTGATGCTAAAATCGACACAAGTTCCAAGATTAGAGAGCCGAGGACGACTAAGAAGGTGTCACCCAATAGCTTCCCTTCAAATGTTAAAAGCTTGCTCTCAACTGGTATGCTTGATGGGGTTCCTGTGAAATATGTTTCTTGGTCACGGGAGGTATAATTATTCTTTTCTTATACATAATTTTCATTTTAATTTGAAAATCTGGAGAATGCTATGAAACTAATTAGTCTTTTCTGCCACCTGTTTCAAATATTTATACTGGGAAGAAAAATCTGAAAGGAACAATAAAAGGAACTGGATACTTGTGCAGCTGTGACAACTGCAATCATTCAAAGGTAATTTGTGAAGTTCAAGATTCTATTTTTTTTTTTTGTAAAAGTCAACGAAGGAACATACAGCTCTAATTATATTATTAAGGTTGCTAAGTGAGTTTGAGTTGCATTGCACAGGCTCTCAATGCTTATGAGTTTGAGAGGCACGCTGGCTGCAAAACCAAACATCCAAATAACCACATTTACTTTGAGAATGGTAAAACTATCTATGCTGTTGTTCAAGAGCTAAAGAACACCCCTCAGGAGATGTTATTTGATGCAATTCACAATGTGACTGGCTCTCCCATTAATCAGAAGAACTTTCGTGTTTGGAAAGGTGATTCGTATCATTCCATAAAATATTTACGCGAGTACATCAGTAAAAATATGAAGGATTTATGTTGAATTTTCTCACTTTTTCTTTCATTCATACTACACAGCATCTTATCAAGCAGCCACTCTTGAACTCCAGCGTATTTATGGAAAAGATGAAATAACTATGCCTTCTTGAGATGGGGATTTACCTTTTTAAATTTTGAGGTAGGTTGTGCTTAGCATTGTTCAGATTTGACCTAGGATAAATGATATCTAGTGCGAAACATCTCGTGGACTTGTCGCTCCAAGGTTTGATGTGTATGCTGGTGGTGTTCCATCTATTCATGATATGTATTCTCATACTTGTTTTATTTTGTTTGTTTATTAATTTATTTTGCTGATACGTGTATTATACGACTTGTAGACCTCAAAGCTATACCCACCGGGTATCTAATCACATGATGGATTTGTAAACAAGCCAGTAGTTGCAAGCAAAGATGATATGGATGAGTGGCGATTGGTGAAGGCACGATTCGGTCGAGTCAGCTACCTGCCGACTTATGAAGACATCATTTTGATCACAAGTGTGCAATGCAATGCTGAGAATGCCTTCTGCTTTTGCTTTCATGTCGCATTTTGTTCCTTATATTATCCCTTCTTGGCAGATTTGTGTATTGAAGCATTTTAGCACAATAGAATAGTATGCAACAACCATCTGGCCCTTTGAAGCTCAAAAACCAGTGTCAGAGCTTTGGTTCCGACAAAAACAGAAACTATTATTATATAGTTACAAATACATTTTGGGAAATGAAAATGCAGGGTACTTTAAACACATTCAATTGCTTCATATATTAACAGATACCAGGACTAGAAGTTGTCATCATTTTGTGTCCTCTTTGCCAACATCAAGCGCTGTGAACTCTTGACTCGATTGAACGGACTTCGAGCCCTCCTCACCCGTTGTTGCAGTCATTTGTTGCTTACCAGATGATATCTTATCAGATTCTGGTTTAACTAAAAGCTGATCCTTGCTTTTGCCCCACAAAACCAGGTACAACCCTGTGATTATCACCACTGCTCCAACAATCCTGCAAAAGAGTTGAAATGAATTCTCTTATATTGCATTGTTCTTTCAGTTTTGTTCAATGGAATTTTGGAATAATGAATTTTCTTCTTACCTTCCCAGGAACATGATCTCAGACAAGATGAAGGAGCTCAAGATTGCAACAAGAATCAAGCTCAGAGGATTGAATGCAGTGACAAAAACAGGCCCTTTTATTTTCATCACTACTCCTTGAATATAATAAGTTACCCCTGAACACATTACTCCCTGCATCAAAACATAACTTTTCATTCTCATTCCCATGGAATCAACAACAAACATAGATACTTACAGCATAAACCACTGCCAGAAGCTGTCTATCAAAGTGCACAGCCCAAGCAGAAGGGTTCCCCCTCTCCATGACCAAAGCCACCCCACAGCCACCAATGGTACCCACCAAGCAAATCAACGCTGTAAGAGACAGCTCGGCCGGGTACGATTTCAATGTAATCATCTAACACCATTCAACATCTGAGTTAGTAATTTCCATTACAAAATAAGAGTTTCTTTTTGAAGAACTCTAATTATTTACCTGAAGAATGATGAAAGCTGACCAGGAAATGTCGCCAATGGCAATCATAAGGGAGCCCTTGATTGGGCTTTGGTGATTTGCAGAACCAGCTGAAGAAGAAGATGAAGAAGCAGAGGGGTGGTAGGGCTTTGTCCATGGCAGATTCAACATGGGTCCTGTTATGAAGGTCATAATCATGGCTCCTCCTACTGTCACCATGGTTCCTATGATTTTTGCTTGGCTTCCCCTTTTCAAAATATTCACTTTCTCAAGCCTGCCATTATAAACATTCATATCTCATTCATTCAATCAATCTTATGCATCATTTGTTC

mRNA sequence

TGTCGCTATCCAGTAGTCCCTCGGAAGTCCCTTTCTATTTCTTCTCACCCAGAGGAACATCGCGTGTTCCAACCTCCCCGGCAATTTCTCCGGCGAAGGTCAGCTCACTGAGAATCTATGATAGTTGTGAAATTTCGGAGGTTTCAGTCTCTCATTGTTGGTTTCTTTGATTTTGATCGATTGATTTTCGAAATCTTTGAGGTTTGCGCTTTCCGTGGTTCTTTGTTGTGTTTTTCATCGTCTTTCCAGCATAAAAGCTTTTGGATACCAAGGGATGCTGGATGTTTAACTGATGGAGAGATGAACTATGATAGCTCCTCCAGAATCGAAGCCAAACGTGGCCATCAATGGTTCATGGATGGCAGTGCACCAGAGCTATTCAGTAGCAAGAAGCAAGCAATAGAAACCGTAAATACTAGACCTGTTCCAGGAGTTCCACACATGAATGTTTCTCCCTGGGAAAACACTTCAAGTTTTCAGTCAGTTCCTGGGCACTTCACTGATCGTCTTTTCGGCTCTGAGCCAGTACGAACTGTCAACTTGGTCGATAGAGGCATCACTGTTGGAAATGCAAATATGGACATGGGAAGAAAGGAGTATGAGAATCATTTTGCAAACAACCCATCCGTTGGCTTGTCCATGTCCCAATCCATTGAGGATTCCTCATCGTGCCTCAATTTTGGTGGAATTAGAAAAGTTAAAGTCAATCAAGTCAGGGATCCTGACATTGGCATGCCTGCCTCTTTGGGACATGCCTATAGTAGGGGTGATAATGGTACAATTTCAATGGGTACAACATTTAACAAAAACCACGAGCACGCCATTTCATTGGGCCATACATATAATAGTCGAGATGAGAATGCCATCTCGGTTGGCCCTGCATATCACAAGACAGACGACAGTTTCATTTCCATGGGTCATGCTTTCAGCAAAGGTGATGAGAATTTTATTACAATTGGTCCTAACTATAGCAAGGGAGATAATAGCATCCTATCGATGAGCCAGCCTTTTGACAAAGGGGATGACTCTTTTATTTCAATGGGTCAATCTTATGAGAAGGCAGAAGGTAATATCATTTCTTTTGGTGCCTCCTACAATAAAGGGCATGAAAATTTTATCTCAATGGGGCCAACCTATAGTAAGGCAGGTGATACTTTCATTTCAATGGCGCCCTCCTATAATAAGGGAAATGATGATAACTTGTCCATGGGTCCAACCTTTGACAAGGTAAACTCGGACGTTGTACATGTCGGTCCTAAATTTGACAAAGCCGATTCTGGTTCGGTGTCAATGACTCACAACTATCATAAAGGTGAGACTAATACCATATCTTTTGGAGGCTTTGATGATGAGAATGGGACAGGTAATCCTTCTGGTGGGATCATTAGCAGTTATGACTTGTTAATGGCGAATCAGGCTTCTGCCCAAGCATCAGAAGTATCAACCATGAGAGATTCAGTTCATCCCAATGTGGAAGTGAACATTAACAATGTCATAAAAGTTGATGCTAAAATCGACACAAGTTCCAAGATTAGAGAGCCGAGGACGACTAAGAAGGTGTCACCCAATAGCTTCCCTTCAAATGTTAAAAGCTTGCTCTCAACTGGTATGCTTGATGGGGTTCCTGTGAAATATGTTTCTTGGTCACGGGAGTCTTTTCTGCCACCTGTTTCAAATATTTATACTGGGAAGAAAAATCTGAAAGGAACAATAAAAGGAACTGGATACTTGTGCAGCTGTGACAACTGCAATCATTCAAAGGCTCTCAATGCTTATGAGTTTGAGAGGCACGCTGGCTGCAAAACCAAACATCCAAATAACCACATTTACTTTGAGAATGGTAAAACTATCTATGCTGTTGTTCAAGAGCTAAAGAACACCCCTCAGGAGATGTTATTTGATGCAATTCACAATGTGACTGGCTCTCCCATTAATCAGAAGAACTTTCGTGTTTGGAAAGCATCTTATCAAGCAGCCACTCTTGAACTCCAGCGTATTTATGGAAAAGATGAAATAACTATGCCTTCTTGAGATGGGGATTTACCTTTTTAAATTTTGAGACCTCAAAGCTATACCCACCGGGTATCTAATCACATGATGGATTTGTAAACAAGCCAGTAGTTGCAAGCAAAGATGATATGGATGAGTGGCGATTGGTGAAGGCACGATTCGGTCGAGTCAGCTACCTGCCGACTTATGAAGACATCATTTTGATCACAAGTGTGCAATGCAATGCTGAGAATGCCTTCTGCTTTTGCTTTCATGTCGCATTTTGTTCCTTATATTATCCCTTCTTGGCAGATTTGTGTATTGAAGCATTTTAGCACAATAGAATAGTATGCAACAACCATCTGGCCCTTTGAAGCTCAAAAACCAGTGTCAGAGCTTTGGTTCCGACAAAAACAGAAACTATTATTATATAGTTACAAATACATTTTGGGAAATGAAAATGCAGGGTACTTTAAACACATTCAATTGCTTCATATATTAACAGATACCAGGACTAGAAGTTGTCATCATTTTGTGTCCTCTTTGCCAACATCAAGCGCTGTGAACTCTTGACTCGATTGAACGGACTTCGAGCCCTCCTCACCCGTTGTTGCAGTCATTTGTTGCTTACCAGATGATATCTTATCAGATTCTGGTTTAACTAAAAGCTGATCCTTGCTTTTGCCCCACAAAACCAGGTACAACCCTGTGATTATCACCACTGCTCCAACAATCCTGCAAAAGAGTTGAAATGAATTCTCTTATATTGCATTGTTCTTTCAGTTTTGTTCAATGGAATTTTGGAATAATGAATTTTCTTCTTACCTTCCCAGGAACATGATCTCAGACAAGATGAAGGAGCTCAAGATTGCAACAAGAATCAAGCTCAGAGGATTGAATGCAGTGACAAAAACAGGCCCTTTTATTTTCATCACTACTCCTTGAATATAATAAGTTACCCCTGAACACATTACTCCCTGCATCAAAACATAACTTTTCATTCTCATTCCCATGGAATCAACAACAAACATAGATACTTACAGCATAAACCACTGCCAGAAGCTGTCTATCAAAGTGCACAGCCCAAGCAGAAGGGTTCCCCCTCTCCATGACCAAAGCCACCCCACAGCCACCAATGGTACCCACCAAGCAAATCAACGCTGTAAGAGACAGCTCGGCCGGGTACGATTTCAATGTAATCATCTAACACCATTCAACATCTGAGTTAGTAATTTCCATTACAAAATAAGAGTTTCTTTTTGAAGAACTCTAATTATTTACCTGAAGAATGATGAAAGCTGACCAGGAAATGTCGCCAATGGCAATCATAAGGGAGCCCTTGATTGGGCTTTGGTGATTTGCAGAACCAGCTGAAGAAGAAGATGAAGAAGCAGAGGGGTGGTAGGGCTTTGTCCATGGCAGATTCAACATGGGTCCTGTTATGAAGGTCATAATCATGGCTCCTCCTACTGTCACCATGGTTCCTATGATTTTTGCTTGGCTTCCCCTTTTCAAAATATTCACTTTCTCAAGCCTGCCATTATAAACATTCATATCTCATTCATTCAATCAATCTTATGCATCATTTGTTC

Coding sequence (CDS)

ATGATAGTTGTGAAATTTCGGAGGTTTCAGTCTCTCATTGTTGGTTTCTTTGATTTTGATCGATTGATTTTCGAAATCTTTGAGGTTTGCGCTTTCCGTGGTTCTTTGTTGTGTTTTTCATCGTCTTTCCAGCATAAAAGCTTTTGGATACCAAGGGATGCTGGATGTTTAACTGATGGAGAGATGAACTATGATAGCTCCTCCAGAATCGAAGCCAAACGTGGCCATCAATGGTTCATGGATGGCAGTGCACCAGAGCTATTCAGTAGCAAGAAGCAAGCAATAGAAACCGTAAATACTAGACCTGTTCCAGGAGTTCCACACATGAATGTTTCTCCCTGGGAAAACACTTCAAGTTTTCAGTCAGTTCCTGGGCACTTCACTGATCGTCTTTTCGGCTCTGAGCCAGTACGAACTGTCAACTTGGTCGATAGAGGCATCACTGTTGGAAATGCAAATATGGACATGGGAAGAAAGGAGTATGAGAATCATTTTGCAAACAACCCATCCGTTGGCTTGTCCATGTCCCAATCCATTGAGGATTCCTCATCGTGCCTCAATTTTGGTGGAATTAGAAAAGTTAAAGTCAATCAAGTCAGGGATCCTGACATTGGCATGCCTGCCTCTTTGGGACATGCCTATAGTAGGGGTGATAATGGTACAATTTCAATGGGTACAACATTTAACAAAAACCACGAGCACGCCATTTCATTGGGCCATACATATAATAGTCGAGATGAGAATGCCATCTCGGTTGGCCCTGCATATCACAAGACAGACGACAGTTTCATTTCCATGGGTCATGCTTTCAGCAAAGGTGATGAGAATTTTATTACAATTGGTCCTAACTATAGCAAGGGAGATAATAGCATCCTATCGATGAGCCAGCCTTTTGACAAAGGGGATGACTCTTTTATTTCAATGGGTCAATCTTATGAGAAGGCAGAAGGTAATATCATTTCTTTTGGTGCCTCCTACAATAAAGGGCATGAAAATTTTATCTCAATGGGGCCAACCTATAGTAAGGCAGGTGATACTTTCATTTCAATGGCGCCCTCCTATAATAAGGGAAATGATGATAACTTGTCCATGGGTCCAACCTTTGACAAGGTAAACTCGGACGTTGTACATGTCGGTCCTAAATTTGACAAAGCCGATTCTGGTTCGGTGTCAATGACTCACAACTATCATAAAGGTGAGACTAATACCATATCTTTTGGAGGCTTTGATGATGAGAATGGGACAGGTAATCCTTCTGGTGGGATCATTAGCAGTTATGACTTGTTAATGGCGAATCAGGCTTCTGCCCAAGCATCAGAAGTATCAACCATGAGAGATTCAGTTCATCCCAATGTGGAAGTGAACATTAACAATGTCATAAAAGTTGATGCTAAAATCGACACAAGTTCCAAGATTAGAGAGCCGAGGACGACTAAGAAGGTGTCACCCAATAGCTTCCCTTCAAATGTTAAAAGCTTGCTCTCAACTGGTATGCTTGATGGGGTTCCTGTGAAATATGTTTCTTGGTCACGGGAGTCTTTTCTGCCACCTGTTTCAAATATTTATACTGGGAAGAAAAATCTGAAAGGAACAATAAAAGGAACTGGATACTTGTGCAGCTGTGACAACTGCAATCATTCAAAGGCTCTCAATGCTTATGAGTTTGAGAGGCACGCTGGCTGCAAAACCAAACATCCAAATAACCACATTTACTTTGAGAATGGTAAAACTATCTATGCTGTTGTTCAAGAGCTAAAGAACACCCCTCAGGAGATGTTATTTGATGCAATTCACAATGTGACTGGCTCTCCCATTAATCAGAAGAACTTTCGTGTTTGGAAAGCATCTTATCAAGCAGCCACTCTTGAACTCCAGCGTATTTATGGAAAAGATGAAATAACTATGCCTTCTTGA

Protein sequence

MIVVKFRRFQSLIVGFFDFDRLIFEIFEVCAFRGSLLCFSSSFQHKSFWIPRDAGCLTDGEMNYDSSSRIEAKRGHQWFMDGSAPELFSSKKQAIETVNTRPVPGVPHMNVSPWENTSSFQSVPGHFTDRLFGSEPVRTVNLVDRGITVGNANMDMGRKEYENHFANNPSVGLSMSQSIEDSSSCLNFGGIRKVKVNQVRDPDIGMPASLGHAYSRGDNGTISMGTTFNKNHEHAISLGHTYNSRDENAISVGPAYHKTDDSFISMGHAFSKGDENFITIGPNYSKGDNSILSMSQPFDKGDDSFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYSKAGDTFISMAPSYNKGNDDNLSMGPTFDKVNSDVVHVGPKFDKADSGSVSMTHNYHKGETNTISFGGFDDENGTGNPSGGIISSYDLLMANQASAQASEVSTMRDSVHPNVEVNINNVIKVDAKIDTSSKIREPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSRESFLPPVSNIYTGKKNLKGTIKGTGYLCSCDNCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIHNVTGSPINQKNFRVWKASYQAATLELQRIYGKDEITMPS
Homology
BLAST of Cp4.1LG01g10540 vs. NCBI nr
Match: XP_023545327.1 (uncharacterized protein LOC111804766 [Cucurbita pepo subsp. pepo] >XP_023545335.1 uncharacterized protein LOC111804766 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1177 bits (3044), Expect = 0.0
Identity = 583/596 (97.82%), Postives = 583/596 (97.82%), Query Frame = 0

Query: 42  SFQHKSFWIPRDAGCLTDGEMNYDSSSRIEAKRGHQWFMDGSAPELFSSKKQAIETVNTR 101
           SFQHKSFWIPRDAGCLTDGEMNYDSSSRIEAKRGHQWFMDGSAPELFSSKKQAIETVNTR
Sbjct: 2   SFQHKSFWIPRDAGCLTDGEMNYDSSSRIEAKRGHQWFMDGSAPELFSSKKQAIETVNTR 61

Query: 102 PVPGVPHMNVSPWENTSSFQSVPGHFTDRLFGSEPVRTVNLVDRGITVGNANMDMGRKEY 161
           PVPGVPHMNVSPWENTSSFQSVPGHFTDRLFGSEPVRTVNLVDRGITVGNANMDMGRKEY
Sbjct: 62  PVPGVPHMNVSPWENTSSFQSVPGHFTDRLFGSEPVRTVNLVDRGITVGNANMDMGRKEY 121

Query: 162 ENHFANNPSVGLSMSQSIEDSSSCLNFGGIRKVKVNQVRDPDIGMPASLGHAYSRGDNGT 221
           ENHFANNPSVGLSMSQSIEDSSSCLNFGGIRKVKVNQVRDPDIGMPASLGHAYSRGDNGT
Sbjct: 122 ENHFANNPSVGLSMSQSIEDSSSCLNFGGIRKVKVNQVRDPDIGMPASLGHAYSRGDNGT 181

Query: 222 ISMGTTFNKNHEHAISLGHTYNSRDENAISVGPAYHKTDDSFISMGHAFSKGDENFITIG 281
           ISMGTTFNKNHEHAISLGHTYNSRDENAISVGPAYHKTDDSFISMGHAFSKGDENFITIG
Sbjct: 182 ISMGTTFNKNHEHAISLGHTYNSRDENAISVGPAYHKTDDSFISMGHAFSKGDENFITIG 241

Query: 282 PNYSKGDNSILSMSQPFDKGDDSFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS 341
           PNYSKGDNSILSMSQPFDKGDDSFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS
Sbjct: 242 PNYSKGDNSILSMSQPFDKGDDSFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS 301

Query: 342 KAGDTFISMAPSYNKGNDDNLSMGPTFDKVNSDVVHVGPKFDKADSGSVSMTHNYHKGET 401
           KAGDTFISMAPSYNKGNDDNLSMGPTFDKVNSDVVHVGPKFDKADSGSVSMTHNYHKGET
Sbjct: 302 KAGDTFISMAPSYNKGNDDNLSMGPTFDKVNSDVVHVGPKFDKADSGSVSMTHNYHKGET 361

Query: 402 NTISFGGFDDENGTGNPSGGIISSYDLLMANQASAQASEVSTMRDSVHPNVEVNINNVIK 461
           NTISFGGFDDENGTGNPSGGIISSYDLLMANQASAQASEVSTMRDSVHPNVEVNINNVIK
Sbjct: 362 NTISFGGFDDENGTGNPSGGIISSYDLLMANQASAQASEVSTMRDSVHPNVEVNINNVIK 421

Query: 462 VDAKIDTSSKIREPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSRESFLPPVSNI 521
           VDAKIDTSSKIREPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSRE         
Sbjct: 422 VDAKIDTSSKIREPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSRE--------- 481

Query: 522 YTGKKNLKGTIKGTGYLCSCDNCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAV 581
               KNLKGTIKGTGYLCSCDNCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAV
Sbjct: 482 ----KNLKGTIKGTGYLCSCDNCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAV 541

Query: 582 VQELKNTPQEMLFDAIHNVTGSPINQKNFRVWKASYQAATLELQRIYGKDEITMPS 637
           VQELKNTPQEMLFDAIHNVTGSPINQKNFRVWKASYQAATLELQRIYGKDEITMPS
Sbjct: 542 VQELKNTPQEMLFDAIHNVTGSPINQKNFRVWKASYQAATLELQRIYGKDEITMPS 584

BLAST of Cp4.1LG01g10540 vs. NCBI nr
Match: KAG7031622.1 (hypothetical protein SDJN02_05663, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1165 bits (3014), Expect = 0.0
Identity = 576/598 (96.32%), Postives = 579/598 (96.82%), Query Frame = 0

Query: 40  SSSFQHKSFWIPRDAGCLTDGEMNYDSSSRIEAKRGHQWFMDGSAPELFSSKKQAIETVN 99
           + SFQHKSFWIPRDAGCLTDGEMNYDSSSRIEAKRGHQWFMDGSAPELFSSKKQAIETVN
Sbjct: 94  AKSFQHKSFWIPRDAGCLTDGEMNYDSSSRIEAKRGHQWFMDGSAPELFSSKKQAIETVN 153

Query: 100 TRPVPGVPHMNVSPWENTSSFQSVPGHFTDRLFGSEPVRTVNLVDRGITVGNANMDMGRK 159
           TRPVPGVPHMNVSPWENTSSFQSVPGHFTDRLFGSEPVRTVNLVDRGITVGNANMDMGRK
Sbjct: 154 TRPVPGVPHMNVSPWENTSSFQSVPGHFTDRLFGSEPVRTVNLVDRGITVGNANMDMGRK 213

Query: 160 EYENHFANNPSVGLSMSQSIEDSSSCLNFGGIRKVKVNQVRDPDIGMPASLGHAYSRGDN 219
           EYENHFANNPSVGLSMSQSIEDSSSCLNFGGIRKVKVNQVRDPDIGM ASLGHAYSRGDN
Sbjct: 214 EYENHFANNPSVGLSMSQSIEDSSSCLNFGGIRKVKVNQVRDPDIGMSASLGHAYSRGDN 273

Query: 220 GTISMGTTFNKNHEHAISLGHTYNSRDENAISVGPAYHKTDDSFISMGHAFSKGDENFIT 279
           GTISMG TFNKNHEHAISLGHTYNSRDENAISVGPAYHKTDDSFISMGHAFSKGDENFIT
Sbjct: 274 GTISMGATFNKNHEHAISLGHTYNSRDENAISVGPAYHKTDDSFISMGHAFSKGDENFIT 333

Query: 280 IGPNYSKGDNSILSMSQPFDKGDDSFISMGQSYEKAEGNIISFGASYNKGHENFISMGPT 339
           IGPNYSKGDNSILSMSQPFDKGDDSFISMGQSYEKAEGNIISFGASYNKGHENFISMGPT
Sbjct: 334 IGPNYSKGDNSILSMSQPFDKGDDSFISMGQSYEKAEGNIISFGASYNKGHENFISMGPT 393

Query: 340 YSKAGDTFISMAPSYNKGNDDNLSMGPTFDKVNSDVVHVGPKFDKADSGSVSMTHNYHKG 399
           YSKAGDTFISMAPSYNKGNDDNLSMGPTFDKVNSDVVHVGPKFDKADSGSVSM HNYHKG
Sbjct: 394 YSKAGDTFISMAPSYNKGNDDNLSMGPTFDKVNSDVVHVGPKFDKADSGSVSMIHNYHKG 453

Query: 400 ETNTISFGGFDDENGTGNPSGGIISSYDLLMANQASAQASEVSTMRDSVHPNVEVNINNV 459
           E NTISFGGFDDENGTGNPSGGIISSYDLLMANQASAQASEVSTMRDSVHPNVEVNINN 
Sbjct: 454 EINTISFGGFDDENGTGNPSGGIISSYDLLMANQASAQASEVSTMRDSVHPNVEVNINNA 513

Query: 460 IKVDAKIDTSSKIREPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSRESFLPPVS 519
           IKVDAKIDTSSKIREPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSRE       
Sbjct: 514 IKVDAKIDTSSKIREPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSRE------- 573

Query: 520 NIYTGKKNLKGTIKGTGYLCSCDNCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIY 579
                 KNLKGTIKGTGYLCSCDNCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIY
Sbjct: 574 ------KNLKGTIKGTGYLCSCDNCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIY 633

Query: 580 AVVQELKNTPQEMLFDAIHNVTGSPINQKNFRVWKASYQAATLELQRIYGKDEITMPS 637
           AVVQELKNTPQEMLFDAIHNVTGSPINQKNFR+WKASYQAATLELQRIYGKDE+TMPS
Sbjct: 634 AVVQELKNTPQEMLFDAIHNVTGSPINQKNFRIWKASYQAATLELQRIYGKDEVTMPS 678

BLAST of Cp4.1LG01g10540 vs. NCBI nr
Match: KAG6601008.1 (hypothetical protein SDJN03_06241, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1165 bits (3013), Expect = 0.0
Identity = 576/596 (96.64%), Postives = 578/596 (96.98%), Query Frame = 0

Query: 42  SFQHKSFWIPRDAGCLTDGEMNYDSSSRIEAKRGHQWFMDGSAPELFSSKKQAIETVNTR 101
           SFQHKSFWIPRDAGCLTDGEMNYDSSSRIEAKRGHQWFMDGSAPELFSSKKQAIETVNTR
Sbjct: 2   SFQHKSFWIPRDAGCLTDGEMNYDSSSRIEAKRGHQWFMDGSAPELFSSKKQAIETVNTR 61

Query: 102 PVPGVPHMNVSPWENTSSFQSVPGHFTDRLFGSEPVRTVNLVDRGITVGNANMDMGRKEY 161
           PVPGVPHMNVSPWENTSSFQSVPGHFTDRLFGSEPVRTVNLVDRGITVGNANMDMGRKEY
Sbjct: 62  PVPGVPHMNVSPWENTSSFQSVPGHFTDRLFGSEPVRTVNLVDRGITVGNANMDMGRKEY 121

Query: 162 ENHFANNPSVGLSMSQSIEDSSSCLNFGGIRKVKVNQVRDPDIGMPASLGHAYSRGDNGT 221
           ENHFANNPSVGLSMSQSIEDSSSCLNFGGIRKVKVNQVRDPDIGM ASLGHAYSRGDNGT
Sbjct: 122 ENHFANNPSVGLSMSQSIEDSSSCLNFGGIRKVKVNQVRDPDIGMSASLGHAYSRGDNGT 181

Query: 222 ISMGTTFNKNHEHAISLGHTYNSRDENAISVGPAYHKTDDSFISMGHAFSKGDENFITIG 281
           ISMG TFNKNHEHAISLGHTYNSRDENAISVGPAYHKTDDSFISMGHAFSKGDENFITIG
Sbjct: 182 ISMGATFNKNHEHAISLGHTYNSRDENAISVGPAYHKTDDSFISMGHAFSKGDENFITIG 241

Query: 282 PNYSKGDNSILSMSQPFDKGDDSFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS 341
           PNYSKGDNSILSMSQPFDKGDDSFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS
Sbjct: 242 PNYSKGDNSILSMSQPFDKGDDSFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS 301

Query: 342 KAGDTFISMAPSYNKGNDDNLSMGPTFDKVNSDVVHVGPKFDKADSGSVSMTHNYHKGET 401
           KAGDTFISMAPSYNKGNDDNLSMGPTFDKVNSDVVHVGPKFDKADSGSVSM HNYHKGE 
Sbjct: 302 KAGDTFISMAPSYNKGNDDNLSMGPTFDKVNSDVVHVGPKFDKADSGSVSMIHNYHKGEI 361

Query: 402 NTISFGGFDDENGTGNPSGGIISSYDLLMANQASAQASEVSTMRDSVHPNVEVNINNVIK 461
           NTISFGGFDDENGTGNPSGGIISSYDLLMANQASAQASEVSTMRDSVHPNVEVNINN IK
Sbjct: 362 NTISFGGFDDENGTGNPSGGIISSYDLLMANQASAQASEVSTMRDSVHPNVEVNINNAIK 421

Query: 462 VDAKIDTSSKIREPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSRESFLPPVSNI 521
           VDAKIDTSSKIREPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSRE         
Sbjct: 422 VDAKIDTSSKIREPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSRE--------- 481

Query: 522 YTGKKNLKGTIKGTGYLCSCDNCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAV 581
               KNLKGTIKGTGYLCSCDNCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAV
Sbjct: 482 ----KNLKGTIKGTGYLCSCDNCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAV 541

Query: 582 VQELKNTPQEMLFDAIHNVTGSPINQKNFRVWKASYQAATLELQRIYGKDEITMPS 637
           VQELKNTPQEMLFDAIHNVTGSPINQKNFR+WKASYQAATLELQRIYGKDE+TMPS
Sbjct: 542 VQELKNTPQEMLFDAIHNVTGSPINQKNFRIWKASYQAATLELQRIYGKDEVTMPS 584

BLAST of Cp4.1LG01g10540 vs. NCBI nr
Match: XP_022956892.1 (uncharacterized protein LOC111458442 [Cucurbita moschata] >XP_022956893.1 uncharacterized protein LOC111458442 [Cucurbita moschata] >XP_022956894.1 uncharacterized protein LOC111458442 [Cucurbita moschata])

HSP 1 Score: 1160 bits (3001), Expect = 0.0
Identity = 575/596 (96.48%), Postives = 579/596 (97.15%), Query Frame = 0

Query: 42  SFQHKSFWIPRDAGCLTDGEMNYDSSSRIEAKRGHQWFMDGSAPELFSSKKQAIETVNTR 101
           SFQHKSFWIPRDAGCLTDGEMNYDSSSRIEAKRGHQWFMDGSAPELFSSKKQAIETVNTR
Sbjct: 2   SFQHKSFWIPRDAGCLTDGEMNYDSSSRIEAKRGHQWFMDGSAPELFSSKKQAIETVNTR 61

Query: 102 PVPGVPHMNVSPWENTSSFQSVPGHFTDRLFGSEPVRTVNLVDRGITVGNANMDMGRKEY 161
           PVPGVPHMNVSPWENTSSFQSVPGHFTDRLFGSEPVRTVNLVDRGITVGNANMDMGRKEY
Sbjct: 62  PVPGVPHMNVSPWENTSSFQSVPGHFTDRLFGSEPVRTVNLVDRGITVGNANMDMGRKEY 121

Query: 162 ENHFANNPSVGLSMSQSIEDSSSCLNFGGIRKVKVNQVRDPDIGMPASLGHAYSRGDNGT 221
           ENHFANNPSVGLS SQSIEDSSSCLNFGGIRKVKVNQVRDPDIGM ASLGHAYSRGDNGT
Sbjct: 122 ENHFANNPSVGLSTSQSIEDSSSCLNFGGIRKVKVNQVRDPDIGMSASLGHAYSRGDNGT 181

Query: 222 ISMGTTFNKNHEHAISLGHTYNSRDENAISVGPAYHKTDDSFISMGHAFSKGDENFITIG 281
           ISMGTTFNKNHEHAISLGHTYNSRDENAISVGPAYHKTDDSFISMGHAFSKGDENFITIG
Sbjct: 182 ISMGTTFNKNHEHAISLGHTYNSRDENAISVGPAYHKTDDSFISMGHAFSKGDENFITIG 241

Query: 282 PNYSKGDNSILSMSQPFDKGDDSFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS 341
           PNYSKGDN+ILSMSQPFDKGDDSFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS
Sbjct: 242 PNYSKGDNNILSMSQPFDKGDDSFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS 301

Query: 342 KAGDTFISMAPSYNKGNDDNLSMGPTFDKVNSDVVHVGPKFDKADSGSVSMTHNYHKGET 401
           KAGDTFISMAPSYNKGNDDNLSMGPTFDKVNSDVVHVGPKFDKADSGSVSMTHNYHKGET
Sbjct: 302 KAGDTFISMAPSYNKGNDDNLSMGPTFDKVNSDVVHVGPKFDKADSGSVSMTHNYHKGET 361

Query: 402 NTISFGGFDDENGTGNPSGGIISSYDLLMANQASAQASEVSTMRDSVHPNVEVNINNVIK 461
           NTISFGGFDDENGTGNPSGGIISSYDLLMANQASAQASEVSTMRDSVHPNVEVNIN  +K
Sbjct: 362 NTISFGGFDDENGTGNPSGGIISSYDLLMANQASAQASEVSTMRDSVHPNVEVNIN--VK 421

Query: 462 VDAKIDTSSKIREPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSRESFLPPVSNI 521
           VDAKIDTSSKIREPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSRE         
Sbjct: 422 VDAKIDTSSKIREPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSRE--------- 481

Query: 522 YTGKKNLKGTIKGTGYLCSCDNCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAV 581
               KNLKGTIKGTGYLCSCDNCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAV
Sbjct: 482 ----KNLKGTIKGTGYLCSCDNCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAV 541

Query: 582 VQELKNTPQEMLFDAIHNVTGSPINQKNFRVWKASYQAATLELQRIYGKDEITMPS 637
           VQELKNTPQEMLFDAIHNVTGSPINQKNFR+WKASYQAATLELQRIYGKDE+TMPS
Sbjct: 542 VQELKNTPQEMLFDAIHNVTGSPINQKNFRIWKASYQAATLELQRIYGKDEVTMPS 582

BLAST of Cp4.1LG01g10540 vs. NCBI nr
Match: XP_022994977.1 (uncharacterized protein LOC111490611 [Cucurbita maxima] >XP_022995052.1 uncharacterized protein LOC111490611 [Cucurbita maxima])

HSP 1 Score: 1154 bits (2986), Expect = 0.0
Identity = 569/596 (95.47%), Postives = 576/596 (96.64%), Query Frame = 0

Query: 42  SFQHKSFWIPRDAGCLTDGEMNYDSSSRIEAKRGHQWFMDGSAPELFSSKKQAIETVNTR 101
           SFQHKSFWIPRDAGCLTDGEMNYDSSS+IEAKRGHQWFMDGSAPELFSSKKQAIE VNTR
Sbjct: 2   SFQHKSFWIPRDAGCLTDGEMNYDSSSKIEAKRGHQWFMDGSAPELFSSKKQAIEPVNTR 61

Query: 102 PVPGVPHMNVSPWENTSSFQSVPGHFTDRLFGSEPVRTVNLVDRGITVGNANMDMGRKEY 161
           PVPGVPHMNVSPWENTSSFQSVPGHFTDRLF SEPVRTVNLVDRGITVGNANMDMGRKEY
Sbjct: 62  PVPGVPHMNVSPWENTSSFQSVPGHFTDRLFSSEPVRTVNLVDRGITVGNANMDMGRKEY 121

Query: 162 ENHFANNPSVGLSMSQSIEDSSSCLNFGGIRKVKVNQVRDPDIGMPASLGHAYSRGDNGT 221
           ENHFANNPSVGLSMSQSIEDSSSCLNFGGIRKVKVNQVRDPDIGM +SLGHAYSRGDNGT
Sbjct: 122 ENHFANNPSVGLSMSQSIEDSSSCLNFGGIRKVKVNQVRDPDIGMSSSLGHAYSRGDNGT 181

Query: 222 ISMGTTFNKNHEHAISLGHTYNSRDENAISVGPAYHKTDDSFISMGHAFSKGDENFITIG 281
           ISMGTTFNKNHEHAISLGHTYNSRDENAISVGP YHKTDDSFISMGHAFSKGDENFITIG
Sbjct: 182 ISMGTTFNKNHEHAISLGHTYNSRDENAISVGPVYHKTDDSFISMGHAFSKGDENFITIG 241

Query: 282 PNYSKGDNSILSMSQPFDKGDDSFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS 341
           PNYSKGDNSILSMSQPFDKGDDSFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS
Sbjct: 242 PNYSKGDNSILSMSQPFDKGDDSFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS 301

Query: 342 KAGDTFISMAPSYNKGNDDNLSMGPTFDKVNSDVVHVGPKFDKADSGSVSMTHNYHKGET 401
           KAGDTFISMAPSYNKGND+NLSMGPTFDKVNSDVVHVGPKFDKADSGSVSMTHNYHKGE+
Sbjct: 302 KAGDTFISMAPSYNKGNDENLSMGPTFDKVNSDVVHVGPKFDKADSGSVSMTHNYHKGES 361

Query: 402 NTISFGGFDDENGTGNPSGGIISSYDLLMANQASAQASEVSTMRDSVHPNVEVNINNVIK 461
           NTISFGGFD+ENGTGNPSGGIISSYDLLMANQASAQASEVSTMRDSVHPN EVNINN IK
Sbjct: 362 NTISFGGFDNENGTGNPSGGIISSYDLLMANQASAQASEVSTMRDSVHPNAEVNINNAIK 421

Query: 462 VDAKIDTSSKIREPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSRESFLPPVSNI 521
           VDAKIDTSSKIREPRTTKKV PNSFPSNVKSLLSTGMLDGVPVKYVSWSRE         
Sbjct: 422 VDAKIDTSSKIREPRTTKKVPPNSFPSNVKSLLSTGMLDGVPVKYVSWSRE--------- 481

Query: 522 YTGKKNLKGTIKGTGYLCSCDNCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAV 581
               KNLKGTIKGTGYLCSCDNCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAV
Sbjct: 482 ----KNLKGTIKGTGYLCSCDNCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAV 541

Query: 582 VQELKNTPQEMLFDAIHNVTGSPINQKNFRVWKASYQAATLELQRIYGKDEITMPS 637
           VQELKNTPQEMLFDAIHNVTGSPINQKNFR+WKASYQAATLELQRIYGKDE+TMPS
Sbjct: 542 VQELKNTPQEMLFDAIHNVTGSPINQKNFRIWKASYQAATLELQRIYGKDEVTMPS 584

BLAST of Cp4.1LG01g10540 vs. ExPASy TrEMBL
Match: A0A6J1GYI5 (uncharacterized protein LOC111458442 OS=Cucurbita moschata OX=3662 GN=LOC111458442 PE=4 SV=1)

HSP 1 Score: 1160 bits (3001), Expect = 0.0
Identity = 575/596 (96.48%), Postives = 579/596 (97.15%), Query Frame = 0

Query: 42  SFQHKSFWIPRDAGCLTDGEMNYDSSSRIEAKRGHQWFMDGSAPELFSSKKQAIETVNTR 101
           SFQHKSFWIPRDAGCLTDGEMNYDSSSRIEAKRGHQWFMDGSAPELFSSKKQAIETVNTR
Sbjct: 2   SFQHKSFWIPRDAGCLTDGEMNYDSSSRIEAKRGHQWFMDGSAPELFSSKKQAIETVNTR 61

Query: 102 PVPGVPHMNVSPWENTSSFQSVPGHFTDRLFGSEPVRTVNLVDRGITVGNANMDMGRKEY 161
           PVPGVPHMNVSPWENTSSFQSVPGHFTDRLFGSEPVRTVNLVDRGITVGNANMDMGRKEY
Sbjct: 62  PVPGVPHMNVSPWENTSSFQSVPGHFTDRLFGSEPVRTVNLVDRGITVGNANMDMGRKEY 121

Query: 162 ENHFANNPSVGLSMSQSIEDSSSCLNFGGIRKVKVNQVRDPDIGMPASLGHAYSRGDNGT 221
           ENHFANNPSVGLS SQSIEDSSSCLNFGGIRKVKVNQVRDPDIGM ASLGHAYSRGDNGT
Sbjct: 122 ENHFANNPSVGLSTSQSIEDSSSCLNFGGIRKVKVNQVRDPDIGMSASLGHAYSRGDNGT 181

Query: 222 ISMGTTFNKNHEHAISLGHTYNSRDENAISVGPAYHKTDDSFISMGHAFSKGDENFITIG 281
           ISMGTTFNKNHEHAISLGHTYNSRDENAISVGPAYHKTDDSFISMGHAFSKGDENFITIG
Sbjct: 182 ISMGTTFNKNHEHAISLGHTYNSRDENAISVGPAYHKTDDSFISMGHAFSKGDENFITIG 241

Query: 282 PNYSKGDNSILSMSQPFDKGDDSFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS 341
           PNYSKGDN+ILSMSQPFDKGDDSFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS
Sbjct: 242 PNYSKGDNNILSMSQPFDKGDDSFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS 301

Query: 342 KAGDTFISMAPSYNKGNDDNLSMGPTFDKVNSDVVHVGPKFDKADSGSVSMTHNYHKGET 401
           KAGDTFISMAPSYNKGNDDNLSMGPTFDKVNSDVVHVGPKFDKADSGSVSMTHNYHKGET
Sbjct: 302 KAGDTFISMAPSYNKGNDDNLSMGPTFDKVNSDVVHVGPKFDKADSGSVSMTHNYHKGET 361

Query: 402 NTISFGGFDDENGTGNPSGGIISSYDLLMANQASAQASEVSTMRDSVHPNVEVNINNVIK 461
           NTISFGGFDDENGTGNPSGGIISSYDLLMANQASAQASEVSTMRDSVHPNVEVNIN  +K
Sbjct: 362 NTISFGGFDDENGTGNPSGGIISSYDLLMANQASAQASEVSTMRDSVHPNVEVNIN--VK 421

Query: 462 VDAKIDTSSKIREPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSRESFLPPVSNI 521
           VDAKIDTSSKIREPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSRE         
Sbjct: 422 VDAKIDTSSKIREPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSRE--------- 481

Query: 522 YTGKKNLKGTIKGTGYLCSCDNCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAV 581
               KNLKGTIKGTGYLCSCDNCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAV
Sbjct: 482 ----KNLKGTIKGTGYLCSCDNCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAV 541

Query: 582 VQELKNTPQEMLFDAIHNVTGSPINQKNFRVWKASYQAATLELQRIYGKDEITMPS 637
           VQELKNTPQEMLFDAIHNVTGSPINQKNFR+WKASYQAATLELQRIYGKDE+TMPS
Sbjct: 542 VQELKNTPQEMLFDAIHNVTGSPINQKNFRIWKASYQAATLELQRIYGKDEVTMPS 582

BLAST of Cp4.1LG01g10540 vs. ExPASy TrEMBL
Match: A0A6J1JXF6 (uncharacterized protein LOC111490611 OS=Cucurbita maxima OX=3661 GN=LOC111490611 PE=4 SV=1)

HSP 1 Score: 1154 bits (2986), Expect = 0.0
Identity = 569/596 (95.47%), Postives = 576/596 (96.64%), Query Frame = 0

Query: 42  SFQHKSFWIPRDAGCLTDGEMNYDSSSRIEAKRGHQWFMDGSAPELFSSKKQAIETVNTR 101
           SFQHKSFWIPRDAGCLTDGEMNYDSSS+IEAKRGHQWFMDGSAPELFSSKKQAIE VNTR
Sbjct: 2   SFQHKSFWIPRDAGCLTDGEMNYDSSSKIEAKRGHQWFMDGSAPELFSSKKQAIEPVNTR 61

Query: 102 PVPGVPHMNVSPWENTSSFQSVPGHFTDRLFGSEPVRTVNLVDRGITVGNANMDMGRKEY 161
           PVPGVPHMNVSPWENTSSFQSVPGHFTDRLF SEPVRTVNLVDRGITVGNANMDMGRKEY
Sbjct: 62  PVPGVPHMNVSPWENTSSFQSVPGHFTDRLFSSEPVRTVNLVDRGITVGNANMDMGRKEY 121

Query: 162 ENHFANNPSVGLSMSQSIEDSSSCLNFGGIRKVKVNQVRDPDIGMPASLGHAYSRGDNGT 221
           ENHFANNPSVGLSMSQSIEDSSSCLNFGGIRKVKVNQVRDPDIGM +SLGHAYSRGDNGT
Sbjct: 122 ENHFANNPSVGLSMSQSIEDSSSCLNFGGIRKVKVNQVRDPDIGMSSSLGHAYSRGDNGT 181

Query: 222 ISMGTTFNKNHEHAISLGHTYNSRDENAISVGPAYHKTDDSFISMGHAFSKGDENFITIG 281
           ISMGTTFNKNHEHAISLGHTYNSRDENAISVGP YHKTDDSFISMGHAFSKGDENFITIG
Sbjct: 182 ISMGTTFNKNHEHAISLGHTYNSRDENAISVGPVYHKTDDSFISMGHAFSKGDENFITIG 241

Query: 282 PNYSKGDNSILSMSQPFDKGDDSFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS 341
           PNYSKGDNSILSMSQPFDKGDDSFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS
Sbjct: 242 PNYSKGDNSILSMSQPFDKGDDSFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS 301

Query: 342 KAGDTFISMAPSYNKGNDDNLSMGPTFDKVNSDVVHVGPKFDKADSGSVSMTHNYHKGET 401
           KAGDTFISMAPSYNKGND+NLSMGPTFDKVNSDVVHVGPKFDKADSGSVSMTHNYHKGE+
Sbjct: 302 KAGDTFISMAPSYNKGNDENLSMGPTFDKVNSDVVHVGPKFDKADSGSVSMTHNYHKGES 361

Query: 402 NTISFGGFDDENGTGNPSGGIISSYDLLMANQASAQASEVSTMRDSVHPNVEVNINNVIK 461
           NTISFGGFD+ENGTGNPSGGIISSYDLLMANQASAQASEVSTMRDSVHPN EVNINN IK
Sbjct: 362 NTISFGGFDNENGTGNPSGGIISSYDLLMANQASAQASEVSTMRDSVHPNAEVNINNAIK 421

Query: 462 VDAKIDTSSKIREPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSRESFLPPVSNI 521
           VDAKIDTSSKIREPRTTKKV PNSFPSNVKSLLSTGMLDGVPVKYVSWSRE         
Sbjct: 422 VDAKIDTSSKIREPRTTKKVPPNSFPSNVKSLLSTGMLDGVPVKYVSWSRE--------- 481

Query: 522 YTGKKNLKGTIKGTGYLCSCDNCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAV 581
               KNLKGTIKGTGYLCSCDNCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAV
Sbjct: 482 ----KNLKGTIKGTGYLCSCDNCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAV 541

Query: 582 VQELKNTPQEMLFDAIHNVTGSPINQKNFRVWKASYQAATLELQRIYGKDEITMPS 637
           VQELKNTPQEMLFDAIHNVTGSPINQKNFR+WKASYQAATLELQRIYGKDE+TMPS
Sbjct: 542 VQELKNTPQEMLFDAIHNVTGSPINQKNFRIWKASYQAATLELQRIYGKDEVTMPS 584

BLAST of Cp4.1LG01g10540 vs. ExPASy TrEMBL
Match: A0A5A7U4X7 (TDBD domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold88G00130 PE=4 SV=1)

HSP 1 Score: 1082 bits (2797), Expect = 0.0
Identity = 534/596 (89.60%), Postives = 556/596 (93.29%), Query Frame = 0

Query: 42  SFQHKSFWIPRDAGCLTDGEMNYDSSSRIEAKRGHQWFMDGSAPELFSSKKQAIETVNTR 101
           SFQHKSFWIPRDAGCLTDGEMNYDSSSRIE KRGHQWFMDGSAPELFSSKKQAIE VN+R
Sbjct: 19  SFQHKSFWIPRDAGCLTDGEMNYDSSSRIETKRGHQWFMDGSAPELFSSKKQAIEAVNSR 78

Query: 102 PVPGVPHMNVSPWENTSSFQSVPGHFTDRLFGSEPVRTVNLVDRGITVGNANMDMGRKEY 161
           PVPGVPHMNVSPWENTSSFQSVPGHFTDRLFGSEP+RTVNLVDRGI+VGNANMDMGRKE+
Sbjct: 79  PVPGVPHMNVSPWENTSSFQSVPGHFTDRLFGSEPIRTVNLVDRGISVGNANMDMGRKEF 138

Query: 162 ENHFANNPSVGLSMSQSIEDSSSCLNFGGIRKVKVNQVRDPDIGMPASLGHAYSRGDNGT 221
           ENHF NNPSVGLSMSQSIED SSCLNFGGIRKVKVNQVRDPD+GMPASLGH YSRGDN T
Sbjct: 139 ENHFTNNPSVGLSMSQSIEDPSSCLNFGGIRKVKVNQVRDPDVGMPASLGHGYSRGDNCT 198

Query: 222 ISMGTTFNKNHEHAISLGHTYNSRDENAISVGPAYHKTDDSFISMGHAFSKGDENFITIG 281
           ISMG+ FNKNHE+ ISLG TYNSRDENAISVGPAYHKTDD+FISMGHAFSKGD +FITIG
Sbjct: 199 ISMGSGFNKNHENTISLGQTYNSRDENAISVGPAYHKTDDNFISMGHAFSKGDGSFITIG 258

Query: 282 PNYSKGDNSILSMSQPFDKGDDSFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS 341
            NYSKGDNSILSM+QPFDKGDDSFISMGQSYEKAEGNIISF ASYNKG ENFISMGP YS
Sbjct: 259 HNYSKGDNSILSMNQPFDKGDDSFISMGQSYEKAEGNIISF-ASYNKGQENFISMGPAYS 318

Query: 342 KAGDTFISMAPSYNKGNDDNLSMGPTFDKVNSDVVHVGPKFDKADSGSVSMTHNYHKGET 401
           KAGDTFISMA S+NKGNDDNLSM PT+DKVNSD+VHVGPKFDKADSG+VSM HNYHKGE+
Sbjct: 319 KAGDTFISMASSFNKGNDDNLSMAPTYDKVNSDIVHVGPKFDKADSGAVSMAHNYHKGES 378

Query: 402 NTISFGGFDDENGTGNPSGGIISSYDLLMANQASAQASEVSTMRDSVHPNVEVNINNVIK 461
           NTISFGGFDDENGT NPSGGIISSYDLLMANQASAQASEVST+RDSV PNVEVNINN IK
Sbjct: 379 NTISFGGFDDENGTDNPSGGIISSYDLLMANQASAQASEVSTLRDSVDPNVEVNINNAIK 438

Query: 462 VDAKIDTSSKIREPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSRESFLPPVSNI 521
           VD KIDT+SK +EPR +KKV PNSFPSNVKSLLSTGMLDGVPVKYVSWSRE         
Sbjct: 439 VDGKIDTNSKNKEPRMSKKVPPNSFPSNVKSLLSTGMLDGVPVKYVSWSRE--------- 498

Query: 522 YTGKKNLKGTIKGTGYLCSCDNCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAV 581
               KNLKG IKGTGYLCSC+NCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAV
Sbjct: 499 ----KNLKGIIKGTGYLCSCENCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAV 558

Query: 582 VQELKNTPQEMLFDAIHNVTGSPINQKNFRVWKASYQAATLELQRIYGKDEITMPS 637
           VQELKNTPQEMLFDAI NVTGSPINQKNFR+WKASYQAATLELQRIYGKDE+ MPS
Sbjct: 559 VQELKNTPQEMLFDAIQNVTGSPINQKNFRIWKASYQAATLELQRIYGKDEVIMPS 600

BLAST of Cp4.1LG01g10540 vs. ExPASy TrEMBL
Match: A0A1S3CFN8 (uncharacterized protein LOC103500200 OS=Cucumis melo OX=3656 GN=LOC103500200 PE=4 SV=1)

HSP 1 Score: 1080 bits (2794), Expect = 0.0
Identity = 533/596 (89.43%), Postives = 556/596 (93.29%), Query Frame = 0

Query: 42  SFQHKSFWIPRDAGCLTDGEMNYDSSSRIEAKRGHQWFMDGSAPELFSSKKQAIETVNTR 101
           SFQHKSFWIPRDAGCLTDGEMNYDSSSRIE KRGHQWFMDGSAPELFSSKKQAIE VN+R
Sbjct: 2   SFQHKSFWIPRDAGCLTDGEMNYDSSSRIETKRGHQWFMDGSAPELFSSKKQAIEAVNSR 61

Query: 102 PVPGVPHMNVSPWENTSSFQSVPGHFTDRLFGSEPVRTVNLVDRGITVGNANMDMGRKEY 161
           PVPGVPHMNVSPWENTSSFQSVPGHFTDRLFGSEP+RTVNLVDRGI+VGNANMDMGRKE+
Sbjct: 62  PVPGVPHMNVSPWENTSSFQSVPGHFTDRLFGSEPIRTVNLVDRGISVGNANMDMGRKEF 121

Query: 162 ENHFANNPSVGLSMSQSIEDSSSCLNFGGIRKVKVNQVRDPDIGMPASLGHAYSRGDNGT 221
           ENHF NNPSVGLSMSQSIED SSCLNFGGIRKVKVNQVRDPD+GMPASLGH YSRGDN T
Sbjct: 122 ENHFTNNPSVGLSMSQSIEDPSSCLNFGGIRKVKVNQVRDPDVGMPASLGHGYSRGDNCT 181

Query: 222 ISMGTTFNKNHEHAISLGHTYNSRDENAISVGPAYHKTDDSFISMGHAFSKGDENFITIG 281
           ISMG+ FNKNHE+ ISLG TYNSRDENAISVGPAYHKTDD+FISMGHAFSKGD +FITIG
Sbjct: 182 ISMGSGFNKNHENTISLGQTYNSRDENAISVGPAYHKTDDNFISMGHAFSKGDGSFITIG 241

Query: 282 PNYSKGDNSILSMSQPFDKGDDSFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS 341
            NYSKGDNSILSM+QPFDKGDDSFISMGQSYEKAEGNIISF ASYNKG ENFISMGP YS
Sbjct: 242 HNYSKGDNSILSMNQPFDKGDDSFISMGQSYEKAEGNIISF-ASYNKGQENFISMGPAYS 301

Query: 342 KAGDTFISMAPSYNKGNDDNLSMGPTFDKVNSDVVHVGPKFDKADSGSVSMTHNYHKGET 401
           KAGDTFISMA S+NKGNDDNLSM PT+DKVNSD+VHVGPKFDKADSG+VSM HNYHKGE+
Sbjct: 302 KAGDTFISMASSFNKGNDDNLSMAPTYDKVNSDIVHVGPKFDKADSGAVSMAHNYHKGES 361

Query: 402 NTISFGGFDDENGTGNPSGGIISSYDLLMANQASAQASEVSTMRDSVHPNVEVNINNVIK 461
           NTISFGGFDDENGT NPSGGIISSYDLLMANQASAQASEVST+RDSV PNVEVNINN IK
Sbjct: 362 NTISFGGFDDENGTDNPSGGIISSYDLLMANQASAQASEVSTLRDSVDPNVEVNINNAIK 421

Query: 462 VDAKIDTSSKIREPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSRESFLPPVSNI 521
           VD KIDT+SK +EPR +KKV PNSFPSNVKSLLSTGMLDGVPVKYVSWSRE         
Sbjct: 422 VDGKIDTNSKNKEPRMSKKVPPNSFPSNVKSLLSTGMLDGVPVKYVSWSRE--------- 481

Query: 522 YTGKKNLKGTIKGTGYLCSCDNCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAV 581
               KNLKG IKGTGYLCSC+NCNH+KALNAYEFERHAGCKTKHPNNHIYFENGKTIYAV
Sbjct: 482 ----KNLKGIIKGTGYLCSCENCNHAKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAV 541

Query: 582 VQELKNTPQEMLFDAIHNVTGSPINQKNFRVWKASYQAATLELQRIYGKDEITMPS 637
           VQELKNTPQEMLFDAI NVTGSPINQKNFR+WKASYQAATLELQRIYGKDE+ MPS
Sbjct: 542 VQELKNTPQEMLFDAIQNVTGSPINQKNFRIWKASYQAATLELQRIYGKDEVIMPS 583

BLAST of Cp4.1LG01g10540 vs. ExPASy TrEMBL
Match: A0A6J1CEN2 (uncharacterized protein LOC111010042 OS=Momordica charantia OX=3673 GN=LOC111010042 PE=4 SV=1)

HSP 1 Score: 1079 bits (2790), Expect = 0.0
Identity = 530/596 (88.93%), Postives = 557/596 (93.46%), Query Frame = 0

Query: 42  SFQHKSFWIPRDAGCLTDGEMNYDSSSRIEAKRGHQWFMDGSAPELFSSKKQAIETVNTR 101
           SFQHKSFWIPRDAGCLTDGEMNYDSSSRIE KRGHQWFMDG+A ELFSSKKQAIETVN+R
Sbjct: 2   SFQHKSFWIPRDAGCLTDGEMNYDSSSRIETKRGHQWFMDGNAQELFSSKKQAIETVNSR 61

Query: 102 PVPGVPHMNVSPWENTSSFQSVPGHFTDRLFGSEPVRTVNLVDRGITVGNANMDMGRKEY 161
           PVPGVPHMNVSPW+NTSSFQSVPG FTDRLFGSEP+RTVNLVDRGITVGNANMDMGRKE+
Sbjct: 62  PVPGVPHMNVSPWDNTSSFQSVPGPFTDRLFGSEPIRTVNLVDRGITVGNANMDMGRKEF 121

Query: 162 ENHFANNPSVGLSMSQSIEDSSSCLNFGGIRKVKVNQVRDPDIGMPASLGHAYSRGDNGT 221
           ENHFANNPSVGLSMSQSIEDSSSCL+FGGIRKVKVNQVRDPDIGM ASLGHAY+RGDNGT
Sbjct: 122 ENHFANNPSVGLSMSQSIEDSSSCLSFGGIRKVKVNQVRDPDIGMSASLGHAYNRGDNGT 181

Query: 222 ISMGTTFNKNHEHAISLGHTYNSRDENAISVGPAYHKTDDSFISMGHAFSKGDENFITIG 281
           ISMGTTFNKNHE+AISLG TYNSRD++ ISVGPAYHKTDD+FISMGH FSKGD NFITIG
Sbjct: 182 ISMGTTFNKNHENAISLGQTYNSRDDSTISVGPAYHKTDDNFISMGHTFSKGDGNFITIG 241

Query: 282 PNYSKGDNSILSMSQPFDKGDDSFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYS 341
            NYSKGD+SILSMSQPFDKGDD+FISMGQSYEKA+GNIISFGASYNKGHENFISMGPTYS
Sbjct: 242 HNYSKGDSSILSMSQPFDKGDDTFISMGQSYEKADGNIISFGASYNKGHENFISMGPTYS 301

Query: 342 KAGDTFISMAPSYNKGNDDNLSMGPTFDKVNSDVVHVGPKFDKADSGSVSMTHNYHKGET 401
           K GDTFISMA SYNKGNDD LSMGPT+DKV+SD+VHVGPK+DKADSGS+SM HNYHKGE+
Sbjct: 302 KGGDTFISMASSYNKGNDDTLSMGPTYDKVDSDIVHVGPKYDKADSGSLSMAHNYHKGES 361

Query: 402 NTISFGGFDDENGTGNPSGGIISSYDLLMANQASAQASEVSTMRDSVHPNVEVNINNVIK 461
           NTISFGGFDDEN T NPSGGIISSYDLLMANQASAQASEVST+RDSV PN E+N+NN  K
Sbjct: 362 NTISFGGFDDENATDNPSGGIISSYDLLMANQASAQASEVSTLRDSVDPNAELNVNNAPK 421

Query: 462 VDAKIDTSSKIREPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSRESFLPPVSNI 521
           +DAKIDTSSK +EPRTTKKV PNSFPSNVKSLLSTGMLDGVPVKYVSWSRE         
Sbjct: 422 LDAKIDTSSKNKEPRTTKKVPPNSFPSNVKSLLSTGMLDGVPVKYVSWSRE--------- 481

Query: 522 YTGKKNLKGTIKGTGYLCSCDNCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAV 581
               KNLKG IKGTGYLCSCDNC  SKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAV
Sbjct: 482 ----KNLKGIIKGTGYLCSCDNCKQSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAV 541

Query: 582 VQELKNTPQEMLFDAIHNVTGSPINQKNFRVWKASYQAATLELQRIYGKDEITMPS 637
           VQELKNTPQEMLFDAI NVTGSPINQKNFR+WKASYQAATLELQRIYGKDE+ MPS
Sbjct: 542 VQELKNTPQEMLFDAIQNVTGSPINQKNFRIWKASYQAATLELQRIYGKDEVIMPS 584

BLAST of Cp4.1LG01g10540 vs. TAIR 10
Match: AT5G13660.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G59830.2); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 362.1 bits (928), Expect = 9.3e-100
Identity = 232/585 (39.66%), Postives = 315/585 (53.85%), Query Frame = 0

Query: 61  EMNYDSSSRIEAKRG-HQWFMDGSAPELFSSKKQAIETVNTRPVPGVPHMNVSPWENTSS 120
           E+ Y  SSR+E KR  HQW  + S+ ELFS+K+Q +  ++        HMN+SPW+ +  
Sbjct: 14  EIPYSGSSRMELKRSHHQWLTEESSSELFSNKRQQVVEIDA-------HMNLSPWDTS-- 73

Query: 121 FQSVPGHFTDRLFGSEPVRTVNLVDRGITVGNANMDMGRKEYENHFANNPSVGLSMSQSI 180
              VP HFTD LF    +   + + R           GR   E       S GL ++   
Sbjct: 74  --LVPSHFTDCLFDDPAIAHTSHLLRN----------GRNYTEEQCNPVSSFGLPLAH-- 133

Query: 181 EDSSSCLNFGGIRKVKVNQVRDPDIGMPASLGHAYSRGDNGTISMGTTFNKNHEH-AISL 240
               S  N   I KV           +P  +   Y +G + +     +FN   E   +S 
Sbjct: 134 --HGSSFNLDTINKVS---------NVPEFMVQLYGQGISTSFETAPSFNSGQESTTLSF 193

Query: 241 GHTYNSRDENAISVGPAYHKTDDSFISMGHAFSKGDENFITIGPNYSKGDNSILSMSQPF 300
           G T+++ D + I  G    KTD +FI     F+      + IG  Y KGD ++LS   P 
Sbjct: 194 GQTFSNTDRSFILPGQFASKTDGNFI---RNFNNEGVGVVPIGDYYDKGDENVLSTFHPL 253

Query: 301 DKGDDSFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYSKAGDTFISMAPSYNKGN 360
           +KG ++F+SMGQS +KA+ NI S  +SYNKG ENF+ +          F++ + +Y+  N
Sbjct: 254 EKGVENFLSMGQSLQKADCNIFSVSSSYNKGQENFMPLLSCEQVPEYDFMTES-NYHNEN 313

Query: 361 DDNLSMGPTFDKVNSDVVHVGPKFDKADSGSVSMTHNYHKGETNTISFGGFDDENGTGNP 420
            + LS G +      ++  +    ++A   +  +     + E  T+SFG    E   G+ 
Sbjct: 314 ANALSAGQSSFTEGGEMTFMVSSQERAGQSNDQIRREDDRSE--TLSFGDCQKETAMGS- 373

Query: 421 SGGIISSYDLLMANQASAQASEVSTMRDSVHPNVEVNIN----NVIKVDAKIDT--SSKI 480
                    + ++N     + + +  +D +H   E N++    N      ++DT    KI
Sbjct: 374 --------SVRVSNNYENFSHDPAITKDPLHIEAEENMSFECRNPPYASPRVDTLLVPKI 433

Query: 481 REPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSRESFLPPVSNIYTGKKNLKGTI 540
           ++ +T KK S N+FPSNVKSLLSTG+ DGV VKY SWSRE            ++NLKG I
Sbjct: 434 KDTKTAKKGSTNTFPSNVKSLLSTGIFDGVTVKYYSWSRE------------QRNLKGMI 493

Query: 541 KGTGYLCSCDNCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEM 600
           KGTGYLC C NC  +K LNAYEFE+HA CKTKHPNNHIYFENGKTIY VVQELKNTPQE 
Sbjct: 494 KGTGYLCGCGNCKLNKVLNAYEFEQHANCKTKHPNNHIYFENGKTIYGVVQELKNTPQEK 537

Query: 601 LFDAIHNVTGSPINQKNFRVWKASYQAATLELQRIYGKDEITMPS 638
           LFDAI NVTGS IN KNF  WKASY  A LELQRIYGKD++T+ S
Sbjct: 554 LFDAIQNVTGSDINHKNFNTWKASYHVARLELQRIYGKDDVTLAS 537

BLAST of Cp4.1LG01g10540 vs. TAIR 10
Match: AT5G13660.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G59830.2); Has 135 Blast hits to 126 proteins in 13 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 135; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 361.3 bits (926), Expect = 1.6e-99
Identity = 232/585 (39.66%), Postives = 314/585 (53.68%), Query Frame = 0

Query: 61  EMNYDSSSRIEAKRG-HQWFMDGSAPELFSSKKQAIETVNTRPVPGVPHMNVSPWENTSS 120
           E+ Y  SSR+E KR  HQW  + S+ ELFS+K+Q +  ++        HMN+SPW+ +  
Sbjct: 14  EIPYSGSSRMELKRSHHQWLTEESSSELFSNKRQQVVEIDA-------HMNLSPWDTS-- 73

Query: 121 FQSVPGHFTDRLFGSEPVRTVNLVDRGITVGNANMDMGRKEYENHFANNPSVGLSMSQSI 180
              VP HFTD LF    +   + + R           GR   E       S GL ++   
Sbjct: 74  --LVPSHFTDCLFDDPAIAHTSHLLRN----------GRNYTEEQCNPVSSFGLPLAH-- 133

Query: 181 EDSSSCLNFGGIRKVKVNQVRDPDIGMPASLGHAYSRGDNGTISMGTTFNKNHEH-AISL 240
               S  N   I KV           +P  +   Y +G + +     +FN   E   +S 
Sbjct: 134 --HGSSFNLDTINKVS---------NVPEFMVQLYGQGISTSFETAPSFNSGQESTTLSF 193

Query: 241 GHTYNSRDENAISVGPAYHKTDDSFISMGHAFSKGDENFITIGPNYSKGDNSILSMSQPF 300
           G T+++ D + I  G    KTD +FI     F+      + IG  Y KGD ++LS   P 
Sbjct: 194 GQTFSNTDRSFILPGQFASKTDGNFI---RNFNNEGVGVVPIGDYYDKGDENVLSTFHPL 253

Query: 301 DKGDDSFISMGQSYEKAEGNIISFGASYNKGHENFISMGPTYSKAGDTFISMAPSYNKGN 360
           +KG ++F+SMGQS +KA+ NI S  +SYNKG ENF+ +          F++ + +Y+  N
Sbjct: 254 EKGVENFLSMGQSLQKADCNIFSVSSSYNKGQENFMPLLSCEQVPEYDFMTES-NYHNEN 313

Query: 361 DDNLSMGPTFDKVNSDVVHVGPKFDKADSGSVSMTHNYHKGETNTISFGGFDDENGTGNP 420
            + LS G +      ++  +    ++A   +  +     + E  T+SFG    E   G+ 
Sbjct: 314 ANALSAGQSSFTEGGEMTFMVSSQERAGQSNDQIRREDDRSE--TLSFGDCQKETAMGS- 373

Query: 421 SGGIISSYDLLMANQASAQASEVSTMRDSVHPNVEVNIN----NVIKVDAKIDT--SSKI 480
                    + ++N     + + +  +D +H   E N++    N      ++DT    KI
Sbjct: 374 --------SVRVSNNYENFSHDPAITKDPLHIEAEENMSFECRNPPYASPRVDTLLVPKI 433

Query: 481 REPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSRESFLPPVSNIYTGKKNLKGTI 540
           ++ +T KK S N+FPSNVKSLLSTG+ DGV VKY SWSRE             +NLKG I
Sbjct: 434 KDTKTAKKGSTNTFPSNVKSLLSTGIFDGVTVKYYSWSRE-------------RNLKGMI 493

Query: 541 KGTGYLCSCDNCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEM 600
           KGTGYLC C NC  +K LNAYEFE+HA CKTKHPNNHIYFENGKTIY VVQELKNTPQE 
Sbjct: 494 KGTGYLCGCGNCKLNKVLNAYEFEQHANCKTKHPNNHIYFENGKTIYGVVQELKNTPQEK 536

Query: 601 LFDAIHNVTGSPINQKNFRVWKASYQAATLELQRIYGKDEITMPS 638
           LFDAI NVTGS IN KNF  WKASY  A LELQRIYGKD++T+ S
Sbjct: 554 LFDAIQNVTGSDINHKNFNTWKASYHVARLELQRIYGKDDVTLAS 536

BLAST of Cp4.1LG01g10540 vs. TAIR 10
Match: AT5G59830.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G13660.2); Has 174 Blast hits to 139 proteins in 16 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 172; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink). )

HSP 1 Score: 270.8 bits (691), Expect = 2.8e-72
Identity = 201/594 (33.84%), Postives = 277/594 (46.63%), Query Frame = 0

Query: 42  SFQHKSFWIPRDAGCLTDGEMNYDSSSRIEAKRGHQWFMDGSAPELFSSKKQAIETVNTR 101
           S++ K FW+ ++    ++ +  YD S+R ++KR H WF+D S  E+F +KKQA++     
Sbjct: 2   SYESKGFWVMKNNEHTSEEDSVYDHSTRDDSKRPHPWFVDSSRSEMFPNKKQAVQD---- 61

Query: 102 PVPGVPHMNVS--PWENTSSFQSVPGHFTDRLFGSE-PVRTVNLVDRGITVGNANMDMGR 161
           PV G+   NV    WE++S FQSV   F DRL G+E P R +   DR  T G ++    +
Sbjct: 62  PVVGLGKSNVGLPLWESSSVFQSVSNQFMDRLLGAEMPPRPLLFGDRDRTEGCSHHHQNK 121

Query: 162 KEYENHFANNPSVGLSMSQSIEDSSSCLNFGGIRKVKVNQVRDPDIGMPASLGHAYSRGD 221
              E+ +  + SV LS+S  +E +  C    G RK+ V++V++                 
Sbjct: 122 SIAES-YMEDTSVELSISNGVEVAGGCFGGDGNRKLPVSRVKE----------------- 181

Query: 222 NGTISMGTTFNKNHEHAISLGHTYNSRDENAISVGPAYHKTDDSFISMGHAFSKGDENFI 281
               +M T       H    GH+    + ++I                  A S+ +E+  
Sbjct: 182 ----TMST-------HVALEGHSQRKIESSSI-----------------QACSRENES-- 241

Query: 282 TIGPNYSKGDNSILSMSQPFDKGDDSFISMGQSYEKAEGNIISFGASYNKGHENFISMGP 341
                                    S+I                         NF   G 
Sbjct: 242 -------------------------SYI-------------------------NFALAGH 301

Query: 342 TYSKAGDTFISMAPSYNKGNDDNLSMGPTFDKVNSDVVHVGPKFDKADSGSVSMTHNYHK 401
            Y                GN+D  S G TF ++N D   VG     + S  V    +Y +
Sbjct: 302 PY----------------GNED--SQGITFGEIN-DEHGVG-----STSNVVGNYQSYVQ 361

Query: 402 GETNTISFGGFDDENGTGNPSGGIISSYDLLMANQASAQASEVSTMRDSVHPNVEVNINN 461
               T+    +D E G+   S G++S   +   +  S   ++                  
Sbjct: 362 DPIGTLDI-VYDQETGSSQTSSGVVSEQQVAKPSLGSLPKTKA----------------- 421

Query: 462 VIKVDAKIDTSSKIREPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSRESFLPPV 521
                          E +++KK +  SFPSNV+SL+STGMLDGVPVKYVS SRE      
Sbjct: 422 ---------------EAKSSKKEASTSFPSNVRSLISTGMLDGVPVKYVSVSRE------ 422

Query: 522 SNIYTGKKNLKGTIKGTGYLCSCDNCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTI 581
                    L+G IKG+GYLC C  C+ +K LNAY FERHAGCKTKHPNNHIYFENGKTI
Sbjct: 482 --------ELRGVIKGSGYLCGCQTCDFTKVLNAYAFERHAGCKTKHPNNHIYFENGKTI 422

Query: 582 YAVVQELKNTPQEMLFDAIHNVTGSPINQKNFRVWKASYQAATLELQRIYGKDE 633
           Y +VQEL+NTP+ +LFD I  V GSPINQK FR+WK S+QAAT ELQRIYGK+E
Sbjct: 542 YQIVQELRNTPESILFDVIQTVFGSPINQKAFRIWKESFQAATRELQRIYGKEE 422

BLAST of Cp4.1LG01g10540 vs. TAIR 10
Match: AT5G59830.2 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G13660.2); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 270.8 bits (691), Expect = 2.8e-72
Identity = 201/594 (33.84%), Postives = 277/594 (46.63%), Query Frame = 0

Query: 42  SFQHKSFWIPRDAGCLTDGEMNYDSSSRIEAKRGHQWFMDGSAPELFSSKKQAIETVNTR 101
           S++ K FW+ ++    ++ +  YD S+R ++KR H WF+D S  E+F +KKQA++     
Sbjct: 2   SYESKGFWVMKNNEHTSEEDSVYDHSTRDDSKRPHPWFVDSSRSEMFPNKKQAVQD---- 61

Query: 102 PVPGVPHMNVS--PWENTSSFQSVPGHFTDRLFGSE-PVRTVNLVDRGITVGNANMDMGR 161
           PV G+   NV    WE++S FQSV   F DRL G+E P R +   DR  T G ++    +
Sbjct: 62  PVVGLGKSNVGLPLWESSSVFQSVSNQFMDRLLGAEMPPRPLLFGDRDRTEGCSHHHQNK 121

Query: 162 KEYENHFANNPSVGLSMSQSIEDSSSCLNFGGIRKVKVNQVRDPDIGMPASLGHAYSRGD 221
              E+ +  + SV LS+S  +E +  C    G RK+ V++V++                 
Sbjct: 122 SIAES-YMEDTSVELSISNGVEVAGGCFGGDGNRKLPVSRVKE----------------- 181

Query: 222 NGTISMGTTFNKNHEHAISLGHTYNSRDENAISVGPAYHKTDDSFISMGHAFSKGDENFI 281
               +M T       H    GH+    + ++I                  A S+ +E+  
Sbjct: 182 ----TMST-------HVALEGHSQRKIESSSI-----------------QACSRENES-- 241

Query: 282 TIGPNYSKGDNSILSMSQPFDKGDDSFISMGQSYEKAEGNIISFGASYNKGHENFISMGP 341
                                    S+I                         NF   G 
Sbjct: 242 -------------------------SYI-------------------------NFALAGH 301

Query: 342 TYSKAGDTFISMAPSYNKGNDDNLSMGPTFDKVNSDVVHVGPKFDKADSGSVSMTHNYHK 401
            Y                GN+D  S G TF ++N D   VG     + S  V    +Y +
Sbjct: 302 PY----------------GNED--SQGITFGEIN-DEHGVG-----STSNVVGNYQSYVQ 361

Query: 402 GETNTISFGGFDDENGTGNPSGGIISSYDLLMANQASAQASEVSTMRDSVHPNVEVNINN 461
               T+    +D E G+   S G++S   +   +  S   ++                  
Sbjct: 362 DPIGTLDI-VYDQETGSSQTSSGVVSEQQVAKPSLGSLPKTKA----------------- 421

Query: 462 VIKVDAKIDTSSKIREPRTTKKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSRESFLPPV 521
                          E +++KK +  SFPSNV+SL+STGMLDGVPVKYVS SRE      
Sbjct: 422 ---------------EAKSSKKEASTSFPSNVRSLISTGMLDGVPVKYVSVSRE------ 422

Query: 522 SNIYTGKKNLKGTIKGTGYLCSCDNCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTI 581
                    L+G IKG+GYLC C  C+ +K LNAY FERHAGCKTKHPNNHIYFENGKTI
Sbjct: 482 --------ELRGVIKGSGYLCGCQTCDFTKVLNAYAFERHAGCKTKHPNNHIYFENGKTI 422

Query: 582 YAVVQELKNTPQEMLFDAIHNVTGSPINQKNFRVWKASYQAATLELQRIYGKDE 633
           Y +VQEL+NTP+ +LFD I  V GSPINQK FR+WK S+QAAT ELQRIYGK+E
Sbjct: 542 YQIVQELRNTPESILFDVIQTVFGSPINQKAFRIWKESFQAATRELQRIYGKEE 422

BLAST of Cp4.1LG01g10540 vs. TAIR 10
Match: AT2G37520.1 (Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger domain )

HSP 1 Score: 151.8 bits (382), Expect = 1.9e-36
Identity = 76/157 (48.41%), Postives = 98/157 (62.42%), Query Frame = 0

Query: 479 KKVSPNSFPSNVKSLLSTGMLDGVPVKYVSWSRESFLPPVSNIYTGKKNLKGTIKGTGYL 538
           KK+   S+PSNVK LL TG+L+G  VKY+S       PPV       + L G I   GYL
Sbjct: 165 KKIVSLSYPSNVKKLLETGILEGARVKYIS------TPPV-------RQLLGIIHSGGYL 224

Query: 539 CSCDNCNHSKALNAYEFERHAGCKTKHPNNHIYFENGKTIYAVVQELKNTPQEMLFDAIH 598
           C C  CN SK L+AYEFE+HAG KT+HPNNHI+ EN + +Y +VQELK  P+ +L + I 
Sbjct: 225 CGCTTCNFSKVLSAYEFEQHAGAKTRHPNNHIFLENRRAVYNIVQELKTAPRVVLEEVIR 284

Query: 599 NVTGSPINQKNFRVWKASYQAATLELQRIYGKDEITM 636
           NV GS +N++  R WKAS+Q +     R Y  D  T+
Sbjct: 285 NVAGSALNEEGLRAWKASFQQSNSMSDRNYITDHSTV 308

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023545327.10.097.82uncharacterized protein LOC111804766 [Cucurbita pepo subsp. pepo] >XP_023545335.... [more]
KAG7031622.10.096.32hypothetical protein SDJN02_05663, partial [Cucurbita argyrosperma subsp. argyro... [more]
KAG6601008.10.096.64hypothetical protein SDJN03_06241, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022956892.10.096.48uncharacterized protein LOC111458442 [Cucurbita moschata] >XP_022956893.1 unchar... [more]
XP_022994977.10.095.47uncharacterized protein LOC111490611 [Cucurbita maxima] >XP_022995052.1 uncharac... [more]
Match NameE-valueIdentityDescription
A0A6J1GYI50.096.48uncharacterized protein LOC111458442 OS=Cucurbita moschata OX=3662 GN=LOC1114584... [more]
A0A6J1JXF60.095.47uncharacterized protein LOC111490611 OS=Cucurbita maxima OX=3661 GN=LOC111490611... [more]
A0A5A7U4X70.089.60TDBD domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_s... [more]
A0A1S3CFN80.089.43uncharacterized protein LOC103500200 OS=Cucumis melo OX=3656 GN=LOC103500200 PE=... [more]
A0A6J1CEN20.088.93uncharacterized protein LOC111010042 OS=Momordica charantia OX=3673 GN=LOC111010... [more]
Match NameE-valueIdentityDescription
AT5G13660.29.3e-10039.66unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G13660.11.6e-9939.66unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G59830.12.8e-7233.84unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G59830.22.8e-7233.84unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G37520.11.9e-3648.41Acyl-CoA N-acyltransferase with RING/FYVE/PHD-type zinc finger domain [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR032308Jas TPL-binding domainPFAMPF16135TDBDcoord: 530..585
e-value: 1.5E-14
score: 53.8
NoneNo IPR availablePANTHERPTHR47025AUTOIMMUNE REGULATORcoord: 44..637
NoneNo IPR availablePANTHERPTHR47025:SF6N-LYSINE METHYLTRANSFERASEcoord: 44..637

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g10540.1Cp4.1LG01g10540.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0045944 positive regulation of transcription by RNA polymerase II
cellular_component GO:0005634 nucleus
molecular_function GO:0003682 chromatin binding
molecular_function GO:0042393 histone binding
molecular_function GO:0000977 RNA polymerase II transcription regulatory region sequence-specific DNA binding