Cp4.1LG12g01700 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG12g01700
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionUnknown protein
LocationCp4.1LG12: 1114638 .. 1118430 (+)
RNA-Seq ExpressionCp4.1LG12g01700
SyntenyCp4.1LG12g01700
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CATAAAAGAAGAACCCTCCGGCGTGACAGTAGGCTGAGGGTCCGAAATGTGACCAATAAAGATTAAAGAAGCATTTATTTGAAAGAAAGGTGACCGTGGGTAACGGTATCGTAGCCTTTTTTCCAATTCAAGAAACCTCTCTTCCATCTCCGGAGGGCTAGGAGTGTACTCTGGTGAAGAAAAATGGCGTTCACAGATCAAATCACTACCTGAAATTTACAATTCCATCACTTCAGTTCCGGATTAATGGCGTTCCGATTTGGAGTTTGTCCACGCGCCACCGCTATCTCCAACTTTCCCCAACTCAATGACAAGAACTTTCTCGTTCTCCCTCTGGAACGAGAGGTTGAGATTCGGCGGGCGGGAGGCACGAGGAGATTGAGAATTAGGGTTTCGGATGGAGAAGAATCGTATCTCGGGATGTGGAAGAACGCGGTGGAGCGCCAGAGGAAAGCTATTGAGTTTCGGAAAGTTGTGGAGAATACTGTGGGAAATGACGATCGCAATGACGGTGATCGAAGTGATGATCAGTTGGAGGAGAAGAGCGGGGAGTTCAGTAAGATTCTTCAGGTCCCGACGGAGGAAAGGGATAAGATTCAGCGGATGCAGGTCATACATCGAGCCACTGCCGCCATTGCAGCTGCTCGTGCACTTGTTGGTGAGACTAGAATTGTAGCTGATTCGAATACTTCTTTGAATTTGAATAGTAGGAATGACGGTGGACTGCTCGATCGTGAAGAAGGTATTATTAACATTATTGAATCTTGAAGAACTGATCTATTTGTTGATAATATGAGTTGAAGTCAAGTTACACATTTGTATAAATTCTAGGCTATTTTCTCTTCTGTATTTGACCTTCTTTCAAGTTGAACTTTGAATATTACAAGTAGTTGGTACATTTCCGTTTGGCTTGTTAGCGAATGAATAAACTTTCATGTTTGACCAGGCATATCTTCTCTGAGTTTCAAGCTCAGAAAATTGATTGGTTAACATAGATCTGCATTCAACATCCTAATACAGAAAATAATGGCAGTGTTTCTAGTGTTACAATCGCAATTATCTTGGTAGTGAACTTTTGTGACTATGCGGCACTCACTCCCAATACCGAGTGAGTCAAGCCAATTTGGATTCTTGTCGCCTACGGTCAGGCAATGTATGAGTCTCTAGCCCCGTTTTGAGAATAAGGTTTTAGAAACGGGGGCAGAGTGATTTTTGAAAGAGAGTTTGAAAGCAAAGTTGAAAGAAGGTTTGAAAGTCGTTATATGACTATATGCAAAGAAAGAAACACCACAACTTAAATATCATATAACTTGGGTGGGAAGAATTGACAATTAGTCACAAGCTCCTATAGTACATTTTGAGGTGATTCAGGTCTGCTAGGCCTAGACGTGTTTTCGCCAAACGATCATACAAAGTCTAGATGGGAGGAGTTGACGATTGGTCGCATCCCCTATAGTATATTTTGGGCGATTTAGGTCTAGCAAGCCTAGACATAACTTTACCAAATAGTCATACAAACAGAAAATACAATATTAAGACAATACAACATGAAATCCAATAAAGGGTTTGGCGTGACGTGAGAAACACGCCCTGGACGAACATGACCCTGACAACTAGCACGCCTAGAAAAGAATAGAGAAATAGTGTTATTGTGGTTCAGTAATTTGAAGGAGACAATTTAATGGTTTATGTATTTAGACTTGTTAACTTCTTATGTGGTTGTTTAGGCTCTCTTACTAGATTGTTTAGGCTCTCTTACTAATTTTTCTATACTGTTTTTGATTCATCTAGCATCATCTGAATTTCAAAGTGAGAACGCCCTGCTACCCGAGTCTGAAACTTCACCAACCGCGACCCCTGGTCCAGATTTCTGGTCTTGGACACCCCCTCCAGATAGTGATGGGAATGGTAATGCTTTCAGCGAGTTGCGACCAATTGGAAAATCCCAGGCATATCCAATGTTATCCAATTTTGTTAAGGAGAAAGAGCCTCCGGTGGGTTTTCTCTCAATTCCTTTTCAGAGTGAACTTCCTGAAAGCGTCAAACCTCTTCTACCACCTTTTCAATCATTGATGGACATTGAAAAGTTAGAATCATTGGAAACCAGTACAGAGACACATTCCTTGGAAGAAGATGAAAATGTTGGGATGGAGTTTTCGGTACTTGCAGCAGAAGCATCTCAGGCGCTTAGTAGCATAGATAAAGAATCAACAAAAGGAATAGATTCAGATGGGTCACGATGGTGGAAGGAGATGGGAGTTGAGAAGAGACCGGATGGAGTGATTTGCAAGTGGACACTAACGAGGGGAGTTAGCGCAGACCTCGCCACTGAGTGGCAGAACAAGTATTGGGAAGCTGCTGATGAGTTTGGTTATAAGGAACTTGGTTCAGAGAAATCTGGCCGCGATGCCTATGGAAGTGTTTGGCGTGAATATTGGAGAGAATCTATGCGGCAGGTCAGTTTAAATTGTTAATGATTTGCTTGCTTCTGTTGCATTATCCTTGTAAAATATTTGCTTTATGCAATCAGTTAAAGTCTCAATAGTCTTTGTTTTATGCACTCACTTAAAGTGCTTAATGGTTAGTTCGAATCCATATGCTAGTTTGCCAAATTTGAAAACGAAGTTAAATCATTTCATTGGATATTGCGAGATGGTGGAAATGCCTTTATGAGTGTGTAATGTACAATCTGTCTGTTGATATCTAACAAATTCTTTTGTGTCTCTCTCTCTGATACTTGAGAAGAGAAGGATTCGCTGATGTTTCGATGTCGGGAAATGATCTCTATAGTGGATCCTGGAAATTGCTTCATGTGGTTCCTAATTTCAATCATGGCAACAAGCAGTCCTTATTATGACCATAACTGAACTTACTTTCTTTCAGGAGCAAGGCCTTGTTCACCTTGAAAAAACTGCAGACAAATGGGGGAAAAATGGAAGTGGCACCGAGTGGCAAGAGAAATGGTGGGAATATTATAATACCTCTGGTCAAGTTGAAAAGAATGCTCATAAATGGTGTAAAATTGACCCGAACACGTATGTCGATCCTGGTCATGCTCATATCTGGCATGAAAGGTATCAAAAATTTTCAAATGTTGGAGATGTTAATCACCTGTTGTTCTTGTTGTCCTCTTGACTTGTCTTTTGCGATGCATTCTCAATCCTCTGAATTCCTTTTTCGAGATTTCGTACGTTTAATCTGGTGACATGCTTCTTAAGAAATTGTCTATTTGAGTTTGCTTTAGCATGTAACTTTCTTTGCAGATGGGGTGAAAAGTATGATGGACAAGGTGGCAGCATCAAGTACACAGATAAATGGGCTGAAGGATGTGAAGGTGATGGTTGGACGAAGTGGGGCGACAAATGGGACGAAAACTTCGATTCAAATGGTCATGGCGTCAAACAGGGGGAAACATGGTGGGAAGGTAAACATGGAGAACGGTGGAACCGTACGTGGGGCGAGGGCCACAGTGGTTCAGGCTGGGTTCACAAGTATGGCAAGAGCAGCAGTGGGGAGCATTGGGACACACATGTTCAGCAGGAAACCTGGTATGAGAGATTCCCACACTTCGGCTTTTATCACTGCTTCAACAATTCAGTCCAGCTCCGGGAAGTTCAGAAGCCATCTGAGACGTCTTTGTAAGTCTTCTTTTCCTACTGCTAAGTGAATATAAAAACTTGCAGTATTATAGCTCATTTTGAAGAAAATTGAATGCTTTATCGGAGTGGATACCGTTTTTCTGGCATTGTACATAAAATAAAAGCTTTTAAGTTCATAAATTAGCTTACTTCTT

mRNA sequence

CATAAAAGAAGAACCCTCCGGCGTGACAGTAGGCTGAGGGTCCGAAATGTGACCAATAAAGATTAAAGAAGCATTTATTTGAAAGAAAGGTGACCGTGGGTAACGGTATCGTAGCCTTTTTTCCAATTCAAGAAACCTCTCTTCCATCTCCGGAGGGCTAGGAGTGTACTCTGGTGAAGAAAAATGGCGTTCACAGATCAAATCACTACCTGAAATTTACAATTCCATCACTTCAGTTCCGGATTAATGGCGTTCCGATTTGGAGTTTGTCCACGCGCCACCGCTATCTCCAACTTTCCCCAACTCAATGACAAGAACTTTCTCGTTCTCCCTCTGGAACGAGAGGTTGAGATTCGGCGGGCGGGAGGCACGAGGAGATTGAGAATTAGGGTTTCGGATGGAGAAGAATCGTATCTCGGGATGTGGAAGAACGCGGTGGAGCGCCAGAGGAAAGCTATTGAGTTTCGGAAAGTTGTGGAGAATACTGTGGGAAATGACGATCGCAATGACGGTGATCGAAGTGATGATCAGTTGGAGGAGAAGAGCGGGGAGTTCAGTAAGATTCTTCAGGTCCCGACGGAGGAAAGGGATAAGATTCAGCGGATGCAGGTCATACATCGAGCCACTGCCGCCATTGCAGCTGCTCGTGCACTTGTTGGTGAGACTAGAATTGTAGCTGATTCGAATACTTCTTTGAATTTGAATAGTAGGAATGACGGTGGACTGCTCGATCGTGAAGAAGCATCATCTGAATTTCAAAGTGAGAACGCCCTGCTACCCGAGTCTGAAACTTCACCAACCGCGACCCCTGGTCCAGATTTCTGGTCTTGGACACCCCCTCCAGATAGTGATGGGAATGGTAATGCTTTCAGCGAGTTGCGACCAATTGGAAAATCCCAGGCATATCCAATGTTATCCAATTTTGTTAAGGAGAAAGAGCCTCCGGTGGGTTTTCTCTCAATTCCTTTTCAGAGTGAACTTCCTGAAAGCGTCAAACCTCTTCTACCACCTTTTCAATCATTGATGGACATTGAAAAGTTAGAATCATTGGAAACCAGTACAGAGACACATTCCTTGGAAGAAGATGAAAATGTTGGGATGGAGTTTTCGGTACTTGCAGCAGAAGCATCTCAGGCGCTTAGTAGCATAGATAAAGAATCAACAAAAGGAATAGATTCAGATGGGTCACGATGGTGGAAGGAGATGGGAGTTGAGAAGAGACCGGATGGAGTGATTTGCAAGTGGACACTAACGAGGGGAGTTAGCGCAGACCTCGCCACTGAGTGGCAGAACAAGTATTGGGAAGCTGCTGATGAGTTTGGTTATAAGGAACTTGGTTCAGAGAAATCTGGCCGCGATGCCTATGGAAGTGTTTGGCGTGAATATTGGAGAGAATCTATGCGGCAGGAGCAAGGCCTTGTTCACCTTGAAAAAACTGCAGACAAATGGGGGAAAAATGGAAGTGGCACCGAGTGGCAAGAGAAATGGTGGGAATATTATAATACCTCTGGTCAAGTTGAAAAGAATGCTCATAAATGGTGTAAAATTGACCCGAACACGTATGTCGATCCTGGTCATGCTCATATCTGGCATGAAAGATGGGGTGAAAAGTATGATGGACAAGGTGGCAGCATCAAGTACACAGATAAATGGGCTGAAGGATGTGAAGGTGATGGTTGGACGAAGTGGGGCGACAAATGGGACGAAAACTTCGATTCAAATGGTCATGGCGTCAAACAGGGGGAAACATGGTGGGAAGGTAAACATGGAGAACGGTGGAACCGTACGTGGGGCGAGGGCCACAGTGGTTCAGGCTGGGTTCACAAGTATGGCAAGAGCAGCAGTGGGGAGCATTGGGACACACATGTTCAGCAGGAAACCTGGTATGAGAGATTCCCACACTTCGGCTTTTATCACTGCTTCAACAATTCAGTCCAGCTCCGGGAAGTTCAGAAGCCATCTGAGACGTCTTTGTAAGTCTTCTTTTCCTACTGCTAAGTGAATATAAAAACTTGCAGTATTATAGCTCATTTTGAAGAAAATTGAATGCTTTATCGGAGTGGATACCGTTTTTCTGGCATTGTACATAAAATAAAAGCTTTTAAGTTCATAAATTAGCTTACTTCTT

Coding sequence (CDS)

ATGGCGTTCCGATTTGGAGTTTGTCCACGCGCCACCGCTATCTCCAACTTTCCCCAACTCAATGACAAGAACTTTCTCGTTCTCCCTCTGGAACGAGAGGTTGAGATTCGGCGGGCGGGAGGCACGAGGAGATTGAGAATTAGGGTTTCGGATGGAGAAGAATCGTATCTCGGGATGTGGAAGAACGCGGTGGAGCGCCAGAGGAAAGCTATTGAGTTTCGGAAAGTTGTGGAGAATACTGTGGGAAATGACGATCGCAATGACGGTGATCGAAGTGATGATCAGTTGGAGGAGAAGAGCGGGGAGTTCAGTAAGATTCTTCAGGTCCCGACGGAGGAAAGGGATAAGATTCAGCGGATGCAGGTCATACATCGAGCCACTGCCGCCATTGCAGCTGCTCGTGCACTTGTTGGTGAGACTAGAATTGTAGCTGATTCGAATACTTCTTTGAATTTGAATAGTAGGAATGACGGTGGACTGCTCGATCGTGAAGAAGCATCATCTGAATTTCAAAGTGAGAACGCCCTGCTACCCGAGTCTGAAACTTCACCAACCGCGACCCCTGGTCCAGATTTCTGGTCTTGGACACCCCCTCCAGATAGTGATGGGAATGGTAATGCTTTCAGCGAGTTGCGACCAATTGGAAAATCCCAGGCATATCCAATGTTATCCAATTTTGTTAAGGAGAAAGAGCCTCCGGTGGGTTTTCTCTCAATTCCTTTTCAGAGTGAACTTCCTGAAAGCGTCAAACCTCTTCTACCACCTTTTCAATCATTGATGGACATTGAAAAGTTAGAATCATTGGAAACCAGTACAGAGACACATTCCTTGGAAGAAGATGAAAATGTTGGGATGGAGTTTTCGGTACTTGCAGCAGAAGCATCTCAGGCGCTTAGTAGCATAGATAAAGAATCAACAAAAGGAATAGATTCAGATGGGTCACGATGGTGGAAGGAGATGGGAGTTGAGAAGAGACCGGATGGAGTGATTTGCAAGTGGACACTAACGAGGGGAGTTAGCGCAGACCTCGCCACTGAGTGGCAGAACAAGTATTGGGAAGCTGCTGATGAGTTTGGTTATAAGGAACTTGGTTCAGAGAAATCTGGCCGCGATGCCTATGGAAGTGTTTGGCGTGAATATTGGAGAGAATCTATGCGGCAGGAGCAAGGCCTTGTTCACCTTGAAAAAACTGCAGACAAATGGGGGAAAAATGGAAGTGGCACCGAGTGGCAAGAGAAATGGTGGGAATATTATAATACCTCTGGTCAAGTTGAAAAGAATGCTCATAAATGGTGTAAAATTGACCCGAACACGTATGTCGATCCTGGTCATGCTCATATCTGGCATGAAAGATGGGGTGAAAAGTATGATGGACAAGGTGGCAGCATCAAGTACACAGATAAATGGGCTGAAGGATGTGAAGGTGATGGTTGGACGAAGTGGGGCGACAAATGGGACGAAAACTTCGATTCAAATGGTCATGGCGTCAAACAGGGGGAAACATGGTGGGAAGGTAAACATGGAGAACGGTGGAACCGTACGTGGGGCGAGGGCCACAGTGGTTCAGGCTGGGTTCACAAGTATGGCAAGAGCAGCAGTGGGGAGCATTGGGACACACATGTTCAGCAGGAAACCTGGTATGAGAGATTCCCACACTTCGGCTTTTATCACTGCTTCAACAATTCAGTCCAGCTCCGGGAAGTTCAGAAGCCATCTGAGACGTCTTTGTAA

Protein sequence

MAFRFGVCPRATAISNFPQLNDKNFLVLPLEREVEIRRAGGTRRLRIRVSDGEESYLGMWKNAVERQRKAIEFRKVVENTVGNDDRNDGDRSDDQLEEKSGEFSKILQVPTEERDKIQRMQVIHRATAAIAAARALVGETRIVADSNTSLNLNSRNDGGLLDREEASSEFQSENALLPESETSPTATPGPDFWSWTPPPDSDGNGNAFSELRPIGKSQAYPMLSNFVKEKEPPVGFLSIPFQSELPESVKPLLPPFQSLMDIEKLESLETSTETHSLEEDENVGMEFSVLAAEASQALSSIDKESTKGIDSDGSRWWKEMGVEKRPDGVICKWTLTRGVSADLATEWQNKYWEAADEFGYKELGSEKSGRDAYGSVWREYWRESMRQEQGLVHLEKTADKWGKNGSGTEWQEKWWEYYNTSGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTKWGDKWDENFDSNGHGVKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTHVQQETWYERFPHFGFYHCFNNSVQLREVQKPSETSL
Homology
BLAST of Cp4.1LG12g01700 vs. NCBI nr
Match: XP_023548665.1 (uncharacterized protein LOC111807253 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1194 bits (3090), Expect = 0.0
Identity = 576/576 (100.00%), Postives = 576/576 (100.00%), Query Frame = 0

Query: 1   MAFRFGVCPRATAISNFPQLNDKNFLVLPLEREVEIRRAGGTRRLRIRVSDGEESYLGMW 60
           MAFRFGVCPRATAISNFPQLNDKNFLVLPLEREVEIRRAGGTRRLRIRVSDGEESYLGMW
Sbjct: 1   MAFRFGVCPRATAISNFPQLNDKNFLVLPLEREVEIRRAGGTRRLRIRVSDGEESYLGMW 60

Query: 61  KNAVERQRKAIEFRKVVENTVGNDDRNDGDRSDDQLEEKSGEFSKILQVPTEERDKIQRM 120
           KNAVERQRKAIEFRKVVENTVGNDDRNDGDRSDDQLEEKSGEFSKILQVPTEERDKIQRM
Sbjct: 61  KNAVERQRKAIEFRKVVENTVGNDDRNDGDRSDDQLEEKSGEFSKILQVPTEERDKIQRM 120

Query: 121 QVIHRATAAIAAARALVGETRIVADSNTSLNLNSRNDGGLLDREEASSEFQSENALLPES 180
           QVIHRATAAIAAARALVGETRIVADSNTSLNLNSRNDGGLLDREEASSEFQSENALLPES
Sbjct: 121 QVIHRATAAIAAARALVGETRIVADSNTSLNLNSRNDGGLLDREEASSEFQSENALLPES 180

Query: 181 ETSPTATPGPDFWSWTPPPDSDGNGNAFSELRPIGKSQAYPMLSNFVKEKEPPVGFLSIP 240
           ETSPTATPGPDFWSWTPPPDSDGNGNAFSELRPIGKSQAYPMLSNFVKEKEPPVGFLSIP
Sbjct: 181 ETSPTATPGPDFWSWTPPPDSDGNGNAFSELRPIGKSQAYPMLSNFVKEKEPPVGFLSIP 240

Query: 241 FQSELPESVKPLLPPFQSLMDIEKLESLETSTETHSLEEDENVGMEFSVLAAEASQALSS 300
           FQSELPESVKPLLPPFQSLMDIEKLESLETSTETHSLEEDENVGMEFSVLAAEASQALSS
Sbjct: 241 FQSELPESVKPLLPPFQSLMDIEKLESLETSTETHSLEEDENVGMEFSVLAAEASQALSS 300

Query: 301 IDKESTKGIDSDGSRWWKEMGVEKRPDGVICKWTLTRGVSADLATEWQNKYWEAADEFGY 360
           IDKESTKGIDSDGSRWWKEMGVEKRPDGVICKWTLTRGVSADLATEWQNKYWEAADEFGY
Sbjct: 301 IDKESTKGIDSDGSRWWKEMGVEKRPDGVICKWTLTRGVSADLATEWQNKYWEAADEFGY 360

Query: 361 KELGSEKSGRDAYGSVWREYWRESMRQEQGLVHLEKTADKWGKNGSGTEWQEKWWEYYNT 420
           KELGSEKSGRDAYGSVWREYWRESMRQEQGLVHLEKTADKWGKNGSGTEWQEKWWEYYNT
Sbjct: 361 KELGSEKSGRDAYGSVWREYWRESMRQEQGLVHLEKTADKWGKNGSGTEWQEKWWEYYNT 420

Query: 421 SGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTK 480
           SGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTK
Sbjct: 421 SGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTK 480

Query: 481 WGDKWDENFDSNGHGVKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTH 540
           WGDKWDENFDSNGHGVKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTH
Sbjct: 481 WGDKWDENFDSNGHGVKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTH 540

Query: 541 VQQETWYERFPHFGFYHCFNNSVQLREVQKPSETSL 576
           VQQETWYERFPHFGFYHCFNNSVQLREVQKPSETSL
Sbjct: 541 VQQETWYERFPHFGFYHCFNNSVQLREVQKPSETSL 576

BLAST of Cp4.1LG12g01700 vs. NCBI nr
Match: XP_023006712.1 (uncharacterized protein LOC111499367 [Cucurbita maxima])

HSP 1 Score: 1172 bits (3032), Expect = 0.0
Identity = 564/576 (97.92%), Postives = 570/576 (98.96%), Query Frame = 0

Query: 1   MAFRFGVCPRATAISNFPQLNDKNFLVLPLEREVEIRRAGGTRRLRIRVSDGEESYLGMW 60
           MAFRFGVCPRATAISNFPQLNDKNFL+LP EREVEIRRA GTRRLRIRVSDGEESYLGMW
Sbjct: 1   MAFRFGVCPRATAISNFPQLNDKNFLILPHEREVEIRRARGTRRLRIRVSDGEESYLGMW 60

Query: 61  KNAVERQRKAIEFRKVVENTVGNDDRNDGDRSDDQLEEKSGEFSKILQVPTEERDKIQRM 120
           KNAVERQRKAIEFRKVVENTVGNDDRNDGDRSDDQL EKS EFSKILQVPT+ERDKIQRM
Sbjct: 61  KNAVERQRKAIEFRKVVENTVGNDDRNDGDRSDDQLAEKSEEFSKILQVPTKERDKIQRM 120

Query: 121 QVIHRATAAIAAARALVGETRIVADSNTSLNLNSRNDGGLLDREEASSEFQSENALLPES 180
           QVIHRATAAIAAARALVGETRIVADSNTSLNLNSRNDGGLLDREEASSEFQSEN LLP+S
Sbjct: 121 QVIHRATAAIAAARALVGETRIVADSNTSLNLNSRNDGGLLDREEASSEFQSENTLLPKS 180

Query: 181 ETSPTATPGPDFWSWTPPPDSDGNGNAFSELRPIGKSQAYPMLSNFVKEKEPPVGFLSIP 240
           ETSPTATPGPDFWSWTPPPD+DGNGNAFSELRPI KSQAYPMLSNFVKEKEPPVGFLSIP
Sbjct: 181 ETSPTATPGPDFWSWTPPPDNDGNGNAFSELRPIEKSQAYPMLSNFVKEKEPPVGFLSIP 240

Query: 241 FQSELPESVKPLLPPFQSLMDIEKLESLETSTETHSLEEDENVGMEFSVLAAEASQALSS 300
           FQSELPESVKPLLPPFQSLMDIEKLESLETSTETHSLEEDENVGMEFSVLAAEASQALSS
Sbjct: 241 FQSELPESVKPLLPPFQSLMDIEKLESLETSTETHSLEEDENVGMEFSVLAAEASQALSS 300

Query: 301 IDKESTKGIDSDGSRWWKEMGVEKRPDGVICKWTLTRGVSADLATEWQNKYWEAADEFGY 360
           IDKESTKGIDSDGSRWWKEMGVE+RPDGVICKWTLTRGVSADLATEWQNKYWEAADEFGY
Sbjct: 301 IDKESTKGIDSDGSRWWKEMGVEQRPDGVICKWTLTRGVSADLATEWQNKYWEAADEFGY 360

Query: 361 KELGSEKSGRDAYGSVWREYWRESMRQEQGLVHLEKTADKWGKNGSGTEWQEKWWEYYNT 420
           KELGSEKSGRDAYGSVWREYWRESMRQEQGLVHLEKTADKWGKNGSGTEWQEKWWEYYNT
Sbjct: 361 KELGSEKSGRDAYGSVWREYWRESMRQEQGLVHLEKTADKWGKNGSGTEWQEKWWEYYNT 420

Query: 421 SGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTK 480
           SGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTK
Sbjct: 421 SGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTK 480

Query: 481 WGDKWDENFDSNGHGVKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTH 540
           WGDKWDENFDSNGHG+KQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTH
Sbjct: 481 WGDKWDENFDSNGHGIKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTH 540

Query: 541 VQQETWYERFPHFGFYHCFNNSVQLREVQKPSETSL 576
           VQQETWYERFPHFGFYHCFNNSVQLREVQKPSETSL
Sbjct: 541 VQQETWYERFPHFGFYHCFNNSVQLREVQKPSETSL 576

BLAST of Cp4.1LG12g01700 vs. NCBI nr
Match: XP_022958887.1 (uncharacterized protein LOC111460038 [Cucurbita moschata])

HSP 1 Score: 1166 bits (3017), Expect = 0.0
Identity = 561/576 (97.40%), Postives = 568/576 (98.61%), Query Frame = 0

Query: 1   MAFRFGVCPRATAISNFPQLNDKNFLVLPLEREVEIRRAGGTRRLRIRVSDGEESYLGMW 60
           M FRFGVCPRATAISNFPQLNDKNFLVLPLEREVEIRRA GTRRLRIRVSDGEESYLGMW
Sbjct: 1   MTFRFGVCPRATAISNFPQLNDKNFLVLPLEREVEIRRARGTRRLRIRVSDGEESYLGMW 60

Query: 61  KNAVERQRKAIEFRKVVENTVGNDDRNDGDRSDDQLEEKSGEFSKILQVPTEERDKIQRM 120
           KNAVERQRKA EFRKVVENTVGND+RNDGDRSDDQLEEKS EFSKILQVPTEERDKIQRM
Sbjct: 61  KNAVERQRKANEFRKVVENTVGNDNRNDGDRSDDQLEEKSEEFSKILQVPTEERDKIQRM 120

Query: 121 QVIHRATAAIAAARALVGETRIVADSNTSLNLNSRNDGGLLDREEASSEFQSENALLPES 180
           QVIHRATAAIAAARALVGETRIVADSNTSLNLNSRNDGGLLDREEASSEFQSEN LLP+S
Sbjct: 121 QVIHRATAAIAAARALVGETRIVADSNTSLNLNSRNDGGLLDREEASSEFQSENTLLPKS 180

Query: 181 ETSPTATPGPDFWSWTPPPDSDGNGNAFSELRPIGKSQAYPMLSNFVKEKEPPVGFLSIP 240
           ETSPTATPGPDFWSWTPPPD+DGNGNAFSELRPI KSQ YPMLSNFVKEKEPPVGFLSIP
Sbjct: 181 ETSPTATPGPDFWSWTPPPDNDGNGNAFSELRPIEKSQVYPMLSNFVKEKEPPVGFLSIP 240

Query: 241 FQSELPESVKPLLPPFQSLMDIEKLESLETSTETHSLEEDENVGMEFSVLAAEASQALSS 300
           FQSELPESVKPLLPPFQSLMDIEKLESLET+TETHSLE+DENVGMEFSVLAAEASQALSS
Sbjct: 241 FQSELPESVKPLLPPFQSLMDIEKLESLETNTETHSLEDDENVGMEFSVLAAEASQALSS 300

Query: 301 IDKESTKGIDSDGSRWWKEMGVEKRPDGVICKWTLTRGVSADLATEWQNKYWEAADEFGY 360
           IDKESTKGIDSDGSRWWKEMGVE+RPDGVICKWTLTRGVSAD ATEWQNKYWEAADEFGY
Sbjct: 301 IDKESTKGIDSDGSRWWKEMGVEQRPDGVICKWTLTRGVSADFATEWQNKYWEAADEFGY 360

Query: 361 KELGSEKSGRDAYGSVWREYWRESMRQEQGLVHLEKTADKWGKNGSGTEWQEKWWEYYNT 420
           KELGSEKSGRDAYGSVWREYWRESM+QEQGLVHLEKTADKWGKNGSGTEWQEKWWEYYNT
Sbjct: 361 KELGSEKSGRDAYGSVWREYWRESMQQEQGLVHLEKTADKWGKNGSGTEWQEKWWEYYNT 420

Query: 421 SGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTK 480
           SGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTK
Sbjct: 421 SGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTK 480

Query: 481 WGDKWDENFDSNGHGVKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTH 540
           WGDKWDENFDSNGHGVKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTH
Sbjct: 481 WGDKWDENFDSNGHGVKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTH 540

Query: 541 VQQETWYERFPHFGFYHCFNNSVQLREVQKPSETSL 576
           VQQETWYERFPHFGFYHCFNNSVQLREVQKPSETSL
Sbjct: 541 VQQETWYERFPHFGFYHCFNNSVQLREVQKPSETSL 576

BLAST of Cp4.1LG12g01700 vs. NCBI nr
Match: KAG7013519.1 (hypothetical protein SDJN02_23685 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1163 bits (3008), Expect = 0.0
Identity = 560/576 (97.22%), Postives = 568/576 (98.61%), Query Frame = 0

Query: 1   MAFRFGVCPRATAISNFPQLNDKNFLVLPLEREVEIRRAGGTRRLRIRVSDGEESYLGMW 60
           MAFRFGVCPRATAISNFPQLNDKNFLVLPLEREVEIRRA GTRRLRIRVSDGEESYLGMW
Sbjct: 1   MAFRFGVCPRATAISNFPQLNDKNFLVLPLEREVEIRRARGTRRLRIRVSDGEESYLGMW 60

Query: 61  KNAVERQRKAIEFRKVVENTVGNDDRNDGDRSDDQLEEKSGEFSKILQVPTEERDKIQRM 120
           KNAVER+RKA EFRKVVENTVGND+RNDGDRSDDQLEEKS EFSKILQVPTEERDKIQRM
Sbjct: 61  KNAVERRRKANEFRKVVENTVGNDNRNDGDRSDDQLEEKSEEFSKILQVPTEERDKIQRM 120

Query: 121 QVIHRATAAIAAARALVGETRIVADSNTSLNLNSRNDGGLLDREEASSEFQSENALLPES 180
           QVIHRATAAIAAARALV ETRIVADSNTSLNLNSRNDGGLLDREEASSEFQSEN LLP+S
Sbjct: 121 QVIHRATAAIAAARALVCETRIVADSNTSLNLNSRNDGGLLDREEASSEFQSENTLLPKS 180

Query: 181 ETSPTATPGPDFWSWTPPPDSDGNGNAFSELRPIGKSQAYPMLSNFVKEKEPPVGFLSIP 240
           ETSPTATPGPDFWSWTPPPD+DGNGNAFSELRPI KSQ YPMLSNFVKEKEPPVGFLSIP
Sbjct: 181 ETSPTATPGPDFWSWTPPPDNDGNGNAFSELRPIEKSQVYPMLSNFVKEKEPPVGFLSIP 240

Query: 241 FQSELPESVKPLLPPFQSLMDIEKLESLETSTETHSLEEDENVGMEFSVLAAEASQALSS 300
           FQSELPESVKPLLPPFQSLMDIEKLESLET+TETHSLE+DENVGMEFSVLAAEASQALSS
Sbjct: 241 FQSELPESVKPLLPPFQSLMDIEKLESLETNTETHSLEDDENVGMEFSVLAAEASQALSS 300

Query: 301 IDKESTKGIDSDGSRWWKEMGVEKRPDGVICKWTLTRGVSADLATEWQNKYWEAADEFGY 360
           IDKESTKGIDSDGSRWWKEMGVE+RPDGVICKWTLTRGVSAD ATEWQNKYWEAADEFGY
Sbjct: 301 IDKESTKGIDSDGSRWWKEMGVEQRPDGVICKWTLTRGVSADFATEWQNKYWEAADEFGY 360

Query: 361 KELGSEKSGRDAYGSVWREYWRESMRQEQGLVHLEKTADKWGKNGSGTEWQEKWWEYYNT 420
           KELGSEKSGRDAYGSVWREYWRESM+QEQGLVHLEKTADKWGKNGSGTEWQEKWWEYYNT
Sbjct: 361 KELGSEKSGRDAYGSVWREYWRESMQQEQGLVHLEKTADKWGKNGSGTEWQEKWWEYYNT 420

Query: 421 SGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTK 480
           SGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTK
Sbjct: 421 SGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTK 480

Query: 481 WGDKWDENFDSNGHGVKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTH 540
           WGDKWDENFDSNGHGVKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTH
Sbjct: 481 WGDKWDENFDSNGHGVKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTH 540

Query: 541 VQQETWYERFPHFGFYHCFNNSVQLREVQKPSETSL 576
           VQQETWYERFPHFGFYHCFNNSVQLREVQKPSETSL
Sbjct: 541 VQQETWYERFPHFGFYHCFNNSVQLREVQKPSETSL 576

BLAST of Cp4.1LG12g01700 vs. NCBI nr
Match: KAG6574953.1 (hypothetical protein SDJN03_25592, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1161 bits (3004), Expect = 0.0
Identity = 559/576 (97.05%), Postives = 568/576 (98.61%), Query Frame = 0

Query: 1   MAFRFGVCPRATAISNFPQLNDKNFLVLPLEREVEIRRAGGTRRLRIRVSDGEESYLGMW 60
           MAFRFGVCPRATAISNFPQLNDKNFLVLPLEREVEIRRA GTRRLRIRVSDGEESYLGMW
Sbjct: 1   MAFRFGVCPRATAISNFPQLNDKNFLVLPLEREVEIRRARGTRRLRIRVSDGEESYLGMW 60

Query: 61  KNAVERQRKAIEFRKVVENTVGNDDRNDGDRSDDQLEEKSGEFSKILQVPTEERDKIQRM 120
           KNAVER+RKA EFRKVVENTVGND+RNDGDRSDDQLEEKS EFSKILQVPTEERDKIQRM
Sbjct: 61  KNAVERRRKANEFRKVVENTVGNDNRNDGDRSDDQLEEKSEEFSKILQVPTEERDKIQRM 120

Query: 121 QVIHRATAAIAAARALVGETRIVADSNTSLNLNSRNDGGLLDREEASSEFQSENALLPES 180
           QVIHRATAAIAAARALV ETRIVADSNTSLNLNSRNDGGLLDREEASSEFQSEN LLP+S
Sbjct: 121 QVIHRATAAIAAARALVCETRIVADSNTSLNLNSRNDGGLLDREEASSEFQSENTLLPKS 180

Query: 181 ETSPTATPGPDFWSWTPPPDSDGNGNAFSELRPIGKSQAYPMLSNFVKEKEPPVGFLSIP 240
           ETSPTATPGPDFWSWTPPPD+DGNGNAFSELRPI KSQ YPMLSNFVKEKEPPVGFLSIP
Sbjct: 181 ETSPTATPGPDFWSWTPPPDNDGNGNAFSELRPIEKSQVYPMLSNFVKEKEPPVGFLSIP 240

Query: 241 FQSELPESVKPLLPPFQSLMDIEKLESLETSTETHSLEEDENVGMEFSVLAAEASQALSS 300
           FQSELPESVKPLLPPFQSLMDIEKLESLET+TETHSLE+DENVGMEFSVLAAEASQALSS
Sbjct: 241 FQSELPESVKPLLPPFQSLMDIEKLESLETNTETHSLEDDENVGMEFSVLAAEASQALSS 300

Query: 301 IDKESTKGIDSDGSRWWKEMGVEKRPDGVICKWTLTRGVSADLATEWQNKYWEAADEFGY 360
           IDKESTKGIDSDGSRWWKEMGVE+RPDGVICKWTLTRGVSAD ATEWQNKYWEAADEFGY
Sbjct: 301 IDKESTKGIDSDGSRWWKEMGVEQRPDGVICKWTLTRGVSADFATEWQNKYWEAADEFGY 360

Query: 361 KELGSEKSGRDAYGSVWREYWRESMRQEQGLVHLEKTADKWGKNGSGTEWQEKWWEYYNT 420
           KELGSEKSGRDAYGSVWREYWRESM+QEQGLVHLEKTADKWG+NGSGTEWQEKWWEYYNT
Sbjct: 361 KELGSEKSGRDAYGSVWREYWRESMQQEQGLVHLEKTADKWGENGSGTEWQEKWWEYYNT 420

Query: 421 SGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTK 480
           SGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTK
Sbjct: 421 SGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTK 480

Query: 481 WGDKWDENFDSNGHGVKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTH 540
           WGDKWDENFDSNGHGVKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTH
Sbjct: 481 WGDKWDENFDSNGHGVKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTH 540

Query: 541 VQQETWYERFPHFGFYHCFNNSVQLREVQKPSETSL 576
           VQQETWYERFPHFGFYHCFNNSVQLREVQKPSETSL
Sbjct: 541 VQQETWYERFPHFGFYHCFNNSVQLREVQKPSETSL 576

BLAST of Cp4.1LG12g01700 vs. ExPASy TrEMBL
Match: A0A6J1KWK6 (uncharacterized protein LOC111499367 OS=Cucurbita maxima OX=3661 GN=LOC111499367 PE=4 SV=1)

HSP 1 Score: 1172 bits (3032), Expect = 0.0
Identity = 564/576 (97.92%), Postives = 570/576 (98.96%), Query Frame = 0

Query: 1   MAFRFGVCPRATAISNFPQLNDKNFLVLPLEREVEIRRAGGTRRLRIRVSDGEESYLGMW 60
           MAFRFGVCPRATAISNFPQLNDKNFL+LP EREVEIRRA GTRRLRIRVSDGEESYLGMW
Sbjct: 1   MAFRFGVCPRATAISNFPQLNDKNFLILPHEREVEIRRARGTRRLRIRVSDGEESYLGMW 60

Query: 61  KNAVERQRKAIEFRKVVENTVGNDDRNDGDRSDDQLEEKSGEFSKILQVPTEERDKIQRM 120
           KNAVERQRKAIEFRKVVENTVGNDDRNDGDRSDDQL EKS EFSKILQVPT+ERDKIQRM
Sbjct: 61  KNAVERQRKAIEFRKVVENTVGNDDRNDGDRSDDQLAEKSEEFSKILQVPTKERDKIQRM 120

Query: 121 QVIHRATAAIAAARALVGETRIVADSNTSLNLNSRNDGGLLDREEASSEFQSENALLPES 180
           QVIHRATAAIAAARALVGETRIVADSNTSLNLNSRNDGGLLDREEASSEFQSEN LLP+S
Sbjct: 121 QVIHRATAAIAAARALVGETRIVADSNTSLNLNSRNDGGLLDREEASSEFQSENTLLPKS 180

Query: 181 ETSPTATPGPDFWSWTPPPDSDGNGNAFSELRPIGKSQAYPMLSNFVKEKEPPVGFLSIP 240
           ETSPTATPGPDFWSWTPPPD+DGNGNAFSELRPI KSQAYPMLSNFVKEKEPPVGFLSIP
Sbjct: 181 ETSPTATPGPDFWSWTPPPDNDGNGNAFSELRPIEKSQAYPMLSNFVKEKEPPVGFLSIP 240

Query: 241 FQSELPESVKPLLPPFQSLMDIEKLESLETSTETHSLEEDENVGMEFSVLAAEASQALSS 300
           FQSELPESVKPLLPPFQSLMDIEKLESLETSTETHSLEEDENVGMEFSVLAAEASQALSS
Sbjct: 241 FQSELPESVKPLLPPFQSLMDIEKLESLETSTETHSLEEDENVGMEFSVLAAEASQALSS 300

Query: 301 IDKESTKGIDSDGSRWWKEMGVEKRPDGVICKWTLTRGVSADLATEWQNKYWEAADEFGY 360
           IDKESTKGIDSDGSRWWKEMGVE+RPDGVICKWTLTRGVSADLATEWQNKYWEAADEFGY
Sbjct: 301 IDKESTKGIDSDGSRWWKEMGVEQRPDGVICKWTLTRGVSADLATEWQNKYWEAADEFGY 360

Query: 361 KELGSEKSGRDAYGSVWREYWRESMRQEQGLVHLEKTADKWGKNGSGTEWQEKWWEYYNT 420
           KELGSEKSGRDAYGSVWREYWRESMRQEQGLVHLEKTADKWGKNGSGTEWQEKWWEYYNT
Sbjct: 361 KELGSEKSGRDAYGSVWREYWRESMRQEQGLVHLEKTADKWGKNGSGTEWQEKWWEYYNT 420

Query: 421 SGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTK 480
           SGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTK
Sbjct: 421 SGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTK 480

Query: 481 WGDKWDENFDSNGHGVKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTH 540
           WGDKWDENFDSNGHG+KQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTH
Sbjct: 481 WGDKWDENFDSNGHGIKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTH 540

Query: 541 VQQETWYERFPHFGFYHCFNNSVQLREVQKPSETSL 576
           VQQETWYERFPHFGFYHCFNNSVQLREVQKPSETSL
Sbjct: 541 VQQETWYERFPHFGFYHCFNNSVQLREVQKPSETSL 576

BLAST of Cp4.1LG12g01700 vs. ExPASy TrEMBL
Match: A0A6J1H328 (uncharacterized protein LOC111460038 OS=Cucurbita moschata OX=3662 GN=LOC111460038 PE=4 SV=1)

HSP 1 Score: 1166 bits (3017), Expect = 0.0
Identity = 561/576 (97.40%), Postives = 568/576 (98.61%), Query Frame = 0

Query: 1   MAFRFGVCPRATAISNFPQLNDKNFLVLPLEREVEIRRAGGTRRLRIRVSDGEESYLGMW 60
           M FRFGVCPRATAISNFPQLNDKNFLVLPLEREVEIRRA GTRRLRIRVSDGEESYLGMW
Sbjct: 1   MTFRFGVCPRATAISNFPQLNDKNFLVLPLEREVEIRRARGTRRLRIRVSDGEESYLGMW 60

Query: 61  KNAVERQRKAIEFRKVVENTVGNDDRNDGDRSDDQLEEKSGEFSKILQVPTEERDKIQRM 120
           KNAVERQRKA EFRKVVENTVGND+RNDGDRSDDQLEEKS EFSKILQVPTEERDKIQRM
Sbjct: 61  KNAVERQRKANEFRKVVENTVGNDNRNDGDRSDDQLEEKSEEFSKILQVPTEERDKIQRM 120

Query: 121 QVIHRATAAIAAARALVGETRIVADSNTSLNLNSRNDGGLLDREEASSEFQSENALLPES 180
           QVIHRATAAIAAARALVGETRIVADSNTSLNLNSRNDGGLLDREEASSEFQSEN LLP+S
Sbjct: 121 QVIHRATAAIAAARALVGETRIVADSNTSLNLNSRNDGGLLDREEASSEFQSENTLLPKS 180

Query: 181 ETSPTATPGPDFWSWTPPPDSDGNGNAFSELRPIGKSQAYPMLSNFVKEKEPPVGFLSIP 240
           ETSPTATPGPDFWSWTPPPD+DGNGNAFSELRPI KSQ YPMLSNFVKEKEPPVGFLSIP
Sbjct: 181 ETSPTATPGPDFWSWTPPPDNDGNGNAFSELRPIEKSQVYPMLSNFVKEKEPPVGFLSIP 240

Query: 241 FQSELPESVKPLLPPFQSLMDIEKLESLETSTETHSLEEDENVGMEFSVLAAEASQALSS 300
           FQSELPESVKPLLPPFQSLMDIEKLESLET+TETHSLE+DENVGMEFSVLAAEASQALSS
Sbjct: 241 FQSELPESVKPLLPPFQSLMDIEKLESLETNTETHSLEDDENVGMEFSVLAAEASQALSS 300

Query: 301 IDKESTKGIDSDGSRWWKEMGVEKRPDGVICKWTLTRGVSADLATEWQNKYWEAADEFGY 360
           IDKESTKGIDSDGSRWWKEMGVE+RPDGVICKWTLTRGVSAD ATEWQNKYWEAADEFGY
Sbjct: 301 IDKESTKGIDSDGSRWWKEMGVEQRPDGVICKWTLTRGVSADFATEWQNKYWEAADEFGY 360

Query: 361 KELGSEKSGRDAYGSVWREYWRESMRQEQGLVHLEKTADKWGKNGSGTEWQEKWWEYYNT 420
           KELGSEKSGRDAYGSVWREYWRESM+QEQGLVHLEKTADKWGKNGSGTEWQEKWWEYYNT
Sbjct: 361 KELGSEKSGRDAYGSVWREYWRESMQQEQGLVHLEKTADKWGKNGSGTEWQEKWWEYYNT 420

Query: 421 SGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTK 480
           SGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTK
Sbjct: 421 SGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTK 480

Query: 481 WGDKWDENFDSNGHGVKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTH 540
           WGDKWDENFDSNGHGVKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTH
Sbjct: 481 WGDKWDENFDSNGHGVKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTH 540

Query: 541 VQQETWYERFPHFGFYHCFNNSVQLREVQKPSETSL 576
           VQQETWYERFPHFGFYHCFNNSVQLREVQKPSETSL
Sbjct: 541 VQQETWYERFPHFGFYHCFNNSVQLREVQKPSETSL 576

BLAST of Cp4.1LG12g01700 vs. ExPASy TrEMBL
Match: A0A5A7UQT1 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold216G001440 PE=4 SV=1)

HSP 1 Score: 1037 bits (2681), Expect = 0.0
Identity = 499/577 (86.48%), Postives = 528/577 (91.51%), Query Frame = 0

Query: 1   MAFRFGVCPRATAISNFPQLNDKNFLVLPLEREVEIRRAGGTRRLRIRVSDGEESYLGMW 60
           M FR GV PR T  ++FP L   NFL+LPLERE++IR A GTR LRIR SDG ESYLGMW
Sbjct: 1   MPFRLGVSPRPTLSNHFPHLYHHNFLLLPLEREIDIRHATGTRSLRIRASDGGESYLGMW 60

Query: 61  KNAVERQRKAIEFRKVVENTVGNDDRNDGDRSDDQLEEKSGEFSKILQVPTEERDKIQRM 120
           KNAVERQRKAIEF+KVVENTVGNDDRN GD S DQLE+KS EFSKILQVP EERD+IQRM
Sbjct: 61  KNAVERQRKAIEFQKVVENTVGNDDRNAGDPSSDQLEKKSEEFSKILQVPPEERDRIQRM 120

Query: 121 QVIHRATAAIAAARALVGETRIVA--DSNTSLNLNSRNDGGLLDREEASSEFQSENALLP 180
           QVIHRA AAIAAARALVGET  +A  DS+ S+NLNS NDGGLLDREEA  EFQSEN+LLP
Sbjct: 121 QVIHRAAAAIAAARALVGETGTLAVGDSDASVNLNSTNDGGLLDREEALPEFQSENSLLP 180

Query: 181 ESETSPTATPGPDFWSWTPPPDSDGNGNAFSELRPIGKSQAYPMLSNFVKEKEPPVGFLS 240
           ESETS + TPGPDFWSWTPPPD+D NGNAF EL+PIGKSQAYP LSNFV+EKE P+  LS
Sbjct: 181 ESETSRSWTPGPDFWSWTPPPDNDENGNAFGELQPIGKSQAYPKLSNFVEEKERPIDSLS 240

Query: 241 IPFQSELPESVKPLLPPFQSLMDIEKLESLETSTETHSLEEDENVGMEFSVLAAEASQAL 300
           IPFQSE+ ESV PLLPPFQSL+ +EKLES ETSTETHSLEEDENVG+EFSV AAEASQAL
Sbjct: 241 IPFQSEISESVNPLLPPFQSLVGMEKLESSETSTETHSLEEDENVGIEFSVHAAEASQAL 300

Query: 301 SSIDKESTKGIDSDGSRWWKEMGVEKRPDGVICKWTLTRGVSADLATEWQNKYWEAADEF 360
           SS+DKESTKGID DGSRWWKE G+E+RPDGVIC+WTLTRGVSADLATEWQNKYWEAADEF
Sbjct: 301 SSVDKESTKGIDPDGSRWWKETGIEQRPDGVICRWTLTRGVSADLATEWQNKYWEAADEF 360

Query: 361 GYKELGSEKSGRDAYGSVWREYWRESMRQEQGLVHLEKTADKWGKNGSGTEWQEKWWEYY 420
           GYKELGSEKSGRDAYG+VWREYWRESMRQEQGLVHLEKTADKWG NGSGTEWQEKWWEYY
Sbjct: 361 GYKELGSEKSGRDAYGNVWREYWRESMRQEQGLVHLEKTADKWGINGSGTEWQEKWWEYY 420

Query: 421 NTSGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGW 480
           NTSGQ EKNAHKWCKIDPNTYVDPGHAHIW+ERWGEKYDGQGGSIKYTDKWAE CEGDGW
Sbjct: 421 NTSGQAEKNAHKWCKIDPNTYVDPGHAHIWNERWGEKYDGQGGSIKYTDKWAERCEGDGW 480

Query: 481 TKWGDKWDENFDSNGHGVKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWD 540
           TKWGDKWDENFD NGHGVKQGETWWEGKHGERWNRTWGEGH+GSGWVHKYGKSSSGEHWD
Sbjct: 481 TKWGDKWDENFDPNGHGVKQGETWWEGKHGERWNRTWGEGHNGSGWVHKYGKSSSGEHWD 540

Query: 541 THVQQETWYERFPHFGFYHCFNNSVQLREVQKPSETS 575
           TH QQETWYERFPHFGFYHCFNNSVQLREVQKPSET+
Sbjct: 541 THAQQETWYERFPHFGFYHCFNNSVQLREVQKPSETA 577

BLAST of Cp4.1LG12g01700 vs. ExPASy TrEMBL
Match: A0A1S3BMD1 (uncharacterized protein LOC103491223 OS=Cucumis melo OX=3656 GN=LOC103491223 PE=4 SV=1)

HSP 1 Score: 1037 bits (2681), Expect = 0.0
Identity = 499/577 (86.48%), Postives = 528/577 (91.51%), Query Frame = 0

Query: 1   MAFRFGVCPRATAISNFPQLNDKNFLVLPLEREVEIRRAGGTRRLRIRVSDGEESYLGMW 60
           M FR GV PR T  ++FP L   NFL+LPLERE++IR A GTR LRIR SDG ESYLGMW
Sbjct: 1   MPFRLGVSPRPTLSNHFPHLYHHNFLLLPLEREIDIRHATGTRSLRIRASDGGESYLGMW 60

Query: 61  KNAVERQRKAIEFRKVVENTVGNDDRNDGDRSDDQLEEKSGEFSKILQVPTEERDKIQRM 120
           KNAVERQRKAIEF+KVVENTVGNDDRN GD S DQLE+KS EFSKILQVP EERD+IQRM
Sbjct: 61  KNAVERQRKAIEFQKVVENTVGNDDRNAGDPSSDQLEKKSEEFSKILQVPPEERDRIQRM 120

Query: 121 QVIHRATAAIAAARALVGETRIVA--DSNTSLNLNSRNDGGLLDREEASSEFQSENALLP 180
           QVIHRA AAIAAARALVGET  +A  DS+ S+NLNS NDGGLLDREEA  EFQSEN+LLP
Sbjct: 121 QVIHRAAAAIAAARALVGETGTLAVGDSDASVNLNSTNDGGLLDREEALPEFQSENSLLP 180

Query: 181 ESETSPTATPGPDFWSWTPPPDSDGNGNAFSELRPIGKSQAYPMLSNFVKEKEPPVGFLS 240
           ESETS + TPGPDFWSWTPPPD+D NGNAF EL+PIGKSQAYP LSNFV+EKE P+  LS
Sbjct: 181 ESETSRSWTPGPDFWSWTPPPDNDENGNAFGELQPIGKSQAYPKLSNFVEEKERPIDSLS 240

Query: 241 IPFQSELPESVKPLLPPFQSLMDIEKLESLETSTETHSLEEDENVGMEFSVLAAEASQAL 300
           IPFQSE+ ESV PLLPPFQSL+ +EKLES ETSTETHSLEEDENVG+EFSV AAEASQAL
Sbjct: 241 IPFQSEISESVNPLLPPFQSLVGMEKLESSETSTETHSLEEDENVGIEFSVHAAEASQAL 300

Query: 301 SSIDKESTKGIDSDGSRWWKEMGVEKRPDGVICKWTLTRGVSADLATEWQNKYWEAADEF 360
           SS+DKESTKGID DGSRWWKE G+E+RPDGVIC+WTLTRGVSADLATEWQNKYWEAADEF
Sbjct: 301 SSVDKESTKGIDPDGSRWWKETGIEQRPDGVICRWTLTRGVSADLATEWQNKYWEAADEF 360

Query: 361 GYKELGSEKSGRDAYGSVWREYWRESMRQEQGLVHLEKTADKWGKNGSGTEWQEKWWEYY 420
           GYKELGSEKSGRDAYG+VWREYWRESMRQEQGLVHLEKTADKWG NGSGTEWQEKWWEYY
Sbjct: 361 GYKELGSEKSGRDAYGNVWREYWRESMRQEQGLVHLEKTADKWGINGSGTEWQEKWWEYY 420

Query: 421 NTSGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGW 480
           NTSGQ EKNAHKWCKIDPNTYVDPGHAHIW+ERWGEKYDGQGGSIKYTDKWAE CEGDGW
Sbjct: 421 NTSGQAEKNAHKWCKIDPNTYVDPGHAHIWNERWGEKYDGQGGSIKYTDKWAERCEGDGW 480

Query: 481 TKWGDKWDENFDSNGHGVKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWD 540
           TKWGDKWDENFD NGHGVKQGETWWEGKHGERWNRTWGEGH+GSGWVHKYGKSSSGEHWD
Sbjct: 481 TKWGDKWDENFDPNGHGVKQGETWWEGKHGERWNRTWGEGHNGSGWVHKYGKSSSGEHWD 540

Query: 541 THVQQETWYERFPHFGFYHCFNNSVQLREVQKPSETS 575
           TH QQETWYERFPHFGFYHCFNNSVQLREVQKPSET+
Sbjct: 541 THAQQETWYERFPHFGFYHCFNNSVQLREVQKPSETA 577

BLAST of Cp4.1LG12g01700 vs. ExPASy TrEMBL
Match: A0A0A0LNS6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G215500 PE=4 SV=1)

HSP 1 Score: 1011 bits (2613), Expect = 0.0
Identity = 486/575 (84.52%), Postives = 517/575 (89.91%), Query Frame = 0

Query: 1   MAFRFGVCPRATAISNFPQLNDKNFLVLPLEREVEIRRAGGTRRLRIRVSDGEESYLGMW 60
           M  R  + PR T   +FP+L   NFL+LPL+  ++IR A   R LRIR SD  ESYLGMW
Sbjct: 2   MPLRLPLSPRPTLHHHFPRLYHHNFLLLPLQPHIQIRHATPARTLRIRASDEGESYLGMW 61

Query: 61  KNAVERQRKAIEFRKVVENTVGNDDRNDGDRSDDQLEEKSGEFSKILQVPTEERDKIQRM 120
           KNAVERQRKA+EF+KVVENT GNDDRN GD S DQLE+KS EFSKILQVP EERD+IQRM
Sbjct: 62  KNAVERQRKAVEFQKVVENTEGNDDRNAGDPSSDQLEKKSEEFSKILQVPPEERDRIQRM 121

Query: 121 QVIHRATAAIAAARALVGETRIVA--DSNTSLNLNSRNDGGLLDREEASSEFQSENALLP 180
           QVIHRA AAIAAARALVGET  +A  DS+T +NLNS ND GLLDREEA SEFQSENALLP
Sbjct: 122 QVIHRAAAAIAAARALVGETGTLAVGDSDTCVNLNSTNDEGLLDREEALSEFQSENALLP 181

Query: 181 ESETSPTATPGPDFWSWTPPPDSDGNGNAFSELRPIGKSQAYPMLSNFVKEKEPPVGFLS 240
           E ETS + TPGPDFWSWTPPPD DGN NAF EL+P+GKSQAYP LSNFV+EKE P+ FLS
Sbjct: 182 EFETSQSWTPGPDFWSWTPPPDDDGNDNAFGELQPLGKSQAYPKLSNFVEEKERPIDFLS 241

Query: 241 IPFQSELPESVKPLLPPFQSLMDIEKLESLETSTETHSLEEDENVGMEFSVLAAEASQAL 300
           IPFQSE+ ESV PLLPPFQSL+ +EKLES ETSTETHSLEEDENVG+EFSV AAEASQAL
Sbjct: 242 IPFQSEISESVNPLLPPFQSLVGMEKLESSETSTETHSLEEDENVGIEFSVHAAEASQAL 301

Query: 301 SSIDKESTKGIDSDGSRWWKEMGVEKRPDGVICKWTLTRGVSADLATEWQNKYWEAADEF 360
           SS+DKESTKGID DGSRWWKE G+E+RPDGVICKWTLTRGVSADLATEWQNKYWEAADEF
Sbjct: 302 SSVDKESTKGIDPDGSRWWKETGIEQRPDGVICKWTLTRGVSADLATEWQNKYWEAADEF 361

Query: 361 GYKELGSEKSGRDAYGSVWREYWRESMRQEQGLVHLEKTADKWGKNGSGTEWQEKWWEYY 420
           GYKELGSEKSGRDAYG+VWREYWRESMRQEQGLVHLEKTADKWG NGSGTEWQEKWWEYY
Sbjct: 362 GYKELGSEKSGRDAYGNVWREYWRESMRQEQGLVHLEKTADKWGINGSGTEWQEKWWEYY 421

Query: 421 NTSGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGW 480
           NTSGQ EKNAHKWCKIDPNTYVDPGHAHIW+ERWGEKYDGQGGSIKYTDKWAE CEGDGW
Sbjct: 422 NTSGQAEKNAHKWCKIDPNTYVDPGHAHIWNERWGEKYDGQGGSIKYTDKWAERCEGDGW 481

Query: 481 TKWGDKWDENFDSNGHGVKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWD 540
           TKWGDKWDENFD NGHG+KQGETWWEG+HGERWNRTWGEGH+GSGWVHKYGKSSSGEHWD
Sbjct: 482 TKWGDKWDENFDPNGHGIKQGETWWEGRHGERWNRTWGEGHNGSGWVHKYGKSSSGEHWD 541

Query: 541 THVQQETWYERFPHFGFYHCFNNSVQLREVQKPSE 573
           TH QQETWYERFPHFGFYHCFNNSVQLREVQKPSE
Sbjct: 542 THAQQETWYERFPHFGFYHCFNNSVQLREVQKPSE 576

BLAST of Cp4.1LG12g01700 vs. TAIR 10
Match: AT3G55760.1 (unknown protein; LOCATED IN: chloroplast stroma, chloroplast; EXPRESSED IN: 16 plant structures; EXPRESSED DURING: 10 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G42430.2); Has 176 Blast hits to 125 proteins in 40 species: Archae - 0; Bacteria - 3; Metazoa - 19; Fungi - 9; Plants - 81; Viruses - 0; Other Eukaryotes - 64 (source: NCBI BLink). )

HSP 1 Score: 633.6 bits (1633), Expect = 1.5e-181
Identity = 314/537 (58.47%), Postives = 389/537 (72.44%), Query Frame = 0

Query: 41  GTRRLRIRVSDGEESYLGMWKNAVERQRKAIEFRKVVENTVGNDDRNDGDRSDDQLEEKS 100
           G R LR+  ++G ESYL MWKNAV+R++K   F K+ EN V  D   +       LE+KS
Sbjct: 51  GVRILRVS-NEGRESYLDMWKNAVDREKKEKAFEKIAENVVAVDGEKE---KGGDLEKKS 110

Query: 101 GEFSKILQVPTEERDKIQRMQVIHRATAAIAAARALVGETRIVADSNTSLNLNSRNDGGL 160
            EF KIL+V  EERD+IQRMQV+ RA AAI+AARA++             N ++     +
Sbjct: 111 DEFQKILEVSVEERDRIQRMQVVDRAAAAISAARAILASNNSGDGKEGFPNEDNTVTSEV 170

Query: 161 LDR-EEASSEFQSENALLPESETSPTATPGPDFWSWTPPPDSDGNGNAFSELRPIGKSQA 220
            +  + A     S    +P SETS T TPGPDFWSWTPP    G+  +  +L+ + K   
Sbjct: 171 TETPKNAKLGMWSRTVYVPRSETSGTETPGPDFWSWTPP---QGSEISSVDLQAVEKPAE 230

Query: 221 YPMLSNFVKEKEPPVGFLSIPFQSEL-PESVKPLLPPFQSLMDIEKLESLETSTETHSLE 280
           +P L N V EK+     LSIP++S L  E     +PPF+SL+++ K    + S+ET S E
Sbjct: 231 FPTLPNPVLEKDKSADSLSIPYESMLSSERHSFTIPPFESLIEVRKEAETKPSSETLSTE 290

Query: 281 EDENVGMEFSVLAAEASQALSSIDKESTKGIDSDGSRWWKEMGVEKRPDGVICKWTLTRG 340
            D  + +  S  A E ++ L S+D+ ST G+  DG +WWK+ GVEKRPDGV+C+WT+ RG
Sbjct: 291 HD--LDLISSANAEEVARVLDSLDESSTHGVSEDGLKWWKQTGVEKRPDGVVCRWTMIRG 350

Query: 341 VSADLATEWQNKYWEAADEFGYKELGSEKSGRDAYGSVWREYWRESMRQEQGLVHLEKTA 400
           V+AD   EWQ+KYWEA+D+FG+KELGSEKSGRDA G+VWRE+WRESM QE G+VH+EKTA
Sbjct: 351 VTADGVVEWQDKYWEASDDFGFKELGSEKSGRDATGNVWREFWRESMSQENGVVHMEKTA 410

Query: 401 DKWGKNGSGTEWQEKWWEYYNTSGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDG 460
           DKWGK+G G EWQEKWWE+Y+ +G+ EK AHKWC ID NT +D GHAH+WHERWGEKYDG
Sbjct: 411 DKWGKSGQGDEWQEKWWEHYDATGKSEKWAHKWCSIDRNTPLDAGHAHVWHERWGEKYDG 470

Query: 461 QGGSIKYTDKWAEGCEGDGWTKWGDKWDENFDSNGHGVKQGETWWEGKHGERWNRTWGEG 520
           QGGS KYTDKWAE   GDGW KWGDKWDENF+ +  GVKQGETWWEGKHG+RWNR+WGEG
Sbjct: 471 QGGSTKYTDKWAERWVGDGWDKWGDKWDENFNPSAQGVKQGETWWEGKHGDRWNRSWGEG 530

Query: 521 HSGSGWVHKYGKSSSGEHWDTHVQQETWYERFPHFGFYHCFNNSVQLREVQKPSETS 576
           H+GSGWVHKYGKSSSGEHWDTHV QETWYE+FPHFGF+HCF+NSVQLR V+KPS+ S
Sbjct: 531 HNGSGWVHKYGKSSSGEHWDTHVPQETWYEKFPHFGFFHCFDNSVQLRAVKKPSDMS 578

BLAST of Cp4.1LG12g01700 vs. TAIR 10
Match: AT3G55760.2 (unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 16 plant structures; EXPRESSED DURING: 10 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G42430.2); Has 176 Blast hits to 125 proteins in 40 species: Archae - 0; Bacteria - 3; Metazoa - 19; Fungi - 9; Plants - 81; Viruses - 0; Other Eukaryotes - 64 (source: NCBI BLink). )

HSP 1 Score: 633.6 bits (1633), Expect = 1.5e-181
Identity = 314/537 (58.47%), Postives = 389/537 (72.44%), Query Frame = 0

Query: 41  GTRRLRIRVSDGEESYLGMWKNAVERQRKAIEFRKVVENTVGNDDRNDGDRSDDQLEEKS 100
           G R LR+  ++G ESYL MWKNAV+R++K   F K+ EN V  D   +       LE+KS
Sbjct: 51  GVRILRVS-NEGRESYLDMWKNAVDREKKEKAFEKIAENVVAVDGEKE---KGGDLEKKS 110

Query: 101 GEFSKILQVPTEERDKIQRMQVIHRATAAIAAARALVGETRIVADSNTSLNLNSRNDGGL 160
            EF KIL+V  EERD+IQRMQV+ RA AAI+AARA++             N ++     +
Sbjct: 111 DEFQKILEVSVEERDRIQRMQVVDRAAAAISAARAILASNNSGDGKEGFPNEDNTVTSEV 170

Query: 161 LDR-EEASSEFQSENALLPESETSPTATPGPDFWSWTPPPDSDGNGNAFSELRPIGKSQA 220
            +  + A     S    +P SETS T TPGPDFWSWTPP    G+  +  +L+ + K   
Sbjct: 171 TETPKNAKLGMWSRTVYVPRSETSGTETPGPDFWSWTPP---QGSEISSVDLQAVEKPAE 230

Query: 221 YPMLSNFVKEKEPPVGFLSIPFQSEL-PESVKPLLPPFQSLMDIEKLESLETSTETHSLE 280
           +P L N V EK+     LSIP++S L  E     +PPF+SL+++ K    + S+ET S E
Sbjct: 231 FPTLPNPVLEKDKSADSLSIPYESMLSSERHSFTIPPFESLIEVRKEAETKPSSETLSTE 290

Query: 281 EDENVGMEFSVLAAEASQALSSIDKESTKGIDSDGSRWWKEMGVEKRPDGVICKWTLTRG 340
            D  + +  S  A E ++ L S+D+ ST G+  DG +WWK+ GVEKRPDGV+C+WT+ RG
Sbjct: 291 HD--LDLISSANAEEVARVLDSLDESSTHGVSEDGLKWWKQTGVEKRPDGVVCRWTMIRG 350

Query: 341 VSADLATEWQNKYWEAADEFGYKELGSEKSGRDAYGSVWREYWRESMRQEQGLVHLEKTA 400
           V+AD   EWQ+KYWEA+D+FG+KELGSEKSGRDA G+VWRE+WRESM QE G+VH+EKTA
Sbjct: 351 VTADGVVEWQDKYWEASDDFGFKELGSEKSGRDATGNVWREFWRESMSQENGVVHMEKTA 410

Query: 401 DKWGKNGSGTEWQEKWWEYYNTSGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDG 460
           DKWGK+G G EWQEKWWE+Y+ +G+ EK AHKWC ID NT +D GHAH+WHERWGEKYDG
Sbjct: 411 DKWGKSGQGDEWQEKWWEHYDATGKSEKWAHKWCSIDRNTPLDAGHAHVWHERWGEKYDG 470

Query: 461 QGGSIKYTDKWAEGCEGDGWTKWGDKWDENFDSNGHGVKQGETWWEGKHGERWNRTWGEG 520
           QGGS KYTDKWAE   GDGW KWGDKWDENF+ +  GVKQGETWWEGKHG+RWNR+WGEG
Sbjct: 471 QGGSTKYTDKWAERWVGDGWDKWGDKWDENFNPSAQGVKQGETWWEGKHGDRWNRSWGEG 530

Query: 521 HSGSGWVHKYGKSSSGEHWDTHVQQETWYERFPHFGFYHCFNNSVQLREVQKPSETS 576
           H+GSGWVHKYGKSSSGEHWDTHV QETWYE+FPHFGF+HCF+NSVQLR V+KPS+ S
Sbjct: 531 HNGSGWVHKYGKSSSGEHWDTHVPQETWYEKFPHFGFFHCFDNSVQLRAVKKPSDMS 578

BLAST of Cp4.1LG12g01700 vs. TAIR 10
Match: AT3G55760.3 (unknown protein; EXPRESSED IN: 16 plant structures; EXPRESSED DURING: 10 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G42430.2). )

HSP 1 Score: 633.6 bits (1633), Expect = 1.5e-181
Identity = 314/537 (58.47%), Postives = 389/537 (72.44%), Query Frame = 0

Query: 41  GTRRLRIRVSDGEESYLGMWKNAVERQRKAIEFRKVVENTVGNDDRNDGDRSDDQLEEKS 100
           G R LR+  ++G ESYL MWKNAV+R++K   F K+ EN V  D   +       LE+KS
Sbjct: 51  GVRILRVS-NEGRESYLDMWKNAVDREKKEKAFEKIAENVVAVDGEKE---KGGDLEKKS 110

Query: 101 GEFSKILQVPTEERDKIQRMQVIHRATAAIAAARALVGETRIVADSNTSLNLNSRNDGGL 160
            EF KIL+V  EERD+IQRMQV+ RA AAI+AARA++             N ++     +
Sbjct: 111 DEFQKILEVSVEERDRIQRMQVVDRAAAAISAARAILASNNSGDGKEGFPNEDNTVTSEV 170

Query: 161 LDR-EEASSEFQSENALLPESETSPTATPGPDFWSWTPPPDSDGNGNAFSELRPIGKSQA 220
            +  + A     S    +P SETS T TPGPDFWSWTPP    G+  +  +L+ + K   
Sbjct: 171 TETPKNAKLGMWSRTVYVPRSETSGTETPGPDFWSWTPP---QGSEISSVDLQAVEKPAE 230

Query: 221 YPMLSNFVKEKEPPVGFLSIPFQSEL-PESVKPLLPPFQSLMDIEKLESLETSTETHSLE 280
           +P L N V EK+     LSIP++S L  E     +PPF+SL+++ K    + S+ET S E
Sbjct: 231 FPTLPNPVLEKDKSADSLSIPYESMLSSERHSFTIPPFESLIEVRKEAETKPSSETLSTE 290

Query: 281 EDENVGMEFSVLAAEASQALSSIDKESTKGIDSDGSRWWKEMGVEKRPDGVICKWTLTRG 340
            D  + +  S  A E ++ L S+D+ ST G+  DG +WWK+ GVEKRPDGV+C+WT+ RG
Sbjct: 291 HD--LDLISSANAEEVARVLDSLDESSTHGVSEDGLKWWKQTGVEKRPDGVVCRWTMIRG 350

Query: 341 VSADLATEWQNKYWEAADEFGYKELGSEKSGRDAYGSVWREYWRESMRQEQGLVHLEKTA 400
           V+AD   EWQ+KYWEA+D+FG+KELGSEKSGRDA G+VWRE+WRESM QE G+VH+EKTA
Sbjct: 351 VTADGVVEWQDKYWEASDDFGFKELGSEKSGRDATGNVWREFWRESMSQENGVVHMEKTA 410

Query: 401 DKWGKNGSGTEWQEKWWEYYNTSGQVEKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDG 460
           DKWGK+G G EWQEKWWE+Y+ +G+ EK AHKWC ID NT +D GHAH+WHERWGEKYDG
Sbjct: 411 DKWGKSGQGDEWQEKWWEHYDATGKSEKWAHKWCSIDRNTPLDAGHAHVWHERWGEKYDG 470

Query: 461 QGGSIKYTDKWAEGCEGDGWTKWGDKWDENFDSNGHGVKQGETWWEGKHGERWNRTWGEG 520
           QGGS KYTDKWAE   GDGW KWGDKWDENF+ +  GVKQGETWWEGKHG+RWNR+WGEG
Sbjct: 471 QGGSTKYTDKWAERWVGDGWDKWGDKWDENFNPSAQGVKQGETWWEGKHGDRWNRSWGEG 530

Query: 521 HSGSGWVHKYGKSSSGEHWDTHVQQETWYERFPHFGFYHCFNNSVQLREVQKPSETS 576
           H+GSGWVHKYGKSSSGEHWDTHV QETWYE+FPHFGF+HCF+NSVQLR V+KPS+ S
Sbjct: 531 HNGSGWVHKYGKSSSGEHWDTHVPQETWYEKFPHFGFFHCFDNSVQLRAVKKPSDMS 578

BLAST of Cp4.1LG12g01700 vs. TAIR 10
Match: AT1G42430.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G55760.3); Has 186 Blast hits to 143 proteins in 47 species: Archae - 0; Bacteria - 23; Metazoa - 14; Fungi - 6; Plants - 87; Viruses - 0; Other Eukaryotes - 56 (source: NCBI BLink). )

HSP 1 Score: 236.5 bits (602), Expect = 5.3e-62
Identity = 120/265 (45.28%), Postives = 167/265 (63.02%), Query Frame = 0

Query: 308 GIDSDGSRWWKEMGVEKRPDGVICKWTLTRGVSADLATEWQNKYWEAADEFGYKELGSEK 367
           G + DGS W++E G +   +G  C+W+   G S D ++EW   +WE +D  GYKELG EK
Sbjct: 145 GTNEDGSSWFRESGHDLGDNGYRCRWSRMGGRSHDGSSEWTETWWEKSDWTGYKELGVEK 204

Query: 368 SGRDAYGSVWREYWRESMRQEQ--GLVHLEKTADKWGKNGS-GTEWQEKWWEYYNTSGQV 427
           SG+++ G  W E W+E + Q++   L  +E++A K  K+G+    W EKWWE Y+  G  
Sbjct: 205 SGKNSEGDSWWETWQEVLHQDEWSNLARIERSAQKQAKSGTENAGWYEKWWEKYDAKGWT 264

Query: 428 EKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTKWGDK 487
           EK AHK+ +++  +         W E+WGE YDG+G  +K+TDKWAE   G   TKWGDK
Sbjct: 265 EKGAHKYGRLNEQS---------WWEKWGEHYDGRGSVLKWTDKWAETELG---TKWGDK 324

Query: 488 WDENFDSNGHGVKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTHVQQE 547
           W+E F S G G +QGETW    + +RW+RTWGE H G+G VHKYGKS++GE WD  V +E
Sbjct: 325 WEEKFFS-GIGSRQGETWHVSPNSDRWSRTWGEEHFGNGKVHKYGKSTTGESWDIVVDEE 384

Query: 548 TWYERFPHFGFYHCFNNSVQLREVQ 570
           T+YE  PH+G+     +S QL  +Q
Sbjct: 385 TYYEAEPHYGWADVVGDSTQLLSIQ 396

BLAST of Cp4.1LG12g01700 vs. TAIR 10
Match: AT1G42430.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G55760.3). )

HSP 1 Score: 236.5 bits (602), Expect = 5.3e-62
Identity = 120/265 (45.28%), Postives = 167/265 (63.02%), Query Frame = 0

Query: 308 GIDSDGSRWWKEMGVEKRPDGVICKWTLTRGVSADLATEWQNKYWEAADEFGYKELGSEK 367
           G + DGS W++E G +   +G  C+W+   G S D ++EW   +WE +D  GYKELG EK
Sbjct: 128 GTNEDGSSWFRESGHDLGDNGYRCRWSRMGGRSHDGSSEWTETWWEKSDWTGYKELGVEK 187

Query: 368 SGRDAYGSVWREYWRESMRQEQ--GLVHLEKTADKWGKNGS-GTEWQEKWWEYYNTSGQV 427
           SG+++ G  W E W+E + Q++   L  +E++A K  K+G+    W EKWWE Y+  G  
Sbjct: 188 SGKNSEGDSWWETWQEVLHQDEWSNLARIERSAQKQAKSGTENAGWYEKWWEKYDAKGWT 247

Query: 428 EKNAHKWCKIDPNTYVDPGHAHIWHERWGEKYDGQGGSIKYTDKWAEGCEGDGWTKWGDK 487
           EK AHK+ +++  +         W E+WGE YDG+G  +K+TDKWAE   G   TKWGDK
Sbjct: 248 EKGAHKYGRLNEQS---------WWEKWGEHYDGRGSVLKWTDKWAETELG---TKWGDK 307

Query: 488 WDENFDSNGHGVKQGETWWEGKHGERWNRTWGEGHSGSGWVHKYGKSSSGEHWDTHVQQE 547
           W+E F S G G +QGETW    + +RW+RTWGE H G+G VHKYGKS++GE WD  V +E
Sbjct: 308 WEEKFFS-GIGSRQGETWHVSPNSDRWSRTWGEEHFGNGKVHKYGKSTTGESWDIVVDEE 367

Query: 548 TWYERFPHFGFYHCFNNSVQLREVQ 570
           T+YE  PH+G+     +S QL  +Q
Sbjct: 368 TYYEAEPHYGWADVVGDSTQLLSIQ 379

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023548665.10.0100.00uncharacterized protein LOC111807253 [Cucurbita pepo subsp. pepo][more]
XP_023006712.10.097.92uncharacterized protein LOC111499367 [Cucurbita maxima][more]
XP_022958887.10.097.40uncharacterized protein LOC111460038 [Cucurbita moschata][more]
KAG7013519.10.097.22hypothetical protein SDJN02_23685 [Cucurbita argyrosperma subsp. argyrosperma][more]
KAG6574953.10.097.05hypothetical protein SDJN03_25592, partial [Cucurbita argyrosperma subsp. sorori... [more]
Match NameE-valueIdentityDescription
A0A6J1KWK60.097.92uncharacterized protein LOC111499367 OS=Cucurbita maxima OX=3661 GN=LOC111499367... [more]
A0A6J1H3280.097.40uncharacterized protein LOC111460038 OS=Cucurbita moschata OX=3662 GN=LOC1114600... [more]
A0A5A7UQT10.086.48Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3BMD10.086.48uncharacterized protein LOC103491223 OS=Cucumis melo OX=3656 GN=LOC103491223 PE=... [more]
A0A0A0LNS60.084.52Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G215500 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G55760.11.5e-18158.47unknown protein; LOCATED IN: chloroplast stroma, chloroplast; EXPRESSED IN: 16 p... [more]
AT3G55760.21.5e-18158.47unknown protein; LOCATED IN: chloroplast; EXPRESSED IN: 16 plant structures; EXP... [more]
AT3G55760.31.5e-18158.47unknown protein; EXPRESSED IN: 16 plant structures; EXPRESSED DURING: 10 growth ... [more]
AT1G42430.15.3e-6245.28unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT1G42430.25.3e-6245.28unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 81..101
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 149..192
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 170..190
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 81..103
NoneNo IPR availablePANTHERPTHR34113INACTIVE PURPLE ACID PHOSPHATASE-LIKE PROTEINcoord: 34..574
NoneNo IPR availablePANTHERPTHR34113:SF2BNAA01G24310D PROTEINcoord: 34..574

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG12g01700.1Cp4.1LG12g01700.1mRNA