Cp4.1LG13g01820 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG13g01820
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionUnknown protein
LocationCp4.1LG13: 1402043 .. 1404828 (+)
RNA-Seq ExpressionCp4.1LG13g01820
SyntenyCp4.1LG13g01820
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
TGAACTTCACTCCCACTAAGAACCATCCTCTCACCTTCTTCTTCTTCTTCTTCATCTTCTTCCTCAAATTAATGGAACTTCACTCTGCTTTTAATGCTCTCAGTACTGAATGTTTCTCTAATTAGTACTCTTCTGCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCATCGTTCTTCACAAATTCAAATCCTCTGTTTCTCCCTTTCTCTCTCTAAGATTTGATTTCTCTGTTCATGGCGTCCATTGATTCTTCCGCTTGGAGCATCATCTCCAAGCCTCCGCCGGAAAAACCTACCTCCTTCATGCTCAAGGATTATCTTCTCGACGATTTCAGTTCCTGCTCCTCCAATGGCTTCCGATCCTTTCCTCGCCGCCAATGCTGCGCCACAACCGTCCGATTTCTTCTCGAAATCGATCTCAAAGTTAAAGATTCTTCCCTAACTAAAAGATTCCTTCCTCGAACTGCCTCCCGAAAAATCGCGCTCTCCACGATCTCCACTTTGCAGAGAGCCTCCGATGCCGTTGTCAGAGCCTTCAAGAAATTCCCCTTGCCTTCTTACCGGAAGCCGTTTTGTTCGAGGAGTTTTTCACGGAAGGTGATTCTGCGAGCGTTCTGGAAGAAACAGGATTTTGTGGATGTGAACACCAGACGGTGTAAATCGTTTCAGGAATTTCTCGATGAGAAAGAACCGCCGTTGTCTCGCTCCGATTCCGCTGTGTGCACCGCCGTGACTGTCGTCGGAAGAAACTCGATTAGTAGCTGTAGTAATAGTATCAGTTGGACGGAGAGCGAATTTACATCGGAGATGATTCCGTCGTCTTCGAGCGGTAACTCCGAGAGTTGCAGCGAAAACGACGCCGTTAAAGTCGATAAGGATTCACCTGGTAATCTCATTGGCAAAAGAGATGGCGTAACATTCGGTAAAGATTCCATGGAGGAAACAACCACCGCCCCTTCCGCCGCCGCCCCCACCACCCCTACCACCACCGGTTTCCGAGAGGATATCGTTAAGGTCAGGATTTAGTTTTCAATCACATGCTTTGCGTGCATTTGATAACCGTCTCTGATTAATATCTCCATTCGATCTCCACCGTTGATTTTAAATTAAAATCGTTTTATTTTTTAATTAAATTTCCGGAGAAAATTGGTCTTTCAAAATAATTTTTAAAATATTCATAAAATTAAATTATAGAGTTCGCGTAGATAAAATAATACATAGAGAAAATACAGCTGGACTGCGGGAGTCTATTAATTTCACGTGTGGGGTATATCTTGGGACAAGTAAAAATACAACTCAGGAACACGCAAATTAAATAAATCGAGTTCCTCGAGGCTTCAGATCTCGCCATGTGTCCATACAGCGGGATCTACGGCTTTCAAACCTGCAACACCTTTCTATTTCATGGTACGCATGATACTAGATTTCTCCCCATGTCTTAACTTTTTGAAAATATTTACTATTATGCCATTCCATTGAAAACTAATCTTTAATATTATTGGTCTGTCATCTGCTGGCTTTTATGTGTCGGTAAGAAAATCATTAGGATTTTTTTCTTTTTTAAAAAAATAAAAAAACCAAATTATTATTAAAAAAAAAAAATGAGGGCATTCCTTTTTGTCTTTTATTGTATTATTATTATTTTACTAATATTATCTATTTTGTGACATTTTTTATTTATAAACATTATCTATTTTTTTTTTTAGAACTTTCCTAAGTTTAAAATAAGAAAATTAACAGGTTATCGATATTTATGGTGATTTCAGATGAGCTATGATTAAATTGGTTAATTTTTAAAAATAAAAAATAAGATTTATTTTATTTTTGTTTGTAAATTTGTAATATTTGTGTTTATTTCTGATCTTAACAGCAATGGCCAAATGAAGAAGAAAAGGAACAGTTGAGTCCAGTTTCAGTGTTGGATTTTCCTTTTGAAGATGAAGATCAAGACACCCCCTCGTCTTTCAACTGCAATCTTCACCTCGTCCAAGGTCTCTCTCTCCTCTTCTCTCTCTAGGATAATTATATATTTTCAATTTTGTCCCTAACCCCGAACGTTTAGGACCCATAATACCTTTCTCGAGTATTTGGTCGTATGAGATTATCATCCGACTATTTTCTTTTAATTCAAAAATATTTCGTGGGAAAATTCAAAGTTTGAAACTTTTAATTACGAGTATTAAAAAAATTTGATTAACCCAACTCGATCCATATAGTTTCAGTTGGGTTATGTACAATTGAGTTGAGTCTAACAAATGTCTTAATACGAGTTCATTGCTTTAACAGGTAAGAAGCAAAAACATGCACAACAGCCGAAGCGATTCGAGAATGGAGTTGAATTTGAACCTCTAGACTTAACGAAACGATTTGCAGACATTGTTGTTGGTCGTCAACATTTCAGCTCAATCTCGAGAAAAGAACACCAAAGGGAACAGAAAGCATTTGAGCTTCTAAAGCTTGTCAAATCAACTAAGACATCGATAGAGAATCTGCTTCTCGATTTCTTCCACGAGAAGCTTGAAGAAAACGACGCAACTGCAAGAACAGGAGCTGATTTTGATCAAGCACAGGTCTTGAAATTCACTGAAGATTGGATCAATGGGGATGTCGGAGAAGCCATGGCGACGGGATGGGAGTCGCCGGAGGGACGGGTTTTGTACATTAAGGACATGGAGATTGCCGGGAAATGGAGAAGTGTCGCCGGAGAAAAGGAAGAGTTGGCGGCGGAGTTTGAAGCTGAGGTTTGGATGTCTTTGTTTGATGAGCTATTAATCGACCTCTCC

mRNA sequence

TGAACTTCACTCCCACTAAGAACCATCCTCTCACCTTCTTCTTCTTCTTCTTCATCTTCTTCCTCAAATTAATGGAACTTCACTCTGCTTTTAATGCTCTCAGTACTGAATGTTTCTCTAATTAGTACTCTTCTGCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCATCGTTCTTCACAAATTCAAATCCTCTGTTTCTCCCTTTCTCTCTCTAAGATTTGATTTCTCTGTTCATGGCGTCCATTGATTCTTCCGCTTGGAGCATCATCTCCAAGCCTCCGCCGGAAAAACCTACCTCCTTCATGCTCAAGGATTATCTTCTCGACGATTTCAGTTCCTGCTCCTCCAATGGCTTCCGATCCTTTCCTCGCCGCCAATGCTGCGCCACAACCGTCCGATTTCTTCTCGAAATCGATCTCAAAGTTAAAGATTCTTCCCTAACTAAAAGATTCCTTCCTCGAACTGCCTCCCGAAAAATCGCGCTCTCCACGATCTCCACTTTGCAGAGAGCCTCCGATGCCGTTGTCAGAGCCTTCAAGAAATTCCCCTTGCCTTCTTACCGGAAGCCGTTTTGTTCGAGGAGTTTTTCACGGAAGGTGATTCTGCGAGCGTTCTGGAAGAAACAGGATTTTGTGGATGTGAACACCAGACGGTGTAAATCGTTTCAGGAATTTCTCGATGAGAAAGAACCGCCGTTGTCTCGCTCCGATTCCGCTGTGTGCACCGCCGTGACTGTCGTCGGAAGAAACTCGATTAGTAGCTGTAGTAATAGTATCAGTTGGACGGAGAGCGAATTTACATCGGAGATGATTCCGTCGTCTTCGAGCGGTAACTCCGAGAGTTGCAGCGAAAACGACGCCGTTAAAGTCGATAAGGATTCACCTGGTAATCTCATTGGCAAAAGAGATGGCGTAACATTCGGTAAAGATTCCATGGAGGAAACAACCACCGCCCCTTCCGCCGCCGCCCCCACCACCCCTACCACCACCGGTTTCCGAGAGGATATCGTTAAGCAATGGCCAAATGAAGAAGAAAAGGAACAGTTGAGTCCAGTTTCAGTGTTGGATTTTCCTTTTGAAGATGAAGATCAAGACACCCCCTCGTCTTTCAACTGCAATCTTCACCTCGTCCAAGACTTAACGAAACGATTTGCAGACATTGTTGTTGGTCGTCAACATTTCAGCTCAATCTCGAGAAAAGAACACCAAAGGGAACAGAAAGCATTTGAGCTTCTAAAGCTTGTCAAATCAACTAAGACATCGATAGAGAATCTGCTTCTCGATTTCTTCCACGAGAAGCTTGAAGAAAACGACGCAACTGCAAGAACAGGAGCTGATTTTGATCAAGCACAGGTCTTGAAATTCACTGAAGATTGGATCAATGGGGATGTCGGAGAAGCCATGGCGACGGGATGGGAGTCGCCGGAGGGACGGGTTTTGTACATTAAGGACATGGAGATTGCCGGGAAATGGAGAAGTGTCGCCGGAGAAAAGGAAGAGTTGGCGGCGGAGTTTGAAGCTGAGGTTTGGATGTCTTTGTTTGATGAGCTATTAATCGACCTCTCC

Coding sequence (CDS)

ATGGCGTCCATTGATTCTTCCGCTTGGAGCATCATCTCCAAGCCTCCGCCGGAAAAACCTACCTCCTTCATGCTCAAGGATTATCTTCTCGACGATTTCAGTTCCTGCTCCTCCAATGGCTTCCGATCCTTTCCTCGCCGCCAATGCTGCGCCACAACCGTCCGATTTCTTCTCGAAATCGATCTCAAAGTTAAAGATTCTTCCCTAACTAAAAGATTCCTTCCTCGAACTGCCTCCCGAAAAATCGCGCTCTCCACGATCTCCACTTTGCAGAGAGCCTCCGATGCCGTTGTCAGAGCCTTCAAGAAATTCCCCTTGCCTTCTTACCGGAAGCCGTTTTGTTCGAGGAGTTTTTCACGGAAGGTGATTCTGCGAGCGTTCTGGAAGAAACAGGATTTTGTGGATGTGAACACCAGACGGTGTAAATCGTTTCAGGAATTTCTCGATGAGAAAGAACCGCCGTTGTCTCGCTCCGATTCCGCTGTGTGCACCGCCGTGACTGTCGTCGGAAGAAACTCGATTAGTAGCTGTAGTAATAGTATCAGTTGGACGGAGAGCGAATTTACATCGGAGATGATTCCGTCGTCTTCGAGCGGTAACTCCGAGAGTTGCAGCGAAAACGACGCCGTTAAAGTCGATAAGGATTCACCTGGTAATCTCATTGGCAAAAGAGATGGCGTAACATTCGGTAAAGATTCCATGGAGGAAACAACCACCGCCCCTTCCGCCGCCGCCCCCACCACCCCTACCACCACCGGTTTCCGAGAGGATATCGTTAAGCAATGGCCAAATGAAGAAGAAAAGGAACAGTTGAGTCCAGTTTCAGTGTTGGATTTTCCTTTTGAAGATGAAGATCAAGACACCCCCTCGTCTTTCAACTGCAATCTTCACCTCGTCCAAGACTTAACGAAACGATTTGCAGACATTGTTGTTGGTCGTCAACATTTCAGCTCAATCTCGAGAAAAGAACACCAAAGGGAACAGAAAGCATTTGAGCTTCTAAAGCTTGTCAAATCAACTAAGACATCGATAGAGAATCTGCTTCTCGATTTCTTCCACGAGAAGCTTGAAGAAAACGACGCAACTGCAAGAACAGGAGCTGATTTTGATCAAGCACAGGTCTTGAAATTCACTGAAGATTGGATCAATGGGGATGTCGGAGAAGCCATGGCGACGGGATGGGAGTCGCCGGAGGGACGGGTTTTGTACATTAAGGACATGGAGATTGCCGGGAAATGGAGAAGTGTCGCCGGAGAAAAGGAAGAGTTGGCGGCGGAGTTTGAAGCTGAGGTTTGGATGTCTTTGTTTGATGAGCTATTAATCGACCTCTCC

Protein sequence

MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSSLTKRFLPRTASRKIALSTISTLQRASDAVVRAFKKFPLPSYRKPFCSRSFSRKVILRAFWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAAAPTTPTTTGFREDIVKQWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQDLTKRFADIVVGRQHFSSISRKEHQREQKAFELLKLVKSTKTSIENLLLDFFHEKLEENDATARTGADFDQAQVLKFTEDWINGDVGEAMATGWESPEGRVLYIKDMEIAGKWRSVAGEKEELAAEFEAEVWMSLFDELLIDLS
Homology
BLAST of Cp4.1LG13g01820 vs. NCBI nr
Match: XP_023551213.1 (uncharacterized protein LOC111809098 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 856 bits (2212), Expect = 1.64e-312
Identity = 444/466 (95.28%), Postives = 444/466 (95.28%), Query Frame = 0

Query: 1   MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEI 60
           MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEI
Sbjct: 1   MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEI 60

Query: 61  DLKVKDSSLTKRFLPRTASRKIALSTISTLQRASDAVVRAFKKFPLPSYRKPFCSRSFSR 120
           DLKVKDSSLTKRFLPRTASRKIALSTISTLQRASDAVVRAFKKFPLPSYRKPFCSRSFSR
Sbjct: 61  DLKVKDSSLTKRFLPRTASRKIALSTISTLQRASDAVVRAFKKFPLPSYRKPFCSRSFSR 120

Query: 121 KVILRAFWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNS 180
           KVILRAFWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNS
Sbjct: 121 KVILRAFWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNS 180

Query: 181 ISWTESEFTSEMIPSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTA 240
           ISWTESEFTSEMIPSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTA
Sbjct: 181 ISWTESEFTSEMIPSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTA 240

Query: 241 PSAAAPTTPTTTGFREDIVKQWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQ 300
           PSAAAPTTPTTTGFREDIVKQWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQ
Sbjct: 241 PSAAAPTTPTTTGFREDIVKQWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQ 300

Query: 301 ----------------------DLTKRFADIVVGRQHFSSISRKEHQREQKAFELLKLVK 360
                                 DLTKRFADIVVGRQHFSSISRKEHQREQKAFELLKLVK
Sbjct: 301 GKKQKHAQQPKRFENGVEFEPLDLTKRFADIVVGRQHFSSISRKEHQREQKAFELLKLVK 360

Query: 361 STKTSIENLLLDFFHEKLEENDATARTGADFDQAQVLKFTEDWINGDVGEAMATGWESPE 420
           STKTSIENLLLDFFHEKLEENDATARTGADFDQAQVLKFTEDWINGDVGEAMATGWESPE
Sbjct: 361 STKTSIENLLLDFFHEKLEENDATARTGADFDQAQVLKFTEDWINGDVGEAMATGWESPE 420

Query: 421 GRVLYIKDMEIAGKWRSVAGEKEELAAEFEAEVWMSLFDELLIDLS 444
           GRVLYIKDMEIAGKWRSVAGEKEELAAEFEAEVWMSLFDELLIDLS
Sbjct: 421 GRVLYIKDMEIAGKWRSVAGEKEELAAEFEAEVWMSLFDELLIDLS 466

BLAST of Cp4.1LG13g01820 vs. NCBI nr
Match: KAG6579477.1 (hypothetical protein SDJN03_23925, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 833 bits (2153), Expect = 1.60e-303
Identity = 431/466 (92.49%), Postives = 434/466 (93.13%), Query Frame = 0

Query: 1   MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEI 60
           MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEI
Sbjct: 1   MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEI 60

Query: 61  DLKVKDSSLTKRFLPRTASRKIALSTISTLQRASDAVVRAFKKFPLPSYRKPFCSRSFSR 120
           DLKVKDS+LTKRFLPRTASRKIALSTISTLQRASDAVVRAFKKFPLPSY KPFCSRSFSR
Sbjct: 61  DLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRAFKKFPLPSYPKPFCSRSFSR 120

Query: 121 KVILRAFWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNS 180
           KVILR FWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNS
Sbjct: 121 KVILRTFWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNS 180

Query: 181 ISWTESEFTSEMIPSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTA 240
           ISWTESEFTSEMIPSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTA
Sbjct: 181 ISWTESEFTSEMIPSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTA 240

Query: 241 PSAAAPTTPTTTGFREDIVKQWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQ 300
           PSAA PTTPTTTGFREDIVK WPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHL+Q
Sbjct: 241 PSAATPTTPTTTGFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLIQ 300

Query: 301 ----------------------DLTKRFADIVVGRQHFSSISRKEHQREQKAFELLKLVK 360
                                 DL KRFADIVVGRQHF SISRKEHQREQKAFELLKLVK
Sbjct: 301 GKKQKHAHRPKRFENGVEFEPLDLKKRFADIVVGRQHFGSISRKEHQREQKAFELLKLVK 360

Query: 361 STKTSIENLLLDFFHEKLEENDATARTGADFDQAQVLKFTEDWINGDVGEAMATGWESPE 420
           ST TS ENLLLDFFHEKLEENDA ARTGADFDQAQVLKFTEDWINGD GEAMATGWESPE
Sbjct: 361 STTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKFTEDWINGDAGEAMATGWESPE 420

Query: 421 GRVLYIKDMEIAGKWRSVAGEKEELAAEFEAEVWMSLFDELLIDLS 444
           GRVLYIKDMEIAGKWRS+AGEKEELAAEFEAEVWMSLFDELLIDLS
Sbjct: 421 GRVLYIKDMEIAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS 466

BLAST of Cp4.1LG13g01820 vs. NCBI nr
Match: KAG7016946.1 (hypothetical protein SDJN02_22057, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 832 bits (2148), Expect = 9.61e-303
Identity = 433/467 (92.72%), Postives = 435/467 (93.15%), Query Frame = 0

Query: 1   MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEI 60
           MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEI
Sbjct: 1   MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEI 60

Query: 61  DLKVKDSSLTKRFLPRTASRKIALSTISTLQRASDAVVRAFKKFPLPSYRKPFCSRSFSR 120
           DLKVKDS+LTKRFLPRTASRKIALSTISTLQRASDAVVRAFKKFPLPSYRKPFCSRSFSR
Sbjct: 61  DLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRAFKKFPLPSYRKPFCSRSFSR 120

Query: 121 KVILRAFWKKQDFVDVNTRR-CKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSN 180
           KVILRAFWKKQDFVDVNTRR CKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSN
Sbjct: 121 KVILRAFWKKQDFVDVNTRRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSN 180

Query: 181 SISWTESEFTSEMIPSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTT 240
           SISWTESEFTSEMIPSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTT
Sbjct: 181 SISWTESEFTSEMIPSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTT 240

Query: 241 APSAAAPTTPTTTGFREDIVKQWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLV 300
           APSAA PTTPTTTGFREDIVK WPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLV
Sbjct: 241 APSAATPTTPTTTGFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLV 300

Query: 301 Q----------------------DLTKRFADIVVGRQHFSSISRKEHQREQKAFELLKLV 360
           Q                      DL KRFADIVVG QHF SISRKEHQREQKAFELLKLV
Sbjct: 301 QGKKQKHAHRPKRFENGVEFEPLDLKKRFADIVVGHQHFGSISRKEHQREQKAFELLKLV 360

Query: 361 KSTKTSIENLLLDFFHEKLEENDATARTGADFDQAQVLKFTEDWINGDVGEAMATGWESP 420
           KST TS ENLLLDFFHEKLEENDA ARTGADFDQAQVLKFTEDWINGD GEAMATGWESP
Sbjct: 361 KSTTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKFTEDWINGDAGEAMATGWESP 420

Query: 421 EGRVLYIKDMEIAGKWRSVAGEKEELAAEFEAEVWMSLFDELLIDLS 444
           EGRVLYIKDMEIAGKWRS+AGEKEELAAEFEAEVWMSLFDELLIDLS
Sbjct: 421 EGRVLYIKDMEIAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS 467

BLAST of Cp4.1LG13g01820 vs. NCBI nr
Match: XP_022969906.1 (uncharacterized protein LOC111468962 [Cucurbita maxima])

HSP 1 Score: 828 bits (2140), Expect = 1.48e-301
Identity = 429/465 (92.26%), Postives = 432/465 (92.90%), Query Frame = 0

Query: 1   MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEI 60
           MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEI
Sbjct: 1   MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEI 60

Query: 61  DLKVKDSSLTKRFLPRTASRKIALSTISTLQRASDAVVRAFKKFPLPSYRKPFCSRSFSR 120
           DLKVKDSSLTKRFLPRTASRKIALSTISTLQRASDAVVRAFKKFPLPSYRKPFC+RSFSR
Sbjct: 61  DLKVKDSSLTKRFLPRTASRKIALSTISTLQRASDAVVRAFKKFPLPSYRKPFCTRSFSR 120

Query: 121 KVILRAFWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNS 180
           KVILRAFWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNS
Sbjct: 121 KVILRAFWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNS 180

Query: 181 ISWTESEFTSEMIPSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTA 240
           ISWTESEFTSE IPSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTA
Sbjct: 181 ISWTESEFTSERIPSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTA 240

Query: 241 PSAAAPTTPTTTGFREDIVKQWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQ 300
           PSAA PTTPTTTGFREDIVKQWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQ
Sbjct: 241 PSAATPTTPTTTGFREDIVKQWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQ 300

Query: 301 ---------------------DLTKRFADIVVGRQHFSSISRKEHQREQKAFELLKLVKS 360
                                DL KRFADIVVG QHFS ISRKEHQREQKAFELLKLVKS
Sbjct: 301 GKKLHAQKSKRFENGVEFEPLDLKKRFADIVVGSQHFSLISRKEHQREQKAFELLKLVKS 360

Query: 361 TKTSIENLLLDFFHEKLEENDATARTGADFDQAQVLKFTEDWINGDVGEAMATGWESPEG 420
           T TS ENLLLDFFHEKLEENDA ARTGAD DQAQVLKFTEDWINGD GEAM TGWE+PEG
Sbjct: 361 TTTSTENLLLDFFHEKLEENDAIARTGADIDQAQVLKFTEDWINGDAGEAMVTGWETPEG 420

Query: 421 RVLYIKDMEIAGKWRSVAGEKEELAAEFEAEVWMSLFDELLIDLS 444
           RVLYIKDMEIAGKWRSV GEKEELAAEFEAEVW+SLFDELLIDLS
Sbjct: 421 RVLYIKDMEIAGKWRSVGGEKEELAAEFEAEVWISLFDELLIDLS 465

BLAST of Cp4.1LG13g01820 vs. NCBI nr
Match: XP_022922340.1 (uncharacterized protein LOC111430353 [Cucurbita moschata])

HSP 1 Score: 826 bits (2133), Expect = 1.79e-300
Identity = 429/466 (92.06%), Postives = 432/466 (92.70%), Query Frame = 0

Query: 1   MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEI 60
           MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEI
Sbjct: 1   MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEI 60

Query: 61  DLKVKDSSLTKRFLPRTASRKIALSTISTLQRASDAVVRAFKKFPLPSYRKPFCSRSFSR 120
           DLKVKDS+LTKRFLPRTASRKIALSTISTLQRASDAVVRAFKKFPLPSY KPFCSRSFSR
Sbjct: 61  DLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRAFKKFPLPSYPKPFCSRSFSR 120

Query: 121 KVILRAFWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNS 180
           KVILR FWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNS
Sbjct: 121 KVILRTFWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNS 180

Query: 181 ISWTESEFTSEMIPSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTA 240
           ISWTESEFTSEMIPSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTA
Sbjct: 181 ISWTESEFTSEMIPSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTA 240

Query: 241 PSAAAPTTPTTTGFREDIVKQWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQ 300
           PSAA PTTPTTT FREDIVK WPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQ
Sbjct: 241 PSAATPTTPTTTDFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQ 300

Query: 301 ----------------------DLTKRFADIVVGRQHFSSISRKEHQREQKAFELLKLVK 360
                                 DL KRFADIVVGRQHF SISRKE+QREQKAFELLKLVK
Sbjct: 301 GKKQKHAHKPKRFENGVEFEPLDLKKRFADIVVGRQHFGSISRKENQREQKAFELLKLVK 360

Query: 361 STKTSIENLLLDFFHEKLEENDATARTGADFDQAQVLKFTEDWINGDVGEAMATGWESPE 420
           ST TS ENLLLDFFHEKLEENDA ARTGADFDQAQVLKFTEDWINGD GEAMATGWESPE
Sbjct: 361 STTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKFTEDWINGDAGEAMATGWESPE 420

Query: 421 GRVLYIKDMEIAGKWRSVAGEKEELAAEFEAEVWMSLFDELLIDLS 444
           GRVLYIKDME AGKWRS+AGEKEELAAEFEAEVWMSLFDELLIDLS
Sbjct: 421 GRVLYIKDMEKAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS 466

BLAST of Cp4.1LG13g01820 vs. ExPASy TrEMBL
Match: A0A6J1HZ34 (uncharacterized protein LOC111468962 OS=Cucurbita maxima OX=3661 GN=LOC111468962 PE=4 SV=1)

HSP 1 Score: 828 bits (2140), Expect = 7.14e-302
Identity = 429/465 (92.26%), Postives = 432/465 (92.90%), Query Frame = 0

Query: 1   MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEI 60
           MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEI
Sbjct: 1   MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEI 60

Query: 61  DLKVKDSSLTKRFLPRTASRKIALSTISTLQRASDAVVRAFKKFPLPSYRKPFCSRSFSR 120
           DLKVKDSSLTKRFLPRTASRKIALSTISTLQRASDAVVRAFKKFPLPSYRKPFC+RSFSR
Sbjct: 61  DLKVKDSSLTKRFLPRTASRKIALSTISTLQRASDAVVRAFKKFPLPSYRKPFCTRSFSR 120

Query: 121 KVILRAFWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNS 180
           KVILRAFWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNS
Sbjct: 121 KVILRAFWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNS 180

Query: 181 ISWTESEFTSEMIPSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTA 240
           ISWTESEFTSE IPSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTA
Sbjct: 181 ISWTESEFTSERIPSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTA 240

Query: 241 PSAAAPTTPTTTGFREDIVKQWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQ 300
           PSAA PTTPTTTGFREDIVKQWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQ
Sbjct: 241 PSAATPTTPTTTGFREDIVKQWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQ 300

Query: 301 ---------------------DLTKRFADIVVGRQHFSSISRKEHQREQKAFELLKLVKS 360
                                DL KRFADIVVG QHFS ISRKEHQREQKAFELLKLVKS
Sbjct: 301 GKKLHAQKSKRFENGVEFEPLDLKKRFADIVVGSQHFSLISRKEHQREQKAFELLKLVKS 360

Query: 361 TKTSIENLLLDFFHEKLEENDATARTGADFDQAQVLKFTEDWINGDVGEAMATGWESPEG 420
           T TS ENLLLDFFHEKLEENDA ARTGAD DQAQVLKFTEDWINGD GEAM TGWE+PEG
Sbjct: 361 TTTSTENLLLDFFHEKLEENDAIARTGADIDQAQVLKFTEDWINGDAGEAMVTGWETPEG 420

Query: 421 RVLYIKDMEIAGKWRSVAGEKEELAAEFEAEVWMSLFDELLIDLS 444
           RVLYIKDMEIAGKWRSV GEKEELAAEFEAEVW+SLFDELLIDLS
Sbjct: 421 RVLYIKDMEIAGKWRSVGGEKEELAAEFEAEVWISLFDELLIDLS 465

BLAST of Cp4.1LG13g01820 vs. ExPASy TrEMBL
Match: A0A6J1E8H2 (uncharacterized protein LOC111430353 OS=Cucurbita moschata OX=3662 GN=LOC111430353 PE=4 SV=1)

HSP 1 Score: 826 bits (2133), Expect = 8.64e-301
Identity = 429/466 (92.06%), Postives = 432/466 (92.70%), Query Frame = 0

Query: 1   MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEI 60
           MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEI
Sbjct: 1   MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEI 60

Query: 61  DLKVKDSSLTKRFLPRTASRKIALSTISTLQRASDAVVRAFKKFPLPSYRKPFCSRSFSR 120
           DLKVKDS+LTKRFLPRTASRKIALSTISTLQRASDAVVRAFKKFPLPSY KPFCSRSFSR
Sbjct: 61  DLKVKDSALTKRFLPRTASRKIALSTISTLQRASDAVVRAFKKFPLPSYPKPFCSRSFSR 120

Query: 121 KVILRAFWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNS 180
           KVILR FWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNS
Sbjct: 121 KVILRTFWKKQDFVDVNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNS 180

Query: 181 ISWTESEFTSEMIPSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTA 240
           ISWTESEFTSEMIPSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTA
Sbjct: 181 ISWTESEFTSEMIPSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTA 240

Query: 241 PSAAAPTTPTTTGFREDIVKQWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQ 300
           PSAA PTTPTTT FREDIVK WPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQ
Sbjct: 241 PSAATPTTPTTTDFREDIVKPWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQ 300

Query: 301 ----------------------DLTKRFADIVVGRQHFSSISRKEHQREQKAFELLKLVK 360
                                 DL KRFADIVVGRQHF SISRKE+QREQKAFELLKLVK
Sbjct: 301 GKKQKHAHKPKRFENGVEFEPLDLKKRFADIVVGRQHFGSISRKENQREQKAFELLKLVK 360

Query: 361 STKTSIENLLLDFFHEKLEENDATARTGADFDQAQVLKFTEDWINGDVGEAMATGWESPE 420
           ST TS ENLLLDFFHEKLEENDA ARTGADFDQAQVLKFTEDWINGD GEAMATGWESPE
Sbjct: 361 STTTSTENLLLDFFHEKLEENDAIARTGADFDQAQVLKFTEDWINGDAGEAMATGWESPE 420

Query: 421 GRVLYIKDMEIAGKWRSVAGEKEELAAEFEAEVWMSLFDELLIDLS 444
           GRVLYIKDME AGKWRS+AGEKEELAAEFEAEVWMSLFDELLIDLS
Sbjct: 421 GRVLYIKDMEKAGKWRSLAGEKEELAAEFEAEVWMSLFDELLIDLS 466

BLAST of Cp4.1LG13g01820 vs. ExPASy TrEMBL
Match: A0A0A0KP06 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G155580 PE=4 SV=1)

HSP 1 Score: 615 bits (1585), Expect = 3.25e-217
Identity = 334/477 (70.02%), Postives = 377/477 (79.04%), Query Frame = 0

Query: 1   MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEI 60
           MAS DSS W++IS PP +KP S +LKDYLLDDFSSCSSNGFRSFPRRQCC+TTVRFLLEI
Sbjct: 1   MASTDSSNWTLISNPPLQKPKSLLLKDYLLDDFSSCSSNGFRSFPRRQCCSTTVRFLLEI 60

Query: 61  DLKVKDSSLTKRFLPRTASRKIALSTISTLQRASDAVVRAFKKFPLPSYRKPFCSRSFSR 120
           DLKVKDSS+TKRFLPRT SRKIALSTISTLQRASDAV+RAFK+FPLPS RK F  RS SR
Sbjct: 61  DLKVKDSSVTKRFLPRTTSRKIALSTISTLQRASDAVLRAFKQFPLPSSRKSFFPRSISR 120

Query: 121 KVILRAFWKKQDFVDVN-TRRCKSFQEFLDEKEPPLS------RSDSAVCTAVTVVGRNS 180
           K+I +AF KK D VD N  +R KSF+EFLDEKEPP S       SDSAVCTA+ V GRNS
Sbjct: 121 KLISKAFRKKSDIVDPNINKRWKSFKEFLDEKEPPSSSSFEENHSDSAVCTAIAVAGRNS 180

Query: 181 ISSCSNSISWTESEFTSEMIPSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDS 240
           ISSCSNSISWTESEFTSE+IPSS SGNSESCSENDAVK DKDSPGNLIGKRDGVTFGKDS
Sbjct: 181 ISSCSNSISWTESEFTSEIIPSSCSGNSESCSENDAVKDDKDSPGNLIGKRDGVTFGKDS 240

Query: 241 MEETTTAPSAAAPTTPTTTGFREDIVKQWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFN 300
           MEETTTAP++ A  T +   +RED VKQW NEEEKEQ SPVSVLDFPFEDEDQD  SSFN
Sbjct: 241 MEETTTAPTSVAAAT-SADDYREDTVKQWQNEEEKEQFSPVSVLDFPFEDEDQDISSSFN 300

Query: 301 CNLHLVQ-----------------------DLTKRFADI-VVGRQ-HFSSISRKEHQREQ 360
           CN+HL++                       DL KRF +I V+G Q HF+ I++KEHQ E+
Sbjct: 301 CNVHLMEGKKQKQRDQKTKRLEKGTELEPVDLKKRFTNISVIGDQDHFTLITKKEHQMEE 360

Query: 361 KAFELLKLVKSTKTSIENLLLDFFHEKLEENDATARTGADFDQAQVLKFTEDWINGDVGE 420
           KA E LKL+KST  S ENLLLDFFH+KL+E++AT+ T +DFDQ Q+LKF +DWI+G+ GE
Sbjct: 361 KALEFLKLLKSTTESTENLLLDFFHQKLDEHEATS-TNSDFDQPQLLKFAQDWIDGNAGE 420

Query: 421 AMATG-WESPEGRVLYIKDMEIAGKWRSVAGEKEELAAEFEAEVWMSLFDELLIDLS 444
               G WE PE R  YIKDME+  KWRS  G+KEEL AEFE EVW+SL ++LLIDLS
Sbjct: 421 LTVMGRWELPEERNFYIKDMEVGDKWRSFGGDKEELVAEFEGEVWISLLNDLLIDLS 475

BLAST of Cp4.1LG13g01820 vs. ExPASy TrEMBL
Match: A0A5A7TN51 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold44G003020 PE=4 SV=1)

HSP 1 Score: 594 bits (1532), Expect = 3.56e-209
Identity = 327/479 (68.27%), Postives = 370/479 (77.24%), Query Frame = 0

Query: 1   MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEI 60
           MAS DSS W++IS PP +KP S +LKDYLLDDFSSCSSNGFRSFPRRQCC+TTVRFLLEI
Sbjct: 1   MASTDSSNWTLISNPPLQKPKSLLLKDYLLDDFSSCSSNGFRSFPRRQCCSTTVRFLLEI 60

Query: 61  DLKVKDSSLTKRFLPRTASRKIALSTISTLQRASDAVVRAFKKFPLPSYRKPFCSRSFSR 120
           DLKVKDSS TK+FLPRT+SRKIALSTISTLQRASDAV+RAFK+FPLPS RK F  RS SR
Sbjct: 61  DLKVKDSSFTKKFLPRTSSRKIALSTISTLQRASDAVLRAFKQFPLPSSRKSFFPRSISR 120

Query: 121 KVILRAFWKKQDFVDVNTRRCKSFQEFLDEKEPPLS------RSDSAVCTAVTVVGRNSI 180
           K+I +AF KK D VD N RR KSF+EFLDEKEPP S       SDSAVCTA+ V GRNSI
Sbjct: 121 KLISKAFRKKSDIVDPNNRRWKSFKEFLDEKEPPSSSSFEQNHSDSAVCTAIAVAGRNSI 180

Query: 181 SSCSNSISWTESEFTSEMIPSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSM 240
           SSCSNSISWTESEFTSE+IPSS SGNSESCSEN AVK DKDSP NLIGKRDGVTFGKDSM
Sbjct: 181 SSCSNSISWTESEFTSEIIPSSCSGNSESCSENGAVKDDKDSPVNLIGKRDGVTFGKDSM 240

Query: 241 EETTTAPSAAAPTTPTTTGFREDIVKQWPN-EEEKEQLSPVSVLDFPFEDEDQDTPSSFN 300
           EET T  + AA        +RED VK+W N EEEKEQ SPVSVLDFPFEDEDQD  SSFN
Sbjct: 241 EETGTPSAVAA----AAGEYREDTVKKWQNNEEEKEQFSPVSVLDFPFEDEDQDISSSFN 300

Query: 301 CNLHLVQ-----------------------DLTKRFADIVV---GRQHFSSISR-KEHQR 360
           CN+HL++                       DL KRF +I V    + HF+ I++ KEHQ 
Sbjct: 301 CNIHLMEGKKQKQRDQKTKRLEKGTELEPVDLKKRFTNISVIADHQDHFTLITKLKEHQM 360

Query: 361 EQKAFELLKLVKSTKTSIENLLLDFFHEKLEENDATARTGADFDQAQVLKFTEDWINGDV 420
           E+KA E LKL+KST  S ENLLLDFFH+KL+E++AT+ T +DFDQ Q+L+F +DW++G+ 
Sbjct: 361 EEKALEFLKLLKSTTKSTENLLLDFFHQKLDEHEATS-TNSDFDQPQLLEFAQDWVDGNA 420

Query: 421 GEAMATG-WESPEGRVLYIKDMEIAGKWRSVAGEKEELAAEFEAEVWMSLFDELLIDLS 444
           GE    G WE PE R  YIKDME+A KWRS  G+KEEL AEFEAEVW+SL D+LLIDLS
Sbjct: 421 GELTVMGRWELPEERNFYIKDMEVAEKWRSFGGDKEELVAEFEAEVWISLLDDLLIDLS 474

BLAST of Cp4.1LG13g01820 vs. ExPASy TrEMBL
Match: A0A1S3ATL0 (uncharacterized protein LOC103482706 OS=Cucumis melo OX=3656 GN=LOC103482706 PE=4 SV=1)

HSP 1 Score: 592 bits (1526), Expect = 2.91e-208
Identity = 326/479 (68.06%), Postives = 369/479 (77.04%), Query Frame = 0

Query: 1   MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEI 60
           MAS DSS W++IS PP +KP S +LKDYLLDDFSSCSSNGFRSFPRRQCC+TTVRFLLEI
Sbjct: 1   MASTDSSNWTLISNPPLQKPKSLLLKDYLLDDFSSCSSNGFRSFPRRQCCSTTVRFLLEI 60

Query: 61  DLKVKDSSLTKRFLPRTASRKIALSTISTLQRASDAVVRAFKKFPLPSYRKPFCSRSFSR 120
           DLKVKDSS TK+FLPRT+SRKIALSTISTLQRASDAV+RAFK+FPLPS RK F  RS SR
Sbjct: 61  DLKVKDSSFTKKFLPRTSSRKIALSTISTLQRASDAVLRAFKQFPLPSSRKSFFPRSISR 120

Query: 121 KVILRAFWKKQDFVDVNTRRCKSFQEFLDEKEPPLS------RSDSAVCTAVTVVGRNSI 180
           K+I +AF KK D VD N RR KSF+EFLDEKEPP S       SDSAVCTA+ V GRNSI
Sbjct: 121 KLISKAFRKKSDIVDPNNRRWKSFKEFLDEKEPPSSSSFEQNHSDSAVCTAIAVAGRNSI 180

Query: 181 SSCSNSISWTESEFTSEMIPSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSM 240
           SSCSNSISWTESEFTSE+IPSS SGNSESCSEN AVK DKDSP NLIGKRDGVTFGKDSM
Sbjct: 181 SSCSNSISWTESEFTSEIIPSSCSGNSESCSENGAVKDDKDSPVNLIGKRDGVTFGKDSM 240

Query: 241 EETTTAPSAAAPTTPTTTGFREDIVKQWPN-EEEKEQLSPVSVLDFPFEDEDQDTPSSFN 300
           EET T  + AA        +RED VK+W N EEEKEQ SPVSVLDFPFEDEDQD  SS N
Sbjct: 241 EETGTPSAVAA----AAGEYREDTVKKWQNNEEEKEQFSPVSVLDFPFEDEDQDISSSLN 300

Query: 301 CNLHLVQ-----------------------DLTKRFADIVV---GRQHFSSISR-KEHQR 360
           CN+HL++                       DL KRF +I V    + HF+ I++ KEHQ 
Sbjct: 301 CNIHLMEGKKQKQRDQKTKRLEKGTELEPVDLKKRFTNISVIADHQDHFTLITKLKEHQM 360

Query: 361 EQKAFELLKLVKSTKTSIENLLLDFFHEKLEENDATARTGADFDQAQVLKFTEDWINGDV 420
           E+KA E LKL+KST  S ENLLLDFFH+KL+E++AT+ T +DFDQ Q+L+F +DW++G+ 
Sbjct: 361 EEKALEFLKLLKSTTKSTENLLLDFFHQKLDEHEATS-TNSDFDQPQLLEFAQDWVDGNA 420

Query: 421 GEAMATG-WESPEGRVLYIKDMEIAGKWRSVAGEKEELAAEFEAEVWMSLFDELLIDLS 444
           GE    G WE PE R  YIKDME+A KWRS  G+KEEL AEFEAEVW+SL D+LLIDLS
Sbjct: 421 GELTVMGRWELPEERNFYIKDMEVAEKWRSFGGDKEELVAEFEAEVWISLLDDLLIDLS 474

BLAST of Cp4.1LG13g01820 vs. TAIR 10
Match: AT4G11780.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G23020.2); Has 550 Blast hits to 387 proteins in 92 species: Archae - 0; Bacteria - 32; Metazoa - 132; Fungi - 122; Plants - 80; Viruses - 0; Other Eukaryotes - 184 (source: NCBI BLink). )

HSP 1 Score: 115.5 bits (288), Expect = 1.1e-25
Identity = 145/487 (29.77%), Postives = 228/487 (46.82%), Query Frame = 0

Query: 24  MLKDYLLDDFSSCSSNGFRSFPRRQ--CCATTVRFLLEIDLK----VKDSSLTKRFLPRT 83
           +L+DYLLDD SSCSSNGF+SFPRRQ    ++TVR LL+ ++K    +     TK+  PR 
Sbjct: 23  LLRDYLLDDLSSCSSNGFKSFPRRQTPSASSTVRRLLDAEIKRSGFIHHHHHTKQ--PRL 82

Query: 84  ASRKIALSTISTLQRASDAVVRAF-KKFPLPS---YRKPFCSRSFSRKVILRAFWKK--- 143
             R    +  + +  A      AF K  P PS    ++   SRSFS++++  +FW+K   
Sbjct: 83  TRRSSHTTCGTAISHAVHKASTAFLKLLPFPSSTVKKQGVFSRSFSKRLLSISFWRKPVV 142

Query: 144 ---------QDFVDVNTRRCKSFQEFLDEKEPPLSR--------SDSAVCTAVTVVGR-- 203
                        ++   R  +++E LD++    S+        + S    A+TVV    
Sbjct: 143 GQSRREVTGDGDGEIQWWRSVAYEESLDQQSDLFSQISTTDDKITFSTSAAAITVVEEFI 202

Query: 204 NSISSCSNSISWTES-----EFTSEMIPSSSSGNSES-CSENDAVKVDKDSPGNLIGKRD 263
           +  SS   S  +T S     + +S    SSSSG SE   SE DAV+  K+S G+ +   D
Sbjct: 203 SGDSSSYGSEFFTNSSSEVVQSSSSSFSSSSSGESEEVSSEIDAVEDGKES-GDSLKAHD 262

Query: 264 GVTFGKDSMEETTTAPSAAAPTTPTTTGFREDIVKQWPNEEEKEQLSPVSVLDFPFEDED 323
           G                 ++     +   R++ V      EEKEQLSPVS+L+ PF+D+D
Sbjct: 263 G---------------DGSSVNRNNSLCNRKECV-----NEEKEQLSPVSILECPFKDDD 322

Query: 324 QDTPSSFNCNLH------------LVQ----DLTKRFADIVVGRQHFS--SISRKEHQRE 383
           +D   +   + +            LV+    DL KR    V  ++ +S  ++  +E + E
Sbjct: 323 EDDEITDQNDTYEKIARKSRRLNGLVRLEPLDLDKRIERYVERQEEYSYHTLETEEDESE 382

Query: 384 QKAFELLKLVK----------STKTSIENLLLDFFHEKLEENDATARTGADFDQAQVLKF 443
            +A  L  LVK          ++K + +NLLLD+    L+E++   +     ++  ++K 
Sbjct: 383 NQANRLFALVKLRIGETNDLLASKVA-DNLLLDY----LQEDNIGPK-----EETLMVKK 442

BLAST of Cp4.1LG13g01820 vs. TAIR 10
Match: AT4G23020.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G11780.1); Has 146 Blast hits to 146 proteins in 40 species: Archae - 0; Bacteria - 16; Metazoa - 17; Fungi - 4; Plants - 67; Viruses - 20; Other Eukaryotes - 22 (source: NCBI BLink). )

HSP 1 Score: 104.8 bits (260), Expect = 1.9e-22
Identity = 136/494 (27.53%), Postives = 205/494 (41.50%), Query Frame = 0

Query: 1   MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEI 60
           M SI SS   +       KP   +L+D+LLDD SSCSSNGF+SFPR             +
Sbjct: 1   MTSISSSDHPLPVSKKRLKP--LILRDFLLDDLSSCSSNGFKSFPRL------------L 60

Query: 61  DLKVKDSSLTKRFLPRTASRKI--ALSTISTLQRASDAVVRAFKKFPLP-SYRKPFCSRS 120
           + +++ S +         +R+I   L+    + +AS A++ A K  P P S +     R 
Sbjct: 61  NAEIQRSGMFHH------NRRITCGLAFSHAVHKASTALLTAVKLLPFPSSVKSQSRDRD 120

Query: 121 FSRKVILRAFWKKQDFVDVNT----------------RRCKSFQEFLDEKEPPLSRSDSA 180
             + +  R+FWKK    ++N                 +RC+SF EFL E +  LS     
Sbjct: 121 NKKGLFSRSFWKKLSRRELNVDVGEKERRTDDREEEIQRCRSFAEFLQESQDQLSDQIYY 180

Query: 181 VCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSGNSESCSENDAVKVDKDSPGNLI 240
           +       G  ++S                     + G+S S S  D+ +V + S G ++
Sbjct: 181 ISPTDLFSGEATLS-------------------KDAVGDSSSFSSEDS-EVTQSSSGVIV 240

Query: 241 GKRDGVTFGKDSMEETTTAPSAAAPTTPTTTGFREDIVKQWPNEEEKEQLSPVSVLDFPF 300
               G   G    + ++                  D  ++  N EEKEQLSP+S+LD PF
Sbjct: 241 VMMSGDCVGSHVSDGSSL----------------NDNTEECEN-EEKEQLSPISILDCPF 300

Query: 301 EDEDQDTPSSFNCNLHLVQ----------------DLTKRFADIVVGRQHFSS--ISRKE 360
           +D+    PS         Q                DL KR  +    RQ + S  I  +E
Sbjct: 301 QDDAISPPSHHKETYEKKQMRKRRRLESLVRLEPVDLEKRI-EKYEERQDYKSHIIEIEE 360

Query: 361 HQREQKAFELLKLVKS----------TKTSIENLLLDFFHEKLEENDATARTGADFDQAQ 420
            Q E +A  L  LVKS              ++N+LLDFF E    N+ T       D+ +
Sbjct: 361 DQSEIRANRLFALVKSRIIEEQNQLLASHVVDNVLLDFFKE--NNNNETR------DEDK 420

Query: 421 VLKFTEDWI--NGDVGEAMATGWESPEGRVLYIKDMEIAGKWRSVAG-EKEELAAEFEAE 445
           +++  E+W+    D    M   W+  E R +Y+K+M    KW  + G EKE +  E    
Sbjct: 421 LVEIVEEWVMRRQDDEYNMFMSWKVSEKREIYVKEM----KWGCINGDEKEYVVEELGNG 424

BLAST of Cp4.1LG13g01820 vs. TAIR 10
Match: AT4G23020.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G11780.1). )

HSP 1 Score: 104.0 bits (258), Expect = 3.2e-22
Identity = 141/508 (27.76%), Postives = 207/508 (40.75%), Query Frame = 0

Query: 1   MASIDSSAWSIISKPPPEKPTSFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEI 60
           M SI SS   +       KP   +L+D+LLDD SSCSSNGF+SFPR             +
Sbjct: 1   MTSISSSDHPLPVSKKRLKP--LILRDFLLDDLSSCSSNGFKSFPRL------------L 60

Query: 61  DLKVKDSSLTKRFLPRTASRKI--ALSTISTLQRASDAVVRAFKKFPLP-SYRKPFCSRS 120
           + +++ S +         +R+I   L+    + +AS A++ A K  P P S +     R 
Sbjct: 61  NAEIQRSGMFHH------NRRITCGLAFSHAVHKASTALLTAVKLLPFPSSVKSQSRDRD 120

Query: 121 FSRKVILRAFWKKQDFVDVNT----------------RRCKSFQEFLDEKEPPLSRSDSA 180
             + +  R+FWKK    ++N                 +RC+SF EFL E +  LS     
Sbjct: 121 NKKGLFSRSFWKKLSRRELNVDVGEKERRTDDREEEIQRCRSFAEFLQESQDQLSDQIYY 180

Query: 181 VCTAVTVVGRNSISSCSNSISWTESEFTSEMIPSSSSGNSESCSENDAVKVDKDSPGNLI 240
           +       G  ++S                     + G+S S S  D+ +V + S G ++
Sbjct: 181 ISPTDLFSGEATLS-------------------KDAVGDSSSFSSEDS-EVTQSSSGVIV 240

Query: 241 GKRDGVTFGKDSMEETTTAPSAAAPTTPTTTGFREDIVKQW---PN-----------EEE 300
               G   G       +   S    T     GF      Q+   PN            EE
Sbjct: 241 VMMSGDCVG----SHVSDGSSLNDNTEIFLRGFESGFYGQYHSLPNVNLVHLKFECENEE 300

Query: 301 KEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQ----------------DLTKRFADIVV 360
           KEQLSP+S+LD PF+D+    PS         Q                DL KR  +   
Sbjct: 301 KEQLSPISILDCPFQDDAISPPSHHKETYEKKQMRKRRRLESLVRLEPVDLEKRI-EKYE 360

Query: 361 GRQHFSS--ISRKEHQREQKAFELLKLVKS----------TKTSIENLLLDFFHEKLEEN 420
            RQ + S  I  +E Q E +A  L  LVKS              ++N+LLDFF E    N
Sbjct: 361 ERQDYKSHIIEIEEDQSEIRANRLFALVKSRIIEEQNQLLASHVVDNVLLDFFKE--NNN 420

Query: 421 DATARTGADFDQAQVLKFTEDWI--NGDVGEAMATGWESPEGRVLYIKDMEIAGKWRSVA 445
           + T       D+ ++++  E+W+    D    M   W+  E R +Y+K+M    KW  + 
Sbjct: 421 NETR------DEDKLVEIVEEWVMRRQDDEYNMFMSWKVSEKREIYVKEM----KWGCIN 451

BLAST of Cp4.1LG13g01820 vs. TAIR 10
Match: AT4G00770.1 (unknown protein; Has 127 Blast hits to 120 proteins in 33 species: Archae - 0; Bacteria - 2; Metazoa - 6; Fungi - 8; Plants - 62; Viruses - 3; Other Eukaryotes - 46 (source: NCBI BLink). )

HSP 1 Score: 62.4 bits (150), Expect = 1.1e-09
Identity = 114/479 (23.80%), Postives = 178/479 (37.16%), Query Frame = 0

Query: 22  SFMLKDYLLDDFSSCSSNGFRSFPRRQCCATTVRFLLEIDLKVKDSSLTKRFLPRTASRK 81
           S MLKD LL+D +SCSSNGF+S PRR                           P    RK
Sbjct: 5   SRMLKDCLLEDSNSCSSNGFKSIPRRH---------------------PLNPFPMIPKRK 64

Query: 82  IALSTISTLQRASDAVVRAFKKF---PLPSYRKPFCSRSFSRKVILRAFWKKQDFVD--- 141
                      A  AV+ A K      + S       RS SR++  +   + Q  +    
Sbjct: 65  --------QSNALQAVINAIKNLHSNTIKSAPSGILPRSLSRRLATKNKAENQASITVIR 124

Query: 142 ----VNTRRCKSFQEFLDEKEPPLSRSDSAVCTAVTVVGRNSISSCSNSISWTESEFTSE 201
               V     K   E +   EP    + +   T  T     S +SCS   SW++ +FTSE
Sbjct: 125 VKDIVRWHSSKDLHEDISHFEPHQYTTKNT--TTTTGSSTTSGTSCS---SWSDLDFTSE 184

Query: 202 MIPSSSSGNSESCSENDAVKVDKDSPGNLIGKRDGVTFGKDSMEETTTAPSAAAPTTPTT 261
            +PSS   N E C E  +VK +    G                E++ TA   A     T 
Sbjct: 185 FLPSSWGSNVEECGEKQSVKNNLHCVG----------------EDSCTAVILA----DTE 244

Query: 262 TGFREDIVKQWPNEEEKEQLSPVSVLDFPFEDEDQDTPSSFNCNLHLVQDLTKRFADIVV 321
            G  E++      + EKE  SPVSV +   E+ D+ + SSF+  L  V+   ++    + 
Sbjct: 245 VGPEENL------QCEKEHNSPVSVFEIQHEEYDETSDSSFSQCLDNVERTKQKLMQTIQ 304

Query: 322 GRQHFSSIS-----------------------------------------RKEHQREQKA 381
             +  ++IS                                             + E+KA
Sbjct: 305 RFESLANISPFNLDEWGSMDEASCMEGGQETDTKYDDDENCDTVDRESEDEYNDEVEEKA 364

Query: 382 FELLKLVKSTKT---SIENLLLDFFHEKLEENDATARTGADFDQAQVLKFTEDWINGDVG 441
            +L   VK         E+L++D+F ++L +   +      F+  Q++   + W+ G   
Sbjct: 365 AQLWNRVKERHAIWIHEEHLIMDYFRDELMQRTNSFHETQHFEN-QLVCEAKGWLQGKRE 420

Query: 442 EAMATGWESPEGRVLYIKDMEIAGKW--RSVAGEKEELAAEFEAEVWMSLFDELLIDLS 445
             +  G  S + R    +++E    W  + +  E E +  + E E++  L DE L  LS
Sbjct: 425 SELERG-TSEQRRQACAREIE-RRDWNEKQIEEEHEVVVTQIEEELFSLLMDETLTTLS 420

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_023551213.11.64e-31295.28uncharacterized protein LOC111809098 [Cucurbita pepo subsp. pepo][more]
KAG6579477.11.60e-30392.49hypothetical protein SDJN03_23925, partial [Cucurbita argyrosperma subsp. sorori... [more]
KAG7016946.19.61e-30392.72hypothetical protein SDJN02_22057, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_022969906.11.48e-30192.26uncharacterized protein LOC111468962 [Cucurbita maxima][more]
XP_022922340.11.79e-30092.06uncharacterized protein LOC111430353 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A6J1HZ347.14e-30292.26uncharacterized protein LOC111468962 OS=Cucurbita maxima OX=3661 GN=LOC111468962... [more]
A0A6J1E8H28.64e-30192.06uncharacterized protein LOC111430353 OS=Cucurbita moschata OX=3662 GN=LOC1114303... [more]
A0A0A0KP063.25e-21770.02Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G155580 PE=4 SV=1[more]
A0A5A7TN513.56e-20968.27Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold... [more]
A0A1S3ATL02.91e-20868.06uncharacterized protein LOC103482706 OS=Cucumis melo OX=3656 GN=LOC103482706 PE=... [more]
Match NameE-valueIdentityDescription
AT4G11780.11.1e-2529.77unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G23020.11.9e-2227.53unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G23020.23.2e-2227.76unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT4G00770.11.1e-0923.80unknown protein; Has 127 Blast hits to 120 proteins in 33 species: Archae - 0; B... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 191..253
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 191..206
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 232..253
NoneNo IPR availablePANTHERPTHR33623:SF4OS04G0572500 PROTEINcoord: 1..443
NoneNo IPR availablePANTHERPTHR33623OS04G0572500 PROTEINcoord: 1..443

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG13g01820.1Cp4.1LG13g01820.1mRNA