Cp4.1LG20g02800 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG20g02800
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
Descriptionroot hair specific 4
LocationCp4.1LG20: 1583906 .. 1585054 (-)
RNA-Seq ExpressionCp4.1LG20g02800
SyntenyCp4.1LG20g02800
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTGGTTGTTGTGCAATTCCATTCACCTTCAAATTCTTCCTCTCTGTCACAGATTTCTCCGGTGATGGCACCTGCACTGCCTACGTCGGATAACTTCAACAATGAGAGGTTGATGTCCGGGAAGCTCGAGTTCGTTGGTTCAACGTATTGTTCCGATGATGACGAATGTGTGGAAAAGGAGAAACAGATATCGGTGGATCCAATATCATTGAGACAGTCATCGGCGAGAGAGGATATGATTTTCGATCCCATCACGGCTCCTGATGTCCCTGATCTTCATCTGCCGCCGCCGCTACCTCCGACGCAGTTCAAGTTCTTAAGCTACAGCCTGCCGAATTCCGTCAATTCATCTCCCCGATTCGGTTCAATGAAAAAGAAAGGAAAACTAGAAAATCAAGAATCGAAACTTAAAATCTCGAATTCGACGAAGCTCAAATCGTCGGTGCAGGATATACAAGTCGCTCTGCAAGAGGATACTCAATTTCGAAGGAGTAAATCATGTGGCGAAGGCAGAGCAAGTGCTCCCGCCGATGATTTGGATCTGTTGTTGAATAAAGCAAAATTTCCAGAAACGATGAGTTACGATGATTTCGTTAGAACTGAATCGAACAAAGATTATCGTAATGGTGCAGAGAACTTAGAGCCTACCGATGACGGATTTAAATGCGGGGCTCTCTGTTTATTCTTACCGGGATTCGGCAAAGCGAAGGCCGTTAGGTCAATCAGAAAGGAAGAAGAACCAGAGATAGGAAAAGTGAGGATGTCGAGGACCGAGATTGGAAGCGTGATATCGAGGACAGTTTCGATGGAGAAATTCGAATGTGGATCATGGGCTTCATCTGCCATGCCAAACGAAACTGGCGAAGACGACTCCAGTAGCAGCCTTTTCTACGATCTGCCAATGGAGTTAATAAGAAATAGTGTGGATGCAAATGCGCCAATCAGTGCAGCTTTCGTCTTCGATAAAGATCAGAAGGGAGTTACAAAAAACAGTTCGTCACAAAAATCTCACGAACCGTCGCATCATGTTCGATTCTCGGCATCGTCTCCTTCAGGACCTTCCTCGCCAGCGTCTTGCATCACACCTAGATTGCGCAAGGCAAGAGAGGAGTTTAATGCCTTTCTCGAAGCCCAAAGCAGTGCTTAA

mRNA sequence

ATGCTGGTTGTTGTGCAATTCCATTCACCTTCAAATTCTTCCTCTCTGTCACAGATTTCTCCGGTGATGGCACCTGCACTGCCTACGTCGGATAACTTCAACAATGAGAGGTTGATGTCCGGGAAGCTCGAGTTCGTTGGTTCAACGTATTGTTCCGATGATGACGAATGTGTGGAAAAGGAGAAACAGATATCGGTGGATCCAATATCATTGAGACAGTCATCGGCGAGAGAGGATATGATTTTCGATCCCATCACGGCTCCTGATGTCCCTGATCTTCATCTGCCGCCGCCGCTACCTCCGACGCAGTTCAAGTTCTTAAGCTACAGCCTGCCGAATTCCGTCAATTCATCTCCCCGATTCGGTTCAATGAAAAAGAAAGGAAAACTAGAAAATCAAGAATCGAAACTTAAAATCTCGAATTCGACGAAGCTCAAATCGTCGGTGCAGGATATACAAGTCGCTCTGCAAGAGGATACTCAATTTCGAAGGAGTAAATCATGTGGCGAAGGCAGAGCAAGTGCTCCCGCCGATGATTTGGATCTGTTGTTGAATAAAGCAAAATTTCCAGAAACGATGAGTTACGATGATTTCGTTAGAACTGAATCGAACAAAGATTATCGTAATGGTGCAGAGAACTTAGAGCCTACCGATGACGGATTTAAATGCGGGGCTCTCTGTTTATTCTTACCGGGATTCGGCAAAGCGAAGGCCGTTAGGTCAATCAGAAAGGAAGAAGAACCAGAGATAGGAAAAGTGAGGATGTCGAGGACCGAGATTGGAAGCGTGATATCGAGGACAGTTTCGATGGAGAAATTCGAATGTGGATCATGGGCTTCATCTGCCATGCCAAACGAAACTGGCGAAGACGACTCCAGTAGCAGCCTTTTCTACGATCTGCCAATGGAGTTAATAAGAAATAGTGTGGATGCAAATGCGCCAATCAGTGCAGCTTTCGTCTTCGATAAAGATCAGAAGGGAGTTACAAAAAACAGTTCGTCACAAAAATCTCACGAACCGTCGCATCATGTTCGATTCTCGGCATCGTCTCCTTCAGGACCTTCCTCGCCAGCGTCTTGCATCACACCTAGATTGCGCAAGGCAAGAGAGGAGTTTAATGCCTTTCTCGAAGCCCAAAGCAGTGCTTAA

Coding sequence (CDS)

ATGCTGGTTGTTGTGCAATTCCATTCACCTTCAAATTCTTCCTCTCTGTCACAGATTTCTCCGGTGATGGCACCTGCACTGCCTACGTCGGATAACTTCAACAATGAGAGGTTGATGTCCGGGAAGCTCGAGTTCGTTGGTTCAACGTATTGTTCCGATGATGACGAATGTGTGGAAAAGGAGAAACAGATATCGGTGGATCCAATATCATTGAGACAGTCATCGGCGAGAGAGGATATGATTTTCGATCCCATCACGGCTCCTGATGTCCCTGATCTTCATCTGCCGCCGCCGCTACCTCCGACGCAGTTCAAGTTCTTAAGCTACAGCCTGCCGAATTCCGTCAATTCATCTCCCCGATTCGGTTCAATGAAAAAGAAAGGAAAACTAGAAAATCAAGAATCGAAACTTAAAATCTCGAATTCGACGAAGCTCAAATCGTCGGTGCAGGATATACAAGTCGCTCTGCAAGAGGATACTCAATTTCGAAGGAGTAAATCATGTGGCGAAGGCAGAGCAAGTGCTCCCGCCGATGATTTGGATCTGTTGTTGAATAAAGCAAAATTTCCAGAAACGATGAGTTACGATGATTTCGTTAGAACTGAATCGAACAAAGATTATCGTAATGGTGCAGAGAACTTAGAGCCTACCGATGACGGATTTAAATGCGGGGCTCTCTGTTTATTCTTACCGGGATTCGGCAAAGCGAAGGCCGTTAGGTCAATCAGAAAGGAAGAAGAACCAGAGATAGGAAAAGTGAGGATGTCGAGGACCGAGATTGGAAGCGTGATATCGAGGACAGTTTCGATGGAGAAATTCGAATGTGGATCATGGGCTTCATCTGCCATGCCAAACGAAACTGGCGAAGACGACTCCAGTAGCAGCCTTTTCTACGATCTGCCAATGGAGTTAATAAGAAATAGTGTGGATGCAAATGCGCCAATCAGTGCAGCTTTCGTCTTCGATAAAGATCAGAAGGGAGTTACAAAAAACAGTTCGTCACAAAAATCTCACGAACCGTCGCATCATGTTCGATTCTCGGCATCGTCTCCTTCAGGACCTTCCTCGCCAGCGTCTTGCATCACACCTAGATTGCGCAAGGCAAGAGAGGAGTTTAATGCCTTTCTCGAAGCCCAAAGCAGTGCTTAA

Protein sequence

MLVVVQFHSPSNSSSLSQISPVMAPALPTSDNFNNERLMSGKLEFVGSTYCSDDDECVEKEKQISVDPISLRQSSAREDMIFDPITAPDVPDLHLPPPLPPTQFKFLSYSLPNSVNSSPRFGSMKKKGKLENQESKLKISNSTKLKSSVQDIQVALQEDTQFRRSKSCGEGRASAPADDLDLLLNKAKFPETMSYDDFVRTESNKDYRNGAENLEPTDDGFKCGALCLFLPGFGKAKAVRSIRKEEEPEIGKVRMSRTEIGSVISRTVSMEKFECGSWASSAMPNETGEDDSSSSLFYDLPMELIRNSVDANAPISAAFVFDKDQKGVTKNSSSQKSHEPSHHVRFSASSPSGPSSPASCITPRLRKAREEFNAFLEAQSSA
Homology
BLAST of Cp4.1LG20g02800 vs. NCBI nr
Match: XP_022924159.1 (uncharacterized protein LOC111431688 isoform X1 [Cucurbita moschata])

HSP 1 Score: 724 bits (1868), Expect = 1.50e-262
Identity = 374/382 (97.91%), Postives = 377/382 (98.69%), Query Frame = 0

Query: 1   MLVVVQFHSPSNSSSLSQISPVMAPALPTSDNFNNERLMSGKLEFVGSTYCSDDDECVEK 60
           M  V+QFHSPSNSSSLSQISPVMAPALPTSDNFNNERLMSGKL FVGSTYCSDDDECVEK
Sbjct: 1   MPAVLQFHSPSNSSSLSQISPVMAPALPTSDNFNNERLMSGKLVFVGSTYCSDDDECVEK 60

Query: 61  EKQISVDPISLRQSSAREDMIFDPITAPDVPDLHLPPPLPPTQFKFLSYSLPNSVNSSPR 120
           EKQISVDPISLRQSSAREDMIFDPITAPD PDLHLPPPLPPTQFKFLSYSLPNSVNSSPR
Sbjct: 61  EKQISVDPISLRQSSAREDMIFDPITAPDAPDLHLPPPLPPTQFKFLSYSLPNSVNSSPR 120

Query: 121 FGSMKKKGKLENQESKLKISNSTKLKSSVQDIQVALQEDTQFRRSKSCGEGRASAPADDL 180
           FGSMKKKGKLENQESKLKISNSTKLKSSVQDIQVALQEDTQFRRSKSCGEGRASAPADDL
Sbjct: 121 FGSMKKKGKLENQESKLKISNSTKLKSSVQDIQVALQEDTQFRRSKSCGEGRASAPADDL 180

Query: 181 DLLLNKAKFPETMSYDDFVRTESNKDYRNGAENLEPTDDGFKCGALCLFLPGFGKAKAVR 240
           DLLLNKAKFPETMSY DFVRTESNKDYRNGAENLEPTDDGFKCGALCLFLPGFGKAKAVR
Sbjct: 181 DLLLNKAKFPETMSYGDFVRTESNKDYRNGAENLEPTDDGFKCGALCLFLPGFGKAKAVR 240

Query: 241 SIRKEEEPEIGKVRMSRTEIGSVISRTVSMEKFECGSWASSAMPNETGEDDSSSSLFYDL 300
           SIRKEEEPEIGKVR+SRTEIGSVISRTVSMEKFECGSWASSAMPNETGEDDSSSSLFYDL
Sbjct: 241 SIRKEEEPEIGKVRISRTEIGSVISRTVSMEKFECGSWASSAMPNETGEDDSSSSLFYDL 300

Query: 301 PMELIRNSVDANAPISAAFVFDKDQKGVTKNSSSQKSHEPSHHVRFSASSPSGPSSPASC 360
           PMELIRNSVDANAPISAAFVFDKDQKGVTKNSSSQKSHEPSHHVRFSASSPSGPSSPASC
Sbjct: 301 PMELIRNSVDANAPISAAFVFDKDQKGVTKNSSSQKSHEPSHHVRFSASSPSGPSSPASC 360

Query: 361 ITPRLRKAREEFNAFLEAQSSA 382
           ITPRLRKAREEFNAFLEAQS+A
Sbjct: 361 ITPRLRKAREEFNAFLEAQSNA 382

BLAST of Cp4.1LG20g02800 vs. NCBI nr
Match: KAG6584176.1 (hypothetical protein SDJN03_20108, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 723 bits (1867), Expect = 2.14e-262
Identity = 374/382 (97.91%), Postives = 377/382 (98.69%), Query Frame = 0

Query: 1   MLVVVQFHSPSNSSSLSQISPVMAPALPTSDNFNNERLMSGKLEFVGSTYCSDDDECVEK 60
           M VV+QFHSPSNSSSLSQISPVMAPALPTSDNFNNERLM GKL FVGSTYCSDDDECVEK
Sbjct: 1   MPVVLQFHSPSNSSSLSQISPVMAPALPTSDNFNNERLMPGKLVFVGSTYCSDDDECVEK 60

Query: 61  EKQISVDPISLRQSSAREDMIFDPITAPDVPDLHLPPPLPPTQFKFLSYSLPNSVNSSPR 120
           EKQISVDPISLRQSSAREDMIFDPITAPD PDLHLPPPLPPTQFKFLSYSLPNSVNSSPR
Sbjct: 61  EKQISVDPISLRQSSAREDMIFDPITAPDAPDLHLPPPLPPTQFKFLSYSLPNSVNSSPR 120

Query: 121 FGSMKKKGKLENQESKLKISNSTKLKSSVQDIQVALQEDTQFRRSKSCGEGRASAPADDL 180
           FGSMKKKGKLENQESKLKISNSTKLKSSVQDIQVALQEDTQFRRSKSCGEGRASAPADDL
Sbjct: 121 FGSMKKKGKLENQESKLKISNSTKLKSSVQDIQVALQEDTQFRRSKSCGEGRASAPADDL 180

Query: 181 DLLLNKAKFPETMSYDDFVRTESNKDYRNGAENLEPTDDGFKCGALCLFLPGFGKAKAVR 240
           DLLLNKAKFPETMSY DFVRTESNKDYRNGAENLEPTDDGFKCGALCLFLPGFGKAKAVR
Sbjct: 181 DLLLNKAKFPETMSYGDFVRTESNKDYRNGAENLEPTDDGFKCGALCLFLPGFGKAKAVR 240

Query: 241 SIRKEEEPEIGKVRMSRTEIGSVISRTVSMEKFECGSWASSAMPNETGEDDSSSSLFYDL 300
           SIRKEEEPEIGKVR+SRTEIGSVISRTVSMEKFECGSWASSAMPNETGEDDSSSSLFYDL
Sbjct: 241 SIRKEEEPEIGKVRISRTEIGSVISRTVSMEKFECGSWASSAMPNETGEDDSSSSLFYDL 300

Query: 301 PMELIRNSVDANAPISAAFVFDKDQKGVTKNSSSQKSHEPSHHVRFSASSPSGPSSPASC 360
           PMELIRNSVDANAPISAAFVFDKDQKGVTKNSSSQKSHEPSHHVRFSASSPSGPSSPASC
Sbjct: 301 PMELIRNSVDANAPISAAFVFDKDQKGVTKNSSSQKSHEPSHHVRFSASSPSGPSSPASC 360

Query: 361 ITPRLRKAREEFNAFLEAQSSA 382
           ITPRLRKAREEFNAFLEAQS+A
Sbjct: 361 ITPRLRKAREEFNAFLEAQSNA 382

BLAST of Cp4.1LG20g02800 vs. NCBI nr
Match: XP_023000753.1 (uncharacterized protein LOC111495110 isoform X1 [Cucurbita maxima])

HSP 1 Score: 715 bits (1845), Expect = 4.82e-259
Identity = 369/382 (96.60%), Postives = 377/382 (98.69%), Query Frame = 0

Query: 1   MLVVVQFHSPSNSSSLSQISPVMAPALPTSDNFNNERLMSGKLEFVGSTYCSDDDECVEK 60
           M VV+QFHSPSNSSSLSQISPVMAPALPTSDNFNNERLMSGKLEFVGSTYCSDD ECVEK
Sbjct: 1   MPVVLQFHSPSNSSSLSQISPVMAPALPTSDNFNNERLMSGKLEFVGSTYCSDDVECVEK 60

Query: 61  EKQISVDPISLRQSSAREDMIFDPITAPDVPDLHLPPPLPPTQFKFLSYSLPNSVNSSPR 120
           EKQISVDPISLRQSSAREDMIFDPITAPDVPDLHLPPPLPPTQFKFLSYSLPNSVNSSPR
Sbjct: 61  EKQISVDPISLRQSSAREDMIFDPITAPDVPDLHLPPPLPPTQFKFLSYSLPNSVNSSPR 120

Query: 121 FGSMKKKGKLENQESKLKISNSTKLKSSVQDIQVALQEDTQFRRSKSCGEGRASAPADDL 180
           FGSMKKKGKLENQESKLKISNSTKLKSS+QDIQVALQEDTQFRRSKSCGEGRASAPADDL
Sbjct: 121 FGSMKKKGKLENQESKLKISNSTKLKSSLQDIQVALQEDTQFRRSKSCGEGRASAPADDL 180

Query: 181 DLLLNKAKFPETMSYDDFVRTESNKDYRNGAENLEPTDDGFKCGALCLFLPGFGKAKAVR 240
           DLLLNKAKFPET SY DFVRTESNKDYRNGAENLEPTDDGFKCGALCLFLPGFGKAKAVR
Sbjct: 181 DLLLNKAKFPETTSYGDFVRTESNKDYRNGAENLEPTDDGFKCGALCLFLPGFGKAKAVR 240

Query: 241 SIRKEEEPEIGKVRMSRTEIGSVISRTVSMEKFECGSWASSAMPNETGEDDSSSSLFYDL 300
           SIRKEEEPEIGK+R+SRTEIGSVISRTVSMEKFECGSWASSAMPN+TGEDDSSSSLF+DL
Sbjct: 241 SIRKEEEPEIGKMRISRTEIGSVISRTVSMEKFECGSWASSAMPNDTGEDDSSSSLFFDL 300

Query: 301 PMELIRNSVDANAPISAAFVFDKDQKGVTKNSSSQKSHEPSHHVRFSASSPSGPSSPASC 360
           PMELIRNSVDANAPISAAFVFDKDQKGVTKN+SSQKSHE SHHVRFSASSPSGPSSPASC
Sbjct: 301 PMELIRNSVDANAPISAAFVFDKDQKGVTKNNSSQKSHESSHHVRFSASSPSGPSSPASC 360

Query: 361 ITPRLRKAREEFNAFLEAQSSA 382
           ITPRLRKAREEFNAF+EAQSSA
Sbjct: 361 ITPRLRKAREEFNAFIEAQSSA 382

BLAST of Cp4.1LG20g02800 vs. NCBI nr
Match: XP_023520328.1 (uncharacterized protein LOC111783644 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 699 bits (1804), Expect = 3.73e-253
Identity = 360/360 (100.00%), Postives = 360/360 (100.00%), Query Frame = 0

Query: 23  MAPALPTSDNFNNERLMSGKLEFVGSTYCSDDDECVEKEKQISVDPISLRQSSAREDMIF 82
           MAPALPTSDNFNNERLMSGKLEFVGSTYCSDDDECVEKEKQISVDPISLRQSSAREDMIF
Sbjct: 1   MAPALPTSDNFNNERLMSGKLEFVGSTYCSDDDECVEKEKQISVDPISLRQSSAREDMIF 60

Query: 83  DPITAPDVPDLHLPPPLPPTQFKFLSYSLPNSVNSSPRFGSMKKKGKLENQESKLKISNS 142
           DPITAPDVPDLHLPPPLPPTQFKFLSYSLPNSVNSSPRFGSMKKKGKLENQESKLKISNS
Sbjct: 61  DPITAPDVPDLHLPPPLPPTQFKFLSYSLPNSVNSSPRFGSMKKKGKLENQESKLKISNS 120

Query: 143 TKLKSSVQDIQVALQEDTQFRRSKSCGEGRASAPADDLDLLLNKAKFPETMSYDDFVRTE 202
           TKLKSSVQDIQVALQEDTQFRRSKSCGEGRASAPADDLDLLLNKAKFPETMSYDDFVRTE
Sbjct: 121 TKLKSSVQDIQVALQEDTQFRRSKSCGEGRASAPADDLDLLLNKAKFPETMSYDDFVRTE 180

Query: 203 SNKDYRNGAENLEPTDDGFKCGALCLFLPGFGKAKAVRSIRKEEEPEIGKVRMSRTEIGS 262
           SNKDYRNGAENLEPTDDGFKCGALCLFLPGFGKAKAVRSIRKEEEPEIGKVRMSRTEIGS
Sbjct: 181 SNKDYRNGAENLEPTDDGFKCGALCLFLPGFGKAKAVRSIRKEEEPEIGKVRMSRTEIGS 240

Query: 263 VISRTVSMEKFECGSWASSAMPNETGEDDSSSSLFYDLPMELIRNSVDANAPISAAFVFD 322
           VISRTVSMEKFECGSWASSAMPNETGEDDSSSSLFYDLPMELIRNSVDANAPISAAFVFD
Sbjct: 241 VISRTVSMEKFECGSWASSAMPNETGEDDSSSSLFYDLPMELIRNSVDANAPISAAFVFD 300

Query: 323 KDQKGVTKNSSSQKSHEPSHHVRFSASSPSGPSSPASCITPRLRKAREEFNAFLEAQSSA 382
           KDQKGVTKNSSSQKSHEPSHHVRFSASSPSGPSSPASCITPRLRKAREEFNAFLEAQSSA
Sbjct: 301 KDQKGVTKNSSSQKSHEPSHHVRFSASSPSGPSSPASCITPRLRKAREEFNAFLEAQSSA 360

BLAST of Cp4.1LG20g02800 vs. NCBI nr
Match: XP_022924160.1 (uncharacterized protein LOC111431688 isoform X2 [Cucurbita moschata])

HSP 1 Score: 689 bits (1779), Expect = 2.41e-249
Identity = 355/360 (98.61%), Postives = 357/360 (99.17%), Query Frame = 0

Query: 23  MAPALPTSDNFNNERLMSGKLEFVGSTYCSDDDECVEKEKQISVDPISLRQSSAREDMIF 82
           MAPALPTSDNFNNERLMSGKL FVGSTYCSDDDECVEKEKQISVDPISLRQSSAREDMIF
Sbjct: 1   MAPALPTSDNFNNERLMSGKLVFVGSTYCSDDDECVEKEKQISVDPISLRQSSAREDMIF 60

Query: 83  DPITAPDVPDLHLPPPLPPTQFKFLSYSLPNSVNSSPRFGSMKKKGKLENQESKLKISNS 142
           DPITAPD PDLHLPPPLPPTQFKFLSYSLPNSVNSSPRFGSMKKKGKLENQESKLKISNS
Sbjct: 61  DPITAPDAPDLHLPPPLPPTQFKFLSYSLPNSVNSSPRFGSMKKKGKLENQESKLKISNS 120

Query: 143 TKLKSSVQDIQVALQEDTQFRRSKSCGEGRASAPADDLDLLLNKAKFPETMSYDDFVRTE 202
           TKLKSSVQDIQVALQEDTQFRRSKSCGEGRASAPADDLDLLLNKAKFPETMSY DFVRTE
Sbjct: 121 TKLKSSVQDIQVALQEDTQFRRSKSCGEGRASAPADDLDLLLNKAKFPETMSYGDFVRTE 180

Query: 203 SNKDYRNGAENLEPTDDGFKCGALCLFLPGFGKAKAVRSIRKEEEPEIGKVRMSRTEIGS 262
           SNKDYRNGAENLEPTDDGFKCGALCLFLPGFGKAKAVRSIRKEEEPEIGKVR+SRTEIGS
Sbjct: 181 SNKDYRNGAENLEPTDDGFKCGALCLFLPGFGKAKAVRSIRKEEEPEIGKVRISRTEIGS 240

Query: 263 VISRTVSMEKFECGSWASSAMPNETGEDDSSSSLFYDLPMELIRNSVDANAPISAAFVFD 322
           VISRTVSMEKFECGSWASSAMPNETGEDDSSSSLFYDLPMELIRNSVDANAPISAAFVFD
Sbjct: 241 VISRTVSMEKFECGSWASSAMPNETGEDDSSSSLFYDLPMELIRNSVDANAPISAAFVFD 300

Query: 323 KDQKGVTKNSSSQKSHEPSHHVRFSASSPSGPSSPASCITPRLRKAREEFNAFLEAQSSA 382
           KDQKGVTKNSSSQKSHEPSHHVRFSASSPSGPSSPASCITPRLRKAREEFNAFLEAQS+A
Sbjct: 301 KDQKGVTKNSSSQKSHEPSHHVRFSASSPSGPSSPASCITPRLRKAREEFNAFLEAQSNA 360

BLAST of Cp4.1LG20g02800 vs. ExPASy TrEMBL
Match: A0A6J1E8C7 (uncharacterized protein LOC111431688 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111431688 PE=4 SV=1)

HSP 1 Score: 724 bits (1868), Expect = 7.29e-263
Identity = 374/382 (97.91%), Postives = 377/382 (98.69%), Query Frame = 0

Query: 1   MLVVVQFHSPSNSSSLSQISPVMAPALPTSDNFNNERLMSGKLEFVGSTYCSDDDECVEK 60
           M  V+QFHSPSNSSSLSQISPVMAPALPTSDNFNNERLMSGKL FVGSTYCSDDDECVEK
Sbjct: 1   MPAVLQFHSPSNSSSLSQISPVMAPALPTSDNFNNERLMSGKLVFVGSTYCSDDDECVEK 60

Query: 61  EKQISVDPISLRQSSAREDMIFDPITAPDVPDLHLPPPLPPTQFKFLSYSLPNSVNSSPR 120
           EKQISVDPISLRQSSAREDMIFDPITAPD PDLHLPPPLPPTQFKFLSYSLPNSVNSSPR
Sbjct: 61  EKQISVDPISLRQSSAREDMIFDPITAPDAPDLHLPPPLPPTQFKFLSYSLPNSVNSSPR 120

Query: 121 FGSMKKKGKLENQESKLKISNSTKLKSSVQDIQVALQEDTQFRRSKSCGEGRASAPADDL 180
           FGSMKKKGKLENQESKLKISNSTKLKSSVQDIQVALQEDTQFRRSKSCGEGRASAPADDL
Sbjct: 121 FGSMKKKGKLENQESKLKISNSTKLKSSVQDIQVALQEDTQFRRSKSCGEGRASAPADDL 180

Query: 181 DLLLNKAKFPETMSYDDFVRTESNKDYRNGAENLEPTDDGFKCGALCLFLPGFGKAKAVR 240
           DLLLNKAKFPETMSY DFVRTESNKDYRNGAENLEPTDDGFKCGALCLFLPGFGKAKAVR
Sbjct: 181 DLLLNKAKFPETMSYGDFVRTESNKDYRNGAENLEPTDDGFKCGALCLFLPGFGKAKAVR 240

Query: 241 SIRKEEEPEIGKVRMSRTEIGSVISRTVSMEKFECGSWASSAMPNETGEDDSSSSLFYDL 300
           SIRKEEEPEIGKVR+SRTEIGSVISRTVSMEKFECGSWASSAMPNETGEDDSSSSLFYDL
Sbjct: 241 SIRKEEEPEIGKVRISRTEIGSVISRTVSMEKFECGSWASSAMPNETGEDDSSSSLFYDL 300

Query: 301 PMELIRNSVDANAPISAAFVFDKDQKGVTKNSSSQKSHEPSHHVRFSASSPSGPSSPASC 360
           PMELIRNSVDANAPISAAFVFDKDQKGVTKNSSSQKSHEPSHHVRFSASSPSGPSSPASC
Sbjct: 301 PMELIRNSVDANAPISAAFVFDKDQKGVTKNSSSQKSHEPSHHVRFSASSPSGPSSPASC 360

Query: 361 ITPRLRKAREEFNAFLEAQSSA 382
           ITPRLRKAREEFNAFLEAQS+A
Sbjct: 361 ITPRLRKAREEFNAFLEAQSNA 382

BLAST of Cp4.1LG20g02800 vs. ExPASy TrEMBL
Match: A0A6J1KNI3 (uncharacterized protein LOC111495110 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111495110 PE=4 SV=1)

HSP 1 Score: 715 bits (1845), Expect = 2.33e-259
Identity = 369/382 (96.60%), Postives = 377/382 (98.69%), Query Frame = 0

Query: 1   MLVVVQFHSPSNSSSLSQISPVMAPALPTSDNFNNERLMSGKLEFVGSTYCSDDDECVEK 60
           M VV+QFHSPSNSSSLSQISPVMAPALPTSDNFNNERLMSGKLEFVGSTYCSDD ECVEK
Sbjct: 1   MPVVLQFHSPSNSSSLSQISPVMAPALPTSDNFNNERLMSGKLEFVGSTYCSDDVECVEK 60

Query: 61  EKQISVDPISLRQSSAREDMIFDPITAPDVPDLHLPPPLPPTQFKFLSYSLPNSVNSSPR 120
           EKQISVDPISLRQSSAREDMIFDPITAPDVPDLHLPPPLPPTQFKFLSYSLPNSVNSSPR
Sbjct: 61  EKQISVDPISLRQSSAREDMIFDPITAPDVPDLHLPPPLPPTQFKFLSYSLPNSVNSSPR 120

Query: 121 FGSMKKKGKLENQESKLKISNSTKLKSSVQDIQVALQEDTQFRRSKSCGEGRASAPADDL 180
           FGSMKKKGKLENQESKLKISNSTKLKSS+QDIQVALQEDTQFRRSKSCGEGRASAPADDL
Sbjct: 121 FGSMKKKGKLENQESKLKISNSTKLKSSLQDIQVALQEDTQFRRSKSCGEGRASAPADDL 180

Query: 181 DLLLNKAKFPETMSYDDFVRTESNKDYRNGAENLEPTDDGFKCGALCLFLPGFGKAKAVR 240
           DLLLNKAKFPET SY DFVRTESNKDYRNGAENLEPTDDGFKCGALCLFLPGFGKAKAVR
Sbjct: 181 DLLLNKAKFPETTSYGDFVRTESNKDYRNGAENLEPTDDGFKCGALCLFLPGFGKAKAVR 240

Query: 241 SIRKEEEPEIGKVRMSRTEIGSVISRTVSMEKFECGSWASSAMPNETGEDDSSSSLFYDL 300
           SIRKEEEPEIGK+R+SRTEIGSVISRTVSMEKFECGSWASSAMPN+TGEDDSSSSLF+DL
Sbjct: 241 SIRKEEEPEIGKMRISRTEIGSVISRTVSMEKFECGSWASSAMPNDTGEDDSSSSLFFDL 300

Query: 301 PMELIRNSVDANAPISAAFVFDKDQKGVTKNSSSQKSHEPSHHVRFSASSPSGPSSPASC 360
           PMELIRNSVDANAPISAAFVFDKDQKGVTKN+SSQKSHE SHHVRFSASSPSGPSSPASC
Sbjct: 301 PMELIRNSVDANAPISAAFVFDKDQKGVTKNNSSQKSHESSHHVRFSASSPSGPSSPASC 360

Query: 361 ITPRLRKAREEFNAFLEAQSSA 382
           ITPRLRKAREEFNAF+EAQSSA
Sbjct: 361 ITPRLRKAREEFNAFIEAQSSA 382

BLAST of Cp4.1LG20g02800 vs. ExPASy TrEMBL
Match: A0A6J1EE09 (uncharacterized protein LOC111431688 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111431688 PE=4 SV=1)

HSP 1 Score: 689 bits (1779), Expect = 1.17e-249
Identity = 355/360 (98.61%), Postives = 357/360 (99.17%), Query Frame = 0

Query: 23  MAPALPTSDNFNNERLMSGKLEFVGSTYCSDDDECVEKEKQISVDPISLRQSSAREDMIF 82
           MAPALPTSDNFNNERLMSGKL FVGSTYCSDDDECVEKEKQISVDPISLRQSSAREDMIF
Sbjct: 1   MAPALPTSDNFNNERLMSGKLVFVGSTYCSDDDECVEKEKQISVDPISLRQSSAREDMIF 60

Query: 83  DPITAPDVPDLHLPPPLPPTQFKFLSYSLPNSVNSSPRFGSMKKKGKLENQESKLKISNS 142
           DPITAPD PDLHLPPPLPPTQFKFLSYSLPNSVNSSPRFGSMKKKGKLENQESKLKISNS
Sbjct: 61  DPITAPDAPDLHLPPPLPPTQFKFLSYSLPNSVNSSPRFGSMKKKGKLENQESKLKISNS 120

Query: 143 TKLKSSVQDIQVALQEDTQFRRSKSCGEGRASAPADDLDLLLNKAKFPETMSYDDFVRTE 202
           TKLKSSVQDIQVALQEDTQFRRSKSCGEGRASAPADDLDLLLNKAKFPETMSY DFVRTE
Sbjct: 121 TKLKSSVQDIQVALQEDTQFRRSKSCGEGRASAPADDLDLLLNKAKFPETMSYGDFVRTE 180

Query: 203 SNKDYRNGAENLEPTDDGFKCGALCLFLPGFGKAKAVRSIRKEEEPEIGKVRMSRTEIGS 262
           SNKDYRNGAENLEPTDDGFKCGALCLFLPGFGKAKAVRSIRKEEEPEIGKVR+SRTEIGS
Sbjct: 181 SNKDYRNGAENLEPTDDGFKCGALCLFLPGFGKAKAVRSIRKEEEPEIGKVRISRTEIGS 240

Query: 263 VISRTVSMEKFECGSWASSAMPNETGEDDSSSSLFYDLPMELIRNSVDANAPISAAFVFD 322
           VISRTVSMEKFECGSWASSAMPNETGEDDSSSSLFYDLPMELIRNSVDANAPISAAFVFD
Sbjct: 241 VISRTVSMEKFECGSWASSAMPNETGEDDSSSSLFYDLPMELIRNSVDANAPISAAFVFD 300

Query: 323 KDQKGVTKNSSSQKSHEPSHHVRFSASSPSGPSSPASCITPRLRKAREEFNAFLEAQSSA 382
           KDQKGVTKNSSSQKSHEPSHHVRFSASSPSGPSSPASCITPRLRKAREEFNAFLEAQS+A
Sbjct: 301 KDQKGVTKNSSSQKSHEPSHHVRFSASSPSGPSSPASCITPRLRKAREEFNAFLEAQSNA 360

BLAST of Cp4.1LG20g02800 vs. ExPASy TrEMBL
Match: A0A6J1KKV0 (uncharacterized protein LOC111495110 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111495110 PE=4 SV=1)

HSP 1 Score: 679 bits (1752), Expect = 1.52e-245
Identity = 349/360 (96.94%), Postives = 356/360 (98.89%), Query Frame = 0

Query: 23  MAPALPTSDNFNNERLMSGKLEFVGSTYCSDDDECVEKEKQISVDPISLRQSSAREDMIF 82
           MAPALPTSDNFNNERLMSGKLEFVGSTYCSDD ECVEKEKQISVDPISLRQSSAREDMIF
Sbjct: 1   MAPALPTSDNFNNERLMSGKLEFVGSTYCSDDVECVEKEKQISVDPISLRQSSAREDMIF 60

Query: 83  DPITAPDVPDLHLPPPLPPTQFKFLSYSLPNSVNSSPRFGSMKKKGKLENQESKLKISNS 142
           DPITAPDVPDLHLPPPLPPTQFKFLSYSLPNSVNSSPRFGSMKKKGKLENQESKLKISNS
Sbjct: 61  DPITAPDVPDLHLPPPLPPTQFKFLSYSLPNSVNSSPRFGSMKKKGKLENQESKLKISNS 120

Query: 143 TKLKSSVQDIQVALQEDTQFRRSKSCGEGRASAPADDLDLLLNKAKFPETMSYDDFVRTE 202
           TKLKSS+QDIQVALQEDTQFRRSKSCGEGRASAPADDLDLLLNKAKFPET SY DFVRTE
Sbjct: 121 TKLKSSLQDIQVALQEDTQFRRSKSCGEGRASAPADDLDLLLNKAKFPETTSYGDFVRTE 180

Query: 203 SNKDYRNGAENLEPTDDGFKCGALCLFLPGFGKAKAVRSIRKEEEPEIGKVRMSRTEIGS 262
           SNKDYRNGAENLEPTDDGFKCGALCLFLPGFGKAKAVRSIRKEEEPEIGK+R+SRTEIGS
Sbjct: 181 SNKDYRNGAENLEPTDDGFKCGALCLFLPGFGKAKAVRSIRKEEEPEIGKMRISRTEIGS 240

Query: 263 VISRTVSMEKFECGSWASSAMPNETGEDDSSSSLFYDLPMELIRNSVDANAPISAAFVFD 322
           VISRTVSMEKFECGSWASSAMPN+TGEDDSSSSLF+DLPMELIRNSVDANAPISAAFVFD
Sbjct: 241 VISRTVSMEKFECGSWASSAMPNDTGEDDSSSSLFFDLPMELIRNSVDANAPISAAFVFD 300

Query: 323 KDQKGVTKNSSSQKSHEPSHHVRFSASSPSGPSSPASCITPRLRKAREEFNAFLEAQSSA 382
           KDQKGVTKN+SSQKSHE SHHVRFSASSPSGPSSPASCITPRLRKAREEFNAF+EAQSSA
Sbjct: 301 KDQKGVTKNNSSQKSHESSHHVRFSASSPSGPSSPASCITPRLRKAREEFNAFIEAQSSA 360

BLAST of Cp4.1LG20g02800 vs. ExPASy TrEMBL
Match: A0A5D3BJI8 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G005000 PE=4 SV=1)

HSP 1 Score: 529 bits (1362), Expect = 3.83e-186
Identity = 292/370 (78.92%), Postives = 318/370 (85.95%), Query Frame = 0

Query: 23  MAPALPTSDNFNNERLMSGKLEFVGSTYCSDDDECVE-KEKQISVDPISLRQSSAREDMI 82
           MAPALPTSDNFNN R +SGKLEF+ STY  D+ EC + KEKQISVDPISLR+SSARED+I
Sbjct: 1   MAPALPTSDNFNNSRSISGKLEFIVSTYSPDNAECADQKEKQISVDPISLRESSAREDII 60

Query: 83  FDPITAPDVPDLHLPPPLPPTQFKFLSYSLPNSVNSSPRFGSMKKKGKLENQESKLKISN 142
            DP+TAPDV DLHLPPPLPPTQFKFLSYSLPNS NSSP+F  MKKKGK ENQ S LK+SN
Sbjct: 61  VDPLTAPDVADLHLPPPLPPTQFKFLSYSLPNSANSSPKF--MKKKGKFENQASLLKVSN 120

Query: 143 STKLKSSVQDIQVAL-QEDTQFRRSKSCGEGRASAPADDLDLLLNKAKFPETMSYDD-FV 202
           STKL SSVQDIQ    QEDTQFRRSKSCGEGRASAPADDLDL LNKAKFPET SYDD F 
Sbjct: 121 STKLNSSVQDIQSTTPQEDTQFRRSKSCGEGRASAPADDLDLWLNKAKFPETKSYDDGFS 180

Query: 203 RTESNKDYRNGAENLEPTDDGFKCGALCLFLPGFGKAKAVRSIRKEEEP-EIGKVRMSRT 262
           +TESNK       NLE  D+GF CGALCLFLPGFGK K+V+S+RKEEE  E+ KVR+S+T
Sbjct: 181 KTESNK-------NLEAPDEGFNCGALCLFLPGFGKGKSVKSMRKEEETTEMEKVRISKT 240

Query: 263 EIGSVISRTVSMEKFECGSWASSAMPNETGEDDSSSSLFYDLPMELIRNSVDANAPISAA 322
           EIGSVISRTVS+EKFECGSWASS +PNETGED++ SSLFYDLP+EL+RNSVDANAP++AA
Sbjct: 241 EIGSVISRTVSLEKFECGSWASSVLPNETGEDEAGSSLFYDLPLELMRNSVDANAPVNAA 300

Query: 323 FVFDKDQKGVTKNSSS----QKSHEPS-HHVRFSASSPS-GPSSPASCITPRLRKAREEF 382
           FVFDKD KGV KN+SS    QKSHE S H  RFSASSPS GPSSPASCITPRLRKAREEF
Sbjct: 301 FVFDKDHKGVMKNNSSTKLVQKSHESSSHRARFSASSPSSGPSSPASCITPRLRKAREEF 360

BLAST of Cp4.1LG20g02800 vs. TAIR 10
Match: AT4G20190.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G44660.1); Has 271 Blast hits to 209 proteins in 52 species: Archae - 0; Bacteria - 15; Metazoa - 63; Fungi - 14; Plants - 48; Viruses - 3; Other Eukaryotes - 128 (source: NCBI BLink). )

HSP 1 Score: 186.8 bits (473), Expect = 3.2e-47
Identity = 145/370 (39.19%), Postives = 203/370 (54.86%), Query Frame = 0

Query: 61  EKQISVDPISLRQSSAREDMIFDPITAP-DVPDLHLPPPLPPTQFKFLSYSLPNSVNSSP 120
           E++ISVDP SL   +   DMI   ++ P D+ DL L   +   + KF+S SLPNS  +SP
Sbjct: 43  ERRISVDPQSLLSRNGSFDMI---VSRPRDIDDLPLDHQM---KTKFVSCSLPNSAATSP 102

Query: 121 RFGSMKKKGKLENQESKLKISNSTKLKSSVQDIQVALQED--TQFRRSKSCGEGRASAPA 180
           R  S+                ++ K +++ Q + + L +D  T FRRSKSCGEGRA  P+
Sbjct: 103 RNSSI----------------HNWKDRTTEQVLDLMLVQDAATAFRRSKSCGEGRACTPS 162

Query: 181 DDLDLLLNKAK---------------FPETMSYDD------FVRTESNKDYRN-----GA 240
            D D+LL+K++                 +++S+        F +TESNK  R+      +
Sbjct: 163 LDFDMLLHKSRNAHHNQNHHRGFSSSNSKSLSHKSSGNNSFFSKTESNKSNRSNSNTANS 222

Query: 241 ENLEPTDDGFKCGALCLFLPGFGKAKAVRSIRKEEEPEIGKVRMSRTEI---------GS 300
           +++   +DGFKC ALCL+LPGF K K VRS RK +        M+ ++           +
Sbjct: 223 KSINSFEDGFKCSALCLYLPGFSKGKPVRSSRKGDSSFTRTTTMTSSQSMARTASIRDTA 282

Query: 301 VISRTVSMEKFECGSWASSAMPNETGEDDSSSSLFYDLPMELIRNSV---DANAPISAAF 360
           V+S   S+E+FECGSW SSAM  +   D      F+DLP ELI+      D + P+SAAF
Sbjct: 283 VLSARASLERFECGSWTSSAMIYDDNAD--LGGHFFDLPSELIKGGPGGNDQDDPVSAAF 342

Query: 361 VFDKDQ------KGV--TKNSSSQKSHEPSHHVRFSASSP-SGPSSPASCITPRLRKARE 381
           VFDK+       KGV  T  S S++S E   HVRFS SSP S P+SP   ITPRL +A E
Sbjct: 343 VFDKEPNLDKEIKGVLKTSGSKSRRSMESPRHVRFSTSSPVSYPTSPTHSITPRLLQATE 388

BLAST of Cp4.1LG20g02800 vs. TAIR 10
Match: AT5G44660.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G20190.1); Has 944 Blast hits to 462 proteins in 141 species: Archae - 2; Bacteria - 370; Metazoa - 161; Fungi - 102; Plants - 64; Viruses - 6; Other Eukaryotes - 239 (source: NCBI BLink). )

HSP 1 Score: 128.3 bits (321), Expect = 1.4e-29
Identity = 135/405 (33.33%), Postives = 185/405 (45.68%), Query Frame = 0

Query: 58  VEKEKQISVDPISLR----QSSAREDMIFDPITAPDVP---DLHLPPPLP---------- 117
           ++ E++IS+DP S+R      S R +  FD +  P +    DL  P PLP          
Sbjct: 41  MQSEREISMDPKSIRSLSMSGSLRRNDSFDMVRLPAMSPPRDLDSPMPLPLQPVQTTGSP 100

Query: 118 ---------------------PTQFKFL--------SYSLPNSVNSSP--RFGSMKKKGK 177
                                P Q   L          SLPNS   SP  R G M+    
Sbjct: 101 KQRSGLMRALRNREQDSLPNSPKQRSGLMRAFRNKDQDSLPNSTTGSPKQRSGLMRALRN 160

Query: 178 LENQESKLKISNSTKLKSSVQDIQVALQEDT---QFRRSKSCGEGRASAPADDLDLLLNK 237
            E        + S K +S +       ++D+    ++RSKSCG    +        L +K
Sbjct: 161 KEQDSLPNSTTGSPKQRSGLMRALRNKEQDSSSASYKRSKSCGSTSKT--------LSHK 220

Query: 238 AKFPETMSYDDFVRTESNKDYRNGAENLEPTDDGFKCGALCLFLPGFGKAKAVRSIRKEE 297
           +     +    F++T+SNK   N +      +D FKC ALCLFLPGF K K +RS +K++
Sbjct: 221 S---SGIRNSFFIKTDSNKSISNNS----TLEDRFKCNALCLFLPGFSKGKPIRSSQKDD 280

Query: 298 EPEIGK-----------VRMSR-------TEIGSVISRTVSMEKFECGSWASSAMPNETG 357
                +           + +SR       T   +VIS   SMEKF+CGS+ S +   E G
Sbjct: 281 SSSFTRTTTMTRSSSSTITVSRTVSVRESTTTTTVISARASMEKFDCGSYTSESCGEEGG 340

Query: 358 EDDSSSSLFYDLPMELIRNSV---DANAPISAAFVFDKDQ-----KGVTKNSSSQK---S 381
                   F+DLP ELI++     D + P+SAAFVFDK+      KGV K S S+     
Sbjct: 341 NH------FFDLPSELIKSGSGDNDHDEPVSAAFVFDKEPVEKEIKGVLKVSGSKNRKAM 400

BLAST of Cp4.1LG20g02800 vs. TAIR 10
Match: AT2G34910.1 (BEST Arabidopsis thaliana protein match is: root hair specific 4 (TAIR:AT1G30850.1); Has 43 Blast hits to 43 proteins in 9 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 43; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 116.3 bits (290), Expect = 5.3e-26
Identity = 77/186 (41.40%), Postives = 107/186 (57.53%), Query Frame = 0

Query: 212 ENLEPTDDGFKCGALCLFLPGFGKAKAVRSIRKEEEPEIGKVRMSRTEIGSVISRTVSME 271
           +N    ++ FKC A CL LPGFGK + VRS + E+  +   ++ S     S +S + S+E
Sbjct: 111 KNFYQEEENFKCNAFCLSLPGFGK-RPVRSPKSEDSIKKKMIKASSFS-NSTVSLSASLE 170

Query: 272 KFECGSWAS-SAMPNETGEDDSSSSLFYDLPMELIR-NSVDANAPISAAFVFDKDQ---- 331
           KFECGSWAS +A+  E G       L+ DLP+E+I+    D   P+S+ F FDK+     
Sbjct: 171 KFECGSWASTTALTRENGR------LYIDLPVEMIKCGGGDVQEPVSSGFFFDKETGSLA 230

Query: 332 -KGVTKNSSSQKSHE--------PSHHVRFS-ASSPSGPSSPASCITPRLRKAREEFNAF 382
            + V K SSS    +        P   VRFS  +S S P+SP +CITPRL KAR++FN F
Sbjct: 231 LRSVLKKSSSLSGRQLRDLAETSPQRRVRFSTTTSDSCPASPRTCITPRLLKARDDFNTF 288

BLAST of Cp4.1LG20g02800 vs. TAIR 10
Match: AT1G30850.1 (root hair specific 4 )

HSP 1 Score: 113.2 bits (282), Expect = 4.5e-25
Identity = 89/291 (30.58%), Postives = 141/291 (48.45%), Query Frame = 0

Query: 114 SVNSSPRFGSMKKKGKLENQESKLKISNSTKLKSSVQD--IQVALQEDTQFRRSKSCGEG 173
           + N +P    + KK  L+N++S   + +    +SS +D   ++ L       R  +    
Sbjct: 32  NTNPNPNINFLVKKAILQNEKSITPLFS----RSSARDDSFRIVLPPALPPPRDSTV--- 91

Query: 174 RASAPADDLDLLLNKAKFPETMSYDDFVRTESNKDYRNGAENLEPTDDGFKCGALCLFLP 233
                   L +L    +  + +S+ + V   S   +   AE +   ++ FKC A CL LP
Sbjct: 92  -------PLPMLPEPMRVRKKLSHQESVIFMSKSRF---AEKILYKEEDFKCNAFCLSLP 151

Query: 234 GFGKAKAVRSIRKEEEPEIGKVRMSRTEIGSVISRTVSMEKFECGSWASSAMPNETGEDD 293
           GFGK K +RS  K +     K+  + +  GS +S   S+EKFECGSWAS+     T    
Sbjct: 152 GFGKNKLIRSSSKRQNSMEKKMIRASSFTGSTVSVRASLEKFECGSWAST-----TALIQ 211

Query: 294 SSSSLFYDLPMELIR-------NSVDANAPISAAFVFDKDQKGV----------TKNSSS 353
            +  LF+D P+E+ +          D   P+++ F+FD++ + +          T++   
Sbjct: 212 DNGRLFFDFPVEMTKCNSRGGNGGRDVQEPVTSGFLFDRETETLALRSVLKTRSTRDHRR 271

Query: 354 QKSHEPSHHVRFSASSPSG----PSSPASCITPRLRKAREEFNAFLEAQSS 382
                P   VRFS SS S     P+SP +CITPRLRKAR++FN FL AQ++
Sbjct: 272 SAESSPQRRVRFSTSSSSASVSCPTSPRTCITPRLRKARDDFNTFLTAQNA 300

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022924159.11.50e-26297.91uncharacterized protein LOC111431688 isoform X1 [Cucurbita moschata][more]
KAG6584176.12.14e-26297.91hypothetical protein SDJN03_20108, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_023000753.14.82e-25996.60uncharacterized protein LOC111495110 isoform X1 [Cucurbita maxima][more]
XP_023520328.13.73e-253100.00uncharacterized protein LOC111783644 [Cucurbita pepo subsp. pepo][more]
XP_022924160.12.41e-24998.61uncharacterized protein LOC111431688 isoform X2 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A6J1E8C77.29e-26397.91uncharacterized protein LOC111431688 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1KNI32.33e-25996.60uncharacterized protein LOC111495110 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1EE091.17e-24998.61uncharacterized protein LOC111431688 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1KKV01.52e-24596.94uncharacterized protein LOC111495110 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A5D3BJI83.83e-18678.92Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
Match NameE-valueIdentityDescription
AT4G20190.13.2e-4739.19unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT5G44660.11.4e-2933.33unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G34910.15.3e-2641.40BEST Arabidopsis thaliana protein match is: root hair specific 4 (TAIR:AT1G30850... [more]
AT1G30850.14.5e-2530.58root hair specific 4 [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 344..359
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 326..361
NoneNo IPR availablePANTHERPTHR33672:SF2OS07G0499850 PROTEINcoord: 56..380
IPR040340Chloroplast enhancing stress tolerance proteinPANTHERPTHR33672YCF3-INTERACTING PROTEIN 1, CHLOROPLASTICcoord: 56..380

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG20g02800.1Cp4.1LG20g02800.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0048564 photosystem I assembly
biological_process GO:0080183 response to photooxidative stress
cellular_component GO:0009535 chloroplast thylakoid membrane