Cp4.1LG05g04090 (gene) Cucurbita pepo (MU‐CU‐16) v4.1

Overview
NameCp4.1LG05g04090
Typegene
OrganismCucurbita pepo (Cucurbita pepo (MU‐CU‐16) v4.1)
DescriptionGolgin family A protein
LocationCp4.1LG05: 2267078 .. 2269057 (-)
RNA-Seq ExpressionCp4.1LG05g04090
SyntenyCp4.1LG05g04090
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCATGGCCAGAGCAGAAAACAGAGGAGATATGTAAGATTAGAAAACGAAGGTGTTCTTCGTCGTCTTCTTCTTCTTCTTCTACTTTGGTCTTTAAATACAGATTCAAGAACACACCCACGTGGAAAATGAGCACAAAATCCCATTCATCCAACCGGTCGCCGTCGTGTTCTGTCGCCGGCGGCGGCAGTAAAGGGAAAGAAGCCTCTGTTTCAGTCTCTGTTTCAGTGTCAGCTCGGAATCATTCCCAGAAGTTGAAGAACAATATGGATATCATTGAAGATAAACAAGAGCTGATGAAAACCCAGGATTTTGTTTCCCAGATCTCGCATTCGTGCTTATCAGATCCGGATCCGTGTTTCAATGACTCAAATTCAAAGGTGAGTTTTTTATTTCGATTATGAATACTCTGTTTTGGGAATTGTGTTGTGAAATGGAATTGGATTTGGGTATTGAAATCTGAGCAAGTGGGGTTTGATTAAATGAATTAAAATCTTCCCAGAAAGTTGAAGGTGACAGAGTTCACAGAAGAAGAACATCGGCATCTTCTATGAGGCTTGGAACAGGGGAGGCCAATTTTCATGGCAATCACTGTTTAATAGAGGTACCCATCCAACTTGGGTTGCTATCATTTTCATATAGATACGATGAGACATAAAGTTATTCATTGAATGCAGATTGAAAATCCAAGTAATCAGGGAAGAACAGCTCGCCGGAAGACAAAATTCATGCTGAAAACACGTTTAAAGGAAGTGGGTAATTGTCTAACAACATCAAAAGAGCTTATAAGAGTATTAAACCATGTTCTAGCTCATGAAGACAACGATCAACACCGTCCATCATCAATTTCACCACTGATTACAGCTTTAAAATTAGAGATGGAGCGAGCCAAGGCTCGTGTTGATCATTTGATTAAAGATCAAAGCTTTCATGGCGATGAAATTGAGATTGTAATGAAGCGATTTACAGAGGAAAAGACAGCGTGGAAGAACAGAGAGAGAGCTCGAGTTCGTAGCTCCATTGCTTCAATGGCGGATGAGATTGAGATTGAGAAGAAGCTTAGAAGACAAGCTGAGAGATTGAACAAAACAATTGCTAAAGAACTCGCTGAAGCTAAAATTTCACTTTCTAAAGCGATGAAAGATCTTCAAAGGGAGCGGAGAGCTAAGGAGATATTCGAGCAAATATGCGATGAATTAGCTAAAGGGATTGGAGAAGACAGAGCTCAATTCGAGGAGTTTAAGAAGGAATCTGCTAAAGTACGAGAAGAAATCGAACAAGAACGTGAAATGCTTCAATTGGCTGATGTTTTACGAGAGGAAAGAGTTCAGATGAAGCTGTCGGAAGCGAAATATCAATTTGAAGAGAAGAACGCCGCCGTGGAACGGCTGAAAGACGAGCTGGAAGCGTTTCTGATAACCCAATTCCGTCATGAAAACAGGGAAGAAGAAGATTATTCCGGCAAGATCAAAGAATTGGAAGCGTATTTGAAGAAAATAAATTTTGGGTCTGTTCAAGAACACCTTGAAGGAGATGGGAAGATTGAAGAACAGGAATGTTCAGAGGAGGACGACAGCGATCTTCATTCGATTGAGCTGAACATGGATAACAACAACAAGAGTTACAGATGGAGCTTTGTTCATGGAGGATCTAAAAGGAACTCATTTGAGAAGGATCAAATCAATGGAAGAAAGTCTGTTTCAGAGAAGATTCAATGGGGAAGCATTTGTTTGAATCGAAAAGCTTCGAATGGCTCTAAAAATGGCGAATTTGTTGGTCGGAAGAGCCATGAAAGTACAGAGAGATTGGAGTGGGAGAGATTTACAGAGGTTTTTGAAAATGAGGGAGACAATGGAAGTGCAGAGAAGAAGAACACTAAATCTGGGAAGTGTCTTAGAGACATTCTGTTTCCAGGATTTGTGGAACCAAATGATGATGTTGGGATTGCTGGCAATGTGGAGATGGATGAAGCTTCTTGA

mRNA sequence

ATGTCATGGCCAGAGCAGAAAACAGAGGAGATATGTAAGATTAGAAAACGAAGGTGTTCTTCGTCGTCTTCTTCTTCTTCTTCTACTTTGGTCTTTAAATACAGATTCAAGAACACACCCACGTGGAAAATGAGCACAAAATCCCATTCATCCAACCGGTCGCCGTCGTGTTCTGTCGCCGGCGGCGGCAGTAAAGGGAAAGAAGCCTCTGTTTCAGTCTCTGTTTCAGTGTCAGCTCGGAATCATTCCCAGAAGTTGAAGAACAATATGGATATCATTGAAGATAAACAAGAGCTGATGAAAACCCAGGATTTTGTTTCCCAGATCTCGCATTCGTGCTTATCAGATCCGGATCCGTGTTTCAATGACTCAAATTCAAAGAAAGTTGAAGGTGACAGAGTTCACAGAAGAAGAACATCGGCATCTTCTATGAGGCTTGGAACAGGGGAGGCCAATTTTCATGGCAATCACTGTTTAATAGAGATTGAAAATCCAAGTAATCAGGGAAGAACAGCTCGCCGGAAGACAAAATTCATGCTGAAAACACGTTTAAAGGAAGTGGGTAATTGTCTAACAACATCAAAAGAGCTTATAAGAGTATTAAACCATGTTCTAGCTCATGAAGACAACGATCAACACCGTCCATCATCAATTTCACCACTGATTACAGCTTTAAAATTAGAGATGGAGCGAGCCAAGGCTCGTGTTGATCATTTGATTAAAGATCAAAGCTTTCATGGCGATGAAATTGAGATTGTAATGAAGCGATTTACAGAGGAAAAGACAGCGTGGAAGAACAGAGAGAGAGCTCGAGTTCGTAGCTCCATTGCTTCAATGGCGGATGAGATTGAGATTGAGAAGAAGCTTAGAAGACAAGCTGAGAGATTGAACAAAACAATTGCTAAAGAACTCGCTGAAGCTAAAATTTCACTTTCTAAAGCGATGAAAGATCTTCAAAGGGAGCGGAGAGCTAAGGAGATATTCGAGCAAATATGCGATGAATTAGCTAAAGGGATTGGAGAAGACAGAGCTCAATTCGAGGAGTTTAAGAAGGAATCTGCTAAAGTACGAGAAGAAATCGAACAAGAACGTGAAATGCTTCAATTGGCTGATGTTTTACGAGAGGAAAGAGTTCAGATGAAGCTGTCGGAAGCGAAATATCAATTTGAAGAGAAGAACGCCGCCGTGGAACGGCTGAAAGACGAGCTGGAAGCGTTTCTGATAACCCAATTCCGTCATGAAAACAGGGAAGAAGAAGATTATTCCGGCAAGATCAAAGAATTGGAAGCGTATTTGAAGAAAATAAATTTTGGGTCTGTTCAAGAACACCTTGAAGGAGATGGGAAGATTGAAGAACAGGAATGTTCAGAGGAGGACGACAGCGATCTTCATTCGATTGAGCTGAACATGGATAACAACAACAAGAGTTACAGATGGAGCTTTGTTCATGGAGGATCTAAAAGGAACTCATTTGAGAAGGATCAAATCAATGGAAGAAAGTCTGTTTCAGAGAAGATTCAATGGGGAAGCATTTGTTTGAATCGAAAAGCTTCGAATGGCTCTAAAAATGGCGAATTTGTTGGTCGGAAGAGCCATGAAAGTACAGAGAGATTGGAGTGGGAGAGATTTACAGAGGTTTTTGAAAATGAGGGAGACAATGGAAGTGCAGAGAAGAAGAACACTAAATCTGGGAAGTGTCTTAGAGACATTCTGTTTCCAGGATTTGTGGAACCAAATGATGATGTTGGGATTGCTGGCAATGTGGAGATGGATGAAGCTTCTTGA

Coding sequence (CDS)

ATGTCATGGCCAGAGCAGAAAACAGAGGAGATATGTAAGATTAGAAAACGAAGGTGTTCTTCGTCGTCTTCTTCTTCTTCTTCTACTTTGGTCTTTAAATACAGATTCAAGAACACACCCACGTGGAAAATGAGCACAAAATCCCATTCATCCAACCGGTCGCCGTCGTGTTCTGTCGCCGGCGGCGGCAGTAAAGGGAAAGAAGCCTCTGTTTCAGTCTCTGTTTCAGTGTCAGCTCGGAATCATTCCCAGAAGTTGAAGAACAATATGGATATCATTGAAGATAAACAAGAGCTGATGAAAACCCAGGATTTTGTTTCCCAGATCTCGCATTCGTGCTTATCAGATCCGGATCCGTGTTTCAATGACTCAAATTCAAAGAAAGTTGAAGGTGACAGAGTTCACAGAAGAAGAACATCGGCATCTTCTATGAGGCTTGGAACAGGGGAGGCCAATTTTCATGGCAATCACTGTTTAATAGAGATTGAAAATCCAAGTAATCAGGGAAGAACAGCTCGCCGGAAGACAAAATTCATGCTGAAAACACGTTTAAAGGAAGTGGGTAATTGTCTAACAACATCAAAAGAGCTTATAAGAGTATTAAACCATGTTCTAGCTCATGAAGACAACGATCAACACCGTCCATCATCAATTTCACCACTGATTACAGCTTTAAAATTAGAGATGGAGCGAGCCAAGGCTCGTGTTGATCATTTGATTAAAGATCAAAGCTTTCATGGCGATGAAATTGAGATTGTAATGAAGCGATTTACAGAGGAAAAGACAGCGTGGAAGAACAGAGAGAGAGCTCGAGTTCGTAGCTCCATTGCTTCAATGGCGGATGAGATTGAGATTGAGAAGAAGCTTAGAAGACAAGCTGAGAGATTGAACAAAACAATTGCTAAAGAACTCGCTGAAGCTAAAATTTCACTTTCTAAAGCGATGAAAGATCTTCAAAGGGAGCGGAGAGCTAAGGAGATATTCGAGCAAATATGCGATGAATTAGCTAAAGGGATTGGAGAAGACAGAGCTCAATTCGAGGAGTTTAAGAAGGAATCTGCTAAAGTACGAGAAGAAATCGAACAAGAACGTGAAATGCTTCAATTGGCTGATGTTTTACGAGAGGAAAGAGTTCAGATGAAGCTGTCGGAAGCGAAATATCAATTTGAAGAGAAGAACGCCGCCGTGGAACGGCTGAAAGACGAGCTGGAAGCGTTTCTGATAACCCAATTCCGTCATGAAAACAGGGAAGAAGAAGATTATTCCGGCAAGATCAAAGAATTGGAAGCGTATTTGAAGAAAATAAATTTTGGGTCTGTTCAAGAACACCTTGAAGGAGATGGGAAGATTGAAGAACAGGAATGTTCAGAGGAGGACGACAGCGATCTTCATTCGATTGAGCTGAACATGGATAACAACAACAAGAGTTACAGATGGAGCTTTGTTCATGGAGGATCTAAAAGGAACTCATTTGAGAAGGATCAAATCAATGGAAGAAAGTCTGTTTCAGAGAAGATTCAATGGGGAAGCATTTGTTTGAATCGAAAAGCTTCGAATGGCTCTAAAAATGGCGAATTTGTTGGTCGGAAGAGCCATGAAAGTACAGAGAGATTGGAGTGGGAGAGATTTACAGAGGTTTTTGAAAATGAGGGAGACAATGGAAGTGCAGAGAAGAAGAACACTAAATCTGGGAAGTGTCTTAGAGACATTCTGTTTCCAGGATTTGTGGAACCAAATGATGATGTTGGGATTGCTGGCAATGTGGAGATGGATGAAGCTTCTTGA

Protein sequence

MSWPEQKTEEICKIRKRRCSSSSSSSSSTLVFKYRFKNTPTWKMSTKSHSSNRSPSCSVAGGGSKGKEASVSVSVSVSARNHSQKLKNNMDIIEDKQELMKTQDFVSQISHSCLSDPDPCFNDSNSKKVEGDRVHRRRTSASSMRLGTGEANFHGNHCLIEIENPSNQGRTARRKTKFMLKTRLKEVGNCLTTSKELIRVLNHVLAHEDNDQHRPSSISPLITALKLEMERAKARVDHLIKDQSFHGDEIEIVMKRFTEEKTAWKNRERARVRSSIASMADEIEIEKKLRRQAERLNKTIAKELAEAKISLSKAMKDLQRERRAKEIFEQICDELAKGIGEDRAQFEEFKKESAKVREEIEQEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDELEAFLITQFRHENREEEDYSGKIKELEAYLKKINFGSVQEHLEGDGKIEEQECSEEDDSDLHSIELNMDNNNKSYRWSFVHGGSKRNSFEKDQINGRKSVSEKIQWGSICLNRKASNGSKNGEFVGRKSHESTERLEWERFTEVFENEGDNGSAEKKNTKSGKCLRDILFPGFVEPNDDVGIAGNVEMDEAS
Homology
BLAST of Cp4.1LG05g04090 vs. ExPASy Swiss-Prot
Match: Q66GQ2 (Uncharacterized protein At5g41620 OS=Arabidopsis thaliana OX=3702 GN=At5g41620 PE=2 SV=2)

HSP 1 Score: 100.9 bits (250), Expect = 5.1e-20
Identity = 75/219 (34.25%), Postives = 132/219 (60.27%), Query Frame = 0

Query: 191 LTTSKELIRVLNHVLAHEDNDQHRPSSISPLITALKLEMERAKARVDHLIKDQSFHGDEI 250
           L TS EL++VLN + + E  +QH  S+IS LI ALK E+  ++ R+  L++ Q     E+
Sbjct: 193 LKTSTELLKVLNRIWSLE--EQH-VSNIS-LIKALKTEVAHSRVRIKELLRYQQADRHEL 252

Query: 251 EIVMKRFTEEKTAWKNRERARVRSSIASMADEIEIEKKLRRQAERLNKTIAKELAEAKIS 310
           + V+K+  EEK   KN+E  R+ S++ S+   +E E+KLR+++E L++ +A+EL+E K S
Sbjct: 253 DSVVKQLAEEKLLSKNKEVERMSSAVQSVRKALEDERKLRKRSESLHRKMARELSEVKSS 312

Query: 311 LSKAMKDLQRERRAKEIFEQICDELAKGIGEDRAQFEEFKKESAKV--REEIEQEREMLQ 370
           LS  +K+L+R  ++ ++ E +CDE AKGI     +    KK++           ++ +L 
Sbjct: 313 LSNCVKELERGSKSNKMMELLCDEFAKGIKSYEEEIHGLKKKNLDKDWAGRGGGDQLVLH 372

Query: 371 LADVLREERVQMKLSEAKYQFEEKNAAVERLKDELEAFL 408
           +A+   +ER+QM+L        +  + +++L+ E+E FL
Sbjct: 373 IAESWLDERMQMRLEGGDTLNGKNRSVLDKLEVEIETFL 407

BLAST of Cp4.1LG05g04090 vs. ExPASy Swiss-Prot
Match: F4I878 (Protein BRANCHLESS TRICHOME OS=Arabidopsis thaliana OX=3702 GN=BLT PE=1 SV=1)

HSP 1 Score: 55.1 bits (131), Expect = 3.2e-06
Identity = 41/130 (31.54%), Postives = 75/130 (57.69%), Query Frame = 0

Query: 268 ERARVRSSIASMADEIEIEKKLRRQAERLNKTIAKELAEAKISLSKAMKDLQRERRAKEI 327
           E  + +  I  +  E++ E+K RR+AE + K +AK++ E +  +++  +++Q +R  KE+
Sbjct: 76  ELGKAQDEIKELKAELDYERKARRRAELMIKKLAKDVEEER--MAREAEEMQNKRLFKEL 135

Query: 328 FEQICDELAKGIGEDRAQFEEFKKESAKVREEIEQEREMLQLADVLREERVQMKLSEAKY 387
             +                   K E  +++ ++E+ER+M +LA+VLREERVQMKL +A+ 
Sbjct: 136 SSE-------------------KSEMVRMKRDLEEERQMHRLAEVLREERVQMKLMDARL 184

Query: 388 QFEEKNAAVE 398
             EEK + +E
Sbjct: 196 FLEEKLSELE 184

BLAST of Cp4.1LG05g04090 vs. ExPASy Swiss-Prot
Match: Q8IIG7 (Uncharacterized protein PF11_0207 OS=Plasmodium falciparum (isolate 3D7) OX=36329 GN=PF11_0207 PE=1 SV=2)

HSP 1 Score: 47.8 bits (112), Expect = 5.1e-04
Identity = 64/261 (24.52%), Postives = 139/261 (53.26%), Query Frame = 0

Query: 208 EDNDQHRPSSISPLITALKLEMERAKARVDHLIKDQSF---HGDEIEIVMKRFTEEKTAW 267
           ++ND ++  S  PL    +LE  R K +++    D +      ++I+ + +R  E K  +
Sbjct: 440 KENDTNK--SEKPLYLR-RLEEYRKKKKLESQANDTAMKMHEKEQIDDIQERKEEIKEEF 499

Query: 268 K---NRERARVRSSIASMADEI--EIEKKLRRQAERLNKTIAKELAEAKISLSKAMKDLQ 327
           K     E   ++  I  + +EI  EI+++++   E + + I +E+ E K  + +  +++ 
Sbjct: 500 KEEVKEEIKEIKEEIKEVKEEIKEEIKEEIKEVKEEIKEEIKEEIKEVKEEIKEVKEEI- 559

Query: 328 RERRAKEIFEQICDELAKGIGEDRAQF-EEFKKESAKVREEIEQE--REMLQLADVLREE 387
             +  KE  +++ +E+ + I E + +  EE K+E  +V+EEI++E   E+ ++ + ++E 
Sbjct: 560 --KEVKEEIKEVKEEIKEEIKEVKEEIKEEIKEEIKEVKEEIKEEVKEEIKEVKEEIKEV 619

Query: 388 RVQMKLSEAKYQFEEKNAAVERLKDELEAFLITQFRHENREE--EDYSGKIKELEAYLKK 447
           + ++K  E K + +E    ++ +K+E++   I + + E +EE  E+   +IKE++  LK 
Sbjct: 620 KEEIK-EEVKEEIKEVKEEIKEVKEEIKE-EIKEVKEEIKEEVKEEIKEEIKEIKEELKN 679

Query: 448 -INFGSVQEHLEGDGKIEEQE 455
            I+  + +E    + K EE E
Sbjct: 680 DISSETTKEEKNTEHKKEETE 692

BLAST of Cp4.1LG05g04090 vs. NCBI nr
Match: XP_023534122.1 (uncharacterized protein LOC111795778 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023534127.1 uncharacterized protein LOC111795779 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1132 bits (2928), Expect = 0.0
Identity = 594/594 (100.00%), Postives = 594/594 (100.00%), Query Frame = 0

Query: 1   MSWPEQKTEEICKIRKRRCSSSSSSSSSTLVFKYRFKNTPTWKMSTKSHSSNRSPSCSVA 60
           MSWPEQKTEEICKIRKRRCSSSSSSSSSTLVFKYRFKNTPTWKMSTKSHSSNRSPSCSVA
Sbjct: 1   MSWPEQKTEEICKIRKRRCSSSSSSSSSTLVFKYRFKNTPTWKMSTKSHSSNRSPSCSVA 60

Query: 61  GGGSKGKEASVSVSVSVSARNHSQKLKNNMDIIEDKQELMKTQDFVSQISHSCLSDPDPC 120
           GGGSKGKEASVSVSVSVSARNHSQKLKNNMDIIEDKQELMKTQDFVSQISHSCLSDPDPC
Sbjct: 61  GGGSKGKEASVSVSVSVSARNHSQKLKNNMDIIEDKQELMKTQDFVSQISHSCLSDPDPC 120

Query: 121 FNDSNSKKVEGDRVHRRRTSASSMRLGTGEANFHGNHCLIEIENPSNQGRTARRKTKFML 180
           FNDSNSKKVEGDRVHRRRTSASSMRLGTGEANFHGNHCLIEIENPSNQGRTARRKTKFML
Sbjct: 121 FNDSNSKKVEGDRVHRRRTSASSMRLGTGEANFHGNHCLIEIENPSNQGRTARRKTKFML 180

Query: 181 KTRLKEVGNCLTTSKELIRVLNHVLAHEDNDQHRPSSISPLITALKLEMERAKARVDHLI 240
           KTRLKEVGNCLTTSKELIRVLNHVLAHEDNDQHRPSSISPLITALKLEMERAKARVDHLI
Sbjct: 181 KTRLKEVGNCLTTSKELIRVLNHVLAHEDNDQHRPSSISPLITALKLEMERAKARVDHLI 240

Query: 241 KDQSFHGDEIEIVMKRFTEEKTAWKNRERARVRSSIASMADEIEIEKKLRRQAERLNKTI 300
           KDQSFHGDEIEIVMKRFTEEKTAWKNRERARVRSSIASMADEIEIEKKLRRQAERLNKTI
Sbjct: 241 KDQSFHGDEIEIVMKRFTEEKTAWKNRERARVRSSIASMADEIEIEKKLRRQAERLNKTI 300

Query: 301 AKELAEAKISLSKAMKDLQRERRAKEIFEQICDELAKGIGEDRAQFEEFKKESAKVREEI 360
           AKELAEAKISLSKAMKDLQRERRAKEIFEQICDELAKGIGEDRAQFEEFKKESAKVREEI
Sbjct: 301 AKELAEAKISLSKAMKDLQRERRAKEIFEQICDELAKGIGEDRAQFEEFKKESAKVREEI 360

Query: 361 EQEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDELEAFLITQFRHENREEED 420
           EQEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDELEAFLITQFRHENREEED
Sbjct: 361 EQEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDELEAFLITQFRHENREEED 420

Query: 421 YSGKIKELEAYLKKINFGSVQEHLEGDGKIEEQECSEEDDSDLHSIELNMDNNNKSYRWS 480
           YSGKIKELEAYLKKINFGSVQEHLEGDGKIEEQECSEEDDSDLHSIELNMDNNNKSYRWS
Sbjct: 421 YSGKIKELEAYLKKINFGSVQEHLEGDGKIEEQECSEEDDSDLHSIELNMDNNNKSYRWS 480

Query: 481 FVHGGSKRNSFEKDQINGRKSVSEKIQWGSICLNRKASNGSKNGEFVGRKSHESTERLEW 540
           FVHGGSKRNSFEKDQINGRKSVSEKIQWGSICLNRKASNGSKNGEFVGRKSHESTERLEW
Sbjct: 481 FVHGGSKRNSFEKDQINGRKSVSEKIQWGSICLNRKASNGSKNGEFVGRKSHESTERLEW 540

Query: 541 ERFTEVFENEGDNGSAEKKNTKSGKCLRDILFPGFVEPNDDVGIAGNVEMDEAS 594
           ERFTEVFENEGDNGSAEKKNTKSGKCLRDILFPGFVEPNDDVGIAGNVEMDEAS
Sbjct: 541 ERFTEVFENEGDNGSAEKKNTKSGKCLRDILFPGFVEPNDDVGIAGNVEMDEAS 594

BLAST of Cp4.1LG05g04090 vs. NCBI nr
Match: KAG7035923.1 (hypothetical protein SDJN02_02723 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1115 bits (2885), Expect = 0.0
Identity = 586/594 (98.65%), Postives = 588/594 (98.99%), Query Frame = 0

Query: 1   MSWPEQKTEEICKIRKRRCSSSSSSSSSTLVFKYRFKNTPTWKMSTKSHSSNRSPSCSVA 60
           MSWPEQKTEEICKIRKRRCSSSSSSSSSTLVFKYRFKNTPTWKMSTKSHSSNRSPSCS+A
Sbjct: 1   MSWPEQKTEEICKIRKRRCSSSSSSSSSTLVFKYRFKNTPTWKMSTKSHSSNRSPSCSIA 60

Query: 61  GGGSKGKEASVSVSVSVSARNHSQKLKNNMDIIEDKQELMKTQDFVSQISHSCLSDPDPC 120
           GGGSKGKEASVSVSVSVSARNHSQKLKNNMDIIEDKQELMKTQDFVSQISHSCLSDPDPC
Sbjct: 61  GGGSKGKEASVSVSVSVSARNHSQKLKNNMDIIEDKQELMKTQDFVSQISHSCLSDPDPC 120

Query: 121 FNDSNSKKVEGDRVHRRRTSASSMRLGTGEANFHGNHCLIEIENPSNQGRTARRKTKFML 180
           FNDSNSKKVEGDRVHRRRTSASSMRLGTGEANFHGNHCLIEIENPSNQGRT RRKTKFML
Sbjct: 121 FNDSNSKKVEGDRVHRRRTSASSMRLGTGEANFHGNHCLIEIENPSNQGRTVRRKTKFML 180

Query: 181 KTRLKEVGNCLTTSKELIRVLNHVLAHEDNDQHRPSSISPLITALKLEMERAKARVDHLI 240
           KTRLKEVGNCLTTSKELIRVLNHVLAHEDNDQHRPSSISPLITALK EMERAKARVDHLI
Sbjct: 181 KTRLKEVGNCLTTSKELIRVLNHVLAHEDNDQHRPSSISPLITALKSEMERAKARVDHLI 240

Query: 241 KDQSFHGDEIEIVMKRFTEEKTAWKNRERARVRSSIASMADEIEIEKKLRRQAERLNKTI 300
           KDQS HGDEIEIVMKRFTEEKTAWKNRERARVRSSIASMADEIEIEKKLRRQAERLNKTI
Sbjct: 241 KDQSLHGDEIEIVMKRFTEEKTAWKNRERARVRSSIASMADEIEIEKKLRRQAERLNKTI 300

Query: 301 AKELAEAKISLSKAMKDLQRERRAKEIFEQICDELAKGIGEDRAQFEEFKKESAKVREEI 360
           AKELAEAKISLSKAMKDLQRERRAKEIFEQICDELAKGIGEDRAQFEEFKKESAKVREEI
Sbjct: 301 AKELAEAKISLSKAMKDLQRERRAKEIFEQICDELAKGIGEDRAQFEEFKKESAKVREEI 360

Query: 361 EQEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDELEAFLITQFRHENREEED 420
           EQEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDELEAFLITQFRHENREEED
Sbjct: 361 EQEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDELEAFLITQFRHENREEED 420

Query: 421 YSGKIKELEAYLKKINFGSVQEHLEGDGKIEEQECSEEDDSDLHSIELNMDNNNKSYRWS 480
           YSGKIKELEAYLKKINFGSVQEHLEGD KIEEQECSEEDDSDLHSIELNMDNNNKSYRWS
Sbjct: 421 YSGKIKELEAYLKKINFGSVQEHLEGDEKIEEQECSEEDDSDLHSIELNMDNNNKSYRWS 480

Query: 481 FVHGGSKRNSFEKDQINGRKSVSEKIQWGSICLNRKASNGSKNGEFVGRKSHESTERLEW 540
           FVHGGSKRNSFEKDQINGRKSVSEKIQWGSICLNRKASNGSKNGEFVGRKSHES+ERLEW
Sbjct: 481 FVHGGSKRNSFEKDQINGRKSVSEKIQWGSICLNRKASNGSKNGEFVGRKSHESSERLEW 540

Query: 541 ERFTEVFENEGDNGSAEKKNTKSGKCLRDILFPGFVEPNDDVGIAGNVEMDEAS 594
           ERFTEVFE EGDNGSAEKKNTKSGKCLRDILFPGFVEPNDDVGIA NVEMDEAS
Sbjct: 541 ERFTEVFEKEGDNGSAEKKNTKSGKCLRDILFPGFVEPNDDVGIAENVEMDEAS 594

BLAST of Cp4.1LG05g04090 vs. NCBI nr
Match: XP_022957750.1 (uncharacterized protein LOC111459204 isoform X1 [Cucurbita moschata] >XP_022957751.1 uncharacterized protein LOC111459204 isoform X1 [Cucurbita moschata])

HSP 1 Score: 1112 bits (2877), Expect = 0.0
Identity = 583/594 (98.15%), Postives = 589/594 (99.16%), Query Frame = 0

Query: 1   MSWPEQKTEEICKIRKRRCSSSSSSSSSTLVFKYRFKNTPTWKMSTKSHSSNRSPSCSVA 60
           MSWPEQKTEEICKIRKRRCSS SSSSSSTLVFKYRFKNTPTWKMSTKSHSSNRSPSCS+A
Sbjct: 1   MSWPEQKTEEICKIRKRRCSSLSSSSSSTLVFKYRFKNTPTWKMSTKSHSSNRSPSCSIA 60

Query: 61  GGGSKGKEASVSVSVSVSARNHSQKLKNNMDIIEDKQELMKTQDFVSQISHSCLSDPDPC 120
           GGGSKGKEASVSVSVSVSARNHSQKLKNNMDIIEDKQELMKTQDFVSQISHSCLSDPDPC
Sbjct: 61  GGGSKGKEASVSVSVSVSARNHSQKLKNNMDIIEDKQELMKTQDFVSQISHSCLSDPDPC 120

Query: 121 FNDSNSKKVEGDRVHRRRTSASSMRLGTGEANFHGNHCLIEIENPSNQGRTARRKTKFML 180
           FNDSNSKKVEGDRVHRRRTSASSMRLGTGEANFHG+HCLIEIENPSNQG+TARRKTKFML
Sbjct: 121 FNDSNSKKVEGDRVHRRRTSASSMRLGTGEANFHGDHCLIEIENPSNQGKTARRKTKFML 180

Query: 181 KTRLKEVGNCLTTSKELIRVLNHVLAHEDNDQHRPSSISPLITALKLEMERAKARVDHLI 240
           KTRLKEVGNCLTTSKELIRVLNHVLAHEDNDQHRPSSISPLITALK EMERAKARVDHLI
Sbjct: 181 KTRLKEVGNCLTTSKELIRVLNHVLAHEDNDQHRPSSISPLITALKSEMERAKARVDHLI 240

Query: 241 KDQSFHGDEIEIVMKRFTEEKTAWKNRERARVRSSIASMADEIEIEKKLRRQAERLNKTI 300
           KDQS HGDEIEIVMKRFTEEKTAWKNRERARVRSSIASMADEIEIEKKLR+QAERLNKTI
Sbjct: 241 KDQSLHGDEIEIVMKRFTEEKTAWKNRERARVRSSIASMADEIEIEKKLRKQAERLNKTI 300

Query: 301 AKELAEAKISLSKAMKDLQRERRAKEIFEQICDELAKGIGEDRAQFEEFKKESAKVREEI 360
           AKELAEAKISLSKAMKDLQRERRAKEIFEQICDELAKGIGEDRAQFEEFKKESAKVREEI
Sbjct: 301 AKELAEAKISLSKAMKDLQRERRAKEIFEQICDELAKGIGEDRAQFEEFKKESAKVREEI 360

Query: 361 EQEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDELEAFLITQFRHENREEED 420
           EQEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDELEAFLITQFRHENREEED
Sbjct: 361 EQEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDELEAFLITQFRHENREEED 420

Query: 421 YSGKIKELEAYLKKINFGSVQEHLEGDGKIEEQECSEEDDSDLHSIELNMDNNNKSYRWS 480
           YSGKIKELEAYLKKINFGSVQEHLEGD KIEEQECSEEDDSDLHSIELNMDNNNKSYRWS
Sbjct: 421 YSGKIKELEAYLKKINFGSVQEHLEGDEKIEEQECSEEDDSDLHSIELNMDNNNKSYRWS 480

Query: 481 FVHGGSKRNSFEKDQINGRKSVSEKIQWGSICLNRKASNGSKNGEFVGRKSHESTERLEW 540
           FVHGGSKRNSFEKD+INGRKSVSEKIQWGSICLNRKASNGSKNGEFVGRKSHES+ERLEW
Sbjct: 481 FVHGGSKRNSFEKDEINGRKSVSEKIQWGSICLNRKASNGSKNGEFVGRKSHESSERLEW 540

Query: 541 ERFTEVFENEGDNGSAEKKNTKSGKCLRDILFPGFVEPNDDVGIAGNVEMDEAS 594
           ERFTEVFE EGDNGSAEKKNTKSGKCLRDILFPGFVEPNDDVGIAGNVEMDEAS
Sbjct: 541 ERFTEVFEKEGDNGSAEKKNTKSGKCLRDILFPGFVEPNDDVGIAGNVEMDEAS 594

BLAST of Cp4.1LG05g04090 vs. NCBI nr
Match: XP_022995046.1 (uncharacterized protein LOC111490718 isoform X1 [Cucurbita maxima] >XP_022995048.1 uncharacterized protein LOC111490718 isoform X1 [Cucurbita maxima])

HSP 1 Score: 1110 bits (2871), Expect = 0.0
Identity = 583/596 (97.82%), Postives = 590/596 (98.99%), Query Frame = 0

Query: 1   MSWPEQKTEEICKIRKRRCSSSSSSSSS--TLVFKYRFKNTPTWKMSTKSHSSNRSPSCS 60
           MSWPEQKTEEICKIRKRRCSSSSSSSSS  TLVFKYRFKNTPTWKMSTKSHSSNRSPSCS
Sbjct: 1   MSWPEQKTEEICKIRKRRCSSSSSSSSSSSTLVFKYRFKNTPTWKMSTKSHSSNRSPSCS 60

Query: 61  VAGGGSKGKEASVSVSVSVSARNHSQKLKNNMDIIEDKQELMKTQDFVSQISHSCLSDPD 120
           VAGGGSKGKEASVSVSVSVSARNHSQKLKNNMDIIEDKQELMKTQDFVSQISHSCLSDPD
Sbjct: 61  VAGGGSKGKEASVSVSVSVSARNHSQKLKNNMDIIEDKQELMKTQDFVSQISHSCLSDPD 120

Query: 121 PCFNDSNSKKVEGDRVHRRRTSASSMRLGTGEANFHGNHCLIEIENPSNQGRTARRKTKF 180
           PCFNDSNSKKVEGDRVHRRRTSASS+R+GTGEANFHGNHCLIEIENPSNQGRTARRKTKF
Sbjct: 121 PCFNDSNSKKVEGDRVHRRRTSASSLRIGTGEANFHGNHCLIEIENPSNQGRTARRKTKF 180

Query: 181 MLKTRLKEVGNCLTTSKELIRVLNHVLAHEDNDQHRPSSISPLITALKLEMERAKARVDH 240
           MLKTRLKEV NCLTTSKEL+RVLNHVLAHEDNDQHRPSSISPLITALK EMERAKARVDH
Sbjct: 181 MLKTRLKEVSNCLTTSKELVRVLNHVLAHEDNDQHRPSSISPLITALKSEMERAKARVDH 240

Query: 241 LIKDQSFHGDEIEIVMKRFTEEKTAWKNRERARVRSSIASMADEIEIEKKLRRQAERLNK 300
           LIKDQSFHGDEIEIVMKRFTEEKTAWKNRERARVRSSIASMADEIEIEKKLR+QAERLNK
Sbjct: 241 LIKDQSFHGDEIEIVMKRFTEEKTAWKNRERARVRSSIASMADEIEIEKKLRKQAERLNK 300

Query: 301 TIAKELAEAKISLSKAMKDLQRERRAKEIFEQICDELAKGIGEDRAQFEEFKKESAKVRE 360
           TIAKELAEAKISLSKAMKDLQRERRAKEIFEQICDELAKGIGEDRAQFEEFKKESAKVRE
Sbjct: 301 TIAKELAEAKISLSKAMKDLQRERRAKEIFEQICDELAKGIGEDRAQFEEFKKESAKVRE 360

Query: 361 EIEQEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDELEAFLITQFRHENREE 420
           EIEQEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDELEAFLITQFRHENREE
Sbjct: 361 EIEQEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDELEAFLITQFRHENREE 420

Query: 421 EDYSGKIKELEAYLKKINFGSVQEHLEGDGKIEEQECSEEDDSDLHSIELNMDNNNKSYR 480
           EDYSGKIKELEAYLKKINFGSVQEH +GDGKIEEQECSEEDDSDLHSIELNMDNNNKSYR
Sbjct: 421 EDYSGKIKELEAYLKKINFGSVQEHPDGDGKIEEQECSEEDDSDLHSIELNMDNNNKSYR 480

Query: 481 WSFVHGGSKRNSFEKDQINGRKSVSEKIQWGSICLNRKASNGSKNGEFVGRKSHESTERL 540
           WSFVHGGSKRNSFEKDQINGRKSVSEKIQWGSICLNRKASNGSKNG+FVGRKSHES+ERL
Sbjct: 481 WSFVHGGSKRNSFEKDQINGRKSVSEKIQWGSICLNRKASNGSKNGDFVGRKSHESSERL 540

Query: 541 EWERFTEVFENEGDNGSAEKKNTKSGKCLRDILFPGFVEPNDDVGIAGNVEMDEAS 594
           EWERFTEVFE EGDNGSAEKKNTKSGKCLRDILFPGFVEPNDDVGIAGNVEMDEAS
Sbjct: 541 EWERFTEVFEKEGDNGSAEKKNTKSGKCLRDILFPGFVEPNDDVGIAGNVEMDEAS 596

BLAST of Cp4.1LG05g04090 vs. NCBI nr
Match: XP_023534124.1 (uncharacterized protein LOC111795778 isoform X2 [Cucurbita pepo subsp. pepo] >XP_023534128.1 uncharacterized protein LOC111795779 isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1069 bits (2764), Expect = 0.0
Identity = 560/560 (100.00%), Postives = 560/560 (100.00%), Query Frame = 0

Query: 35  RFKNTPTWKMSTKSHSSNRSPSCSVAGGGSKGKEASVSVSVSVSARNHSQKLKNNMDIIE 94
           RFKNTPTWKMSTKSHSSNRSPSCSVAGGGSKGKEASVSVSVSVSARNHSQKLKNNMDIIE
Sbjct: 31  RFKNTPTWKMSTKSHSSNRSPSCSVAGGGSKGKEASVSVSVSVSARNHSQKLKNNMDIIE 90

Query: 95  DKQELMKTQDFVSQISHSCLSDPDPCFNDSNSKKVEGDRVHRRRTSASSMRLGTGEANFH 154
           DKQELMKTQDFVSQISHSCLSDPDPCFNDSNSKKVEGDRVHRRRTSASSMRLGTGEANFH
Sbjct: 91  DKQELMKTQDFVSQISHSCLSDPDPCFNDSNSKKVEGDRVHRRRTSASSMRLGTGEANFH 150

Query: 155 GNHCLIEIENPSNQGRTARRKTKFMLKTRLKEVGNCLTTSKELIRVLNHVLAHEDNDQHR 214
           GNHCLIEIENPSNQGRTARRKTKFMLKTRLKEVGNCLTTSKELIRVLNHVLAHEDNDQHR
Sbjct: 151 GNHCLIEIENPSNQGRTARRKTKFMLKTRLKEVGNCLTTSKELIRVLNHVLAHEDNDQHR 210

Query: 215 PSSISPLITALKLEMERAKARVDHLIKDQSFHGDEIEIVMKRFTEEKTAWKNRERARVRS 274
           PSSISPLITALKLEMERAKARVDHLIKDQSFHGDEIEIVMKRFTEEKTAWKNRERARVRS
Sbjct: 211 PSSISPLITALKLEMERAKARVDHLIKDQSFHGDEIEIVMKRFTEEKTAWKNRERARVRS 270

Query: 275 SIASMADEIEIEKKLRRQAERLNKTIAKELAEAKISLSKAMKDLQRERRAKEIFEQICDE 334
           SIASMADEIEIEKKLRRQAERLNKTIAKELAEAKISLSKAMKDLQRERRAKEIFEQICDE
Sbjct: 271 SIASMADEIEIEKKLRRQAERLNKTIAKELAEAKISLSKAMKDLQRERRAKEIFEQICDE 330

Query: 335 LAKGIGEDRAQFEEFKKESAKVREEIEQEREMLQLADVLREERVQMKLSEAKYQFEEKNA 394
           LAKGIGEDRAQFEEFKKESAKVREEIEQEREMLQLADVLREERVQMKLSEAKYQFEEKNA
Sbjct: 331 LAKGIGEDRAQFEEFKKESAKVREEIEQEREMLQLADVLREERVQMKLSEAKYQFEEKNA 390

Query: 395 AVERLKDELEAFLITQFRHENREEEDYSGKIKELEAYLKKINFGSVQEHLEGDGKIEEQE 454
           AVERLKDELEAFLITQFRHENREEEDYSGKIKELEAYLKKINFGSVQEHLEGDGKIEEQE
Sbjct: 391 AVERLKDELEAFLITQFRHENREEEDYSGKIKELEAYLKKINFGSVQEHLEGDGKIEEQE 450

Query: 455 CSEEDDSDLHSIELNMDNNNKSYRWSFVHGGSKRNSFEKDQINGRKSVSEKIQWGSICLN 514
           CSEEDDSDLHSIELNMDNNNKSYRWSFVHGGSKRNSFEKDQINGRKSVSEKIQWGSICLN
Sbjct: 451 CSEEDDSDLHSIELNMDNNNKSYRWSFVHGGSKRNSFEKDQINGRKSVSEKIQWGSICLN 510

Query: 515 RKASNGSKNGEFVGRKSHESTERLEWERFTEVFENEGDNGSAEKKNTKSGKCLRDILFPG 574
           RKASNGSKNGEFVGRKSHESTERLEWERFTEVFENEGDNGSAEKKNTKSGKCLRDILFPG
Sbjct: 511 RKASNGSKNGEFVGRKSHESTERLEWERFTEVFENEGDNGSAEKKNTKSGKCLRDILFPG 570

Query: 575 FVEPNDDVGIAGNVEMDEAS 594
           FVEPNDDVGIAGNVEMDEAS
Sbjct: 571 FVEPNDDVGIAGNVEMDEAS 590

BLAST of Cp4.1LG05g04090 vs. ExPASy TrEMBL
Match: A0A6J1H037 (uncharacterized protein LOC111459204 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111459204 PE=4 SV=1)

HSP 1 Score: 1112 bits (2877), Expect = 0.0
Identity = 583/594 (98.15%), Postives = 589/594 (99.16%), Query Frame = 0

Query: 1   MSWPEQKTEEICKIRKRRCSSSSSSSSSTLVFKYRFKNTPTWKMSTKSHSSNRSPSCSVA 60
           MSWPEQKTEEICKIRKRRCSS SSSSSSTLVFKYRFKNTPTWKMSTKSHSSNRSPSCS+A
Sbjct: 1   MSWPEQKTEEICKIRKRRCSSLSSSSSSTLVFKYRFKNTPTWKMSTKSHSSNRSPSCSIA 60

Query: 61  GGGSKGKEASVSVSVSVSARNHSQKLKNNMDIIEDKQELMKTQDFVSQISHSCLSDPDPC 120
           GGGSKGKEASVSVSVSVSARNHSQKLKNNMDIIEDKQELMKTQDFVSQISHSCLSDPDPC
Sbjct: 61  GGGSKGKEASVSVSVSVSARNHSQKLKNNMDIIEDKQELMKTQDFVSQISHSCLSDPDPC 120

Query: 121 FNDSNSKKVEGDRVHRRRTSASSMRLGTGEANFHGNHCLIEIENPSNQGRTARRKTKFML 180
           FNDSNSKKVEGDRVHRRRTSASSMRLGTGEANFHG+HCLIEIENPSNQG+TARRKTKFML
Sbjct: 121 FNDSNSKKVEGDRVHRRRTSASSMRLGTGEANFHGDHCLIEIENPSNQGKTARRKTKFML 180

Query: 181 KTRLKEVGNCLTTSKELIRVLNHVLAHEDNDQHRPSSISPLITALKLEMERAKARVDHLI 240
           KTRLKEVGNCLTTSKELIRVLNHVLAHEDNDQHRPSSISPLITALK EMERAKARVDHLI
Sbjct: 181 KTRLKEVGNCLTTSKELIRVLNHVLAHEDNDQHRPSSISPLITALKSEMERAKARVDHLI 240

Query: 241 KDQSFHGDEIEIVMKRFTEEKTAWKNRERARVRSSIASMADEIEIEKKLRRQAERLNKTI 300
           KDQS HGDEIEIVMKRFTEEKTAWKNRERARVRSSIASMADEIEIEKKLR+QAERLNKTI
Sbjct: 241 KDQSLHGDEIEIVMKRFTEEKTAWKNRERARVRSSIASMADEIEIEKKLRKQAERLNKTI 300

Query: 301 AKELAEAKISLSKAMKDLQRERRAKEIFEQICDELAKGIGEDRAQFEEFKKESAKVREEI 360
           AKELAEAKISLSKAMKDLQRERRAKEIFEQICDELAKGIGEDRAQFEEFKKESAKVREEI
Sbjct: 301 AKELAEAKISLSKAMKDLQRERRAKEIFEQICDELAKGIGEDRAQFEEFKKESAKVREEI 360

Query: 361 EQEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDELEAFLITQFRHENREEED 420
           EQEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDELEAFLITQFRHENREEED
Sbjct: 361 EQEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDELEAFLITQFRHENREEED 420

Query: 421 YSGKIKELEAYLKKINFGSVQEHLEGDGKIEEQECSEEDDSDLHSIELNMDNNNKSYRWS 480
           YSGKIKELEAYLKKINFGSVQEHLEGD KIEEQECSEEDDSDLHSIELNMDNNNKSYRWS
Sbjct: 421 YSGKIKELEAYLKKINFGSVQEHLEGDEKIEEQECSEEDDSDLHSIELNMDNNNKSYRWS 480

Query: 481 FVHGGSKRNSFEKDQINGRKSVSEKIQWGSICLNRKASNGSKNGEFVGRKSHESTERLEW 540
           FVHGGSKRNSFEKD+INGRKSVSEKIQWGSICLNRKASNGSKNGEFVGRKSHES+ERLEW
Sbjct: 481 FVHGGSKRNSFEKDEINGRKSVSEKIQWGSICLNRKASNGSKNGEFVGRKSHESSERLEW 540

Query: 541 ERFTEVFENEGDNGSAEKKNTKSGKCLRDILFPGFVEPNDDVGIAGNVEMDEAS 594
           ERFTEVFE EGDNGSAEKKNTKSGKCLRDILFPGFVEPNDDVGIAGNVEMDEAS
Sbjct: 541 ERFTEVFEKEGDNGSAEKKNTKSGKCLRDILFPGFVEPNDDVGIAGNVEMDEAS 594

BLAST of Cp4.1LG05g04090 vs. ExPASy TrEMBL
Match: A0A6J1K0X3 (uncharacterized protein LOC111490718 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111490718 PE=4 SV=1)

HSP 1 Score: 1110 bits (2871), Expect = 0.0
Identity = 583/596 (97.82%), Postives = 590/596 (98.99%), Query Frame = 0

Query: 1   MSWPEQKTEEICKIRKRRCSSSSSSSSS--TLVFKYRFKNTPTWKMSTKSHSSNRSPSCS 60
           MSWPEQKTEEICKIRKRRCSSSSSSSSS  TLVFKYRFKNTPTWKMSTKSHSSNRSPSCS
Sbjct: 1   MSWPEQKTEEICKIRKRRCSSSSSSSSSSSTLVFKYRFKNTPTWKMSTKSHSSNRSPSCS 60

Query: 61  VAGGGSKGKEASVSVSVSVSARNHSQKLKNNMDIIEDKQELMKTQDFVSQISHSCLSDPD 120
           VAGGGSKGKEASVSVSVSVSARNHSQKLKNNMDIIEDKQELMKTQDFVSQISHSCLSDPD
Sbjct: 61  VAGGGSKGKEASVSVSVSVSARNHSQKLKNNMDIIEDKQELMKTQDFVSQISHSCLSDPD 120

Query: 121 PCFNDSNSKKVEGDRVHRRRTSASSMRLGTGEANFHGNHCLIEIENPSNQGRTARRKTKF 180
           PCFNDSNSKKVEGDRVHRRRTSASS+R+GTGEANFHGNHCLIEIENPSNQGRTARRKTKF
Sbjct: 121 PCFNDSNSKKVEGDRVHRRRTSASSLRIGTGEANFHGNHCLIEIENPSNQGRTARRKTKF 180

Query: 181 MLKTRLKEVGNCLTTSKELIRVLNHVLAHEDNDQHRPSSISPLITALKLEMERAKARVDH 240
           MLKTRLKEV NCLTTSKEL+RVLNHVLAHEDNDQHRPSSISPLITALK EMERAKARVDH
Sbjct: 181 MLKTRLKEVSNCLTTSKELVRVLNHVLAHEDNDQHRPSSISPLITALKSEMERAKARVDH 240

Query: 241 LIKDQSFHGDEIEIVMKRFTEEKTAWKNRERARVRSSIASMADEIEIEKKLRRQAERLNK 300
           LIKDQSFHGDEIEIVMKRFTEEKTAWKNRERARVRSSIASMADEIEIEKKLR+QAERLNK
Sbjct: 241 LIKDQSFHGDEIEIVMKRFTEEKTAWKNRERARVRSSIASMADEIEIEKKLRKQAERLNK 300

Query: 301 TIAKELAEAKISLSKAMKDLQRERRAKEIFEQICDELAKGIGEDRAQFEEFKKESAKVRE 360
           TIAKELAEAKISLSKAMKDLQRERRAKEIFEQICDELAKGIGEDRAQFEEFKKESAKVRE
Sbjct: 301 TIAKELAEAKISLSKAMKDLQRERRAKEIFEQICDELAKGIGEDRAQFEEFKKESAKVRE 360

Query: 361 EIEQEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDELEAFLITQFRHENREE 420
           EIEQEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDELEAFLITQFRHENREE
Sbjct: 361 EIEQEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDELEAFLITQFRHENREE 420

Query: 421 EDYSGKIKELEAYLKKINFGSVQEHLEGDGKIEEQECSEEDDSDLHSIELNMDNNNKSYR 480
           EDYSGKIKELEAYLKKINFGSVQEH +GDGKIEEQECSEEDDSDLHSIELNMDNNNKSYR
Sbjct: 421 EDYSGKIKELEAYLKKINFGSVQEHPDGDGKIEEQECSEEDDSDLHSIELNMDNNNKSYR 480

Query: 481 WSFVHGGSKRNSFEKDQINGRKSVSEKIQWGSICLNRKASNGSKNGEFVGRKSHESTERL 540
           WSFVHGGSKRNSFEKDQINGRKSVSEKIQWGSICLNRKASNGSKNG+FVGRKSHES+ERL
Sbjct: 481 WSFVHGGSKRNSFEKDQINGRKSVSEKIQWGSICLNRKASNGSKNGDFVGRKSHESSERL 540

Query: 541 EWERFTEVFENEGDNGSAEKKNTKSGKCLRDILFPGFVEPNDDVGIAGNVEMDEAS 594
           EWERFTEVFE EGDNGSAEKKNTKSGKCLRDILFPGFVEPNDDVGIAGNVEMDEAS
Sbjct: 541 EWERFTEVFEKEGDNGSAEKKNTKSGKCLRDILFPGFVEPNDDVGIAGNVEMDEAS 596

BLAST of Cp4.1LG05g04090 vs. ExPASy TrEMBL
Match: A0A6J1K315 (uncharacterized protein LOC111490718 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111490718 PE=4 SV=1)

HSP 1 Score: 1033 bits (2670), Expect = 0.0
Identity = 540/551 (98.00%), Postives = 547/551 (99.27%), Query Frame = 0

Query: 44  MSTKSHSSNRSPSCSVAGGGSKGKEASVSVSVSVSARNHSQKLKNNMDIIEDKQELMKTQ 103
           MSTKSHSSNRSPSCSVAGGGSKGKEASVSVSVSVSARNHSQKLKNNMDIIEDKQELMKTQ
Sbjct: 1   MSTKSHSSNRSPSCSVAGGGSKGKEASVSVSVSVSARNHSQKLKNNMDIIEDKQELMKTQ 60

Query: 104 DFVSQISHSCLSDPDPCFNDSNSKKVEGDRVHRRRTSASSMRLGTGEANFHGNHCLIEIE 163
           DFVSQISHSCLSDPDPCFNDSNSKKVEGDRVHRRRTSASS+R+GTGEANFHGNHCLIEIE
Sbjct: 61  DFVSQISHSCLSDPDPCFNDSNSKKVEGDRVHRRRTSASSLRIGTGEANFHGNHCLIEIE 120

Query: 164 NPSNQGRTARRKTKFMLKTRLKEVGNCLTTSKELIRVLNHVLAHEDNDQHRPSSISPLIT 223
           NPSNQGRTARRKTKFMLKTRLKEV NCLTTSKEL+RVLNHVLAHEDNDQHRPSSISPLIT
Sbjct: 121 NPSNQGRTARRKTKFMLKTRLKEVSNCLTTSKELVRVLNHVLAHEDNDQHRPSSISPLIT 180

Query: 224 ALKLEMERAKARVDHLIKDQSFHGDEIEIVMKRFTEEKTAWKNRERARVRSSIASMADEI 283
           ALK EMERAKARVDHLIKDQSFHGDEIEIVMKRFTEEKTAWKNRERARVRSSIASMADEI
Sbjct: 181 ALKSEMERAKARVDHLIKDQSFHGDEIEIVMKRFTEEKTAWKNRERARVRSSIASMADEI 240

Query: 284 EIEKKLRRQAERLNKTIAKELAEAKISLSKAMKDLQRERRAKEIFEQICDELAKGIGEDR 343
           EIEKKLR+QAERLNKTIAKELAEAKISLSKAMKDLQRERRAKEIFEQICDELAKGIGEDR
Sbjct: 241 EIEKKLRKQAERLNKTIAKELAEAKISLSKAMKDLQRERRAKEIFEQICDELAKGIGEDR 300

Query: 344 AQFEEFKKESAKVREEIEQEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDEL 403
           AQFEEFKKESAKVREEIEQEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDEL
Sbjct: 301 AQFEEFKKESAKVREEIEQEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDEL 360

Query: 404 EAFLITQFRHENREEEDYSGKIKELEAYLKKINFGSVQEHLEGDGKIEEQECSEEDDSDL 463
           EAFLITQFRHENREEEDYSGKIKELEAYLKKINFGSVQEH +GDGKIEEQECSEEDDSDL
Sbjct: 361 EAFLITQFRHENREEEDYSGKIKELEAYLKKINFGSVQEHPDGDGKIEEQECSEEDDSDL 420

Query: 464 HSIELNMDNNNKSYRWSFVHGGSKRNSFEKDQINGRKSVSEKIQWGSICLNRKASNGSKN 523
           HSIELNMDNNNKSYRWSFVHGGSKRNSFEKDQINGRKSVSEKIQWGSICLNRKASNGSKN
Sbjct: 421 HSIELNMDNNNKSYRWSFVHGGSKRNSFEKDQINGRKSVSEKIQWGSICLNRKASNGSKN 480

Query: 524 GEFVGRKSHESTERLEWERFTEVFENEGDNGSAEKKNTKSGKCLRDILFPGFVEPNDDVG 583
           G+FVGRKSHES+ERLEWERFTEVFE EGDNGSAEKKNTKSGKCLRDILFPGFVEPNDDVG
Sbjct: 481 GDFVGRKSHESSERLEWERFTEVFEKEGDNGSAEKKNTKSGKCLRDILFPGFVEPNDDVG 540

Query: 584 IAGNVEMDEAS 594
           IAGNVEMDEAS
Sbjct: 541 IAGNVEMDEAS 551

BLAST of Cp4.1LG05g04090 vs. ExPASy TrEMBL
Match: A0A6J1H001 (uncharacterized protein LOC111459204 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111459204 PE=4 SV=1)

HSP 1 Score: 1032 bits (2669), Expect = 0.0
Identity = 541/551 (98.19%), Postives = 547/551 (99.27%), Query Frame = 0

Query: 44  MSTKSHSSNRSPSCSVAGGGSKGKEASVSVSVSVSARNHSQKLKNNMDIIEDKQELMKTQ 103
           MSTKSHSSNRSPSCS+AGGGSKGKEASVSVSVSVSARNHSQKLKNNMDIIEDKQELMKTQ
Sbjct: 1   MSTKSHSSNRSPSCSIAGGGSKGKEASVSVSVSVSARNHSQKLKNNMDIIEDKQELMKTQ 60

Query: 104 DFVSQISHSCLSDPDPCFNDSNSKKVEGDRVHRRRTSASSMRLGTGEANFHGNHCLIEIE 163
           DFVSQISHSCLSDPDPCFNDSNSKKVEGDRVHRRRTSASSMRLGTGEANFHG+HCLIEIE
Sbjct: 61  DFVSQISHSCLSDPDPCFNDSNSKKVEGDRVHRRRTSASSMRLGTGEANFHGDHCLIEIE 120

Query: 164 NPSNQGRTARRKTKFMLKTRLKEVGNCLTTSKELIRVLNHVLAHEDNDQHRPSSISPLIT 223
           NPSNQG+TARRKTKFMLKTRLKEVGNCLTTSKELIRVLNHVLAHEDNDQHRPSSISPLIT
Sbjct: 121 NPSNQGKTARRKTKFMLKTRLKEVGNCLTTSKELIRVLNHVLAHEDNDQHRPSSISPLIT 180

Query: 224 ALKLEMERAKARVDHLIKDQSFHGDEIEIVMKRFTEEKTAWKNRERARVRSSIASMADEI 283
           ALK EMERAKARVDHLIKDQS HGDEIEIVMKRFTEEKTAWKNRERARVRSSIASMADEI
Sbjct: 181 ALKSEMERAKARVDHLIKDQSLHGDEIEIVMKRFTEEKTAWKNRERARVRSSIASMADEI 240

Query: 284 EIEKKLRRQAERLNKTIAKELAEAKISLSKAMKDLQRERRAKEIFEQICDELAKGIGEDR 343
           EIEKKLR+QAERLNKTIAKELAEAKISLSKAMKDLQRERRAKEIFEQICDELAKGIGEDR
Sbjct: 241 EIEKKLRKQAERLNKTIAKELAEAKISLSKAMKDLQRERRAKEIFEQICDELAKGIGEDR 300

Query: 344 AQFEEFKKESAKVREEIEQEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDEL 403
           AQFEEFKKESAKVREEIEQEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDEL
Sbjct: 301 AQFEEFKKESAKVREEIEQEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDEL 360

Query: 404 EAFLITQFRHENREEEDYSGKIKELEAYLKKINFGSVQEHLEGDGKIEEQECSEEDDSDL 463
           EAFLITQFRHENREEEDYSGKIKELEAYLKKINFGSVQEHLEGD KIEEQECSEEDDSDL
Sbjct: 361 EAFLITQFRHENREEEDYSGKIKELEAYLKKINFGSVQEHLEGDEKIEEQECSEEDDSDL 420

Query: 464 HSIELNMDNNNKSYRWSFVHGGSKRNSFEKDQINGRKSVSEKIQWGSICLNRKASNGSKN 523
           HSIELNMDNNNKSYRWSFVHGGSKRNSFEKD+INGRKSVSEKIQWGSICLNRKASNGSKN
Sbjct: 421 HSIELNMDNNNKSYRWSFVHGGSKRNSFEKDEINGRKSVSEKIQWGSICLNRKASNGSKN 480

Query: 524 GEFVGRKSHESTERLEWERFTEVFENEGDNGSAEKKNTKSGKCLRDILFPGFVEPNDDVG 583
           GEFVGRKSHES+ERLEWERFTEVFE EGDNGSAEKKNTKSGKCLRDILFPGFVEPNDDVG
Sbjct: 481 GEFVGRKSHESSERLEWERFTEVFEKEGDNGSAEKKNTKSGKCLRDILFPGFVEPNDDVG 540

Query: 584 IAGNVEMDEAS 594
           IAGNVEMDEAS
Sbjct: 541 IAGNVEMDEAS 551

BLAST of Cp4.1LG05g04090 vs. ExPASy TrEMBL
Match: A0A6J1DHY9 (uncharacterized protein LOC111020667 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111020667 PE=4 SV=1)

HSP 1 Score: 583 bits (1504), Expect = 3.05e-201
Identity = 352/587 (59.97%), Postives = 422/587 (71.89%), Query Frame = 0

Query: 1   MSWPEQKTEEICKIRKRRCSSSSSSSSSTLVFKYRFKNTPTWKMSTKSHSSNRSPSCSVA 60
           MSW EQ+TE+ CKIRKR+   SSSSS S+LV KYRFK +PTWKMSTKSHSS        A
Sbjct: 1   MSWAEQQTEKRCKIRKRKGPWSSSSSYSSLVRKYRFKKSPTWKMSTKSHSS--------A 60

Query: 61  GGGSKGKEASVSVSVSVSARNHSQKLKNNMDIIE--DKQELMKTQDFVSQISHSCLSDPD 120
           G G             V  +N+     NN D +E  DK+EL+KT + VSQISHSCLSDPD
Sbjct: 61  GKG-------------VKLKNN-----NNSDAVELEDKKELIKTGEMVSQISHSCLSDPD 120

Query: 121 PCFNDSNSKKVEGD--RVHRRRTSASSMRLGTGE-----ANFHGNHCLIEIENPSNQGRT 180
           P FN++ S+KVEG   RV RRR SA S+R+G GE     +NF GN CL+EIEN S + +T
Sbjct: 121 PNFNNTKSEKVEGGSGRVQRRRRSACSLRIGIGEIEVGGSNFRGNDCLMEIENRS-EVKT 180

Query: 181 ARRKTKFMLKTRLKEVGNCLTTSKELIRVLNHVLAHEDNDQHRPSSISPLITALKLEMER 240
            RRK KF +KTRLKEV NCLTTSKEL+RVL HV    D  Q  PSS S L+ ALK E++R
Sbjct: 181 TRRKKKFTVKTRLKEVSNCLTTSKELVRVLTHVWGSHDEKQ--PSSASSLMAALKSELDR 240

Query: 241 AKARVDHLIKDQS--FHGDEIEIVMKRFTEEKTAWKNRERARVRSSIASMADEIEIEKKL 300
           AK RV+HL++D+   FHGDEIE + KRF EEK AWK +ERARV S+I+SMA+E+ +E+KL
Sbjct: 241 AKTRVEHLMRDEQRLFHGDEIEALRKRFAEEKAAWKYKERARVGSAISSMAEEVAVERKL 300

Query: 301 RRQAERLNKTIAKELAEAKISLSKAMKDLQRERRAKEIFEQICDELAKGIGEDRAQFEEF 360
           RRQAERLNK I KEL EA+++++KAMKD+ RE+RAKEI E+IC+ELAKGIGEDRA+FEE 
Sbjct: 301 RRQAERLNKRIGKELGEARVAVAKAMKDVDREKRAKEILEEICEELAKGIGEDRAEFEEL 360

Query: 361 KKESAKVREEIEQEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDELEAFLIT 420
           +KES KVREE+E+EREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKD+LEA+ + 
Sbjct: 361 RKESEKVREEVEKEREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDQLEAYFVE 420

Query: 421 QFRHENREEEDYSGKIKELEAYLKKINFGSVQEHLEGDGKIEEQECSEEDDSDLHSIELN 480
              H      +   KIKELEAYLKKINFGS  ++ E      E++C+ +++SDLHSIELN
Sbjct: 421 NRDHSQDSFNNKLDKIKELEAYLKKINFGSYNKNKE------EEDCNWDEESDLHSIELN 480

Query: 481 MDNNNKSYRWSFVHGG---SKRNSFEKDQINGRKSVSEKIQWGSICLNRKASNGSKNGEF 540
           MDNNNKSYRWSFVHG    SKRNSFEK+    RKS+SEKIQWGSIC N    + SKNGEF
Sbjct: 481 MDNNNKSYRWSFVHGSHNASKRNSFEKE----RKSLSEKIQWGSICFN----SSSKNGEF 527

Query: 541 VGRKSHESTERLEWERFTEVFENEGDNGSAEKKNTKSGKCLRDILFP 573
            G                   E +G+     K+ +   +CLRDILFP
Sbjct: 541 EGDG-----------------ERDGEIQITHKQKSGGVRCLRDILFP 527

BLAST of Cp4.1LG05g04090 vs. TAIR 10
Match: AT3G11590.1 (unknown protein; LOCATED IN: plasma membrane; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G22310.1); Has 22320 Blast hits to 15179 proteins in 1213 species: Archae - 372; Bacteria - 2307; Metazoa - 10906; Fungi - 1700; Plants - 1146; Viruses - 65; Other Eukaryotes - 5824 (source: NCBI BLink). )

HSP 1 Score: 271.2 bits (692), Expect = 2.0e-72
Identity = 218/593 (36.76%), Postives = 323/593 (54.47%), Query Frame = 0

Query: 13  KIRKRRCSSSSSSSSSTLVFKYRFKN-------------TPTWKMSTKSHSSNRSPSCSV 72
           KIRKR CSS +SS+SS L   YRFK               PTW++  +S S   S +   
Sbjct: 16  KIRKRGCSSPTSSTSSILREGYRFKRAIVVGKRGGSTTPVPTWRLMGRSPSPRASGALHA 75

Query: 73  AGGGSK---GKEASVSVSVSVSARNHSQKL--KNNMD---IIEDKQELMK--TQDFVSQI 132
           A   S     K   VS    VSAR  +  L   N M    ++E+   +++   ++ ++ +
Sbjct: 76  AASPSSHCGSKTGKVSAPAPVSARKLAATLWEMNEMPSPRVVEEAAPMIRKSRKERIAPL 135

Query: 133 ---SHSCLSDPDPCFNDSNSKKVEGDRVHRRRTSASSMRLGTGEANFHGNHCLIEIENPS 192
                S  S   P      S     +R+ R  T +   R  +         C +   +P 
Sbjct: 136 PPPRSSVHSGSLPPHLSDPSHSPVSERMERSGTGSRQRRASSTVQKLRLGDCNVGARDPI 195

Query: 193 NQGRTARRKTKFM----------LKTRLKEVGNCLTTSKELIRVLNHVLAHEDNDQHRPS 252
           N G     +T+            +KTRLK+  N LTTSKEL++++N +   +D    RPS
Sbjct: 196 NSGSFMDIETRSRVETPTGSTVGVKTRLKDCSNALTTSKELLKIINRMWGQDD----RPS 255

Query: 253 SISPLITALKLEMERAKARVDHLIKDQSFHGDEIEIVMKRFTEEKTAWKNRERARVRSSI 312
           S   L++AL  E+ERA+ +V+ LI +     ++I  +MKRF EEK  WK+ E+  V ++I
Sbjct: 256 SSMSLVSALHSELERARLQVNQLIHEHKPENNDISYLMKRFAEEKAVWKSNEQEVVEAAI 315

Query: 313 ASMADEIEIEKKLRRQAERLNKTIAKELAEAKISLSKAMKDLQRERRAKEIFEQICDELA 372
            S+A E+E+E+KLRR+ E LNK + KELAE K +L KA+K+++ E+RA+ + E++CDELA
Sbjct: 316 ESVAGELEVERKLRRRFESLNKKLGKELAETKSALMKAVKEIENEKRARVMVEKVCDELA 375

Query: 373 KGIGEDRAQFEEFKKESAKVREEIEQEREMLQLADVLREERVQMKLSEAKYQFEEKNAAV 432
           + I ED+A+ EE K+ES KV+EE+E+EREMLQLAD LREERVQMKLSEAK+Q EEKNAAV
Sbjct: 376 RDISEDKAEVEELKRESFKVKEEVEKEREMLQLADALREERVQMKLSEAKHQLEEKNAAV 435

Query: 433 ERLKDELEAFL-ITQFRHENREEEDYSGKIKELEAYLK-KINFGSVQEHLEGDGKIEEQE 492
           ++L+++L+ +L   + + + RE        +E   YL   I+FGS   ++E DG++E   
Sbjct: 436 DKLRNQLQTYLKAKRCKEKTREPPQTQLHNEEAGDYLNHHISFGSY--NIE-DGEVENGN 495

Query: 493 CSEEDDSDLHSIELNMDNNNKSYRWSFVHGGSKRNSFEKDQINGRKSVSEKIQWGSICLN 552
                +SDLHSIELN+D  NKSY+W +      R S  +  ++ ++S+S+ + W  +  +
Sbjct: 496 EEGSGESDLHSIELNID--NKSYKWPYGEENRGRKSTPRKSLSLQRSISDCVDW--VVQS 555

Query: 553 RKASNGSKNGEFVGRKSHESTERLEWERFTEVFENEG--DNGSAEKKNTKSGK 566
            K       G             L+W R  +V E +G  D   A K N  S K
Sbjct: 556 EKLQKSGDGG-------------LDWGRSIDV-EPKGYIDETQAYKPNKASSK 583

BLAST of Cp4.1LG05g04090 vs. TAIR 10
Match: AT1G50660.1 (unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G20350.1); Has 21445 Blast hits to 15134 proteins in 1325 species: Archae - 461; Bacteria - 2309; Metazoa - 11052; Fungi - 1737; Plants - 1035; Viruses - 42; Other Eukaryotes - 4809 (source: NCBI BLink). )

HSP 1 Score: 150.6 bits (379), Expect = 4.0e-36
Identity = 110/343 (32.07%), Postives = 200/343 (58.31%), Query Frame = 0

Query: 190 CLTTSKELIRVLNHVLAHEDNDQHRPSSISPLITALKLEMERAKARVDHLIKDQSFHGDE 249
           CL T +E    ++ + ++      + +++S L+++L+ E+E A AR++ L  ++  H  +
Sbjct: 212 CLDTMEE----VHQIYSNMKRIDQQVNAVS-LVSSLEAELEEAHARIEDLESEKRSHKKK 271

Query: 250 IEIVMKRFTEEKTAWKNRERARVRSSIASMADEIEIEKKLRRQAERLNKTIAKELAEAKI 309
           +E  +++ +EE+ AW++RE  +VR+ I  M  ++  EKK R++ E +N  +  ELA++K+
Sbjct: 272 LEQFLRKVSEERAAWRSREHEKVRAIIDDMKTDMNREKKTRQRLEIVNHKLVNELADSKL 331

Query: 310 SLSKAMKDLQRERRAKEIFEQICDELAKGIGEDRAQFEEFKKESAKVREEIEQEREMLQL 369
           ++ + M+D ++ER+A+E+ E++CDELAK IGED+A+ E  K+ES  +REE++ ER MLQ+
Sbjct: 332 AVKRYMQDYEKERKARELIEEVCDELAKEIGEDKAEIEALKRESMSLREEVDDERRMLQM 391

Query: 370 ADVLREERVQMKLSEAKYQFEEKNAAVERLKDELEAFL-----ITQFRH----ENREEED 429
           A+V REERVQMKL +AK   EE+ + + +L  +LE+FL     +T  +     E   E  
Sbjct: 392 AEVWREERVQMKLIDAKVALEERYSQMNKLVGDLESFLRSRDIVTDVKEVREAELLRETA 451

Query: 430 YSGKIKELE-------------AYLKKINFGSVQEHLEGDGKIEEQECSEEDDSDLHSIE 489
            S  I+E++             A  +++N G   +  E +  +     S   DS +H++ 
Sbjct: 452 ASVNIQEIKEFTYVPANPDDIYAVFEEMNLGEAHDR-EMEKSVAYSPISH--DSKVHTVS 511

Query: 490 LNMDNNNKSYRWSFVHGGSKRNSFEKDQINGRKSVSEKIQWGS 511
           L+ +  NK  R S  +  + +N   ++  +G ++VS   + GS
Sbjct: 512 LDANMMNKKGRHSDAY--THQNGDIEEDDSGWETVSHLEEQGS 544

BLAST of Cp4.1LG05g04090 vs. TAIR 10
Match: AT5G22310.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT3G11590.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 150.6 bits (379), Expect = 4.0e-36
Identity = 160/490 (32.65%), Postives = 241/490 (49.18%), Query Frame = 0

Query: 12  CKIRKRRCSSSSSSSSSTLVFKYRFKNTP-TWKMSTKSHSSNRSPSCSVAGGGSKGKEAS 71
           CKIRKR     SSSSSS+L  + RFK      K + +    + +P  S+         A 
Sbjct: 8   CKIRKR---GGSSSSSSSLARRNRFKRAIFAGKRAAQDDGGSGTPVKSITA-------AK 67

Query: 72  VSVSVSVSARN----HSQKLKNNMDIIEDKQELMKTQDFVSQISHS---CLSDPDPCFND 131
             V +S S  N    H Q  K+ +   +    L +  D      +S   CL    P    
Sbjct: 68  TPVLLSFSPENLPIDHHQLQKSCVSARKLAATLWEINDDADPPVNSDKDCLRSKKPSRYR 127

Query: 132 SNSKKVEGDRVHRRRTSASSMRLGTGEANFHGNHCLIEIENPSNQGRTARRKT-KFMLKT 191
           +             R+S    RL +   +   +       NP        +      +KT
Sbjct: 128 AKKSTEFSSIDFPPRSSDPISRLSSERIDLCDDMIRRRSTNPQKLNPIEYKIIGANSVKT 187

Query: 192 RLKEVGNCLTTSKELIRVLNHVLAHEDNDQHRPSSISPLITALKLEMERAKARVDHLIKD 251
           R K V + LTTSKEL++VL  +   E  D H+ +S + LI+AL  E++RA++ + HL+ +
Sbjct: 188 RFKNVSDGLTTSKELVKVLKRI--GELGDDHKTAS-NRLISALLCELDRARSSLKHLMSE 247

Query: 252 QSFHGDEIEIVMKRFTEEKTAWKNRERARVRSSIASMADEIEIEKKLRRQAERLNKTIAK 311
                DE E       EEK           R  I S+ +E  +E+KLRR+ E++N+ + +
Sbjct: 248 L----DEEE-------EEK-----------RRLIESLQEEAMVERKLRRRTEKMNRRLGR 307

Query: 312 ELAEAKISLSKAMKDLQRERRAKEIFEQICDELAKGIGEDRAQFEEFKKESAKVREEIEQ 371
           EL EAK +  K  ++++RE+RAK++ E++CDEL KGIG+D              ++E+E+
Sbjct: 308 ELTEAKETERKMKEEMKREKRAKDVLEEVCDELTKGIGDD--------------KKEMEK 367

Query: 372 EREMLQLADVLREERVQMKLSEAKYQFEEKNAAVERLKDELEAFLITQFRHENREEEDYS 431
           EREM+ +ADVLREERVQMKL+EAK++FE+K AAVERLK EL   L       + EE   S
Sbjct: 368 EREMMHIADVLREERVQMKLTEAKFEFEDKYAAVERLKKELRRVL-------DGEEGKGS 420

Query: 432 GKIKELEAYLKKINFGSVQEHLEGDGKIEEQECSEEDDSDLHSIELNMDNNNKSYRWSFV 491
            +I+             + E ++G G        ++++SDL SIELNM++ +K   W +V
Sbjct: 428 SEIRR------------ILEVIDGSGS------DDDEESDLKSIELNMESGSK---WGYV 420

Query: 492 HGGSKRNSFE 493
                R  F+
Sbjct: 488 DSLKDRRRFD 420

BLAST of Cp4.1LG05g04090 vs. TAIR 10
Match: AT3G20350.1 (unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: cotyledon; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G50660.1); Has 15095 Blast hits to 11224 proteins in 1051 species: Archae - 223; Bacteria - 1586; Metazoa - 7000; Fungi - 1255; Plants - 746; Viruses - 40; Other Eukaryotes - 4245 (source: NCBI BLink). )

HSP 1 Score: 135.2 bits (339), Expect = 1.7e-31
Identity = 107/343 (31.20%), Postives = 188/343 (54.81%), Query Frame = 0

Query: 190 CLTTSKELIRVLNHVLAHEDNDQHRPSSISPLITALKLEMERAKARVDHLIKDQSFHGDE 249
           CL T  ++ ++  +V    +N Q    S   L ++++L+++ A+A +  L  ++     +
Sbjct: 189 CLDTRDDVHQIYTNV--KWNNQQVNDVS---LASSIELKLQEARACIKDLESEKRSQKKK 248

Query: 250 IEIVMKRFTEEKTAWKNRERARVRSSIASMADEIEIEKKLRRQAERLNKTIAKELAEAKI 309
           +E  +K+ +EE+ AW++RE  +VR+ I  M  ++  EKK R++ E +N  +  ELA++K+
Sbjct: 249 LEQFLKKVSEERAAWRSREHEKVRAIIDDMKADMNQEKKTRQRLEIVNSKLVNELADSKL 308

Query: 310 SLSKAMKDLQRERRAKEIFEQICDELAKGIGEDRAQFEEFKKESAKVREEIEQEREMLQL 369
           ++ + M D Q+ER+A+E+ E++CDELAK I ED+A+ E  K ES  +REE++ ER MLQ+
Sbjct: 309 AVKRYMHDYQQERKARELIEEVCDELAKEIEEDKAEIEALKSESMNLREEVDDERRMLQM 368

Query: 370 ADVLREERVQMKLSEAKYQFEEKNAAVERLKDELEAFL------------ITQFRHENRE 429
           A+V REERVQMKL +AK   EEK + + +L  ++EAFL            + +   E   
Sbjct: 369 AEVWREERVQMKLIDAKVTLEEKYSQMNKLVGDMEAFLSSRNTTGVKEVRVAELLRETAA 428

Query: 430 EEDYSGKIKE----------LEAYLKKINFGSVQEHLEGDGKIEEQECSEEDDSDLHSIE 489
             D   +IKE          +    +++N G  Q+  E +  +     S    +   S +
Sbjct: 429 SVDNIQEIKEFTYEPAKPDDILMLFEQMNMGENQDR-ESEQYVAYSPVSHASKAHTVSPD 488

Query: 490 LNMDNNNKSYRWSFVHGGSKRNSFEKDQINGRKSVSEKIQWGS 511
           +N+ N  + +  +F     +   FE+D  +G ++VS   + GS
Sbjct: 489 VNLINKGR-HSNAFT---DQNGEFEEDD-SGWETVSHSEEHGS 520

BLAST of Cp4.1LG05g04090 vs. TAIR 10
Match: AT5G41620.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast, plasma membrane; EXPRESSED IN: 9 plant structures; EXPRESSED DURING: 6 growth stages; BEST Arabidopsis thaliana protein match is: intracellular protein transport protein USO1-related (TAIR:AT1G64180.1); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink). )

HSP 1 Score: 100.9 bits (250), Expect = 3.6e-21
Identity = 75/219 (34.25%), Postives = 132/219 (60.27%), Query Frame = 0

Query: 191 LTTSKELIRVLNHVLAHEDNDQHRPSSISPLITALKLEMERAKARVDHLIKDQSFHGDEI 250
           L TS EL++VLN + + E  +QH  S+IS LI ALK E+  ++ R+  L++ Q     E+
Sbjct: 193 LKTSTELLKVLNRIWSLE--EQH-VSNIS-LIKALKTEVAHSRVRIKELLRYQQADRHEL 252

Query: 251 EIVMKRFTEEKTAWKNRERARVRSSIASMADEIEIEKKLRRQAERLNKTIAKELAEAKIS 310
           + V+K+  EEK   KN+E  R+ S++ S+   +E E+KLR+++E L++ +A+EL+E K S
Sbjct: 253 DSVVKQLAEEKLLSKNKEVERMSSAVQSVRKALEDERKLRKRSESLHRKMARELSEVKSS 312

Query: 311 LSKAMKDLQRERRAKEIFEQICDELAKGIGEDRAQFEEFKKESAKV--REEIEQEREMLQ 370
           LS  +K+L+R  ++ ++ E +CDE AKGI     +    KK++           ++ +L 
Sbjct: 313 LSNCVKELERGSKSNKMMELLCDEFAKGIKSYEEEIHGLKKKNLDKDWAGRGGGDQLVLH 372

Query: 371 LADVLREERVQMKLSEAKYQFEEKNAAVERLKDELEAFL 408
           +A+   +ER+QM+L        +  + +++L+ E+E FL
Sbjct: 373 IAESWLDERMQMRLEGGDTLNGKNRSVLDKLEVEIETFL 407

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q66GQ25.1e-2034.25Uncharacterized protein At5g41620 OS=Arabidopsis thaliana OX=3702 GN=At5g41620 P... [more]
F4I8783.2e-0631.54Protein BRANCHLESS TRICHOME OS=Arabidopsis thaliana OX=3702 GN=BLT PE=1 SV=1[more]
Q8IIG75.1e-0424.52Uncharacterized protein PF11_0207 OS=Plasmodium falciparum (isolate 3D7) OX=3632... [more]
Match NameE-valueIdentityDescription
XP_023534122.10.0100.00uncharacterized protein LOC111795778 isoform X1 [Cucurbita pepo subsp. pepo] >XP... [more]
KAG7035923.10.098.65hypothetical protein SDJN02_02723 [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_022957750.10.098.15uncharacterized protein LOC111459204 isoform X1 [Cucurbita moschata] >XP_0229577... [more]
XP_022995046.10.097.82uncharacterized protein LOC111490718 isoform X1 [Cucurbita maxima] >XP_022995048... [more]
XP_023534124.10.0100.00uncharacterized protein LOC111795778 isoform X2 [Cucurbita pepo subsp. pepo] >XP... [more]
Match NameE-valueIdentityDescription
A0A6J1H0370.098.15uncharacterized protein LOC111459204 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1K0X30.097.82uncharacterized protein LOC111490718 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1K3150.098.00uncharacterized protein LOC111490718 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1H0010.098.19uncharacterized protein LOC111459204 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1DHY93.05e-20159.97uncharacterized protein LOC111020667 isoform X1 OS=Momordica charantia OX=3673 G... [more]
Match NameE-valueIdentityDescription
AT3G11590.12.0e-7236.76unknown protein; LOCATED IN: plasma membrane; EXPRESSED IN: 22 plant structures;... [more]
AT1G50660.14.0e-3632.07unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplas... [more]
AT5G22310.14.0e-3632.65unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT3G20350.11.7e-3131.20unknown protein; INVOLVED IN: biological_process unknown; LOCATED IN: plasma mem... [more]
AT5G41620.13.6e-2134.25FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita pepo (Zucchini) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 222..242
NoneNo IPR availableCOILSCoilCoilcoord: 279..324
NoneNo IPR availableCOILSCoilCoilcoord: 382..409
NoneNo IPR availableCOILSCoilCoilcoord: 346..366
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 42..59
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 124..148
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 42..66
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 124..139
NoneNo IPR availablePANTHERPTHR31071:SF16OS04G0382800 PROTEINcoord: 12..570
IPR043424Protein BRANCHLESS TRICHOME-likePANTHERPTHR31071GB|AAF24581.1coord: 12..570

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG05g04090.1Cp4.1LG05g04090.1mRNA