Cp4.1LG20g02350 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG20g02350
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionTyrosine kinase family protein
LocationCp4.1LG20 : 1347646 .. 1349619 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAACTTCCCATCTTCTTCTTTTTTCCATTTCTCTCTGTTTCACCTCCCAATTTGGTGTTCTTGAGTTTGAAAGAAATGACCCTTTTGTTGTTTTCTTCAGCTGTGTTGTAGATTTAGATGATTATATGTATATGGGTGTGTCTAATTTCAACGAAATCTGGCTCTGTCTGTATCTGATGTGTGAGTTTGAAGGGAGTTTTCTTTTTGTTTTGCTGGGATGACTTTCTTTGTCTTGTTTCTTTCTTGCATGAGCATTGGAAAGGGAAAATGTAGTTAGTTTCTGTCTTTTTTCAGCTATAGATCCATTGATTTACACAAATTTCGACTCGATTTAGAGTGTACATTGATGAGATTGAGGATATTGATGAAATTTGGGGACACCCCATGAGGAATTTGAGCTGGTTCAAGCCAATTTCCATCAATGGGAAGCCTGGGAGGAGGCTTTCCCTTGGAGAGTACGAGCGGGCTGTGTCGTGGTCTAAGTACTTGGTGTCTTCAGGAGCAGAGATAAAGGGAGAGGGAGAAGAGGAATGGAGTGCAGACATGTCTCAGTTGTTCATTGGCTTCAAACTTGCTACTGGAAGACACAGCAGGATTTACAGAGGCGTTTATAAGCAAAGGGATGTTGCAATTAAGCTGATAAGCCAGCCTGAGGAGGATGAACACTTGGCTAACTTTCTTGAAAATCAGTTCATTTCAGAGGTGGCATTGCTATTTCGATTGAGACATCCCAATATCATCACTGTATGTTTCATTACTTGATACCCATTTCTTGTGGGTCTGTGTTTTTTTTTTTTTTTGTTGTTGTCTGATGCACTGAATTTGGGTTCTGATAACTTCATTTGTTCTATAAACTTGATGCAATGCCTTAGCTTTTAGCATCTCATCACTGGAAGATCAGAGAAAAATGGCTTCTGTTTATGTTCATTTCTTAGGACTTTATGTAATTTTGATTCTTGATTGAATAAGATTGATGTTCTTGTGCTGTTTGTGACCTGCATATTACAAAGAACACTTGATGAGATTTCTTTACTCGAAACAGTTCATTGCAGCTTGCAAGAAACCTCCAGTTTTTTGCATAATCACGGAGTATATGGCGGGGGGCTCGTTAAGAAAGTATCTGCATCAACAGGAGCCACATTCGGTTCCGCTGAACTTGGTTCTGAAACTAGCACTCGAGATCTCGCGCGGGATGCAGTATCTTCATTCTCAGGGAATACTCCACAGAGATCTCAAATCAGAAAACCTTTTACTTGGTGAAGATATGTGCATTAAGGTAGCCGATTTCGGTATCTCATGCTTGGAATCTCAATGCGGAAGCGCAAAGGGATTCACCGGAACATATCGATGGATGGCACCGGAAATGATCAAAGAAAAACACCATACTAAGAAAGTTGATGTTTATAGCTTTGGCATTGTCTTGTGGGAGCTCTTAACTGCATTGACACCATTTGATAACATGACTCCTGAACAGGCAGCATTTGCAGTCTGCCAGAAGGTAACAATCACTTCTCTAGTTTTAGCTCGGTTGATACCTATTTCGTTCGTGTTGATATCTATTTTGTTCGTGTGTTCGTCGTGTAACTGATTTGGATCATGTCTGTGCTCTACAGAATGCAAGACCACCTCTGCCTTCAGCTTGCCCGGAAGCGTTTCGCCATCTGATCAAGAGATGCTGGTCGAAAAATCCCAACAAGCGACCGCATTTCGACGAGATTGTTTCGATTTTGGAAGCTTATATGGAGTCTTACAATGAAGATCGAGAGTTTTTCTGTCATTACCTCCCTTCATCTAGAAATACTCCATTACAATGCTTACCAAAGTGTATTACAGAACAATTGTGTGCTTCGTGGAAGCCTAGGAATTCATCATCTTGAATCTTGTATGATTGTCAAGTACAGCTCCTTGTCTTGCATGAAATTGTTAGGGTTTTAATTAAATCAAAGTTTTATGGATTTTTGTATTTTAAAAAAA

mRNA sequence

AAACTTCCCATCTTCTTCTTTTTTCCATTTCTCTCTGTTTCACCTCCCAATTTGGTGTTCTTGAGTTTTTTTCTTCAGCTGTGTTGTAGATTTAGATGATTATATGTATATGGGTGTGTCTAATTTCAACGAAATCTGGCTCTGTCTGTATCTGATGTGTGAGTTTGAAGGGAGTTTTCTTTTTGTTTTGCTGGGATGACTTTCTTTGTCTTGTTTCTTTCTTGCATGAGCATTGGAAAGGGAAAATGTACTATAGATCCATTGATTTACACAAATTTCGACTCGATTTAGAGTGTACATTGATGAGATTGAGGATATTGATGAAATTTGGGGACACCCCATGAGGAATTTGAGCTGGTTCAAGCCAATTTCCATCAATGGGAAGCCTGGGAGGAGGCTTTCCCTTGGAGAGTACGAGCGGGCTGTGTCGTGGTCTAAGTACTTGGTGTCTTCAGGAGCAGAGATAAAGGGAGAGGGAGAAGAGGAATGGAGTGCAGACATGTCTCAGTTGTTCATTGGCTTCAAACTTGCTACTGGAAGACACAGCAGGATTTACAGAGGCGTTTATAAGCAAAGGGATGTTGCAATTAAGCTGATAAGCCAGCCTGAGGAGGATGAACACTTGGCTAACTTTCTTGAAAATCAGTTCATTTCAGAGGTGGCATTGCTATTTCGATTGAGACATCCCAATATCATCACTTTCATTGCAGCTTGCAAGAAACCTCCAGTTTTTTGCATAATCACGGAGTATATGGCGGGGGGCTCGTTAAGAAAGTATCTGCATCAACAGGAGCCACATTCGGTTCCGCTGAACTTGGTTCTGAAACTAGCACTCGAGATCTCGCGCGGGATGCAGTATCTTCATTCTCAGGGAATACTCCACAGAGATCTCAAATCAGAAAACCTTTTACTTGGTGAAGATATGTGCATTAAGGTAGCCGATTTCGGTATCTCATGCTTGGAATCTCAATGCGGAAGCGCAAAGGGATTCACCGGAACATATCGATGGATGGCACCGGAAATGATCAAAGAAAAACACCATACTAAGAAAGTTGATGTTTATAGCTTTGGCATTGTCTTGTGGGAGCTCTTAACTGCATTGACACCATTTGATAACATGACTCCTGAACAGGCAGCATTTGCAGTCTGCCAGAAGAATGCAAGACCACCTCTGCCTTCAGCTTGCCCGGAAGCGTTTCGCCATCTGATCAAGAGATGCTGGTCGAAAAATCCCAACAAGCGACCGCATTTCGACGAGATTGTTTCGATTTTGGAAGCTTATATGGAGTCTTACAATGAAGATCGAGAGTTTTTCTGTCATTACCTCCCTTCATCTAGAAATACTCCATTACAATGCTTACCAAAGTGTATTACAGAACAATTGTGTGCTTCGTGGAAGCCTAGGAATTCATCATCTTGAATCTTGTATGATTGTCAAGTACAGCTCCTTGTCTTGCATGAAATTGTTAGGGTTTTAATTAAATCAAAGTTTTATGGATTTTTGTATTTTAAAAAAA

Coding sequence (CDS)

ATGAGGAATTTGAGCTGGTTCAAGCCAATTTCCATCAATGGGAAGCCTGGGAGGAGGCTTTCCCTTGGAGAGTACGAGCGGGCTGTGTCGTGGTCTAAGTACTTGGTGTCTTCAGGAGCAGAGATAAAGGGAGAGGGAGAAGAGGAATGGAGTGCAGACATGTCTCAGTTGTTCATTGGCTTCAAACTTGCTACTGGAAGACACAGCAGGATTTACAGAGGCGTTTATAAGCAAAGGGATGTTGCAATTAAGCTGATAAGCCAGCCTGAGGAGGATGAACACTTGGCTAACTTTCTTGAAAATCAGTTCATTTCAGAGGTGGCATTGCTATTTCGATTGAGACATCCCAATATCATCACTTTCATTGCAGCTTGCAAGAAACCTCCAGTTTTTTGCATAATCACGGAGTATATGGCGGGGGGCTCGTTAAGAAAGTATCTGCATCAACAGGAGCCACATTCGGTTCCGCTGAACTTGGTTCTGAAACTAGCACTCGAGATCTCGCGCGGGATGCAGTATCTTCATTCTCAGGGAATACTCCACAGAGATCTCAAATCAGAAAACCTTTTACTTGGTGAAGATATGTGCATTAAGGTAGCCGATTTCGGTATCTCATGCTTGGAATCTCAATGCGGAAGCGCAAAGGGATTCACCGGAACATATCGATGGATGGCACCGGAAATGATCAAAGAAAAACACCATACTAAGAAAGTTGATGTTTATAGCTTTGGCATTGTCTTGTGGGAGCTCTTAACTGCATTGACACCATTTGATAACATGACTCCTGAACAGGCAGCATTTGCAGTCTGCCAGAAGAATGCAAGACCACCTCTGCCTTCAGCTTGCCCGGAAGCGTTTCGCCATCTGATCAAGAGATGCTGGTCGAAAAATCCCAACAAGCGACCGCATTTCGACGAGATTGTTTCGATTTTGGAAGCTTATATGGAGTCTTACAATGAAGATCGAGAGTTTTTCTGTCATTACCTCCCTTCATCTAGAAATACTCCATTACAATGCTTACCAAAGTGTATTACAGAACAATTGTGTGCTTCGTGGAAGCCTAGGAATTCATCATCTTGA

Protein sequence

MRNLSWFKPISINGKPGRRLSLGEYERAVSWSKYLVSSGAEIKGEGEEEWSADMSQLFIGFKLATGRHSRIYRGVYKQRDVAIKLISQPEEDEHLANFLENQFISEVALLFRLRHPNIITFIAACKKPPVFCIITEYMAGGSLRKYLHQQEPHSVPLNLVLKLALEISRGMQYLHSQGILHRDLKSENLLLGEDMCIKVADFGISCLESQCGSAKGFTGTYRWMAPEMIKEKHHTKKVDVYSFGIVLWELLTALTPFDNMTPEQAAFAVCQKNARPPLPSACPEAFRHLIKRCWSKNPNKRPHFDEIVSILEAYMESYNEDREFFCHYLPSSRNTPLQCLPKCITEQLCASWKPRNSSS
BLAST of Cp4.1LG20g02350 vs. Swiss-Prot
Match: HT1_ARATH (Serine/threonine-protein kinase HT1 OS=Arabidopsis thaliana GN=HT1 PE=1 SV=1)

HSP 1 Score: 405.6 bits (1041), Expect = 5.6e-112
Identity = 201/326 (61.66%), Postives = 241/326 (73.93%), Query Frame = 1

Query: 25  YERAVSWSKYLVSSGAEI----KGEGEEEWSADMSQLFIGFKLATGRHSRIYRGVYKQRD 84
           ++   SWS  L S   E     KGE  EEW+AD+SQLFIG K A+G HSRIYRG+YKQR 
Sbjct: 51  FDSMESWSMILESENVETWEASKGE-REEWTADLSQLFIGNKFASGAHSRIYRGIYKQRA 110

Query: 85  VAIKLISQPEEDEHLANFLENQFISEVALLFRLRHPNIITFIAACKKPPVFCIITEYMAG 144
           VA+K++  P   E     LE QF SEVALL RL HPNI+ FIAACKKPPV+CIITEYM+ 
Sbjct: 111 VAVKMVRIPTHKEETRAKLEQQFKSEVALLSRLFHPNIVQFIAACKKPPVYCIITEYMSQ 170

Query: 145 GSLRKYLHQQEPHSVPLNLVLKLALEISRGMQYLHSQGILHRDLKSENLLLGEDMCIKVA 204
           G+LR YL+++EP+S+ +  VL+LAL+ISRGM+YLHSQG++HRDLKS NLLL ++M +KVA
Sbjct: 171 GNLRMYLNKKEPYSLSIETVLRLALDISRGMEYLHSQGVIHRDLKSNNLLLNDEMRVKVA 230

Query: 205 DFGISCLESQCGSAKGFTGTYRWMAPEMIKEKHHTKKVDVYSFGIVLWELLTALTPFDNM 264
           DFG SCLE+QC  AKG  GTYRWMAPEMIKEK +T+KVDVYSFGIVLWEL TAL PF  M
Sbjct: 231 DFGTSCLETQCREAKGNMGTYRWMAPEMIKEKPYTRKVDVYSFGIVLWELTTALLPFQGM 290

Query: 265 TPEQAAFAVCQKNARPPLPSACPEAFRHLIKRCWSKNPNKRPHFDEIVSILEAYMESYNE 324
           TP QAAFAV +KN RPPLP++C  A  HLIKRCWS+NP+KRP F  IV++LE Y E   E
Sbjct: 291 TPVQAAFAVAEKNERPPLPASCQPALAHLIKRCWSENPSKRPDFSNIVAVLEKYDECVKE 350

Query: 325 DREFFCH-YLPSSRNTPLQCLPKCIT 346
                 H  L  ++   L  L  C+T
Sbjct: 351 GLPLTSHASLTKTKKAILDHLKGCVT 375

BLAST of Cp4.1LG20g02350 vs. Swiss-Prot
Match: STY8_ARATH (Serine/threonine-protein kinase STY8 OS=Arabidopsis thaliana GN=STY8 PE=1 SV=2)

HSP 1 Score: 255.0 bits (650), Expect = 1.2e-66
Identity = 122/306 (39.87%), Postives = 196/306 (64.05%), Query Frame = 1

Query: 14  GKPGRRLSLGEYERAVSWSKYLVSSGAEIKGEGEEEWSADMSQLFIGFKLATGRHSRIYR 73
           G   + +S  E++++   S  L+ +  EI  +G +EW  D++QL I  K+A+G +  ++R
Sbjct: 246 GSKQKSISFFEHDKS---SNELIPACIEIPTDGTDEWEIDVTQLKIEKKVASGSYGDLHR 305

Query: 74  GVYKQRDVAIKLISQPEEDEHLANFLENQFISEVALLFRLRHPNIITFIAACKKPPVFCI 133
           G Y  ++VAIK +    + + + N +  +F  EV ++ ++RH N++ F+ AC + P  CI
Sbjct: 306 GTYCSQEVAIKFL----KPDRVNNEMLREFSQEVFIMRKVRHKNVVQFLGACTRSPTLCI 365

Query: 134 ITEYMAGGSLRKYLHQQEPHSVPLNLVLKLALEISRGMQYLHSQGILHRDLKSENLLLGE 193
           +TE+MA GS+  +LH+Q+  +  L  +LK+AL++++GM YLH   I+HRDLK+ NLL+ E
Sbjct: 366 VTEFMARGSIYDFLHKQKC-AFKLQTLLKVALDVAKGMSYLHQNNIIHRDLKTANLLMDE 425

Query: 194 DMCIKVADFGISCLESQCGSAKGFTGTYRWMAPEMIKEKHHTKKVDVYSFGIVLWELLTA 253
              +KVADFG++ ++ + G     TGTYRWMAPE+I+ K +  K DV+S+ IVLWELLT 
Sbjct: 426 HGLVKVADFGVARVQIESGVMTAETGTYRWMAPEVIEHKPYNHKADVFSYAIVLWELLTG 485

Query: 254 LTPFDNMTPEQAAFAVCQKNARPPLPSACPEAFRHLIKRCWSKNPNKRPHFDEIVSILEA 313
             P+  +TP QAA  V QK  RP +P       + L++RCW ++P +RP F+EI+ +L+ 
Sbjct: 486 DIPYAFLTPLQAAVGVVQKGLRPKIPKKTHPKVKGLLERCWHQDPEQRPLFEEIIEMLQQ 543

Query: 314 YMESYN 320
            M+  N
Sbjct: 546 IMKEVN 543

BLAST of Cp4.1LG20g02350 vs. Swiss-Prot
Match: STY46_ARATH (Serine/threonine-protein kinase STY46 OS=Arabidopsis thaliana GN=STY46 PE=1 SV=1)

HSP 1 Score: 252.3 bits (643), Expect = 8.0e-66
Identity = 120/282 (42.55%), Postives = 178/282 (63.12%), Query Frame = 1

Query: 42  IKGEGEEEWSADMSQLFIGFKLATGRHSRIYRGVYKQRDVAIKLISQPEEDEHLANFLEN 101
           I  +G + W  ++  L  G K+A+G +  +Y+G Y  ++VAIK++    + E L + LE 
Sbjct: 275 IPNDGTDVWEINLKHLKFGHKIASGSYGDLYKGTYCSQEVAIKVL----KPERLDSDLEK 334

Query: 102 QFISEVALLFRLRHPNIITFIAACKKPPVFCIITEYMAGGSLRKYLHQQEPHSVPLNLVL 161
           +F  EV ++ ++RH N++ FI AC KPP  CI+TE+M GGS+  YLH+Q+     L  + 
Sbjct: 335 EFAQEVFIMRKVRHKNVVQFIGACTKPPHLCIVTEFMPGGSVYDYLHKQKG-VFKLPTLF 394

Query: 162 KLALEISRGMQYLHSQGILHRDLKSENLLLGEDMCIKVADFGISCLESQCGSAKGFTGTY 221
           K+A++I +GM YLH   I+HRDLK+ NLL+ E+  +KVADFG++ +++Q G     TGTY
Sbjct: 395 KVAIDICKGMSYLHQNNIIHRDLKAANLLMDENEVVKVADFGVARVKAQTGVMTAETGTY 454

Query: 222 RWMAPEMIKEKHHTKKVDVYSFGIVLWELLTALTPFDNMTPEQAAFAVCQKNARPPLPSA 281
           RWMAPE+I+ K +  K DV+S+GIVLWELLT   P++ MTP QAA  V QK  RP +P  
Sbjct: 455 RWMAPEVIEHKPYDHKADVFSYGIVLWELLTGKLPYEYMTPLQAAVGVVQKGLRPTIPKN 514

Query: 282 CPEAFRHLIKRCWSKNPNKRPHFDEIVSILEAYMESYNEDRE 324
                  L++R W  +  +RP F EI+  L+   +   E+ E
Sbjct: 515 THPKLAELLERLWEHDSTQRPDFSEIIEQLQEIAKEVGEEGE 551

BLAST of Cp4.1LG20g02350 vs. Swiss-Prot
Match: STY17_ARATH (Serine/threonine-protein kinase STY17 OS=Arabidopsis thaliana GN=STY17 PE=1 SV=1)

HSP 1 Score: 244.2 bits (622), Expect = 2.2e-63
Identity = 115/287 (40.07%), Postives = 179/287 (62.37%), Query Frame = 1

Query: 35  LVSSGAEIKGEGEEEWSADMSQLFIGFKLATGRHSRIYRGVYKQRDVAIKLISQPEEDEH 94
           L+ +  EI  +G +EW  DM QL I  K+A G +  ++RG Y  ++VAIK++    + E 
Sbjct: 270 LLPACVEIPTDGTDEWEIDMKQLKIEKKVACGSYGELFRGTYCSQEVAIKIL----KPER 329

Query: 95  LANFLENQFISEVALLFRLRHPNIITFIAACKKPPVFCIITEYMAGGSLRKYLHQQEPHS 154
           +   +  +F  EV ++ ++RH N++ FI AC + P  CI+TE+M  GS+  +LH+ +   
Sbjct: 330 VNAEMLREFSQEVYIMRKVRHKNVVQFIGACTRSPNLCIVTEFMTRGSIYDFLHKHKG-V 389

Query: 155 VPLNLVLKLALEISRGMQYLHSQGILHRDLKSENLLLGEDMCIKVADFGISCLESQCGSA 214
             +  +LK+AL++S+GM YLH   I+HRDLK+ NLL+ E   +KVADFG++ ++++ G  
Sbjct: 390 FKIQSLLKVALDVSKGMNYLHQNNIIHRDLKTANLLMDEHEVVKVADFGVARVQTESGVM 449

Query: 215 KGFTGTYRWMAPEMIKEKHHTKKVDVYSFGIVLWELLTALTPFDNMTPEQAAFAVCQKNA 274
              TGTYRWMAPE+I+ K +  + DV+S+ IVLWELLT   P+  +TP QAA  V QK  
Sbjct: 450 TAETGTYRWMAPEVIEHKPYDHRADVFSYAIVLWELLTGELPYSYLTPLQAAVGVVQKGL 509

Query: 275 RPPLPSACPEAFRHLIKRCWSKNPNKRPHFDEIVSILEAYMESYNED 322
           RP +P         L+++CW ++P  RP+F EI+ +L   +    +D
Sbjct: 510 RPKIPKETHPKLTELLEKCWQQDPALRPNFAEIIEMLNQLIREVGDD 551

BLAST of Cp4.1LG20g02350 vs. Swiss-Prot
Match: Y9955_DICDI (Probable serine/threonine-protein kinase DDB_G0267514 OS=Dictyostelium discoideum GN=DDB_G0267514 PE=3 SV=1)

HSP 1 Score: 229.2 bits (583), Expect = 7.2e-59
Identity = 114/257 (44.36%), Postives = 166/257 (64.59%), Query Frame = 1

Query: 55  SQLFIGFKLATGRHSRIYRGVYKQRDVAIKLISQPEEDEHLANFLENQFISEVALLFRLR 114
           S+L I  KL  G    +Y+G+++   VAIK I   + +E + N +  +F  E+ +L RLR
Sbjct: 660 SELKISSKLGEGTFGVVYKGLWRGSSVAIKQI---KINEDVNNQVLEEFRKELTILSRLR 719

Query: 115 HPNIITFIAACKKPPVFCIITEYMAGGSLRKYLHQQEPHSVPLNLVLKLALEISRGMQYL 174
           HPNI+  +AAC  PP  C ITEY+ GGSL   LH ++   + + L  KLA++I++GM YL
Sbjct: 720 HPNIVLLMAACTAPPNLCFITEYLPGGSLYDALHSKKI-KMNMQLYKKLAIQIAQGMNYL 779

Query: 175 HSQGILHRDLKSENLLLGEDMCIKVADFGISCLESQCGSAKGFTGTYRWMAPEMIKEKHH 234
           H  G++HRD+KS NLLL E M +K+ DFG+S L+S+        G+  WM+PE++  + +
Sbjct: 780 HLSGVIHRDIKSLNLLLDEHMNVKICDFGLSKLKSKSTEMTKSIGSPIWMSPELLMGEDY 839

Query: 235 TKKVDVYSFGIVLWELLTALTPFDNMTPEQAAFAVCQKNARPPLPSACPEAFRHLIKRCW 294
           T+KVDVY+FGI+LWEL T   P+  +   Q A AV  K+ RPP+P+A P    HLI+ CW
Sbjct: 840 TEKVDVYAFGIILWELGTGELPYSGLDSVQLALAVTTKSLRPPIPNAWPYQLSHLIQACW 899

Query: 295 SKNPNKRPHFDEIVSIL 312
            ++P KRP F EI+++L
Sbjct: 900 HQDPLKRPSFTEILNLL 912

BLAST of Cp4.1LG20g02350 vs. TrEMBL
Match: A0A0A0LW99_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G046040 PE=4 SV=1)

HSP 1 Score: 699.1 bits (1803), Expect = 2.8e-198
Identity = 335/360 (93.06%), Postives = 347/360 (96.39%), Query Frame = 1

Query: 1   MRNLSWFKPISINGKPGRRLSLGEYERAVSWSKYLVSSGAEIKGEGEEEWSADMSQLFIG 60
           MRNL+WFKPISINGKPGRRLSLGEY+RAVSWSKYLVSSGAEIKGEGEEEWSADMSQLFIG
Sbjct: 1   MRNLNWFKPISINGKPGRRLSLGEYQRAVSWSKYLVSSGAEIKGEGEEEWSADMSQLFIG 60

Query: 61  FKLATGRHSRIYRGVYKQRDVAIKLISQPEEDEHLANFLENQFISEVALLFRLRHPNIIT 120
           FK ATGRHSRIYRGVYKQRDVAIKLISQPEEDE+LANFLENQFISEVALLFRLRHPNIIT
Sbjct: 61  FKFATGRHSRIYRGVYKQRDVAIKLISQPEEDENLANFLENQFISEVALLFRLRHPNIIT 120

Query: 121 FIAACKKPPVFCIITEYMAGGSLRKYLHQQEPHSVPLNLVLKLALEISRGMQYLHSQGIL 180
           FIAACKKPPVFCIITEYM GGSLRKYLHQQEPHSVPLNLVLKLAL+ISRGMQYLHSQGIL
Sbjct: 121 FIAACKKPPVFCIITEYMTGGSLRKYLHQQEPHSVPLNLVLKLALDISRGMQYLHSQGIL 180

Query: 181 HRDLKSENLLLGEDMCIKVADFGISCLESQCGSAKGFTGTYRWMAPEMIKEKHHTKKVDV 240
           HRDLKSENLLLGEDMC+KVADFGISCLESQCGSAKGFTGTYRWMAPEMIKEKHHTKKVDV
Sbjct: 181 HRDLKSENLLLGEDMCVKVADFGISCLESQCGSAKGFTGTYRWMAPEMIKEKHHTKKVDV 240

Query: 241 YSFGIVLWELLTALTPFDNMTPEQAAFAVCQKNARPPLPSACPEAFRHLIKRCWSKNPNK 300
           YSFGIVLWELLTALTPFDN+TPEQAAFAVCQKNARPPLPSACP+AFRHLIKRCWSK P+K
Sbjct: 241 YSFGIVLWELLTALTPFDNLTPEQAAFAVCQKNARPPLPSACPQAFRHLIKRCWSKKPDK 300

Query: 301 RPHFDEIVSILEAYMESYNEDREFFCHYLP-SSRNTPLQCLPKCITEQLCASWKPRNSSS 360
           RPHFDEIVSILE Y+ESYNED EFFCHY+P SSR    +CLPKCIT+Q  AS KPRNSSS
Sbjct: 301 RPHFDEIVSILETYVESYNEDPEFFCHYVPSSSRYIAWKCLPKCITKQSSASLKPRNSSS 360

BLAST of Cp4.1LG20g02350 vs. TrEMBL
Match: A0A061FTL2_THECC (Serine/threonine-protein kinase HT1 OS=Theobroma cacao GN=TCM_012052 PE=4 SV=1)

HSP 1 Score: 623.6 bits (1607), Expect = 1.5e-175
Identity = 299/344 (86.92%), Postives = 315/344 (91.57%), Query Frame = 1

Query: 1   MRNLSWFKPISINGKPGRRLSLGEYERAVSWSKYLVSSGAEIKGEGEEEWSADMSQLFIG 60
           M+NL WFK IS N +  RRLSLGEY+RA+SWSKYLVSSGAEIKGEGEEEWSADMSQLFIG
Sbjct: 1   MKNLYWFKQISNNVRSERRLSLGEYKRAISWSKYLVSSGAEIKGEGEEEWSADMSQLFIG 60

Query: 61  FKLATGRHSRIYRGVYKQRDVAIKLISQPEEDEHLANFLENQFISEVALLFRLRHPNIIT 120
            K A+GRHSRIYRG+YKQRDVAIKLISQPEED +LANFLE QFISEVALLF LRHPNIIT
Sbjct: 61  NKFASGRHSRIYRGIYKQRDVAIKLISQPEEDANLANFLEKQFISEVALLFHLRHPNIIT 120

Query: 121 FIAACKKPPVFCIITEYMAGGSLRKYLHQQEPHSVPLNLVLKLALEISRGMQYLHSQGIL 180
           F+AACKKPPVFCIITEY+AGGSLRKYLHQQEP+SVPLNLVLKLAL+I+RGMQYLHS+GIL
Sbjct: 121 FVAACKKPPVFCIITEYLAGGSLRKYLHQQEPYSVPLNLVLKLALDIARGMQYLHSEGIL 180

Query: 181 HRDLKSENLLLGEDMCIKVADFGISCLESQCGSAKGFTGTYRWMAPEMIKEKHHTKKVDV 240
           HRDLKSENLLLGEDMC+KVADFGISCLESQCGSAKGFTGTYRWMAPEMIKEKHHTKKVDV
Sbjct: 181 HRDLKSENLLLGEDMCVKVADFGISCLESQCGSAKGFTGTYRWMAPEMIKEKHHTKKVDV 240

Query: 241 YSFGIVLWELLTALTPFDNMTPEQAAFAVCQKNARPPLPSACPEAFRHLIKRCWSKNPNK 300
           YSFGIVLWELLTALTPFDNMTPEQAAFAVCQKNARPPLPS CP AF HLI RCWS NP K
Sbjct: 241 YSFGIVLWELLTALTPFDNMTPEQAAFAVCQKNARPPLPSTCPLAFGHLINRCWSSNPQK 300

Query: 301 RPHFDEIVSILEAYMESYNEDREFFCHYLPSSRNTPLQCLPKCI 345
           RPHFDEIVSILE Y ES  ED EFF  Y PS  +  L+CLPKCI
Sbjct: 301 RPHFDEIVSILEHYAESLEEDPEFFSTYKPSPDHGILRCLPKCI 344

BLAST of Cp4.1LG20g02350 vs. TrEMBL
Match: A0A067K4N8_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_11483 PE=4 SV=1)

HSP 1 Score: 620.5 bits (1599), Expect = 1.2e-174
Identity = 293/342 (85.67%), Postives = 315/342 (92.11%), Query Frame = 1

Query: 1   MRNLSWFKPISINGKPGRRLSLGEYERAVSWSKYLVSSGAEIKGEGEEEWSADMSQLFIG 60
           M+NL WFK IS NG+ GRRLSLGEY+RAVSWSKYLVSSGAEIKGEGE EWSADMSQLFIG
Sbjct: 1   MKNLYWFKQISNNGRSGRRLSLGEYKRAVSWSKYLVSSGAEIKGEGEVEWSADMSQLFIG 60

Query: 61  FKLATGRHSRIYRGVYKQRDVAIKLISQPEEDEHLANFLENQFISEVALLFRLRHPNIIT 120
            K A+GRHSRIYRG+YKQRDVAIK++SQPEEDE LA+ LE QF SEVALLFRLRHPNIIT
Sbjct: 61  NKFASGRHSRIYRGIYKQRDVAIKIVSQPEEDEDLASMLEKQFTSEVALLFRLRHPNIIT 120

Query: 121 FIAACKKPPVFCIITEYMAGGSLRKYLHQQEPHSVPLNLVLKLALEISRGMQYLHSQGIL 180
           F+AACKKPPVFCIITEY+AGGSLR+YLHQQEPHSVPLNLVLKLAL+I+RGMQYLHS+GIL
Sbjct: 121 FVAACKKPPVFCIITEYLAGGSLRRYLHQQEPHSVPLNLVLKLALDIARGMQYLHSRGIL 180

Query: 181 HRDLKSENLLLGEDMCIKVADFGISCLESQCGSAKGFTGTYRWMAPEMIKEKHHTKKVDV 240
           HRDLKSENLLLGEDMC+KVADFGISCLE+QCGSAKGFTGTYRWMAPEMIKEKHHTKKVDV
Sbjct: 181 HRDLKSENLLLGEDMCVKVADFGISCLETQCGSAKGFTGTYRWMAPEMIKEKHHTKKVDV 240

Query: 241 YSFGIVLWELLTALTPFDNMTPEQAAFAVCQKNARPPLPSACPEAFRHLIKRCWSKNPNK 300
           YSFGIVLWELLTALTPFDNMTPEQAAFAVCQKNARPPLP ACP AF HLI RCWS NP+K
Sbjct: 241 YSFGIVLWELLTALTPFDNMTPEQAAFAVCQKNARPPLPPACPLAFSHLINRCWSSNPDK 300

Query: 301 RPHFDEIVSILEAYMESYNEDREFFCHYLPSSRNTPLQCLPK 343
           RPHFDEIV+ILE Y ES+ +D EFF +Y P S  T L+C PK
Sbjct: 301 RPHFDEIVAILEGYTESFEQDPEFFKNYKPYSEQTILRCFPK 342

BLAST of Cp4.1LG20g02350 vs. TrEMBL
Match: V4SYN7_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10020764mg PE=4 SV=1)

HSP 1 Score: 614.0 bits (1582), Expect = 1.2e-172
Identity = 290/355 (81.69%), Postives = 315/355 (88.73%), Query Frame = 1

Query: 1   MRNLSWFKPISINGKPGRRLSLGEYERAVSWSKYLVSSGAEIKGEGEEEWSADMSQLFIG 60
           M+N  WFK ISIN KP R LSL EY RAVSWSKYLVSSGAEIKGEGEEEWSADMSQLFIG
Sbjct: 1   MKNFHWFKQISINAKPERMLSLREYRRAVSWSKYLVSSGAEIKGEGEEEWSADMSQLFIG 60

Query: 61  FKLATGRHSRIYRGVYKQRDVAIKLISQPEEDEHLANFLENQFISEVALLFRLRHPNIIT 120
            K A+GRHSRIYRG+YKQRDVAIKL+SQPEED  LA+ LE QF SEVALLFRL HP+IIT
Sbjct: 61  CKFASGRHSRIYRGIYKQRDVAIKLVSQPEEDASLASMLEKQFTSEVALLFRLNHPHIIT 120

Query: 121 FIAACKKPPVFCIITEYMAGGSLRKYLHQQEPHSVPLNLVLKLALEISRGMQYLHSQGIL 180
           F+AACKKPPVFCIITEY+AGGSLRKYLHQQEP+SVPLNLVLKLAL+I+RGMQYLHSQGIL
Sbjct: 121 FVAACKKPPVFCIITEYLAGGSLRKYLHQQEPYSVPLNLVLKLALDIARGMQYLHSQGIL 180

Query: 181 HRDLKSENLLLGEDMCIKVADFGISCLESQCGSAKGFTGTYRWMAPEMIKEKHHTKKVDV 240
           HRDLKSENLLLGEDMC+KVADFGISCLESQCGSAKGFTGTYRWMAPEMIKEK HTKKVDV
Sbjct: 181 HRDLKSENLLLGEDMCVKVADFGISCLESQCGSAKGFTGTYRWMAPEMIKEKRHTKKVDV 240

Query: 241 YSFGIVLWELLTALTPFDNMTPEQAAFAVCQKNARPPLPSACPEAFRHLIKRCWSKNPNK 300
           YSFG+VLWELLTALTPFDNMTPEQAAFAVCQKNARPPLP  CP+AF +LI RCWS +P++
Sbjct: 241 YSFGVVLWELLTALTPFDNMTPEQAAFAVCQKNARPPLPPTCPKAFNYLISRCWSSSPDR 300

Query: 301 RPHFDEIVSILEAYMESYNEDREFFCHYLPSSRNTPLQCLPKCITEQLCASWKPR 356
           RPHFD+IVSILE Y ES  +D EFF  ++PS  +T L+CLP CI    CA  K +
Sbjct: 301 RPHFDQIVSILEGYSESLEQDPEFFSSFIPSPDHTILRCLPTCIARHCCAHSKAK 355

BLAST of Cp4.1LG20g02350 vs. TrEMBL
Match: B9RB33_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1510770 PE=4 SV=1)

HSP 1 Score: 613.6 bits (1581), Expect = 1.5e-172
Identity = 291/343 (84.84%), Postives = 312/343 (90.96%), Query Frame = 1

Query: 1   MRNLSWFKPISINGKPGRRLSLGEYERAVSWSKYLVSSGAEIKGEGEEEWSADMSQLFIG 60
           M+NL WFK IS NG+ GRRLSLGEY+RAVSWSKYLVSSGAEIKGEGE EWSADMSQLFIG
Sbjct: 1   MKNLYWFKQISNNGRSGRRLSLGEYKRAVSWSKYLVSSGAEIKGEGEIEWSADMSQLFIG 60

Query: 61  FKLATGRHSRIYRGVYKQRDVAIKLISQPEEDEHLANFLENQFISEVALLFRLRHPNIIT 120
            K A+GRHSRIYRG+YKQRDVAIK++SQPEEDE LA  LE QF SEVALLFRL HPNIIT
Sbjct: 61  NKFASGRHSRIYRGIYKQRDVAIKIVSQPEEDEDLAAMLEKQFTSEVALLFRLSHPNIIT 120

Query: 121 FIAACKKPPVFCIITEYMAGGSLRKYLHQQEPHSVPLNLVLKLALEISRGMQYLHSQGIL 180
           F+AACKK PV+CIITEY+AGGSLRKYLHQQEPHSVPLNLVLKLA++I+RGMQYLHSQGIL
Sbjct: 121 FVAACKKTPVYCIITEYLAGGSLRKYLHQQEPHSVPLNLVLKLAIDIARGMQYLHSQGIL 180

Query: 181 HRDLKSENLLLGEDMCIKVADFGISCLESQCGSAKGFTGTYRWMAPEMIKEKHHTKKVDV 240
           HRDLKSENLLLGEDMC+KVADFGISCLESQCGSAKGFTGTYRWMAPEMIKEKHHTKKVDV
Sbjct: 181 HRDLKSENLLLGEDMCVKVADFGISCLESQCGSAKGFTGTYRWMAPEMIKEKHHTKKVDV 240

Query: 241 YSFGIVLWELLTALTPFDNMTPEQAAFAVCQKNARPPLPSACPEAFRHLIKRCWSKNPNK 300
           YSFGIVLWELLTALTPFDNMTPEQAAFAVCQKNARPPLP ACP AF HLI RCWS NP+K
Sbjct: 241 YSFGIVLWELLTALTPFDNMTPEQAAFAVCQKNARPPLPPACPPAFSHLINRCWSSNPDK 300

Query: 301 RPHFDEIVSILEAYMESYNEDREFFCHYLPSSRNTPLQCLPKC 344
           RPHFDEIV+ILE Y ES  +D EFF +Y P S ++ L+C P C
Sbjct: 301 RPHFDEIVAILEIYTESLEQDPEFFSNYKPHSGHSILRCFPIC 343

BLAST of Cp4.1LG20g02350 vs. TAIR10
Match: AT1G62400.1 (AT1G62400.1 Protein kinase superfamily protein)

HSP 1 Score: 405.6 bits (1041), Expect = 3.2e-113
Identity = 201/326 (61.66%), Postives = 241/326 (73.93%), Query Frame = 1

Query: 25  YERAVSWSKYLVSSGAEI----KGEGEEEWSADMSQLFIGFKLATGRHSRIYRGVYKQRD 84
           ++   SWS  L S   E     KGE  EEW+AD+SQLFIG K A+G HSRIYRG+YKQR 
Sbjct: 6   FDSMESWSMILESENVETWEASKGE-REEWTADLSQLFIGNKFASGAHSRIYRGIYKQRA 65

Query: 85  VAIKLISQPEEDEHLANFLENQFISEVALLFRLRHPNIITFIAACKKPPVFCIITEYMAG 144
           VA+K++  P   E     LE QF SEVALL RL HPNI+ FIAACKKPPV+CIITEYM+ 
Sbjct: 66  VAVKMVRIPTHKEETRAKLEQQFKSEVALLSRLFHPNIVQFIAACKKPPVYCIITEYMSQ 125

Query: 145 GSLRKYLHQQEPHSVPLNLVLKLALEISRGMQYLHSQGILHRDLKSENLLLGEDMCIKVA 204
           G+LR YL+++EP+S+ +  VL+LAL+ISRGM+YLHSQG++HRDLKS NLLL ++M +KVA
Sbjct: 126 GNLRMYLNKKEPYSLSIETVLRLALDISRGMEYLHSQGVIHRDLKSNNLLLNDEMRVKVA 185

Query: 205 DFGISCLESQCGSAKGFTGTYRWMAPEMIKEKHHTKKVDVYSFGIVLWELLTALTPFDNM 264
           DFG SCLE+QC  AKG  GTYRWMAPEMIKEK +T+KVDVYSFGIVLWEL TAL PF  M
Sbjct: 186 DFGTSCLETQCREAKGNMGTYRWMAPEMIKEKPYTRKVDVYSFGIVLWELTTALLPFQGM 245

Query: 265 TPEQAAFAVCQKNARPPLPSACPEAFRHLIKRCWSKNPNKRPHFDEIVSILEAYMESYNE 324
           TP QAAFAV +KN RPPLP++C  A  HLIKRCWS+NP+KRP F  IV++LE Y E   E
Sbjct: 246 TPVQAAFAVAEKNERPPLPASCQPALAHLIKRCWSENPSKRPDFSNIVAVLEKYDECVKE 305

Query: 325 DREFFCH-YLPSSRNTPLQCLPKCIT 346
                 H  L  ++   L  L  C+T
Sbjct: 306 GLPLTSHASLTKTKKAILDHLKGCVT 330

BLAST of Cp4.1LG20g02350 vs. TAIR10
Match: AT5G58950.1 (AT5G58950.1 Protein kinase superfamily protein)

HSP 1 Score: 317.0 bits (811), Expect = 1.5e-86
Identity = 146/299 (48.83%), Postives = 205/299 (68.56%), Query Frame = 1

Query: 26  ERAVSWSKYLVSSGAEIKG-EGEEEWSADMSQLFIGFKLATGRHSRIYRGVYKQRDVAIK 85
           ++   WSK   ++G  +   E  EE+  DMS+LF G K A G +SR+Y G Y+ + VA+K
Sbjct: 175 KKDTGWSKLFDNTGRRVSAVEASEEFRVDMSKLFFGLKFAHGLYSRLYHGKYEDKAVAVK 234

Query: 86  LISQPEEDEH--LANFLENQFISEVALLFRLRHPNIITFIAACKKPPVFCIITEYMAGGS 145
           LI+ P++D++  L   LE QF  EV LL RL HPN+I F+ A K PPV+C++T+Y+  GS
Sbjct: 235 LITVPDDDDNGCLGARLEKQFTKEVTLLSRLTHPNVIKFVGAYKDPPVYCVLTQYLPEGS 294

Query: 146 LRKYLHQQEPHSVPLNLVLKLALEISRGMQYLHSQGILHRDLKSENLLLGEDMCIKVADF 205
           LR +LH+ E  S+PL  +++ A++I+RGM+Y+HS+ I+HRDLK EN+L+ E+  +K+ADF
Sbjct: 295 LRSFLHKPENRSLPLKKLIEFAIDIARGMEYIHSRRIIHRDLKPENVLIDEEFHLKIADF 354

Query: 206 GISCLESQCGSAKGFTGTYRWMAPEMIKEKHHTKKVDVYSFGIVLWELLTALTPFDNMTP 265
           GI+C E  C       GTYRWMAPEMIK K H +K DVYSFG+VLWE++    P+++M P
Sbjct: 355 GIACEEEYCDMLADDPGTYRWMAPEMIKRKPHGRKADVYSFGLVLWEMVAGAIPYEDMNP 414

Query: 266 EQAAFAVCQKNARPPLPSACPEAFRHLIKRCWSKNPNKRPHFDEIVSILEAYMESYNED 322
            QAAFAV  KN RP +P  CP A + LI++CWS  P+KRP F +IV +LE +  S   +
Sbjct: 415 IQAAFAVVHKNIRPAIPGDCPVAMKALIEQCWSVAPDKRPEFWQIVKVLEQFAISLERE 473

BLAST of Cp4.1LG20g02350 vs. TAIR10
Match: AT2G24360.1 (AT2G24360.1 Protein kinase superfamily protein)

HSP 1 Score: 295.8 bits (756), Expect = 3.6e-80
Identity = 133/266 (50.00%), Postives = 184/266 (69.17%), Query Frame = 1

Query: 48  EEWSADMSQLFIGFKLATGRHSRIYRGVYKQRDVAIKLISQPEEDEHLANFLENQFISEV 107
           +EW+ D+ +L +G   A G   ++Y+G Y   DVAIK++ +PE     A F+E QF  EV
Sbjct: 121 DEWTIDLRKLNMGPAFAQGAFGKLYKGTYNGEDVAIKILERPENSPEKAQFMEQQFQQEV 180

Query: 108 ALLFRLRHPNIITFIAACKKPPVFCIITEYMAGGSLRKYLHQQEPHSVPLNLVLKLALEI 167
           ++L  L+HPNI+ FI AC+KP V+CI+TEY  GGS+R++L +++  +VPL L +K AL++
Sbjct: 181 SMLANLKHPNIVRFIGACRKPMVWCIVTEYAKGGSVRQFLTRRQNRAVPLKLAVKQALDV 240

Query: 168 SRGMQYLHSQGILHRDLKSENLLLGEDMCIKVADFGISCLESQCGSAKGFTGTYRWMAPE 227
           +RGM Y+H +  +HRDLKS+NLL+  D  IK+ADFG++ +E Q       TGTYRWMAPE
Sbjct: 241 ARGMAYVHGRNFIHRDLKSDNLLISADKSIKIADFGVARIEVQTEGMTPETGTYRWMAPE 300

Query: 228 MIKEKHHTKKVDVYSFGIVLWELLTALTPFDNMTPEQAAFAVCQKNARPPLPSACPEAFR 287
           MI+ + + +KVDVYSFGIVLWEL+T L PF NMT  QAAFAV  +  RP +P+ C     
Sbjct: 301 MIQHRAYNQKVDVYSFGIVLWELITGLLPFQNMTAVQAAFAVVNRGVRPTVPNDCLPVLS 360

Query: 288 HLIKRCWSKNPNKRPHFDEIVSILEA 314
            ++ RCW  NP  RP F E+V +LEA
Sbjct: 361 DIMTRCWDANPEVRPCFVEVVKLLEA 386

BLAST of Cp4.1LG20g02350 vs. TAIR10
Match: AT4G31170.1 (AT4G31170.1 Protein kinase superfamily protein)

HSP 1 Score: 289.3 bits (739), Expect = 3.3e-78
Identity = 134/266 (50.38%), Postives = 183/266 (68.80%), Query Frame = 1

Query: 48  EEWSADMSQLFIGFKLATGRHSRIYRGVYKQRDVAIKLISQPEEDEHLANFLENQFISEV 107
           EEW+ D+ +L +G   A G   ++YRG Y   DVAIKL+ + + +   A  LE QF  EV
Sbjct: 122 EEWTIDLRKLHMGPAFAQGAFGKLYRGTYNGEDVAIKLLERSDSNPEKAQALEQQFQQEV 181

Query: 108 ALLFRLRHPNIITFIAACKKPPVFCIITEYMAGGSLRKYLHQQEPHSVPLNLVLKLALEI 167
           ++L  L+HPNI+ FI AC KP V+CI+TEY  GGS+R++L +++  +VPL L +  AL++
Sbjct: 182 SMLAFLKHPNIVRFIGACIKPMVWCIVTEYAKGGSVRQFLTKRQNRAVPLKLAVMQALDV 241

Query: 168 SRGMQYLHSQGILHRDLKSENLLLGEDMCIKVADFGISCLESQCGSAKGFTGTYRWMAPE 227
           +RGM Y+H +  +HRDLKS+NLL+  D  IK+ADFG++ +E Q       TGTYRWMAPE
Sbjct: 242 ARGMAYVHERNFIHRDLKSDNLLISADRSIKIADFGVARIEVQTEGMTPETGTYRWMAPE 301

Query: 228 MIKEKHHTKKVDVYSFGIVLWELLTALTPFDNMTPEQAAFAVCQKNARPPLPSACPEAFR 287
           MI+ + +T+KVDVYSFGIVLWEL+T L PF NMT  QAAFAV  +  RP +P+ C     
Sbjct: 302 MIQHRPYTQKVDVYSFGIVLWELITGLLPFQNMTAVQAAFAVVNRGVRPTVPADCLPVLG 361

Query: 288 HLIKRCWSKNPNKRPHFDEIVSILEA 314
            ++ RCW  +P  RP F EIV++LEA
Sbjct: 362 EIMTRCWDADPEVRPCFAEIVNLLEA 387

BLAST of Cp4.1LG20g02350 vs. TAIR10
Match: AT3G46930.1 (AT3G46930.1 Protein kinase superfamily protein)

HSP 1 Score: 262.7 bits (670), Expect = 3.3e-70
Identity = 136/315 (43.17%), Postives = 197/315 (62.54%), Query Frame = 1

Query: 32  SKYLVSSGAEIKGEGE-EEWSADMSQLFIGFKLATGRHSRIYRGVYKQRDVAIKLISQPE 91
           SK +   G+++   G  EE   D+S+L  G + A G++S+IY G Y+ + VA+K+I+ PE
Sbjct: 135 SKSVDYRGSKVSSAGVLEECLIDVSKLSYGDRFAHGKYSQIYHGEYEGKAVALKIITAPE 194

Query: 92  E--DEHLANFLENQFISEVALLFRLRHPNIITFIAACKKPPVFCIITEYMAGGSLRKYLH 151
           +  D  L   LE +FI E  LL RL HPN++ F+         CIITEY+  GSLR YLH
Sbjct: 195 DSDDIFLGARLEKEFIVEATLLSRLSHPNVVKFVGVNTGN---CIITEYVPRGSLRSYLH 254

Query: 152 QQEPHSVPLNLVLKLALEISRGMQYLHSQGILHRDLKSENLLLGEDMCIKVADFGISCLE 211
           + E  S+PL  ++   L+I++GM+Y+HS+ I+H+DLK EN+L+  D  +K+ADFGI+C E
Sbjct: 255 KLEQKSLPLEQLIDFGLDIAKGMEYIHSREIVHQDLKPENVLIDNDFHLKIADFGIACEE 314

Query: 212 SQCGSAKGFTGTYRWMAPEMIKEKHHTKKVDVYSFGIVLWELLTALTPFDNMT-PEQAAF 271
             C       GTYRWMAPE++K   H +K DVYSFG++LWE++    P++ M   EQ A+
Sbjct: 315 EYCDVLGDNIGTYRWMAPEVLKRIPHGRKCDVYSFGLLLWEMVAGALPYEEMKFAEQIAY 374

Query: 272 AVCQKNARPPLPSACPEAFRHLIKRCWSKNPNKRPHFDEIVSILEAYMESYNEDREFFCH 331
           AV  K  RP +P+ CP A + LI+RCWS   +KRP F +IV +LE + +S   + +   +
Sbjct: 375 AVIYKKIRPVIPTDCPAAMKELIERCWSSQTDKRPEFWQIVKVLEHFKKSLTSEGKL--N 434

Query: 332 YLPSSRNTPLQCLPK 343
            LPS     L+  PK
Sbjct: 435 LLPSQICPELKKCPK 444

BLAST of Cp4.1LG20g02350 vs. NCBI nr
Match: gi|449469533|ref|XP_004152474.1| (PREDICTED: serine/threonine-protein kinase HT1-like [Cucumis sativus])

HSP 1 Score: 699.1 bits (1803), Expect = 3.9e-198
Identity = 335/360 (93.06%), Postives = 347/360 (96.39%), Query Frame = 1

Query: 1   MRNLSWFKPISINGKPGRRLSLGEYERAVSWSKYLVSSGAEIKGEGEEEWSADMSQLFIG 60
           MRNL+WFKPISINGKPGRRLSLGEY+RAVSWSKYLVSSGAEIKGEGEEEWSADMSQLFIG
Sbjct: 1   MRNLNWFKPISINGKPGRRLSLGEYQRAVSWSKYLVSSGAEIKGEGEEEWSADMSQLFIG 60

Query: 61  FKLATGRHSRIYRGVYKQRDVAIKLISQPEEDEHLANFLENQFISEVALLFRLRHPNIIT 120
           FK ATGRHSRIYRGVYKQRDVAIKLISQPEEDE+LANFLENQFISEVALLFRLRHPNIIT
Sbjct: 61  FKFATGRHSRIYRGVYKQRDVAIKLISQPEEDENLANFLENQFISEVALLFRLRHPNIIT 120

Query: 121 FIAACKKPPVFCIITEYMAGGSLRKYLHQQEPHSVPLNLVLKLALEISRGMQYLHSQGIL 180
           FIAACKKPPVFCIITEYM GGSLRKYLHQQEPHSVPLNLVLKLAL+ISRGMQYLHSQGIL
Sbjct: 121 FIAACKKPPVFCIITEYMTGGSLRKYLHQQEPHSVPLNLVLKLALDISRGMQYLHSQGIL 180

Query: 181 HRDLKSENLLLGEDMCIKVADFGISCLESQCGSAKGFTGTYRWMAPEMIKEKHHTKKVDV 240
           HRDLKSENLLLGEDMC+KVADFGISCLESQCGSAKGFTGTYRWMAPEMIKEKHHTKKVDV
Sbjct: 181 HRDLKSENLLLGEDMCVKVADFGISCLESQCGSAKGFTGTYRWMAPEMIKEKHHTKKVDV 240

Query: 241 YSFGIVLWELLTALTPFDNMTPEQAAFAVCQKNARPPLPSACPEAFRHLIKRCWSKNPNK 300
           YSFGIVLWELLTALTPFDN+TPEQAAFAVCQKNARPPLPSACP+AFRHLIKRCWSK P+K
Sbjct: 241 YSFGIVLWELLTALTPFDNLTPEQAAFAVCQKNARPPLPSACPQAFRHLIKRCWSKKPDK 300

Query: 301 RPHFDEIVSILEAYMESYNEDREFFCHYLP-SSRNTPLQCLPKCITEQLCASWKPRNSSS 360
           RPHFDEIVSILE Y+ESYNED EFFCHY+P SSR    +CLPKCIT+Q  AS KPRNSSS
Sbjct: 301 RPHFDEIVSILETYVESYNEDPEFFCHYVPSSSRYIAWKCLPKCITKQSSASLKPRNSSS 360

BLAST of Cp4.1LG20g02350 vs. NCBI nr
Match: gi|590663374|ref|XP_007036199.1| (Serine/threonine-protein kinase HT1 [Theobroma cacao])

HSP 1 Score: 623.6 bits (1607), Expect = 2.1e-175
Identity = 299/344 (86.92%), Postives = 315/344 (91.57%), Query Frame = 1

Query: 1   MRNLSWFKPISINGKPGRRLSLGEYERAVSWSKYLVSSGAEIKGEGEEEWSADMSQLFIG 60
           M+NL WFK IS N +  RRLSLGEY+RA+SWSKYLVSSGAEIKGEGEEEWSADMSQLFIG
Sbjct: 1   MKNLYWFKQISNNVRSERRLSLGEYKRAISWSKYLVSSGAEIKGEGEEEWSADMSQLFIG 60

Query: 61  FKLATGRHSRIYRGVYKQRDVAIKLISQPEEDEHLANFLENQFISEVALLFRLRHPNIIT 120
            K A+GRHSRIYRG+YKQRDVAIKLISQPEED +LANFLE QFISEVALLF LRHPNIIT
Sbjct: 61  NKFASGRHSRIYRGIYKQRDVAIKLISQPEEDANLANFLEKQFISEVALLFHLRHPNIIT 120

Query: 121 FIAACKKPPVFCIITEYMAGGSLRKYLHQQEPHSVPLNLVLKLALEISRGMQYLHSQGIL 180
           F+AACKKPPVFCIITEY+AGGSLRKYLHQQEP+SVPLNLVLKLAL+I+RGMQYLHS+GIL
Sbjct: 121 FVAACKKPPVFCIITEYLAGGSLRKYLHQQEPYSVPLNLVLKLALDIARGMQYLHSEGIL 180

Query: 181 HRDLKSENLLLGEDMCIKVADFGISCLESQCGSAKGFTGTYRWMAPEMIKEKHHTKKVDV 240
           HRDLKSENLLLGEDMC+KVADFGISCLESQCGSAKGFTGTYRWMAPEMIKEKHHTKKVDV
Sbjct: 181 HRDLKSENLLLGEDMCVKVADFGISCLESQCGSAKGFTGTYRWMAPEMIKEKHHTKKVDV 240

Query: 241 YSFGIVLWELLTALTPFDNMTPEQAAFAVCQKNARPPLPSACPEAFRHLIKRCWSKNPNK 300
           YSFGIVLWELLTALTPFDNMTPEQAAFAVCQKNARPPLPS CP AF HLI RCWS NP K
Sbjct: 241 YSFGIVLWELLTALTPFDNMTPEQAAFAVCQKNARPPLPSTCPLAFGHLINRCWSSNPQK 300

Query: 301 RPHFDEIVSILEAYMESYNEDREFFCHYLPSSRNTPLQCLPKCI 345
           RPHFDEIVSILE Y ES  ED EFF  Y PS  +  L+CLPKCI
Sbjct: 301 RPHFDEIVSILEHYAESLEEDPEFFSTYKPSPDHGILRCLPKCI 344

BLAST of Cp4.1LG20g02350 vs. NCBI nr
Match: gi|802652351|ref|XP_012080077.1| (PREDICTED: serine/threonine-protein kinase HT1-like [Jatropha curcas])

HSP 1 Score: 620.5 bits (1599), Expect = 1.8e-174
Identity = 293/342 (85.67%), Postives = 315/342 (92.11%), Query Frame = 1

Query: 1   MRNLSWFKPISINGKPGRRLSLGEYERAVSWSKYLVSSGAEIKGEGEEEWSADMSQLFIG 60
           M+NL WFK IS NG+ GRRLSLGEY+RAVSWSKYLVSSGAEIKGEGE EWSADMSQLFIG
Sbjct: 1   MKNLYWFKQISNNGRSGRRLSLGEYKRAVSWSKYLVSSGAEIKGEGEVEWSADMSQLFIG 60

Query: 61  FKLATGRHSRIYRGVYKQRDVAIKLISQPEEDEHLANFLENQFISEVALLFRLRHPNIIT 120
            K A+GRHSRIYRG+YKQRDVAIK++SQPEEDE LA+ LE QF SEVALLFRLRHPNIIT
Sbjct: 61  NKFASGRHSRIYRGIYKQRDVAIKIVSQPEEDEDLASMLEKQFTSEVALLFRLRHPNIIT 120

Query: 121 FIAACKKPPVFCIITEYMAGGSLRKYLHQQEPHSVPLNLVLKLALEISRGMQYLHSQGIL 180
           F+AACKKPPVFCIITEY+AGGSLR+YLHQQEPHSVPLNLVLKLAL+I+RGMQYLHS+GIL
Sbjct: 121 FVAACKKPPVFCIITEYLAGGSLRRYLHQQEPHSVPLNLVLKLALDIARGMQYLHSRGIL 180

Query: 181 HRDLKSENLLLGEDMCIKVADFGISCLESQCGSAKGFTGTYRWMAPEMIKEKHHTKKVDV 240
           HRDLKSENLLLGEDMC+KVADFGISCLE+QCGSAKGFTGTYRWMAPEMIKEKHHTKKVDV
Sbjct: 181 HRDLKSENLLLGEDMCVKVADFGISCLETQCGSAKGFTGTYRWMAPEMIKEKHHTKKVDV 240

Query: 241 YSFGIVLWELLTALTPFDNMTPEQAAFAVCQKNARPPLPSACPEAFRHLIKRCWSKNPNK 300
           YSFGIVLWELLTALTPFDNMTPEQAAFAVCQKNARPPLP ACP AF HLI RCWS NP+K
Sbjct: 241 YSFGIVLWELLTALTPFDNMTPEQAAFAVCQKNARPPLPPACPLAFSHLINRCWSSNPDK 300

Query: 301 RPHFDEIVSILEAYMESYNEDREFFCHYLPSSRNTPLQCLPK 343
           RPHFDEIV+ILE Y ES+ +D EFF +Y P S  T L+C PK
Sbjct: 301 RPHFDEIVAILEGYTESFEQDPEFFKNYKPYSEQTILRCFPK 342

BLAST of Cp4.1LG20g02350 vs. NCBI nr
Match: gi|567894640|ref|XP_006439808.1| (hypothetical protein CICLE_v10020764mg [Citrus clementina])

HSP 1 Score: 614.0 bits (1582), Expect = 1.7e-172
Identity = 290/355 (81.69%), Postives = 315/355 (88.73%), Query Frame = 1

Query: 1   MRNLSWFKPISINGKPGRRLSLGEYERAVSWSKYLVSSGAEIKGEGEEEWSADMSQLFIG 60
           M+N  WFK ISIN KP R LSL EY RAVSWSKYLVSSGAEIKGEGEEEWSADMSQLFIG
Sbjct: 1   MKNFHWFKQISINAKPERMLSLREYRRAVSWSKYLVSSGAEIKGEGEEEWSADMSQLFIG 60

Query: 61  FKLATGRHSRIYRGVYKQRDVAIKLISQPEEDEHLANFLENQFISEVALLFRLRHPNIIT 120
            K A+GRHSRIYRG+YKQRDVAIKL+SQPEED  LA+ LE QF SEVALLFRL HP+IIT
Sbjct: 61  CKFASGRHSRIYRGIYKQRDVAIKLVSQPEEDASLASMLEKQFTSEVALLFRLNHPHIIT 120

Query: 121 FIAACKKPPVFCIITEYMAGGSLRKYLHQQEPHSVPLNLVLKLALEISRGMQYLHSQGIL 180
           F+AACKKPPVFCIITEY+AGGSLRKYLHQQEP+SVPLNLVLKLAL+I+RGMQYLHSQGIL
Sbjct: 121 FVAACKKPPVFCIITEYLAGGSLRKYLHQQEPYSVPLNLVLKLALDIARGMQYLHSQGIL 180

Query: 181 HRDLKSENLLLGEDMCIKVADFGISCLESQCGSAKGFTGTYRWMAPEMIKEKHHTKKVDV 240
           HRDLKSENLLLGEDMC+KVADFGISCLESQCGSAKGFTGTYRWMAPEMIKEK HTKKVDV
Sbjct: 181 HRDLKSENLLLGEDMCVKVADFGISCLESQCGSAKGFTGTYRWMAPEMIKEKRHTKKVDV 240

Query: 241 YSFGIVLWELLTALTPFDNMTPEQAAFAVCQKNARPPLPSACPEAFRHLIKRCWSKNPNK 300
           YSFG+VLWELLTALTPFDNMTPEQAAFAVCQKNARPPLP  CP+AF +LI RCWS +P++
Sbjct: 241 YSFGVVLWELLTALTPFDNMTPEQAAFAVCQKNARPPLPPTCPKAFNYLISRCWSSSPDR 300

Query: 301 RPHFDEIVSILEAYMESYNEDREFFCHYLPSSRNTPLQCLPKCITEQLCASWKPR 356
           RPHFD+IVSILE Y ES  +D EFF  ++PS  +T L+CLP CI    CA  K +
Sbjct: 301 RPHFDQIVSILEGYSESLEQDPEFFSSFIPSPDHTILRCLPTCIARHCCAHSKAK 355

BLAST of Cp4.1LG20g02350 vs. NCBI nr
Match: gi|255540687|ref|XP_002511408.1| (PREDICTED: serine/threonine-protein kinase HT1 [Ricinus communis])

HSP 1 Score: 613.6 bits (1581), Expect = 2.2e-172
Identity = 291/343 (84.84%), Postives = 312/343 (90.96%), Query Frame = 1

Query: 1   MRNLSWFKPISINGKPGRRLSLGEYERAVSWSKYLVSSGAEIKGEGEEEWSADMSQLFIG 60
           M+NL WFK IS NG+ GRRLSLGEY+RAVSWSKYLVSSGAEIKGEGE EWSADMSQLFIG
Sbjct: 1   MKNLYWFKQISNNGRSGRRLSLGEYKRAVSWSKYLVSSGAEIKGEGEIEWSADMSQLFIG 60

Query: 61  FKLATGRHSRIYRGVYKQRDVAIKLISQPEEDEHLANFLENQFISEVALLFRLRHPNIIT 120
            K A+GRHSRIYRG+YKQRDVAIK++SQPEEDE LA  LE QF SEVALLFRL HPNIIT
Sbjct: 61  NKFASGRHSRIYRGIYKQRDVAIKIVSQPEEDEDLAAMLEKQFTSEVALLFRLSHPNIIT 120

Query: 121 FIAACKKPPVFCIITEYMAGGSLRKYLHQQEPHSVPLNLVLKLALEISRGMQYLHSQGIL 180
           F+AACKK PV+CIITEY+AGGSLRKYLHQQEPHSVPLNLVLKLA++I+RGMQYLHSQGIL
Sbjct: 121 FVAACKKTPVYCIITEYLAGGSLRKYLHQQEPHSVPLNLVLKLAIDIARGMQYLHSQGIL 180

Query: 181 HRDLKSENLLLGEDMCIKVADFGISCLESQCGSAKGFTGTYRWMAPEMIKEKHHTKKVDV 240
           HRDLKSENLLLGEDMC+KVADFGISCLESQCGSAKGFTGTYRWMAPEMIKEKHHTKKVDV
Sbjct: 181 HRDLKSENLLLGEDMCVKVADFGISCLESQCGSAKGFTGTYRWMAPEMIKEKHHTKKVDV 240

Query: 241 YSFGIVLWELLTALTPFDNMTPEQAAFAVCQKNARPPLPSACPEAFRHLIKRCWSKNPNK 300
           YSFGIVLWELLTALTPFDNMTPEQAAFAVCQKNARPPLP ACP AF HLI RCWS NP+K
Sbjct: 241 YSFGIVLWELLTALTPFDNMTPEQAAFAVCQKNARPPLPPACPPAFSHLINRCWSSNPDK 300

Query: 301 RPHFDEIVSILEAYMESYNEDREFFCHYLPSSRNTPLQCLPKC 344
           RPHFDEIV+ILE Y ES  +D EFF +Y P S ++ L+C P C
Sbjct: 301 RPHFDEIVAILEIYTESLEQDPEFFSNYKPHSGHSILRCFPIC 343

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
HT1_ARATH5.6e-11261.66Serine/threonine-protein kinase HT1 OS=Arabidopsis thaliana GN=HT1 PE=1 SV=1[more]
STY8_ARATH1.2e-6639.87Serine/threonine-protein kinase STY8 OS=Arabidopsis thaliana GN=STY8 PE=1 SV=2[more]
STY46_ARATH8.0e-6642.55Serine/threonine-protein kinase STY46 OS=Arabidopsis thaliana GN=STY46 PE=1 SV=1[more]
STY17_ARATH2.2e-6340.07Serine/threonine-protein kinase STY17 OS=Arabidopsis thaliana GN=STY17 PE=1 SV=1[more]
Y9955_DICDI7.2e-5944.36Probable serine/threonine-protein kinase DDB_G0267514 OS=Dictyostelium discoideu... [more]
Match NameE-valueIdentityDescription
A0A0A0LW99_CUCSA2.8e-19893.06Uncharacterized protein OS=Cucumis sativus GN=Csa_1G046040 PE=4 SV=1[more]
A0A061FTL2_THECC1.5e-17586.92Serine/threonine-protein kinase HT1 OS=Theobroma cacao GN=TCM_012052 PE=4 SV=1[more]
A0A067K4N8_JATCU1.2e-17485.67Uncharacterized protein OS=Jatropha curcas GN=JCGZ_11483 PE=4 SV=1[more]
V4SYN7_9ROSI1.2e-17281.69Uncharacterized protein OS=Citrus clementina GN=CICLE_v10020764mg PE=4 SV=1[more]
B9RB33_RICCO1.5e-17284.84Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1510770 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G62400.13.2e-11361.66 Protein kinase superfamily protein[more]
AT5G58950.11.5e-8648.83 Protein kinase superfamily protein[more]
AT2G24360.13.6e-8050.00 Protein kinase superfamily protein[more]
AT4G31170.13.3e-7850.38 Protein kinase superfamily protein[more]
AT3G46930.13.3e-7043.17 Protein kinase superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449469533|ref|XP_004152474.1|3.9e-19893.06PREDICTED: serine/threonine-protein kinase HT1-like [Cucumis sativus][more]
gi|590663374|ref|XP_007036199.1|2.1e-17586.92Serine/threonine-protein kinase HT1 [Theobroma cacao][more]
gi|802652351|ref|XP_012080077.1|1.8e-17485.67PREDICTED: serine/threonine-protein kinase HT1-like [Jatropha curcas][more]
gi|567894640|ref|XP_006439808.1|1.7e-17281.69hypothetical protein CICLE_v10020764mg [Citrus clementina][more]
gi|255540687|ref|XP_002511408.1|2.2e-17284.84PREDICTED: serine/threonine-protein kinase HT1 [Ricinus communis][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006468protein phosphorylation
Vocabulary: Molecular Function
TermDefinition
GO:0005524ATP binding
GO:0004672protein kinase activity
GO:0004674protein serine/threonine kinase activity
Vocabulary: INTERPRO
TermDefinition
IPR011009Kinase-like_dom_sf
IPR008271Ser/Thr_kinase_AS
IPR001245Ser-Thr/Tyr_kinase_cat_dom
IPR000719Prot_kinase_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006468 protein phosphorylation
biological_process GO:0009069 serine family amino acid metabolic process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0005886 plasma membrane
molecular_function GO:0005524 ATP binding
molecular_function GO:0004674 protein serine/threonine kinase activity
molecular_function GO:0004672 protein kinase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG20g02350.1Cp4.1LG20g02350.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000719Protein kinase domainSMARTSM00220serkin_6coord: 57..315
score: 4.2
IPR000719Protein kinase domainPROFILEPS50011PROTEIN_KINASE_DOMcoord: 57..315
score: 4
IPR001245Serine-threonine/tyrosine-protein kinase catalytic domainPRINTSPR00109TYRKINASEcoord: 282..304
score: 3.4E-20coord: 173..191
score: 3.4E-20coord: 135..148
score: 3.4E-20coord: 238..260
score: 3.4E-20coord: 219..229
score: 3.4
IPR001245Serine-threonine/tyrosine-protein kinase catalytic domainPFAMPF07714Pkinase_Tyrcoord: 61..311
score: 2.4
IPR008271Serine/threonine-protein kinase, active sitePROSITEPS00108PROTEIN_KINASE_STcoord: 179..191
scor
IPR011009Protein kinase-like domainunknownSSF56112Protein kinase-like (PK-like)coord: 46..311
score: 5.91
NoneNo IPR availableGENE3DG3DSA:1.10.510.10coord: 125..310
score: 6.6
NoneNo IPR availableGENE3DG3DSA:3.30.200.20coord: 40..124
score: 6.9
NoneNo IPR availablePANTHERPTHR23257SERINE-THREONINE PROTEIN KINASEcoord: 30..316
score: 6.9E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG20g02350Cp4.1LG09g10400Cucurbita pepo (Zucchini)cpecpeB049