Cla97C02G047760 (gene) Watermelon (97103) v2

NameCla97C02G047760
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon 17.6
LocationCla97Chr02 : 35380273 .. 35381484 (+)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATGAGTTCCAAAACTAAAAGACCTCGATCCGTATCTTCCTCGGAAGGGGGTTTCAATCGCCATAAATTTATTAACAAGGATGCTGCCGATCGGTATCAAAAATATGTTGCTAAGAGTAGTGTTATACCGGAAAGGGGCTTGGCTCCGTGCGAAGTTCACCAACCCCAATTATTTGAGAATATTATGCAACGGAGTTGGTCCGACTTTGTGAAACAGCCTGAAGCGGCTGTGGTTCCCATTGTTCGCGAATTCTATGCTAATATGGTTGAGGGCAATTCTCGATCATTTGTACGAGGTCGTCGGGTTCCCTTTGATGCCCTCACAATCAATCAGTACTATCACTTACCTAACTTTGAGCGTGATGAATATGATATCTATGCCAATGAACAGGTGGATGTTCACCAGATCATTCGTCAGCTCTGCCAACCTGGAGCTGAATGGATTGTTAATCCGGGCGAGCCGATTAGATTTAAATCTTCAAACTTGACTGTTTCTAACCAAGTGTGGCACAATTTTATCTGTGCTAAGTTACTTCCTGTTGCTCACACAAATAGTGTCACGAAAGAGAGGGCGATTCTCCTTTATGCTATTGCGACTAAGAGGTCTGTGGATGTCGGTAAAGTTATCCACAAGTCCATATGCCACATCCGAAAGAGTGGCACTGTGGGAGGACTTGGTCATTCATCTCTAATTACAGCCTTGTGTAAGAATGCAGGTGTCTTGTGGAACGAAAATGAAGAGTTGGCCAACCCTAAGGCCATCATGGACAAAAATTTGATTATGGGACTTCGTGGTTGGGGTCTTGAGACCACTGGTGCAGGCCACCGTGACGAGACTACTGGTTCAGGTCACTGTGATGGGACGACTGATGCAGGCCACCGTGATGAGACGACTAATGCAGGCCACCATGATGAAAGAACTGATTCAGGCCATCATGACGAGCCAACTGACCTGGAGGAGGCAGAAGCAGAACCCATACGGGAGGAGCAGCCAAACATGGCCATAGACCTTCCTAGGCAGACACGGAGGCCTCTATCCCTTGATGAGCGAGTTCGACGGATGGAACTTCGTGTTCGGCGCTACCATAGGCGCTCAGAGGAAAGATTTGATCATCTCTACAAGTGTTTGGTTGCTCTGCACGATCGTGGAGCCAGGCATGTGTTTCCTTCTCCCATGCAACCATATATGTCCTCAGACGAGGATCCCTGA

mRNA sequence

ATGATGAGTTCCAAAACTAAAAGACCTCGATCCGTATCTTCCTCGGAAGGGGGTTTCAATCGCCATAAATTTATTAACAAGGATGCTGCCGATCGGTATCAAAAATATGTTGCTAAGAGTAGTGTTATACCGGAAAGGGGCTTGGCTCCGTGCGAAGTTCACCAACCCCAATTATTTGAGAATATTATGCAACGGAGTTGGTCCGACTTTGTGAAACAGCCTGAAGCGGCTGTGGTTCCCATTGTTCGCGAATTCTATGCTAATATGGTTGAGGGCAATTCTCGATCATTTGTACGAGGTCGTCGGGTTCCCTTTGATGCCCTCACAATCAATCAGTACTATCACTTACCTAACTTTGAGCGTGATGAATATGATATCTATGCCAATGAACAGGTGGATGTTCACCAGATCATTCGTCAGCTCTGCCAACCTGGAGCTGAATGGATTGTTAATCCGGGCGAGCCGATTAGATTTAAATCTTCAAACTTGACTGTTTCTAACCAAGTGTGGCACAATTTTATCTGTGCTAAGTTACTTCCTGTTGCTCACACAAATAGTGTCACGAAAGAGAGGGCGATTCTCCTTTATGCTATTGCGACTAAGAGGTCTGTGGATGTCGGTAAAGTTATCCACAAGTCCATATGCCACATCCGAAAGAGTGGCACTGTGGGAGGACTTGGTCATTCATCTCTAATTACAGCCTTGTGTAAGAATGCAGGTGTCTTGTGGAACGAAAATGAAGAGTTGGCCAACCCTAAGGCCATCATGGACAAAAATTTGATTATGGGACTTCGTGGTTGGGGTCTTGAGACCACTGGTGCAGGCCACCGTGACGAGACTACTGGTTCAGGTCACTGTGATGGGACGACTGATGCAGGCCACCGTGATGAGACGACTAATGCAGGCCACCATGATGAAAGAACTGATTCAGGCCATCATGACGAGCCAACTGACCTGGAGGAGGCAGAAGCAGAACCCATACGGGAGGAGCAGCCAAACATGGCCATAGACCTTCCTAGGCAGACACGGAGGCCTCTATCCCTTGATGAGCGAGTTCGACGGATGGAACTTCGTGTTCGGCGCTACCATAGGCGCTCAGAGGAAAGATTTGATCATCTCTACAAGTGTTTGGTTGCTCTGCACGATCGTGGAGCCAGGCATGTGTTTCCTTCTCCCATGCAACCATATATGTCCTCAGACGAGGATCCCTGA

Coding sequence (CDS)

ATGATGAGTTCCAAAACTAAAAGACCTCGATCCGTATCTTCCTCGGAAGGGGGTTTCAATCGCCATAAATTTATTAACAAGGATGCTGCCGATCGGTATCAAAAATATGTTGCTAAGAGTAGTGTTATACCGGAAAGGGGCTTGGCTCCGTGCGAAGTTCACCAACCCCAATTATTTGAGAATATTATGCAACGGAGTTGGTCCGACTTTGTGAAACAGCCTGAAGCGGCTGTGGTTCCCATTGTTCGCGAATTCTATGCTAATATGGTTGAGGGCAATTCTCGATCATTTGTACGAGGTCGTCGGGTTCCCTTTGATGCCCTCACAATCAATCAGTACTATCACTTACCTAACTTTGAGCGTGATGAATATGATATCTATGCCAATGAACAGGTGGATGTTCACCAGATCATTCGTCAGCTCTGCCAACCTGGAGCTGAATGGATTGTTAATCCGGGCGAGCCGATTAGATTTAAATCTTCAAACTTGACTGTTTCTAACCAAGTGTGGCACAATTTTATCTGTGCTAAGTTACTTCCTGTTGCTCACACAAATAGTGTCACGAAAGAGAGGGCGATTCTCCTTTATGCTATTGCGACTAAGAGGTCTGTGGATGTCGGTAAAGTTATCCACAAGTCCATATGCCACATCCGAAAGAGTGGCACTGTGGGAGGACTTGGTCATTCATCTCTAATTACAGCCTTGTGTAAGAATGCAGGTGTCTTGTGGAACGAAAATGAAGAGTTGGCCAACCCTAAGGCCATCATGGACAAAAATTTGATTATGGGACTTCGTGGTTGGGGTCTTGAGACCACTGGTGCAGGCCACCGTGACGAGACTACTGGTTCAGGTCACTGTGATGGGACGACTGATGCAGGCCACCGTGATGAGACGACTAATGCAGGCCACCATGATGAAAGAACTGATTCAGGCCATCATGACGAGCCAACTGACCTGGAGGAGGCAGAAGCAGAACCCATACGGGAGGAGCAGCCAAACATGGCCATAGACCTTCCTAGGCAGACACGGAGGCCTCTATCCCTTGATGAGCGAGTTCGACGGATGGAACTTCGTGTTCGGCGCTACCATAGGCGCTCAGAGGAAAGATTTGATCATCTCTACAAGTGTTTGGTTGCTCTGCACGATCGTGGAGCCAGGCATGTGTTTCCTTCTCCCATGCAACCATATATGTCCTCAGACGAGGATCCCTGA

Protein sequence

MMSSKTKRPRSVSSSEGGFNRHKFINKDAADRYQKYVAKSSVIPERGLAPCEVHQPQLFENIMQRSWSDFVKQPEAAVVPIVREFYANMVEGNSRSFVRGRRVPFDALTINQYYHLPNFERDEYDIYANEQVDVHQIIRQLCQPGAEWIVNPGEPIRFKSSNLTVSNQVWHNFICAKLLPVAHTNSVTKERAILLYAIATKRSVDVGKVIHKSICHIRKSGTVGGLGHSSLITALCKNAGVLWNENEELANPKAIMDKNLIMGLRGWGLETTGAGHRDETTGSGHCDGTTDAGHRDETTNAGHHDERTDSGHHDEPTDLEEAEAEPIREEQPNMAIDLPRQTRRPLSLDERVRRMELRVRRYHRRSEERFDHLYKCLVALHDRGARHVFPSPMQPYMSSDEDP
BLAST of Cla97C02G047760 vs. NCBI nr
Match: KGN46897.1 (hypothetical protein Csa_6G149380 [Cucumis sativus])

HSP 1 Score: 576.6 bits (1485), Expect = 6.4e-161
Identity = 315/402 (78.36%), Postives = 342/402 (85.07%), Query Frame = 0

Query: 1   MMSSKTKRPRSVSSSEGGFNRHKFINKDAADRYQKYVAKSSVIPERGLAPCEVHQPQLFE 60
           MMSSKTKR RS  SSEG FNRHKFI+KDAADRY+K V KSSVIPERGLAPCEVHQPQLF+
Sbjct: 1   MMSSKTKRARSALSSEGAFNRHKFISKDAADRYRKLVVKSSVIPERGLAPCEVHQPQLFQ 60

Query: 61  NIMQRSWSDFVKQPEAAVVPIVREFYANMVEGNSRSFVRGRRVPFDALTINQYYHLPNFE 120
           NIMQR WSDFVKQPE AV+ IVREFYANMVEG+SRSFVRGR+V FD  TIN+YYHLPNFE
Sbjct: 61  NIMQRGWSDFVKQPEPAVLSIVREFYANMVEGSSRSFVRGRQVSFDYGTINRYYHLPNFE 120

Query: 121 RDEYDIYANEQVDVHQIIRQLCQPGAEWIVNPGEPIRFKSSNLTVSNQVWHNFICAKLLP 180
           RDEYDIYA+E VDVHQIIR+LCQPGAEW++NPGEPIRFKSSNLTVSNQVWH FICAKLLP
Sbjct: 121 RDEYDIYASEHVDVHQIIRELCQPGAEWVINPGEPIRFKSSNLTVSNQVWHKFICAKLLP 180

Query: 181 VAHTNSVTKERAILLYAIATKRSVDVGKVIHKSICHIRKSGTVGGLGHSSLITALCKNAG 240
           VAHT+SVTKERAILLYAIATKRSVDVGKVI KS+C+IRKSG  GGLGHSSLITALC+N G
Sbjct: 181 VAHTSSVTKERAILLYAIATKRSVDVGKVIQKSLCNIRKSGMTGGLGHSSLITALCRNEG 240

Query: 241 VLWNENEELANPKAIMDKNLIMGLRGWGLETTGAGHRXXXXXXXXXXXXXXXXXXXXXXX 300
           V+WNE EEL +PK IMDK+ IM + GW  E  GAGH                    XXXX
Sbjct: 241 VVWNEKEELVDPKPIMDKSFIMEIPGWSFEPMGAGH--------------------XXXX 300

Query: 301 XXXXXXXXXXXXXXXXXXXXXAEAEPIREEQPNMAIDLPRQTRRPLSLDERVRRMELRVR 360
           XXXXXXXXXXXXXXXXXXXXX EAEPIRE +  + IDLPRQT+RPLSLDE++RR+E RVR
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXEAEPIREVRQTLTIDLPRQTQRPLSLDEQIRRLERRVR 360

Query: 361 RYHRRSEERFDHLYKCLVALHDRGARHVFPSPMQPYMSSDED 403
            YHRRSEERFDHLYKCL ALHDRG  HVFP  MQPY+SSD+D
Sbjct: 361 SYHRRSEERFDHLYKCLFALHDRGVMHVFPPRMQPYVSSDDD 382

BLAST of Cla97C02G047760 vs. NCBI nr
Match: XP_008458668.1 (PREDICTED: uncharacterized protein LOC103497996 [Cucumis melo])

HSP 1 Score: 476.5 bits (1225), Expect = 9.1e-131
Identity = 266/340 (78.24%), Postives = 288/340 (84.71%), Query Frame = 0

Query: 63  MQRSWSDFVKQPEAAVVPIVREFYANMVEGNSRSFVRGRRVPFDALTINQYYHLPNFERD 122
           MQR WSDFVKQPE AVV IVREFYANMVEG+SRSFVRGR+V FD  TIN+YYHLPNFERD
Sbjct: 1   MQRGWSDFVKQPEPAVVSIVREFYANMVEGSSRSFVRGRQVSFDYGTINRYYHLPNFERD 60

Query: 123 EYDIYANEQVDVHQIIRQLCQPGAEWIVNPGEPIRFKSSNLTVSNQVWHNFICAKLLPVA 182
           EY IYA+E VDVHQIIR+LCQPGAEWI+NPGEPIRFKSSNLTVSNQVWH FICAKLLPVA
Sbjct: 61  EYAIYASEHVDVHQIIRELCQPGAEWIINPGEPIRFKSSNLTVSNQVWHKFICAKLLPVA 120

Query: 183 HTNSVTKERAILLYAIATKRSVDVGKVIHKSICHIRKSGTVGGLGHSSLITALCKNAGVL 242
           HT+SVTKERAILLYAIATKRSVDVGKVIHKS+C+IRKSG  GGLGHSSLITALC+N GV+
Sbjct: 121 HTSSVTKERAILLYAIATKRSVDVGKVIHKSLCNIRKSGMTGGLGHSSLITALCRNEGVV 180

Query: 243 WNENEELANPKAIMDKNLIMGLRGWGLETTGAGHRXXXXXXXXXXXXXXXXXXXXXXXXX 302
           WNE EEL +PK IMDKN IMG+ GW  ET GAG                    XXXXXXX
Sbjct: 181 WNEKEELVDPKPIMDKNFIMGIPGWSFETMGAG--------------------XXXXXXX 240

Query: 303 XXXXXXXXXXXXXXXXXXXAEAEPIREEQPNMAIDLPRQTRRPLSLDERVRRMELRVRRY 362
           XXXXXXXXXXXXXXXXXXX EAEPIRE +  + IDLPRQT+RPLSLDE+++R+E RVR Y
Sbjct: 241 XXXXXXXXXXXXXXXXXXXXEAEPIREVRQTLTIDLPRQTQRPLSLDEQIQRLERRVRSY 300

Query: 363 HRRSEERFDHLYKCLVALHDRGARHVFPSPMQPYMSSDED 403
           HRRSEERFDHLYKCL ALHDRG  HVFP  MQPY+SSD+D
Sbjct: 301 HRRSEERFDHLYKCLFALHDRGVMHVFPPRMQPYVSSDDD 320

BLAST of Cla97C02G047760 vs. NCBI nr
Match: KGN51153.1 (hypothetical protein Csa_5G468460 [Cucumis sativus])

HSP 1 Score: 396.4 bits (1017), Expect = 1.2e-106
Identity = 200/276 (72.46%), Postives = 218/276 (78.99%), Query Frame = 0

Query: 1   MMSSKTKRPRSVSSSEGGFNRHKFINKDAADRYQKYVAKSSVIPERGLAPCEVHQPQLFE 60
           MMSSKTKR RS  SSEG FNRHKFI+KDAADRY+K V KSS  PERGLAPCEVHQPQLF+
Sbjct: 1   MMSSKTKRARSALSSEGAFNRHKFISKDAADRYRKLVVKSSTKPERGLAPCEVHQPQLFQ 60

Query: 61  NIMQRSWSDFVKQPEAAVVPIVREFYANMVEGNSRSFVRGRRVPFDALTINQYYHLPNFE 120
           NIMQR WSDFVKQPE AV+ IVREFYANMVEG+SRSFVRGR+V FD  TIN+YYHLPNFE
Sbjct: 61  NIMQRGWSDFVKQPEPAVLSIVREFYANMVEGSSRSFVRGRQVSFDYGTINRYYHLPNFE 120

Query: 121 RDEYDIYANEQVDVHQIIRQLCQPGAEWIVNPGEPIRFKSSNLTVSNQVWHNFICAKLLP 180
           RDEYDIYA+E VDVHQIIR+LCQPGAEW                             LLP
Sbjct: 121 RDEYDIYASEHVDVHQIIRELCQPGAEW-----------------------------LLP 180

Query: 181 VAHTNSVTKERAILLYAIATKRSVDVGKVIHKSICHIRKSGTVGGLGHSSLITALCKNAG 240
           +AHT+SVTKERAILLYAIATKRSVDVGKVI KS+C+IRKSG  GGLGHSSLITALC+N G
Sbjct: 181 MAHTSSVTKERAILLYAIATKRSVDVGKVIQKSLCNIRKSGMTGGLGHSSLITALCRNEG 240

Query: 241 VLWNENEELANPKAIMDKNLIMGLRGWGLETTGAGH 277
           V+WNE EEL +PK IMDK+ IM + GW  E  GAGH
Sbjct: 241 VVWNEKEELVDPKPIMDKSFIMEIPGWSFEPMGAGH 247

BLAST of Cla97C02G047760 vs. NCBI nr
Match: PIN01433.1 (hypothetical protein CDL12_26059 [Handroanthus impetiginosus])

HSP 1 Score: 196.4 bits (498), Expect = 1.8e-46
Identity = 105/274 (38.32%), Postives = 156/274 (56.93%), Query Frame = 0

Query: 2   MSSKTKRPRSVSSSEGGFNRHKFINKDAADRYQKYVAKSSVIPERGL-APCEVHQPQLFE 61
           M+ K KR R  SS     ++ +F++K A +RY   +     I ERG     E +   ++ 
Sbjct: 1   MAPKNKRARKDSSDSR--DKGRFVSKSAEERYHSGLVGKVAIAERGFETKGEAYYEHIYH 60

Query: 62  NIMQRSWSDFVKQPEAAVVPIVREFYANMVE-GNSRSFVRGRRVPFDALTINQYYHLPNF 121
            + +R W  F+  PE+ V+P+VREFYAN  E  N +  VRGR VPFD++TIN+ Y++P  
Sbjct: 61  TVRERKWKTFIASPESGVLPLVREFYANAAEHKNLKCLVRGREVPFDSVTINELYNIPPI 120

Query: 122 ERDEYDIYANEQVDVHQIIRQLCQPGAEWIVNPGEPIRFKSSNLTVSNQVWHNFICAKLL 181
           E D ++ +    +D  ++ R LC  GA+W +  GE + FKS+ L  + ++W  FI A++L
Sbjct: 121 ELDAFENFCENGIDYEELTRTLCPHGAQWKMTKGEYVSFKSNCLDKAAKIWLWFIFARML 180

Query: 182 PVAHTNSVTKERAILLYAIATKRSVDVGKVIHKSICHIRKSGTVGGLGHSSLITALCKNA 241
           P  H+  VT +RA+LLY I T ++ DVGK+I  SI     S    GL   SLIT LC  A
Sbjct: 181 PTRHSGEVTADRALLLYCIMTGKAFDVGKIISDSIIQSANSSR-DGLWFPSLITKLCTRA 240

Query: 242 GVLWNENEELANPKAIMDKNLIMGLRGWGLETTG 274
           GV W+E EEL  P+  +D   ++ +   G +  G
Sbjct: 241 GVKWDEKEELIFPRHPIDNTTMLRILNAGHDEAG 271

BLAST of Cla97C02G047760 vs. NCBI nr
Match: EOY12720.1 (S-locus lectin protein kinase family protein, putative [Theobroma cacao])

HSP 1 Score: 183.7 bits (465), Expect = 1.2e-42
Identity = 95/245 (38.78%), Postives = 149/245 (60.82%), Query Frame = 0

Query: 14   SSEGGFNRHKFINKDAADRYQKYVAKSSVIPERGLAPCEVHQPQLFENIMQRSWSDFVKQ 73
            +S+ G++R KF++ +A  R+ + + K S + ERG     V        I+ R W +F   
Sbjct: 877  TSDNGYDRSKFVSIEAFTRHIQSLNKKSSVLERGFDLPNVRYGDSLSVIIARHWKNFSAH 936

Query: 74   PEAAVVPIVREFYANMVEGNSR-SFVRGRRVPFDALTINQYYHLPNFERDEYDIYANEQV 133
             EAAV+P+VR+FY N  E  +R +F RG++VPFD+ TINQ+ ++P  E DEY  Y +  V
Sbjct: 937  LEAAVMPVVRKFYTNAYEHENRVTFCRGKKVPFDSFTINQFSNIPKIENDEYAHYTDGNV 996

Query: 134  DVHQIIRQLCQPGAEWIVNPGEPIRFKSSNLTVSNQVWHNFICAKLLPVAHTNSVTKERA 193
            ++ ++I  L  PG +W ++ G  + FK++ L    ++W++ + AK+ P+   + VTK+RA
Sbjct: 997  NLDEVITFLYDPGTQWKISKGISVSFKANTLDKFFKIWYHILTAKMFPIKDLSDVTKDRA 1056

Query: 194  ILLYAIATKRSVDVGKVIHKSICHIRKSGTVGGLGHSSLITALCKNAGVLWNENEELANP 253
            ILLYA+ T +S++VGK I  SI H   S     + + SLI ALCK A V W+  EEL + 
Sbjct: 1057 ILLYAMVTGKSINVGKQIFNSIVHCAISAR-DNIWYLSLIIALCKQARVQWSSEEELLHL 1116

Query: 254  KAIMD 258
            +A +D
Sbjct: 1117 RAPLD 1120

BLAST of Cla97C02G047760 vs. TrEMBL
Match: tr|A0A0A0KER1|A0A0A0KER1_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G149380 PE=4 SV=1)

HSP 1 Score: 576.6 bits (1485), Expect = 4.3e-161
Identity = 315/402 (78.36%), Postives = 342/402 (85.07%), Query Frame = 0

Query: 1   MMSSKTKRPRSVSSSEGGFNRHKFINKDAADRYQKYVAKSSVIPERGLAPCEVHQPQLFE 60
           MMSSKTKR RS  SSEG FNRHKFI+KDAADRY+K V KSSVIPERGLAPCEVHQPQLF+
Sbjct: 1   MMSSKTKRARSALSSEGAFNRHKFISKDAADRYRKLVVKSSVIPERGLAPCEVHQPQLFQ 60

Query: 61  NIMQRSWSDFVKQPEAAVVPIVREFYANMVEGNSRSFVRGRRVPFDALTINQYYHLPNFE 120
           NIMQR WSDFVKQPE AV+ IVREFYANMVEG+SRSFVRGR+V FD  TIN+YYHLPNFE
Sbjct: 61  NIMQRGWSDFVKQPEPAVLSIVREFYANMVEGSSRSFVRGRQVSFDYGTINRYYHLPNFE 120

Query: 121 RDEYDIYANEQVDVHQIIRQLCQPGAEWIVNPGEPIRFKSSNLTVSNQVWHNFICAKLLP 180
           RDEYDIYA+E VDVHQIIR+LCQPGAEW++NPGEPIRFKSSNLTVSNQVWH FICAKLLP
Sbjct: 121 RDEYDIYASEHVDVHQIIRELCQPGAEWVINPGEPIRFKSSNLTVSNQVWHKFICAKLLP 180

Query: 181 VAHTNSVTKERAILLYAIATKRSVDVGKVIHKSICHIRKSGTVGGLGHSSLITALCKNAG 240
           VAHT+SVTKERAILLYAIATKRSVDVGKVI KS+C+IRKSG  GGLGHSSLITALC+N G
Sbjct: 181 VAHTSSVTKERAILLYAIATKRSVDVGKVIQKSLCNIRKSGMTGGLGHSSLITALCRNEG 240

Query: 241 VLWNENEELANPKAIMDKNLIMGLRGWGLETTGAGHRXXXXXXXXXXXXXXXXXXXXXXX 300
           V+WNE EEL +PK IMDK+ IM + GW  E  GAGH                    XXXX
Sbjct: 241 VVWNEKEELVDPKPIMDKSFIMEIPGWSFEPMGAGH--------------------XXXX 300

Query: 301 XXXXXXXXXXXXXXXXXXXXXAEAEPIREEQPNMAIDLPRQTRRPLSLDERVRRMELRVR 360
           XXXXXXXXXXXXXXXXXXXXX EAEPIRE +  + IDLPRQT+RPLSLDE++RR+E RVR
Sbjct: 301 XXXXXXXXXXXXXXXXXXXXXXEAEPIREVRQTLTIDLPRQTQRPLSLDEQIRRLERRVR 360

Query: 361 RYHRRSEERFDHLYKCLVALHDRGARHVFPSPMQPYMSSDED 403
            YHRRSEERFDHLYKCL ALHDRG  HVFP  MQPY+SSD+D
Sbjct: 361 SYHRRSEERFDHLYKCLFALHDRGVMHVFPPRMQPYVSSDDD 382

BLAST of Cla97C02G047760 vs. TrEMBL
Match: tr|A0A1S3C7Y0|A0A1S3C7Y0_CUCME (uncharacterized protein LOC103497996 OS=Cucumis melo OX=3656 GN=LOC103497996 PE=4 SV=1)

HSP 1 Score: 476.5 bits (1225), Expect = 6.0e-131
Identity = 266/340 (78.24%), Postives = 288/340 (84.71%), Query Frame = 0

Query: 63  MQRSWSDFVKQPEAAVVPIVREFYANMVEGNSRSFVRGRRVPFDALTINQYYHLPNFERD 122
           MQR WSDFVKQPE AVV IVREFYANMVEG+SRSFVRGR+V FD  TIN+YYHLPNFERD
Sbjct: 1   MQRGWSDFVKQPEPAVVSIVREFYANMVEGSSRSFVRGRQVSFDYGTINRYYHLPNFERD 60

Query: 123 EYDIYANEQVDVHQIIRQLCQPGAEWIVNPGEPIRFKSSNLTVSNQVWHNFICAKLLPVA 182
           EY IYA+E VDVHQIIR+LCQPGAEWI+NPGEPIRFKSSNLTVSNQVWH FICAKLLPVA
Sbjct: 61  EYAIYASEHVDVHQIIRELCQPGAEWIINPGEPIRFKSSNLTVSNQVWHKFICAKLLPVA 120

Query: 183 HTNSVTKERAILLYAIATKRSVDVGKVIHKSICHIRKSGTVGGLGHSSLITALCKNAGVL 242
           HT+SVTKERAILLYAIATKRSVDVGKVIHKS+C+IRKSG  GGLGHSSLITALC+N GV+
Sbjct: 121 HTSSVTKERAILLYAIATKRSVDVGKVIHKSLCNIRKSGMTGGLGHSSLITALCRNEGVV 180

Query: 243 WNENEELANPKAIMDKNLIMGLRGWGLETTGAGHRXXXXXXXXXXXXXXXXXXXXXXXXX 302
           WNE EEL +PK IMDKN IMG+ GW  ET GAG                    XXXXXXX
Sbjct: 181 WNEKEELVDPKPIMDKNFIMGIPGWSFETMGAG--------------------XXXXXXX 240

Query: 303 XXXXXXXXXXXXXXXXXXXAEAEPIREEQPNMAIDLPRQTRRPLSLDERVRRMELRVRRY 362
           XXXXXXXXXXXXXXXXXXX EAEPIRE +  + IDLPRQT+RPLSLDE+++R+E RVR Y
Sbjct: 241 XXXXXXXXXXXXXXXXXXXXEAEPIREVRQTLTIDLPRQTQRPLSLDEQIQRLERRVRSY 300

Query: 363 HRRSEERFDHLYKCLVALHDRGARHVFPSPMQPYMSSDED 403
           HRRSEERFDHLYKCL ALHDRG  HVFP  MQPY+SSD+D
Sbjct: 301 HRRSEERFDHLYKCLFALHDRGVMHVFPPRMQPYVSSDDD 320

BLAST of Cla97C02G047760 vs. TrEMBL
Match: tr|A0A0A0KNI1|A0A0A0KNI1_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G468460 PE=4 SV=1)

HSP 1 Score: 396.4 bits (1017), Expect = 7.9e-107
Identity = 200/276 (72.46%), Postives = 218/276 (78.99%), Query Frame = 0

Query: 1   MMSSKTKRPRSVSSSEGGFNRHKFINKDAADRYQKYVAKSSVIPERGLAPCEVHQPQLFE 60
           MMSSKTKR RS  SSEG FNRHKFI+KDAADRY+K V KSS  PERGLAPCEVHQPQLF+
Sbjct: 1   MMSSKTKRARSALSSEGAFNRHKFISKDAADRYRKLVVKSSTKPERGLAPCEVHQPQLFQ 60

Query: 61  NIMQRSWSDFVKQPEAAVVPIVREFYANMVEGNSRSFVRGRRVPFDALTINQYYHLPNFE 120
           NIMQR WSDFVKQPE AV+ IVREFYANMVEG+SRSFVRGR+V FD  TIN+YYHLPNFE
Sbjct: 61  NIMQRGWSDFVKQPEPAVLSIVREFYANMVEGSSRSFVRGRQVSFDYGTINRYYHLPNFE 120

Query: 121 RDEYDIYANEQVDVHQIIRQLCQPGAEWIVNPGEPIRFKSSNLTVSNQVWHNFICAKLLP 180
           RDEYDIYA+E VDVHQIIR+LCQPGAEW                             LLP
Sbjct: 121 RDEYDIYASEHVDVHQIIRELCQPGAEW-----------------------------LLP 180

Query: 181 VAHTNSVTKERAILLYAIATKRSVDVGKVIHKSICHIRKSGTVGGLGHSSLITALCKNAG 240
           +AHT+SVTKERAILLYAIATKRSVDVGKVI KS+C+IRKSG  GGLGHSSLITALC+N G
Sbjct: 181 MAHTSSVTKERAILLYAIATKRSVDVGKVIQKSLCNIRKSGMTGGLGHSSLITALCRNEG 240

Query: 241 VLWNENEELANPKAIMDKNLIMGLRGWGLETTGAGH 277
           V+WNE EEL +PK IMDK+ IM + GW  E  GAGH
Sbjct: 241 VVWNEKEELVDPKPIMDKSFIMEIPGWSFEPMGAGH 247

BLAST of Cla97C02G047760 vs. TrEMBL
Match: tr|A0A2G9G807|A0A2G9G807_9LAMI (Uncharacterized protein OS=Handroanthus impetiginosus OX=429701 GN=CDL12_26059 PE=4 SV=1)

HSP 1 Score: 196.4 bits (498), Expect = 1.2e-46
Identity = 105/274 (38.32%), Postives = 156/274 (56.93%), Query Frame = 0

Query: 2   MSSKTKRPRSVSSSEGGFNRHKFINKDAADRYQKYVAKSSVIPERGL-APCEVHQPQLFE 61
           M+ K KR R  SS     ++ +F++K A +RY   +     I ERG     E +   ++ 
Sbjct: 1   MAPKNKRARKDSSDSR--DKGRFVSKSAEERYHSGLVGKVAIAERGFETKGEAYYEHIYH 60

Query: 62  NIMQRSWSDFVKQPEAAVVPIVREFYANMVE-GNSRSFVRGRRVPFDALTINQYYHLPNF 121
            + +R W  F+  PE+ V+P+VREFYAN  E  N +  VRGR VPFD++TIN+ Y++P  
Sbjct: 61  TVRERKWKTFIASPESGVLPLVREFYANAAEHKNLKCLVRGREVPFDSVTINELYNIPPI 120

Query: 122 ERDEYDIYANEQVDVHQIIRQLCQPGAEWIVNPGEPIRFKSSNLTVSNQVWHNFICAKLL 181
           E D ++ +    +D  ++ R LC  GA+W +  GE + FKS+ L  + ++W  FI A++L
Sbjct: 121 ELDAFENFCENGIDYEELTRTLCPHGAQWKMTKGEYVSFKSNCLDKAAKIWLWFIFARML 180

Query: 182 PVAHTNSVTKERAILLYAIATKRSVDVGKVIHKSICHIRKSGTVGGLGHSSLITALCKNA 241
           P  H+  VT +RA+LLY I T ++ DVGK+I  SI     S    GL   SLIT LC  A
Sbjct: 181 PTRHSGEVTADRALLLYCIMTGKAFDVGKIISDSIIQSANSSR-DGLWFPSLITKLCTRA 240

Query: 242 GVLWNENEELANPKAIMDKNLIMGLRGWGLETTG 274
           GV W+E EEL  P+  +D   ++ +   G +  G
Sbjct: 241 GVKWDEKEELIFPRHPIDNTTMLRILNAGHDEAG 271

BLAST of Cla97C02G047760 vs. TrEMBL
Match: tr|A0A061F5W4|A0A061F5W4_THECC (S-locus lectin protein kinase family protein, putative OS=Theobroma cacao OX=3641 GN=TCM_046867 PE=4 SV=1)

HSP 1 Score: 183.7 bits (465), Expect = 8.0e-43
Identity = 95/245 (38.78%), Postives = 149/245 (60.82%), Query Frame = 0

Query: 14   SSEGGFNRHKFINKDAADRYQKYVAKSSVIPERGLAPCEVHQPQLFENIMQRSWSDFVKQ 73
            +S+ G++R KF++ +A  R+ + + K S + ERG     V        I+ R W +F   
Sbjct: 877  TSDNGYDRSKFVSIEAFTRHIQSLNKKSSVLERGFDLPNVRYGDSLSVIIARHWKNFSAH 936

Query: 74   PEAAVVPIVREFYANMVEGNSR-SFVRGRRVPFDALTINQYYHLPNFERDEYDIYANEQV 133
             EAAV+P+VR+FY N  E  +R +F RG++VPFD+ TINQ+ ++P  E DEY  Y +  V
Sbjct: 937  LEAAVMPVVRKFYTNAYEHENRVTFCRGKKVPFDSFTINQFSNIPKIENDEYAHYTDGNV 996

Query: 134  DVHQIIRQLCQPGAEWIVNPGEPIRFKSSNLTVSNQVWHNFICAKLLPVAHTNSVTKERA 193
            ++ ++I  L  PG +W ++ G  + FK++ L    ++W++ + AK+ P+   + VTK+RA
Sbjct: 997  NLDEVITFLYDPGTQWKISKGISVSFKANTLDKFFKIWYHILTAKMFPIKDLSDVTKDRA 1056

Query: 194  ILLYAIATKRSVDVGKVIHKSICHIRKSGTVGGLGHSSLITALCKNAGVLWNENEELANP 253
            ILLYA+ T +S++VGK I  SI H   S     + + SLI ALCK A V W+  EEL + 
Sbjct: 1057 ILLYAMVTGKSINVGKQIFNSIVHCAISAR-DNIWYLSLIIALCKQARVQWSSEEELLHL 1116

Query: 254  KAIMD 258
            +A +D
Sbjct: 1117 RAPLD 1120

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KGN46897.16.4e-16178.36hypothetical protein Csa_6G149380 [Cucumis sativus][more]
XP_008458668.19.1e-13178.24PREDICTED: uncharacterized protein LOC103497996 [Cucumis melo][more]
KGN51153.11.2e-10672.46hypothetical protein Csa_5G468460 [Cucumis sativus][more]
PIN01433.11.8e-4638.32hypothetical protein CDL12_26059 [Handroanthus impetiginosus][more]
EOY12720.11.2e-4238.78S-locus lectin protein kinase family protein, putative [Theobroma cacao][more]
Match NameE-valueIdentityDescription
tr|A0A0A0KER1|A0A0A0KER1_CUCSA4.3e-16178.36Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G149380 PE=4 SV=1[more]
tr|A0A1S3C7Y0|A0A1S3C7Y0_CUCME6.0e-13178.24uncharacterized protein LOC103497996 OS=Cucumis melo OX=3656 GN=LOC103497996 PE=... [more]
tr|A0A0A0KNI1|A0A0A0KNI1_CUCSA7.9e-10772.46Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G468460 PE=4 SV=1[more]
tr|A0A2G9G807|A0A2G9G807_9LAMI1.2e-4638.32Uncharacterized protein OS=Handroanthus impetiginosus OX=429701 GN=CDL12_26059 P... [more]
tr|A0A061F5W4|A0A061F5W4_THECC8.0e-4338.78S-locus lectin protein kinase family protein, putative OS=Theobroma cacao OX=364... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0016310 phosphorylation
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0016301 kinase activity
molecular_function GO:0016740 transferase activity
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C02G047760.1Cla97C02G047760.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 272..327
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 280..319

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla97C02G047760Watermelon (97103) v2wmbwmbB012
Cla97C02G047760Watermelon (97103) v2wmbwmbB047
Cla97C02G047760Watermelon (97103) v2wmbwmbB098
Cla97C02G047760Silver-seed gourdcarwmbB0164
Cla97C02G047760Silver-seed gourdcarwmbB0255
Cla97C02G047760Silver-seed gourdcarwmbB0346
Cla97C02G047760Silver-seed gourdcarwmbB0966
Cla97C02G047760Cucumber (Gy14) v2cgybwmbB026
Cla97C02G047760Cucumber (Gy14) v2cgybwmbB187
Cla97C02G047760Cucumber (Gy14) v2cgybwmbB414
Cla97C02G047760Cucumber (Gy14) v1cgywmbB194
Cla97C02G047760Cucurbita maxima (Rimu)cmawmbB255
Cla97C02G047760Cucurbita maxima (Rimu)cmawmbB568
Cla97C02G047760Cucurbita maxima (Rimu)cmawmbB629
Cla97C02G047760Cucurbita maxima (Rimu)cmawmbB850
Cla97C02G047760Cucurbita maxima (Rimu)cmawmbB884
Cla97C02G047760Cucurbita moschata (Rifu)cmowmbB235
Cla97C02G047760Cucurbita moschata (Rifu)cmowmbB545
Cla97C02G047760Cucurbita moschata (Rifu)cmowmbB602
Cla97C02G047760Cucurbita moschata (Rifu)cmowmbB826
Cla97C02G047760Cucurbita moschata (Rifu)cmowmbB854
Cla97C02G047760Wild cucumber (PI 183967)cpiwmbB026
Cla97C02G047760Wild cucumber (PI 183967)cpiwmbB200
Cla97C02G047760Wild cucumber (PI 183967)cpiwmbB458
Cla97C02G047760Cucumber (Chinese Long) v3cucwmbB025
Cla97C02G047760Cucumber (Chinese Long) v3cucwmbB197
Cla97C02G047760Cucumber (Chinese Long) v3cucwmbB451
Cla97C02G047760Cucumber (Chinese Long) v2cuwmbB026
Cla97C02G047760Cucumber (Chinese Long) v2cuwmbB195
Cla97C02G047760Cucumber (Chinese Long) v2cuwmbB434
Cla97C02G047760Bottle gourd (USVL1VR-Ls)lsiwmbB124
Cla97C02G047760Bottle gourd (USVL1VR-Ls)lsiwmbB228
Cla97C02G047760Melon (DHL92) v3.6.1medwmbB314
Cla97C02G047760Melon (DHL92) v3.6.1medwmbB520
Cla97C02G047760Melon (DHL92) v3.5.1mewmbB326
Cla97C02G047760Melon (DHL92) v3.5.1mewmbB528
Cla97C02G047760Watermelon (Charleston Gray)wcgwmbB043
Cla97C02G047760Watermelon (Charleston Gray)wcgwmbB197
Cla97C02G047760Watermelon (97103) v1wmwmbB062
Cla97C02G047760Watermelon (97103) v1wmwmbB287
Cla97C02G047760Wax gourdwgowmbB369
Cla97C02G047760Wax gourdwgowmbB418