Cla97C09G164760 (gene) Watermelon (97103) v2.5

Overview
NameCla97C09G164760
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionENT domain-containing protein
LocationCla97Chr09: 2229749 .. 2232765 (-)
RNA-Seq ExpressionCla97C09G164760
SyntenyCla97C09G164760
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACTTTCGCGCCCAAAAGCGTTGAGAAATTTTGTGCTCTTTTGCTAACGTGCGCAGGTTTGGCAAAACGACACCGTCTTGTTATGGGTGGAGATGGGAGTGGGACTAAAATGCGCGGGAGGGGTAGGGTAAATAAGACGACACGTCGTATCATTTTGAGATGGACGGTTTTATGGTTATAAATTTTTTTATGTTAGAGGACAAGCGACTGCCACGTCGCAACTTTTCCGGACACTTTATTTTTCAAATAAATTTTGTTTCTTTTTTCAATAAATTTACTATCCGTTTGGATTCAAGAATGTTTGATAAACAATTTTAAATTCTGATTTAAAATTTAAAAAATAATTTTATATTTATTATTTTGTTTGGAACGACAAAAATTTAAAATGAATGTATTTAAGATTTTGTGTTTGGATGATAATCGAAAATGAAAAGAAAGAAGAATTTACTAATATAAGTCATTAACTATGATGTATTAATTGAAAATTAAAATATGACACAGTTAAATAAGTAATTTATCTTTAAATCAATTTTGCGAAATCCGACCATTTTAGTTATGAAAGTTTTTTTAAGTAAAAATAAGATTTAAAATCACTTCAAACTTCAAAATCATTCTTTTCTTTTTTTAATATCAAGTAGCATAGTTCATTTGAAATTCATTATAAATTAAAATCATGAATTTCAAGTCAACCTACACCTAACTTAATTGATTAAACATTATATAATCAATATAAGAGCCAAATATTTGAACTCATCTCCCATGTGGTTAAACTACAAGAGGTTGATTAGGTCAGAAATACCAACTCATGTCCTATATCACATAAACACCCATGCATTTCCTATCTCACATGTTTAACATGATAACTCTTTGTTAAGACTTGGACCTTACAAAAACAAAAAGATTAGGGAACTTAGAACATTGAAAATAAAGCCAACTTTTTCTCGAAATAATAATAAATAATAAAAAATAAAAATAAAAAAACCAACTTGAACCAGTTATATCGGTAATATAATTAACCTAAAATGTGGTAATTAAGCTCGGGTTGACAGTTGGGCCGAAGCCCATGGGCCACATTTAAGGCTAACCGCAGAGGCCCAAGTGGTTTCGCCGCCAAGCGACCAAGGAAATTACATTGCGAATTATTCGGGACAGTCCCTACCACGGCACCACCGTATTACTCCCAATTGGTTTAAGAATTTAAAATTTTTTATAATTACTCTTTGCTCATAAAATCTCTCAGAAACTTTTTTTATTTTCACTCAATTTGAAGAATTTCTTAGCAAGATTCAGAGCATTGTTGATACTCCCCGCTTAATTCTTCTCTCCGGGTATCGATCGCTCTCTCTACCTCTAAATTTGTTGTTTCTTAATCTTTTATTTTCCTTTCATTTTACCATTCAATCAATCGATTTCAGCTTTCTCTTTACTTCCTTCCGCATTTGTCTTACTTTGATTCCGTTAGTTTTCTGCGTTGCTGTCCTTTCTCTGGTTGTTTAAAGCTTTGTTCTTTTTATTGAAATTGGAAATTTCTTTTTGTCGTTTACACGTTGTATAGGAAAATTATAGTTATTGTGCCATTCTATCTGGTTCCATCTGTACTTTTTTGTCTTGCTAAATGACGTTAGAGAGAACTGACCTCCACATATCTCCAAAAAAATTTCCATTAATGAAACATGTAAATCCATTGGAAAATAAAAAGAAGCCCTTTCATTTTGATTTTGATTGGAGCCGGTTAGAGATATTGTCCCTCACATATATACCAATGAACCAATGATCTTCCATTCGAACTTTGGTTCAGCTTCCTGAAAGATATTCTTGATTTTTTATTTTTGATCTTTACAGGTTTCCAGTTCGTTTTAGGATTAGGATCGTAGGAAGTATGAGACTAAGGAAGGGGGATCAAGTTGAGGTGTTGAGCAAGAAGGAGGCTTCGAGTGGCTCATGGAGTTGTGCTGAAATACTTTCTGGTAATGGCCATTTTTGGTATAGCGTAAGATATTTGTCCGTGGAGGAAACAGTGGAGAGGGTACCGAGGACGGCGATACGCCCATGTCCTCCCCCGGTTGAACGCCCAAATGTTTGGTTTGTTGGTGATCTTGCTGAAGCATTCCACGATTCCTCATGGAAACAAGCAAAAATAATGAAGATTGTTGGGGTTGATTGCTATATAGTAAGAATACTTGGATCACCTAATTTAGACATATCGGTTGGCCAATCTAATCTGAGAATGCGACAGGCTTGGCATGATGGCAAATGGTTTCTCTTACAGAAGGTACATTTCTTAACAAGTTCAGACTAACTTCGCTTAACCCTTTTCCATATTCTTTACAATCACATTAATTTTGAACCATATGATCTCATATGTTATTTCTTTGGTAGGGCTTTGAAGATTCGGGTTCGTCTAGAAATAGGCAGACAGAGCCAAACATAGTGAAAAGCAAGGATCGACAACTAGTGGTGACGCTTCCTGCTGGACCTCGTAAGAGGCCATTGCCTAGTGAATCCATAAATCAAAATCCATCTGTCCAGAGAAGGAAAGTCACTGAAAAAGATGTGTGGTGTTCGCCTTCATTAACACAAGAGTTGAACAAATCAAGCAATGCTGGTTTGAGAGAAGGCAACCTTAATTCAGGAACCTCAACCCACATTCAGACAGATAGATGTGCATCTTCTGTTGGCAGTAATAGTTTCACTAATGAATTTTTAAAGGCGCCACTTATTTCTATAGCTCGTCGTGGTAAAAAAGTCGAAGATACGGACTATTGCAGCGATGCTGAATCATCTACGGGAAGGGAACATGAGGAGGAAGATTCATGCTCCTATGAGGAAATATTAGCTAAATTCCACAGGTCAGAGCTAAGTGTATTTCGTTCTTTTATTAGGGCATTGTATGCTTCTGGACCTTTAAGTTGGGAAGATGAAGGAGAAGTTTCAAATATTCGTGCTTTACTTCACATATCTAATGATGAATATTTGCTAGAGCTGAGAGATCTAATCTCTACCAATAAAGATTGA

mRNA sequence

ATGACTTTCGCGCCCAAAAGCGTTGAGAAATTTTGTGCTCTTTTGCTAACGTGCGCAGGTTTGGCAAAACGACACCGTCTTGTTATGGGTGGAGATGGGAGTGGGACTAAAATGCGCGGGAGGGAATTTCTTAGCAAGATTCAGAGCATTGTTGATACTCCCCGCTTAATTCTTCTCTCCGGGTTTCCAGTTCGTTTTAGGATTAGGATCGTAGGAAGTATGAGACTAAGGAAGGGGGATCAAGTTGAGGTGTTGAGCAAGAAGGAGGCTTCGAGTGGCTCATGGAGTTGTGCTGAAATACTTTCTGGTAATGGCCATTTTTGGTATAGCGTAAGATATTTGTCCGTGGAGGAAACAGTGGAGAGGGTACCGAGGACGGCGATACGCCCATGTCCTCCCCCGGTTGAACGCCCAAATGTTTGGTTTGTTGGTGATCTTGCTGAAGCATTCCACGATTCCTCATGGAAACAAGCAAAAATAATGAAGATTGTTGGGGTTGATTGCTATATAGTAAGAATACTTGGATCACCTAATTTAGACATATCGGTTGGCCAATCTAATCTGAGAATGCGACAGGCTTGGCATGATGGCAAATGGTTTCTCTTACAGAAGGGCTTTGAAGATTCGGGTTCGTCTAGAAATAGGCAGACAGAGCCAAACATAGTGAAAAGCAAGGATCGACAACTAGTGGTGACGCTTCCTGCTGGACCTCGTAAGAGGCCATTGCCTAGTGAATCCATAAATCAAAATCCATCTGTCCAGAGAAGGAAAGTCACTGAAAAAGATGTGTGGTGTTCGCCTTCATTAACACAAGAGTTGAACAAATCAAGCAATGCTGGTTTGAGAGAAGGCAACCTTAATTCAGGAACCTCAACCCACATTCAGACAGATAGATGTGCATCTTCTGTTGGCAGTAATAGTTTCACTAATGAATTTTTAAAGGCGCCACTTATTTCTATAGCTCGTCGTGGTAAAAAAGTCGAAGATACGGACTATTGCAGCGATGCTGAATCATCTACGGGAAGGGAACATGAGGAGGAAGATTCATGCTCCTATGAGGAAATATTAGCTAAATTCCACAGGTCAGAGCTAAGTGTATTTCGTTCTTTTATTAGGGCATTGTATGCTTCTGGACCTTTAAGTTGGGAAGATGAAGGAGAAGTTTCAAATATTCGTGCTTTACTTCACATATCTAATGATGAATATTTGCTAGAGCTGAGAGATCTAATCTCTACCAATAAAGATTGA

Coding sequence (CDS)

ATGACTTTCGCGCCCAAAAGCGTTGAGAAATTTTGTGCTCTTTTGCTAACGTGCGCAGGTTTGGCAAAACGACACCGTCTTGTTATGGGTGGAGATGGGAGTGGGACTAAAATGCGCGGGAGGGAATTTCTTAGCAAGATTCAGAGCATTGTTGATACTCCCCGCTTAATTCTTCTCTCCGGGTTTCCAGTTCGTTTTAGGATTAGGATCGTAGGAAGTATGAGACTAAGGAAGGGGGATCAAGTTGAGGTGTTGAGCAAGAAGGAGGCTTCGAGTGGCTCATGGAGTTGTGCTGAAATACTTTCTGGTAATGGCCATTTTTGGTATAGCGTAAGATATTTGTCCGTGGAGGAAACAGTGGAGAGGGTACCGAGGACGGCGATACGCCCATGTCCTCCCCCGGTTGAACGCCCAAATGTTTGGTTTGTTGGTGATCTTGCTGAAGCATTCCACGATTCCTCATGGAAACAAGCAAAAATAATGAAGATTGTTGGGGTTGATTGCTATATAGTAAGAATACTTGGATCACCTAATTTAGACATATCGGTTGGCCAATCTAATCTGAGAATGCGACAGGCTTGGCATGATGGCAAATGGTTTCTCTTACAGAAGGGCTTTGAAGATTCGGGTTCGTCTAGAAATAGGCAGACAGAGCCAAACATAGTGAAAAGCAAGGATCGACAACTAGTGGTGACGCTTCCTGCTGGACCTCGTAAGAGGCCATTGCCTAGTGAATCCATAAATCAAAATCCATCTGTCCAGAGAAGGAAAGTCACTGAAAAAGATGTGTGGTGTTCGCCTTCATTAACACAAGAGTTGAACAAATCAAGCAATGCTGGTTTGAGAGAAGGCAACCTTAATTCAGGAACCTCAACCCACATTCAGACAGATAGATGTGCATCTTCTGTTGGCAGTAATAGTTTCACTAATGAATTTTTAAAGGCGCCACTTATTTCTATAGCTCGTCGTGGTAAAAAAGTCGAAGATACGGACTATTGCAGCGATGCTGAATCATCTACGGGAAGGGAACATGAGGAGGAAGATTCATGCTCCTATGAGGAAATATTAGCTAAATTCCACAGGTCAGAGCTAAGTGTATTTCGTTCTTTTATTAGGGCATTGTATGCTTCTGGACCTTTAAGTTGGGAAGATGAAGGAGAAGTTTCAAATATTCGTGCTTTACTTCACATATCTAATGATGAATATTTGCTAGAGCTGAGAGATCTAATCTCTACCAATAAAGATTGA

Protein sequence

MTFAPKSVEKFCALLLTCAGLAKRHRLVMGGDGSGTKMRGREFLSKIQSIVDTPRLILLSGFPVRFRIRIVGSMRLRKGDQVEVLSKKEASSGSWSCAEILSGNGHFWYSVRYLSVEETVERVPRTAIRPCPPPVERPNVWFVGDLAEAFHDSSWKQAKIMKIVGVDCYIVRILGSPNLDISVGQSNLRMRQAWHDGKWFLLQKGFEDSGSSRNRQTEPNIVKSKDRQLVVTLPAGPRKRPLPSESINQNPSVQRRKVTEKDVWCSPSLTQELNKSSNAGLREGNLNSGTSTHIQTDRCASSVGSNSFTNEFLKAPLISIARRGKKVEDTDYCSDAESSTGREHEEEDSCSYEEILAKFHRSELSVFRSFIRALYASGPLSWEDEGEVSNIRALLHISNDEYLLELRDLISTNKD
Homology
BLAST of Cla97C09G164760 vs. NCBI nr
Match: XP_038899293.1 (uncharacterized protein LOC120086627 [Benincasa hispida] >XP_038899294.1 uncharacterized protein LOC120086627 [Benincasa hispida])

HSP 1 Score: 565.5 bits (1456), Expect = 3.9e-157
Identity = 290/352 (82.39%), Postives = 305/352 (86.65%), Query Frame = 0

Query: 74  MRLRKGDQVEVLSKKEASSGSWSCAEILSGNGHFWYSVRYLSVEETVERVPRTAIRPCPP 133
           MRLRKGDQVEVLSK+ ASSGSWSCAEILS N HFWY+VRYLSVEETVERVPR AIRPCPP
Sbjct: 1   MRLRKGDQVEVLSKEAASSGSWSCAEILSDNDHFWYNVRYLSVEETVERVPRIAIRPCPP 60

Query: 134 PVERPNVWFVGDLAEAFHDSSWKQAKIMKIVGVDCYIVRILGSPNLDISVGQSNLRMRQA 193
           PVER N+WFVGDLAEAFH+SSWKQAKIMKIVGVDCYIVR+LGSPNLDISV QSNLRMRQA
Sbjct: 61  PVERTNIWFVGDLAEAFHNSSWKQAKIMKIVGVDCYIVRLLGSPNLDISVSQSNLRMRQA 120

Query: 194 WHDGKWFLLQKGFEDSGSSRNRQTEPNIVKSKDRQLVVTLPAGPRKRPLPSESINQNPSV 253
           WHDGKWFLLQKG ED GSSRNRQTEPN++K+KD+Q  VTLP G RKRPLP + INQN S+
Sbjct: 121 WHDGKWFLLQKGIEDLGSSRNRQTEPNLMKNKDQQPAVTLPTGSRKRPLPCQPINQNLSI 180

Query: 254 QRRKVTEKDVWCSPSLTQELNKS-----------SNAGLREGNLNSGTSTHIQTDRCASS 313
            RRKV       SPSLTQEL KS           SNAGLREGNL S TSTHI TD C SS
Sbjct: 181 WRRKVG------SPSLTQELKKSNSPIENTNVTTSNAGLREGNLVSRTSTHIHTDSCVSS 240

Query: 314 VGSNSFTNEFLKAPLISIARRGKKVEDTDYCSDAESSTGREHEEEDSCSYEEILAKFHRS 373
           VGSNSFTN+  K P IS+ARRGKK EDTDY SDAESSTGREHEEEDSCSYEE+LAKFHRS
Sbjct: 241 VGSNSFTNDSFKKPFISMARRGKKDEDTDYYSDAESSTGREHEEEDSCSYEEVLAKFHRS 300

Query: 374 ELSVFRSFIRALYASGPLSWEDEGEVSNIRALLHISNDEYLLELRDLISTNK 415
           ELSVFRSFIRALYASGPLSWEDE E+SNIRALLHISNDEYLLELRDLISTNK
Sbjct: 301 ELSVFRSFIRALYASGPLSWEDEVEISNIRALLHISNDEYLLELRDLISTNK 346

BLAST of Cla97C09G164760 vs. NCBI nr
Match: KAG7014119.1 (DUF724 domain-containing protein 3, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 486.5 bits (1251), Expect = 2.3e-133
Identity = 267/399 (66.92%), Postives = 303/399 (75.94%), Query Frame = 0

Query: 42  EFLSKIQSIVDTPRLILLSGFPV--RFRIRIVGSMRLRKGDQVEVLSKKEASSGSWSCAE 101
           EF+ KIQSIV+TP   L S   +   FRIRI+GSMRLRKGD+VEVLS+KE SSGSWSCAE
Sbjct: 37  EFIGKIQSIVNTP---LNSSLEICNSFRIRIIGSMRLRKGDRVEVLSQKEVSSGSWSCAE 96

Query: 102 ILSGNGHFWYSVRYLSVEETVERVPRTAIRPCPPPVERPNVWFVGDLAEAFHDSSWKQAK 161
           I+SGNG   YSVR+LS  E VERVPR AIRPCPPPV   NVW  GDLAEAFH+ SWKQAK
Sbjct: 97  IISGNGRS-YSVRFLSSVEAVERVPRRAIRPCPPPVAGSNVWAAGDLAEAFHNFSWKQAK 156

Query: 162 IMKIVGVDCYIVRILGSPNLDISVGQSNLRMRQAWHDGKWFLLQKGFEDSGS-SRNRQTE 221
           IMKIV V CYIVR+LGSP LD+ V QSNLRMRQAWHDG+WFLL K    SGS SRN+QT+
Sbjct: 157 IMKIVSVHCYIVRLLGSP-LDVLVRQSNLRMRQAWHDGRWFLLGKAIGGSGSLSRNKQTK 216

Query: 222 PNIVKSKDRQLVVTLPAGPRKRPLPSESINQNPSVQRRKVTEKDVWCSPS--------LT 281
           PN+VKSKD+QLVVT P G RKRPLPS+S++ N SVQ +KVTEKDV C PS        LT
Sbjct: 217 PNMVKSKDQQLVVTFP-GRRKRPLPSQSLSHNLSVQTKKVTEKDVRCLPSSWMKDDMNLT 276

Query: 282 QELN---------------KSSNAGLREGNLNSGTSTHIQTDRCASSVGSNSFTNEFLKA 341
           ++LN                + +AGLREGN    TSTHIQ D CASSVGSN  T++F K 
Sbjct: 277 KDLNTIRLSSTFRTENVEVTTGSAGLREGNPIPVTSTHIQADSCASSVGSNGATDDFFKD 336

Query: 342 PLISIARRGKKVEDTDYCSDAESSTGREHEEEDSCSYEEILAKFHRSELSVFRSFIRALY 401
           P +S+ARR K +ED D  SDAES+TGR H EEDSCSY+E+LA+FHRSELS F SFIR LY
Sbjct: 337 PFVSVARRSKDIEDADCHSDAESATGRGHGEEDSCSYKEVLARFHRSELSAFHSFIRTLY 396

Query: 402 ASGPLSWEDEGEVSNIRALLHISNDEYLLELRDLISTNK 415
           ASGPLSWEDE  VSNI   LHISNDEYL+ELR+L+S NK
Sbjct: 397 ASGPLSWEDEAHVSNICDSLHISNDEYLMELRNLMSANK 429

BLAST of Cla97C09G164760 vs. NCBI nr
Match: XP_038899468.1 (uncharacterized protein LOC120086754 isoform X1 [Benincasa hispida])

HSP 1 Score: 485.0 bits (1247), Expect = 6.7e-133
Identity = 258/364 (70.88%), Postives = 293/364 (80.49%), Query Frame = 0

Query: 74  MRLRKGDQVEVLSKKEASSGSWSCAEILSGNGHFWYSVRYLSVEETVERVPRTAIRPCPP 133
           MRLRKGDQVEVLSKKEAS+GSWSCAEI+SGNG   YSV++ S +E +E+VPR AIRPCPP
Sbjct: 1   MRLRKGDQVEVLSKKEASNGSWSCAEIISGNGRL-YSVKFFSSQEAMEKVPRKAIRPCPP 60

Query: 134 PVERPNVWFVGDLAEAFHDSSWKQAKIMKIVGVDCYIVRILGSPNLDISVGQSNLRMRQA 193
           PVE  NVW VGDLAEAFH+SSWKQAKI+KIVGV+CYIVR+LGSP LD+ V +SNLRMRQA
Sbjct: 61  PVEGSNVWDVGDLAEAFHNSSWKQAKILKIVGVNCYIVRLLGSP-LDVMVRKSNLRMRQA 120

Query: 194 WHDGKWFLLQKGFEDSGS-SRNRQTEPNIVKSKDRQLVVTLPAGPRKRPLPSESINQNPS 253
           WHDG+W LL K  E+SGS SRNRQ EPN+++SKD+QLVV LP GPRKRPLPS+ I+   S
Sbjct: 121 WHDGQWILLGKAMENSGSLSRNRQIEPNMMRSKDQQLVVMLPTGPRKRPLPSQFIDHKVS 180

Query: 254 VQRRKVTEKDVWCSPSL-----TQELNK-----SSN------------AGLREGNLNSGT 313
           VQ+RKVTEKDV  S        TQELN      SSN            AGLREGNL  GT
Sbjct: 181 VQKRKVTEKDVRSSAVTTNMYSTQELNTIRLRLSSNFPTENTEVTTGDAGLREGNLIPGT 240

Query: 314 STHIQTDRCASSVGSNSFTNEFLKAPLISIARRGKKVEDTDYCSDAESSTGREHEEEDSC 373
           STHI TD C SSVGSNS T++F   P +S+ARR KKVE+ DY SDAESSTGR H EE+SC
Sbjct: 241 STHIYTDNCTSSVGSNSSTDDFFNVPFVSVARRSKKVENMDYYSDAESSTGRGHGEENSC 300

Query: 374 SYEEILAKFHRSELSVFRSFIRALYASGPLSWEDEGEVSNIRALLHISNDEYLLELRDLI 415
           S+EE+LA+ HRSELSVFRSFIRALYASGPLSWEDEGEVSNIRA LH+SNDEYL+ELR+L+
Sbjct: 301 SHEEVLARPHRSELSVFRSFIRALYASGPLSWEDEGEVSNIRASLHVSNDEYLMELRNLM 360

BLAST of Cla97C09G164760 vs. NCBI nr
Match: XP_008461067.1 (PREDICTED: uncharacterized protein LOC103499769 isoform X1 [Cucumis melo])

HSP 1 Score: 477.2 bits (1227), Expect = 1.4e-130
Identity = 257/365 (70.41%), Postives = 284/365 (77.81%), Query Frame = 0

Query: 74  MRLRKGDQVEVLSKKEASSGSWSCAEILSGNGHFWYSVRYLSVEETVERVPRTAIRPCPP 133
           MRLRKGDQVEVL+KKE SSGSWSCAEILSGNG   YSV++LS +E VE+VPR AIRPCPP
Sbjct: 1   MRLRKGDQVEVLNKKEVSSGSWSCAEILSGNGRS-YSVKFLSSDEAVEKVPRKAIRPCPP 60

Query: 134 PVERPNVWFVGDLAEAFHDSSWKQAKIMKIVGVDCYIVRILGSPNLDISVGQSNLRMRQA 193
           P +  N W  GDLAEAFH+SSWK AKIMKIVGV+ YIVRILGSP LDI VG SNLRMRQA
Sbjct: 61  PFQGSNDWDAGDLAEAFHNSSWKHAKIMKIVGVNRYIVRILGSP-LDIMVGSSNLRMRQA 120

Query: 194 WHDGKWFLLQKGFEDSGS-SRNRQTEPNIVKSKDRQLVVTLPAGPRKRPLPSESINQNPS 253
           WHDG+W LL K  E+SGS SRNRQ EPN+V+SKD+QL V LPAG RKR LPSE IN   S
Sbjct: 121 WHDGRWILLGKSMEESGSLSRNRQIEPNMVRSKDQQL-VALPAGSRKRLLPSEFINHKVS 180

Query: 254 VQRRKVTEKDVWCSPSL--------TQELN---------------KSSNAGLREGNLNSG 313
           VQ+RKVTE DV C PSL        TQE N                + +AGLREG L  G
Sbjct: 181 VQKRKVTENDVRCLPSLAITTNMYSTQEFNTIRLSSNLPTENTGVSTGDAGLREGTLIPG 240

Query: 314 TSTHIQTDRCASSVGSNSFTNEFLKAPLISIARRGKKVEDTDYCSDAESSTGREHEEEDS 373
           TSTHIQ D C SSVGSNSFT++F   P + +AR  KKVEDTDYCSDAESSTGR  EEE+ 
Sbjct: 241 TSTHIQADSCTSSVGSNSFTDDFFNVPFVPVARCVKKVEDTDYCSDAESSTGRGDEEEEP 300

Query: 374 CSYEEILAKFHRSELSVFRSFIRALYASGPLSWEDEGEVSNIRALLHISNDEYLLELRDL 415
           CSYEE+L + HRSELSVFRSFIRALYASGPLSWEDEG+VSNIRA LHISNDEYL+ELR+L
Sbjct: 301 CSYEEVLVRSHRSELSVFRSFIRALYASGPLSWEDEGQVSNIRASLHISNDEYLMELRNL 360

BLAST of Cla97C09G164760 vs. NCBI nr
Match: XP_011659542.1 (uncharacterized protein LOC101203701 isoform X1 [Cucumis sativus])

HSP 1 Score: 465.7 bits (1197), Expect = 4.2e-127
Identity = 254/371 (68.46%), Postives = 283/371 (76.28%), Query Frame = 0

Query: 67  RIRIVGSMRLRKGDQVEVLSKKEASSGSWSCAEILSGNGHFWYSVRYLSVEETVERVPRT 126
           RIRI+ SMRLRKGDQVEVL+KKE SSGSWSCAEILSGNG   YSV++LS +E VE+VPR 
Sbjct: 20  RIRIIASMRLRKGDQVEVLNKKEVSSGSWSCAEILSGNGRS-YSVKFLSSDEAVEKVPRK 79

Query: 127 AIRPCPPPVERPNVWFVGDLAEAFHDSSWKQAKIMKIVGVDCYIVRILGSPNLDISVGQS 186
           AIRPCPPP +  N W  GDLAE FH+S WK AKI+ IVGV+ YIVRILGSP LDI VG S
Sbjct: 80  AIRPCPPPFQGSNDWDAGDLAEVFHNSLWKHAKIITIVGVNSYIVRILGSP-LDIMVGSS 139

Query: 187 NLRMRQAWHDGKWFLLQKGFEDSGS-SRNRQTEPNIVKSKDRQLVVTLPAGPRKRPLPSE 246
           NLRMRQAWHDG+W LL K  E+SGS SRNRQ EPN+V+ K+RQLV   PAG RKR LPSE
Sbjct: 140 NLRMRQAWHDGRWILLGKSMEESGSLSRNRQIEPNMVRRKNRQLVAG-PAGSRKRLLPSE 199

Query: 247 SINQNPSVQRRKVTEKDVWCSPSL--------TQELNK---SSN------------AGLR 306
            IN    VQ+RKV E  V C PS+        TQELN    SSN            AGLR
Sbjct: 200 FINHEVFVQKRKVAENVVRCLPSIAITTNMYSTQELNTVRLSSNLPTENTGVTTGDAGLR 259

Query: 307 EGNLNSGTSTHIQTDRCASSVGSNSFTNEFLKAPLISIARRGKKVEDTDYCSDAESSTGR 366
           EG L  GTSTHI  D C SSVGSN FT++F   P +S+ARR KKVEDTDYCSDAES+TGR
Sbjct: 260 EGTLVPGTSTHIHADSCTSSVGSNIFTDDFFNVPFVSVARRVKKVEDTDYCSDAESTTGR 319

Query: 367 EHEEEDSCSYEEILAKFHRSELSVFRSFIRALYASGPLSWEDEGEVSNIRALLHISNDEY 414
             EEE+ CSYEE+L + HRSELSVFRSFIRALYASGPLSWEDEG+VSNIRA LHISNDEY
Sbjct: 320 GDEEEEPCSYEEVLVRSHRSELSVFRSFIRALYASGPLSWEDEGQVSNIRASLHISNDEY 379

BLAST of Cla97C09G164760 vs. ExPASy Swiss-Prot
Match: Q500V5 (Protein AGENET DOMAIN (AGD)-CONTAINING P1 OS=Arabidopsis thaliana OX=3702 GN=AGDP1 PE=1 SV=1)

HSP 1 Score: 49.3 bits (116), Expect = 1.2e-04
Identity = 34/127 (26.77%), Postives = 53/127 (41.73%), Query Frame = 0

Query: 79  GDQVEVLSKKEASSGSWSCAEILSGNGHFWYSVRYLSV------EETVERVPRTAIRPCP 138
           G  +EV  ++E    SW  A+++   G     V Y ++      E   E V  + IRP P
Sbjct: 383 GTPIEVSPEEEGFEDSWFLAKLIEYRGKDKCLVEYDNLKAEDGKEPLREEVNVSRIRPLP 442

Query: 139 PPVERPNVWFVGDLAEAFHDSSWKQAKIMKIVGVDCYIVRILGSPNLDISVGQSNLRMRQ 198
                 + +   D   A ++  W    I K++    Y+V    +  L +    S LR+ Q
Sbjct: 443 LESVMVSPFERHDKVNALYNDGWWVGVIRKVLAKSSYLVLFKNTQEL-LKFHHSQLRLHQ 502

Query: 199 AWHDGKW 200
            W DGKW
Sbjct: 503 EWIDGKW 508

BLAST of Cla97C09G164760 vs. ExPASy TrEMBL
Match: A0A1S3CDW0 (uncharacterized protein LOC103499769 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103499769 PE=4 SV=1)

HSP 1 Score: 477.2 bits (1227), Expect = 6.7e-131
Identity = 257/365 (70.41%), Postives = 284/365 (77.81%), Query Frame = 0

Query: 74  MRLRKGDQVEVLSKKEASSGSWSCAEILSGNGHFWYSVRYLSVEETVERVPRTAIRPCPP 133
           MRLRKGDQVEVL+KKE SSGSWSCAEILSGNG   YSV++LS +E VE+VPR AIRPCPP
Sbjct: 1   MRLRKGDQVEVLNKKEVSSGSWSCAEILSGNGRS-YSVKFLSSDEAVEKVPRKAIRPCPP 60

Query: 134 PVERPNVWFVGDLAEAFHDSSWKQAKIMKIVGVDCYIVRILGSPNLDISVGQSNLRMRQA 193
           P +  N W  GDLAEAFH+SSWK AKIMKIVGV+ YIVRILGSP LDI VG SNLRMRQA
Sbjct: 61  PFQGSNDWDAGDLAEAFHNSSWKHAKIMKIVGVNRYIVRILGSP-LDIMVGSSNLRMRQA 120

Query: 194 WHDGKWFLLQKGFEDSGS-SRNRQTEPNIVKSKDRQLVVTLPAGPRKRPLPSESINQNPS 253
           WHDG+W LL K  E+SGS SRNRQ EPN+V+SKD+QL V LPAG RKR LPSE IN   S
Sbjct: 121 WHDGRWILLGKSMEESGSLSRNRQIEPNMVRSKDQQL-VALPAGSRKRLLPSEFINHKVS 180

Query: 254 VQRRKVTEKDVWCSPSL--------TQELN---------------KSSNAGLREGNLNSG 313
           VQ+RKVTE DV C PSL        TQE N                + +AGLREG L  G
Sbjct: 181 VQKRKVTENDVRCLPSLAITTNMYSTQEFNTIRLSSNLPTENTGVSTGDAGLREGTLIPG 240

Query: 314 TSTHIQTDRCASSVGSNSFTNEFLKAPLISIARRGKKVEDTDYCSDAESSTGREHEEEDS 373
           TSTHIQ D C SSVGSNSFT++F   P + +AR  KKVEDTDYCSDAESSTGR  EEE+ 
Sbjct: 241 TSTHIQADSCTSSVGSNSFTDDFFNVPFVPVARCVKKVEDTDYCSDAESSTGRGDEEEEP 300

Query: 374 CSYEEILAKFHRSELSVFRSFIRALYASGPLSWEDEGEVSNIRALLHISNDEYLLELRDL 415
           CSYEE+L + HRSELSVFRSFIRALYASGPLSWEDEG+VSNIRA LHISNDEYL+ELR+L
Sbjct: 301 CSYEEVLVRSHRSELSVFRSFIRALYASGPLSWEDEGQVSNIRASLHISNDEYLMELRNL 360

BLAST of Cla97C09G164760 vs. ExPASy TrEMBL
Match: A0A6J1JXT9 (uncharacterized protein LOC111488477 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111488477 PE=4 SV=1)

HSP 1 Score: 465.3 bits (1196), Expect = 2.7e-127
Identity = 254/365 (69.59%), Postives = 281/365 (76.99%), Query Frame = 0

Query: 74  MRLRKGDQVEVLSKKEASSGSWSCAEILSGNGHFWYSVRYLSVEETVERVPRTAIRPCPP 133
           MRLRKGD+VEVLS+KE SSGSWSCAEI+SGNG   YSVR+LS  E VERVPR AIRPCPP
Sbjct: 1   MRLRKGDRVEVLSQKEVSSGSWSCAEIISGNGRS-YSVRFLSSVEAVERVPRRAIRPCPP 60

Query: 134 PVERPNVWFVGDLAEAFHDSSWKQAKIMKIVGVDCYIVRILGSPNLDISVGQSNLRMRQA 193
           PV   NVW  GDLAEAFH+SSWKQAKIMKIV V CYIVR+LGSP LD+ VGQSNLRMRQA
Sbjct: 61  PVAGSNVWAAGDLAEAFHNSSWKQAKIMKIVSVHCYIVRLLGSP-LDVLVGQSNLRMRQA 120

Query: 194 WHDGKWFLLQKGFEDSGS-SRNRQTEPNIVKSKDRQLVVTLPAGPRKRPLPSESINQNPS 253
           WHDG+WFLL K    SGS SRN+QT+ N+VKSKD+QLVVT P G RKRPLPS+SI+ N S
Sbjct: 121 WHDGRWFLLGKAIGGSGSLSRNKQTKTNMVKSKDQQLVVTFP-GRRKRPLPSQSISHNLS 180

Query: 254 VQRRKVTEKDVWCSPS--------LTQELN---------------KSSNAGLREGNLNSG 313
           VQ +KVTEKDV C PS        LTQELN                + +AGLREGNL   
Sbjct: 181 VQTKKVTEKDVRCLPSSWIKDDMNLTQELNTIRLSSTFRTENVEVTTGSAGLREGNLIPV 240

Query: 314 TSTHIQTDRCASSVGSNSFTNEFLKAPLISIARRGKKVEDTDYCSDAESSTGREHEEEDS 373
           TSTH Q D CASSVGSN  T+EF K P +S+AR  K VED D  SDAES+TGR H EEDS
Sbjct: 241 TSTHSQADSCASSVGSNRSTDEFFKDPFVSVARCSKDVEDVDCHSDAESATGRGHGEEDS 300

Query: 374 CSYEEILAKFHRSELSVFRSFIRALYASGPLSWEDEGEVSNIRALLHISNDEYLLELRDL 415
           CSY+E+LA+FHRSELS F SFIRALYASGPLSWEDE  VSNI   LHISNDEYL+ELR+L
Sbjct: 301 CSYKEVLARFHRSELSAFHSFIRALYASGPLSWEDEAHVSNICDSLHISNDEYLMELRNL 360

BLAST of Cla97C09G164760 vs. ExPASy TrEMBL
Match: A0A0A0KBX9 (ENT domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G433350 PE=4 SV=1)

HSP 1 Score: 456.4 bits (1173), Expect = 1.2e-124
Identity = 249/364 (68.41%), Postives = 277/364 (76.10%), Query Frame = 0

Query: 74  MRLRKGDQVEVLSKKEASSGSWSCAEILSGNGHFWYSVRYLSVEETVERVPRTAIRPCPP 133
           MRLRKGDQVEVL+KKE SSGSWSCAEILSGNG   YSV++LS +E VE+VPR AIRPCPP
Sbjct: 1   MRLRKGDQVEVLNKKEVSSGSWSCAEILSGNGRS-YSVKFLSSDEAVEKVPRKAIRPCPP 60

Query: 134 PVERPNVWFVGDLAEAFHDSSWKQAKIMKIVGVDCYIVRILGSPNLDISVGQSNLRMRQA 193
           P +  N W  GDLAE FH+S WK AKI+ IVGV+ YIVRILGSP LDI VG SNLRMRQA
Sbjct: 61  PFQGSNDWDAGDLAEVFHNSLWKHAKIITIVGVNSYIVRILGSP-LDIMVGSSNLRMRQA 120

Query: 194 WHDGKWFLLQKGFEDSGS-SRNRQTEPNIVKSKDRQLVVTLPAGPRKRPLPSESINQNPS 253
           WHDG+W LL K  E+SGS SRNRQ EPN+V+ K+RQLV   PAG RKR LPSE IN    
Sbjct: 121 WHDGRWILLGKSMEESGSLSRNRQIEPNMVRRKNRQLVAG-PAGSRKRLLPSEFINHEVF 180

Query: 254 VQRRKVTEKDVWCSPSL--------TQELNK---SSN------------AGLREGNLNSG 313
           VQ+RKV E  V C PS+        TQELN    SSN            AGLREG L  G
Sbjct: 181 VQKRKVAENVVRCLPSIAITTNMYSTQELNTVRLSSNLPTENTGVTTGDAGLREGTLVPG 240

Query: 314 TSTHIQTDRCASSVGSNSFTNEFLKAPLISIARRGKKVEDTDYCSDAESSTGREHEEEDS 373
           TSTHI  D C SSVGSN FT++F   P +S+ARR KKVEDTDYCSDAES+TGR  EEE+ 
Sbjct: 241 TSTHIHADSCTSSVGSNIFTDDFFNVPFVSVARRVKKVEDTDYCSDAESTTGRGDEEEEP 300

Query: 374 CSYEEILAKFHRSELSVFRSFIRALYASGPLSWEDEGEVSNIRALLHISNDEYLLELRDL 414
           CSYEE+L + HRSELSVFRSFIRALYASGPLSWEDEG+VSNIRA LHISNDEYL+ELR+L
Sbjct: 301 CSYEEVLVRSHRSELSVFRSFIRALYASGPLSWEDEGQVSNIRASLHISNDEYLMELRNL 360

BLAST of Cla97C09G164760 vs. ExPASy TrEMBL
Match: A0A6J1GP59 (uncharacterized protein LOC111455870 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111455870 PE=4 SV=1)

HSP 1 Score: 456.4 bits (1173), Expect = 1.2e-124
Identity = 250/365 (68.49%), Postives = 279/365 (76.44%), Query Frame = 0

Query: 74  MRLRKGDQVEVLSKKEASSGSWSCAEILSGNGHFWYSVRYLSVEETVERVPRTAIRPCPP 133
           MRLRKGD+VEVLS+KE SSGSWSCAEI+SGNG   YSVR+LS  E VERVPR AIRPCPP
Sbjct: 1   MRLRKGDRVEVLSQKEVSSGSWSCAEIISGNGRS-YSVRFLSSVEAVERVPRRAIRPCPP 60

Query: 134 PVERPNVWFVGDLAEAFHDSSWKQAKIMKIVGVDCYIVRILGSPNLDISVGQSNLRMRQA 193
           PV   NVW  GDLAEAFH+ SWKQAKIMKIV V CYIVR+LGSP LD+ V QSNLRMRQA
Sbjct: 61  PVAGSNVWAAGDLAEAFHNFSWKQAKIMKIVSVHCYIVRLLGSP-LDVLVRQSNLRMRQA 120

Query: 194 WHDGKWFLLQKGFEDSGS-SRNRQTEPNIVKSKDRQLVVTLPAGPRKRPLPSESINQNPS 253
           WHDG+WFLL K    SGS SRN+QT+PN+VKSKD+QLVVT P G RKRPLPS+SI+ N S
Sbjct: 121 WHDGRWFLLGKAIGGSGSLSRNKQTKPNMVKSKDQQLVVTFP-GRRKRPLPSQSISHNLS 180

Query: 254 VQRRKVTEKDVWCSPS--------LTQELN---------------KSSNAGLREGNLNSG 313
           VQ +KVTEKDV C PS        LTQELN                + +AGLREGN+   
Sbjct: 181 VQTKKVTEKDVRCLPSSWIKDDMNLTQELNTIRLSSTFRTENVEVTTGSAGLREGNVIPV 240

Query: 314 TSTHIQTDRCASSVGSNSFTNEFLKAPLISIARRGKKVEDTDYCSDAESSTGREHEEEDS 373
           TSTHIQ D CASSVGSN  T++F K P +S+ARR K   D D  SDAES+TGR H EEDS
Sbjct: 241 TSTHIQADSCASSVGSNRSTDDFFKDPFVSVARRSK---DVDCHSDAESATGRGHGEEDS 300

Query: 374 CSYEEILAKFHRSELSVFRSFIRALYASGPLSWEDEGEVSNIRALLHISNDEYLLELRDL 415
           CSY+E+LA+FHRSELS F SFIR LYASGPLSWEDE  VSNI   LHISNDEYL+ELR+L
Sbjct: 301 CSYKEVLARFHRSELSAFHSFIRTLYASGPLSWEDEAHVSNICDSLHISNDEYLMELRNL 359

BLAST of Cla97C09G164760 vs. ExPASy TrEMBL
Match: A0A6J1DFY9 (uncharacterized protein LOC111020085 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111020085 PE=4 SV=1)

HSP 1 Score: 438.7 bits (1127), Expect = 2.7e-119
Identity = 244/374 (65.24%), Postives = 276/374 (73.80%), Query Frame = 0

Query: 74  MRLRKGDQVEVLSKKEASSGSWSCAEILSGNGHFWYSVRY----LSVEETVERVPRTAIR 133
           MRL+KGDQVEVLSKK+ S GSWSCAEI+SGNGH  YSVRY    ++ EE VERVPR+AIR
Sbjct: 1   MRLKKGDQVEVLSKKQVSGGSWSCAEIISGNGH-TYSVRYRSFPMTPEEAVERVPRSAIR 60

Query: 134 PCPPPVERPNVWFVGDLAEAFHDSSWKQAKIMKIVGVDCYIVRILGSPNLDISVGQSNLR 193
           PCPPPVE PNVW  GDLAE FH+ SWKQAKI+KIVGVD YI R+LGSP LD+ V QS+LR
Sbjct: 61  PCPPPVEGPNVWAAGDLAEVFHNFSWKQAKIIKIVGVDSYIARLLGSP-LDVMVCQSHLR 120

Query: 194 MRQAWHDGKWFLLQKGFEDSG-SSRNRQT----EPNIVKSKDRQLVVTLPAGPRKRPLPS 253
            RQAWH GKWF+L K  E SG  SR RQT    EPNI+KSKDRQLVV LP  P+KR LP 
Sbjct: 121 TRQAWHGGKWFVLGKAPELSGLLSRKRQTSVGNEPNILKSKDRQLVVVLPTRPQKRQLPR 180

Query: 254 ESINQNPSVQRRKVTEKDVWCSPSLT---------QELNK---------------SSNAG 313
            S +Q  S+++RKV EKDV   P L          QEL++               + + G
Sbjct: 181 NSEDQRVSIKKRKVAEKDVRYLPLLARTTDDMYLPQELHRIRSNSPFPTENIEVSTGDVG 240

Query: 314 LREGNLNSGTSTHIQTDRCASSVGSNSFTNEFLKAPLISIARRGKKVEDTDYCSDAESST 373
           LREGNL  GTSTH  TD CASSVGSNS T++F K P IS+A    KVED DY SDAESST
Sbjct: 241 LREGNLIPGTSTHSYTDSCASSVGSNSSTDDFFKVPFISVAHHSNKVEDEDYYSDAESST 300

Query: 374 GREHEEEDSCSYEEILAKFHRSELSVFRSFIRALYASGPLSWEDEGEVSNIRALLHISND 415
           G  H E DS S EE+++K HRSELSVFRSFIRALYASGPLSWEDEG+VSNIRA LHISND
Sbjct: 301 GWGHGEGDSSSNEEVVSKSHRSELSVFRSFIRALYASGPLSWEDEGQVSNIRASLHISND 360

BLAST of Cla97C09G164760 vs. TAIR 10
Match: AT4G32440.1 (Plant Tudor-like RNA-binding protein )

HSP 1 Score: 163.7 bits (413), Expect = 3.2e-40
Identity = 131/377 (34.75%), Postives = 191/377 (50.66%), Query Frame = 0

Query: 74  MRLRKGDQVEVLSKKEASSGSWSCAEILSGNGHFWYSVRYLSV-----EETVERVPRTAI 133
           MR+RKG +VEV S KEA  G+W CAEI+SGNGH  Y+VR+ S      E  +E+VPR  I
Sbjct: 1   MRIRKGSRVEVFSNKEAPYGAWRCAEIVSGNGH-TYNVRFYSFQIEHEEAVMEKVPRKII 60

Query: 134 RPCPPPVERPNVWFVGDLAEAFHDSSWKQAKIMKIVGVDCYIVRILGSPNLDISVGQSNL 193
           RPCPP V+    W  G+L E   + SWK A + + +    Y+VR+LG+P  +++  + NL
Sbjct: 61  RPCPPLVDVER-WDTGELVEVLDNFSWKAATVREELSGHYYVVRLLGTPE-ELTFHKVNL 120

Query: 194 RMRQAWHDGKWFLLQKGFEDSGSSRNRQTEPNIVKSKDRQLVVTLP--------AGPRKR 253
           R R++W D +W  + K    SGS ++     + V  K +    ++P        A   KR
Sbjct: 121 RARKSWQDERWVAIGK---ISGSLKSSTLTGSDVHQKLQPHRNSMPLHEPSVVSARLLKR 180

Query: 254 PLP------SESINQNPSVQRRKVTEKD------VWCSPS-------LTQELNKSSNAGL 313
           P P      +ES   NP   R    E        + C P        +   LN       
Sbjct: 181 PSPYNWSECAESCTGNPKKMRSLEKEGQQQKVDAISCRPENRGGKSHVQASLNNHKTGYC 240

Query: 314 REGNLNS-GTSTHIQTDRCAS----SVGSNSFTNEFLKAPLISIARRGKKVEDTDYCSDA 373
           +   + S G S  ++ D C+     SVGS S T+ + ++ +      G   +     SDA
Sbjct: 241 QIVRVRSKGFSESVRADDCSDSDVCSVGSCSATS-YDESNMPPCMLDGSTQQADSCSSDA 300

Query: 374 ESSTGREHEEE-DSCSYEEILAKFHRSELSVFRSFIRALYASGPLSWEDEGEVSNIRALL 413
           ESS G   E      S  +      RSEL  +RS +  L++SGPLSWE E  ++++R  L
Sbjct: 301 ESSCGLGEEPRWKHSSVGDGARNSCRSELYSYRSTLGELFSSGPLSWEQEASLTDLRLSL 360

BLAST of Cla97C09G164760 vs. TAIR 10
Match: AT5G20030.1 (Plant Tudor-like RNA-binding protein )

HSP 1 Score: 163.3 bits (412), Expect = 4.1e-40
Identity = 121/358 (33.80%), Postives = 179/358 (50.00%), Query Frame = 0

Query: 74  MRLRKGDQVEVLSKKEASSGSWSCAEILSGNGHFWYSVRYLSVEETVERVPRTAIRPCPP 133
           MR  KG +VEVLSK    SG+W  AEI+SGNGH+ Y+V Y    +  ERVPR ++RP PP
Sbjct: 1   MRFNKGTKVEVLSKSSVPSGAWRSAEIISGNGHY-YTVMY-DHNDGTERVPRKSMRPEPP 60

Query: 134 PVERPNVWFVGDLAEAFHDSSWKQAKIMKIVGVDCYIVRILGSPNLDISVGQSNLRMRQA 193
            ++  + W  GD+ E F   SWK A + K++G  C++VR+LGS +L   V +S++R+RQ+
Sbjct: 61  RLQVLDAWCPGDILEVFQSCSWKMAIVSKVLGNGCFLVRLLGS-SLKFKVTKSDIRVRQS 120

Query: 194 WHDGKWFLLQKGFEDSGSSRNRQTEPNIVKSKDRQLVVTLPAGPRKRPLPSESINQNPSV 253
           W D +W ++ +G     S  + QT    ++ K           P+   + SES       
Sbjct: 121 WQDNEWIMIGQG----TSRLSAQTSTGELRRK---------VNPKGDYISSES------- 180

Query: 254 QRRKVTEKDVWCSP-------SLTQELNKSSNAGLREGNLNSGTSTHIQ-TDRCASSVGS 313
            + K+ E DV  S        SL +  N++                  +  +  ASSVGS
Sbjct: 181 -KDKLDESDVPLSVGLKKRTYSLVEPHNQTRALAAYPPRFREEVKEEEEDRESVASSVGS 240

Query: 314 NSFTNEFLKAPLISIARRGKKVEDTDYCSDAESSTGREHEEEDSCSYEEI---------- 373
                + L A   +    G         SD ESS         SC Y ++          
Sbjct: 241 CCMDTDGLSAVSFNPIETGNS-------SDTESS---------SCGYGKVKKLVVPRKGS 300

Query: 374 -LAKFHRSELSVFRSFIRALYASGP-LSWEDEGEVSNIRALLHISNDEYLLELRDLIS 412
             A  HR EL  +RS I  L+ASGP ++WE E  ++N+R  L+ISN+E+L+++R+LIS
Sbjct: 301 EAADVHRLELDAYRSSIERLHASGPIITWEQETWITNLRLKLNISNEEHLMQIRNLIS 318

BLAST of Cla97C09G164760 vs. TAIR 10
Match: AT4G32440.2 (Plant Tudor-like RNA-binding protein )

HSP 1 Score: 154.5 bits (389), Expect = 1.9e-37
Identity = 126/370 (34.05%), Postives = 184/370 (49.73%), Query Frame = 0

Query: 74  MRLRKGDQVEVLSKKEASSGSWSCAEILSGNGHFWYSVRYLSV-----EETVERVPRTAI 133
           MR+RKG +VEV S KEA  G+W CAEI+SGNGH  Y+VR+ S      E  +E+VPR  I
Sbjct: 1   MRIRKGSRVEVFSNKEAPYGAWRCAEIVSGNGH-TYNVRFYSFQIEHEEAVMEKVPRKII 60

Query: 134 RPCPPPVERPNVWFVGDLAEAFHDSSWKQAKIMKIVGVDCYIVRILGSPNLDISVGQSNL 193
           RPCPP V+    W  G+L E   + SWK A + + +    Y+VR+LG+P  +++  + NL
Sbjct: 61  RPCPPLVDVER-WDTGELVEVLDNFSWKAATVREELSGHYYVVRLLGTPE-ELTFHKVNL 120

Query: 194 RMRQAWHDGKWFLLQKGFEDSGSSRNRQTEPNIVKSKDRQLVVTLP--------AGPRKR 253
           R R++W D +W  + K    SGS ++     + V  K +    ++P        A   KR
Sbjct: 121 RARKSWQDERWVAIGK---ISGSLKSSTLTGSDVHQKLQPHRNSMPLHEPSVVSARLLKR 180

Query: 254 PLP------SESINQNPSVQRRKVTEKD------VWCSPS-------LTQELNKSSNAGL 313
           P P      +ES   NP   R    E        + C P        +   LN       
Sbjct: 181 PSPYNWSECAESCTGNPKKMRSLEKEGQQQKVDAISCRPENRGGKSHVQASLNNHKTGYC 240

Query: 314 REGNLNS-GTSTHIQTDRCAS----SVGSNSFTNEFLKAPLISIARRGKKVEDTDYCSDA 373
           +   + S G S  ++ D C+     SVGS S T+ + ++ +      G   +     SDA
Sbjct: 241 QIVRVRSKGFSESVRADDCSDSDVCSVGSCSATS-YDESNMPPCMLDGSTQQADSCSSDA 300

Query: 374 ESSTGREHEEE-DSCSYEEILAKFHRSELSVFRSFIRALYASGPLSWEDEGEVSNIRALL 406
           ESS G   E      S  +      RSEL  +RS +  L++SGPLSWE E  ++++R  L
Sbjct: 301 ESSCGLGEEPRWKHSSVGDGARNSCRSELYSYRSTLGELFSSGPLSWEQEASLTDLRLSL 360

BLAST of Cla97C09G164760 vs. TAIR 10
Match: AT4G32440.3 (Plant Tudor-like RNA-binding protein )

HSP 1 Score: 118.6 bits (296), Expect = 1.2e-26
Identity = 77/201 (38.31%), Postives = 109/201 (54.23%), Query Frame = 0

Query: 74  MRLRKGDQVEVLSKKEASSGSWSCAEILSGNGHFWYSVRYLSV-----EETVERVPRTAI 133
           MR+RKG +VEV S KEA  G+W CAEI+SGNGH  Y+VR+ S      E  +E+VPR  I
Sbjct: 1   MRIRKGSRVEVFSNKEAPYGAWRCAEIVSGNGH-TYNVRFYSFQIEHEEAVMEKVPRKII 60

Query: 134 RPCPPPVERPNVWFVGDLAEAFHDSSWKQAKIMKIVGVDCYIVRILGSPNLDISVGQSNL 193
           RPCPP V+    W  G+L E   + SWK A + + +    Y+VR+LG+P  +++  + NL
Sbjct: 61  RPCPPLVDVER-WDTGELVEVLDNFSWKAATVREELSGHYYVVRLLGTPE-ELTFHKVNL 120

Query: 194 RMRQAWHDGKWFLLQKGFEDSGSSRNRQTEPNIVKSKDRQLVVTLP--------AGPRKR 253
           R R++W D +W  + K    SGS ++     + V  K +    ++P        A   KR
Sbjct: 121 RARKSWQDERWVAIGK---ISGSLKSSTLTGSDVHQKLQPHRNSMPLHEPSVVSARLLKR 180

Query: 254 PLP------SESINQNPSVQR 256
           P P      +ES   NP   R
Sbjct: 181 PSPYNWSECAESCTGNPKKMR 195

BLAST of Cla97C09G164760 vs. TAIR 10
Match: AT2G25590.1 (Plant Tudor-like protein )

HSP 1 Score: 114.8 bits (286), Expect = 1.7e-25
Identity = 64/137 (46.72%), Postives = 89/137 (64.96%), Query Frame = 0

Query: 74  MRLRKGDQVEVLSKKEASSGSWSCAEILSGNGHFWYSVRYLSVE----ETVE-RVPRTAI 133
           MR R+G +VEV S KEAS G W  AEI+SGNGH  Y+VRY S E    E VE RVPR  I
Sbjct: 1   MRFRRGSRVEVFSIKEASYGVWRSAEIISGNGH-TYNVRYYSFEIANNEVVEDRVPRKII 60

Query: 134 RPCPPPVERPNVWFVGDLAEAFHDS-SWKQAKIMKIVGVDCYIVRILGSPNLDISVGQSN 193
           RPCPP V+  + W  G+L E   ++ SWK A +++++    Y+VR+LG+   +++V +  
Sbjct: 61  RPCPPQVD-VDRWEAGELVEVLDNNISWKTATVLEVLSGRYYVVRLLGA-KAELTVHKVY 120

Query: 194 LRMRQAWHDGKWFLLQK 205
           LR RQ+W D +W ++ K
Sbjct: 121 LRARQSWQDERWVMIGK 134

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038899293.13.9e-15782.39uncharacterized protein LOC120086627 [Benincasa hispida] >XP_038899294.1 unchara... [more]
KAG7014119.12.3e-13366.92DUF724 domain-containing protein 3, partial [Cucurbita argyrosperma subsp. argyr... [more]
XP_038899468.16.7e-13370.88uncharacterized protein LOC120086754 isoform X1 [Benincasa hispida][more]
XP_008461067.11.4e-13070.41PREDICTED: uncharacterized protein LOC103499769 isoform X1 [Cucumis melo][more]
XP_011659542.14.2e-12768.46uncharacterized protein LOC101203701 isoform X1 [Cucumis sativus][more]
Match NameE-valueIdentityDescription
Q500V51.2e-0426.77Protein AGENET DOMAIN (AGD)-CONTAINING P1 OS=Arabidopsis thaliana OX=3702 GN=AGD... [more]
Match NameE-valueIdentityDescription
A0A1S3CDW06.7e-13170.41uncharacterized protein LOC103499769 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A6J1JXT92.7e-12769.59uncharacterized protein LOC111488477 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A0A0KBX91.2e-12468.41ENT domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_7G433350 PE=4 SV... [more]
A0A6J1GP591.2e-12468.49uncharacterized protein LOC111455870 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1DFY92.7e-11965.24uncharacterized protein LOC111020085 isoform X1 OS=Momordica charantia OX=3673 G... [more]
Match NameE-valueIdentityDescription
AT4G32440.13.2e-4034.75Plant Tudor-like RNA-binding protein [more]
AT5G20030.14.1e-4033.80Plant Tudor-like RNA-binding protein [more]
AT4G32440.21.9e-3734.05Plant Tudor-like RNA-binding protein [more]
AT4G32440.31.2e-2638.31Plant Tudor-like RNA-binding protein [more]
AT2G25590.11.7e-2546.72Plant Tudor-like protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (97103) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR014002Agenet domain, plant typeSMARTSM00743agenet_At_2coord: 74..136
e-value: 5.8E-11
score: 52.4
coord: 139..196
e-value: 6.5E-6
score: 35.6
IPR005491ENT domainSMARTSM01191ENT_2coord: 355..414
e-value: 4.4E-6
score: 36.2
IPR005491ENT domainPROSITEPS51138ENTcoord: 355..415
score: 11.379974
IPR036142ENT domain-like superfamilyGENE3D1.10.1240.40ENT domaincoord: 349..414
e-value: 8.5E-11
score: 44.0
IPR036142ENT domain-like superfamilySUPERFAMILY158639ENT-likecoord: 353..411
IPR008395Agenet-like domainPFAMPF05641Agenetcoord: 79..133
e-value: 2.2E-11
score: 44.0
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 234..255
NoneNo IPR availablePANTHERPTHR31917AGENET DOMAIN-CONTAINING PROTEIN-RELATEDcoord: 74..312
NoneNo IPR availablePANTHERPTHR31917:SF63JHL25H03.10-LIKE PROTEINcoord: 74..312

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C09G164760.1Cla97C09G164760.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016020 membrane