Cla97C06G110940 (gene) Watermelon (97103) v2

NameCla97C06G110940
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionPlant protein of unknown function (DUF247)
LocationCla97Chr06 : 1603601 .. 1606166 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACGCAGCCACAGCTGCCCATCAGCATTAGTAGAAACGATAGTGCTCAGTTGGTGATTGAGGTGCAGGAAAATCTCAATAAACTGGGGAAATCAGTACTTGCTACGGACGAGAATACCTTCAATCGTTCGATCTACAGAATACCAACATTCATGAGAGAAGTTCATCCGAAAGCGTTCGAGCCACAGTTGGTGTCGTTTGGGCCATACCACCATGGGAAACCACAATATGCTTCAATGGAACTGGAAAAACAGAAAGCGTTTCGCCGTTTGAAAACCAACCCAGAGATGTTAGAGTCGATTGTGCAGACTGTGACTGACAATTTGCAAAACCTGTTGGGAGCATACGATAAGCTAATTGAGGGTGAATGGGCGAAACGTCCTGCCAAGTTCTTGGAGGTGTTGATTGTGGATGGCTGTTTCATGCTAAGTTTCCTCAAGAAATGTCCCCCCTCCCTAAGTTCTATGAGTTGGGATATAAAGCGGGACATGCTTCTGCTTGAGAATCAGCTACCCATGCTGCTTCTTGACGATTTGTATTCCATTTTACCAAACATGGTACCTCCTAAACTTTTTAAATGTTTATTTCTTTTCAAAATCAGCTTTCAATTTTGCTTCTAATATGTTTCTTCTTTAATTTTAAATTTTATGTCAATTAAGTTTGACATTTTTTTTTCTCAAAAAAGTCAGGATCTACTACCTATAGACTCAAGATTGAAACTTCTTTATACCCCTATTACACATAAAATTCAAATTTGAATTTAATAGATCATCAATTAATTATTTGAAAAAAAAAATATAAAATTTGAATATTTTAGAATTTTATTAAACATAAAATTGAAAGTTTATAAATGTATTAGACTCATATCGTTGTAGTTTTAAAAATAATTGTTGTGCTTTTAAAAAATTTAGTTGCCATACAAAATAGATTCGTGTTTGATACAATTTAAATTTTAAAAATTAAAAAAAAAAAAAAGAAAGAAAAAAAAAAAGCAATTAACGACATATGACTTAGCTATAAAGATTCAATGAAACATGGTATATAATATGTATAAGTTGATATTTAGTATATAAATTTCATAGGTTAATTATTTATAATTTTTATAGTTTAAATTACAAATTATATATTTGTATTTCTTTAATTATATGAAAAAGGTATAGAAAAGAAACCATTGTTATATAAAATTCAATTTAATAATGTTAATATAAGTTCATTTTTTTAAAGAAAGTTACATGATGTAAATTGTAAAATATTTACATATGCATATGCATGCAAAATTAATTAAAGATGTCAACGAAAATTTGAATTTGTGTGAGCATAGTTTAGCCAGGAAAAACTACTTGGGAAAACCTCTCACTTTTACTATAGGAATAAGGCAAAAAAATAATTGACACAAGAATTTAGGTGAAAAACTCCAAACCCGAAAAAAAACCACGACCCACTAGAAATAAAGAAATTCACTACGTGTAAAATTGTTACAATCACACAAAATAATTCTCTCTTTTGAACCTTTTGACTACAAACACTTTTTTTTTTTCACTCTCAAACTAAAGAATACAAAAAGAATTTAATTAGAGTTAGCAGACTAATTAAGTTAAAATGTTTCTAATTAGGACAATTGAAAAACAAAGGCACACACTCCTCTTCTATAGTGCTTAGAATCCATGACTATTGTAATTAGCCAAAAAAAAAAAAAATGAAAGAAAAACATGCATATTTTAAAATCGACTTCTAACACGTGGGTGCACGTGCAGGGATTGAACAAGTTATTATTAAATAAGAATTTGCATGGTTCTTTATTTGCTTCCGGTGATTATATTGCACATGCAGGATGGAAGACTGGCATGGTTTATATGCAAGTCAATGTGCTTCGCATCAGGGGAAGCAGTATCAATGGGAGGAAACTTGCACATTTTGGATATGTATAGAATGTCACTATTGGGTACTACGATATTAGAGAATAAGGATGGTAGTATGGAGAGAAAAACGAAGAAATCCGGGCCAGAATATCAAGTGATTCGACATGCAACACAGCTTCGTGACGCGGGGATAGATTTCCAGGAAAGCGGCACAAAGAGCCTTACTGACGTGAGTTTTGACTCCAAAAAAGGGGTGCTGAAAATTCCACAATTAGTGGTGGACGATGATAGTGAAGCCAGTTTATTAAATGTGATGGCATTTGAGAAACTACATGAGGAGGCTGGAAGCGAAGTCACATCTTTCGTAGTATTGATGAACAATTTGATAGACGTGGATGAAGATGTGGCTTTGCTGTCTTCTAAAAATATATTAGCTAACGCGCTTGGGGATGATCAAAGTGCGGCGGAGTTGTTTGGTTCGCTTGGGAAAGGGGCGGCTATGGATTTGGAAAGCCACATTACAGAGGTGCATCACCAGGTGAATCTGCATTGCCGCCGGCCATGGAATGAATGGTGTGCAAGTCTCAAACATAATTACTTTCATAACCCATGGGCCATTATCTCACTCGTTGCCGCTATTCTCGGTTTCGCCATCCTTATTGTTCAAGCCGTCTACCAAATTGTTGATTATTACCGAGGGAATTAA

mRNA sequence

ATGACGCAGCCACAGCTGCCCATCAGCATTAGTAGAAACGATAGTGCTCAGTTGGTGATTGAGGTGCAGGAAAATCTCAATAAACTGGGGAAATCAGTACTTGCTACGGACGAGAATACCTTCAATCGTTCGATCTACAGAATACCAACATTCATGAGAGAAGTTCATCCGAAAGCGTTCGAGCCACAGTTGGTGTCGTTTGGGCCATACCACCATGGGAAACCACAATATGCTTCAATGGAACTGGAAAAACAGAAAGCGTTTCGCCGTTTGAAAACCAACCCAGAGATGTTAGAGTCGATTGTGCAGACTGTGACTGACAATTTGCAAAACCTGTTGGGAGCATACGATAAGCTAATTGAGGGTGAATGGGCGAAACGTCCTGCCAAGTTCTTGGAGGTGTTGATTGTGGATGGCTGTTTCATGCTAAGTTTCCTCAAGAAATGTCCCCCCTCCCTAAGTTCTATGAGTTGGGATATAAAGCGGGACATGCTTCTGCTTGAGAATCAGCTACCCATGCTGCTTCTTGACGATTTGTATTCCATTTTACCAAACATGGATGGAAGACTGGCATGGTTTATATGCAAGTCAATGTGCTTCGCATCAGGGGAAGCAGTATCAATGGGAGGAAACTTGCACATTTTGGATATGTATAGAATGTCACTATTGGGTACTACGATATTAGAGAATAAGGATGGTAGTATGGAGAGAAAAACGAAGAAATCCGGGCCAGAATATCAAGTGATTCGACATGCAACACAGCTTCGTGACGCGGGGATAGATTTCCAGGAAAGCGGCACAAAGAGCCTTACTGACGTGAGTTTTGACTCCAAAAAAGGGGTGCTGAAAATTCCACAATTAGTGGTGGACGATGATAGTGAAGCCAGTTTATTAAATGTGATGGCATTTGAGAAACTACATGAGGAGGCTGGAAGCGAAGTCACATCTTTCGTAGTATTGATGAACAATTTGATAGACGTGGATGAAGATGTGGCTTTGCTGTCTTCTAAAAATATATTAGCTAACGCGCTTGGGGATGATCAAAGTGCGGCGGAGTTGTTTGGTTCGCTTGGGAAAGGGGCGGCTATGGATTTGGAAAGCCACATTACAGAGGTGCATCACCAGGTGAATCTGCATTGCCGCCGGCCATGGAATGAATGGTGTGCAAGTCTCAAACATAATTACTTTCATAACCCATGGGCCATTATCTCACTCGTTGCCGCTATTCTCGGTTTCGCCATCCTTATTGTTCAAGCCGTCTACCAAATTGTTGATTATTACCGAGGGAATTAA

Coding sequence (CDS)

ATGACGCAGCCACAGCTGCCCATCAGCATTAGTAGAAACGATAGTGCTCAGTTGGTGATTGAGGTGCAGGAAAATCTCAATAAACTGGGGAAATCAGTACTTGCTACGGACGAGAATACCTTCAATCGTTCGATCTACAGAATACCAACATTCATGAGAGAAGTTCATCCGAAAGCGTTCGAGCCACAGTTGGTGTCGTTTGGGCCATACCACCATGGGAAACCACAATATGCTTCAATGGAACTGGAAAAACAGAAAGCGTTTCGCCGTTTGAAAACCAACCCAGAGATGTTAGAGTCGATTGTGCAGACTGTGACTGACAATTTGCAAAACCTGTTGGGAGCATACGATAAGCTAATTGAGGGTGAATGGGCGAAACGTCCTGCCAAGTTCTTGGAGGTGTTGATTGTGGATGGCTGTTTCATGCTAAGTTTCCTCAAGAAATGTCCCCCCTCCCTAAGTTCTATGAGTTGGGATATAAAGCGGGACATGCTTCTGCTTGAGAATCAGCTACCCATGCTGCTTCTTGACGATTTGTATTCCATTTTACCAAACATGGATGGAAGACTGGCATGGTTTATATGCAAGTCAATGTGCTTCGCATCAGGGGAAGCAGTATCAATGGGAGGAAACTTGCACATTTTGGATATGTATAGAATGTCACTATTGGGTACTACGATATTAGAGAATAAGGATGGTAGTATGGAGAGAAAAACGAAGAAATCCGGGCCAGAATATCAAGTGATTCGACATGCAACACAGCTTCGTGACGCGGGGATAGATTTCCAGGAAAGCGGCACAAAGAGCCTTACTGACGTGAGTTTTGACTCCAAAAAAGGGGTGCTGAAAATTCCACAATTAGTGGTGGACGATGATAGTGAAGCCAGTTTATTAAATGTGATGGCATTTGAGAAACTACATGAGGAGGCTGGAAGCGAAGTCACATCTTTCGTAGTATTGATGAACAATTTGATAGACGTGGATGAAGATGTGGCTTTGCTGTCTTCTAAAAATATATTAGCTAACGCGCTTGGGGATGATCAAAGTGCGGCGGAGTTGTTTGGTTCGCTTGGGAAAGGGGCGGCTATGGATTTGGAAAGCCACATTACAGAGGTGCATCACCAGGTGAATCTGCATTGCCGCCGGCCATGGAATGAATGGTGTGCAAGTCTCAAACATAATTACTTTCATAACCCATGGGCCATTATCTCACTCGTTGCCGCTATTCTCGGTTTCGCCATCCTTATTGTTCAAGCCGTCTACCAAATTGTTGATTATTACCGAGGGAATTAA

Protein sequence

MTQPQLPISISRNDSAQLVIEVQENLNKLGKSVLATDENTFNRSIYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQYASMELEKQKAFRRLKTNPEMLESIVQTVTDNLQNLLGAYDKLIEGEWAKRPAKFLEVLIVDGCFMLSFLKKCPPSLSSMSWDIKRDMLLLENQLPMLLLDDLYSILPNMDGRLAWFICKSMCFASGEAVSMGGNLHILDMYRMSLLGTTILENKDGSMERKTKKSGPEYQVIRHATQLRDAGIDFQESGTKSLTDVSFDSKKGVLKIPQLVVDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQSAAELFGSLGKGAAMDLESHITEVHHQVNLHCRRPWNEWCASLKHNYFHNPWAIISLVAAILGFAILIVQAVYQIVDYYRGN
BLAST of Cla97C06G110940 vs. NCBI nr
Match: XP_023004238.1 (UPF0481 protein At3g47200-like isoform X1 [Cucurbita maxima])

HSP 1 Score: 411.4 bits (1056), Expect = 3.8e-111
Identity = 233/440 (52.95%), Postives = 307/440 (69.77%), Query Frame = 0

Query: 3   QPQLPISISRNDS----AQLVIEVQENLNKLGKSV----LATDENTFNRSIYRIPTFMRE 62
           QP L + + R+D+    A +V+ V+  L++L  S     +  ++++   SIY+IP FM +
Sbjct: 4   QPPLQVMVGRDDNIGDKADMVVRVKGTLDQLLDSPAILNMEAEQSSELLSIYKIPFFMTQ 63

Query: 63  VHPKAFEPQLVSFGPYHHGKPQYASMELEKQKAFRRLKTNPEM-LESIVQTVTDNLQNLL 122
            HPKA+EPQ+VS GPY+HGK   + MELEK K F   KT   + +ESIV+ V+  L  L+
Sbjct: 64  THPKAYEPQVVSLGPYNHGKQHLSPMELEKLKLFHSFKTRCLLDVESIVRGVSTILDELM 123

Query: 123 GAYDKLIEGEWAKRPAKFLEVLIVDGCFMLSFLKKCPPSLSSMSWDIKRDMLLLENQLPM 182
            +YD+L E EW + P KFL+++IVDGCFML FL  CP SL ++S DIK+DMLLLENQLPM
Sbjct: 124 ESYDRL-EEEWTQDPGKFLQLMIVDGCFMLGFLISCPNSLINVSPDIKQDMLLLENQLPM 183

Query: 183 LLLDDLYSI------LPNMDGRLAWFICKSMCFASGEAVSMGGNLHILDMYRMSLLGTTI 242
           LLL+ LYSI      LP  D +    +CK +     E   M   LHIL+MY+ SLL   I
Sbjct: 184 LLLEKLYSIAGRNVQLPQQDPKK--LVCKWLSIPQNEV--MKDCLHILEMYKESLLYPPI 243

Query: 243 LENKDGSMERKTKKSGPEYQVIRHATQLRDAGIDFQESGTKSLTDVSFDSKKGVLKIPQL 302
            + +D S E     S PE QVI  AT+L +AGI F+ S T+SL DV FD+K+GVL +PQL
Sbjct: 244 -DRRDWSAE--LDHSDPECQVIPPATKLHEAGIKFKRSKTESLRDVWFDTKRGVLWLPQL 303

Query: 303 VVDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDD 362
           +VDDD+E+++LNVMAFEKLH +AG +VTSFV+LM+NLID + DVA+L+ + ILANA+G+D
Sbjct: 304 MVDDDTESTMLNVMAFEKLHMKAGRKVTSFVILMSNLIDDERDVAVLAGEKILANAVGND 363

Query: 363 QSAAELFGSLGKGAAMDLESHITEVHHQVNLHCRRPWNEWCASLKHNYFHNPWAIISLVA 422
           + AA LF  LG GAAM L+SH+  VH  VN HC +PWNE CA+LKH+YF +PW IISL A
Sbjct: 364 KEAAGLFSRLGSGAAMGLDSHMAGVHKMVNAHCNQPWNERCATLKHDYFQSPWTIISLCA 423

Query: 423 AILGFAILIVQAVYQIVDYY 428
           AI GF ILI+QA+YQ +DYY
Sbjct: 424 AIFGFIILILQAIYQFLDYY 435

BLAST of Cla97C06G110940 vs. NCBI nr
Match: XP_023004239.1 (UPF0481 protein At3g47200-like isoform X2 [Cucurbita maxima])

HSP 1 Score: 409.5 bits (1051), Expect = 1.5e-110
Identity = 233/440 (52.95%), Postives = 307/440 (69.77%), Query Frame = 0

Query: 3   QPQLPISISRNDS----AQLVIEVQENLNKLGKSV----LATDENTFNRSIYRIPTFMRE 62
           QP L + + R+D+    A +V+ V+  L++L  S     +  ++++   SIY+IP FM +
Sbjct: 4   QPPLQVMVGRDDNIGDKADMVVRVKGTLDQLLDSPAILNMEAEQSSELLSIYKIPFFMTQ 63

Query: 63  VHPKAFEPQLVSFGPYHHGKPQYASMELEKQKAFRRLKTNPEM-LESIVQTVTDNLQNLL 122
            HPKA+EPQ+VS GPY+HGK   + MELEK K F   KT   + +ESIV+ V+  L  L+
Sbjct: 64  THPKAYEPQVVSLGPYNHGKQHLSPMELEKLKLFHSFKTRCLLDVESIVRGVSTILDELM 123

Query: 123 GAYDKLIEGEWAKRPAKFLEVLIVDGCFMLSFLKKCPPSLSSMSWDIKRDMLLLENQLPM 182
            +YD+L E EW + P KFL+++IVDGCFML FL  CP SL ++S DIK+DMLLLENQLPM
Sbjct: 124 ESYDRL-EEEWTQDPGKFLQLMIVDGCFMLGFLISCPNSLINVSPDIKQDMLLLENQLPM 183

Query: 183 LLLDDLYSI------LPNMDGRLAWFICKSMCFASGEAVSMGGNLHILDMYRMSLLGTTI 242
           LLL+ LYSI      LP    +L   +CK +     E   M   LHIL+MY+ SLL   I
Sbjct: 184 LLLEKLYSIAGRNVQLPQDPKKL---VCKWLSIPQNEV--MKDCLHILEMYKESLLYPPI 243

Query: 243 LENKDGSMERKTKKSGPEYQVIRHATQLRDAGIDFQESGTKSLTDVSFDSKKGVLKIPQL 302
            + +D S E     S PE QVI  AT+L +AGI F+ S T+SL DV FD+K+GVL +PQL
Sbjct: 244 -DRRDWSAE--LDHSDPECQVIPPATKLHEAGIKFKRSKTESLRDVWFDTKRGVLWLPQL 303

Query: 303 VVDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDD 362
           +VDDD+E+++LNVMAFEKLH +AG +VTSFV+LM+NLID + DVA+L+ + ILANA+G+D
Sbjct: 304 MVDDDTESTMLNVMAFEKLHMKAGRKVTSFVILMSNLIDDERDVAVLAGEKILANAVGND 363

Query: 363 QSAAELFGSLGKGAAMDLESHITEVHHQVNLHCRRPWNEWCASLKHNYFHNPWAIISLVA 422
           + AA LF  LG GAAM L+SH+  VH  VN HC +PWNE CA+LKH+YF +PW IISL A
Sbjct: 364 KEAAGLFSRLGSGAAMGLDSHMAGVHKMVNAHCNQPWNERCATLKHDYFQSPWTIISLCA 423

Query: 423 AILGFAILIVQAVYQIVDYY 428
           AI GF ILI+QA+YQ +DYY
Sbjct: 424 AIFGFIILILQAIYQFLDYY 434

BLAST of Cla97C06G110940 vs. NCBI nr
Match: XP_023513986.1 (UPF0481 protein At3g47200-like isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 403.7 bits (1036), Expect = 8.0e-109
Identity = 232/439 (52.85%), Postives = 305/439 (69.48%), Query Frame = 0

Query: 4   PQLPISISRNDS----AQLVIEVQENLNKLGKSV----LATDENTFNRSIYRIPTFMREV 63
           P L + + R+D+    A +V++V+  L++L  S     +  ++++   SIY+IP FM + 
Sbjct: 5   PSLQVMLGRDDNIGDKADMVVQVKGTLDQLLDSPAILNMEAEQSSELLSIYKIPFFMTQT 64

Query: 64  HPKAFEPQLVSFGPYHHGKPQYASMELEKQKAFRRLKTNPEM-LESIVQTVTDNLQNLLG 123
           HPKA+EP++VS GPY+HGK   + MELEK K F   KT   + +ESIV+ V+  L  L+ 
Sbjct: 65  HPKAYEPRVVSLGPYNHGKQHLSPMELEKLKLFYSFKTRCLLDVESIVKGVSTILDELME 124

Query: 124 AYDKLIEGEWAKRPAKFLEVLIVDGCFMLSFLKKCPPSLSSMSWDIKRDMLLLENQLPML 183
           +YDKL E EW + P KFL+++IVDGCFML FL  CP SL ++S DIK+DMLLLENQLPML
Sbjct: 125 SYDKL-EEEWTEDPGKFLQLMIVDGCFMLGFLINCPDSLINVSPDIKQDMLLLENQLPML 184

Query: 184 LLDDLYSI------LPNMDGRLAWFICKSMCFASGEAVSMGGNLHILDMYRMSLLGTTIL 243
           LL+ LYSI      LP  D  L   +   +     E   M   LHIL+MY+ SLL   I 
Sbjct: 185 LLEKLYSIAARNVQLPQQD--LQKLVSNWLNIPRNEV--MKDCLHILEMYKESLLHPPI- 244

Query: 244 ENKDGSMERKTKKSGPEYQVIRHATQLRDAGIDFQESGTKSLTDVSFDSKKGVLKIPQLV 303
           +  D S+E     S PE QVI  AT+LR+AGI F+ S T SLTDV FD+K GVL +PQL+
Sbjct: 245 DRTDWSVE--LDHSDPESQVIPPATKLREAGIKFKRSKTDSLTDVFFDAKGGVLWLPQLM 304

Query: 304 VDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQ 363
           VDDD+E++LLNVMAFEKLH +AG  VTSFV+LM+NLID + DVA+L+ + +LANA+G+D+
Sbjct: 305 VDDDTESTLLNVMAFEKLHMKAGRRVTSFVILMSNLIDDERDVAVLAGEKVLANAVGNDK 364

Query: 364 SAAELFGSLGKGAAMDLESHITEVHHQVNLHCRRPWNEWCASLKHNYFHNPWAIISLVAA 423
            AA LF  LG GAAM L++H+  VH +VN HC +PWNE CA+LKH+YF +PW IISL AA
Sbjct: 365 EAAGLFNRLGSGAAMGLDTHMAGVHKKVNEHCNQPWNERCATLKHDYFQSPWTIISLCAA 424

Query: 424 ILGFAILIVQAVYQIVDYY 428
           I GF ILI+QA+YQ +DYY
Sbjct: 425 IFGFIILILQAIYQFLDYY 435

BLAST of Cla97C06G110940 vs. NCBI nr
Match: XP_023513987.1 (UPF0481 protein At3g47200-like isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 401.7 bits (1031), Expect = 3.0e-108
Identity = 229/436 (52.52%), Postives = 303/436 (69.50%), Query Frame = 0

Query: 4   PQLPISISRNDS----AQLVIEVQENLNKLGKSV----LATDENTFNRSIYRIPTFMREV 63
           P L + + R+D+    A +V++V+  L++L  S     +  ++++   SIY+IP FM + 
Sbjct: 5   PSLQVMLGRDDNIGDKADMVVQVKGTLDQLLDSPAILNMEAEQSSELLSIYKIPFFMTQT 64

Query: 64  HPKAFEPQLVSFGPYHHGKPQYASMELEKQKAFRRLKTNPEM-LESIVQTVTDNLQNLLG 123
           HPKA+EP++VS GPY+HGK   + MELEK K F   KT   + +ESIV+ V+  L  L+ 
Sbjct: 65  HPKAYEPRVVSLGPYNHGKQHLSPMELEKLKLFYSFKTRCLLDVESIVKGVSTILDELME 124

Query: 124 AYDKLIEGEWAKRPAKFLEVLIVDGCFMLSFLKKCPPSLSSMSWDIKRDMLLLENQLPML 183
           +YDKL E EW + P KFL+++IVDGCFML FL  CP SL ++S DIK+DMLLLENQLPML
Sbjct: 125 SYDKL-EEEWTEDPGKFLQLMIVDGCFMLGFLINCPDSLINVSPDIKQDMLLLENQLPML 184

Query: 184 LLDDLYSILP---NMDGRLAWFICKSMCFASGEAVSMGGNLHILDMYRMSLLGTTILENK 243
           LL+ LYSI      +   L   +   +     E   M   LHIL+MY+ SLL   I +  
Sbjct: 185 LLEKLYSIAARNVQLPQDLQKLVSNWLNIPRNEV--MKDCLHILEMYKESLLHPPI-DRT 244

Query: 244 DGSMERKTKKSGPEYQVIRHATQLRDAGIDFQESGTKSLTDVSFDSKKGVLKIPQLVVDD 303
           D S+E     S PE QVI  AT+LR+AGI F+ S T SLTDV FD+K GVL +PQL+VDD
Sbjct: 245 DWSVE--LDHSDPESQVIPPATKLREAGIKFKRSKTDSLTDVFFDAKGGVLWLPQLMVDD 304

Query: 304 DSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQSAA 363
           D+E++LLNVMAFEKLH +AG  VTSFV+LM+NLID + DVA+L+ + +LANA+G+D+ AA
Sbjct: 305 DTESTLLNVMAFEKLHMKAGRRVTSFVILMSNLIDDERDVAVLAGEKVLANAVGNDKEAA 364

Query: 364 ELFGSLGKGAAMDLESHITEVHHQVNLHCRRPWNEWCASLKHNYFHNPWAIISLVAAILG 423
            LF  LG GAAM L++H+  VH +VN HC +PWNE CA+LKH+YF +PW IISL AAI G
Sbjct: 365 GLFNRLGSGAAMGLDTHMAGVHKKVNEHCNQPWNERCATLKHDYFQSPWTIISLCAAIFG 424

Query: 424 FAILIVQAVYQIVDYY 428
           F ILI+QA+YQ +DYY
Sbjct: 425 FIILILQAIYQFLDYY 434

BLAST of Cla97C06G110940 vs. NCBI nr
Match: XP_022960454.1 (UPF0481 protein At3g47200-like [Cucurbita moschata])

HSP 1 Score: 397.1 bits (1019), Expect = 7.5e-107
Identity = 229/439 (52.16%), Postives = 303/439 (69.02%), Query Frame = 0

Query: 4   PQLPISISRN----DSAQLVIEVQENLNKLGKSV----LATDENTFNRSIYRIPTFMREV 63
           P L + + R+    D+A +V+ V+  L++L  S     +  ++++   SIY+IP FM + 
Sbjct: 5   PSLQVMLGRDDNIGDNADMVVRVKGTLDQLLDSPAILNMEAEQSSELLSIYKIPFFMTQT 64

Query: 64  HPKAFEPQLVSFGPYHHGKPQYASMELEKQKAFRRLKTNPEM-LESIVQTVTDNLQNLLG 123
           HPKA+EPQ+VS GPY+HGK   + MELEK K F   K    + +ESIV+ V+  L  L+ 
Sbjct: 65  HPKAYEPQVVSLGPYNHGKQHLSPMELEKLKLFHSFKARCLLDVESIVRGVSTILDELME 124

Query: 124 AYDKLIEGEWAKRPAKFLEVLIVDGCFMLSFLKKCPPSLSSMSWDIKRDMLLLENQLPML 183
           +YDKL E +W + P KFL+++IVDGCFML FL  CP SL ++S DIK+DMLLLENQLPML
Sbjct: 125 SYDKL-EEDWKEDPGKFLQLMIVDGCFMLGFLINCPDSLINVSPDIKQDMLLLENQLPML 184

Query: 184 LLDDLYSI------LPNMDGRLAWFICKSMCFASGEAVSMGGNLHILDMYRMSLLGTTIL 243
           LL+ LYSI      LP  D  L   +   +     E   M   LHIL+MY+ SLL   I 
Sbjct: 185 LLEKLYSIADRNVQLPQQD--LKKLVSNWLNIPRNEV--MKDCLHILEMYKESLLHPPI- 244

Query: 244 ENKDGSMERKTKKSGPEYQVIRHATQLRDAGIDFQESGTKSLTDVSFDSKKGVLKIPQLV 303
           +  D S+E     S PE QVI  AT+LR+AGI F+ S T SLTDV FD+K GVL +P+L+
Sbjct: 245 DRTDWSVE--LGHSDPECQVIPPATKLREAGIKFKRSKTGSLTDVFFDAKGGVLWLPRLM 304

Query: 304 VDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQ 363
           VDD++E++LLNVMAFEKLH +AG +VTSFV+LM+NLID + DVA+L+ + +LANA+G+D+
Sbjct: 305 VDDNTESTLLNVMAFEKLHMKAGRKVTSFVILMSNLIDDERDVAVLAGEQVLANAVGNDK 364

Query: 364 SAAELFGSLGKGAAMDLESHITEVHHQVNLHCRRPWNEWCASLKHNYFHNPWAIISLVAA 423
            AA LF  LG GAAM L++H+  VH +VN HC +PWNE CA+LKH YF +PW IISL AA
Sbjct: 365 EAAGLFNRLGSGAAMGLDTHMAWVHKKVNEHCNQPWNERCATLKHEYFQSPWTIISLCAA 424

Query: 424 ILGFAILIVQAVYQIVDYY 428
           I GF ILI+QA+YQ +DYY
Sbjct: 425 IFGFIILILQAIYQFLDYY 435

BLAST of Cla97C06G110940 vs. TrEMBL
Match: tr|A0A1S4E2L7|A0A1S4E2L7_CUCME (uncharacterized protein LOC103499077 OS=Cucumis melo OX=3656 GN=LOC103499077 PE=4 SV=1)

HSP 1 Score: 371.7 bits (953), Expect = 2.2e-99
Identity = 225/441 (51.02%), Postives = 280/441 (63.49%), Query Frame = 0

Query: 14  DSAQLVIEVQENLNKL------GKSVLATDENTFNRSIYRIPTFMREVHPKAFEPQLVSF 73
           D   +VI V++ L +L        +   + E T   SIY+IP FM+++  KA+EP LVSF
Sbjct: 502 DKNDVVINVKDCLKELLMKRPVPVNESPSSEKTIKPSIYKIPKFMKDIQLKAYEPYLVSF 561

Query: 74  GPYHHGKPQYASMELEKQKAFRRL---KTNPEMLESIVQTVTDNLQNLLGAYDKLIEGEW 133
           GPYHHG    A ME EKQK F+ L     N    ESI   V++ L++L  AYD L E +W
Sbjct: 562 GPYHHGVEHLAPMEKEKQKVFQHLVKGDNNAATYESIASEVSNILEDLYAAYDNLDE-KW 621

Query: 134 AK---RPAKFLEVLIVDGCFMLSFLK--KCPPSLSSMSWDIKRDMLLLENQLPMLLLDDL 193
            K     AKF+E++I+D CF+L F    K   SL ++  DIKRD+LLLENQLP  LL  L
Sbjct: 622 RKDVVASAKFMEMMIIDACFILVFFSKDKSYKSLMTLRSDIKRDILLLENQLPFQLLQLL 681

Query: 194 YSILPNMDGR--LAWFICKSMCFASGEAVSMGGNLHILDMYRMSLLG--TTILENKDGSM 253
           Y ILP  D    L   ICK + F   + +++ G  HIL+MYRM LL     +L  +D S 
Sbjct: 682 YKILPIKDQNKSLTSLICK-LWFVKKDELTVKGGKHILEMYRMLLLDPIPIVLSERDESQ 741

Query: 254 ERK-----TKKSGPE----YQVIRHATQLRDAGIDFQESGTKSLTDVSFDSKKGVLKIPQ 313
           ++       KK   E     Q+I  AT L DAGI F+ S T+SL D+ F  K GVL++P 
Sbjct: 742 KKAEGTKGNKKEKDESVLNSQIIPQATLLHDAGIKFRRSETESLIDIGF--KNGVLELPH 801

Query: 314 LVVDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGD 373
           L VDDD+E  LLNVMAFEKLH    S VTSFVVLMNNLID+D+DV LLS   I+ NALG+
Sbjct: 802 LTVDDDTETKLLNVMAFEKLHGNVRSYVTSFVVLMNNLIDIDKDVELLSENKIIDNALGN 861

Query: 374 DQSAAELFGSLGKGAAMDLESHITEVHHQVNLHCRRPWNEWCASLKHNYFHNPWAIISLV 428
           D+ AA+LF  LGKG A+DLES+I +VH  VN HC    N WCA+LKHNYF NPWAIISL+
Sbjct: 862 DEDAAKLFTVLGKGVALDLESNIAKVHRLVNKHCDGRCNRWCANLKHNYFQNPWAIISLI 921

BLAST of Cla97C06G110940 vs. TrEMBL
Match: tr|E5GB49|E5GB49_CUCME (Uncharacterized protein OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1)

HSP 1 Score: 371.7 bits (953), Expect = 2.2e-99
Identity = 225/441 (51.02%), Postives = 280/441 (63.49%), Query Frame = 0

Query: 14  DSAQLVIEVQENLNKL------GKSVLATDENTFNRSIYRIPTFMREVHPKAFEPQLVSF 73
           D   +VI V++ L +L        +   + E T   SIY+IP FM+++  KA+EP LVSF
Sbjct: 26  DKNDVVINVKDCLKELLMKRPVPVNESPSSEKTIKPSIYKIPKFMKDIQLKAYEPYLVSF 85

Query: 74  GPYHHGKPQYASMELEKQKAFRRL---KTNPEMLESIVQTVTDNLQNLLGAYDKLIEGEW 133
           GPYHHG    A ME EKQK F+ L     N    ESI   V++ L++L  AYD L E +W
Sbjct: 86  GPYHHGVEHLAPMEKEKQKVFQHLVKGDNNAATYESIASEVSNILEDLYAAYDNLDE-KW 145

Query: 134 AK---RPAKFLEVLIVDGCFMLSFLK--KCPPSLSSMSWDIKRDMLLLENQLPMLLLDDL 193
            K     AKF+E++I+D CF+L F    K   SL ++  DIKRD+LLLENQLP  LL  L
Sbjct: 146 RKDVVASAKFMEMMIIDACFILVFFSKDKSYKSLMTLRSDIKRDILLLENQLPFQLLQLL 205

Query: 194 YSILPNMDGR--LAWFICKSMCFASGEAVSMGGNLHILDMYRMSLLG--TTILENKDGSM 253
           Y ILP  D    L   ICK + F   + +++ G  HIL+MYRM LL     +L  +D S 
Sbjct: 206 YKILPIKDQNKSLTSLICK-LWFVKKDELTVKGGKHILEMYRMLLLDPIPIVLSERDESQ 265

Query: 254 ERK-----TKKSGPE----YQVIRHATQLRDAGIDFQESGTKSLTDVSFDSKKGVLKIPQ 313
           ++       KK   E     Q+I  AT L DAGI F+ S T+SL D+ F  K GVL++P 
Sbjct: 266 KKAEGTKGNKKEKDESVLNSQIIPQATLLHDAGIKFRRSETESLIDIGF--KNGVLELPH 325

Query: 314 LVVDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGD 373
           L VDDD+E  LLNVMAFEKLH    S VTSFVVLMNNLID+D+DV LLS   I+ NALG+
Sbjct: 326 LTVDDDTETKLLNVMAFEKLHGNVRSYVTSFVVLMNNLIDIDKDVELLSENKIIDNALGN 385

Query: 374 DQSAAELFGSLGKGAAMDLESHITEVHHQVNLHCRRPWNEWCASLKHNYFHNPWAIISLV 428
           D+ AA+LF  LGKG A+DLES+I +VH  VN HC    N WCA+LKHNYF NPWAIISL+
Sbjct: 386 DEDAAKLFTVLGKGVALDLESNIAKVHRLVNKHCDGRCNRWCANLKHNYFQNPWAIISLI 445

BLAST of Cla97C06G110940 vs. TrEMBL
Match: tr|A0A067JUP9|A0A067JUP9_JATCU (Uncharacterized protein OS=Jatropha curcas OX=180498 GN=JCGZ_23062 PE=4 SV=1)

HSP 1 Score: 310.1 bits (793), Expect = 7.9e-81
Identity = 180/431 (41.76%), Postives = 262/431 (60.79%), Query Frame = 0

Query: 19  VIEVQENLNKLGKSVLATDENTFNRSIYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQYA 78
           VIEV E L  + KS+   +E    RSIY+IP  + +++  A+ PQ VSFGPYHHG+    
Sbjct: 9   VIEVNEKLENIDKSM--EEERWKKRSIYKIPACVTDLNKNAYRPQAVSFGPYHHGEAHLK 68

Query: 79  SMELEKQKAFRR-LKTNPEMLESIVQTVTDNLQNLLGAYDKLIEGEWAKRPAKFLEVLIV 138
            ME  KQ+A    LK   + L+  V ++T  +Q L   YD L +  W +    F++++I+
Sbjct: 69  PMEEHKQRALLHFLKRANKPLQVFVDSLTQVVQVLKDCYDPL-DIIWQEDTCGFVQLMIM 128

Query: 139 DGCFMLSFLKKCPPSLSSMSWD---------------IKRDMLLLENQLPMLLLDDLYSI 198
           DGCFML  L+    ++   + +               +KRDML+LENQLPM+LLD L ++
Sbjct: 129 DGCFMLEILRVATRTVEDYAPNDPIFSDHGKLYIMPYVKRDMLMLENQLPMILLDKLLAV 188

Query: 199 L--PNMDGRLAWFICKSMCFASGEAVSMGGNLHILDMYRMSLLGTTI-LENKDGSMERKT 258
                 D      +    CF      S+G  LH LD+YR SLL   I + +K  S  R  
Sbjct: 189 ECGEEKDEEFVNRLILKFCFPDVPVSSLGKCLHALDVYRKSLLQRHIGMNDKRRSRSRSR 248

Query: 259 KKSGPEYQVIRHATQLRDAGIDFQESGTKSLTDVSFDSKKGVLKIPQLVVDDDSEASLLN 318
           +++G +  +IR AT+L +AGI F++S T+SL D+SF  + GVL++P +VVDD +E+S LN
Sbjct: 249 RRNGGD-NIIRSATELNEAGIRFKKSKTRSLKDISF--RGGVLRLPVIVVDDATESSFLN 308

Query: 319 VMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQSAAELFGSLGK 378
           ++AFE+ H  AG+E+TSF+  M+N+ID + DVALL S+ I+ NA+G D++ A+LF S+ K
Sbjct: 309 LIAFERFHVGAGNEITSFIFFMDNIIDSERDVALLHSRGIIQNAIGSDKAVAKLFNSMSK 368

Query: 379 GAAMDLESHITEVHHQVNLHCRRPWNEWCASLKHNYFHNPWAIISLVAAILGFAILIVQA 431
              +D  S +  VH +VN +C++ WNEW A+L H YF NPWAI+S +AA+L FA+ IVQ 
Sbjct: 369 DITLDPNSSLDFVHKKVNAYCKKAWNEWRANLIHTYFRNPWAIVSFLAAVLLFALTIVQT 428

BLAST of Cla97C06G110940 vs. TrEMBL
Match: tr|A0A0A0K9F5|A0A0A0K9F5_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G089270 PE=4 SV=1)

HSP 1 Score: 308.5 bits (789), Expect = 2.3e-80
Identity = 200/435 (45.98%), Postives = 264/435 (60.69%), Query Frame = 0

Query: 5   QLPISISRNDSAQLVIEVQENLNK-LGKSV----LATDENTFNRSIYRIPTFMREVHPKA 64
           QLPI     D+A +V+EV++NL K L KSV    L +   T   SIY+IP F+++VH +A
Sbjct: 23  QLPIM----DNA-VVVEVKDNLKKLLMKSVAVEKLGSSGKTIKPSIYKIPNFIKDVHKEA 82

Query: 65  FEPQLVSFGPYHHGKPQYASMELEKQKAFRRL-KTNPEMLESIVQTVTDNLQNLLGAYDK 124
           + P +VSFGPYHHG+   A ME EK K FR L        ESIV  V++ L++L GAYD 
Sbjct: 83  YMPHMVSFGPYHHGEKNLAPMEQEKLKVFRHLVDVKGVDYESIVSDVSNILEDLYGAYDD 142

Query: 125 LIEGEWAKR--PAKFLEVLIVDGCFMLSFLKKCPPSLSSMSWDIKRDMLLLENQLPMLLL 184
           L E  W      AKF++++I+     + + K    +L+S                     
Sbjct: 143 LDEDWWKDNAGSAKFMKMMILH----IFYSKDQNTTLTS--------------------- 202

Query: 185 DDLYSILPNMDGRLAWFICKSMCFASGEAVSMGGNLHILDMYRMSLL--GTTILENKDG- 244
                            +  ++ F   + +++    HIL MYR SLL   T    N D  
Sbjct: 203 -----------------LISNLLFVEKDELAIVEKKHILHMYRASLLYPSTLSYPNMDEI 262

Query: 245 SMERKTKKSGPEYQVIRHATQLRDAGIDFQESGTKSLTDVSFDSKKGVLKIPQLVVDDDS 304
               K  K G + Q+I  AT LR+AGI F++S  KSL +VSF+  KGVL +P L+VDD++
Sbjct: 263 KKNNKDDKFGLKCQLIPQATLLREAGIRFRKSENKSLENVSFE--KGVLTLPSLIVDDNT 322

Query: 305 EASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQSAAEL 364
           + +LLNVMAFEKLH + GS+VTSFVVLMNNLID+D+DV LLS+ NI+ANALG+++ AA L
Sbjct: 323 KTNLLNVMAFEKLH-DVGSQVTSFVVLMNNLIDIDKDVELLSNDNIIANALGNNEEAANL 382

Query: 365 FGSLGKGAAMDLES-HITEVHHQVNLHCRRPWNEWCASLKHNYFHNPWAIISLVAAILGF 424
           F  LGKG ++DL S ++TEVH  VN+HC   WN W A+LKH YF NPWAIIS   AI GF
Sbjct: 383 FSVLGKGVSLDLGSNNLTEVHQLVNIHCDDSWNRWWANLKHTYFQNPWAIISFFGAIFGF 407

Query: 425 AILIVQAVYQIVDYY 428
           AILIVQAVYQIVD++
Sbjct: 443 AILIVQAVYQIVDFH 407

BLAST of Cla97C06G110940 vs. TrEMBL
Match: tr|A0A2C9W374|A0A2C9W374_MANES (Uncharacterized protein OS=Manihot esculenta OX=3983 GN=MANES_04G166100 PE=4 SV=1)

HSP 1 Score: 300.1 bits (767), Expect = 8.2e-78
Identity = 174/431 (40.37%), Postives = 259/431 (60.09%), Query Frame = 0

Query: 19  VIEVQENLNKLGKSVLATDENTF-NRSIYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQY 78
           VIEV E L K+  +  + +E  +  R+ Y+IP F+ +++ KA+ PQ VSFGPYHHG+   
Sbjct: 7   VIEVNEKLEKMIDADNSMEEERWKKRAAYKIPAFVTDLNKKAYRPQAVSFGPYHHGEDHL 66

Query: 79  ASMELEKQKAFRR-LKTNPEMLESIVQTVTDNLQNLLGAYDKLIEGEWAKRPAKFLEVLI 138
             ME  KQ+A    LK   + L+   +++T  +Q L  +YD L +  W     +FL+++I
Sbjct: 67  KPMEEHKQRALLHFLKRAKKPLQVFFESLTGLVQLLKESYDPL-DISWQDN-TRFLQLMI 126

Query: 139 VDGCFMLSFLKKCPPSLSSMSWD---------------IKRDMLLLENQLPMLLLDDLYS 198
           +DGCFML  L+    +L   + +               I RDML+LENQLP+L+LD L +
Sbjct: 127 LDGCFMLEILRVATRTLDDYARNDPIFSNHGNLYVMPYIMRDMLMLENQLPLLVLDKLVA 186

Query: 199 ILPN--MDGRLAWFICKSMCFASGEAVSMGGNLHILDMYRMSLLGTTILENKDGSMERKT 258
           +     MD      +    CF       +G  LH LD+YR +LL   +   K      + 
Sbjct: 187 VESGKPMDEEFINNLILKFCFPDTPLSCLGNCLHPLDVYRKNLLQNHVGGEKPHRSRSRG 246

Query: 259 KKSGPEYQVIRHATQLRDAGIDFQESGTKSLTDVSFDSKKGVLKIPQLVVDDDSEASLLN 318
           K+      +IR AT+L +AGI F++S T+SL D+SF  + GVL++P +VVDD +E+  LN
Sbjct: 247 KRRKGGDNIIRSATELNEAGIRFKKSKTRSLKDISF--RGGVLRLPVIVVDDATESIFLN 306

Query: 319 VMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQSAAELFGSLGK 378
           +MAFE+ H  AG+EVTS++  M+N+ID + DVALL S+ I+ NA+G D++ A+LF SL K
Sbjct: 307 LMAFERFHVGAGNEVTSYIFFMDNIIDSERDVALLHSRGIIQNAIGSDKAVAKLFNSLSK 366

Query: 379 GAAMDLESHITEVHHQVNLHCRRPWNEWCASLKHNYFHNPWAIISLVAAILGFAILIVQA 431
             ++D  S +  VH ++N++C++  NEW A+L H YF NPWAI+SL+AA+  FA+ I Q 
Sbjct: 367 DISLDPNSSLNFVHQKINVYCKKACNEWRANLIHTYFRNPWAILSLIAAVFLFALTIAQT 426

BLAST of Cla97C06G110940 vs. Swiss-Prot
Match: sp|Q9SD53|Y3720_ARATH (UPF0481 protein At3g47200 OS=Arabidopsis thaliana OX=3702 GN=At3g47200 PE=2 SV=1)

HSP 1 Score: 139.8 bits (351), Expect = 7.0e-32
Identity = 111/419 (26.49%), Postives = 201/419 (47.97%), Query Frame = 0

Query: 45  IYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQYASMELEKQKAFR----RLKTNPEMLES 104
           I+R+P     ++PKA++P++VS GPYH+G+     ++  K +  +      K        
Sbjct: 48  IFRVPESFVALNPKAYKPKVVSIGPYHYGEKHLQMIQQHKPRLLQLFLDEAKKKDVEENV 107

Query: 105 IVQTVTDNLQNLLGAYDKLIEGEWAKRPAKFLEVLIVDGCFMLSF-------LKKCPPSL 164
           +V+ V D    +  +Y      E  K     + ++++DGCF+L         ++     +
Sbjct: 108 LVKAVVDLEDKIRKSY-----SEELKTGHDLMFMMVLDGCFILMVFLIMSGNIELSEDPI 167

Query: 165 SSMSW---DIKRDMLLLENQLPMLLLDDLY---SILPNMD-GRLAWFICKSMCFASGEAV 224
            S+ W    I+ D+LLLENQ+P  +L  LY    I  + D  R+A+   K+     G   
Sbjct: 168 FSIPWLLSSIQSDLLLLENQVPFFVLQTLYVGSKIGVSSDLNRIAFHFFKNPIDKEGSYW 227

Query: 225 SMGGNL---HILDMYRMSLLGTT--------------ILENKDGSMERKTKKSGPEYQVI 284
               N    H+LD+ R + L  T              + E K G++     K+ P   +I
Sbjct: 228 EKHRNYKAKHLLDLIRETFLPNTSESDKASSPHVQVQLHEGKSGNVPSVDSKAVP---LI 287

Query: 285 RHATQLRDAGIDFQESGTKSLTDVSFDSKKGVLKIPQLVVDDDSEASLLNVMAFEKLHEE 344
             A +LR  GI F+   +K  + ++   KK  L+IPQL  D    +  LN +AFE+ + +
Sbjct: 288 LSAKRLRLQGIKFRLRRSKEDSILNVRLKKNKLQIPQLRFDGFISSFFLNCVAFEQFYTD 347

Query: 345 AGSEVTSFVVLMNNLIDVDEDVALL-SSKNILANALGDDQSAAELFGSLGKGAAMDLE-S 404
           + +E+T+++V M  L++ +EDV  L + K I+ N  G +   +E F ++ K    +++ S
Sbjct: 348 SSNEITTYIVFMGCLLNNEEDVTFLRNDKLIIENHFGSNNEVSEFFKTISKDVVFEVDTS 407

Query: 405 HITEVHHQVNLHCRRPWNEWCASLKHNYFHNPWAIISLVAAILGFAILIVQAVYQIVDY 427
           ++  V   VN + ++ +N   A  +H +F +PW  +S  A +    + ++Q+   I+ Y
Sbjct: 408 YLNNVFKGVNEYTKKWYNGLWAGFRHTHFESPWTFLSSCAVLFVILLTMLQSTVAILSY 458

BLAST of Cla97C06G110940 vs. Swiss-Prot
Match: sp|P0C897|Y3264_ARATH (Putative UPF0481 protein At3g02645 OS=Arabidopsis thaliana OX=3702 GN=At3g02645 PE=3 SV=1)

HSP 1 Score: 79.7 bits (195), Expect = 8.6e-14
Identity = 47/174 (27.01%), Postives = 86/174 (49.43%), Query Frame = 0

Query: 245 EYQVIRHATQLRDAGIDFQESGTKSLTDVSFDSKKGVLKIPQLVVDDDSEASLLNVMAFE 304
           E   I   + L  AG+ F+ +   +++ V+FDS  G   +P + +D ++E  L N++A+E
Sbjct: 341 EELTIPSVSDLHKAGVRFKPTAHGNISTVTFDSNSGQFYLPVINLDINTETVLRNLVAYE 400

Query: 305 KLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQSAAELFGSLGKGAAMD 364
             +       T +  L+N +ID +EDV LL  + +L + L  DQ AAE++  + K   + 
Sbjct: 401 ATNTSGPLVFTRYTELINGIIDSEEDVRLLREQGVLVSRLKSDQEAAEMWNGMSKSVRLT 460

Query: 365 LESHITEVHHQVNLHCRRPWNEWCASLKHNYFHNPWAIISLVAAILGFAILIVQ 419
               + +    VN +    W      L   Y +  W I++ +AA+L   ++ +Q
Sbjct: 461 KVGFLDKTIEDVNRYYTGRWKVKIGRLVEVYVYGSWQILAFLAAVLLLMLVSLQ 514

BLAST of Cla97C06G110940 vs. TAIR10
Match: AT3G50170.1 (Plant protein of unknown function (DUF247))

HSP 1 Score: 208.4 bits (529), Expect = 8.9e-54
Identity = 137/457 (29.98%), Postives = 234/457 (51.20%), Query Frame = 0

Query: 19  VIEVQENLNKLGKSVLATDENTF--NRSIYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQ 78
           VI +++ L +  +     D+ T      IYR+P +++E   K++ PQ VS GPYHHGK +
Sbjct: 91  VISIRDKLEQADRD----DDTTIWGKLCIYRVPHYLQENDKKSYFPQTVSLGPYHHGKKR 150

Query: 79  YASMELEKQKAFRRLKTNPEMLESIVQTVTDNLQNLLGAYDKLIEGEWAKRPAKFLEVLI 138
              ME  K +A  ++    + L+  ++  T+ ++ L        EG  +    +F E+L+
Sbjct: 151 LRPMERHKWRALNKVL---KRLKQRIEMYTNAMRELEEKARACYEGPISLSRNEFTEMLV 210

Query: 139 VDGCFMLSFLKKCPPSLSSMSW--------------DIKRDMLLLENQLPMLLLDDLYSI 198
           +DGCF+L   +      + + +               I+RDM++LENQLP+ +LD L  +
Sbjct: 211 LDGCFVLELFRGTVEGFTEIGYARNDPVFAMRGLMHSIQRDMIMLENQLPLFVLDRLLEL 270

Query: 199 ---LPNMDGRLAWFICK--SMCFASGEAVSM-------------------GGNLHILDMY 258
                N  G +A    K       +GEA++                     G LH LD++
Sbjct: 271 QLGTQNQTGIVAHVAVKFFDPLMPTGEALTKPDQSKLMNWLEKSLDTLGDKGELHCLDVF 330

Query: 259 RMSLLGTTILENKDGSMERKTKKS----GPEYQVIRHATQLRDAGIDFQESGTKSLTDVS 318
           R SLL ++   N    ++R T+ +      + Q++   T+LR+AG+ F++  T    D+ 
Sbjct: 331 RRSLLQSSPTPNTRSLLKRLTRNTRVVDKRQQQLVHCVTELREAGVKFRKRKTDRFWDIE 390

Query: 319 FDSKKGVLKIPQLVVDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALL 378
           F  K G L+IP+L++ D +++   N++AFE+ H E+ + +TS+++ M+NLI+  EDV+ L
Sbjct: 391 F--KNGYLEIPKLLIHDGTKSLFSNLIAFEQCHIESSNHITSYIIFMDNLINSSEDVSYL 450

Query: 379 SSKNILANALGDDQSAAELFGSLGKGAAMD-LESHITEVHHQVNLHCRRPWNEWCASLKH 431
               I+ + LG D   A+LF  L +    D  +SH++ +   VN +  R WN   A+L H
Sbjct: 451 HYCGIIEHWLGSDSEVADLFNRLCQEVVFDPKDSHLSRLSGDVNRYYNRKWNVLKATLTH 510

BLAST of Cla97C06G110940 vs. TAIR10
Match: AT3G50160.1 (Plant protein of unknown function (DUF247))

HSP 1 Score: 205.7 bits (522), Expect = 5.8e-53
Identity = 133/451 (29.49%), Postives = 232/451 (51.44%), Query Frame = 0

Query: 2   TQPQLPISISRNDSAQL----VIEVQENLNKLGKSVLATDENTFNRSIYRIPTFMREVHP 61
           TQ +  +SI   +  +L    VI + + +  LG +   + +N     IYR+P +++E   
Sbjct: 62  TQVESVVSIEDKNEQKLREIWVISLNDKMKTLGDNATTSWDNL---CIYRVPPYLQENDT 121

Query: 62  KAFEPQLVSFGPYHHGKPQYASMELEKQKAFRRLKTNPEMLESIVQTVTDNLQNLLGAYD 121
           K++ PQ+VS GPYHHG      ME  K +A   +       +  ++   D ++ L     
Sbjct: 122 KSYMPQIVSIGPYHHGHKHLMPMERHKWRAVNMVMAR---AKHDIEMYIDAMKELEEKAR 181

Query: 122 KLIEGEWAKRPAKFLEVLIVDGCFMLSFLKKCPPSLSSMSW--------------DIKRD 181
              +G       +F+E+L++DG F++   K        + +               I+RD
Sbjct: 182 ACYQGPINMNRNEFIEMLVLDGVFIIEIFKGTSEGFQEIGYAPNDPVFGMRGLMQSIRRD 241

Query: 182 MLLLENQLPMLLLDDLY-----SILPNMDGRLAWFICKSMCFASGEAVSMGGNLHILDMY 241
           M++LENQLP  +L  L       +L  ++ +L     + +   + E ++  G LH LD+ 
Sbjct: 242 MVMLENQLPWSVLKGLLQLQRPDVLDKVNVQLFQPFFQPL-LPTREVLTEEGGLHCLDVL 301

Query: 242 RMSLLGTTILENKDGSMERKTKKSGPEYQVIRHATQLRDAGIDFQESGTKSLTDVSFDSK 301
           R  LL ++   ++D SM  K  +     Q+I   T+LR+AG++F    T    D+ F  K
Sbjct: 302 RRGLLQSSGTSDEDMSMVNKQPQ-----QLIHCVTELRNAGVEFMRKETGHFWDIEF--K 361

Query: 302 KGVLKIPQLVVDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKN 361
            G LKIP+L++ D +++  LN++AFE+ H ++  ++TS+++ M+NLI+  EDV+ L    
Sbjct: 362 NGYLKIPKLLIHDGTKSLFLNLIAFEQCHIKSSKKITSYIIFMDNLINSSEDVSYLHHYG 421

Query: 362 ILANALGDDQSAAELFGSLGKGAAMD-LESHITEVHHQVNLHCRRPWNEWCASLKHNYFH 421
           I+ N LG D   ++LF  LGK    D  + +++ +  +VN++ RR WN   A+L+H YF+
Sbjct: 422 IIENWLGSDSEVSDLFNGLGKEVIFDPNDGYLSALTGEVNIYYRRKWNYLKATLRHKYFN 481

Query: 422 NPWAIISLVAAILGFAILIVQAVYQIVDYYR 429
           NPWA  S +AA+        Q+ + +  Y++
Sbjct: 482 NPWAYFSFIAAVTLLIFTFCQSFFAVFAYFK 498

BLAST of Cla97C06G110940 vs. TAIR10
Match: AT3G50120.1 (Plant protein of unknown function (DUF247))

HSP 1 Score: 203.4 bits (516), Expect = 2.9e-52
Identity = 131/430 (30.47%), Postives = 224/430 (52.09%), Query Frame = 0

Query: 45  IYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQYASMELEKQKAFRR-LKTNPEMLESIVQ 104
           IYR+P +++E   K++ PQ VS GPYHHGK +  SM+  K +A  R LK   + ++  + 
Sbjct: 105 IYRVPYYLQENDNKSYFPQTVSLGPYHHGKKRLRSMDRHKWRAVNRVLKRTNQGIKMYID 164

Query: 105 TVTDNLQNLLGAYDKLIEGEWAKRPAKFLEVLIVDGCFMLSFLKKCPPSLSSMSW----- 164
            + +  +     Y    EG  +    +F+E+L++DGCF+L   +      + + +     
Sbjct: 165 AMRELEEKARACY----EGPLSLSSNEFIEMLVLDGCFVLELFRGAVEGFTELGYARNDP 224

Query: 165 ---------DIKRDMLLLENQLPMLLLDDLYSI---LPNMDGRLAWFICK---------S 224
                     I+RDM++LENQLP+ +L+ L  +     N  G +A    +          
Sbjct: 225 VFAMRGSMHSIQRDMVMLENQLPLFVLNRLLELQLGTRNQTGLVAQLAIRFFDPLMPTDE 284

Query: 225 MCFASGEA--------------VSMGGNLHILDMYRMSLLGTTILENKDGSMERKTKKS- 284
               SG++               +  G LH LD++R SLL ++       + +R ++ + 
Sbjct: 285 PLTKSGQSKLENSLARDKSFDPFADMGELHCLDVFRRSLLRSSPKPEPRLTRKRWSRNTR 344

Query: 285 ---GPEYQVIRHATQLRDAGIDFQESGTKSLTDVSFDSKKGVLKIPQLVVDDDSEASLLN 344
                  Q+I   T+L++AGI F+   T    D+ F  K G L+IP+L++ D +++  LN
Sbjct: 345 VADKRRQQLIHCVTELKEAGIKFRRRKTDRFWDMQF--KNGYLEIPRLLIHDGTKSLFLN 404

Query: 345 VMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLSSKNILANALGDDQSAAELFGSLGK 404
           ++AFE+ H ++ +++TS+++ M+NLID  EDV+ L    I+ + LG D   A+LF  L +
Sbjct: 405 LIAFEQCHIDSSNDITSYIIFMDNLIDSHEDVSYLHYCGIIEHWLGSDSEVADLFNRLCQ 464

Query: 405 GAAMDLE-SHITEVHHQVNLHCRRPWNEWCASLKHNYFHNPWAIISLVAAILGFAILIVQ 429
               D E S+++ +  +VN +    WN W A+LKH YF+NPWAI+S  AA++   +   Q
Sbjct: 465 EVVFDTEDSYLSRLSIEVNRYYDHKWNAWRATLKHKYFNNPWAIVSFCAAVILLVLTFSQ 524

BLAST of Cla97C06G110940 vs. TAIR10
Match: AT3G50150.1 (Plant protein of unknown function (DUF247))

HSP 1 Score: 196.8 bits (499), Expect = 2.7e-50
Identity = 131/455 (28.79%), Postives = 229/455 (50.33%), Query Frame = 0

Query: 12  RNDSAQLVIEVQENLNKLGKSVLATDENTFNR-SIYRIPTFMREVHPKAFEPQLVSFGPY 71
           R    + VI +++   K+ K++     N++++  IYR+P +++E   K++ PQ VS GPY
Sbjct: 61  RETREEWVISIKD---KMEKALSYDATNSWDKLCIYRVPFYLQENDKKSYLPQTVSIGPY 120

Query: 72  HHGKPQYASMELEKQKAFRRLKTNPE-MLESIVQTVTDNLQNLLGAYDKLIEGEWAKRPA 131
           HHGK     ME  K +A   +    +  +E  +  + +  +     Y   I+    K   
Sbjct: 121 HHGKVHLRPMERHKWRAVNMIMARTKHNIEMYIDAMKELEEEARACYQGPID---MKNSN 180

Query: 132 KFLEVLIVDGCFMLSFLKKCPPSLSSMSW--------------DIKRDMLLLENQLPMLL 191
           +F E+L++DGCF+L   K        + +               I+RDM++LENQLP+ +
Sbjct: 181 EFTEMLVLDGCFVLELFKGTIQGFQKIGYARNDPVFAKRGLMHSIQRDMIMLENQLPLFV 240

Query: 192 LDDLYSI---LPNMDG---RLAWFICKSMCFAS---------------GEAVSMGGNLHI 251
           LD L  +    PN  G    +A    K++   S                + +   G LH 
Sbjct: 241 LDRLLGLQTGTPNQTGIVAEVAVRFFKTLMPTSEVLTKSERSLDSQEKSDELGDNGGLHC 300

Query: 252 LDMYRMSLLGTTILENKDGSMERKTKKSGPEYQVIRHATQLRDAGIDFQESGTKSLTDVS 311
           LD++  SL+ ++   N+ G+          + Q+I   T+LR AG++F    T  L D+ 
Sbjct: 301 LDVFHRSLIQSSETTNQ-GTPYEDMSMVEKQQQLIHCVTELRGAGVNFMRKETGQLWDIE 360

Query: 312 FDSKKGVLKIPQLVVDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALL 371
           F  K G LKIP+L++ D +++   N++AFE+ H ++ + +TS+++ M+NLI+  +DV+ L
Sbjct: 361 F--KNGYLKIPKLLIHDGTKSLFSNLIAFEQCHTQSSNNITSYIIFMDNLINSSQDVSYL 420

Query: 372 SSKNILANALGDDQSAAELFGSLGKGAAMD-LESHITEVHHQVNLHCRRPWNEWCASLKH 429
               I+ + LG D   A+LF  L K    D  + +++++  +VN +  R WN   A+L+ 
Sbjct: 421 HHDGIIEHWLGSDSEVADLFNRLCKEVIFDPKDGYLSQLSREVNRYYSRKWNSLKATLRQ 480

BLAST of Cla97C06G110940 vs. TAIR10
Match: AT3G50140.1 (Plant protein of unknown function (DUF247))

HSP 1 Score: 184.9 bits (468), Expect = 1.1e-46
Identity = 125/454 (27.53%), Postives = 227/454 (50.00%), Query Frame = 0

Query: 19  VIEVQENLNKLGKSVLATDENTFNRSIYRIPTFMREVHPKAFEPQLVSFGPYHHGKPQYA 78
           VI +++ + ++ +    T  +     IYR+P  +++    ++ PQ VS GPYHHG     
Sbjct: 91  VIWIKDKMEQVMRDAATTSWDKI--CIYRVPLSLKKSDKNSYFPQAVSLGPYHHGDEHLR 150

Query: 79  SMELEKQKAFRR-LKTNPEMLESIVQTVTDNLQNLLGAYDKLIEGEWAKRPAKFLEVLIV 138
            M+  K +A    +K   + +E  +  + +  +     Y    EG       KF ++L++
Sbjct: 151 PMDYHKWRAVNMVMKRTKQGIEMYIDAMKELEERARACY----EGPIGLSSNKFTQMLVL 210

Query: 139 DGCFMLSFLKKCPPSLSSMSWD--------------IKRDMLLLENQLPMLLLDDLYSI- 198
           DGCF+L   +      S + +D              I+RDML+LENQLP+ +L+ L  + 
Sbjct: 211 DGCFVLDLFRGAYEGFSKLGYDRNDPVFAMRGSMHSIRRDMLMLENQLPLFVLNRLLELQ 270

Query: 199 --LPNMDGRLAWFICK---------------------SMCFASGEAVSMGGNLHILDMYR 258
                  G +A    +                     +  F +  A      LH LD++R
Sbjct: 271 LGTQYQTGLVAQLAVRFFNPLMPTYMSSTKIENSQENNNKFFNPIADKEKEELHCLDVFR 330

Query: 259 MSLLGTTILENKDGSMERKTKK----SGPEYQVIRHATQLRDAGIDFQESGTKSLTDVSF 318
            SLL  ++  +   S  R ++K       + Q++   T+LR+AGI F+   +    D+ F
Sbjct: 331 RSLLQPSLKPDPRLSRSRWSRKPLVADKRQQQLLHCVTELREAGIKFKRRKSDRFWDIQF 390

Query: 319 DSKKGVLKIPQLVVDDDSEASLLNVMAFEKLHEEAGSEVTSFVVLMNNLIDVDEDVALLS 378
             K G L+IP+L++ D +++   N++A+E+ H ++ +++TS+++ M+NLID  ED+  L 
Sbjct: 391 --KNGCLEIPKLLIHDGTKSLFSNLIAYEQCHIDSTNDITSYIIFMDNLIDSAEDIRYLH 450

Query: 379 SKNILANALGDDQSAAELFGSLGKGAAMDLE-SHITEVHHQVNLHCRRPWNEWCASLKHN 429
             +I+ + LG+D   A++F  L +  A DLE ++++E+ ++V+ +  R WN   A+LKH 
Sbjct: 451 YYDIIEHWLGNDSEVADVFNRLCQEVAFDLENTYLSELSNKVDRYYNRKWNVLKATLKHK 510

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_023004238.13.8e-11152.95UPF0481 protein At3g47200-like isoform X1 [Cucurbita maxima][more]
XP_023004239.11.5e-11052.95UPF0481 protein At3g47200-like isoform X2 [Cucurbita maxima][more]
XP_023513986.18.0e-10952.85UPF0481 protein At3g47200-like isoform X1 [Cucurbita pepo subsp. pepo][more]
XP_023513987.13.0e-10852.52UPF0481 protein At3g47200-like isoform X2 [Cucurbita pepo subsp. pepo][more]
XP_022960454.17.5e-10752.16UPF0481 protein At3g47200-like [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
tr|A0A1S4E2L7|A0A1S4E2L7_CUCME2.2e-9951.02uncharacterized protein LOC103499077 OS=Cucumis melo OX=3656 GN=LOC103499077 PE=... [more]
tr|E5GB49|E5GB49_CUCME2.2e-9951.02Uncharacterized protein OS=Cucumis melo subsp. melo OX=412675 PE=4 SV=1[more]
tr|A0A067JUP9|A0A067JUP9_JATCU7.9e-8141.76Uncharacterized protein OS=Jatropha curcas OX=180498 GN=JCGZ_23062 PE=4 SV=1[more]
tr|A0A0A0K9F5|A0A0A0K9F5_CUCSA2.3e-8045.98Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G089270 PE=4 SV=1[more]
tr|A0A2C9W374|A0A2C9W374_MANES8.2e-7840.37Uncharacterized protein OS=Manihot esculenta OX=3983 GN=MANES_04G166100 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
sp|Q9SD53|Y3720_ARATH7.0e-3226.49UPF0481 protein At3g47200 OS=Arabidopsis thaliana OX=3702 GN=At3g47200 PE=2 SV=1[more]
sp|P0C897|Y3264_ARATH8.6e-1427.01Putative UPF0481 protein At3g02645 OS=Arabidopsis thaliana OX=3702 GN=At3g02645 ... [more]
Match NameE-valueIdentityDescription
AT3G50170.18.9e-5429.98Plant protein of unknown function (DUF247)[more]
AT3G50160.15.8e-5329.49Plant protein of unknown function (DUF247)[more]
AT3G50120.12.9e-5230.47Plant protein of unknown function (DUF247)[more]
AT3G50150.12.7e-5028.79Plant protein of unknown function (DUF247)[more]
AT3G50140.11.1e-4627.53Plant protein of unknown function (DUF247)[more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR004158DUF247_pln
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C06G110940.1Cla97C06G110940.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004158Protein of unknown function DUF247, plantPFAMPF03140DUF247coord: 45..413
e-value: 1.9E-94
score: 317.1
NoneNo IPR availablePANTHERPTHR31549:SF29SUBFAMILY NOT NAMEDcoord: 14..427
NoneNo IPR availablePANTHERPTHR31549FAMILY NOT NAMEDcoord: 14..427