Cla97C01G004740 (gene) Watermelon (97103) v2

NameCla97C01G004740
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionUPF0496 protein 3
LocationCla97Chr01 : 4537563 .. 4539806 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGAAATCAAAGATTGTTTCATGTCTCAGAAAATTCCTTTCATGCTCTGGTAATTTCTAATTACCAATATTATCCTCATTTTCGATTTAAATACTTCTTTTTTGGTTTAATTGCTAAATATATCTTTTAGTATTTTTCAATTTGGTTAAAGGAAGATTCAATAAATCTTGTTTATTATTGTTAAATATTTTTTCAATTTTGTTTGTTATTATTAACATTTTGTCTTTTGTTTTTTCTTTCAATTATGTGATAATGTAAATTAAAATTTTAGGATAAAATAAACAATTTTTGTTATCATAATTTTAACTTATCTTACCAGAAGCTTAAATTCTTATTCTTATTCTTATTCAAATTCTAAAAATAAAAGTAAATTTTTGTAGATTATTTTTTTAATTAAAATTTTTTTTTTGAAAACATAAATAGAAAAAGTAGATAACAAAACATGATTGTCAAGTAGGTAGAGGTAGTGTTTATAAGTTTAATTCTTAAAATAAAAAATTTAAAATTTTAATAGTTAACAAATGCAGTGTGAAGTAATTTAATTTCTATTTTGTTCTTATTGTAAAAGTTTGATTAAAAAGGATATTGCTAGCGAAGGGATTTCTTTCTAATAAAGTAAAAAACGAATAGTGTAATTTTCAAAAATTCAAAACATAAAACTTAAAAGAAAGGAATGGAACAACTATAGCTAGACATTTGTATAATTTAATCTATTTTTTTAATTTTAAAAAAAAAACATAATTACTTTATTTAAAAGTTGTTTCCTACGAAGCTAAAATCATGCCATTCAAATTACAGGTTTTCAAATTTTTACTTAAATTAATATTTAACTTTATAATTCATATTTTATAAAAAATTAAACTTACTAAAACATGTATATTAAAGCATGTTTGAGTAAAAAATAGGTCCTAATAATTAAGGTGGTAATCACACTTAATCTTTAATTTTGTTGGCCAAAACTCCATAATCCATAAAATATGACATTACACATTTTTCTTTGTTTTTTTTTTTATTTTTTTTATTTTACAATATGTGGATGGAGGAACTGAACTTCTGACTTTAAGATTGATAGTACATATACTATGTCGGTTGAGTTATGCTCATTTTCGTCTATGATATTACACTTTATTTTGTACAATACATTTTTATTTATTCTTTACATGTTGAATAGACGGTGGGCCATACATTACAAGTCATCCAGTAGACATAGATGTGCGGGAGGAGTACGCGAATGCTTTCCGCACCGAATCATACATCGACTTTTGGACACGTGTCCTTGCCTTAAACAATGGAGACGACCTCATAACCCAACTTTCAGTAGAGTCCACAACTTCAACTCGCCTCTCATCTTATAGGCTATTTGTTGAACATCTACTCGACCCACCTCAATCCACAATTAAAAGAATCCTAACTTCGCCCCATATTGGACTCGATTCCTACTCCCTTCTCTTAGATTACTTCTCTCATACCGCCAATGCCTCTCTCTTATGTAGTCGTTTACTAAAACACATTGACCACTTACGTCTCAAACTCCATCCCTTAGAAATCAACCTTCAGTCCTTAGAAAACAAAGAAGAATTTCATCATGAGTCTCACTTCAAACAACTCTTAATTCGTTTGGTAGAATTCTCCAACACCCACAACTCATTTATACCATCTACAGAACAAGTTCAAATCATCCAAAATGGATGCTCAAAATTGTTAAAGCGACTCGAGTTTAGCCGTGACAAGGCTCAAGCAAAACTCAAGAGAGTTAGATACTTTCAACACAGTTCAGCTGGCTTCTTGGTGGCTATAACCGCATCACTTACTATAATAATTGTGACTCATGGAATTGCATTATTCGTTGCTGCACCTGGCTTCCTTGTGGGTGCTATAAAGTTGGCTAATAGGTCGAGGAAGCTAGCTAAGAAAGTTGCTCGACTCAACGTTACTGCCAAAGGGACTTACACTTTGAATAGGGATTTCGATACAATTGGTAGGCTCGTGGCTCGATTGAGTCATGAGCTCGAACATATGAGAGTGATGGCAAGATTTTGGCTTGACAAAGGAGAAGATAGACACAGAGCCATTGATGAACTGATGCGTCAGTTAAATCAAACTCATGTGAACTTTAGCCAACAATTAGATGAGCTTGAGGAGCATTTGTATTTGTGTTTTATGACCATAAATCGAGCTAGAAATCTTGTAGTGAAAGAGATTTTGGATTCGGGTCAACCTATAAAGATTTCGTATTTATGA

mRNA sequence

ATGAAGAAATCAAAGATTGTTTCATGTCTCAGAAAATTCCTTTCATGCTCTGACGGTGGGCCATACATTACAAGTCATCCAGTAGACATAGATGTGCGGGAGGAGTACGCGAATGCTTTCCGCACCGAATCATACATCGACTTTTGGACACGTGTCCTTGCCTTAAACAATGGAGACGACCTCATAACCCAACTTTCAGTAGAGTCCACAACTTCAACTCGCCTCTCATCTTATAGGCTATTTGTTGAACATCTACTCGACCCACCTCAATCCACAATTAAAAGAATCCTAACTTCGCCCCATATTGGACTCGATTCCTACTCCCTTCTCTTAGATTACTTCTCTCATACCGCCAATGCCTCTCTCTTATGTAGTCGTTTACTAAAACACATTGACCACTTACGTCTCAAACTCCATCCCTTAGAAATCAACCTTCAGTCCTTAGAAAACAAAGAAGAATTTCATCATGAGTCTCACTTCAAACAACTCTTAATTCGTTTGGTAGAATTCTCCAACACCCACAACTCATTTATACCATCTACAGAACAAGTTCAAATCATCCAAAATGGATGCTCAAAATTGTTAAAGCGACTCGAGTTTAGCCGTGACAAGGCTCAAGCAAAACTCAAGAGAGTTAGATACTTTCAACACAGTTCAGCTGGCTTCTTGGTGGCTATAACCGCATCACTTACTATAATAATTGTGACTCATGGAATTGCATTATTCGTTGCTGCACCTGGCTTCCTTGTGGGTGCTATAAAGTTGGCTAATAGGTCGAGGAAGCTAGCTAAGAAAGTTGCTCGACTCAACGTTACTGCCAAAGGGACTTACACTTTGAATAGGGATTTCGATACAATTGGTAGGCTCGTGGCTCGATTGAGTCATGAGCTCGAACATATGAGAGTGATGGCAAGATTTTGGCTTGACAAAGGAGAAGATAGACACAGAGCCATTGATGAACTGATGCGTCAGTTAAATCAAACTCATGTGAACTTTAGCCAACAATTAGATGAGCTTGAGGAGCATTTGTATTTGTGTTTTATGACCATAAATCGAGCTAGAAATCTTGTAGTGAAAGAGATTTTGGATTCGGGTCAACCTATAAAGATTTCGTATTTATGA

Coding sequence (CDS)

ATGAAGAAATCAAAGATTGTTTCATGTCTCAGAAAATTCCTTTCATGCTCTGACGGTGGGCCATACATTACAAGTCATCCAGTAGACATAGATGTGCGGGAGGAGTACGCGAATGCTTTCCGCACCGAATCATACATCGACTTTTGGACACGTGTCCTTGCCTTAAACAATGGAGACGACCTCATAACCCAACTTTCAGTAGAGTCCACAACTTCAACTCGCCTCTCATCTTATAGGCTATTTGTTGAACATCTACTCGACCCACCTCAATCCACAATTAAAAGAATCCTAACTTCGCCCCATATTGGACTCGATTCCTACTCCCTTCTCTTAGATTACTTCTCTCATACCGCCAATGCCTCTCTCTTATGTAGTCGTTTACTAAAACACATTGACCACTTACGTCTCAAACTCCATCCCTTAGAAATCAACCTTCAGTCCTTAGAAAACAAAGAAGAATTTCATCATGAGTCTCACTTCAAACAACTCTTAATTCGTTTGGTAGAATTCTCCAACACCCACAACTCATTTATACCATCTACAGAACAAGTTCAAATCATCCAAAATGGATGCTCAAAATTGTTAAAGCGACTCGAGTTTAGCCGTGACAAGGCTCAAGCAAAACTCAAGAGAGTTAGATACTTTCAACACAGTTCAGCTGGCTTCTTGGTGGCTATAACCGCATCACTTACTATAATAATTGTGACTCATGGAATTGCATTATTCGTTGCTGCACCTGGCTTCCTTGTGGGTGCTATAAAGTTGGCTAATAGGTCGAGGAAGCTAGCTAAGAAAGTTGCTCGACTCAACGTTACTGCCAAAGGGACTTACACTTTGAATAGGGATTTCGATACAATTGGTAGGCTCGTGGCTCGATTGAGTCATGAGCTCGAACATATGAGAGTGATGGCAAGATTTTGGCTTGACAAAGGAGAAGATAGACACAGAGCCATTGATGAACTGATGCGTCAGTTAAATCAAACTCATGTGAACTTTAGCCAACAATTAGATGAGCTTGAGGAGCATTTGTATTTGTGTTTTATGACCATAAATCGAGCTAGAAATCTTGTAGTGAAAGAGATTTTGGATTCGGGTCAACCTATAAAGATTTCGTATTTATGA

Protein sequence

MKKSKIVSCLRKFLSCSDGGPYITSHPVDIDVREEYANAFRTESYIDFWTRVLALNNGDDLITQLSVESTTSTRLSSYRLFVEHLLDPPQSTIKRILTSPHIGLDSYSLLLDYFSHTANASLLCSRLLKHIDHLRLKLHPLEINLQSLENKEEFHHESHFKQLLIRLVEFSNTHNSFIPSTEQVQIIQNGCSKLLKRLEFSRDKAQAKLKRVRYFQHSSAGFLVAITASLTIIIVTHGIALFVAAPGFLVGAIKLANRSRKLAKKVARLNVTAKGTYTLNRDFDTIGRLVARLSHELEHMRVMARFWLDKGEDRHRAIDELMRQLNQTHVNFSQQLDELEEHLYLCFMTINRARNLVVKEILDSGQPIKISYL
BLAST of Cla97C01G004740 vs. NCBI nr
Match: XP_008436996.1 (PREDICTED: UPF0496 protein At3g49070 isoform X1 [Cucumis melo])

HSP 1 Score: 569.3 bits (1466), Expect = 9.5e-159
Identity = 303/378 (80.16%), Postives = 337/378 (89.15%), Query Frame = 0

Query: 1   MKKSKIVSCLRKFLSCSDGGPYITSHPVDIDVREEYANAFRTESYIDFWTRVLALNNGDD 60
           MKK KI+SCLRKFLSCSDG  ++T++PVD DV EEYANAFRTESYIDFWTRV+ALNNGD+
Sbjct: 1   MKKLKIISCLRKFLSCSDGEQHVTNYPVDTDVGEEYANAFRTESYIDFWTRVVALNNGDN 60

Query: 61  LITQLSVESTTSTRLSSYRLFVEHLLDPPQSTIKRILTSPHIGLDSYSLLLD-YFSHTAN 120
           L  Q+S+ESTT+TRLSSYRLFVEHLLDPPQ TIKR+LT+ H+G +S SLLLD YFSHT+N
Sbjct: 61  LTAQVSLESTTATRLSSYRLFVEHLLDPPQPTIKRMLTA-HLGPNSCSLLLDHYFSHTSN 120

Query: 121 ASLLCSRLLKHIDHLRLKLHPLEINLQSLENKEEFHH-ESHFKQLLIRLVEFSN--THNS 180
           ASLLCSR+LKHI HLRLKL  L+      +NK+EF++ +SHFKQLL+ LVEFSN  T NS
Sbjct: 121 ASLLCSRILKHIVHLRLKLRSLD------QNKQEFNYDDSHFKQLLVCLVEFSNNSTPNS 180

Query: 181 FIP-STEQVQIIQNGCSKLLKRLEFSRDKAQAKLKRVRYFQHSSAGFLVAITASLTIIIV 240
           F+P   EQVQIIQNGCSKLLKRLE+SRDKAQ KLKRVRYFQHSSAGFLVAITAS TII+V
Sbjct: 181 FVPYCMEQVQIIQNGCSKLLKRLEYSRDKAQDKLKRVRYFQHSSAGFLVAITASFTIIVV 240

Query: 241 THGIALFVAAPGFLVGAIKLANRSRKLAKKVARLNVTAKGTYTLNRDFDTIGRLVARLSH 300
           THGIALFVAAPGFLVGAIKL  +SRKLAKKVA+LNV AKGTYTLNRDFDTIGRLVARLSH
Sbjct: 241 THGIALFVAAPGFLVGAIKLVKKSRKLAKKVAQLNVAAKGTYTLNRDFDTIGRLVARLSH 300

Query: 301 ELEHMRVMARFWLDKGEDRHRAIDELMRQLNQTHVNFSQQLDELEEHLYLCFMTINRARN 360
           ELEHMRVM +FWLD+GED+  AI EL+RQLNQ+H NF+QQLDELEEHLYLCFMTINRARN
Sbjct: 301 ELEHMRVMTKFWLDRGEDKRWAIGELLRQLNQSHENFNQQLDELEEHLYLCFMTINRARN 360

Query: 361 LVVKEILDSGQPIKISYL 374
           LVVKEILDS  PIKISYL
Sbjct: 361 LVVKEILDSSAPIKISYL 371

BLAST of Cla97C01G004740 vs. NCBI nr
Match: XP_004143745.2 (PREDICTED: UPF0496 protein At3g49070, partial [Cucumis sativus])

HSP 1 Score: 535.8 bits (1379), Expect = 1.2e-148
Identity = 286/361 (79.22%), Postives = 320/361 (88.64%), Query Frame = 0

Query: 18  DGGPYITSHPVDIDVREEYANAFRTESYIDFWTRVLALNNGDDLITQLSVESTTSTRLSS 77
           DGG ++ ++PVD DV EEYANAFRTESYIDFWTRV+ALNNGD+L  Q+S+ESTT+TRLSS
Sbjct: 2   DGGQHVINYPVDTDVGEEYANAFRTESYIDFWTRVVALNNGDNLTAQVSLESTTATRLSS 61

Query: 78  YRLFVEHLLDPPQSTIKRILTSPHIGLDSYSLLLD-YFSHTANASLLCSRLLKHIDHLRL 137
           YRLFVEHLLDPPQ TIK ILT+ H+G +S SLLLD YFSHTANASLLCSR+LK I HLRL
Sbjct: 62  YRLFVEHLLDPPQPTIKTILTA-HLGPNSCSLLLDHYFSHTANASLLCSRILKQIVHLRL 121

Query: 138 KLHPLEINLQSLENKEEFHH-ESHFKQLLIRLVEFSN--THNSFIP-STEQVQIIQNGCS 197
           KLH L+      +NK+EF+H +SHFKQLL+RL EFSN  T NSF+P   EQVQIIQNGCS
Sbjct: 122 KLHSLD------QNKQEFNHDDSHFKQLLVRLFEFSNDSTPNSFVPYCMEQVQIIQNGCS 181

Query: 198 KLLKRLEFSRDKAQAKLKRVRYFQHSSAGFLVAITASLTIIIVTHGIALFVAAPGFLVGA 257
           KLLKRLE+SRDK + KLKRVRYFQHSSAGFLVAITAS T+I+VTHGIALFVAAPGFLVGA
Sbjct: 182 KLLKRLEYSRDKTRDKLKRVRYFQHSSAGFLVAITASFTVIVVTHGIALFVAAPGFLVGA 241

Query: 258 IKLANRSRKLAKKVARLNVTAKGTYTLNRDFDTIGRLVARLSHELEHMRVMARFWLDKGE 317
           IKLA +SRKLAK+VA+LNV AKGTYTLNRDFDTIGRLVARLSHELEHM+VMA+FWLDKG 
Sbjct: 242 IKLAKKSRKLAKEVAQLNVAAKGTYTLNRDFDTIGRLVARLSHELEHMKVMAKFWLDKGG 301

Query: 318 DRHRAIDELMRQLNQTHVNFSQQLDELEEHLYLCFMTINRARNLVVKEILDSGQPIKISY 374
           D+  AIDEL RQLNQ+H NF+QQLDELEEHLYLCFMTINRARNLVVKEIL+  +PIKISY
Sbjct: 302 DKRWAIDELARQLNQSHENFNQQLDELEEHLYLCFMTINRARNLVVKEILNLSKPIKISY 355

BLAST of Cla97C01G004740 vs. NCBI nr
Match: KGN50304.1 (hypothetical protein Csa_5G166480 [Cucumis sativus])

HSP 1 Score: 533.5 bits (1373), Expect = 5.8e-148
Identity = 285/360 (79.17%), Postives = 319/360 (88.61%), Query Frame = 0

Query: 19  GGPYITSHPVDIDVREEYANAFRTESYIDFWTRVLALNNGDDLITQLSVESTTSTRLSSY 78
           GG ++ ++PVD DV EEYANAFRTESYIDFWTRV+ALNNGD+L  Q+S+ESTT+TRLSSY
Sbjct: 4   GGQHVINYPVDTDVGEEYANAFRTESYIDFWTRVVALNNGDNLTAQVSLESTTATRLSSY 63

Query: 79  RLFVEHLLDPPQSTIKRILTSPHIGLDSYSLLLD-YFSHTANASLLCSRLLKHIDHLRLK 138
           RLFVEHLLDPPQ TIK ILT+ H+G +S SLLLD YFSHTANASLLCSR+LK I HLRLK
Sbjct: 64  RLFVEHLLDPPQPTIKTILTA-HLGPNSCSLLLDHYFSHTANASLLCSRILKQIVHLRLK 123

Query: 139 LHPLEINLQSLENKEEFHH-ESHFKQLLIRLVEFSN--THNSFIP-STEQVQIIQNGCSK 198
           LH L+      +NK+EF+H +SHFKQLL+RL EFSN  T NSF+P   EQVQIIQNGCSK
Sbjct: 124 LHSLD------QNKQEFNHDDSHFKQLLVRLFEFSNDSTPNSFVPYCMEQVQIIQNGCSK 183

Query: 199 LLKRLEFSRDKAQAKLKRVRYFQHSSAGFLVAITASLTIIIVTHGIALFVAAPGFLVGAI 258
           LLKRLE+SRDK + KLKRVRYFQHSSAGFLVAITAS T+I+VTHGIALFVAAPGFLVGAI
Sbjct: 184 LLKRLEYSRDKTRDKLKRVRYFQHSSAGFLVAITASFTVIVVTHGIALFVAAPGFLVGAI 243

Query: 259 KLANRSRKLAKKVARLNVTAKGTYTLNRDFDTIGRLVARLSHELEHMRVMARFWLDKGED 318
           KLA +SRKLAK+VA+LNV AKGTYTLNRDFDTIGRLVARLSHELEHM+VMA+FWLDKG D
Sbjct: 244 KLAKKSRKLAKEVAQLNVAAKGTYTLNRDFDTIGRLVARLSHELEHMKVMAKFWLDKGGD 303

Query: 319 RHRAIDELMRQLNQTHVNFSQQLDELEEHLYLCFMTINRARNLVVKEILDSGQPIKISYL 374
           +  AIDEL RQLNQ+H NF+QQLDELEEHLYLCFMTINRARNLVVKEIL+  +PIKISYL
Sbjct: 304 KRWAIDELARQLNQSHENFNQQLDELEEHLYLCFMTINRARNLVVKEILNLSKPIKISYL 356

BLAST of Cla97C01G004740 vs. NCBI nr
Match: XP_022159588.1 (UPF0496 protein At3g49070 [Momordica charantia])

HSP 1 Score: 526.9 bits (1356), Expect = 5.4e-146
Identity = 275/376 (73.14%), Postives = 320/376 (85.11%), Query Frame = 0

Query: 1   MKKSKIVSCLRKFLSCSDGGPYITSHPVDIDVREEYANAFRTESYIDFWTRVLALNNGDD 60
           M+K KIVS L KFLSCSDG P+ITS+P+ +DVREEYANAFRTESYIDFWTRVL LN+G D
Sbjct: 1   MRKFKIVSRLIKFLSCSDGEPHITSYPIGVDVREEYANAFRTESYIDFWTRVLTLNDG-D 60

Query: 61  LITQLSVESTTSTRLSSYRLFVEHLLDPPQSTIKRILTSPHIGLDSYSLLLDYFSHTANA 120
           L TQL VESTT+TRLSSYRLF EHLLDP Q T+KR+L S ++  +SYSLL DYFSHTANA
Sbjct: 61  LTTQLPVESTTATRLSSYRLFAEHLLDPTQPTVKRLLASTYLRPNSYSLLTDYFSHTANA 120

Query: 121 SLLCSRLLKHIDHLRLKLHPLEINLQSLENKEEFHHESHFKQLLIRLVEFSNTHNSFI-- 180
           SLLC RLLK I +LRLKL PL+I L S+EN   F+H  +F+ +L  LVEFSN HN F+  
Sbjct: 121 SLLCGRLLKDIHNLRLKLRPLKITLHSIEN-TRFYHNYNFEPILTCLVEFSNAHNPFVSS 180

Query: 181 -PSTEQVQIIQNGCSKLLKRLEFSRDKAQAKLKRVRYFQHSSAGFLVAITASLTIIIVTH 240
            PST +++IIQ+GCS LLK+LEF+RDKA+AKL+RVRYFQH SAGFLVA+T SLT+I++TH
Sbjct: 181 APSTRRIRIIQSGCSNLLKQLEFNRDKARAKLRRVRYFQHGSAGFLVALTTSLTVIVMTH 240

Query: 241 GIALFVAAPGFLVGAIKLANRSRKLAKKVARLNVTAKGTYTLNRDFDTIGRLVARLSHEL 300
           GIALFVAAPG LV +I+LA +SRKLAK+VA+LNV AKGTYTLNRDFDT+GRLVARLSHEL
Sbjct: 241 GIALFVAAPGLLVASIELA-KSRKLAKQVAQLNVAAKGTYTLNRDFDTVGRLVARLSHEL 300

Query: 301 EHMRVMARFWLDKGEDRHRAIDELMRQLNQTHVNFSQQLDELEEHLYLCFMTINRARNLV 360
           EHMRVM RFWL++ EDR +AI E+ RQL Q H +FSQQLDELEEHLYLCFMTINRARNLV
Sbjct: 301 EHMRVMTRFWLERREDRRQAISEVARQLKQNHASFSQQLDELEEHLYLCFMTINRARNLV 360

Query: 361 VKEILDSGQPIKISYL 374
           VKEILD GQP K  ++
Sbjct: 361 VKEILDPGQPAKAPHV 373

BLAST of Cla97C01G004740 vs. NCBI nr
Match: XP_023534108.1 (UPF0496 protein At3g49070-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 391.3 bits (1004), Expect = 3.6e-105
Identity = 232/370 (62.70%), Postives = 273/370 (73.78%), Query Frame = 0

Query: 1   MKKSKIVSCLRKFLSCSDGGPYITSHPVDIDVREEYANAFRTESYIDFWTRVLALNNGDD 60
           MKKSK+VS LRKFL  SDGG +I S P+ +DVREEYANAFRTESY+DFWTRV+A      
Sbjct: 1   MKKSKLVSRLRKFL--SDGGVHIRSRPIGVDVREEYANAFRTESYLDFWTRVVA------ 60

Query: 61  LITQLSVESTTSTRLSSYRLFVEHLLDPPQSTIKRILTSPHIGLDSYSLLLDYFSHTANA 120
            I  + VESTT++RLSSYRLF EHLLDP + T+KRIL   H+  +S SLL DYFSHTANA
Sbjct: 61  -IKDVVVESTTASRLSSYRLFAEHLLDPTEPTVKRILNWAHLRPNSKSLLSDYFSHTANA 120

Query: 121 SLLCSRLLKHIDHLRLKLHPLEINLQSLENKEEFHHESHFKQLLIRLVEFSNTHNSFIPS 180
           SLLCSRLLK I+HLR      EI +QSL+N      E +FK L I        HN F   
Sbjct: 121 SLLCSRLLKDINHLR-----PEIAIQSLQNP-----EFNFKPLSI--------HNPF--- 180

Query: 181 TEQVQIIQNGCSKLLKRLEFSRDKAQAKLKRVRYFQHSSAGFLVAITASLTIIIVTHGIA 240
               +IIQNGC+KLLK+LEF+RDKA+ K+KRVRYFQHSSAG LVA+TASLT+I+VTHGI 
Sbjct: 181 ---PEIIQNGCAKLLKQLEFNRDKARTKVKRVRYFQHSSAGLLVAVTASLTVILVTHGIG 240

Query: 241 LFV--AAPGF--LVGAIKLANRSRKLAKKVAR-LNVTAKGTYTLNRDFDTIGRLVARLSH 300
           L V  AAPG   LVGA+KLA       K++AR L+V AK TYTLNRDFDT+GRLVARL+ 
Sbjct: 241 LVVVAAAPGLAGLVGAVKLAR-----VKEMARVLDVAAKATYTLNRDFDTVGRLVARLNQ 300

Query: 301 ELEHMRVMARFWLDKGE---DRHRAIDELMRQLNQTHVNFSQQLDELEEHLYLCFMTINR 360
           E+EHMR + RFW++ GE    RH  + E+ RQL Q  +N  Q LDELEEHLYLCFMTINR
Sbjct: 301 EVEHMRGLMRFWVELGEGRDGRHGGVGEVARQLQQCRLNLGQHLDELEEHLYLCFMTINR 332

Query: 361 ARNLVVKEIL 363
           ARNLV+K+IL
Sbjct: 361 ARNLVLKQIL 332

BLAST of Cla97C01G004740 vs. TrEMBL
Match: tr|A0A1S3ATJ2|A0A1S3ATJ2_CUCME (UPF0496 protein At3g49070 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103482559 PE=4 SV=1)

HSP 1 Score: 569.3 bits (1466), Expect = 6.3e-159
Identity = 303/378 (80.16%), Postives = 337/378 (89.15%), Query Frame = 0

Query: 1   MKKSKIVSCLRKFLSCSDGGPYITSHPVDIDVREEYANAFRTESYIDFWTRVLALNNGDD 60
           MKK KI+SCLRKFLSCSDG  ++T++PVD DV EEYANAFRTESYIDFWTRV+ALNNGD+
Sbjct: 1   MKKLKIISCLRKFLSCSDGEQHVTNYPVDTDVGEEYANAFRTESYIDFWTRVVALNNGDN 60

Query: 61  LITQLSVESTTSTRLSSYRLFVEHLLDPPQSTIKRILTSPHIGLDSYSLLLD-YFSHTAN 120
           L  Q+S+ESTT+TRLSSYRLFVEHLLDPPQ TIKR+LT+ H+G +S SLLLD YFSHT+N
Sbjct: 61  LTAQVSLESTTATRLSSYRLFVEHLLDPPQPTIKRMLTA-HLGPNSCSLLLDHYFSHTSN 120

Query: 121 ASLLCSRLLKHIDHLRLKLHPLEINLQSLENKEEFHH-ESHFKQLLIRLVEFSN--THNS 180
           ASLLCSR+LKHI HLRLKL  L+      +NK+EF++ +SHFKQLL+ LVEFSN  T NS
Sbjct: 121 ASLLCSRILKHIVHLRLKLRSLD------QNKQEFNYDDSHFKQLLVCLVEFSNNSTPNS 180

Query: 181 FIP-STEQVQIIQNGCSKLLKRLEFSRDKAQAKLKRVRYFQHSSAGFLVAITASLTIIIV 240
           F+P   EQVQIIQNGCSKLLKRLE+SRDKAQ KLKRVRYFQHSSAGFLVAITAS TII+V
Sbjct: 181 FVPYCMEQVQIIQNGCSKLLKRLEYSRDKAQDKLKRVRYFQHSSAGFLVAITASFTIIVV 240

Query: 241 THGIALFVAAPGFLVGAIKLANRSRKLAKKVARLNVTAKGTYTLNRDFDTIGRLVARLSH 300
           THGIALFVAAPGFLVGAIKL  +SRKLAKKVA+LNV AKGTYTLNRDFDTIGRLVARLSH
Sbjct: 241 THGIALFVAAPGFLVGAIKLVKKSRKLAKKVAQLNVAAKGTYTLNRDFDTIGRLVARLSH 300

Query: 301 ELEHMRVMARFWLDKGEDRHRAIDELMRQLNQTHVNFSQQLDELEEHLYLCFMTINRARN 360
           ELEHMRVM +FWLD+GED+  AI EL+RQLNQ+H NF+QQLDELEEHLYLCFMTINRARN
Sbjct: 301 ELEHMRVMTKFWLDRGEDKRWAIGELLRQLNQSHENFNQQLDELEEHLYLCFMTINRARN 360

Query: 361 LVVKEILDSGQPIKISYL 374
           LVVKEILDS  PIKISYL
Sbjct: 361 LVVKEILDSSAPIKISYL 371

BLAST of Cla97C01G004740 vs. TrEMBL
Match: tr|A0A0A0KN20|A0A0A0KN20_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G166480 PE=4 SV=1)

HSP 1 Score: 533.5 bits (1373), Expect = 3.8e-148
Identity = 285/360 (79.17%), Postives = 319/360 (88.61%), Query Frame = 0

Query: 19  GGPYITSHPVDIDVREEYANAFRTESYIDFWTRVLALNNGDDLITQLSVESTTSTRLSSY 78
           GG ++ ++PVD DV EEYANAFRTESYIDFWTRV+ALNNGD+L  Q+S+ESTT+TRLSSY
Sbjct: 4   GGQHVINYPVDTDVGEEYANAFRTESYIDFWTRVVALNNGDNLTAQVSLESTTATRLSSY 63

Query: 79  RLFVEHLLDPPQSTIKRILTSPHIGLDSYSLLLD-YFSHTANASLLCSRLLKHIDHLRLK 138
           RLFVEHLLDPPQ TIK ILT+ H+G +S SLLLD YFSHTANASLLCSR+LK I HLRLK
Sbjct: 64  RLFVEHLLDPPQPTIKTILTA-HLGPNSCSLLLDHYFSHTANASLLCSRILKQIVHLRLK 123

Query: 139 LHPLEINLQSLENKEEFHH-ESHFKQLLIRLVEFSN--THNSFIP-STEQVQIIQNGCSK 198
           LH L+      +NK+EF+H +SHFKQLL+RL EFSN  T NSF+P   EQVQIIQNGCSK
Sbjct: 124 LHSLD------QNKQEFNHDDSHFKQLLVRLFEFSNDSTPNSFVPYCMEQVQIIQNGCSK 183

Query: 199 LLKRLEFSRDKAQAKLKRVRYFQHSSAGFLVAITASLTIIIVTHGIALFVAAPGFLVGAI 258
           LLKRLE+SRDK + KLKRVRYFQHSSAGFLVAITAS T+I+VTHGIALFVAAPGFLVGAI
Sbjct: 184 LLKRLEYSRDKTRDKLKRVRYFQHSSAGFLVAITASFTVIVVTHGIALFVAAPGFLVGAI 243

Query: 259 KLANRSRKLAKKVARLNVTAKGTYTLNRDFDTIGRLVARLSHELEHMRVMARFWLDKGED 318
           KLA +SRKLAK+VA+LNV AKGTYTLNRDFDTIGRLVARLSHELEHM+VMA+FWLDKG D
Sbjct: 244 KLAKKSRKLAKEVAQLNVAAKGTYTLNRDFDTIGRLVARLSHELEHMKVMAKFWLDKGGD 303

Query: 319 RHRAIDELMRQLNQTHVNFSQQLDELEEHLYLCFMTINRARNLVVKEILDSGQPIKISYL 374
           +  AIDEL RQLNQ+H NF+QQLDELEEHLYLCFMTINRARNLVVKEIL+  +PIKISYL
Sbjct: 304 KRWAIDELARQLNQSHENFNQQLDELEEHLYLCFMTINRARNLVVKEILNLSKPIKISYL 356

BLAST of Cla97C01G004740 vs. TrEMBL
Match: tr|A0A061G259|A0A061G259_THECC (Uncharacterized protein OS=Theobroma cacao OX=3641 GN=TCM_015222 PE=4 SV=1)

HSP 1 Score: 380.2 bits (975), Expect = 5.4e-102
Identity = 212/367 (57.77%), Postives = 265/367 (72.21%), Query Frame = 0

Query: 3   KSKIVSCLRKFLSCSD-GGPYITSHPVDIDVREEYANAFRTESYIDFWTRVLALNNGDDL 62
           K +I +  RKFLSC+   G   +  P  IDVREEYANAFRTESY DFWTRVLA+++ D  
Sbjct: 141 KKRIGARFRKFLSCTGASGANSSVIPRSIDVREEYANAFRTESYNDFWTRVLAISHIDSA 200

Query: 63  ITQLSVESTTSTRLSSYRLFVEHLLDPPQSTIKRILTSPHIGLDSYSLLLDYFSHTANAS 122
                ++STT+ RLSSYRLF EHLLDP Q T+ RILT       ++SLLLDYFS TANAS
Sbjct: 201 TCISPIDSTTAARLSSYRLFAEHLLDPDQPTVSRILTLIQNRPTTHSLLLDYFSQTANAS 260

Query: 123 LLCSRLLKHIDHLRLKLHPLEINLQSLENKEEFHHESHFKQLLIRLVEFSNTHNSFI--- 182
           LLC  LLK IDH R+K      + Q+LE   +F + + F  ++ RL+EFSN+ N F+   
Sbjct: 261 LLCGLLLKDIDHTRVKYRSFRTSFQALE-IAQFSNGNQFLGIVTRLIEFSNSPNPFLSTA 320

Query: 183 PSTEQVQIIQNGCSKLLKRLEFSRDKAQAKLKRVRYFQHSSAGFLVAITASLTIIIVTHG 242
           PS+ +V++IQ GC  LL+RLE SRDKA+AKL  +   QH S  FLVA+TASLTII+ +H 
Sbjct: 321 PSSSRVRVIQAGCCDLLERLESSRDKARAKLHLINSLQHGSGVFLVALTASLTIIVASHA 380

Query: 243 IALFVAAPGFLVGAIKLANRSRKLAKKVARLNVTAKGTYTLNRDFDTIGRLVARLSHELE 302
           +AL VAAPG +  +++LA+  R+LA++ A+L+  AKGTY LNRD DTI RLVARL+ ELE
Sbjct: 381 LALVVAAPGLIAASLELAS-MRRLARESAQLDAAAKGTYILNRDLDTISRLVARLNDELE 440

Query: 303 HMRVMARFWLDKGEDRHRAIDELMRQLNQTHVNFSQQLDELEEHLYLCFMTINRARNLVV 362
            M  M +FWLD+GEDR +A  E+ RQL +   NF+QQLDELEEHLYLCFMTINRARNLVV
Sbjct: 441 DMSAMVKFWLDRGEDRLQASGEVARQLKKNDANFTQQLDELEEHLYLCFMTINRARNLVV 500

Query: 363 KEILDSG 366
           +EILD G
Sbjct: 501 REILDPG 505

BLAST of Cla97C01G004740 vs. TrEMBL
Match: tr|A0A1R3HN72|A0A1R3HN72_9ROSI (Uncharacterized protein OS=Corchorus olitorius OX=93759 GN=COLO4_28024 PE=4 SV=1)

HSP 1 Score: 375.9 bits (964), Expect = 1.0e-100
Identity = 210/369 (56.91%), Postives = 262/369 (71.00%), Query Frame = 0

Query: 3   KSKIVSCLRKFL-SCSDGGPYITSHPVDIDVREEYANAFRTESYIDFWTRVLALNNGDDL 62
           K KI + LRKFL SC+     +   P  +DVREEYANAFRTESY+DFWTRVLA++N +  
Sbjct: 2   KKKIRATLRKFLSSCTAPAANLAVIPNSVDVREEYANAFRTESYMDFWTRVLAISNTEFA 61

Query: 63  ITQLSVESTTSTRLSSYRLFVEHLLDPPQSTIKRILTSPHIGLDSYSLLLDYFSHTANAS 122
                ++STT+ RL SYRLF EHLLDP QST+ RIL        ++SLLLDYFS TANAS
Sbjct: 62  TCISPMDSTTAARLPSYRLFAEHLLDPDQSTVTRILNLAQNRPTTHSLLLDYFSQTANAS 121

Query: 123 LLCSRLLKHIDHLRLKLHPLEINLQSLENKEEFHHESHFKQLLIRLVEFSNTHNSFI--- 182
           LLC  LLK IDH RLK      + Q+L    +F +E+ F  ++  L+EFSN  N F+   
Sbjct: 122 LLCGLLLKVIDHTRLKYRSFRTSFQAL-GVVQFSNENQFSGIVTCLIEFSNCPNPFVSSS 181

Query: 183 PSTEQVQIIQNGCSKLLKRLEFSRDKAQAKLKRVRYFQHSSAGFLVAITASLTIIIVTHG 242
           PS+ +V+IIQ GC +LLKRLE SRDKA+AKL  V   QH S  FLVA+T SLTII+ +H 
Sbjct: 182 PSSNRVRIIQAGCCELLKRLETSRDKARAKLHLVNSLQHGSGIFLVALTTSLTIIVASHA 241

Query: 243 IALFVAAPGFLVGAIKLANRSRKLAKKVARLNVTAKGTYTLNRDFDTIGRLVARLSHELE 302
           +AL V APG +  + +LA+ SR+L ++  +L+  AKGTY LNRD DTI RLVARL+ E+E
Sbjct: 242 LALLVTAPGLIAASFELAS-SRRLFRESTQLDAAAKGTYILNRDLDTISRLVARLNDEVE 301

Query: 303 HMRVMARFWLDKGEDRHRAIDELMRQLNQTHVNFSQQLDELEEHLYLCFMTINRARNLVV 362
            MR M +FWL++GEDR +A  E+ R+L +   NF+QQLD+LEEHLYLCFMTINRARNLVV
Sbjct: 302 DMRAMVKFWLERGEDRLQASCEVARRLKKNDANFTQQLDDLEEHLYLCFMTINRARNLVV 361

Query: 363 KEILDSGQP 368
           KEILD G P
Sbjct: 362 KEILDPGPP 368

BLAST of Cla97C01G004740 vs. TrEMBL
Match: tr|V4S4G2|V4S4G2_9ROSI (Uncharacterized protein OS=Citrus clementina OX=85681 GN=CICLE_v10006929mg PE=4 SV=1)

HSP 1 Score: 352.8 bits (904), Expect = 9.3e-94
Identity = 198/367 (53.95%), Postives = 260/367 (70.84%), Query Frame = 0

Query: 3   KSKIVSCLRKFLSCSDGGPYITSHPVDIDVREEYANAFRTESYIDFWTRVLALNNGDDLI 62
           K +I + +RK LS +     I+S   D+DVREEYA+AFRTESY +FWTRVLAL+N D   
Sbjct: 2   KKRISARIRKCLSHAASSADISSQRTDVDVREEYAHAFRTESYNEFWTRVLALSNKDS-A 61

Query: 63  TQLSVESTTSTRLSSYRLFVEHLLDPPQSTIKRILTSPHIGLDSYSLLLDYFSHTANASL 122
             + VESTT+ RLSSYRLF E LLDP QST+ RIL      + + SL+ DYF+ TANASL
Sbjct: 62  KSIQVESTTAARLSSYRLFAEQLLDPDQSTVIRILDL----VKTPSLIFDYFTQTANASL 121

Query: 123 LCSRLLKHIDHLRLKLHPLEINLQSLENKEEFHHESHFKQLLIRLVEFSNTHNSF---IP 182
               LLK ID++R+K    ++ + SL+N       ++   ++IRL EFSN  N F    P
Sbjct: 122 FFGPLLKDIDNVRIKYRSFKVIVNSLQNAHAL-PINYVSSVVIRLTEFSNLSNPFASAAP 181

Query: 183 STEQVQIIQNGCSKLLKRLEFSRDKAQAKLKRVRYFQHSSAGFLVAITASLTIIIVTHGI 242
           +T + +++Q GC KLLK+LE SRDKA+A+L+ +   +H SA FL+AIT SLT+I+ +H +
Sbjct: 182 TTRRFRVVQAGCGKLLKQLESSRDKARARLQLINITKHGSATFLLAITISLTVIVASHAL 241

Query: 243 ALFVAAPGFLVGAIKLANRSRKLAKKVARLNVTAKGTYTLNRDFDTIGRLVARLSHELEH 302
           AL VAAPG +  +++LA+ +R+L +   +L+  AKGTY LNRD DTI RLVARL+ ELEH
Sbjct: 242 ALLVAAPGLIAASLELAS-TRRLVRVSTQLDAAAKGTYILNRDLDTISRLVARLNDELEH 301

Query: 303 MRVMARFWLDKGEDRHRAIDELMRQLNQTHVNFSQQLDELEEHLYLCFMTINRARNLVVK 362
           MR   +FWLD+GE R +A  E+ RQL +   +FSQQLDELEEHLYLCFMT+NRARNLV+K
Sbjct: 302 MRSTVKFWLDRGEARLQASGEVARQLLKNDASFSQQLDELEEHLYLCFMTVNRARNLVMK 361

Query: 363 EILDSGQ 367
           EILD GQ
Sbjct: 362 EILDPGQ 361

BLAST of Cla97C01G004740 vs. Swiss-Prot
Match: sp|Q9SMU4|U496N_ARATH (UPF0496 protein At3g49070 OS=Arabidopsis thaliana OX=3702 GN=At3g49070 PE=2 SV=1)

HSP 1 Score: 298.9 bits (764), Expect = 7.9e-80
Identity = 172/350 (49.14%), Postives = 219/350 (62.57%), Query Frame = 0

Query: 29  DIDVREEYANAFRTESYIDFWTRVLALNN-----GDDLITQLSVESTTSTRLSSYRLFVE 88
           D+DVREEYANAFRTESY  FWTRV+ L+                      RL SYRLF  
Sbjct: 68  DVDVREEYANAFRTESYNHFWTRVVQLSRXXXXXXXXXXXXXXXXXXXXXRLMSYRLFAH 127

Query: 89  HLLDPPQSTIKRILTSPHIGLDSYSLLLDYFSHTANASLLCSRLLKHIDHLRLKLHPLEI 148
           +LLDP  +TI RIL    +G  + +LL DYF  TANA LLC++LLK+I HLR K   L  
Sbjct: 128 NLLDPDLNTITRILDVSRVGRHTRTLLSDYFLETANAFLLCTQLLKNIHHLRSKYESL-- 187

Query: 149 NLQSLENKEEFHHESHFKQLLI-RLVEFSNTHNSFIPSTEQVQIIQNGCSKLLKRLEFSR 208
                  K +FH E+H    LI +  E S   + FI S  ++Q+I++GC  LLKRLE  R
Sbjct: 188 -------KPKFHSENHNSLALIDQFTEISKWFDPFISSGSRIQLIRSGCLYLLKRLESRR 247

Query: 209 DKAQAKLKRVRYFQHSSAGFLVAITASLTIIIVTHGIALFVAAPGFLVGAIKLANRSRKL 268
           DK +AKLK +    HSS   ++A+T +L + I +H  ALF+AAP  L    K A    KL
Sbjct: 248 DKTRAKLKLINGLTHSSGLLVLALTTTLIVTIASHAFALFLAAPTLLASQFKPAGLRNKL 307

Query: 269 AKKVARLNVTAKGTYTLNRDFDTIGRLVARLSHELEHMRVMARFWLDKGEDRHRAIDELM 328
            K  ARL+V AKGTY L+RD DTI RLV R++ E+ H+R MA FW+ +G  R R  +E+ 
Sbjct: 308 TKTAARLDVAAKGTYILSRDLDTISRLVTRINDEVNHVRAMAEFWVGRGSGRVRGSEEVA 367

Query: 329 RQLNQTHVNFSQQLDELEEHLYLCFMTINRARNLVVKEILDSGQPIKISY 373
           R+L +   +FS++LDELEEH+YLCFMTINRARNL+VKEILDS  P   S+
Sbjct: 368 RELKRCEESFSEELDELEEHIYLCFMTINRARNLLVKEILDSDDPPNCSF 408

BLAST of Cla97C01G004740 vs. Swiss-Prot
Match: sp|A2XCJ1|U496C_ORYSI (UPF0496 protein 3 OS=Oryza sativa subsp. indica OX=39946 GN=OsI_009784 PE=3 SV=2)

HSP 1 Score: 187.6 bits (475), Expect = 2.6e-46
Identity = 134/346 (38.73%), Postives = 188/346 (54.34%), Query Frame = 0

Query: 31  DVREEYANAFRTESYIDFWTRVL--ALNNGDDLITQLSVES--TTSTRLSSYRLFVEHLL 90
           D REEY +AFRTESY DFW RVL   L +G  L+ +         S RL SYRLF EHLL
Sbjct: 33  DFREEYTSAFRTESYNDFWARVLDITLAHGAALVPRHGGGGGCAASKRLPSYRLFAEHLL 92

Query: 91  DPPQSTIKRILTSP---HIGLDSYSLLLDYFSHTANASLLCSRLLKHIDHLRLKLHPLEI 150
           +P Q  +   L SP    +  D   LL  Y++ TANAS LCS LLK I+H+RL+  PL+ 
Sbjct: 93  EPDQRAVAAALASPRGSRLRPDVRGLLAAYYAETANASFLCSHLLKDIEHIRLRYRPLKH 152

Query: 151 NLQSLENKEEFHHESHFKQLLIRLVEFSNTHNSFIPSTEQVQIIQNGCSKLLKRLEFSRD 210
            L+ L +      +     L            +   S  +++ +Q G   LL+ L+  R 
Sbjct: 153 TLRKLAS------DVGVSGLADVSAALGQPFTALAASQGRLREVQAGSGDLLRGLDAGRK 212

Query: 211 KAQAKLKRVRYFQHS-SAGFL--VAITASLTIIIVTHGIALFVAAPGFLVGAIKLANR-- 270
           KA+ +++ V   + + S  F+  VA+ A +   I  H +A F A P  ++    L  R  
Sbjct: 213 KARHRIRSVARLRRALSVSFVTAVAVVAVVGACIGVHILAAFAAFP--MMSPAWLGERFF 272

Query: 271 -SRKLAKKVARLNVTAKGTYTLNRDFDTIGRLVARLSHELEHMRVMARFWLDK-----GE 330
             R   + + +L   AKGTY LNRD +TI RLVAR+  E EHM  + R  ++        
Sbjct: 273 SGRAARRALVQLEAAAKGTYILNRDMETISRLVARVRDEGEHMVALLRLCVEHRPAAGAG 332

Query: 331 DRHRAIDELMRQLNQTHVNFSQQLDELEEHLYLCFMTINRARNLVV 359
            + R + E++RQL++   +F QQLDELEEHL+LCFMTIN+AR +V+
Sbjct: 333 GKGRLVQEVLRQLSKNEESFRQQLDELEEHLFLCFMTINKARIMVM 370

BLAST of Cla97C01G004740 vs. Swiss-Prot
Match: sp|Q10RR9|U496C_ORYSJ (UPF0496 protein 3 OS=Oryza sativa subsp. japonica OX=39947 GN=Os03g0148000 PE=2 SV=1)

HSP 1 Score: 185.7 bits (470), Expect = 9.7e-46
Identity = 133/346 (38.44%), Postives = 187/346 (54.05%), Query Frame = 0

Query: 31  DVREEYANAFRTESYIDFWTRVL--ALNNGDDLITQLSVES--TTSTRLSSYRLFVEHLL 90
           D REEY +AFRTESY DFW RVL   L +G  L+ +         S RL SYRLF EHLL
Sbjct: 33  DFREEYTSAFRTESYNDFWARVLDITLAHGAALVPRHGGGGGCAASKRLPSYRLFAEHLL 92

Query: 91  DPPQSTIKRILTSP---HIGLDSYSLLLDYFSHTANASLLCSRLLKHIDHLRLKLHPLEI 150
           +P Q  +   L SP    +  D   LL  Y++ TANAS LCS LLK I+H+RL+  PL+ 
Sbjct: 93  EPDQRAVAAALASPRGSRLRPDVRGLLAAYYAETANASFLCSHLLKDIEHIRLRYRPLKH 152

Query: 151 NLQSLENKEEFHHESHFKQLLIRLVEFSNTHNSFIPSTEQVQIIQNGCSKLLKRLEFSRD 210
            L+ L +      +     L            +   S  +++ +Q G   LL+ L+  R 
Sbjct: 153 TLRKLAS------DVGVSGLADVSAALGQPFTALAASQGRLREVQAGSGDLLRGLDAGRK 212

Query: 211 KAQAKLKRVRYFQHS-SAGFL--VAITASLTIIIVTHGIALFVAAPGFLVGAIKLANR-- 270
           KA+ +++ V   + + S  F+  VA+ A +   I  H +A F A P  ++    L  R  
Sbjct: 213 KARHRIRSVARLRRALSVSFVTAVAVVAVVGACIGVHILAAFAAFP--MMSPAWLGERFF 272

Query: 271 -SRKLAKKVARLNVTAKGTYTLNRDFDTIGRLVARLSHELEHMRVMARFWLDK-----GE 330
             R   + + +L   AKGTY LNRD +TI RLVAR+  E EHM  + R  ++        
Sbjct: 273 SGRAARRALVQLEAAAKGTYILNRDMETISRLVARVRDEGEHMVALRRLCVEHRPAAGAG 332

Query: 331 DRHRAIDELMRQLNQTHVNFSQQLDELEEHLYLCFMTINRARNLVV 359
            + R + E++RQL++   +F QQLDELEEHL+LCFMT N+AR +V+
Sbjct: 333 GKGRLVQEVLRQLSKNEESFRQQLDELEEHLFLCFMTTNKARIMVM 370

BLAST of Cla97C01G004740 vs. Swiss-Prot
Match: sp|Q9LJK4|U496L_ARATH (UPF0496 protein At3g19250 OS=Arabidopsis thaliana OX=3702 GN=At3g19250 PE=2 SV=1)

HSP 1 Score: 112.8 bits (281), Expect = 8.0e-24
Identity = 96/349 (27.51%), Postives = 176/349 (50.43%), Query Frame = 0

Query: 31  DVREEYANAFRTESYIDFWTRVLALNNGDDLITQLSVESTTSTRLSSYRLFVEHLLDPPQ 90
           ++  E A+AF+T SY D  +R+L ++      TQ ++E           LF+   L P  
Sbjct: 35  NLSHELAHAFQTPSYHDIRSRLLVIDP-----TQENLE-----------LFLSQELRPNN 94

Query: 91  STIKRILTSPHIGLDSY-SLLLDYFSHTANASLLCSRLLKHIDHLRLKLHPLEINLQSLE 150
            +++  L+  H    +  +L+  YF H+ +A+  C  L +++   R  L+   ++L ++ 
Sbjct: 95  ESVQEALSLRHAKQTTLTNLVSTYFQHSEDATRFCLNLYQNVHSARCHLYTPLLDLFNI- 154

Query: 151 NKEEFHHESHFK----------QLLIRLVEFSNTHNSFIPSTEQVQIIQNGCSKLLKRLE 210
               F  +SH             + ++L  F N   S  P +   Q  Q    +L  +L+
Sbjct: 155 ----FPRDSHSAIDESFCNLAFDVFLKLDTFENPFAS--PESHSFQDTQLCFYQLADKLD 214

Query: 211 FSRDKAQAKLKRVRYFQHSSAGFLVAITASLTIIIVTHGIALFVAAPGFLVGAIKLA--- 270
               K+++   RVR   H++AG  + +  ++ ++  +     + A P  LV A  L    
Sbjct: 215 TRIRKSKS---RVRLLHHATAGSALCLVTAVVVVAASAAFIAYHALPTILVVAGPLCTPY 274

Query: 271 ---NRSRKLAKKVARLNVTAKGTYTLNRDFDTIGRLVARLSHELEHMRVMARFWLDKGED 330
              +  +K    + +LNV AKGT+ LN+D DTI RLV+RL   +++ +++ R  L++G D
Sbjct: 275 LPHSFKKKELSNIFQLNVAAKGTFALNKDLDTIDRLVSRLHTGVKNDKLLIRLGLERGRD 334

Query: 331 RHRAIDELMRQLNQTHVNFSQQLDELEEHLYLCFMTINRARNLVVKEIL 363
            +  I E ++QL ++HVN + QL+ L +H+   F  +N++R+L++KEIL
Sbjct: 335 VY-TIPEFVKQLRKSHVNHTHQLEVLVDHICRWFTNVNKSRSLLLKEIL 356

BLAST of Cla97C01G004740 vs. Swiss-Prot
Match: sp|Q6DYE5|U496K_ARATH (UPF0496 protein At1g20180 OS=Arabidopsis thaliana OX=3702 GN=At1g20180 PE=2 SV=2)

HSP 1 Score: 111.3 bits (277), Expect = 2.3e-23
Identity = 103/361 (28.53%), Postives = 167/361 (46.26%), Query Frame = 0

Query: 30  IDVREEYANAFRTESYIDFWTRV---LALNNGDDLITQLSVESTTSTRLSSYRLFVEHLL 89
           + V EEY  AFRT SY++  T+    L + +   L +           LS +  F ++LL
Sbjct: 34  LSVNEEYKEAFRTNSYLETRTKAEDQLGITSCSKL-SSXXXXXXXXXDLSFHSHFTDYLL 93

Query: 90  DPPQSTIKRILTSPHIGLDSYSLLLDYFSHTANASLLCSRLLK-----HIDHLRLKLHPL 149
           DPPQ T+  ++    +     +L++ +F  ++ A  +C  LL+      I+H ++K   +
Sbjct: 94  DPPQETLDALMQDSSLD----NLIVTFFDLSSEACDVCETLLQCLQQIKINHNKIK-RVM 153

Query: 150 EINLQSLENKEEFHHESHFKQLLI--RLVEFSNTHNSF--IPSTEQVQIIQNGCSKLLKR 209
           +I  +     +           LI   L  F+   N    I +  Q +I+ +  S LL +
Sbjct: 154 KIGKRVCNGAKTLECSPEMLCALIFQELSRFAALKNPLCRIVNEAQFRIVHDANSDLLTK 213

Query: 210 LEFSRDKAQAKLKRVRYFQHSSAGFLVAITASLTI----IIVTHGIALFVAAPGFL---- 269
           L   + + + K++  + F     G+ + IT S  +    II  H I    AAP  L    
Sbjct: 214 LTSKKRRIRRKIRFFK-FCKKLGGYSLVITHSAIVITLLIIALHSILGVFAAPALLGLCS 273

Query: 270 ---------VGAIKLANRSRKLAKKVARLNVTAKGTYTLNRDFDTIGRLVARLSHELEHM 329
                     G +  +N+   L K   ++++ AKG + L  D DT+ RL  RL  E+EH 
Sbjct: 274 FCLLRKKKAKGRMHKSNKDTTLEKLGTQIDIAAKGMFILINDLDTLSRLAGRLCDEIEHR 333

Query: 330 RVMARFWLDKGEDRHRAIDELMRQLNQTHVNFSQQLDELEEHLYLCFMTINRARNLVVKE 362
           + +A   +     +   + E +R+ N     FS QL ELEEHLYLCF TINR+R LV+ +
Sbjct: 334 KTVAA--MCAKSRKIEVLKEALREFNGHEEKFSDQLQELEEHLYLCFHTINRSRRLVLAQ 385

BLAST of Cla97C01G004740 vs. TAIR10
Match: AT3G49070.1 (Protein of unknown function (DUF677))

HSP 1 Score: 298.9 bits (764), Expect = 4.4e-81
Identity = 172/350 (49.14%), Postives = 219/350 (62.57%), Query Frame = 0

Query: 29  DIDVREEYANAFRTESYIDFWTRVLALNN-----GDDLITQLSVESTTSTRLSSYRLFVE 88
           D+DVREEYANAFRTESY  FWTRV+ L+                      RL SYRLF  
Sbjct: 68  DVDVREEYANAFRTESYNHFWTRVVQLSRXXXXXXXXXXXXXXXXXXXXXRLMSYRLFAH 127

Query: 89  HLLDPPQSTIKRILTSPHIGLDSYSLLLDYFSHTANASLLCSRLLKHIDHLRLKLHPLEI 148
           +LLDP  +TI RIL    +G  + +LL DYF  TANA LLC++LLK+I HLR K   L  
Sbjct: 128 NLLDPDLNTITRILDVSRVGRHTRTLLSDYFLETANAFLLCTQLLKNIHHLRSKYESL-- 187

Query: 149 NLQSLENKEEFHHESHFKQLLI-RLVEFSNTHNSFIPSTEQVQIIQNGCSKLLKRLEFSR 208
                  K +FH E+H    LI +  E S   + FI S  ++Q+I++GC  LLKRLE  R
Sbjct: 188 -------KPKFHSENHNSLALIDQFTEISKWFDPFISSGSRIQLIRSGCLYLLKRLESRR 247

Query: 209 DKAQAKLKRVRYFQHSSAGFLVAITASLTIIIVTHGIALFVAAPGFLVGAIKLANRSRKL 268
           DK +AKLK +    HSS   ++A+T +L + I +H  ALF+AAP  L    K A    KL
Sbjct: 248 DKTRAKLKLINGLTHSSGLLVLALTTTLIVTIASHAFALFLAAPTLLASQFKPAGLRNKL 307

Query: 269 AKKVARLNVTAKGTYTLNRDFDTIGRLVARLSHELEHMRVMARFWLDKGEDRHRAIDELM 328
            K  ARL+V AKGTY L+RD DTI RLV R++ E+ H+R MA FW+ +G  R R  +E+ 
Sbjct: 308 TKTAARLDVAAKGTYILSRDLDTISRLVTRINDEVNHVRAMAEFWVGRGSGRVRGSEEVA 367

Query: 329 RQLNQTHVNFSQQLDELEEHLYLCFMTINRARNLVVKEILDSGQPIKISY 373
           R+L +   +FS++LDELEEH+YLCFMTINRARNL+VKEILDS  P   S+
Sbjct: 368 RELKRCEESFSEELDELEEHIYLCFMTINRARNLLVKEILDSDDPPNCSF 408

BLAST of Cla97C01G004740 vs. TAIR10
Match: AT3G19250.1 (Protein of unknown function (DUF677))

HSP 1 Score: 112.8 bits (281), Expect = 4.4e-25
Identity = 96/349 (27.51%), Postives = 176/349 (50.43%), Query Frame = 0

Query: 31  DVREEYANAFRTESYIDFWTRVLALNNGDDLITQLSVESTTSTRLSSYRLFVEHLLDPPQ 90
           ++  E A+AF+T SY D  +R+L ++      TQ ++E           LF+   L P  
Sbjct: 35  NLSHELAHAFQTPSYHDIRSRLLVIDP-----TQENLE-----------LFLSQELRPNN 94

Query: 91  STIKRILTSPHIGLDSY-SLLLDYFSHTANASLLCSRLLKHIDHLRLKLHPLEINLQSLE 150
            +++  L+  H    +  +L+  YF H+ +A+  C  L +++   R  L+   ++L ++ 
Sbjct: 95  ESVQEALSLRHAKQTTLTNLVSTYFQHSEDATRFCLNLYQNVHSARCHLYTPLLDLFNI- 154

Query: 151 NKEEFHHESHFK----------QLLIRLVEFSNTHNSFIPSTEQVQIIQNGCSKLLKRLE 210
               F  +SH             + ++L  F N   S  P +   Q  Q    +L  +L+
Sbjct: 155 ----FPRDSHSAIDESFCNLAFDVFLKLDTFENPFAS--PESHSFQDTQLCFYQLADKLD 214

Query: 211 FSRDKAQAKLKRVRYFQHSSAGFLVAITASLTIIIVTHGIALFVAAPGFLVGAIKLA--- 270
               K+++   RVR   H++AG  + +  ++ ++  +     + A P  LV A  L    
Sbjct: 215 TRIRKSKS---RVRLLHHATAGSALCLVTAVVVVAASAAFIAYHALPTILVVAGPLCTPY 274

Query: 271 ---NRSRKLAKKVARLNVTAKGTYTLNRDFDTIGRLVARLSHELEHMRVMARFWLDKGED 330
              +  +K    + +LNV AKGT+ LN+D DTI RLV+RL   +++ +++ R  L++G D
Sbjct: 275 LPHSFKKKELSNIFQLNVAAKGTFALNKDLDTIDRLVSRLHTGVKNDKLLIRLGLERGRD 334

Query: 331 RHRAIDELMRQLNQTHVNFSQQLDELEEHLYLCFMTINRARNLVVKEIL 363
            +  I E ++QL ++HVN + QL+ L +H+   F  +N++R+L++KEIL
Sbjct: 335 VY-TIPEFVKQLRKSHVNHTHQLEVLVDHICRWFTNVNKSRSLLLKEIL 356

BLAST of Cla97C01G004740 vs. TAIR10
Match: AT1G20180.1 (Protein of unknown function (DUF677))

HSP 1 Score: 111.3 bits (277), Expect = 1.3e-24
Identity = 103/361 (28.53%), Postives = 167/361 (46.26%), Query Frame = 0

Query: 30  IDVREEYANAFRTESYIDFWTRV---LALNNGDDLITQLSVESTTSTRLSSYRLFVEHLL 89
           + V EEY  AFRT SY++  T+    L + +   L +           LS +  F ++LL
Sbjct: 34  LSVNEEYKEAFRTNSYLETRTKAEDQLGITSCSKL-SSXXXXXXXXXDLSFHSHFTDYLL 93

Query: 90  DPPQSTIKRILTSPHIGLDSYSLLLDYFSHTANASLLCSRLLK-----HIDHLRLKLHPL 149
           DPPQ T+  ++    +     +L++ +F  ++ A  +C  LL+      I+H ++K   +
Sbjct: 94  DPPQETLDALMQDSSLD----NLIVTFFDLSSEACDVCETLLQCLQQIKINHNKIK-RVM 153

Query: 150 EINLQSLENKEEFHHESHFKQLLI--RLVEFSNTHNSF--IPSTEQVQIIQNGCSKLLKR 209
           +I  +     +           LI   L  F+   N    I +  Q +I+ +  S LL +
Sbjct: 154 KIGKRVCNGAKTLECSPEMLCALIFQELSRFAALKNPLCRIVNEAQFRIVHDANSDLLTK 213

Query: 210 LEFSRDKAQAKLKRVRYFQHSSAGFLVAITASLTI----IIVTHGIALFVAAPGFL---- 269
           L   + + + K++  + F     G+ + IT S  +    II  H I    AAP  L    
Sbjct: 214 LTSKKRRIRRKIRFFK-FCKKLGGYSLVITHSAIVITLLIIALHSILGVFAAPALLGLCS 273

Query: 270 ---------VGAIKLANRSRKLAKKVARLNVTAKGTYTLNRDFDTIGRLVARLSHELEHM 329
                     G +  +N+   L K   ++++ AKG + L  D DT+ RL  RL  E+EH 
Sbjct: 274 FCLLRKKKAKGRMHKSNKDTTLEKLGTQIDIAAKGMFILINDLDTLSRLAGRLCDEIEHR 333

Query: 330 RVMARFWLDKGEDRHRAIDELMRQLNQTHVNFSQQLDELEEHLYLCFMTINRARNLVVKE 362
           + +A   +     +   + E +R+ N     FS QL ELEEHLYLCF TINR+R LV+ +
Sbjct: 334 KTVAA--MCAKSRKIEVLKEALREFNGHEEKFSDQLQELEEHLYLCFHTINRSRRLVLAQ 385

BLAST of Cla97C01G004740 vs. TAIR10
Match: AT3G19330.1 (Protein of unknown function (DUF677))

HSP 1 Score: 103.6 bits (257), Expect = 2.7e-22
Identity = 93/344 (27.03%), Postives = 173/344 (50.29%), Query Frame = 0

Query: 31  DVREEYANAFRTESYIDFWTRVLALNNGDDLITQLSVESTTSTRLSSYRLFVEHLLDPPQ 90
           ++  E A+AF+T SY D  +RV  + +    +TQ+              L +  +L P +
Sbjct: 46  NLSRELAHAFQTPSYHDVRSRVHVVVD----LTQIHHRLIQ----PDIELLLSQVLQPNK 105

Query: 91  STIKRILTSPHIGLDSY-SLLLDYFSHTANASLLCSRLLKHIDHLRLKLHPLEINL---- 150
             ++  +   H+   +  +L+  YF H+ +A+ LC  L +++   R  L+   ++L    
Sbjct: 106 ECVQEAIR--HVKQTTLTNLVSTYFQHSEDATRLCLNLYQNVHSARHHLYTPLLDLFNIF 165

Query: 151 --QSLENKEEFHHESHFKQLLIRLVEFSNTHNSFIPSTEQVQIIQNGCSKLLKRLEFSRD 210
              SL   +E   +  F  + ++L  F N  +S  P +   +  Q   S+L   L+    
Sbjct: 166 PGDSLPAIDESLCDLAF-DVFLKLDTFENPFSS--PESYSFRDTQLCFSQLKHNLDRRLR 225

Query: 211 KAQAKLKRVRYFQHSSAGFLVAITASLT------IIIVTHGIALFVAAPGFLVGAIKLAN 270
           K+++   RVR   H++AG  + + A++         I +H + + +   G L       +
Sbjct: 226 KSRS---RVRLIHHATAGSSLCLVAAVVXXXXXXXXIASHALPILLVVAGPLCSPYLPHS 285

Query: 271 RSRKLAKKVARLNVTAKGTYTLNRDFDTIGRLVARLSHELEHMRVMARFWLDKGEDRHRA 330
             RK    + +LN  +KGT+ LN+D DTI RLV+RL   +E+ + + R  L++G D H +
Sbjct: 286 FKRKELTNICQLNAASKGTFVLNKDLDTIDRLVSRLHTGIEYDKFLIRLGLERGRDVH-S 345

Query: 331 IDELMRQLNQTHVNFSQQLDELEEHLYLCFMTINRARNLVVKEI 362
           I E+++ L ++H+  + QL +LE+H+ L F  +N+AR+L++ EI
Sbjct: 346 IQEILKLLRKSHLPLTHQLKDLEDHICLWFTNVNKARSLLLTEI 372

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008436996.19.5e-15980.16PREDICTED: UPF0496 protein At3g49070 isoform X1 [Cucumis melo][more]
XP_004143745.21.2e-14879.22PREDICTED: UPF0496 protein At3g49070, partial [Cucumis sativus][more]
KGN50304.15.8e-14879.17hypothetical protein Csa_5G166480 [Cucumis sativus][more]
XP_022159588.15.4e-14673.14UPF0496 protein At3g49070 [Momordica charantia][more]
XP_023534108.13.6e-10562.70UPF0496 protein At3g49070-like [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
tr|A0A1S3ATJ2|A0A1S3ATJ2_CUCME6.3e-15980.16UPF0496 protein At3g49070 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103482559 PE=... [more]
tr|A0A0A0KN20|A0A0A0KN20_CUCSA3.8e-14879.17Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G166480 PE=4 SV=1[more]
tr|A0A061G259|A0A061G259_THECC5.4e-10257.77Uncharacterized protein OS=Theobroma cacao OX=3641 GN=TCM_015222 PE=4 SV=1[more]
tr|A0A1R3HN72|A0A1R3HN72_9ROSI1.0e-10056.91Uncharacterized protein OS=Corchorus olitorius OX=93759 GN=COLO4_28024 PE=4 SV=1[more]
tr|V4S4G2|V4S4G2_9ROSI9.3e-9453.95Uncharacterized protein OS=Citrus clementina OX=85681 GN=CICLE_v10006929mg PE=4 ... [more]
Match NameE-valueIdentityDescription
sp|Q9SMU4|U496N_ARATH7.9e-8049.14UPF0496 protein At3g49070 OS=Arabidopsis thaliana OX=3702 GN=At3g49070 PE=2 SV=1[more]
sp|A2XCJ1|U496C_ORYSI2.6e-4638.73UPF0496 protein 3 OS=Oryza sativa subsp. indica OX=39946 GN=OsI_009784 PE=3 SV=2[more]
sp|Q10RR9|U496C_ORYSJ9.7e-4638.44UPF0496 protein 3 OS=Oryza sativa subsp. japonica OX=39947 GN=Os03g0148000 PE=2 ... [more]
sp|Q9LJK4|U496L_ARATH8.0e-2427.51UPF0496 protein At3g19250 OS=Arabidopsis thaliana OX=3702 GN=At3g19250 PE=2 SV=1[more]
sp|Q6DYE5|U496K_ARATH2.3e-2328.53UPF0496 protein At1g20180 OS=Arabidopsis thaliana OX=3702 GN=At1g20180 PE=2 SV=2[more]
Match NameE-valueIdentityDescription
AT3G49070.14.4e-8149.14Protein of unknown function (DUF677)[more]
AT3G19250.14.4e-2527.51Protein of unknown function (DUF677)[more]
AT1G20180.11.3e-2428.53Protein of unknown function (DUF677)[more]
AT3G19330.12.7e-2227.03Protein of unknown function (DUF677)[more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR007749DUF677
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C01G004740.1Cla97C01G004740.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 322..342
NoneNo IPR availableCOILSCoilCoilcoord: 131..151
NoneNo IPR availablePANTHERPTHR31113:SF6SUBFAMILY NOT NAMEDcoord: 4..368
IPR007749Protein of unknown function DUF677PFAMPF05055DUF677coord: 61..361
e-value: 2.5E-17
score: 62.7
IPR007749Protein of unknown function DUF677PANTHERPTHR31113FAMILY NOT NAMEDcoord: 4..368

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cla97C01G004740Wax gourdwgowmbB059