Cla97C01G005550 (gene) Watermelon (97103) v2

NameCla97C01G005550
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionDUF4378 domain protein
LocationCla97Chr01 : 5324943 .. 5329459 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGAAAGGTAGAATGGCACTTTGGAGGAAGATCATCTTCTCGTCGAGCCACCACCGCCGATCACCCCCGGCAACGCTACCGTCCTTCTCTTCCGAGCTGTATGAGCACCCTTTTTCACTTCTTTGATTTTCGTTCCTCTCGTTTTACTCACATTGTCTTCGATAATCACCACCCGTCCTCCTTCAACCTCCCGCATCATCGCCCTCCCCTACCCAAAGCTTCCCATCATGGTATATACACTTTCAAAATTTTAATTTTCATTTTTTTAAAGAAAGATATTTGGTCAATTTGATCCTCTATGAATGCTTGTTATGCGTGCATGAGTAGATTAGAATATGATTCCATGTGATATATTAGATAAAGTTTTGATAATGTAATAATGTTTGATATTAAAAAAGGGGTCATATTATTACTTAATATTTTTATGGTTAATTGATCTTTAATTATATATATATAAAAGGGCCTATTTTCAAACTATAAAGAAATAAGAAAAAAAGGGAGATTTTTATTTATTTAATTTTTAATTTTTACTTACATTCTCTTTTGGGAAATTCTCAGAAAACTTTGGTACCATCTTTTTTTGTTTTTCAGTTGTTGAAGCACCAAGGAATAGCTTGGAATTAGAGGGAGCTTCAATTTCTTGCTTAAGAAATAAAGAAAAAAATTTGCAACTTCAAGTAAACTTTTCCTTCTCCTTTTTTCATCAAGTCTATTCAAAATGATAGTGTAATTATGCTCAAGAAATCATTTTTTTTTAATGTAAATTTAAAAATTTACTAACATATAATATATATCTTAAAACAGATGGGACTTCAAATCAAAACTAGAAATGGTAGCACAAAATCAAAAGCAAGTGAGCAACAACTTCCAAATAATGATCACATTATTGCATTGGAATCTCCAAGTGGAAAGACACCAAATCTCTTGGCAAGATTAATGGGTCTTGATATTCTCCCTCAAAACCCCTCTTCTTCTTTCAATTGTCGCGGGTCAAATTTCGGAACCCGTTCTCTCCCAGAGAGCCCGAGAGTATCGTCAGCAAGACTATCAGACGTCGATTGTCATCATCGTCGCCTCTCACTCCAAATTATTTCAGACAAAGAAAATATAAATTTTTTTGAAGAGGCTAAGCGAGAAAAGGAAAAAGTGAGCAAGAAAGTTGCCCTCGTTGATATCACCAATAATAACAGAAAAATAGAGTTCGGAAAACAAGAAGTTGGTTTTAGTCAAATTAAAGTCGAGATCAAGTCCTCTAAGAAACTTAAGAAAACAGCCGTTGACGAATCAAGACGTAGTTCGAAAGTCGTGCGTAAAAATCAAGAGGTAATGATTTCAAAGAAGCAAAAGCTAATATCGATGTCGATGCAAAAACCAAAGCGGAGGGCTAGAGAAGGTGAAGCATTTGATTGTCCAACAAGTAATAACCTTGTTAACAACCTCCATCATTCAACCATTTTTCCAGCAAAGAAAGAGCCTTCTCCTCCGGCGATCCAAGCCCCTCGTGAACAGGTACTCAACATATTACTTTTCATGTCCTTTCAATAAATATATATTAAAACAATTATTAAAAAATAAATAAACTTTCCTTTACTTTTTTACCTGTGATTTGTTTCTAATGACCATAAAATATTATCGAAATAAACATGTATGAAACTAGATAATGTAAATAATTAAATACAAATAAACTTTACCCAAGTCAGTCATGTGAATCAATTCGAAAAAATGGACATCCTCCCAGAGAGAACTCAGGTAAGAAATTCAATAGTACAATAAATAAAACAATATATATTTAAATATATCATCATTTTTATTATTCTTATAATTCTCTTAAAAAATAACATAATCTTCCTCAAAGTTTTTATTTTATTTTATTTTTTTGGGTTCAAAATAAGTAGCAAGAATTTCAACCTGTCAAATTTAATAAAAAATATTAATATTAAATATAAGTTATTTTTAAATATAGTAAAATGAATTGACTTATTTACAAATATAGTAAAATATTATTGTCTATCATTGATAGACACTAATATACAATGTAACTTGATTTTACTCGCTACTTAGTAACATTCGTTTTGGCTACAAAAAGTTTCTAGTTAAATTATATTATTCTATTAAATAAAAATTTATAGTTATTTAATAAATTTTGTATTTTTTTCTTTTCAAATATTAGTTGTATGATGATGATAGATGCAATAACATGGTTTGGTAAATTGTCACGACATTACTTTTAAAAATGTCGTAGGTTGAAGTCTATGATCTTCATATTTGTGATGTAATTAGAAAGTATTAGAATTTTTTTTTTTTAAATATCTTTTCAAGAATAAAGTTTATTATTATTTTCTTGTCAACTTTTCTTAAAAACATTTTTGAAATATATAGCCCGATTTTCAGAACTACATTTTATAATTTTTCTTTTAAAATCACATATAGCTAAAGTACTTTTGACATGTTTCTATAAGTAGATTATAATATAAATTATTGCTAATAAACAATTGTTTTTTTTAATTGAAGACCAAACAATAATTATGACAGTTTTTATTTTTCATTTTTATTTATTTTTGTCCTTTAAGTTTACTTTTCTTTTAATATTAGTAGTAATTTCTACTTGAAAGATCACACAAATAAATCTTGAAAATTCTAACCATCAAATATTTTAAAACATAACTGGAAAAAAAAAAAAGACACAAGTACTACTGATACAAAAAGTTGAAACATTGTTTTAATTTATAATAGTGAACAGGACATGTAAGATTTTCTGCAAAAGAATAAATTGATTATTTATGTTGGTCAAGAGGTTCAAATTTTTCTTGTGAAAAATAGGTCAATAACTTTATTGAAGAATTAAATGCAACATTTTATTTTTTCTGCATAAGATTTTTAAGAGAAAAATAAATTTAAGCCTAAAAGTTATTTATCAAGGTTTTTTTTTTTTGAATAATTGGTTTACTTCAAATTAAATCTCATGTTTATTCAATTATCATTTGTCTCGCTTTGTTAAATTATTAAATAAATTCAAAATGAAGAGTAATTTAAATAAATTATTTGAATTTAGGCAATTAAATCGAGTCAAATATTTCACCTAAAATTATTTTTCATTAGATAACATGATTTTATTATTATTGTTTTTATTATCATTTATGTGATGATAATTTTAACTATATTCTTGTTTCATGCTAAATAATTGCTAGTGAATGCTAATATTAAACCAAAGAGTTGGTAATAAGTTTGCTTCAACTTTTGGTATTCAAGTCCATGCACTTTAAGAACCATTTTAAAATATAAAGAACTCTATCAAAGATTTAAAACTTGGCTTCAATGGTATTTAACCAGGAACAAAACACCTATATAACAGACTGCTTTTGTTTTTGTTTTTTTTCTTTTTCCTTACAATTTCCACAGTTTTCATATTTCTTAACTAAACATTTAAAACATTATCAAACTTTTTTTTTTTTTATAAGCTCAATTTCAAAAACTAAAAATCAAAAATTGCCATAATTTGATAATCATTTATTATTATTATTTTTTTAAAATTATGTTTGTTTTCTCACATTTTTTTCTCATGGTTTTCATATTTAGACATATTAGAATTCTTAGCTAATTTCAAAAAACAAAAACTTCTTTTTGAAAACTATTTCTTTATACATATACAAAATTTAGCTTTGATTTTGAAAACTCCTAAAACGTAGTCAACAAAACATAGAAAACAATAGATGGAAGTGATGTTTATAGGTTTAATTTTTAAAACATACATTTTAGTTCTACATTTCTTACTTTGTTATTTACTTTCTACTCATGTTTTCAAAATCTAAGCCAATTTTAAAAACTGAAAAGTAATTTTAAAAACTTATTTTTGTTTTTAAAATTTAACTAAGAATTATTGAACTAAGAAACAAACGTAAAACCCATGATAAAAAATTCCAAAAACCAAAAACAAAAAATGAAATCACTATCAAACCAACCCATTAAAATTCTTTATTCCAACAATCTCACAAATTGTTTCGAAATCAGCCGTGTAGGTACTCAAAGGGCAAGGCGAAGCCGGCGGGCAGAGACGGCGGAGAAAGAAACGCCGTTGACAAGACCACCACCACAGACGGCGGATCAACGGAGTTCGAATACATCAAAAGAATACTAACCAACCACGGCAATTCAAACTCGATCATCTCACCCCCCAATAACCCGACGAACCCCTCAATCTTCCACCACCCAGAAGCGGCGGAGGACCAGCAATGGGGCAGACGACTACTAAACTGTTGGCACGTGCGAAGAGGAATGAAGGGATGGGAATTGGGTGAAGAAGCTGTGAGGGAGAGATCGGTGAAGAAAGAGTACTTCCCACGTGCGAAATATGAAGTAGTGGAAGATATGGATGCTTTAATAATCAACAAGAGAGTGGTGGAGGAAACAGAAAGGATTGTGAAGGTGGTTGAGCTTCACATTTTAGACTCCCTTTTACGAGAAACTGTTGCCCTAATTTCCTCCCTACCAAAATGCTCTCATTTTCCTAATTTCTAA

mRNA sequence

ATGGGAAAGGTAGAATGGCACTTTGGAGGAAGATCATCTTCTCGTCGAGCCACCACCGCCGATCACCCCCGGCAACGCTACCGTCCTTCTCTTCCGAGCTGTATGAGCACCCTTTTTCACTTCTTTGATTTTCGTTCCTCTCGTTTTACTCACATTGTCTTCGATAATCACCACCCGTCCTCCTTCAACCTCCCGCATCATCGCCCTCCCCTACCCAAAGCTTCCCATCATGTTGTTGAAGCACCAAGGAATAGCTTGGAATTAGAGGGAGCTTCAATTTCTTGCTTAAGAAATAAAGAAAAAAATTTGCAACTTCAAATGGGACTTCAAATCAAAACTAGAAATGGTAGCACAAAATCAAAAGCAAGTGAGCAACAACTTCCAAATAATGATCACATTATTGCATTGGAATCTCCAAGTGGAAAGACACCAAATCTCTTGGCAAGATTAATGGGTCTTGATATTCTCCCTCAAAACCCCTCTTCTTCTTTCAATTGTCGCGGGTCAAATTTCGGAACCCGTTCTCTCCCAGAGAGCCCGAGAGTATCGTCAGCAAGACTATCAGACGTCGATTGTCATCATCGTCGCCTCTCACTCCAAATTATTTCAGACAAAGAAAATATAAATTTTTTTGAAGAGGCTAAGCGAGAAAAGGAAAAAGTGAGCAAGAAAGTTGCCCTCGTTGATATCACCAATAATAACAGAAAAATAGAGTTCGGAAAACAAGAAGTTGGTTTTAGTCAAATTAAAGTCGAGATCAAGTCCTCTAAGAAACTTAAGAAAACAGCCGTTGACGAATCAAGACGTAGTTCGAAAGTCGTGCGTAAAAATCAAGAGGTAATGATTTCAAAGAAGCAAAAGCTAATATCGATGTCGATGCAAAAACCAAAGCGGAGGGCTAGAGAAGGTGAAGCATTTGATTGTCCAACAAGTAATAACCTTGTTAACAACCTCCATCATTCAACCATTTTTCCAGCAAAGAAAGAGCCTTCTCCTCCGGCGATCCAAGCCCCTCGTGAACAGCCGTGTAGGTACTCAAAGGGCAAGGCGAAGCCGGCGGGCAGAGACGGCGGAGAAAGAAACGCCGTTGACAAGACCACCACCACAGACGGCGGATCAACGGAGTTCGAATACATCAAAAGAATACTAACCAACCACGGCAATTCAAACTCGATCATCTCACCCCCCAATAACCCGACGAACCCCTCAATCTTCCACCACCCAGAAGCGGCGGAGGACCAGCAATGGGGCAGACGACTACTAAACTGTTGGCACGTGCGAAGAGGAATGAAGGGATGGGAATTGGGTGAAGAAGCTGTGAGGGAGAGATCGGTGAAGAAAGAGTACTTCCCACGTGCGAAATATGAAGTAGTGGAAGATATGGATGCTTTAATAATCAACAAGAGAGTGGTGGAGGAAACAGAAAGGATTGTGAAGGTGGTTGAGCTTCACATTTTAGACTCCCTTTTACGAGAAACTGTTGCCCTAATTTCCTCCCTACCAAAATGCTCTCATTTTCCTAATTTCTAA

Coding sequence (CDS)

ATGGGAAAGGTAGAATGGCACTTTGGAGGAAGATCATCTTCTCGTCGAGCCACCACCGCCGATCACCCCCGGCAACGCTACCGTCCTTCTCTTCCGAGCTGTATGAGCACCCTTTTTCACTTCTTTGATTTTCGTTCCTCTCGTTTTACTCACATTGTCTTCGATAATCACCACCCGTCCTCCTTCAACCTCCCGCATCATCGCCCTCCCCTACCCAAAGCTTCCCATCATGTTGTTGAAGCACCAAGGAATAGCTTGGAATTAGAGGGAGCTTCAATTTCTTGCTTAAGAAATAAAGAAAAAAATTTGCAACTTCAAATGGGACTTCAAATCAAAACTAGAAATGGTAGCACAAAATCAAAAGCAAGTGAGCAACAACTTCCAAATAATGATCACATTATTGCATTGGAATCTCCAAGTGGAAAGACACCAAATCTCTTGGCAAGATTAATGGGTCTTGATATTCTCCCTCAAAACCCCTCTTCTTCTTTCAATTGTCGCGGGTCAAATTTCGGAACCCGTTCTCTCCCAGAGAGCCCGAGAGTATCGTCAGCAAGACTATCAGACGTCGATTGTCATCATCGTCGCCTCTCACTCCAAATTATTTCAGACAAAGAAAATATAAATTTTTTTGAAGAGGCTAAGCGAGAAAAGGAAAAAGTGAGCAAGAAAGTTGCCCTCGTTGATATCACCAATAATAACAGAAAAATAGAGTTCGGAAAACAAGAAGTTGGTTTTAGTCAAATTAAAGTCGAGATCAAGTCCTCTAAGAAACTTAAGAAAACAGCCGTTGACGAATCAAGACGTAGTTCGAAAGTCGTGCGTAAAAATCAAGAGGTAATGATTTCAAAGAAGCAAAAGCTAATATCGATGTCGATGCAAAAACCAAAGCGGAGGGCTAGAGAAGGTGAAGCATTTGATTGTCCAACAAGTAATAACCTTGTTAACAACCTCCATCATTCAACCATTTTTCCAGCAAAGAAAGAGCCTTCTCCTCCGGCGATCCAAGCCCCTCGTGAACAGCCGTGTAGGTACTCAAAGGGCAAGGCGAAGCCGGCGGGCAGAGACGGCGGAGAAAGAAACGCCGTTGACAAGACCACCACCACAGACGGCGGATCAACGGAGTTCGAATACATCAAAAGAATACTAACCAACCACGGCAATTCAAACTCGATCATCTCACCCCCCAATAACCCGACGAACCCCTCAATCTTCCACCACCCAGAAGCGGCGGAGGACCAGCAATGGGGCAGACGACTACTAAACTGTTGGCACGTGCGAAGAGGAATGAAGGGATGGGAATTGGGTGAAGAAGCTGTGAGGGAGAGATCGGTGAAGAAAGAGTACTTCCCACGTGCGAAATATGAAGTAGTGGAAGATATGGATGCTTTAATAATCAACAAGAGAGTGGTGGAGGAAACAGAAAGGATTGTGAAGGTGGTTGAGCTTCACATTTTAGACTCCCTTTTACGAGAAACTGTTGCCCTAATTTCCTCCCTACCAAAATGCTCTCATTTTCCTAATTTCTAA

Protein sequence

MGKVEWHFGGRSSSRRATTADHPRQRYRPSLPSCMSTLFHFFDFRSSRFTHIVFDNHHPSSFNLPHHRPPLPKASHHVVEAPRNSLELEGASISCLRNKEKNLQLQMGLQIKTRNGSTKSKASEQQLPNNDHIIALESPSGKTPNLLARLMGLDILPQNPSSSFNCRGSNFGTRSLPESPRVSSARLSDVDCHHRRLSLQIISDKENINFFEEAKREKEKVSKKVALVDITNNNRKIEFGKQEVGFSQIKVEIKSSKKLKKTAVDESRRSSKVVRKNQEVMISKKQKLISMSMQKPKRRAREGEAFDCPTSNNLVNNLHHSTIFPAKKEPSPPAIQAPREQPCRYSKGKAKPAGRDGGERNAVDKTTTTDGGSTEFEYIKRILTNHGNSNSIISPPNNPTNPSIFHHPEAAEDQQWGRRLLNCWHVRRGMKGWELGEEAVRERSVKKEYFPRAKYEVVEDMDALIINKRVVEETERIVKVVELHILDSLLRETVALISSLPKCSHFPNF
BLAST of Cla97C01G005550 vs. NCBI nr
Match: XP_011656164.1 (PREDICTED: uncharacterized protein LOC105435648 [Cucumis sativus] >KGN50399.1 hypothetical protein Csa_5G172790 [Cucumis sativus])

HSP 1 Score: 421.0 bits (1081), Expect = 5.7e-114
Identity = 302/539 (56.03%), Postives = 341/539 (63.27%), Query Frame = 0

Query: 1   MGKVEW--HFGGRSSSRRATTADHPR-QRYRPSLPSCMSTLFHFFDFRSSRFTHIVFDNH 60
           MGK EW             TT D    QR   SLPSCMSTLFH FDFRSS FTHIVFDNH
Sbjct: 1   MGKSEWXXXXXXXXXXXXVTTVDIDHFQRRDHSLPSCMSTLFHLFDFRSSHFTHIVFDNH 60

Query: 61  HPSSFNLPHHRPPL--PKASHHVVEAPRNSLELE-GASISCLRNKEKNLQLQMGLQIKTR 120
             SSF+L HH P L   KASHH VEAPRNSLEL+ G SISCLRNKE+NLQLQMGLQIKTR
Sbjct: 61  RSSSFDLSHHHPTLRPTKASHHGVEAPRNSLELDNGDSISCLRNKEENLQLQMGLQIKTR 120

Query: 121 NGSTKSKASEQQLPNNDHIIALESPSGKTPNLLARLMGLDILPQNP-SSSFNCRGSNFGT 180
           NGSTKSKA+EQQLPNND+IIALESPS  TPNLLARLMGLD  PQ   SSS+N    N GT
Sbjct: 121 NGSTKSKATEQQLPNNDNIIALESPSTNTPNLLARLMGLDNFPQTTFSSSYNHCMPNLGT 180

Query: 181 RSLPESPRVSSARLSDVDCHHRRLSLQI-ISDKEN--INFFEE-AKREKEKVSK-KVALV 240
           RSL ESPR S +RLSDVD HHRRLSLQI I +KEN  I   EE +KREK+KV + KVAL+
Sbjct: 181 RSLSESPRNSLSRLSDVDYHHRRLSLQINIQEKENNKIKICEEISKREKKKVERPKVALI 240

Query: 241 DITNNNRKIEFGKQEVGFSQI-KVEIKSSKKLKKTAVDESRRSSKVVRKNQE-VMISKKQ 300
           DITN+  K+    QE+G SQ  KVE+KS KKLK            V R NQ+ V++S KQ
Sbjct: 241 DITNSYNKVRSKIQEIGSSQSRKVEMKSLKKLKXXXXXXXXXXXVVCRSNQKNVIVSNKQ 300

Query: 301 KLISMSMQKPK-RRAREGEAFDCPTSNNLVNNLHHSTIFPAKKEPSPPAIQAPREQPCRY 360
           K ISMSMQ PK RRAREGEA DCP SN L + L HSTIF                QPC Y
Sbjct: 301 KSISMSMQIPKERRAREGEALDCPRSNKL-DLLDHSTIF----------------QPCSY 360

Query: 361 SKGKAKPAGRDGGERNAVDKTTTTDGGSTEFEYIKRI-LTNHGNSNSIISPPNNPTNPSI 420
            KGKAK A   GGE NAVD  TTTDGGS EF+YIK I +++  NSN ++ P       S 
Sbjct: 361 PKGKAKAA---GGETNAVDTATTTDGGSAEFKYIKTIQISSKENSNWVVVP------ASR 420

Query: 421 FHHPEAAEDQQWGRRL---------------LNCWHVRRGMK-GWELGEEAVRERSVKKE 480
           F+H  A E+++W +R+                  W  +RG K GWE              
Sbjct: 421 FYHSVAGEERRWKKRVELQQAVVGGDQIPNNKGWWQKQRGRKRGWE-------------- 480

Query: 481 YFPRAKYEVVEDMDALIINKR--------VVEETERIVKVVELHILDSLLRE-TVALIS 499
            FP  K+E+VE     +INK         + EE E IVK+VELHILDSLLRE T +LIS
Sbjct: 481 -FPHVKFELVE---YALINKDLEKSKFIIMAEEREGIVKLVELHILDSLLRELTHSLIS 495

BLAST of Cla97C01G005550 vs. NCBI nr
Match: XP_022958521.1 (uncharacterized protein LOC111459727 [Cucurbita moschata])

HSP 1 Score: 191.8 bits (486), Expect = 5.6e-45
Identity = 172/460 (37.39%), Postives = 209/460 (45.43%), Query Frame = 0

Query: 67  HRPPLPKASHHVVEAPRNSLELEGASISCLRNKEKNLQLQMGLQIKTRNGSTKSKASEQQ 126
           H P LP      V APRNSLE  G        +E+N Q+QMGL+I T             
Sbjct: 25  HHPSLPSC----VVAPRNSLEQLG--------QEQNEQIQMGLEINT------------- 84

Query: 127 LPNNDHIIALESPSGKTPNLLARLMGLDILPQNPSSSFNCRGSNFGTRSLPESPRVSSAR 186
             N DH  AL+SPS KTPNLLARLMGLDILPQ  +S          TRSLP SPRVSS R
Sbjct: 85  --NFDH-NALDSPSVKTPNLLARLMGLDILPQTTTSP-------SATRSLPNSPRVSSLR 144

Query: 187 LSDVDCHHRRLSLQIISDKENINFFEEAKREKEKVSKKVALVDITNNNRKIEFGKQEVGF 246
           LSDVD HH R SL I  D EN    +E K+E+E+V +KVALVDITNNN K+ +GK     
Sbjct: 145 LSDVDRHHHRHSLDINLDIENSQICKEMKQEEEQVRRKVALVDITNNNNKLVYGK----- 204

Query: 247 SQIKVEIKSSKKLKKTAVDESRRSSKVVRKNQEVMISKKQKLISMSMQKPKRRAREGEAF 306
                                        KNQ+V + +K   I                 
Sbjct: 205 ----------------------------LKNQDVTMFRKHNSI----------------X 264

Query: 307 DCPTSNNLVNNLHHSTIFPAKKEPSPPAIQAPREQPCRYSKGKAKPAGRDGGERNAVDKT 366
                                        +   EQ CR+  GK +PA  + G R      
Sbjct: 265 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXKVRHEQRCRFPNGKQRPAAEEVGRR------ 324

Query: 367 TTTDGGSTEFEYIKRILTNHGNSNSIISPPNNPTNPSIFHHPEAA--------------- 426
            T DGG+ E +YIKRILT+    +     P NP NPSIFHH E +               
Sbjct: 325 ATADGGAGELKYIKRILTSPNWFS-----PTNPLNPSIFHHLETSNAAVGEPRLERWNKD 374

Query: 427 -EDQQWGRRLLNCWHVRRGMKGWELGEEAVRERSVKKEYFPRAKYEVVEDMDALI---IN 486
            +D+  G  ++NC      MKGWEL                RAK  V++D+D+LI   + 
Sbjct: 385 DDDEVLGEMVMNCRTRMMMMKGWELA---------------RAKCHVLKDIDSLIDKDLG 374

Query: 487 K-RVVEETERIVKVVELHILDSLLRETVALISSLPKCSHF 507
           K + V E E +V+  E HILDSLLRET A I SL K   F
Sbjct: 445 KWKKVLELEGVVRTFEFHILDSLLRETTATIMSLHKRCRF 374

BLAST of Cla97C01G005550 vs. NCBI nr
Match: XP_015878027.1 (uncharacterized protein LOC107414431 [Ziziphus jujuba])

HSP 1 Score: 124.0 bits (310), Expect = 1.4e-24
Identity = 100/244 (40.98%), Postives = 130/244 (53.28%), Query Frame = 0

Query: 1   MGKVEWHFGGRSSSRRATTADHPRQRYRPSLPSCMSTLFHFFDFRSSRF-THIVFDNHHP 60
           MGK EWH+GGRSS R         +    S   CMS +F FFDF   +F  H    + +P
Sbjct: 1   MGK-EWHWGGRSSKRGGVGGGAADRDATSS--GCMSAVFQFFDFHQFQFPLHHQQPSFNP 60

Query: 61  SSFNLPHHRPPLPKASHHVVEAPRNSLELEGASISCLRNKEKNLQLQMGLQIKTRNGSTK 120
           +SF  P   P  PK     VEAPRNSL+ + +S+S +  +++NL + MG+QIKT    TK
Sbjct: 61  TSF--PLEDPTSPKG----VEAPRNSLKSDDSSLSSIIEEKENLNIPMGIQIKTY--GTK 120

Query: 121 SKASEQQLPNNDHIIALESPSGKTPNLLARLMGLDILPQNP-SSSFNCR----------- 180
            +AS     +        SP  KTPNL+ARLMGLD+LP N  S +F+             
Sbjct: 121 PRASNDLCSDTS-----SSPGAKTPNLVARLMGLDLLPDNSHSPTFHATTPNPLSKXXXX 180

Query: 181 ------------------GSNFGTRSLPESPRVSSARLSDVDCHHRRLSLQIISDKENIN 214
                              SN G+RSLPE+PR+SSAR SDVD  H RLSLQI  +KEN++
Sbjct: 181 XXXXRQPLHSKSRQSMEGDSNAGSRSLPETPRISSARRSDVD--HHRLSLQI--NKENVS 224

BLAST of Cla97C01G005550 vs. NCBI nr
Match: PQM34399.1 (uncharacterized protein Pyn_07556 [Prunus yedoensis var. nudiflora])

HSP 1 Score: 122.5 bits (306), Expect = 4.2e-24
Identity = 115/321 (35.83%), Postives = 153/321 (47.66%), Query Frame = 0

Query: 1   MGKVEWHFG-GRSSSRRATTADHPRQRYRPSLPSCMSTLFHFFDFRSSRFTHIVFDNHHP 60
           MG  EW++G GR S R       P ++   +   CM  +F  FDF   +  ++    HHP
Sbjct: 1   MGMREWYWGSGRISKRGRGXXXXPAEKDMTTSSGCMCAVFQLFDFHQLQLANL----HHP 60

Query: 61  S--SFNLPHHRP-PLPKASHHVVEAPRNSLE-LEGASISCLRNKEKN-----LQLQMGLQ 120
              SFN  H     +PK     VEAPRNSL+  EG S+S    +E+N     L+ QMG+Q
Sbjct: 61  QQPSFNTFHEDDLTVPKG----VEAPRNSLDSSEGTSLSSTTKEEENLNSPILKFQMGMQ 120

Query: 121 IKTRNGSTKSKASEQQLPNNDHIIALESPSGKTPNLLARLMGLDILP---QNPSSSFNCR 180
           IKT  G     +S       D    + SP  KTPNL+ARLMGLD+LP   ++PSS+ +C 
Sbjct: 121 IKTSGGGGGRTSSA------DFSSDISSPGTKTPNLVARLMGLDLLPDQIRSPSSTSSCS 180

Query: 181 GSN----------------------------------FGTRSLPESPRVSSARLSDVDCH 240
            +                                    GTRSLPE+PR+SSAR SDVD H
Sbjct: 181 STTTHATSKSKVRTRKALQSRPRRHVVDMSDATNTAAAGTRSLPETPRISSARRSDVDLH 240

Query: 241 HRRLSLQ--------------IISDKENIN-----FFEEAKREKEKVSKKVALVDITNNN 256
           H RLSLQ              I +D EN+        +  K+ +E VS+KV L DITN  
Sbjct: 241 H-RLSLQINKENVGVGAVRNWIFADHENVKSPSHYARQIVKQVRESVSRKVGL-DITNTT 300

BLAST of Cla97C01G005550 vs. NCBI nr
Match: XP_023892686.1 (uncharacterized protein LOC112004681 [Quercus suber] >POF21508.1 protein longifolia 2 [Quercus suber])

HSP 1 Score: 120.2 bits (300), Expect = 2.1e-23
Identity = 133/383 (34.73%), Postives = 182/383 (47.52%), Query Frame = 0

Query: 1   MGKVEWHFGGRSSSRRATT-ADHPRQRYRPSLPSCMSTLFHFFDFRSSRFTHIVFDNHHP 60
           MG+ E H+GGRSS+ ++ T      Q+   S   CMS +F FFDF   +F      +H  
Sbjct: 1   MGR-EKHWGGRSSNSKSNTGVAGGDQKDTSSFSGCMSAVFQFFDFHQFQFGL----HHQQ 60

Query: 61  SSFNLPHHRPPLPKASHHVVEAPRNSLELEGASISCL---RNKEKNLQLQMGLQIKTRNG 120
            SFN     P +PK     +EAPRNSLE    S+S +   R +EKNL ++MG+QIKT +G
Sbjct: 61  PSFN-STFVPTIPKG----IEAPRNSLESHEVSLSYITREREEEKNLDIKMGIQIKT-SG 120

Query: 121 STKSKASEQQLPNNDHIIALESPSGKTPNLLARLMGLDILP--QNPSS------------ 180
            T+ KA     PN+       SP  KTPNL+ARLMGLD+LP  Q+PSS            
Sbjct: 121 HTEPKAG---APNDFSSEISNSPGTKTPNLVARLMGLDLLPETQSPSSFSSTHGTPKPLS 180

Query: 181 -----------SFNCRGSN-------FGTRSLPESPRVSSARLSDVDCHHRRLSLQIISD 240
                         CR  N        GT SLPE+PR+SSAR SDVD HH RLSLQI  +
Sbjct: 181 KSHSHPLHPRQPLQCRPRNSLEFNDITGTCSLPETPRISSARRSDVD-HHHRLSLQI--N 240

Query: 241 KENINFFE--EAKREKEKVSKKVALVDITNNNRKIEFGKQEVGFSQIKVEIKSSKKLKKT 300
           KENI+  +  E  R      +++ + +  NN+   ++ +Q V   + KV  K    +  T
Sbjct: 241 KENISASDDLEFSRFSYLRRRELRVYEDENNSSPSQYARQIVKQMKEKVSRKVGLDITNT 300

Query: 301 AVD-------------ESRRSSKVVRKNQEVMISKKQKLISMSMQKPKRRAREGEAFDCP 333
             +             +S+++SK   K  E     KQ   S     P+ R  E ++    
Sbjct: 301 IANRDLQGRNELVSQFKSKKASKGFNKVVEESSPGKQYYSSTPSCSPRVRFFEPKSKPST 360

BLAST of Cla97C01G005550 vs. TrEMBL
Match: tr|A0A0A0KNC2|A0A0A0KNC2_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G172790 PE=4 SV=1)

HSP 1 Score: 421.0 bits (1081), Expect = 3.8e-114
Identity = 302/539 (56.03%), Postives = 341/539 (63.27%), Query Frame = 0

Query: 1   MGKVEW--HFGGRSSSRRATTADHPR-QRYRPSLPSCMSTLFHFFDFRSSRFTHIVFDNH 60
           MGK EW             TT D    QR   SLPSCMSTLFH FDFRSS FTHIVFDNH
Sbjct: 1   MGKSEWXXXXXXXXXXXXVTTVDIDHFQRRDHSLPSCMSTLFHLFDFRSSHFTHIVFDNH 60

Query: 61  HPSSFNLPHHRPPL--PKASHHVVEAPRNSLELE-GASISCLRNKEKNLQLQMGLQIKTR 120
             SSF+L HH P L   KASHH VEAPRNSLEL+ G SISCLRNKE+NLQLQMGLQIKTR
Sbjct: 61  RSSSFDLSHHHPTLRPTKASHHGVEAPRNSLELDNGDSISCLRNKEENLQLQMGLQIKTR 120

Query: 121 NGSTKSKASEQQLPNNDHIIALESPSGKTPNLLARLMGLDILPQNP-SSSFNCRGSNFGT 180
           NGSTKSKA+EQQLPNND+IIALESPS  TPNLLARLMGLD  PQ   SSS+N    N GT
Sbjct: 121 NGSTKSKATEQQLPNNDNIIALESPSTNTPNLLARLMGLDNFPQTTFSSSYNHCMPNLGT 180

Query: 181 RSLPESPRVSSARLSDVDCHHRRLSLQI-ISDKEN--INFFEE-AKREKEKVSK-KVALV 240
           RSL ESPR S +RLSDVD HHRRLSLQI I +KEN  I   EE +KREK+KV + KVAL+
Sbjct: 181 RSLSESPRNSLSRLSDVDYHHRRLSLQINIQEKENNKIKICEEISKREKKKVERPKVALI 240

Query: 241 DITNNNRKIEFGKQEVGFSQI-KVEIKSSKKLKKTAVDESRRSSKVVRKNQE-VMISKKQ 300
           DITN+  K+    QE+G SQ  KVE+KS KKLK            V R NQ+ V++S KQ
Sbjct: 241 DITNSYNKVRSKIQEIGSSQSRKVEMKSLKKLKXXXXXXXXXXXVVCRSNQKNVIVSNKQ 300

Query: 301 KLISMSMQKPK-RRAREGEAFDCPTSNNLVNNLHHSTIFPAKKEPSPPAIQAPREQPCRY 360
           K ISMSMQ PK RRAREGEA DCP SN L + L HSTIF                QPC Y
Sbjct: 301 KSISMSMQIPKERRAREGEALDCPRSNKL-DLLDHSTIF----------------QPCSY 360

Query: 361 SKGKAKPAGRDGGERNAVDKTTTTDGGSTEFEYIKRI-LTNHGNSNSIISPPNNPTNPSI 420
            KGKAK A   GGE NAVD  TTTDGGS EF+YIK I +++  NSN ++ P       S 
Sbjct: 361 PKGKAKAA---GGETNAVDTATTTDGGSAEFKYIKTIQISSKENSNWVVVP------ASR 420

Query: 421 FHHPEAAEDQQWGRRL---------------LNCWHVRRGMK-GWELGEEAVRERSVKKE 480
           F+H  A E+++W +R+                  W  +RG K GWE              
Sbjct: 421 FYHSVAGEERRWKKRVELQQAVVGGDQIPNNKGWWQKQRGRKRGWE-------------- 480

Query: 481 YFPRAKYEVVEDMDALIINKR--------VVEETERIVKVVELHILDSLLRE-TVALIS 499
            FP  K+E+VE     +INK         + EE E IVK+VELHILDSLLRE T +LIS
Sbjct: 481 -FPHVKFELVE---YALINKDLEKSKFIIMAEEREGIVKLVELHILDSLLRELTHSLIS 495

BLAST of Cla97C01G005550 vs. TrEMBL
Match: tr|A0A2N9FV46|A0A2N9FV46_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS18927 PE=4 SV=1)

HSP 1 Score: 129.0 bits (323), Expect = 3.0e-26
Identity = 196/696 (28.16%), Postives = 288/696 (41.38%), Query Frame = 0

Query: 1   MGKVEWHFGGRSSSRRATTADHPRQRYRPSLPSCMSTLFHFFDFRSSRFTHIVFDNHHPS 60
           MG+ EW +GG   S  + +     ++   S   C+ST+FHFFDF    F    F     +
Sbjct: 1   MGR-EWPWGGTGRSSHSKSGGVGGEK-DTSFSGCISTVFHFFDFHQFHFQQPSF-----N 60

Query: 61  SFNLPHHRPPLPKASHHVVEAPRNSLELEGASISCLRNK----EKNLQLQMGLQIKTRNG 120
           S ++    P +PK     VEAPRNSLE +  S+S +  K    E+NL +QMG+QIKT +G
Sbjct: 61  STSILPQEPTIPKG----VEAPRNSLESQEVSLSYITRKKEEEEENLDIQMGIQIKT-SG 120

Query: 121 STKSKASE--QQLPNNDHIIALESPSGKTPNLLARLMGLDILPQ---------------- 180
           S    A++   ++ N        SP+ KTP L+ARLMGLD LP+                
Sbjct: 121 SKAGAANDFSSEISN--------SPATKTPTLVARLMGLDHLPETHSPSSFSSTHSTPKP 180

Query: 181 ---------NPSSSFNCRGSN-------FGTRSLPESPRVSSARLSDVDCHHRRLSLQII 240
                    +P     CR  N        GT SLPE+PR+SSAR SDVD HH RLSLQI 
Sbjct: 181 LSKSHSHPFHPRQPLQCRPRNSLEYNDITGTCSLPETPRISSARRSDVD-HHHRLSLQI- 240

Query: 241 SDKENINFFEE--------------------------------AKREKEKVSKKVALVDI 300
            +KEN++  E+                                  + KEKVS+KV L DI
Sbjct: 241 -NKENLSHSEDLEFSRFSYLRRKELRVDQDENNRSPSQYARQIVNQMKEKVSRKVGL-DI 300

Query: 301 TN--NNRKIEF--GKQEVGFSQIKVEIKSSKKLKKTAV---------------------- 360
           TN  NNR  +F   K   GF++I  E   SK+   ++                       
Sbjct: 301 TNTINNRDSQFKSKKSSKGFNKIVEESSPSKQYYSSSTPSCSPRLRIFEPKTKPSTTPPP 360

Query: 361 ---DESRRSSKV---------VRKNQEVMISKKQKLI-----------SMSMQKPKR--- 420
              D++  SSK          V+   + +  + QK I              ++KP +   
Sbjct: 361 STKDKALHSSKPLSLQSSPLNVKPKPQPLQHQGQKSIQKCKKSSTERFGQRLKKPPQTSD 420

Query: 421 --RAREGEAF------------------DCPTSNNLVN-NLHHSTIFPAKKEPSPPAIQA 480
             R ++ E F                    P SN+L++ N++  T  P KK+PSPPA + 
Sbjct: 421 IIRNKQEEPFVHPSKTATRANTPDKKCKKTPLSNDLLHINVNVPTRLPVKKDPSPPATKI 480

Query: 481 PREQ--------------PCRYSKGKAKPAGRDGGERNAVDKTTTTDGGSTEFEYIKRIL 493
           P++Q               C   + K +       + +  +   TT     E+EY+  +L
Sbjct: 481 PQKQVFDAQESKCNSQLSSCSSHEYKQEATCMLEAQESRSNGAFTTGASVDEYEYVTNLL 540

BLAST of Cla97C01G005550 vs. TrEMBL
Match: tr|A0A2P4MVD3|A0A2P4MVD3_QUESU (Protein longifolia 2 OS=Quercus suber OX=58331 GN=CFP56_51640 PE=4 SV=1)

HSP 1 Score: 120.2 bits (300), Expect = 1.4e-23
Identity = 133/383 (34.73%), Postives = 182/383 (47.52%), Query Frame = 0

Query: 1   MGKVEWHFGGRSSSRRATT-ADHPRQRYRPSLPSCMSTLFHFFDFRSSRFTHIVFDNHHP 60
           MG+ E H+GGRSS+ ++ T      Q+   S   CMS +F FFDF   +F      +H  
Sbjct: 1   MGR-EKHWGGRSSNSKSNTGVAGGDQKDTSSFSGCMSAVFQFFDFHQFQFGL----HHQQ 60

Query: 61  SSFNLPHHRPPLPKASHHVVEAPRNSLELEGASISCL---RNKEKNLQLQMGLQIKTRNG 120
            SFN     P +PK     +EAPRNSLE    S+S +   R +EKNL ++MG+QIKT +G
Sbjct: 61  PSFN-STFVPTIPKG----IEAPRNSLESHEVSLSYITREREEEKNLDIKMGIQIKT-SG 120

Query: 121 STKSKASEQQLPNNDHIIALESPSGKTPNLLARLMGLDILP--QNPSS------------ 180
            T+ KA     PN+       SP  KTPNL+ARLMGLD+LP  Q+PSS            
Sbjct: 121 HTEPKAG---APNDFSSEISNSPGTKTPNLVARLMGLDLLPETQSPSSFSSTHGTPKPLS 180

Query: 181 -----------SFNCRGSN-------FGTRSLPESPRVSSARLSDVDCHHRRLSLQIISD 240
                         CR  N        GT SLPE+PR+SSAR SDVD HH RLSLQI  +
Sbjct: 181 KSHSHPLHPRQPLQCRPRNSLEFNDITGTCSLPETPRISSARRSDVD-HHHRLSLQI--N 240

Query: 241 KENINFFE--EAKREKEKVSKKVALVDITNNNRKIEFGKQEVGFSQIKVEIKSSKKLKKT 300
           KENI+  +  E  R      +++ + +  NN+   ++ +Q V   + KV  K    +  T
Sbjct: 241 KENISASDDLEFSRFSYLRRRELRVYEDENNSSPSQYARQIVKQMKEKVSRKVGLDITNT 300

Query: 301 AVD-------------ESRRSSKVVRKNQEVMISKKQKLISMSMQKPKRRAREGEAFDCP 333
             +             +S+++SK   K  E     KQ   S     P+ R  E ++    
Sbjct: 301 IANRDLQGRNELVSQFKSKKASKGFNKVVEESSPGKQYYSSTPSCSPRVRFFEPKSKPST 360

BLAST of Cla97C01G005550 vs. TrEMBL
Match: tr|A0A151RFJ7|A0A151RFJ7_CAJCA (Uncharacterized protein OS=Cajanus cajan OX=3821 GN=KK1_037345 PE=4 SV=1)

HSP 1 Score: 113.2 bits (282), Expect = 1.7e-21
Identity = 179/647 (27.67%), Postives = 264/647 (40.80%), Query Frame = 0

Query: 1   MGKVEWHFGGRSSSRRATTADHPRQRYRPSLPSCMSTLFHFFDFRSSRFTHIVFDNHHPS 60
           MG+ EWHF GR S R    +    +   PS   CM  +F FFDF    F H +   H  S
Sbjct: 1   MGR-EWHFAGRFSKRSGVGSAEESETQAPS--GCMCAVFQFFDFHPFHF-HNINHQHQQS 60

Query: 61  SFNLPHHRPPLPKASHHVVEAPRNSLELE--GASISCLRNKEKNLQLQMGLQIKTRNGST 120
           SF  P   P          EAPRNSLE E    +IS L +KE     +  +QIKT  G+ 
Sbjct: 61  SFKPPSCTPEDHTTVSKGAEAPRNSLESEDGDGTISSLSSKEDFKIPKNIIQIKTSGGTR 120

Query: 121 KSKASEQQLPNNDHIIALESPSGKTPNLLARLMG-------LDILPQNPSSSFNCRGS-- 180
            S  +   L +        SP  KTP L+ARLM        L             R S  
Sbjct: 121 TSGGNLNDLSSE----ISSSPGTKTPTLVARLMDTQGNVPHLXXXXXXXXXXXKHRNSID 180

Query: 181 ---NFGTRSLPESPRVSSARLSDVDCHHRRLSLQIISDKENINFFEEAKREKEKVSKKVA 240
                 TRSLP++PR+S AR SDVD HH RLSLQII +KEN+N  E+ +  +  +SK+  
Sbjct: 181 SSDIAATRSLPDTPRISLARRSDVD-HHHRLSLQII-NKENMNLGEDFELPRLSLSKRKC 240

Query: 241 LVDITNNNRKIEFGKQEVGFSQIK------------------------------------ 300
             D  +N R     K+    S +K                                    
Sbjct: 241 --DENHNGRSQIRSKKSHKTSSLKSIDETXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 300

Query: 301 ------------------------------------VEIKSSKKLKK-TAVDESRRSSKV 360
                                                E +S+K + K   V   + +S++
Sbjct: 301 LTPKDQNILKLPPPTPPQVNTQPQLPRVLTKPKPEQKEFQSNKSVPKCKRVTNEKFNSRL 360

Query: 361 VRKNQ--EVMISKKQKLISMSMQKPKRRAREGEAFDCPTSNNLVNNLHH-STIFPAKKEP 420
            R  Q  +++ +K+++   +    P  RA + ++     ++ L +NL++   + P K +P
Sbjct: 361 KRPPQTSDIIRNKQEEPFIIRPTSPSSRASDIKSTKTKKTHPLSSNLNNVPNVLPVKTDP 420

Query: 421 SPPAIQAPREQPCRY---------SKGKAKPAGRDGGERNAVDKTTTT---------DGG 480
           SPPA + P +Q  +          S   ++ +      +   ++T TT         +G 
Sbjct: 421 SPPATKIPPKQSQQVFDTEREESKSSWSSQLSSCSRHHKYKQEQTITTTLATRDNNLNGV 480

Query: 481 STEFEYIKRILTNHGNSNSIISPPNNPTNPSIFHHPEAAE---------DQQWGRRLL-- 502
           S     +++ L  H  ++ +   P+ P +PSIFHH E  +           +W RRLL  
Sbjct: 481 SASSTAVEQEL--HYITSILAVNPHPPLDPSIFHHLEHKDRNFAPNHHLGHRWNRRLLFD 540

BLAST of Cla97C01G005550 vs. TrEMBL
Match: tr|A0A022PZ67|A0A022PZ67_ERYGU (Uncharacterized protein OS=Erythranthe guttata OX=4155 GN=MIMGU_mgv1a002774mg PE=4 SV=1)

HSP 1 Score: 110.9 bits (276), Expect = 8.4e-21
Identity = 110/365 (30.14%), Postives = 166/365 (45.48%), Query Frame = 0

Query: 34  CMSTLFHFFDFRSSRFTHIVFDNHHPSSFNLPHHRPPLPKASHHVVEAPRNSLELEGASI 93
           CM  +FH FD           ++HHP SF+  +H       +   VEAPRNSLE E    
Sbjct: 50  CMCAVFHLFD----------LNHHHPFSFHHSNHFFQEESITSKGVEAPRNSLEKELEPA 109

Query: 94  SCLRNKEKNLQLQMGLQIKTRNGSTKSKASEQQLPNNDHIIALESPSGKTPNLLARLMGL 153
           +    KE NL   +G+QIKTR  +  SK+  +   +  +  + +SPS KTP+L+ARLMGL
Sbjct: 110 AIKEEKEDNLTPPVGIQIKTRISNVASKSRTEDTISTGY-CSSDSPSAKTPSLVARLMGL 169

Query: 154 DILPQNPSSS---------------------------FNCRGSNFGTRSLPESP--RVSS 213
           D+LP +P+SS                           F+    + G RSLPE+P  ++SS
Sbjct: 170 DLLPDHPTSSSPSLIIPKQTKQHQHTNVVGSRNNKRCFSDDDISVGARSLPETPHHQISS 229

Query: 214 ARLSDVDCHHRRLSLQIISDKENINFFEEAKREKEKVSKKVALVDITNNNRKIEFGKQE- 273
           AR SD + H+  +  +    +      +  K+ KE V +++ L DITN +      +++ 
Sbjct: 230 ARRSDSEYHY--IHKENTGSRPGHYAKQIVKQVKENVGRRIGLHDITNRDETSIVRRRDQ 289

Query: 274 ------------VGFSQIKVE---------------IKSSKKLKKTAVDESRRSSKVVRK 333
                       VGFS +K +                 SS KLKK           VV +
Sbjct: 290 NLVLLKPTKNIRVGFSDVKKKKHIPKXXXXXXXXXXXXSSMKLKKNI-------QSVVHE 349

Query: 334 NQEVMISKKQKLISMSMQKPKRRAREGEAFDCPTSNNLVNNLHHSTIFPAKKEPSPPAIQ 342
           ++ V   K  K ++M   K K++       D    +N   N+   T+ P KK+PSPPA +
Sbjct: 350 HKRVQQIKYGKKVNMMEGKKKKKK------DPLLMSNEKLNISGPTLLPVKKDPSPPATK 388

BLAST of Cla97C01G005550 vs. TAIR10
Match: AT5G51850.1 (unknown protein)

HSP 1 Score: 72.0 bits (175), Expect = 1.2e-12
Identity = 80/265 (30.19%), Postives = 119/265 (44.91%), Query Frame = 0

Query: 9   GGRSSSRRATTADHPRQRYRPSLPSCMSTLFHFFDFRSSRFTHIVFDNHHPSSFNLPHHR 68
           G  SSSR   TA+            CM+  +H            +FD+HH          
Sbjct: 21  GAFSSSRSKKTAN-----------GCMAAFYH------------LFDSHH---------- 80

Query: 69  PPLPKASHHVVEAPRNS--LELEGASISCLRNKEKNL-QLQMGLQIKTRNGSTKSKASEQ 128
                  H  +++P  S  L+L   S+     K+K +  + +G+++KT  G+  S+    
Sbjct: 81  -------HLTIDSPSRSKGLKLMEESLPSTTYKDKEISNIPVGMRVKTDTGTKSSRLRAL 140

Query: 129 QLPNNDHIIAL-ESPSGKTPNLLARLMGLDILPQNPSSSFNC--------------RGSN 188
              ++     +  SP  KTPNL+ARLMGLD+LP     + +               R S 
Sbjct: 141 VTDSSTSSSEICNSPGSKTPNLVARLMGLDLLPDKTDLNHSLSDLHTMSSHHITSHRLSK 200

Query: 189 FGTRSLPESPRVSSARLSDVDCHHRRLSLQIISDKE----NINFFEE------------A 239
            GTRSLP SPR+SSAR SD D H  RLSLQ+  +KE     +   +E             
Sbjct: 201 KGTRSLPVSPRISSARKSDFDIH--RLSLQLNREKEFGRSRLKEDQEESHSPRDYARQIV 243

BLAST of Cla97C01G005550 vs. TAIR10
Match: AT4G25430.1 (unknown protein)

HSP 1 Score: 57.0 bits (136), Expect = 3.9e-08
Identity = 60/163 (36.81%), Postives = 81/163 (49.69%), Query Frame = 0

Query: 72  PKASHHVVEAPRNSLEL-EGASISCLRNKEKNLQLQMGLQIKTRNGSTKSKASEQQLPNN 131
           P  +   + APRNSL+L E + +S     E+      GL I    G  KS      +   
Sbjct: 57  PSRTRKGLVAPRNSLDLSEESPLSTNYKLERE-----GLNISV--GGKKSTLRGLLVDTP 116

Query: 132 DHIIALESPSGKTPNLLARLMGLDILPQNPSSSFNCR---------GSNFGTRSLPESPR 191
            H   L  P  KTPN++ARLMGLD+LP N   + + R         G+  GTRSLP SPR
Sbjct: 117 SHNCNL--PRTKTPNVVARLMGLDLLPDNLELTRSPRNGVRGHRLSGNGSGTRSLPASPR 176

Query: 192 VSSARLSDVDCHHRRLSLQIISDKENINFFEEAKREKEKVSKK 225
           +SS      D  + RLSL++  ++EN N  EE  R + K  K+
Sbjct: 177 ISS------DSENHRLSLEL--NREN-NKHEEFVRTRLKELKQ 201

BLAST of Cla97C01G005550 vs. TAIR10
Match: AT5G62170.1 (unknown protein)

HSP 1 Score: 57.0 bits (136), Expect = 3.9e-08
Identity = 98/356 (27.53%), Postives = 145/356 (40.73%), Query Frame = 0

Query: 34  CMSTLFHFFDFRSSRFTHIVFDNHHPSSFNLPHHRPPLPKASHHVVEAPRNSLEL--EGA 93
           CMS +F+ FDF+     H+         F + HH   LPK     V+APRNSLE   E  
Sbjct: 44  CMSAVFNIFDFQ-----HL--------QFPINHHHLHLPKG----VDAPRNSLESTEEET 103

Query: 94  SISCLRNKEKNLQLQMGLQIKTRNGSTKSKASEQQLPNNDHIIALESPSGKTPNLLARLM 153
           S S  R K+ NL                                  SPS KTP L+ARLM
Sbjct: 104 SFSPTR-KDGNL-------XXXXXXXXXXXXXXXXXXXXXXXXXXXSPSIKTPTLVARLM 163

Query: 154 GLDILPQN------PSSSFN---------CRGSNF---------------GTRSLPESPR 213
           GLD++P N      PSSS +          R S+                GTRSLPE+PR
Sbjct: 164 GLDLVPDNYRSSPTPSSSSSSTLIDLKTPTRSSHAKKHRHYSLQRNSVDGGTRSLPETPR 223

Query: 214 VSSARLS-DVDCH-HRRLSLQIISDKENINFFEEAKREKEKVSKKVALVDITNNNRKIEF 273
           +S  R S DV+C+ H+R SL +  +  NIN F     E+E     V L  +   +   E 
Sbjct: 224 ISLGRRSVDVNCYEHQRSSLHLRDN--NINVFP----ERESGINNVRLTRVKEIHEDKEN 283

Query: 274 GKQEVGFSQIKVEIKSS-KKLKKTAVDESRRSSKVVRKNQEVMISKKQKLISMSMQKPKR 333
                   QI +++K +  + ++   D + + ++    ++    S K  +I+  +    R
Sbjct: 284 RSPREYARQIVMQLKENVSRRRRMGTDITNKETQPREVHESKKASSKTTIITHDVSSSPR 343

Query: 334 RAREGEAFDCPTS---NNLVNNLHHSTIFPAKKEPSPPAIQAPREQPCRYSKGKAK 352
                     PTS   NN+ + +  +T    + +   P +    E+P    K K +
Sbjct: 344 LGLTEVPKTKPTSLQTNNVASKILETTAMKVQDKTRLPTV---HEEPQGTEKEKQR 365

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011656164.15.7e-11456.03PREDICTED: uncharacterized protein LOC105435648 [Cucumis sativus] >KGN50399.1 hy... [more]
XP_022958521.15.6e-4537.39uncharacterized protein LOC111459727 [Cucurbita moschata][more]
XP_015878027.11.4e-2440.98uncharacterized protein LOC107414431 [Ziziphus jujuba][more]
PQM34399.14.2e-2435.83uncharacterized protein Pyn_07556 [Prunus yedoensis var. nudiflora][more]
XP_023892686.12.1e-2334.73uncharacterized protein LOC112004681 [Quercus suber] >POF21508.1 protein longifo... [more]
Match NameE-valueIdentityDescription
tr|A0A0A0KNC2|A0A0A0KNC2_CUCSA3.8e-11456.03Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_5G172790 PE=4 SV=1[more]
tr|A0A2N9FV46|A0A2N9FV46_FAGSY3.0e-2628.16Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS18927 PE=4 SV=1[more]
tr|A0A2P4MVD3|A0A2P4MVD3_QUESU1.4e-2334.73Protein longifolia 2 OS=Quercus suber OX=58331 GN=CFP56_51640 PE=4 SV=1[more]
tr|A0A151RFJ7|A0A151RFJ7_CAJCA1.7e-2127.67Uncharacterized protein OS=Cajanus cajan OX=3821 GN=KK1_037345 PE=4 SV=1[more]
tr|A0A022PZ67|A0A022PZ67_ERYGU8.4e-2130.14Uncharacterized protein OS=Erythranthe guttata OX=4155 GN=MIMGU_mgv1a002774mg PE... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
AT5G51850.11.2e-1230.19unknown protein[more]
AT4G25430.13.9e-0836.81unknown protein[more]
AT5G62170.13.9e-0827.53unknown protein[more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR032795DUF3741-assoc
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0016310 phosphorylation
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0016301 kinase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C01G005550.1Cla97C01G005550.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR032795DUF3741-associated sequence motifPFAMPF14383VARLMGLcoord: 137..161
e-value: 1.0E-7
score: 31.2
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 325..371
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 344..363
NoneNo IPR availablePANTHERPTHR37751FAMILY NOT NAMEDcoord: 169..290
coord: 269..496
coord: 13..163