Cla97C08G156670.1 (mRNA) Watermelon (97103) v2

NameCla97C08G156670.1
TypemRNA
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionUPF0481 protein At3g47200-like
LocationCla97Chr08 : 24409541 .. 24410914 (+)
Sequence length1374
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGGCAGCAGTAGTGACGAATCAAATAAAAATACTACTAAGTCAAGAGATGAAGAAATCGAAGCGATTTGTAATCGAGTGGTGGAATGCGTTCACCATTCAATGTTTCGAGAACTTGACAAAAGTTTGTCATTTTGTAAGGAACGTAGCATATACAAGGTTCCTAAGCCTTTGCGTAATGTGAACCCAAAAGCTTATAGTCCTCAAGTCATTTCGATCGGGCCTCTTCATCATTATCGCACTCGAAATGACCCGACCATTATGGAGAAAAAGGGGAGCTATGTTCTAAACTTTCTTACTGTTGCAAAGCTCAATTGGAAGGAGATGATAGAAAAAGTTATAGATTGGGAAGGAAGAGCTCGCAAATATTACGTGGAGACCATCGAAATGGAAAGAGATGAGTTCGTTCAACTTCTAATTTTTGACGGTTGTTTTGTGGTTATGTATATCATCGGTTCTATGGTGGAGGAGTTTCGGGACCTCGACACGACGTTTTTATGGAGATTCAGCAATGGAATATTTAAAGATCTTCTTATGCTTGAAAACCAACTTCCTTTCTTCCTTCTCCAAGCTCTTTACGACCTATGCGCCTCTTCACAACCTTCACTCAAAGAAATCTCTTTCATTGAACTACTTGGTGGATATTTCAAAAAAGCTCGTGAAGGGATGAGTTATGTTGAAAAAGGCTATTTTGACATGGACGCTAGTGAAGTAAACCATCTTGTTGATTTCATTAGAATACATTTGACTAAACCATCCACTTCTCCAAGATACTTGGGAGTTTCGTACGATGACTTATTCTCCATTTGGCCCCTCACTGCCACTGAGCTTCATGATTGTGGCATTTCTTTCCAAAAGATGAATAAATCATTCTACCATGGTCGAAGGAGGAAATGCAATACGGACGTGTATTTTTCAGAACGTGGGGGCGTTCTAAAAATGCCGGAAATCATAATAGACGATAGTTTCGAAATCATTTTCAGAAACATGATAGCTTATGAGTATTGTCACTTGAAGAGTAAGGATGTGAGCAACTTTGGTATGTTCATGCATTTCTTGATAAACACAAACAAGGATGTGAGTTTGCTGGTTGAGGACGGGATTATACAAAACCATTTGGGAAGCTCCAAGGAAATCGTGGCATTATTCAACGACCTTTGTAAGAACGTTATGGTTGAAAGGAATTTGTACAACTTGGAGTGTTGGAAAATGAAACAATATTGCAAGCACCGTCGACATCGGTGGATGACTTCGTTGAAACGCGACTATTTTGGGACGCCGTGGGCCTTTATCTCCTTTGTTGCTGCTCTCCTCCTCCTTTTACTCACTCTCTTGCAAACACTGTTAGCTTTCATTGCGTTATATAAGTGA

mRNA sequence

ATGGAAGGCAGCAGTAGTGACGAATCAAATAAAAATACTACTAAGTCAAGAGATGAAGAAATCGAAGCGATTTGTAATCGAGTGGTGGAATGCGTTCACCATTCAATGTTTCGAGAACTTGACAAAAGTTTGTCATTTTGTAAGGAACGTAGCATATACAAGGTTCCTAAGCCTTTGCGTAATGTGAACCCAAAAGCTTATAGTCCTCAAGTCATTTCGATCGGGCCTCTTCATCATTATCGCACTCGAAATGACCCGACCATTATGGAGAAAAAGGGGAGCTATGTTCTAAACTTTCTTACTGTTGCAAAGCTCAATTGGAAGGAGATGATAGAAAAAGTTATAGATTGGGAAGGAAGAGCTCGCAAATATTACGTGGAGACCATCGAAATGGAAAGAGATGAGTTCGTTCAACTTCTAATTTTTGACGGTTGTTTTGTGGTTATGTATATCATCGGTTCTATGGTGGAGGAGTTTCGGGACCTCGACACGACGTTTTTATGGAGATTCAGCAATGGAATATTTAAAGATCTTCTTATGCTTGAAAACCAACTTCCTTTCTTCCTTCTCCAAGCTCTTTACGACCTATGCGCCTCTTCACAACCTTCACTCAAAGAAATCTCTTTCATTGAACTACTTGGTGGATATTTCAAAAAAGCTCGTGAAGGGATGAGTTATGTTGAAAAAGGCTATTTTGACATGGACGCTAGTGAAGTAAACCATCTTGTTGATTTCATTAGAATACATTTGACTAAACCATCCACTTCTCCAAGATACTTGGGAGTTTCGTACGATGACTTATTCTCCATTTGGCCCCTCACTGCCACTGAGCTTCATGATTGTGGCATTTCTTTCCAAAAGATGAATAAATCATTCTACCATGGTCGAAGGAGGAAATGCAATACGGACGTGTATTTTTCAGAACGTGGGGGCGTTCTAAAAATGCCGGAAATCATAATAGACGATAGTTTCGAAATCATTTTCAGAAACATGATAGCTTATGAGTATTGTCACTTGAAGAGTAAGGATGTGAGCAACTTTGGTATGTTCATGCATTTCTTGATAAACACAAACAAGGATGTGAGTTTGCTGGTTGAGGACGGGATTATACAAAACCATTTGGGAAGCTCCAAGGAAATCGTGGCATTATTCAACGACCTTTGTAAGAACGTTATGGTTGAAAGGAATTTGTACAACTTGGAGTGTTGGAAAATGAAACAATATTGCAAGCACCGTCGACATCGGTGGATGACTTCGTTGAAACGCGACTATTTTGGGACGCCGTGGGCCTTTATCTCCTTTGTTGCTGCTCTCCTCCTCCTTTTACTCACTCTCTTGCAAACACTGTTAGCTTTCATTGCGTTATATAAGTGA

Coding sequence (CDS)

ATGGAAGGCAGCAGTAGTGACGAATCAAATAAAAATACTACTAAGTCAAGAGATGAAGAAATCGAAGCGATTTGTAATCGAGTGGTGGAATGCGTTCACCATTCAATGTTTCGAGAACTTGACAAAAGTTTGTCATTTTGTAAGGAACGTAGCATATACAAGGTTCCTAAGCCTTTGCGTAATGTGAACCCAAAAGCTTATAGTCCTCAAGTCATTTCGATCGGGCCTCTTCATCATTATCGCACTCGAAATGACCCGACCATTATGGAGAAAAAGGGGAGCTATGTTCTAAACTTTCTTACTGTTGCAAAGCTCAATTGGAAGGAGATGATAGAAAAAGTTATAGATTGGGAAGGAAGAGCTCGCAAATATTACGTGGAGACCATCGAAATGGAAAGAGATGAGTTCGTTCAACTTCTAATTTTTGACGGTTGTTTTGTGGTTATGTATATCATCGGTTCTATGGTGGAGGAGTTTCGGGACCTCGACACGACGTTTTTATGGAGATTCAGCAATGGAATATTTAAAGATCTTCTTATGCTTGAAAACCAACTTCCTTTCTTCCTTCTCCAAGCTCTTTACGACCTATGCGCCTCTTCACAACCTTCACTCAAAGAAATCTCTTTCATTGAACTACTTGGTGGATATTTCAAAAAAGCTCGTGAAGGGATGAGTTATGTTGAAAAAGGCTATTTTGACATGGACGCTAGTGAAGTAAACCATCTTGTTGATTTCATTAGAATACATTTGACTAAACCATCCACTTCTCCAAGATACTTGGGAGTTTCGTACGATGACTTATTCTCCATTTGGCCCCTCACTGCCACTGAGCTTCATGATTGTGGCATTTCTTTCCAAAAGATGAATAAATCATTCTACCATGGTCGAAGGAGGAAATGCAATACGGACGTGTATTTTTCAGAACGTGGGGGCGTTCTAAAAATGCCGGAAATCATAATAGACGATAGTTTCGAAATCATTTTCAGAAACATGATAGCTTATGAGTATTGTCACTTGAAGAGTAAGGATGTGAGCAACTTTGGTATGTTCATGCATTTCTTGATAAACACAAACAAGGATGTGAGTTTGCTGGTTGAGGACGGGATTATACAAAACCATTTGGGAAGCTCCAAGGAAATCGTGGCATTATTCAACGACCTTTGTAAGAACGTTATGGTTGAAAGGAATTTGTACAACTTGGAGTGTTGGAAAATGAAACAATATTGCAAGCACCGTCGACATCGGTGGATGACTTCGTTGAAACGCGACTATTTTGGGACGCCGTGGGCCTTTATCTCCTTTGTTGCTGCTCTCCTCCTCCTTTTACTCACTCTCTTGCAAACACTGTTAGCTTTCATTGCGTTATATAAGTGA

Protein sequence

MEGSSSDESNKNTTKSRDEEIEAICNRVVECVHHSMFRELDKSLSFCKERSIYKVPKPLRNVNPKAYSPQVISIGPLHHYRTRNDPTIMEKKGSYVLNFLTVAKLNWKEMIEKVIDWEGRARKYYVETIEMERDEFVQLLIFDGCFVVMYIIGSMVEEFRDLDTTFLWRFSNGIFKDLLMLENQLPFFLLQALYDLCASSQPSLKEISFIELLGGYFKKAREGMSYVEKGYFDMDASEVNHLVDFIRIHLTKPSTSPRYLGVSYDDLFSIWPLTATELHDCGISFQKMNKSFYHGRRRKCNTDVYFSERGGVLKMPEIIIDDSFEIIFRNMIAYEYCHLKSKDVSNFGMFMHFLINTNKDVSLLVEDGIIQNHLGSSKEIVALFNDLCKNVMVERNLYNLECWKMKQYCKHRRHRWMTSLKRDYFGTPWAFISFVAALLLLLLTLLQTLLAFIALYK
BLAST of Cla97C08G156670.1 vs. NCBI nr
Match: XP_008445209.1 (PREDICTED: UPF0481 protein At3g47200-like [Cucumis melo])

HSP 1 Score: 692.6 bits (1786), Expect = 9.1e-196
Identity = 344/457 (75.27%), Postives = 386/457 (84.46%), Query Frame = 0

Query: 1   MEGSSSDESNKNTTKSRDEEIEAICNRVVECVHHSMFRELDKSLSFCKERSIYKVPKPLR 60
           ME ++ DESN N TKSRDEEI+ I +R+VE V+HS+ RE+  S SF +ERSIY VPK LR
Sbjct: 1   MEINNKDESNNNATKSRDEEIKEIYDRMVESVNHSISREISISTSFSRERSIYMVPKLLR 60

Query: 61  NVNPKAYSPQVISIGPLHHYRTRNDPTIMEKKGSYVLNFLTVAKLNWKEMIEKVIDWEGR 120
           N NPKAYSPQVISIGPLH+YRT+ND TI EKKGSYVLNFLTVAKL W EMI K + WE R
Sbjct: 61  NGNPKAYSPQVISIGPLHYYRTQNDLTIKEKKGSYVLNFLTVAKLGWNEMINKFLCWEER 120

Query: 121 ARKYYVETIEMERDEFVQLLIFDGCFVVMYIIGSMVEEFRDLDTTFLWRFSNGIFKDLLM 180
           AR YYVETI+MERDEF+QLLI+D CFVVMYIIGSMV EFRDLDT+FLWRFSNGIFKDLL+
Sbjct: 121 ARNYYVETIKMERDEFIQLLIYDSCFVVMYIIGSMVAEFRDLDTSFLWRFSNGIFKDLLL 180

Query: 181 LENQLPFFLLQALYDLCASSQPSLKEISFIELLGGYFKKAREGMSYVEKGYFDMDASEVN 240
           LENQLPFFLL  LY+LCA +QPSLK+ISFIELL GYF + REGMSYV +GY D+DA+EVN
Sbjct: 181 LENQLPFFLLHHLYNLCAFAQPSLKDISFIELLRGYFTEVREGMSYVNEGYLDIDANEVN 240

Query: 241 HLVDFIRIHLTKPSTSPRYLGVSYDDLFSIWPLTATELHDCGISFQKMNKSFYHGRRRKC 300
           HLVDF+RIHLT+P   P +L  S DD  S WPLTATELHDCGISF        HG++R C
Sbjct: 241 HLVDFLRIHLTQPRHIPHFLDFSVDDFLSSWPLTATELHDCGISF--------HGQKR-C 300

Query: 301 NTDVYFSERGGVLKMPEIIIDDSFEIIFRNMIAYEYCHLKSKDVSNFGMFMHFLINTNKD 360
             +V F ER GVLKMP+IIIDDSFEI+FRNMIAYEYCHLKSKDVSNFGMFMHFLINTNKD
Sbjct: 301 MMNVNFRERNGVLKMPKIIIDDSFEILFRNMIAYEYCHLKSKDVSNFGMFMHFLINTNKD 360

Query: 361 VSLLVEDGIIQNHLGSSKEIVALFNDLCKNVMVERNLYNLECWKMKQYCKHRRHRWMTSL 420
           VSLLV+DGIIQNHLGS++EIV LFNDLCKN+MVERNLY++EC KMK+YCKHRRHRWMTSL
Sbjct: 361 VSLLVDDGIIQNHLGSTREIVVLFNDLCKNIMVERNLYSIECRKMKEYCKHRRHRWMTSL 420

Query: 421 KRDYFGTPWAFISFVAALXXXXXXXXXXXLAFIALYK 458
           KRDYFGTPWAFISFVAA+           +AF+ALYK
Sbjct: 421 KRDYFGTPWAFISFVAAVLLLLLTLLQTVVAFVALYK 448

BLAST of Cla97C08G156670.1 vs. NCBI nr
Match: XP_004138858.1 (PREDICTED: UPF0481 protein At3g47200-like [Cucumis sativus] >KGN62944.1 hypothetical protein Csa_2G381700 [Cucumis sativus])

HSP 1 Score: 682.9 bits (1761), Expect = 7.2e-193
Identity = 340/458 (74.24%), Postives = 386/458 (84.28%), Query Frame = 0

Query: 1   MEGSSSDESNKNTTKSRDEEIEAICNRVVECVHHSMFRELDKSL-SFCKERSIYKVPKPL 60
           ME +  DESN NTTKSRDEEI+ I +R+V  V+ SMFRE+ +S  SF KERSIY VPK L
Sbjct: 1   MEINVYDESNNNTTKSRDEEIKVIYDRMVGSVNQSMFREISRSASSFSKERSIYMVPKLL 60

Query: 61  RNVNPKAYSPQVISIGPLHHYRTRNDPTIMEKKGSYVLNFLTVAKLNWKEMIEKVIDWEG 120
           R  NPKAYSPQVISIGPLH+YRT+ND  I EKKGSYVLNFLTVAKL+W EMI+K + WE 
Sbjct: 61  RKGNPKAYSPQVISIGPLHYYRTQND-LIKEKKGSYVLNFLTVAKLDWNEMIKKFLSWEE 120

Query: 121 RARKYYVETIEMERDEFVQLLIFDGCFVVMYIIGSMVEEFRDLDTTFLWRFSNGIFKDLL 180
           RAR YYVETIEM+RDEF+QLLI+D CFVVMY+IGSMV EFRDLDT+FLWRFSNGIFKDLL
Sbjct: 121 RARNYYVETIEMKRDEFIQLLIYDSCFVVMYVIGSMVAEFRDLDTSFLWRFSNGIFKDLL 180

Query: 181 MLENQLPFFLLQALYDLCASSQPSLKEISFIELLGGYFKKAREGMSYVEKGYFDMDASEV 240
           +LENQLPFFLL  LY+LCAS+QPSLK+ISFIELL GYF K REGMSYV++GYFD+DAS V
Sbjct: 181 LLENQLPFFLLNHLYNLCASAQPSLKDISFIELLRGYFSKVREGMSYVKEGYFDIDASAV 240

Query: 241 NHLVDFIRIHLTKPSTSPRYLGVSYDDLFSIWPLTATELHDCGISFQKMNKSFYHGRRRK 300
           NHLVDF+RIHLT+P   P + G+S DD  S WPLTATELH+CGISF        HG ++K
Sbjct: 241 NHLVDFLRIHLTQPRHIPHFFGLSVDDFLSSWPLTATELHECGISF--------HG-QKK 300

Query: 301 CNTDVYFSERGGVLKMPEIIIDDSFEIIFRNMIAYEYCHLKSKDVSNFGMFMHFLINTNK 360
           C  +V F ER GVLKMP+IIIDDSFEI+FRNMIAYEYCHLKSKD SNFGMFMHFLINTN+
Sbjct: 301 CMMNVSFKERRGVLKMPKIIIDDSFEILFRNMIAYEYCHLKSKDASNFGMFMHFLINTNE 360

Query: 361 DVSLLVEDGIIQNHLGSSKEIVALFNDLCKNVMVERNLYNLECWKMKQYCKHRRHRWMTS 420
           DVSLLV+DGIIQN LGS+KEIV LF+DLCKN+M+ERN Y++ CW+MK+YCKHRRHRWMTS
Sbjct: 361 DVSLLVDDGIIQNQLGSTKEIVVLFSDLCKNIMIERNFYSIACWRMKEYCKHRRHRWMTS 420

Query: 421 LKRDYFGTPWAFISFVAALXXXXXXXXXXXLAFIALYK 458
           LKRDYFGTPWAFISFVAA+           +AFIALYK
Sbjct: 421 LKRDYFGTPWAFISFVAAVLLLLLTLLQTVVAFIALYK 448

BLAST of Cla97C08G156670.1 vs. NCBI nr
Match: XP_022131636.1 (UPF0481 protein At3g47200-like [Momordica charantia])

HSP 1 Score: 353.6 bits (906), Expect = 1.0e-93
Identity = 194/397 (48.87%), Postives = 263/397 (66.25%), Query Frame = 0

Query: 52  IYKVPKPLRNVNPKAYSPQVISIG----PLHHYRTRNDPTIMEKKGSYVLNFLTVAKLNW 111
           I  VP  L  + P+AY PQ I IG     + H+ + N+  IM  KG +V  F +VAK+  
Sbjct: 6   IGNVPAALFEMKPEAYIPQFIFIGHPDPTVEHFVSGNNDKIM--KGCFVGKFSSVAKVEL 65

Query: 112 KEMIEKVIDWEGRARKYY--VETIEMERDEFVQLLIFDGCFVVMYIIGSMVEEFRDLDT- 171
            E+I +VI WE  AR  Y  +   +++  +FV+ L+ DGCFVVMY++ S+  EF+D+DT 
Sbjct: 66  NEIIGRVIGWEQTARDCYLFIGRRKLDTVQFVKFLVMDGCFVVMYMLVSVFPEFQDIDTS 125

Query: 172 TFLWRFSNGIFKDLLMLENQLPFFLLQALYDLCAS-SQPSLKEISFIELLGGYFKKAREG 231
           +F WRF++ +F+DLL+ +NQLPFFLL++LYDLC S +Q  L   SFI+L   +F   REG
Sbjct: 126 SFFWRFNDAVFRDLLLFQNQLPFFLLRSLYDLCVSNNQTILGNTSFIQLTHQFFID-REG 185

Query: 232 MSYVEKGY-FDMDASEVNHLVDFIRIHLTKPSTSPRYLGVSYDDLF-SIWPLTATELHDC 291
           + Y+ K +  + D  EV HL+ F+  ++  P        +    L  S+WP TATEL+D 
Sbjct: 186 IGYLGKDFRVEEDKLEVKHLIHFLSCYMNFPHQDDSSKSLDGTPLMVSVWPPTATELYDY 245

Query: 292 GISFQKMNKSFYHGRRRKCNTDVYFSERGGVLKMPEIIIDDSFEIIFRNMIAYEYCHLKS 351
           GISF+K  KS Y        +   F ER G+L++P III+++FE   RN+IAYEY   KS
Sbjct: 246 GISFEK--KSHY--------SQKMFDERTGILRVPHIIINETFESTMRNIIAYEYTIRKS 305

Query: 352 KDVSNFGMFMHFLINTNKDVSLLVEDGIIQNHLGSSKEIVALFNDLCKNVMVERNLYNLE 411
             VSNF MFM FL+N++ DV+LL+++GII NHL S+KE+  LF DLCKNV+ ERNLYN E
Sbjct: 306 PGVSNFLMFMRFLLNSDNDVNLLIKEGIIHNHLESAKEVTKLFQDLCKNVVAERNLYNYE 365

Query: 412 CWKMKQYCKHRRHRWMTSLKRDYFGTPWAFISFVAAL 439
           C KM++YCKHRRHRWM SLK DYF TPWA ISF+AA+
Sbjct: 366 CQKMRKYCKHRRHRWMASLKHDYFNTPWALISFIAAV 389

BLAST of Cla97C08G156670.1 vs. NCBI nr
Match: XP_022961893.1 (UPF0481 protein At3g47200-like isoform X1 [Cucurbita moschata] >XP_022961894.1 UPF0481 protein At3g47200-like isoform X1 [Cucurbita moschata] >XP_022961895.1 UPF0481 protein At3g47200-like isoform X1 [Cucurbita moschata])

HSP 1 Score: 347.4 bits (890), Expect = 7.2e-92
Identity = 197/443 (44.47%), Postives = 270/443 (60.95%), Query Frame = 0

Query: 1   MEGSSSDESNKNTTKSRDEEIEAICNRVVECVHHSMFRELDKSLSFCKERSIYKVPKPLR 60
           ME      +N+N + +  E      + VV  ++  + + +    S   + +IYKVP+PLR
Sbjct: 127 MELCGRPNTNENNSLAEIENETGTFDPVVLSINRILQQAVSSGSS---DGTIYKVPEPLR 186

Query: 61  NVNPKAYSPQVISIGPLHHYRTRNDPTIMEKKGSYVLNFLTVAKLNWKEMIEKVIDWEGR 120
           ++ P+AY+P VISIGPLH    R D T    K  Y+ NFL + KL    ++E V  WE R
Sbjct: 187 SIKPEAYTPTVISIGPLH--SGRKDLTANSLKPMYLQNFLNLTKLPTNTIVETVKTWEKR 246

Query: 121 ARKYYVETIEMERDEFVQLLIFDGCFVVMYIIGSMVEEFRDLDTTFLWRFSNGIFKDLLM 180
           AR  Y E+IEM RDEFV+LL+FDGCFVVM++IG    E R  D + LW+F   +F DL++
Sbjct: 247 ARYCYAESIEMNRDEFVELLVFDGCFVVMHLIGYSFFELRASDMSNLWKFWYELFCDLIL 306

Query: 181 LENQLPFFLLQALYDLCASSQPSLKEISFIELLGGYFKKAREG--MSYVEKGYFDMDASE 240
           LENQLPFFLLQ+LYDLCASSQP LK + FIEL+  YF ++ +G   S  E         +
Sbjct: 307 LENQLPFFLLQSLYDLCASSQPLLKGVHFIELVHQYFIESHKGGLFSLKEHVLLAGIGVQ 366

Query: 241 VNHLVDFIRIHLTKPSTSPRYLGVSYDDLFSIWPLTATELHDCGISFQKMNKSFYHGRRR 300
           VNH VD +R+H T   +       S+   F  WP  AT+LH+CG+ F KM K        
Sbjct: 367 VNHFVDLLRLHFTHTRSDE----TSFQHTF--WPPNATKLHECGVIF-KMGKG------- 426

Query: 301 KCNTDVYFSERGGVLKMPEIIIDDSFEIIFRNMIAYEYCHLKSK---DVSNFGMFMHFLI 360
                + F ++GG L++P+I I D FE   RN+IAYE CH+ S+   +VSNF +FM  L+
Sbjct: 427 -----IAFKDQGGCLQLPQINIYDDFEKRVRNLIAYEQCHIGSELRNEVSNFAVFMQCLV 486

Query: 361 NTNKDVSLLVEDGIIQNHLGSSKEIVALFNDLCKNVMVERNLYNLECWKMKQYCKHRRHR 420
            T++DV LL+E GII N+ GS  E+  LFN+L K++    N YN +C +MK YCK  RHR
Sbjct: 487 QTDQDVKLLIEGGIIHNNFGSINEVTQLFNNLGKHICPGINSYNFDCKRMKDYCKRPRHR 545

Query: 421 WMTSLKRDYFGTPWAFISFVAAL 439
           W++ L+R+YF TPW   S +AA+
Sbjct: 547 WISLLRRNYFSTPWLCASSIAAI 545

BLAST of Cla97C08G156670.1 vs. NCBI nr
Match: XP_022961897.1 (UPF0481 protein At3g47200-like isoform X3 [Cucurbita moschata] >XP_022961898.1 UPF0481 protein At3g47200-like isoform X3 [Cucurbita moschata] >XP_022961899.1 UPF0481 protein At3g47200-like isoform X3 [Cucurbita moschata])

HSP 1 Score: 347.4 bits (890), Expect = 7.2e-92
Identity = 197/443 (44.47%), Postives = 270/443 (60.95%), Query Frame = 0

Query: 1   MEGSSSDESNKNTTKSRDEEIEAICNRVVECVHHSMFRELDKSLSFCKERSIYKVPKPLR 60
           ME      +N+N + +  E      + VV  ++  + + +    S   + +IYKVP+PLR
Sbjct: 28  MELCGRPNTNENNSLAEIENETGTFDPVVLSINRILQQAVSSGSS---DGTIYKVPEPLR 87

Query: 61  NVNPKAYSPQVISIGPLHHYRTRNDPTIMEKKGSYVLNFLTVAKLNWKEMIEKVIDWEGR 120
           ++ P+AY+P VISIGPLH    R D T    K  Y+ NFL + KL    ++E V  WE R
Sbjct: 88  SIKPEAYTPTVISIGPLH--SGRKDLTANSLKPMYLQNFLNLTKLPTNTIVETVKTWEKR 147

Query: 121 ARKYYVETIEMERDEFVQLLIFDGCFVVMYIIGSMVEEFRDLDTTFLWRFSNGIFKDLLM 180
           AR  Y E+IEM RDEFV+LL+FDGCFVVM++IG    E R  D + LW+F   +F DL++
Sbjct: 148 ARYCYAESIEMNRDEFVELLVFDGCFVVMHLIGYSFFELRASDMSNLWKFWYELFCDLIL 207

Query: 181 LENQLPFFLLQALYDLCASSQPSLKEISFIELLGGYFKKAREG--MSYVEKGYFDMDASE 240
           LENQLPFFLLQ+LYDLCASSQP LK + FIEL+  YF ++ +G   S  E         +
Sbjct: 208 LENQLPFFLLQSLYDLCASSQPLLKGVHFIELVHQYFIESHKGGLFSLKEHVLLAGIGVQ 267

Query: 241 VNHLVDFIRIHLTKPSTSPRYLGVSYDDLFSIWPLTATELHDCGISFQKMNKSFYHGRRR 300
           VNH VD +R+H T   +       S+   F  WP  AT+LH+CG+ F KM K        
Sbjct: 268 VNHFVDLLRLHFTHTRSDE----TSFQHTF--WPPNATKLHECGVIF-KMGKG------- 327

Query: 301 KCNTDVYFSERGGVLKMPEIIIDDSFEIIFRNMIAYEYCHLKSK---DVSNFGMFMHFLI 360
                + F ++GG L++P+I I D FE   RN+IAYE CH+ S+   +VSNF +FM  L+
Sbjct: 328 -----IAFKDQGGCLQLPQINIYDDFEKRVRNLIAYEQCHIGSELRNEVSNFAVFMQCLV 387

Query: 361 NTNKDVSLLVEDGIIQNHLGSSKEIVALFNDLCKNVMVERNLYNLECWKMKQYCKHRRHR 420
            T++DV LL+E GII N+ GS  E+  LFN+L K++    N YN +C +MK YCK  RHR
Sbjct: 388 QTDQDVKLLIEGGIIHNNFGSINEVTQLFNNLGKHICPGINSYNFDCKRMKDYCKRPRHR 446

Query: 421 WMTSLKRDYFGTPWAFISFVAAL 439
           W++ L+R+YF TPW   S +AA+
Sbjct: 448 WISLLRRNYFSTPWLCASSIAAI 446

BLAST of Cla97C08G156670.1 vs. TrEMBL
Match: tr|A0A1S3BD00|A0A1S3BD00_CUCME (UPF0481 protein At3g47200-like OS=Cucumis melo OX=3656 GN=LOC103488308 PE=4 SV=1)

HSP 1 Score: 692.6 bits (1786), Expect = 6.0e-196
Identity = 344/457 (75.27%), Postives = 386/457 (84.46%), Query Frame = 0

Query: 1   MEGSSSDESNKNTTKSRDEEIEAICNRVVECVHHSMFRELDKSLSFCKERSIYKVPKPLR 60
           ME ++ DESN N TKSRDEEI+ I +R+VE V+HS+ RE+  S SF +ERSIY VPK LR
Sbjct: 1   MEINNKDESNNNATKSRDEEIKEIYDRMVESVNHSISREISISTSFSRERSIYMVPKLLR 60

Query: 61  NVNPKAYSPQVISIGPLHHYRTRNDPTIMEKKGSYVLNFLTVAKLNWKEMIEKVIDWEGR 120
           N NPKAYSPQVISIGPLH+YRT+ND TI EKKGSYVLNFLTVAKL W EMI K + WE R
Sbjct: 61  NGNPKAYSPQVISIGPLHYYRTQNDLTIKEKKGSYVLNFLTVAKLGWNEMINKFLCWEER 120

Query: 121 ARKYYVETIEMERDEFVQLLIFDGCFVVMYIIGSMVEEFRDLDTTFLWRFSNGIFKDLLM 180
           AR YYVETI+MERDEF+QLLI+D CFVVMYIIGSMV EFRDLDT+FLWRFSNGIFKDLL+
Sbjct: 121 ARNYYVETIKMERDEFIQLLIYDSCFVVMYIIGSMVAEFRDLDTSFLWRFSNGIFKDLLL 180

Query: 181 LENQLPFFLLQALYDLCASSQPSLKEISFIELLGGYFKKAREGMSYVEKGYFDMDASEVN 240
           LENQLPFFLL  LY+LCA +QPSLK+ISFIELL GYF + REGMSYV +GY D+DA+EVN
Sbjct: 181 LENQLPFFLLHHLYNLCAFAQPSLKDISFIELLRGYFTEVREGMSYVNEGYLDIDANEVN 240

Query: 241 HLVDFIRIHLTKPSTSPRYLGVSYDDLFSIWPLTATELHDCGISFQKMNKSFYHGRRRKC 300
           HLVDF+RIHLT+P   P +L  S DD  S WPLTATELHDCGISF        HG++R C
Sbjct: 241 HLVDFLRIHLTQPRHIPHFLDFSVDDFLSSWPLTATELHDCGISF--------HGQKR-C 300

Query: 301 NTDVYFSERGGVLKMPEIIIDDSFEIIFRNMIAYEYCHLKSKDVSNFGMFMHFLINTNKD 360
             +V F ER GVLKMP+IIIDDSFEI+FRNMIAYEYCHLKSKDVSNFGMFMHFLINTNKD
Sbjct: 301 MMNVNFRERNGVLKMPKIIIDDSFEILFRNMIAYEYCHLKSKDVSNFGMFMHFLINTNKD 360

Query: 361 VSLLVEDGIIQNHLGSSKEIVALFNDLCKNVMVERNLYNLECWKMKQYCKHRRHRWMTSL 420
           VSLLV+DGIIQNHLGS++EIV LFNDLCKN+MVERNLY++EC KMK+YCKHRRHRWMTSL
Sbjct: 361 VSLLVDDGIIQNHLGSTREIVVLFNDLCKNIMVERNLYSIECRKMKEYCKHRRHRWMTSL 420

Query: 421 KRDYFGTPWAFISFVAALXXXXXXXXXXXLAFIALYK 458
           KRDYFGTPWAFISFVAA+           +AF+ALYK
Sbjct: 421 KRDYFGTPWAFISFVAAVLLLLLTLLQTVVAFVALYK 448

BLAST of Cla97C08G156670.1 vs. TrEMBL
Match: tr|A0A0A0LPK8|A0A0A0LPK8_CUCSA (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G381700 PE=4 SV=1)

HSP 1 Score: 682.9 bits (1761), Expect = 4.8e-193
Identity = 340/458 (74.24%), Postives = 386/458 (84.28%), Query Frame = 0

Query: 1   MEGSSSDESNKNTTKSRDEEIEAICNRVVECVHHSMFRELDKSL-SFCKERSIYKVPKPL 60
           ME +  DESN NTTKSRDEEI+ I +R+V  V+ SMFRE+ +S  SF KERSIY VPK L
Sbjct: 1   MEINVYDESNNNTTKSRDEEIKVIYDRMVGSVNQSMFREISRSASSFSKERSIYMVPKLL 60

Query: 61  RNVNPKAYSPQVISIGPLHHYRTRNDPTIMEKKGSYVLNFLTVAKLNWKEMIEKVIDWEG 120
           R  NPKAYSPQVISIGPLH+YRT+ND  I EKKGSYVLNFLTVAKL+W EMI+K + WE 
Sbjct: 61  RKGNPKAYSPQVISIGPLHYYRTQND-LIKEKKGSYVLNFLTVAKLDWNEMIKKFLSWEE 120

Query: 121 RARKYYVETIEMERDEFVQLLIFDGCFVVMYIIGSMVEEFRDLDTTFLWRFSNGIFKDLL 180
           RAR YYVETIEM+RDEF+QLLI+D CFVVMY+IGSMV EFRDLDT+FLWRFSNGIFKDLL
Sbjct: 121 RARNYYVETIEMKRDEFIQLLIYDSCFVVMYVIGSMVAEFRDLDTSFLWRFSNGIFKDLL 180

Query: 181 MLENQLPFFLLQALYDLCASSQPSLKEISFIELLGGYFKKAREGMSYVEKGYFDMDASEV 240
           +LENQLPFFLL  LY+LCAS+QPSLK+ISFIELL GYF K REGMSYV++GYFD+DAS V
Sbjct: 181 LLENQLPFFLLNHLYNLCASAQPSLKDISFIELLRGYFSKVREGMSYVKEGYFDIDASAV 240

Query: 241 NHLVDFIRIHLTKPSTSPRYLGVSYDDLFSIWPLTATELHDCGISFQKMNKSFYHGRRRK 300
           NHLVDF+RIHLT+P   P + G+S DD  S WPLTATELH+CGISF        HG ++K
Sbjct: 241 NHLVDFLRIHLTQPRHIPHFFGLSVDDFLSSWPLTATELHECGISF--------HG-QKK 300

Query: 301 CNTDVYFSERGGVLKMPEIIIDDSFEIIFRNMIAYEYCHLKSKDVSNFGMFMHFLINTNK 360
           C  +V F ER GVLKMP+IIIDDSFEI+FRNMIAYEYCHLKSKD SNFGMFMHFLINTN+
Sbjct: 301 CMMNVSFKERRGVLKMPKIIIDDSFEILFRNMIAYEYCHLKSKDASNFGMFMHFLINTNE 360

Query: 361 DVSLLVEDGIIQNHLGSSKEIVALFNDLCKNVMVERNLYNLECWKMKQYCKHRRHRWMTS 420
           DVSLLV+DGIIQN LGS+KEIV LF+DLCKN+M+ERN Y++ CW+MK+YCKHRRHRWMTS
Sbjct: 361 DVSLLVDDGIIQNQLGSTKEIVVLFSDLCKNIMIERNFYSIACWRMKEYCKHRRHRWMTS 420

Query: 421 LKRDYFGTPWAFISFVAALXXXXXXXXXXXLAFIALYK 458
           LKRDYFGTPWAFISFVAA+           +AFIALYK
Sbjct: 421 LKRDYFGTPWAFISFVAAVLLLLLTLLQTVVAFIALYK 448

BLAST of Cla97C08G156670.1 vs. TrEMBL
Match: tr|A0A2C9UB51|A0A2C9UB51_MANES (Uncharacterized protein OS=Manihot esculenta OX=3983 GN=MANES_16G120200 PE=4 SV=1)

HSP 1 Score: 239.2 bits (609), Expect = 1.8e-59
Identity = 148/411 (36.01%), Postives = 225/411 (54.74%), Query Frame = 0

Query: 35  SMFRELDKSLSFCKERSIYKVPKPLRNVNPKAYSPQVISIGPLHHYRTRNDPTIMEKKGS 94
           SM   L        ER IY+VPK LR+VN KAY+P+++SIGPLHH R      + E K  
Sbjct: 52  SMKNRLRSLAPVSSERCIYRVPKRLRDVNEKAYTPRLVSIGPLHHGRP-GLGAMEEHKWR 111

Query: 95  YVLNFLTVAKLNWKEMIEKVIDWEGRARKYYVETIEMERDEFVQLLIFDGCFVVMYIIGS 154
           ++ NFL   K+   ++++ + D E RAR  Y ETIE+  DEFVQ++  D  F +  ++G 
Sbjct: 112 HLQNFLQQTKVKLDDLVKFIKDREKRARNCYAETIELTSDEFVQIITVDAAFTIDILLGR 171

Query: 155 MVEEFRDLDTTFLWR--FSNGIFKDLLMLENQLPFFLLQALYDLCAS--SQPSLKEISFI 214
           +           + R      I++D+L++ENQLP+F+L  + DL  S  +  S +  S +
Sbjct: 172 VFPHLTCEIECVIDRSGLVFDIYRDMLLIENQLPYFILGDVLDLAKSRAASGSSQWPSLL 231

Query: 215 ELLGGYFKKAREGMSYVEKGYFDMDASEVNHLVDFIRI-HLTKPSTSPRYLGVSYDDLFS 274
            ++  YF       + ++  +  M +SEV H VDF+R+ H       P    + +D   S
Sbjct: 232 NIIHAYF----NSFAQLDHDFNTMKSSEVRHFVDFLRLCHRPFRQKQPLRRRLVFDGTKS 291

Query: 275 IWPLTATELHDCGISFQKMNKSFYHGRRRKCNTDVYFSERGGVLKMPEIIIDDSFEIIFR 334
           +     TELH+ G+ F+  +         K   D+ F +  G+L++P I + +  E  FR
Sbjct: 292 L-----TELHEAGVKFKVAS--------TKHLLDLQFID--GILEIPHIRVSEMTEAFFR 351

Query: 335 NMIAYEYCHLKSKDVSNFGMFMHFLINTNKDVSLLVEDGIIQNHLGSSKEIVALFNDLCK 394
           N+IA+E CH K  D+S++ + M  LINT  DV LLV+ GI++N L ++ E   LFN+L K
Sbjct: 352 NLIAFEQCHCKVSDISDYIVIMDILINTAHDVELLVKCGIMKNMLANNLEAAMLFNNLAK 411

Query: 395 NVMVERNL--YNLECWKMKQYCKHRRHRWMTSLKRDYFGTPWAFISFVAAL 439
            V+ + N+  Y+L C  +  YCK R H+W   LK +YF  PWA IS  AA+
Sbjct: 412 EVLFDSNVFSYSLLCEDLNDYCKVRWHKWQAILKHNYFNNPWAVISVTAAV 442

BLAST of Cla97C08G156670.1 vs. TrEMBL
Match: tr|B9RP75|B9RP75_RICCO (Uncharacterized protein OS=Ricinus communis OX=3988 GN=RCOM_0924610 PE=4 SV=1)

HSP 1 Score: 222.6 bits (566), Expect = 1.8e-54
Identity = 142/414 (34.30%), Postives = 227/414 (54.83%), Query Frame = 0

Query: 35  SMFRELDKSLSFCKERSIYKVPKPLRNVNPKAYSPQVISIGPLHHYRTRNDPTIMEKKGS 94
           SM   L+       ER IY+VPK +R+VN  AY+P+++SIGP HH +      + E K  
Sbjct: 26  SMKSRLENLSPVSSERCIYRVPKRIRDVNHNAYTPRLVSIGPFHHGKP-GLKAMEEHKWR 85

Query: 95  YVLNFLTVAKLNWKEMIEKVIDWEGRARKYYVETIEMERDEFVQLLIFDGCFVVMYIIGS 154
           ++ NFL   ++   ++++ + D E RAR  Y ETI +  DEFVQ+LI D  F +  ++G 
Sbjct: 86  HLQNFLRQTRVKLDDLVKFIKDREERARNCYAETIALTSDEFVQILIVDATFTIDILLGK 145

Query: 155 MVEEFRDLDTTFLWRFS--NGIFKDLLMLENQLPFFLLQALYDLCAS--SQPSLKEISFI 214
           ++ +          R S    I++D+L++ENQLP+F+L  + D   S  +  S +  S +
Sbjct: 146 VIPQLTCAIECVYDRSSLMFDIYRDMLLIENQLPYFILGDILDFAKSIAASGSSQWPSIL 205

Query: 215 ELLGGYFKKAREGMSYVEKG--YFDMDASEVNHLVDFIRI--HLTKPSTSPRYLGVSYDD 274
           EL   YF       SY++ G     M  SEVNH VDF+R+     KP  +PR      ++
Sbjct: 206 ELTRVYFN------SYMQLGRASHPMRRSEVNHFVDFLRLCHQPIKPRQTPR------EN 265

Query: 275 LFSIWPLTATELHDCGISFQKMNKSFYHGRRRKCNTDVYFSERGGVLKMPEIIIDDSFEI 334
                  ++TEL + G+ F+  + +           D+ F++  GVL++P I + +  E 
Sbjct: 266 RKFEMTRSSTELREAGVKFKVASTTHL--------LDIQFND--GVLEIPYIRVSEITEA 325

Query: 335 IFRNMIAYEYCHLKSKDVSNFGMFMHFLINTNKDVSLLVEDGIIQNHLGSSKEIVALFND 394
            FRN+IA+E CH  +  +S++ + M  LINT  DV +LV+ GI++  L ++ E   LFN+
Sbjct: 326 FFRNLIAFEQCHCHTSYISDYIVIMDSLINTPHDVEVLVKYGIMKVMLANNVEASTLFNN 385

Query: 395 LCKNVMVERNL--YNLECWKMKQYCKHRRHRWMTSLKRDYFGTPWAFISFVAAL 439
           L K ++ + ++  Y+L C  +  +CK R HRW  +LK +YF TPW  IS +A +
Sbjct: 386 LAKEILYDSHVFYYSLLCEDLNTFCKVRWHRWKATLKHNYFNTPWTAISVIAGV 416

BLAST of Cla97C08G156670.1 vs. TrEMBL
Match: tr|A0A1U8ATL2|A0A1U8ATL2_NELNU (UPF0481 protein At3g47200-like OS=Nelumbo nucifera OX=4432 GN=LOC104607450 PE=4 SV=1)

HSP 1 Score: 221.5 bits (563), Expect = 4.0e-54
Identity = 140/416 (33.65%), Postives = 236/416 (56.73%), Query Frame = 0

Query: 34  HSMFRELDKSLSFCKERSIYKVPKPLRNVNPKAYSPQVISIGPLHHYRTRNDPTIMEKKG 93
           +S+  +LD       E  I++VP+ +R +N +AY P+++S+GP H+ + R    + E K 
Sbjct: 25  NSIQEKLDDVSPLSSECCIFRVPEQMRKINEEAYRPRMVSLGPFHYGKQRL-AEMEEHKK 84

Query: 94  SYVLNFL-TVAKLNWKEMIEKVIDWEGRARKYYVETIEMERDEFVQLLIFDGCFVVMYII 153
            Y+ + L     +  +  +  + + E RARK Y ETI +  D+FV++++ DGCF++ Y  
Sbjct: 85  RYLKSLLRRNPSIKLEAYVNTMRELENRARKCYAETISLSSDQFVEMMLLDGCFIIAYFF 144

Query: 154 GSMVEEFR-DLDTTFLWRFSN-GIFKDLLMLENQLPFFLLQALYDLCASS---QPSLKEI 213
                  R + D  F   +++ G++ D+++LENQ+PFF+L+ L+ L  ++   + SL E+
Sbjct: 145 MCENPSLRSEFDPVFHAAWTDAGLYHDMVLLENQIPFFVLEYLFKLVNTTLNYKNSLVEL 204

Query: 214 SFIELLGGYFKKAREGMSYVEKGYFDMDASEVNHLVDFIRIHLTKPSTSPRYLGVSYDDL 273
           SF       F++  +    V     + ++S+V HL+D +R +    S   +  G   D +
Sbjct: 205 SF-----SVFEEFLQKEDRVPNQ--NPNSSQVKHLLDLLRNYHLPSSERKKPKG---DPM 264

Query: 274 FSIWPLTATELHDCGISFQKMNKSFYHGRRRKCNTDVYFSERGGVLKMPEIIIDDSFEII 333
           F++ P TATEL + G+ F+ +  + +         D+ F+E  GVL++P I I+D+ E+ 
Sbjct: 265 FNL-PPTATELQEAGVKFKMITSNSW--------LDIRFNE--GVLEIPNIKIEDATEVF 324

Query: 334 FRNMIAYEYCHLKSKDV--SNFGMFMHFLINTNKDVSLLVEDGIIQNHLGSSKEIVALFN 393
            RN++A+E C++K+  +   ++ +FM  LINT+KDV LL   GII N LG  +E+  LFN
Sbjct: 325 LRNLLAFEQCYMKNSTLYFVDYLLFMDSLINTSKDVGLLRGCGIIDNLLGDDEEVAHLFN 384

Query: 394 DLCKNVMVERN---LYNLECWKMKQYCKHRRHRWMTSLKRDYFGTPWAFISFVAAL 439
            L K V++  N    Y   C  +++Y K+  HRW   L  DYF TPWA ISF+AA+
Sbjct: 385 KLGKGVVMSENDNFYYLRLCQDVQEYYKNPLHRWRAKLVHDYFNTPWAIISFIAAV 418

BLAST of Cla97C08G156670.1 vs. Swiss-Prot
Match: sp|Q9SD53|Y3720_ARATH (UPF0481 protein At3g47200 OS=Arabidopsis thaliana OX=3702 GN=At3g47200 PE=2 SV=1)

HSP 1 Score: 136.3 bits (342), Expect = 8.3e-31
Identity = 122/417 (29.26%), Postives = 201/417 (48.20%), Query Frame = 0

Query: 52  IYKVPKPLRNVNPKAYSPQVISIGPLHHYRTRNDPTIMEKKGSYVLNFLTVAKLNWKE-- 111
           I++VP+    +NPKAY P+V+SIGP +HY  ++   I + K   +  FL  AK    E  
Sbjct: 48  IFRVPESFVALNPKAYKPKVVSIGP-YHYGEKHLQMIQQHKPRLLQLFLDEAKKKDVEEN 107

Query: 112 -MIEKVIDWEGRARKYYVETIEMERDEFVQLLIFDGCFVVM--YIIGSMVEEFRDLDTTF 171
            +++ V+D E + RK Y E ++   D  + +++ DGCF++M   I+   +E   D   + 
Sbjct: 108 VLVKAVVDLEDKIRKSYSEELKTGHD-LMFMMVLDGCFILMVFLIMSGNIELSEDPIFSI 167

Query: 172 LWRFSNGIFKDLLMLENQLPFFLLQALY---DLCASSQPSLKEISF------IELLGGYF 231
            W  S+ I  DLL+LENQ+PFF+LQ LY    +  SS   L  I+F      I+  G Y+
Sbjct: 168 PWLLSS-IQSDLLLLENQVPFFVLQTLYVGSKIGVSS--DLNRIAFHFFKNPIDKEGSYW 227

Query: 232 KKAREGMSYVEKGYFDM------------DASEVNHLVDFIRIHLTKPSTSPRYLGVSYD 291
           +K R   +Y  K   D+            D +   H+   +++H  K    P     S D
Sbjct: 228 EKHR---NYKAKHLLDLIRETFLPNTSESDKASSPHVQ--VQLHEGKSGNVP-----SVD 287

Query: 292 DLFSIWPLTATELHDCGISFQKMNKSFYHGRRRKCNTDVYFSERGGVLKMPEIIIDDSFE 351
                  L+A  L   GI F+         RR K ++ +    +   L++P++  D    
Sbjct: 288 SKAVPLILSAKRLRLQGIKFRL--------RRSKEDSILNVRLKKNKLQIPQLRFDGFIS 347

Query: 352 IIFRNMIAYEYCHL-KSKDVSNFGMFMHFLINTNKDVSLLVEDG-IIQNHLGSSKEIVAL 411
             F N +A+E  +   S +++ + +FM  L+N  +DV+ L  D  II+NH GS+ E+   
Sbjct: 348 SFFLNCVAFEQFYTDSSNEITTYIVFMGCLLNNEEDVTFLRNDKLIIENHFGSNNEVSEF 407

Query: 412 FNDLCKNVM--VERNLYNLECWKMKQYCKHRRHRWMTSLKRDYFGTPWAFISFVAAL 439
           F  + K+V+  V+ +  N     + +Y K   +      +  +F +PW F+S  A L
Sbjct: 408 FKTISKDVVFEVDTSYLNNVFKGVNEYTKKWYNGLWAGFRHTHFESPWTFLSSCAVL 441

BLAST of Cla97C08G156670.1 vs. Swiss-Prot
Match: sp|P0C897|Y3264_ARATH (Putative UPF0481 protein At3g02645 OS=Arabidopsis thaliana OX=3702 GN=At3g02645 PE=3 SV=1)

HSP 1 Score: 84.7 bits (208), Expect = 2.9e-15
Identity = 104/489 (21.27%), Postives = 186/489 (38.04%), Query Frame = 0

Query: 51  SIYKVPKPLRNVNPKAYSPQVISIGPLH------HYRTRNDPTIMEKKGSYVLNFLTVAK 110
           SI+ VPK L   +P +Y+P  +SIGP H      H   R    I  K  +   +F     
Sbjct: 44  SIFNVPKALMCSHPDSYTPHRVSIGPYHCLKPELHEMERYKLMIARKIRNQYNSF----- 103

Query: 111 LNWKEMIEKVIDWEGRARKYYVETIEMERDEFVQLLIFDGCFVVMYIIGSMVEEFRDLDT 170
             + +++EK+   E + R  Y + I    +  + ++  D  F++ ++    +  FR ++T
Sbjct: 104 -RFHDLVEKLQSMEIKIRACYHKYIGFNGETLLWIMAVDSSFLIEFL---KIYSFRKVET 163

Query: 171 TFLWRFSNGIFKDLLMLENQLPFFLLQALYDLCASSQPSLKEISFIELLGGYFKKAREGM 230
                  N I +D++M+ENQ+P F+L+   +    S  S  ++    L G     +   +
Sbjct: 164 LINRVGHNEILRDIMMIENQIPLFVLRKTLEFQLESTESADDLLLSVLTGLCKDLSPLVI 223

Query: 231 SYVEKGYFDMDASEVNHLVDFI-------------------------------------- 290
            + +         E NH++DF+                                      
Sbjct: 224 KFDDDQILKAQFQECNHILDFLYQMIVPRIEXXXXXXXXXXXXXXXXGGNRAIRFMDEIK 283

Query: 291 ----RIHLTKP------------STSPRY--LGVSYDDLFSIWPLTAT------------ 350
               R+  ++P            S  P +  L +S D LF+     AT            
Sbjct: 284 HQFKRVFASRPADLILRFPWRIISNLPGFMALKLSADYLFTRQENEATTTRQESVSILDI 343

Query: 351 ---------------ELHDCGISFQKMNKSFYHGRRRKCNTDVYFSERGGVLKMPEIIID 410
                          +LH  G+ F    K   HG      + V F    G   +P I +D
Sbjct: 344 EKPPLVEELTIPSVSDLHKAGVRF----KPTAHGN----ISTVTFDSNSGQFYLPVINLD 403

Query: 411 DSFEIIFRNMIAYEYCHLKSKDV-SNFGMFMHFLINTNKDVSLLVEDGIIQNHLGSSKEI 439
            + E + RN++AYE  +     V + +   ++ +I++ +DV LL E G++ + L S +E 
Sbjct: 404 INTETVLRNLVAYEATNTSGPLVFTRYTELINGIIDSEEDVRLLREQGVLVSRLKSDQEA 463

BLAST of Cla97C08G156670.1 vs. TAIR10
Match: AT4G31980.1 (unknown protein)

HSP 1 Score: 198.0 bits (502), Expect = 1.3e-50
Identity = 123/394 (31.22%), Postives = 204/394 (51.78%), Query Frame = 0

Query: 52  IYKVPKPLRNVNPKAYSPQVISIGPLHHYRTRNDPTIME-KKGSYVLNFLTVAKLNWKEM 111
           IYKVP  LR +NP AY+P+++S GPLH  R + +   ME +K  Y+L+F+     + +++
Sbjct: 295 IYKVPNKLRRLNPDAYTPRLVSFGPLH--RGKEELQAMEDQKYRYLLSFIPRTNSSLEDL 354

Query: 112 IEKVIDWEGRARKYYVETIEMERDEFVQLLIFDGCFVVMYIIGSMVEEFRDLDTTFLWR- 171
           +     WE  AR  Y E +++  DEFV++L+ DG F+V  ++ S     R  +       
Sbjct: 355 VRLARTWEQNARSCYAEDVKLHSDEFVEMLVVDGSFLVELLLRSHYPRLRGENDRIFGNS 414

Query: 172 -FSNGIFKDLLMLENQLPFFLLQALYDLCAS--SQPSLKEISFIELLGGYFKKAREGMSY 231
                + +D++++ENQLPFF+++ ++ L  +   Q +   I   +    YF    +   +
Sbjct: 415 MMITDVCRDMILIENQLPFFVVKEIFLLLLNYYQQGTPSIIQLAQRHFSYFLSRIDDEKF 474

Query: 232 VEKGYFDMDASEVNHLVDFIR-IHLTKPSTSPRYLGVSYDDLFSIWPLTATELHDCGISF 291
           +         +E  H VD +R  +L +      Y  V  D+        ATELH  G+ F
Sbjct: 475 I---------TEPEHFVDLLRSCYLPQFPIKLEYTTVKVDN-----APEATELHTAGVRF 534

Query: 292 QKMNKSFYHGRRRKCNTDVYFSERGGVLKMPEIIIDDSFEIIFRNMIAYEYCHLKSKDVS 351
           +    S        C  D+ F++  GVLK+P I++DD  E +++N+I +E C   +K+  
Sbjct: 535 KPAETS-------SCLLDISFAD--GVLKIPTIVVDDLTESLYKNIIGFEQCRCSNKNFL 594

Query: 352 NFGMFMHFLINTNKDVSLLVEDGIIQNHLGSSKEIVALFNDLCKNVMVERNLY-NLECWK 411
           ++ M +   I +  D  LL+  GII N+LG+S ++  LFN + K V+ +R  Y ++    
Sbjct: 595 DYIMLLGCFIKSPTDADLLIHSGIIVNYLGNSVDVSNLFNSISKEVIYDRRFYFSMLSEN 654

Query: 412 MKQYCKHRRHRWMTSLKRDYFGTPWAFISFVAAL 439
           ++ YC    +RW   L+RDYF  PWA  S  AAL
Sbjct: 655 LQAYCNTPWNRWKAILRRDYFHNPWAVASVFAAL 663

BLAST of Cla97C08G156670.1 vs. TAIR10
Match: AT3G50120.1 (Plant protein of unknown function (DUF247))

HSP 1 Score: 169.1 bits (427), Expect = 6.4e-42
Identity = 138/466 (29.61%), Postives = 225/466 (48.28%), Query Frame = 0

Query: 16  SRDEEIEAICNRVVECVHHSMFRELDKSLSFCKERSIYKVPKPLRNVNPKAYSPQVISIG 75
           SRD+ + +I ++ +E  H       D   +   +  IY+VP  L+  + K+Y PQ +S+G
Sbjct: 76  SRDDWVISITDK-LEQAHR------DDDTTLWGKLCIYRVPYYLQENDNKSYFPQTVSLG 135

Query: 76  PLHHYRTRNDPTIMEKKGSYVLNFLTVAKLNWKEMIEKVIDWEGRARKYYVETIEMERDE 135
           P HH + R   ++   K   V   L       K  I+ + + E +AR  Y   + +  +E
Sbjct: 136 PYHHGKKRL-RSMDRHKWRAVNRVLKRTNQGIKMYIDAMRELEEKARACYEGPLSLSSNE 195

Query: 136 FVQLLIFDGCFVVMYIIGSMVEEFRDL-----DTTFLWRFS-NGIFKDLLMLENQLPFFL 195
           F+++L+ DGCFV+    G+ VE F +L     D  F  R S + I +D++MLENQLP F+
Sbjct: 196 FIEMLVLDGCFVLELFRGA-VEGFTELGYARNDPVFAMRGSMHSIQRDMVMLENQLPLFV 255

Query: 196 LQALYDLCASSQPSLKEIS------FIELLGGYFKKAREGMSYVEKGY-----FD--MDA 255
           L  L +L   ++     ++      F  L+       + G S +E        FD   D 
Sbjct: 256 LNRLLELQLGTRNQTGLVAQLAIRFFDPLMPTDEPLTKSGQSKLENSLARDKSFDPFADM 315

Query: 256 SEVNHLVDFIRIHL--TKPSTSPRYLGVSYDDLFSIWPLTATELHDCGISFQKMNKSFYH 315
            E+ H +D  R  L  + P   PR     +     +      +L  C    ++    F  
Sbjct: 316 GEL-HCLDVFRRSLLRSSPKPEPRLTRKRWSRNTRVADKRRQQLIHCVTELKEAGIKF-- 375

Query: 316 GRRRKCNTDVYFSERGGVLKMPEIIIDDSFEIIFRNMIAYEYCHL-KSKDVSNFGMFMHF 375
            RRRK +       + G L++P ++I D  + +F N+IA+E CH+  S D++++ +FM  
Sbjct: 376 -RRRKTDRFWDMQFKNGYLEIPRLLIHDGTKSLFLNLIAFEQCHIDSSNDITSYIIFMDN 435

Query: 376 LINTNKDVSLLVEDGIIQNHLGSSKEIVALFNDLCKNVM--VERNLYNLECWKMKQYCKH 435
           LI++++DVS L   GII++ LGS  E+  LFN LC+ V+   E +  +    ++ +Y  H
Sbjct: 436 LIDSHEDVSYLHYCGIIEHWLGSDSEVADLFNRLCQEVVFDTEDSYLSRLSIEVNRYYDH 495

Query: 436 RRHRWMTSLKRDYFGTPWAFISFVAALXXXXXXXXXXXLAFIALYK 458
           + + W  +LK  YF  PWA +SF AA+            A  A YK
Sbjct: 496 KWNAWRATLKHKYFNNPWAIVSFCAAVILLVLTFSQSFYAVYAYYK 528

BLAST of Cla97C08G156670.1 vs. TAIR10
Match: AT3G50160.1 (Plant protein of unknown function (DUF247))

HSP 1 Score: 165.2 bits (417), Expect = 9.2e-41
Identity = 137/475 (28.84%), Postives = 234/475 (49.26%), Query Frame = 0

Query: 15  KSRDEEIEAICNRVVECVHHSMFREL-------------DKSLSFCKERSIYKVPKPLRN 74
           K R+ ++E++ +  +E  +    RE+             D + +      IY+VP  L+ 
Sbjct: 58  KPRETQVESVVS--IEDKNEQKLREIWVISLNDKMKTLGDNATTSWDNLCIYRVPPYLQE 117

Query: 75  VNPKAYSPQVISIGPLHHYRTRNDPTIMEKKGSYVLNFLTV-AKLNWKEMIEKVIDWEGR 134
            + K+Y PQ++SIGP HH      P  ME+     +N +   AK + +  I+ + + E +
Sbjct: 118 NDTKSYMPQIVSIGPYHHGHKHLMP--MERHKWRAVNMVMARAKHDIEMYIDAMKELEEK 177

Query: 135 ARKYYVETIEMERDEFVQLLIFDGCFVVMYIIGSMVEEFRDL-----DTTFLWR-FSNGI 194
           AR  Y   I M R+EF+++L+ DG F++    G+  E F+++     D  F  R     I
Sbjct: 178 ARACYQGPINMNRNEFIEMLVLDGVFIIEIFKGTS-EGFQEIGYAPNDPVFGMRGLMQSI 237

Query: 195 FKDLLMLENQLPFFLLQALYDLCASSQPSLKEISFIELLGGYFK---KAREGMSYVEKGY 254
            +D++MLENQLP+ +L+ L  L    +P + +   ++L   +F+     RE ++  E+G 
Sbjct: 238 RRDMVMLENQLPWSVLKGLLQL---QRPDVLDKVNVQLFQPFFQPLLPTREVLT--EEGG 297

Query: 255 FDMDASEVNHLVDFIRIHLTKPSTSPRYLGVSYDDLFSIWPL------TATELHDCGISF 314
                    H +D +R  L + S      G S +D+  +           TEL + G+ F
Sbjct: 298 L--------HCLDVLRRGLLQSS------GTSDEDMSMVNKQPQQLIHCVTELRNAGVEF 357

Query: 315 QKMNKSFYHGRRRKCNTDVYFSERGGVLKMPEIIIDDSFEIIFRNMIAYEYCHLK-SKDV 374
            +     +         D+ F  + G LK+P+++I D  + +F N+IA+E CH+K SK +
Sbjct: 358 MRKETGHF--------WDIEF--KNGYLKIPKLLIHDGTKSLFLNLIAFEQCHIKSSKKI 417

Query: 375 SNFGMFMHFLINTNKDVSLLVEDGIIQNHLGSSKEIVALFNDLCKNVMVERNLYNLEC-- 434
           +++ +FM  LIN+++DVS L   GII+N LGS  E+  LFN L K V+ + N   L    
Sbjct: 418 TSYIIFMDNLINSSEDVSYLHHYGIIENWLGSDSEVSDLFNGLGKEVIFDPNDGYLSALT 477

Query: 435 WKMKQYCKHRRHRWMTSLKRDYFGTPWAFISFVAALXXXXXXXXXXXLAFIALYK 458
            ++  Y + + +    +L+  YF  PWA+ SF+AA+            A  A +K
Sbjct: 478 GEVNIYYRRKWNYLKATLRHKYFNNPWAYFSFIAAVTLLIFTFCQSFFAVFAYFK 498

BLAST of Cla97C08G156670.1 vs. TAIR10
Match: AT3G50150.1 (Plant protein of unknown function (DUF247))

HSP 1 Score: 165.2 bits (417), Expect = 9.2e-41
Identity = 138/482 (28.63%), Postives = 234/482 (48.55%), Query Frame = 0

Query: 1   MEGSSSDESNKNTTKSRDEEIEAICNRVVECVHHSMFRELDKSLSFCKERSIYKVPKPLR 60
           +E S  +   +   ++R+E + +I +++ + + +      DK    C    IY+VP  L+
Sbjct: 48  VEPSKIEVKEEKPRETREEWVISIKDKMEKALSYDATNSWDK---LC----IYRVPFYLQ 107

Query: 61  NVNPKAYSPQVISIGPLHHYRTRNDPTIMEKKGSYVLNFLTV-AKLNWKEMIEKVIDWEG 120
             + K+Y PQ +SIGP HH +    P  ME+     +N +    K N +  I+ + + E 
Sbjct: 108 ENDKKSYLPQTVSIGPYHHGKVHLRP--MERHKWRAVNMIMARTKHNIEMYIDAMKELEE 167

Query: 121 RARKYYVETIEMER-DEFVQLLIFDGCFVVMYIIGSMVEEFRDL-----DTTFLWR-FSN 180
            AR  Y   I+M+  +EF ++L+ DGCFV+    G+ ++ F+ +     D  F  R   +
Sbjct: 168 EARACYQGPIDMKNSNEFTEMLVLDGCFVLELFKGT-IQGFQKIGYARNDPVFAKRGLMH 227

Query: 181 GIFKDLLMLENQLPFFLLQALYDLCASSQPSLKEISFIELLGGYFKKAREGMSYVEKGYF 240
            I +D++MLENQLP F+L  L  L  +  P+   I   E+   +FK        + K   
Sbjct: 228 SIQRDMIMLENQLPLFVLDRLLGL-QTGTPNQTGI-VAEVAVRFFKTLMPTSEVLTKSER 287

Query: 241 DMDASEVN---------HLVDFIRIHLTKPSTSPRYLGVSYDDLFSIWPL-----TATEL 300
            +D+ E +         H +D     L + S +    G  Y+D+  +          TEL
Sbjct: 288 SLDSQEKSDELGDNGGLHCLDVFHRSLIQSSETTNQ-GTPYEDMSMVEKQQQLIHCVTEL 347

Query: 301 HDCGISFQKMNKSFYHGRRRKCNTDVYFSERGGVLKMPEIIIDDSFEIIFRNMIAYEYCH 360
              G++F +               D+ F  + G LK+P+++I D  + +F N+IA+E CH
Sbjct: 348 RGAGVNFMRKETGQL--------WDIEF--KNGYLKIPKLLIHDGTKSLFSNLIAFEQCH 407

Query: 361 LK-SKDVSNFGMFMHFLINTNKDVSLLVEDGIIQNHLGSSKEIVALFNDLCKNVMVERNL 420
            + S +++++ +FM  LIN+++DVS L  DGII++ LGS  E+  LFN LCK V+ +   
Sbjct: 408 TQSSNNITSYIIFMDNLINSSQDVSYLHHDGIIEHWLGSDSEVADLFNRLCKEVIFDPKD 467

Query: 421 YNLE--CWKMKQYCKHRRHRWMTSLKRDYFGTPWAFISFVAALXXXXXXXXXXXLAFIAL 458
             L     ++ +Y   + +    +L++ YF  PWA+ SF AA+            A  A 
Sbjct: 468 GYLSQLSREVNRYYSRKWNSLKATLRQKYFNNPWAYFSFSAAVILLFLTFFQSFFAVYAY 506

BLAST of Cla97C08G156670.1 vs. TAIR10
Match: AT3G50170.1 (Plant protein of unknown function (DUF247))

HSP 1 Score: 164.1 bits (414), Expect = 2.1e-40
Identity = 129/435 (29.66%), Postives = 210/435 (48.28%), Query Frame = 0

Query: 52  IYKVPKPLRNVNPKAYSPQVISIGPLHHYRTRNDPTIMEKKGSYVLNFLTVAKLNWKEMI 111
           IY+VP  L+  + K+Y PQ +S+GP HH + R  P  ME+     LN +        EM 
Sbjct: 115 IYRVPHYLQENDKKSYFPQTVSLGPYHHGKKRLRP--MERHKWRALNKVLKRLKQRIEMY 174

Query: 112 EKVI-DWEGRARKYYVETIEMERDEFVQLLIFDGCFVVMYIIGSMVEEFRDL-----DTT 171
              + + E +AR  Y   I + R+EF ++L+ DGCFV+    G+ VE F ++     D  
Sbjct: 175 TNAMRELEEKARACYEGPISLSRNEFTEMLVLDGCFVLELFRGT-VEGFTEIGYARNDPV 234

Query: 172 FLWR-FSNGIFKDLLMLENQLPFFLLQALYDLCASSQPSLKEISFIEL--------LGGY 231
           F  R   + I +D++MLENQLP F+L  L +L   +Q     ++ + +         G  
Sbjct: 235 FAMRGLMHSIQRDMIMLENQLPLFVLDRLLELQLGTQNQTGIVAHVAVKFFDPLMPTGEA 294

Query: 232 FKKAREG--MSYVEKGYFDMDASEVNHLVDFIRIHLTKPSTSPRYLGV---------SYD 291
             K  +   M+++EK    +      H +D  R  L + S +P    +           D
Sbjct: 295 LTKPDQSKLMNWLEKSLDTLGDKGELHCLDVFRRSLLQSSPTPNTRSLLKRLTRNTRVVD 354

Query: 292 DLFSIWPLTATELHDCGISFQKMNKSFYHGRRRKCNTDVYFSERGGVLKMPEIIIDDSFE 351
                     TEL + G+ F+K        R+     D+ F  + G L++P+++I D  +
Sbjct: 355 KRQQQLVHCVTELREAGVKFRK--------RKTDRFWDIEF--KNGYLEIPKLLIHDGTK 414

Query: 352 IIFRNMIAYEYCHLKSKD-VSNFGMFMHFLINTNKDVSLLVEDGIIQNHLGSSKEIVALF 411
            +F N+IA+E CH++S + ++++ +FM  LIN+++DVS L   GII++ LGS  E+  LF
Sbjct: 415 SLFSNLIAFEQCHIESSNHITSYIIFMDNLINSSEDVSYLHYCGIIEHWLGSDSEVADLF 474

Query: 412 NDLCKNVMVERNLYNLE--CWKMKQYCKHRRHRWMTSLKRDYFGTPWAFISFVAALXXXX 458
           N LC+ V+ +    +L      + +Y   + +    +L   YF  PWA+ SF AA+    
Sbjct: 475 NRLCQEVVFDPKDSHLSRLSGDVNRYYNRKWNVLKATLTHKYFNNPWAYFSFSAAVILLL 534

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_008445209.19.1e-19675.27PREDICTED: UPF0481 protein At3g47200-like [Cucumis melo][more]
XP_004138858.17.2e-19374.24PREDICTED: UPF0481 protein At3g47200-like [Cucumis sativus] >KGN62944.1 hypothet... [more]
XP_022131636.11.0e-9348.87UPF0481 protein At3g47200-like [Momordica charantia][more]
XP_022961893.17.2e-9244.47UPF0481 protein At3g47200-like isoform X1 [Cucurbita moschata] >XP_022961894.1 U... [more]
XP_022961897.17.2e-9244.47UPF0481 protein At3g47200-like isoform X3 [Cucurbita moschata] >XP_022961898.1 U... [more]
Match NameE-valueIdentityDescription
tr|A0A1S3BD00|A0A1S3BD00_CUCME6.0e-19675.27UPF0481 protein At3g47200-like OS=Cucumis melo OX=3656 GN=LOC103488308 PE=4 SV=1[more]
tr|A0A0A0LPK8|A0A0A0LPK8_CUCSA4.8e-19374.24Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G381700 PE=4 SV=1[more]
tr|A0A2C9UB51|A0A2C9UB51_MANES1.8e-5936.01Uncharacterized protein OS=Manihot esculenta OX=3983 GN=MANES_16G120200 PE=4 SV=... [more]
tr|B9RP75|B9RP75_RICCO1.8e-5434.30Uncharacterized protein OS=Ricinus communis OX=3988 GN=RCOM_0924610 PE=4 SV=1[more]
tr|A0A1U8ATL2|A0A1U8ATL2_NELNU4.0e-5433.65UPF0481 protein At3g47200-like OS=Nelumbo nucifera OX=4432 GN=LOC104607450 PE=4 ... [more]
Match NameE-valueIdentityDescription
sp|Q9SD53|Y3720_ARATH8.3e-3129.26UPF0481 protein At3g47200 OS=Arabidopsis thaliana OX=3702 GN=At3g47200 PE=2 SV=1[more]
sp|P0C897|Y3264_ARATH2.9e-1521.27Putative UPF0481 protein At3g02645 OS=Arabidopsis thaliana OX=3702 GN=At3g02645 ... [more]
Match NameE-valueIdentityDescription
AT4G31980.11.3e-5031.22unknown protein[more]
AT3G50120.16.4e-4229.61Plant protein of unknown function (DUF247)[more]
AT3G50160.19.2e-4128.84Plant protein of unknown function (DUF247)[more]
AT3G50150.19.2e-4128.63Plant protein of unknown function (DUF247)[more]
AT3G50170.12.1e-4029.66Plant protein of unknown function (DUF247)[more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR004158DUF247_pln
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cla97C08G156670Cla97C08G156670gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cla97C08G156670.1.exon.1Cla97C08G156670.1.exon.1exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cla97C08G156670.1.CDS.1Cla97C08G156670.1.CDS.1CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cla97C08G156670.1Cla97C08G156670.1-proteinpolypeptide


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004158Protein of unknown function DUF247, plantPFAMPF03140DUF247coord: 52..443
e-value: 7.6E-99
score: 331.5
NoneNo IPR availablePANTHERPTHR31549FAMILY NOT NAMEDcoord: 44..451