Tan0009390 (gene) Snake gourd v1

Overview
NameTan0009390
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPlant protein of unknown function (DUF247)
LocationLG11: 47637240 .. 47638610 (+)
RNA-Seq ExpressionTan0009390
SyntenyTan0009390
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGAACGGCCCTGTTGAAGCATACGAACAAAACAACGAACGATATGATATCATAGATATTGATGAGACTGAACAAATTCGAGATGATGTAACAATATTCATTGAAGAAAAGCTTGAGAAAATGCCTCCGATTATTCCAGAATGTAGCATCTATCGAGTTCCGAAGCTGCTAATGGAGATGAATGAAATGGCGTATGTGCCGCAAGTCATTTCAATTGGTCCATTTCACCATGATCAAACTGTTTTGAAAGCCACAGAAGAGCTGAAGCTTCGACTTTTTAACAGTTATCGATGCCGCGTAGATATGGATATTCAGGGCATTGTTGAAATGGTTCGAAAATGGGAGAAAAGAGCTCGTCGATACTACTCTGAATTCATAGACATGAGCAGTGACGAGTTTGTTAAAATGATGGTTTTAGATGGTTGTTTCATAGTGGAGCTCTTGATAACTGATTATGGAAATTTTCCTAAAACTGAAAACATGGTAATTTCTTCCATCTATGACGCTATATACTTTTCTATATGTGGTGACTTGATGAAGCTTCAAAATCAACTTCCTTTCTTCGTTCTTGAAGGTCTATTTGACCAAGTTTCACAGAGTCCTGAGAATAGTGTGTCCTTTGAACGTGTTGTACGCGTATTTCTCAGTACGCGTGTGTTAACATGTTCATATGTCCAGCAACCTTCTAATCTGTGGAATATAAAGCCACGACACTTGTTGGATTTCTTAAGCTTTTACTTTGTCCCATTGGAGGGTGAAAACTATGTACACCAAAGGACTTACCTTCCCCCAACTGCAACAGAGCTTAGTGAGGCTGGTGTTGTCTTGAAAAAAGCCGAAGAAAAGCACATTATGGACATAAGTTTCGAAAATGGGGTTTTGAAAATCCCACCTTTTGAAATTAACGATTGCTTTGAAACAAACGTGCGGAACTTGTTGGCATTTGAAGAGTTTACGATGGAGAATTTTGCGGAGCAAAGTGACGAGGTGCCTCACATGGCAAGTAATAAAAAGTATGCAATACATTATTTTTTGTTCCTAGATGAGTTGATAACCACAAAGCAAGATGTGCGTTTACTTGTGAAGGAAGGAATTATAATTAATAGCATTGGCGGCAGTGACAAGGAAATTTCAAAATTATTTAATGATCTTTGTAAAAATGCCACTCTACATGGATATAACTTCTTCAGCCATATTAGCAAAGATTTAAGAGATCACTGTAAAAGACGACGAAACAGGTGGATGGCTTCATTGAAACACAACTATTTCAATACCCCATGGGCTTTTATCTCTTTCTTGGCAGCAACCTTCCTTATTCTGCTCACTCTCTTACAAACCATATCATCACTTGTAACAATTTTCAAATAA

mRNA sequence

ATGGAGAACGGCCCTGTTGAAGCATACGAACAAAACAACGAACGATATGATATCATAGATATTGATGAGACTGAACAAATTCGAGATGATGTAACAATATTCATTGAAGAAAAGCTTGAGAAAATGCCTCCGATTATTCCAGAATGTAGCATCTATCGAGTTCCGAAGCTGCTAATGGAGATGAATGAAATGGCGTATGTGCCGCAAGTCATTTCAATTGGTCCATTTCACCATGATCAAACTGTTTTGAAAGCCACAGAAGAGCTGAAGCTTCGACTTTTTAACAGTTATCGATGCCGCGTAGATATGGATATTCAGGGCATTGTTGAAATGGTTCGAAAATGGGAGAAAAGAGCTCGTCGATACTACTCTGAATTCATAGACATGAGCAGTGACGAGTTTGTTAAAATGATGGTTTTAGATGGTTGTTTCATAGTGGAGCTCTTGATAACTGATTATGGAAATTTTCCTAAAACTGAAAACATGGTAATTTCTTCCATCTATGACGCTATATACTTTTCTATATGTGGTGACTTGATGAAGCTTCAAAATCAACTTCCTTTCTTCGTTCTTGAAGGTCTATTTGACCAAGTTTCACAGAGTCCTGAGAATAGTGTGTCCTTTGAACGTGTTGTACGCGTATTTCTCAGTACGCGTGTGTTAACATGTTCATATGTCCAGCAACCTTCTAATCTGTGGAATATAAAGCCACGACACTTGTTGGATTTCTTAAGCTTTTACTTTGTCCCATTGGAGGGTGAAAACTATGTACACCAAAGGACTTACCTTCCCCCAACTGCAACAGAGCTTAGTGAGGCTGGTGTTGTCTTGAAAAAAGCCGAAGAAAAGCACATTATGGACATAAGTTTCGAAAATGGGGTTTTGAAAATCCCACCTTTTGAAATTAACGATTGCTTTGAAACAAACGTGCGGAACTTGTTGGCATTTGAAGAGTTTACGATGGAGAATTTTGCGGAGCAAAGTGACGAGGTGCCTCACATGGCAAGTAATAAAAAGTATGCAATACATTATTTTTTGTTCCTAGATGAGTTGATAACCACAAAGCAAGATGTGCGTTTACTTGTGAAGGAAGGAATTATAATTAATAGCATTGGCGGCAGTGACAAGGAAATTTCAAAATTATTTAATGATCTTTGTAAAAATGCCACTCTACATGGATATAACTTCTTCAGCCATATTAGCAAAGATTTAAGAGATCACTGTAAAAGACGACGAAACAGGTGGATGGCTTCATTGAAACACAACTATTTCAATACCCCATGGGCTTTTATCTCTTTCTTGGCAGCAACCTTCCTTATTCTGCTCACTCTCTTACAAACCATATCATCACTTGTAACAATTTTCAAATAA

Coding sequence (CDS)

ATGGAGAACGGCCCTGTTGAAGCATACGAACAAAACAACGAACGATATGATATCATAGATATTGATGAGACTGAACAAATTCGAGATGATGTAACAATATTCATTGAAGAAAAGCTTGAGAAAATGCCTCCGATTATTCCAGAATGTAGCATCTATCGAGTTCCGAAGCTGCTAATGGAGATGAATGAAATGGCGTATGTGCCGCAAGTCATTTCAATTGGTCCATTTCACCATGATCAAACTGTTTTGAAAGCCACAGAAGAGCTGAAGCTTCGACTTTTTAACAGTTATCGATGCCGCGTAGATATGGATATTCAGGGCATTGTTGAAATGGTTCGAAAATGGGAGAAAAGAGCTCGTCGATACTACTCTGAATTCATAGACATGAGCAGTGACGAGTTTGTTAAAATGATGGTTTTAGATGGTTGTTTCATAGTGGAGCTCTTGATAACTGATTATGGAAATTTTCCTAAAACTGAAAACATGGTAATTTCTTCCATCTATGACGCTATATACTTTTCTATATGTGGTGACTTGATGAAGCTTCAAAATCAACTTCCTTTCTTCGTTCTTGAAGGTCTATTTGACCAAGTTTCACAGAGTCCTGAGAATAGTGTGTCCTTTGAACGTGTTGTACGCGTATTTCTCAGTACGCGTGTGTTAACATGTTCATATGTCCAGCAACCTTCTAATCTGTGGAATATAAAGCCACGACACTTGTTGGATTTCTTAAGCTTTTACTTTGTCCCATTGGAGGGTGAAAACTATGTACACCAAAGGACTTACCTTCCCCCAACTGCAACAGAGCTTAGTGAGGCTGGTGTTGTCTTGAAAAAAGCCGAAGAAAAGCACATTATGGACATAAGTTTCGAAAATGGGGTTTTGAAAATCCCACCTTTTGAAATTAACGATTGCTTTGAAACAAACGTGCGGAACTTGTTGGCATTTGAAGAGTTTACGATGGAGAATTTTGCGGAGCAAAGTGACGAGGTGCCTCACATGGCAAGTAATAAAAAGTATGCAATACATTATTTTTTGTTCCTAGATGAGTTGATAACCACAAAGCAAGATGTGCGTTTACTTGTGAAGGAAGGAATTATAATTAATAGCATTGGCGGCAGTGACAAGGAAATTTCAAAATTATTTAATGATCTTTGTAAAAATGCCACTCTACATGGATATAACTTCTTCAGCCATATTAGCAAAGATTTAAGAGATCACTGTAAAAGACGACGAAACAGGTGGATGGCTTCATTGAAACACAACTATTTCAATACCCCATGGGCTTTTATCTCTTTCTTGGCAGCAACCTTCCTTATTCTGCTCACTCTCTTACAAACCATATCATCACTTGTAACAATTTTCAAATAA

Protein sequence

MENGPVEAYEQNNERYDIIDIDETEQIRDDVTIFIEEKLEKMPPIIPECSIYRVPKLLMEMNEMAYVPQVISIGPFHHDQTVLKATEELKLRLFNSYRCRVDMDIQGIVEMVRKWEKRARRYYSEFIDMSSDEFVKMMVLDGCFIVELLITDYGNFPKTENMVISSIYDAIYFSICGDLMKLQNQLPFFVLEGLFDQVSQSPENSVSFERVVRVFLSTRVLTCSYVQQPSNLWNIKPRHLLDFLSFYFVPLEGENYVHQRTYLPPTATELSEAGVVLKKAEEKHIMDISFENGVLKIPPFEINDCFETNVRNLLAFEEFTMENFAEQSDEVPHMASNKKYAIHYFLFLDELITTKQDVRLLVKEGIIINSIGGSDKEISKLFNDLCKNATLHGYNFFSHISKDLRDHCKRRRNRWMASLKHNYFNTPWAFISFLAATFLILLTLLQTISSLVTIFK
Homology
BLAST of Tan0009390 vs. ExPASy Swiss-Prot
Match: Q9SD53 (UPF0481 protein At3g47200 OS=Arabidopsis thaliana OX=3702 GN=At3g47200 PE=2 SV=1)

HSP 1 Score: 157.1 bits (396), Expect = 4.6e-37
Identity = 119/437 (27.23%), Postives = 217/437 (49.66%), Query Frame = 0

Query: 49  CSIYRVPKLLMEMNEMAYVPQVISIGPFHHDQTVLKATEELK---LRLFNSYRCRVDMDI 108
           C I+RVP+  + +N  AY P+V+SIGP+H+ +  L+  ++ K   L+LF     + D++ 
Sbjct: 46  CCIFRVPESFVALNPKAYKPKVVSIGPYHYGEKHLQMIQQHKPRLLQLFLDEAKKKDVEE 105

Query: 109 QGIVEMVRKWEKRARRYYSEFIDMSSDEFVKMMVLDGCFIVELLITDYGNFPKTENMVIS 168
             +V+ V   E + R+ YSE +    D  + MMVLDGCFI+ + +   GN   +E+ + S
Sbjct: 106 NVLVKAVVDLEDKIRKSYSEELKTGHD-LMFMMVLDGCFILMVFLIMSGNIELSEDPIFS 165

Query: 169 SIYDAIYFSICGDLMKLQNQLPFFVLEGLFDQVSQSPENSVSFERVVRVFLSTRVLTCSY 228
             +  +  SI  DL+ L+NQ+PFFVL+ L+  V      S    R+   F        + 
Sbjct: 166 IPW--LLSSIQSDLLLLENQVPFFVLQTLY--VGSKIGVSSDLNRIAFHFFK------NP 225

Query: 229 VQQPSNLW----NIKPRHLLDFLSFYFVPLEGEN----------YVHQ------------ 288
           + +  + W    N K +HLLD +   F+P   E+           +H+            
Sbjct: 226 IDKEGSYWEKHRNYKAKHLLDLIRETFLPNTSESDKASSPHVQVQLHEGKSGNVPSVDSK 285

Query: 289 RTYLPPTATELSEAGVV--LKKAEEKHIMDISFENGVLKIPPFEINDCFETNVRNLLAFE 348
              L  +A  L   G+   L++++E  I+++  +   L+IP    +    +   N +AFE
Sbjct: 286 AVPLILSAKRLRLQGIKFRLRRSKEDSILNVRLKKNKLQIPQLRFDGFISSFFLNCVAFE 345

Query: 349 EFTMENFAEQSDEVPHMASNKKYAIHYFLFLDELITTKQDVRLLVKEGIIINSIGGSDKE 408
           +F    + + S+E+            Y +F+  L+  ++DV  L  + +II +  GS+ E
Sbjct: 346 QF----YTDSSNEI----------TTYIVFMGCLLNNEEDVTFLRNDKLIIENHFGSNNE 405

Query: 409 ISKLFNDLCKNATLH-GYNFFSHISKDLRDHCKRRRNRWMASLKHNYFNTPWAFISFLAA 454
           +S+ F  + K+       ++ +++ K + ++ K+  N   A  +H +F +PW F+S  A 
Sbjct: 406 VSEFFKTISKDVVFEVDTSYLNNVFKGVNEYTKKWYNGLWAGFRHTHFESPWTFLSSCAV 457

BLAST of Tan0009390 vs. NCBI nr
Match: XP_022131634.1 (UPF0481 protein At3g47200-like [Momordica charantia])

HSP 1 Score: 374.0 bits (959), Expect = 1.8e-99
Identity = 218/469 (46.48%), Postives = 301/469 (64.18%), Query Frame = 0

Query: 10  EQNNERYDIIDIDE--------------TEQIRDDVTIFIEEKLEKM--PPIIPECSIYR 69
           EQ+   ++ IDID                EQ    V I IEE  +++  PPI PECSIYR
Sbjct: 2   EQSGITFECIDIDRMTGSSVNIANNNEVDEQHCRHVVISIEEMCKRLPPPPIDPECSIYR 61

Query: 70  VPKLLMEMNEMAYVPQVISIGPFHH-DQTVLKATEELKLRLFNSYRCRVDMDIQGIVEMV 129
           VPK L+ MN  AY PQVISIGPFHH +Q+ L  T++ KL+  +SY  RV M ++ +V++ 
Sbjct: 62  VPKRLLNMNRKAYTPQVISIGPFHHSNQSNLIVTQQHKLQALDSYLHRVKMTVEAVVKIT 121

Query: 130 RKWEKRARRYYSEFIDMSSDEFVKMMVLDGCFIVELLITDYGNFPKTENMVISSIYDAIY 189
           + WE RAR  Y E I M++D+FV M++LDGCF+V  LI DY N+   EN   SS Y+A+ 
Sbjct: 122 QNWENRARSCYGEPIKMNNDKFVTMLLLDGCFVVXFLILDYNNYETDENGFDSSFYEAMS 181

Query: 190 FSICGDLMKLQNQLPFFVLEGLFDQVSQSPE-NSVSFERVVRVFLSTRVLTCSYVQQPSN 249
             I GD+  L+NQLPFFVL+GL+D + +  E  + S  +++  F S    + +  + P +
Sbjct: 182 SDIYGDMTMLENQLPFFVLQGLYDLIPKDHEIKNNSLIQLIETFFSR---SMNNHEIPCH 241

Query: 250 LWNIKPRHLLDFLSFYFVPLEGENYVHQRT--YLPPTATELSEAGVVLKKAEEKH-IMDI 309
           +     +HL+D LS YF+P       H +    + P  TEL EAGV +KK +E   +MDI
Sbjct: 242 VSPPNVKHLVDLLSLYFLPPCDTKQQHDKDEYLVTPCVTELCEAGVTIKKGKEATCLMDI 301

Query: 310 SFENGVLKIPPFEINDCFETNVRNLLAFEEFTMENFAEQSDEVPHMASNKKYAIHYFLFL 369
           SF+NGVL+IPP +I+D FET VRNL+AFE +   N+              +Y I Y LFL
Sbjct: 302 SFKNGVLEIPPLDIDDHFETIVRNLMAFEHYPAANY-------------HRYTIQYALFL 361

Query: 370 DELITTKQDVRLLVKEGIIINSIGGSDKEISKLFNDLCKNATL-HGYNFFSHISKDLRDH 429
           D +I+T++DVRLLV+ GIIINSIGGSDKE+S+LFNDL K  ++  G ++ +HI+K L DH
Sbjct: 362 DYMISTEKDVRLLVEAGIIINSIGGSDKEVSRLFNDLGKYVSIPGGVHYLNHITKPLHDH 421

Query: 430 CKRRRNRWMASLKHNYFNTPWAFISFLAATFLILLTLLQTISSLVTIFK 457
           CK+   R  A+LK +YFN+PWAFIS +AAT++I+LTLLQTI + ++ FK
Sbjct: 422 CKKWWPRSKATLKRDYFNSPWAFISIVAATYIIILTLLQTIFTAISTFK 454

BLAST of Tan0009390 vs. NCBI nr
Match: XP_031736550.1 (UPF0481 protein At3g47200-like [Cucumis sativus] >XP_031736551.1 UPF0481 protein At3g47200-like [Cucumis sativus] >XP_031736552.1 UPF0481 protein At3g47200-like [Cucumis sativus])

HSP 1 Score: 364.4 bits (934), Expect = 1.4e-96
Identity = 217/475 (45.68%), Postives = 301/475 (63.37%), Query Frame = 0

Query: 10  EQNNERYDIIDIDETEQIRDDVTIFIEEKLEKMPPI-IPECSIYRVPKLLMEMNEMAYVP 69
           +++N   +I  +D+ + I D+V I IE+ L+++P     +CSIYRVPK L EMN  AY P
Sbjct: 22  QKSNNVVEISGVDQQQLICDNVVISIEKMLDQVPSAQEKQCSIYRVPKQLCEMNPKAYAP 81

Query: 70  QVISIGPF-HHDQTVLKATEELKLRLFNSYRCRVD----------MDIQGIVEMVRKWEK 129
           Q+ISIGPF +H    L A E+ KL+ FN++  RV+            +  +V+  + W K
Sbjct: 82  QLISIGPFYYHAHKNLIANEQYKLQGFNNFLHRVNKMSLEQQERTRSLNDLVKKAQSWVK 141

Query: 130 RARRYYSEFIDMSSDEFVKMMVLDGCFIVELLITDYGN--------FPKTENMVISSIYD 189
            AR  Y+E I+M+ ++F+KMM++DGCFIVE  I DY          FP+ EN V  S Y 
Sbjct: 142 EARNCYAESINMNDEDFIKMMLVDGCFIVEFFILDYEEYKEPHESLFPQIENNVSMSFYK 201

Query: 190 AIYFSICGDLMKLQNQLPFFVLEGLFDQVSQSPENSVSFERVVRVFLSTRVLTCSYVQQP 249
                I  DL+KL+NQLPFFVL+ LFD + +  +N   F+++   +L+   L  +Y  +P
Sbjct: 202 ERIPDIDDDLIKLENQLPFFVLQHLFDLIPKHNDNPNCFKQLTYKYLNMGWLE-NY--EP 261

Query: 250 SNLWNIKPRHLLDFLSFYFVP-------LEGENYVHQRTYLPPTATELSEAGVVLKKAEE 309
           S++ +IKP+H +DFLSFYFVP        E  +       +PP+ TEL EAGV +KKAE 
Sbjct: 262 SDILSIKPKHFIDFLSFYFVPHHRCEHDQESSDMKEWNVIIPPSITELCEAGVTIKKAEN 321

Query: 310 -KHIMDISFENGVLKIPPFEINDCFETNVRNLLAFEEFTMENFAEQSDEVPHMASNKKYA 369
            K +M+I FENG+L+IPP  I+D FE  +RNLLAFE F +E              N  Y 
Sbjct: 322 TKCLMNIRFENGILEIPPLHIDDYFEPMMRNLLAFEHFPVE-------------VNNTYV 381

Query: 370 IHYFLFLDELITTKQDVRLLVKEGIIINSIGGSDKEISKLFNDLCK-NATLHGYNFFSHI 429
           I Y  F+D LI+T++DV LLVKE IIIN IGGSD+E+S+LFN+LCK  ++    N+F++I
Sbjct: 382 IPYVTFMDYLISTEKDVNLLVKEKIIINDIGGSDREVSQLFNNLCKFVSSSPNDNYFNNI 441

Query: 430 SKDLRDHCKRRRNRWMASLKHNYFNTPWAFISFLAATFLILLTLLQTISSLVTIF 456
           S+ LR+HC R  N+  ASLKHNYFNTPWA ISF AAT L++LT+LQT+ S ++ F
Sbjct: 442 SEGLREHCDRWWNKAKASLKHNYFNTPWAAISFSAATVLLVLTILQTVFSAISAF 480

BLAST of Tan0009390 vs. NCBI nr
Match: XP_022132066.1 (UPF0481 protein At3g47200-like [Momordica charantia])

HSP 1 Score: 362.8 bits (930), Expect = 4.2e-96
Identity = 213/463 (46.00%), Postives = 288/463 (62.20%), Query Frame = 0

Query: 1   MENGPVEAYEQNNERYDIIDIDETEQIRDDVTIFIEEKLEKMPPIIPECSIYRVPKLLME 60
           ME+  +E Y+ N +      IDE E  +  VTI ++  LEK+ PI  ECSIYRV K L  
Sbjct: 1   MEDDHIETYDLNKK------IDEVELEQPHVTISMKNMLEKLHPISEECSIYRVSKRLHN 60

Query: 61  MNEMAYVPQVISIGPFHHDQTVLKATEELKLRLFNSYRCRVDMDIQGIVEMVRKWEKRAR 120
           +N+MAY PQ ISIGPFHH Q    A E+LKLR  ++Y  RV M I+   E+ + WE RAR
Sbjct: 61  INDMAYTPQAISIGPFHHGQKEFMAMEQLKLRFLDAYLRRVGMGIEDAFEIAQGWETRAR 120

Query: 121 RYYSEFIDMSSDEFVKMMVLDGCFIVELLITDYGNFPKTENMVISSIYDAIYFSICGDLM 180
           + Y+E IDM SD FVKMM++DG F+VE +   Y     T+  +  +++ AI+  I  DL+
Sbjct: 121 KCYAEHIDMKSDNFVKMMLVDGAFLVEFIRMHYQWATMTQPNLNYTLFQAIHVDIYRDLI 180

Query: 181 KLQNQLPFFVLEGLFDQVSQSPENSVSFERVVRVFLSTRVLTCSYVQQPSNLWNIKPRHL 240
            L+NQLPFF+LE L D+ S S    +      R +   R L          L   KP HL
Sbjct: 181 LLENQLPFFILECLLDKCSSSTPFVLFTSTFCRWYTGARELI------SDKLLTKKPNHL 240

Query: 241 LDFLSFYF----VPLEGENYVHQRTYLPPTATELSEAGVVLKKAEE--KHIMDISFENGV 300
           +DFLSFY+    V  + +   + +   PPTATEL EAGV  +KA E  + IMDI F++GV
Sbjct: 241 VDFLSFYYALPTVTGKNDKLKYNKRESPPTATELWEAGVEFQKATEDKRLIMDIRFKDGV 300

Query: 301 LKIPPFEINDCFETNVRNLLAFEEFTMENFAEQSDEVPHMASNKKYAIHYFLFLDELITT 360
           L IP  EI+D FET VRNLLA+E +             H+  +++  I Y  FLDELI+T
Sbjct: 301 LSIPHLEIHDAFETYVRNLLAYEHY-------------HIGDDERCLIQYVYFLDELIST 360

Query: 361 KQDVRLLVKEGIIINSIGGSDKEISKLFNDLCKNATLH-GYNFFSHISKDLRDHCKRRRN 420
           ++DV LLVK GII N+IGG+++++SKLFNDLCK+  +   + +++ IS DL  +C+   +
Sbjct: 361 ERDVSLLVKAGIITNNIGGNNEDVSKLFNDLCKDINISCDFYYYADISMDLHKYCETWWH 420

Query: 421 RWMASLKHNYFNTPWAFISFLAATFLILLTLLQTISSLVTIFK 457
           R MASL+ +YFNTPWAFISFLAATFL+LLT +Q I S ++  K
Sbjct: 421 RSMASLRRDYFNTPWAFISFLAATFLVLLTSMQAIYSAISYHK 438

BLAST of Tan0009390 vs. NCBI nr
Match: XP_008445188.1 (PREDICTED: UPF0481 protein At3g47200-like [Cucumis melo])

HSP 1 Score: 359.0 bits (920), Expect = 6.1e-95
Identity = 218/493 (44.22%), Postives = 303/493 (61.46%), Query Frame = 0

Query: 2   ENGPVEAY----EQNNERYDIIDIDETEQIRDDVTIFIEEKLEKMPPIIP-ECSIYRVPK 61
           ENG  E +    +++N   +I  +D+ + + D+V I IE+ L+++PP    +CSIYRVPK
Sbjct: 8   ENGQQEVHGVSNQKSNNMVEISVVDQQQLVCDNVVISIEKMLDQVPPTHENQCSIYRVPK 67

Query: 62  LLMEMNEMAYVPQVISIGPFH-HDQTVLKATEELKLRLFNSYRCRV-----------DMD 121
            L EMN  AY PQ+ISIGPFH H    L A E+ KL+ F +Y  RV              
Sbjct: 68  QLREMNPKAYAPQLISIGPFHYHTHKNLIANEQYKLQGFINYLRRVYKMESLEQLVRTKS 127

Query: 122 IQGIVEMVRKWEKRARRYYSEFIDMSSDEFVKMMVLDGCFIVELLITDYGN--------F 181
           ++ +V+  + W + AR  Y+E I+M+ ++F+KMM++DGCFIVE  I D+          F
Sbjct: 128 VEDLVKRAQSWVEEARNCYAETINMNDEDFIKMMLVDGCFIVEFFILDFEEYNESHESLF 187

Query: 182 PKTENMVISSIYDAIYFSICGDLMKLQNQLPFFVLEGLFDQVSQSPENSVSFERVVRVFL 241
           P+ EN V  S Y      I  DL+KL+NQLPFFVL+ LFD + +  +    F++     L
Sbjct: 188 PQIENNVSMSFYKERIPDIDEDLIKLENQLPFFVLQHLFDLIPKHKDAPNCFKQ-----L 247

Query: 242 STRVLTCSYVQ--QPSNLWNIKPRHLLDFLSFYFVPLEGENYVHQR---------TYLPP 301
           +   LT  +++  +PS++ +IKP+H +DFLSFY VP     Y H +           +PP
Sbjct: 248 TYEYLTMGWLENYEPSDILSIKPKHFIDFLSFYLVP--EHQYEHDQKSNDEEEWNIIIPP 307

Query: 302 TATELSEAGVVLKKAEE--KHIMDISFENGVLKIPPFEINDCFETNVRNLLAFEEFTMEN 361
           + TE+ EAGV +KKA++  K +++I FENG+L+IPP  I+D FE  +RNLLAFE F +E 
Sbjct: 308 SITEICEAGVTIKKADKNTKCLLNIRFENGILEIPPLHIDDYFEPMMRNLLAFEHFPVE- 367

Query: 362 FAEQSDEVPHMASNKKYAIHYFLFLDELITTKQDVRLLVKEGIIINSIGGSDKEISKLFN 421
                           Y I Y  F+D LI T++DV LLVKE IIIN IGGSD+E+S+LFN
Sbjct: 368 ------------VKNTYVIPYLTFMDYLIITEKDVNLLVKEKIIINDIGGSDREVSQLFN 427

Query: 422 DLCK-NATLHGYNFFSHISKDLRDHCKRRRNRWMASLKHNYFNTPWAFISFLAATFLILL 456
           +LCK  ++    N+F+  SK LRDHC RR N+  ASLKHNYFNTPWA IS  AATFL++L
Sbjct: 428 NLCKFVSSSPNDNYFNDTSKALRDHCDRRWNKAKASLKHNYFNTPWAAISVSAATFLLVL 480

BLAST of Tan0009390 vs. NCBI nr
Match: XP_023546108.1 (UPF0481 protein At3g47200-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 357.8 bits (917), Expect = 1.4e-94
Identity = 215/464 (46.34%), Postives = 292/464 (62.93%), Query Frame = 0

Query: 6   VEAYEQNNERYDIIDIDETEQIRD---DVTIFIEEKLEKMPPIIPECSIYRVPKLLMEMN 65
           +EA++ N+  Y++ +I E EQ+     DV I I++ +E++PP   ECSIYRVPKLL +MN
Sbjct: 1   MEAHQANDILYNMAEISEVEQVEQPCGDVVICIKKMMEQLPPANFECSIYRVPKLLRKMN 60

Query: 66  EMAYVPQVISIGPFHHDQTVLKATEELKLRLFNSYRCRVD---MDIQGIVEMVRKWEKRA 125
             AY PQVISIGPFHH +  L ATE+ KLR    +  R+D     ++ +V+  + W K A
Sbjct: 61  NAAYTPQVISIGPFHHRRKDLIATEQYKLRRCVDFLNRLDNKMNSLELLVKATQNWVKNA 120

Query: 126 RRYYSEFIDMSSDEFVKMMVLDGCFIVELLITDY----GNFPKTENMVISSIYDAIYFSI 185
           R YY+E I M   +F KMM++DGCFI+E LI  Y     N        +   +   Y  I
Sbjct: 121 RNYYAEPIHMCDKDFHKMMLVDGCFILEFLIQHYDQCLSNVSSETQKYLDLSFHQRYHEI 180

Query: 186 CGDLMKLQNQLPFFVLEGLFDQVSQSPENSVSFERVVRVFLSTRVLTCSYVQQPSNLWNI 245
             DL+ L+NQ+PFFVL+ LF  +  + + S+ F  + R FL  R++  +Y    S+L   
Sbjct: 181 YTDLVMLENQVPFFVLQSLFHLIPHN-KTSIPFLTLTREFLLPRLV--NYSLFDSSL--T 240

Query: 246 KPRHLLDFLSFYFVPLEGENYVHQRTY---LPPTATELSEAGVVLKKAE-EKHIMDISFE 305
           KP+H +DFLSFYFV        ++  Y    PP+ TEL EAGV +KKA+  +++MDI+F+
Sbjct: 241 KPKHFVDFLSFYFVVDNKSLKDNKNNYCLRTPPSITELYEAGVTIKKAKGARNMMDINFK 300

Query: 306 NGVLKIPPFEINDCFETNVRNLLAFEEFTMENFAEQSDEVPHMASNKKYAIHYFLFLDEL 365
           NG+L IPP  I+D FE  +RNL+AFE F +E              +K     Y  F+D+L
Sbjct: 301 NGILTIPPLMIDDFFEPIMRNLIAFEHFPLE--------------DKSKCTQYITFMDKL 360

Query: 366 ITTKQDVRLLVKEGIIINSIGGSDKEISKLFNDLCKNATLHGYNFFSHISKDLRDHCKRR 425
           I+T++DV LLV+  IIIN+IGGSDKE+SKLFN+LCK       + F+ ISK LRDHC +R
Sbjct: 361 ISTEKDVSLLVQAEIIINNIGGSDKEVSKLFNNLCKFVEERCDDKFNEISKALRDHCNKR 420

Query: 426 RNRWMASLKHNYFNTPWAFISFLAATFLILLTLLQTISSLVTIF 456
            N+  ASLKHNYFNTPWA ISF AAT LI+LTLLQTI S ++ F
Sbjct: 421 WNKAKASLKHNYFNTPWAAISFSAATLLIILTLLQTIFSAISAF 445

BLAST of Tan0009390 vs. ExPASy TrEMBL
Match: A0A6J1BQT6 (UPF0481 protein At3g47200-like OS=Momordica charantia OX=3673 GN=LOC111004764 PE=4 SV=1)

HSP 1 Score: 374.0 bits (959), Expect = 8.8e-100
Identity = 218/469 (46.48%), Postives = 301/469 (64.18%), Query Frame = 0

Query: 10  EQNNERYDIIDIDE--------------TEQIRDDVTIFIEEKLEKM--PPIIPECSIYR 69
           EQ+   ++ IDID                EQ    V I IEE  +++  PPI PECSIYR
Sbjct: 2   EQSGITFECIDIDRMTGSSVNIANNNEVDEQHCRHVVISIEEMCKRLPPPPIDPECSIYR 61

Query: 70  VPKLLMEMNEMAYVPQVISIGPFHH-DQTVLKATEELKLRLFNSYRCRVDMDIQGIVEMV 129
           VPK L+ MN  AY PQVISIGPFHH +Q+ L  T++ KL+  +SY  RV M ++ +V++ 
Sbjct: 62  VPKRLLNMNRKAYTPQVISIGPFHHSNQSNLIVTQQHKLQALDSYLHRVKMTVEAVVKIT 121

Query: 130 RKWEKRARRYYSEFIDMSSDEFVKMMVLDGCFIVELLITDYGNFPKTENMVISSIYDAIY 189
           + WE RAR  Y E I M++D+FV M++LDGCF+V  LI DY N+   EN   SS Y+A+ 
Sbjct: 122 QNWENRARSCYGEPIKMNNDKFVTMLLLDGCFVVXFLILDYNNYETDENGFDSSFYEAMS 181

Query: 190 FSICGDLMKLQNQLPFFVLEGLFDQVSQSPE-NSVSFERVVRVFLSTRVLTCSYVQQPSN 249
             I GD+  L+NQLPFFVL+GL+D + +  E  + S  +++  F S    + +  + P +
Sbjct: 182 SDIYGDMTMLENQLPFFVLQGLYDLIPKDHEIKNNSLIQLIETFFSR---SMNNHEIPCH 241

Query: 250 LWNIKPRHLLDFLSFYFVPLEGENYVHQRT--YLPPTATELSEAGVVLKKAEEKH-IMDI 309
           +     +HL+D LS YF+P       H +    + P  TEL EAGV +KK +E   +MDI
Sbjct: 242 VSPPNVKHLVDLLSLYFLPPCDTKQQHDKDEYLVTPCVTELCEAGVTIKKGKEATCLMDI 301

Query: 310 SFENGVLKIPPFEINDCFETNVRNLLAFEEFTMENFAEQSDEVPHMASNKKYAIHYFLFL 369
           SF+NGVL+IPP +I+D FET VRNL+AFE +   N+              +Y I Y LFL
Sbjct: 302 SFKNGVLEIPPLDIDDHFETIVRNLMAFEHYPAANY-------------HRYTIQYALFL 361

Query: 370 DELITTKQDVRLLVKEGIIINSIGGSDKEISKLFNDLCKNATL-HGYNFFSHISKDLRDH 429
           D +I+T++DVRLLV+ GIIINSIGGSDKE+S+LFNDL K  ++  G ++ +HI+K L DH
Sbjct: 362 DYMISTEKDVRLLVEAGIIINSIGGSDKEVSRLFNDLGKYVSIPGGVHYLNHITKPLHDH 421

Query: 430 CKRRRNRWMASLKHNYFNTPWAFISFLAATFLILLTLLQTISSLVTIFK 457
           CK+   R  A+LK +YFN+PWAFIS +AAT++I+LTLLQTI + ++ FK
Sbjct: 422 CKKWWPRSKATLKRDYFNSPWAFISIVAATYIIILTLLQTIFTAISTFK 454

BLAST of Tan0009390 vs. ExPASy TrEMBL
Match: A0A6J1BR71 (UPF0481 protein At3g47200-like OS=Momordica charantia OX=3673 GN=LOC111005028 PE=4 SV=1)

HSP 1 Score: 362.8 bits (930), Expect = 2.0e-96
Identity = 213/463 (46.00%), Postives = 288/463 (62.20%), Query Frame = 0

Query: 1   MENGPVEAYEQNNERYDIIDIDETEQIRDDVTIFIEEKLEKMPPIIPECSIYRVPKLLME 60
           ME+  +E Y+ N +      IDE E  +  VTI ++  LEK+ PI  ECSIYRV K L  
Sbjct: 1   MEDDHIETYDLNKK------IDEVELEQPHVTISMKNMLEKLHPISEECSIYRVSKRLHN 60

Query: 61  MNEMAYVPQVISIGPFHHDQTVLKATEELKLRLFNSYRCRVDMDIQGIVEMVRKWEKRAR 120
           +N+MAY PQ ISIGPFHH Q    A E+LKLR  ++Y  RV M I+   E+ + WE RAR
Sbjct: 61  INDMAYTPQAISIGPFHHGQKEFMAMEQLKLRFLDAYLRRVGMGIEDAFEIAQGWETRAR 120

Query: 121 RYYSEFIDMSSDEFVKMMVLDGCFIVELLITDYGNFPKTENMVISSIYDAIYFSICGDLM 180
           + Y+E IDM SD FVKMM++DG F+VE +   Y     T+  +  +++ AI+  I  DL+
Sbjct: 121 KCYAEHIDMKSDNFVKMMLVDGAFLVEFIRMHYQWATMTQPNLNYTLFQAIHVDIYRDLI 180

Query: 181 KLQNQLPFFVLEGLFDQVSQSPENSVSFERVVRVFLSTRVLTCSYVQQPSNLWNIKPRHL 240
            L+NQLPFF+LE L D+ S S    +      R +   R L          L   KP HL
Sbjct: 181 LLENQLPFFILECLLDKCSSSTPFVLFTSTFCRWYTGARELI------SDKLLTKKPNHL 240

Query: 241 LDFLSFYF----VPLEGENYVHQRTYLPPTATELSEAGVVLKKAEE--KHIMDISFENGV 300
           +DFLSFY+    V  + +   + +   PPTATEL EAGV  +KA E  + IMDI F++GV
Sbjct: 241 VDFLSFYYALPTVTGKNDKLKYNKRESPPTATELWEAGVEFQKATEDKRLIMDIRFKDGV 300

Query: 301 LKIPPFEINDCFETNVRNLLAFEEFTMENFAEQSDEVPHMASNKKYAIHYFLFLDELITT 360
           L IP  EI+D FET VRNLLA+E +             H+  +++  I Y  FLDELI+T
Sbjct: 301 LSIPHLEIHDAFETYVRNLLAYEHY-------------HIGDDERCLIQYVYFLDELIST 360

Query: 361 KQDVRLLVKEGIIINSIGGSDKEISKLFNDLCKNATLH-GYNFFSHISKDLRDHCKRRRN 420
           ++DV LLVK GII N+IGG+++++SKLFNDLCK+  +   + +++ IS DL  +C+   +
Sbjct: 361 ERDVSLLVKAGIITNNIGGNNEDVSKLFNDLCKDINISCDFYYYADISMDLHKYCETWWH 420

Query: 421 RWMASLKHNYFNTPWAFISFLAATFLILLTLLQTISSLVTIFK 457
           R MASL+ +YFNTPWAFISFLAATFL+LLT +Q I S ++  K
Sbjct: 421 RSMASLRRDYFNTPWAFISFLAATFLVLLTSMQAIYSAISYHK 438

BLAST of Tan0009390 vs. ExPASy TrEMBL
Match: A0A1S3BBL9 (UPF0481 protein At3g47200-like OS=Cucumis melo OX=3656 GN=LOC103488293 PE=4 SV=1)

HSP 1 Score: 359.0 bits (920), Expect = 2.9e-95
Identity = 218/493 (44.22%), Postives = 303/493 (61.46%), Query Frame = 0

Query: 2   ENGPVEAY----EQNNERYDIIDIDETEQIRDDVTIFIEEKLEKMPPIIP-ECSIYRVPK 61
           ENG  E +    +++N   +I  +D+ + + D+V I IE+ L+++PP    +CSIYRVPK
Sbjct: 8   ENGQQEVHGVSNQKSNNMVEISVVDQQQLVCDNVVISIEKMLDQVPPTHENQCSIYRVPK 67

Query: 62  LLMEMNEMAYVPQVISIGPFH-HDQTVLKATEELKLRLFNSYRCRV-----------DMD 121
            L EMN  AY PQ+ISIGPFH H    L A E+ KL+ F +Y  RV              
Sbjct: 68  QLREMNPKAYAPQLISIGPFHYHTHKNLIANEQYKLQGFINYLRRVYKMESLEQLVRTKS 127

Query: 122 IQGIVEMVRKWEKRARRYYSEFIDMSSDEFVKMMVLDGCFIVELLITDYGN--------F 181
           ++ +V+  + W + AR  Y+E I+M+ ++F+KMM++DGCFIVE  I D+          F
Sbjct: 128 VEDLVKRAQSWVEEARNCYAETINMNDEDFIKMMLVDGCFIVEFFILDFEEYNESHESLF 187

Query: 182 PKTENMVISSIYDAIYFSICGDLMKLQNQLPFFVLEGLFDQVSQSPENSVSFERVVRVFL 241
           P+ EN V  S Y      I  DL+KL+NQLPFFVL+ LFD + +  +    F++     L
Sbjct: 188 PQIENNVSMSFYKERIPDIDEDLIKLENQLPFFVLQHLFDLIPKHKDAPNCFKQ-----L 247

Query: 242 STRVLTCSYVQ--QPSNLWNIKPRHLLDFLSFYFVPLEGENYVHQR---------TYLPP 301
           +   LT  +++  +PS++ +IKP+H +DFLSFY VP     Y H +           +PP
Sbjct: 248 TYEYLTMGWLENYEPSDILSIKPKHFIDFLSFYLVP--EHQYEHDQKSNDEEEWNIIIPP 307

Query: 302 TATELSEAGVVLKKAEE--KHIMDISFENGVLKIPPFEINDCFETNVRNLLAFEEFTMEN 361
           + TE+ EAGV +KKA++  K +++I FENG+L+IPP  I+D FE  +RNLLAFE F +E 
Sbjct: 308 SITEICEAGVTIKKADKNTKCLLNIRFENGILEIPPLHIDDYFEPMMRNLLAFEHFPVE- 367

Query: 362 FAEQSDEVPHMASNKKYAIHYFLFLDELITTKQDVRLLVKEGIIINSIGGSDKEISKLFN 421
                           Y I Y  F+D LI T++DV LLVKE IIIN IGGSD+E+S+LFN
Sbjct: 368 ------------VKNTYVIPYLTFMDYLIITEKDVNLLVKEKIIINDIGGSDREVSQLFN 427

Query: 422 DLCK-NATLHGYNFFSHISKDLRDHCKRRRNRWMASLKHNYFNTPWAFISFLAATFLILL 456
           +LCK  ++    N+F+  SK LRDHC RR N+  ASLKHNYFNTPWA IS  AATFL++L
Sbjct: 428 NLCKFVSSSPNDNYFNDTSKALRDHCDRRWNKAKASLKHNYFNTPWAAISVSAATFLLVL 480

BLAST of Tan0009390 vs. ExPASy TrEMBL
Match: A0A6J1HDJ3 (UPF0481 protein At3g47200-like OS=Cucurbita moschata OX=3662 GN=LOC111462533 PE=4 SV=1)

HSP 1 Score: 357.8 bits (917), Expect = 6.5e-95
Identity = 219/467 (46.90%), Postives = 298/467 (63.81%), Query Frame = 0

Query: 6   VEAYEQNNERYDIIDIDETEQIRD---DVTIFIEEKLEKMPPIIPECSIYRVPKLLMEMN 65
           +EA++ N+  Y++  I E EQ+     DV I I++ +E++PP   ECSIYRVPKLL +MN
Sbjct: 1   MEAHQANDILYNMAKISEVEQVEQPCGDVVISIKKMMEQLPPPNFECSIYRVPKLLRKMN 60

Query: 66  EMAYVPQVISIGPFHHDQTVLKATEELKLRLFNSYRCRVD---MDIQGIVEMVRKWEKRA 125
             AY PQVISIGPFHH +  L ATE+ KLR    +  R+D     ++ +V+  + W K A
Sbjct: 61  NAAYTPQVISIGPFHHRRKDLIATEQYKLRRCVDFLNRLDNKMNSLELLVKATQNWVKNA 120

Query: 126 RRYYSEFIDMSSDEFVKMMVLDGCFIVELLITDYGN-----FPKTENMVISSIYDAIYFS 185
           R YY+E I M   +F KMM++DGCFI+E LI  +         +T+N +  S +   Y  
Sbjct: 121 RNYYAEPIHMCDKDFHKMMLVDGCFILEFLIQHHDQCLSNVSSETQNNLDLSFHQR-YHE 180

Query: 186 ICGDLMKLQNQLPFFVLEGLFDQVSQSPENSVSFERVVRVFLSTRVLTCSYVQQPSNLWN 245
           I  DL+ L+NQ+PFFVL+ LF  + Q+ + S+ F  +   FL  R++  +Y    S+L  
Sbjct: 181 IYTDLVMLENQVPFFVLQSLFHLIPQN-KTSIPFLTLTHEFLLPRLV--NYSLFDSSL-- 240

Query: 246 IKPRHLLDFLSFYFV-----PLEGENYVHQRTYLPPTATELSEAGVVLKKAE-EKHIMDI 305
            KP+H +DFLSFYFV     P + +N    RT  PP+ TEL EAGV +KKA+  +++MDI
Sbjct: 241 TKPKHFVDFLSFYFVVDNKSPKDNKNNYCLRT--PPSITELYEAGVTIKKAKGARNMMDI 300

Query: 306 SFENGVLKIPPFEINDCFETNVRNLLAFEEFTMENFAEQSDEVPHMASNKKYAIHYFLFL 365
           +F+NG+L IPP  I+D FE  +RNL+AFE F +E              +K     Y  F+
Sbjct: 301 NFKNGILTIPPLMIDDFFEPIMRNLIAFEHFPLE--------------DKSKCTQYITFM 360

Query: 366 DELITTKQDVRLLVKEGIIINSIGGSDKEISKLFNDLCKNATLHGYNFFSHISKDLRDHC 425
           D+LI+T++DV LLV+  IIIN+IGGSDKE+SKLFN+LCK       + F+ ISK LRDHC
Sbjct: 361 DKLISTEKDVSLLVQAEIIINNIGGSDKEVSKLFNNLCKFVEERCDDKFNEISKALRDHC 420

Query: 426 KRRRNRWMASLKHNYFNTPWAFISFLAATFLILLTLLQTISSLVTIF 456
            +R N+  ASLKHNYFNTPWA ISF AAT LI+LTLLQTI S ++ F
Sbjct: 421 NKRWNKAKASLKHNYFNTPWAAISFSAATLLIILTLLQTIFSAISAF 445

BLAST of Tan0009390 vs. ExPASy TrEMBL
Match: A0A6J1HD72 (UPF0481 protein At3g47200-like OS=Cucurbita moschata OX=3662 GN=LOC111462539 PE=4 SV=1)

HSP 1 Score: 357.1 bits (915), Expect = 1.1e-94
Identity = 211/444 (47.52%), Postives = 282/444 (63.51%), Query Frame = 0

Query: 18  IIDIDETEQIRDDVTIFIEEKLEKMPPIIPECSIYRVPKLLMEMNEMAYVPQVISIGPFH 77
           I + ++ E  R +V + I+E ++K+PP+  ECSI+RVPKLL  MN  AY PQVISIGPFH
Sbjct: 4   ISEAEQGEHPRRNVVLSIKEMIDKLPPVNVECSIFRVPKLLRNMNHRAYTPQVISIGPFH 63

Query: 78  HDQTVLKATEELKLRLFNSYRCRVDMDIQGIVEMVRK----WEKRARRYYSEFIDMSSDE 137
           H +  L ATE  KLR   ++  R+  + +  +E+++K    W K  R  Y E I+M+  E
Sbjct: 64  HYRKDLLATEPYKLRRCLNFLSRLGNE-RDSLELLKKNTQTWMKEVRNCYGEPINMNDKE 123

Query: 138 FVKMMVLDGCFIVELLITDYGNFPKTENMVISSIYDAIYFSICGDLMKLQNQLPFFVLEG 197
           FV MM++DGCF+VE LI ++  F    N+ ++    +I   +  DL+ L+NQ+PFF+LE 
Sbjct: 124 FVNMMIVDGCFLVEFLIQNHNGFQTPNNLDLTFHQRSI--ELFTDLIMLENQVPFFLLER 183

Query: 198 LFDQVSQSPENSVSFERVVRVFLSTR-VLTCSYVQQPSNLWNIKPRHLLDFLSFYFVPLE 257
           LF  +      SVSF+ +  +F     V  CS     SNL +IKP+HL+DFLSF+FV   
Sbjct: 184 LFGLIPNI--TSVSFKELTYIFFRQELVANCSL----SNLSSIKPKHLVDFLSFFFVSKT 243

Query: 258 G-ENYVHQRTYLPPTATELSEAGVVLKKAEE-KHIMDISFENGVLKIPPFEINDCFETNV 317
             EN  +     PPT TEL EAGV +KKA++   +MDI FEN +L IPP  I+D FE  +
Sbjct: 244 SLENNNNNSPITPPTITELYEAGVTIKKAKDFISMMDIRFENEILTIPPLVIDDLFEPTM 303

Query: 318 RNLLAFEEFTMENFAEQSDEVPHMASNKKYAIHYFLFLDELITTKQDVRLLVKEGIIINS 377
           RNL+AFE F +               NK   I Y +F+D+LI+T++DV LLVK GIIIN+
Sbjct: 304 RNLIAFEHFPLR--------------NKSNYIPYIVFMDDLISTEKDVNLLVKAGIIINN 363

Query: 378 IGGSDKEISKLFNDLCKNATLHGYNFFSHISKDLRDHCKRRRNRWMASLKHNYFNTPWAF 437
           IGGSDKE+SKLFN+LCK          ++IS  LR+HC RR N+  ASLKHNYFNTPWA 
Sbjct: 364 IGGSDKEVSKLFNNLCKFVETSSDYSLNNISNTLREHCNRRWNKAKASLKHNYFNTPWAI 423

Query: 438 ISFLAATFLILLTLLQTISSLVTI 455
           +SF AAT LI+LTLLQTI S+  I
Sbjct: 424 VSFFAATLLIILTLLQTIFSVFPI 424

BLAST of Tan0009390 vs. TAIR 10
Match: AT4G31980.1 (unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF247, plant (InterPro:IPR004158), Protein of unknown function DUF862, eukaryotic (InterPro:IPR008580); BEST Arabidopsis thaliana protein match is: Plant protein of unknown function (DUF247) (TAIR:AT5G11290.1); Has 1967 Blast hits to 1844 proteins in 183 species: Archae - 0; Bacteria - 6; Metazoa - 223; Fungi - 83; Plants - 1477; Viruses - 0; Other Eukaryotes - 178 (source: NCBI BLink). )

HSP 1 Score: 215.7 bits (548), Expect = 7.7e-56
Identity = 139/430 (32.33%), Postives = 227/430 (52.79%), Query Frame = 0

Query: 35  IEEKLEKMPPIIPECSIYRVPKLLMEMNEMAYVPQVISIGPFHHDQTVLKATEELKLRLF 94
           I+ KL  +  +  +C IY+VP  L  +N  AY P+++S GP H  +  L+A E+ K R  
Sbjct: 279 IKAKLAFLSSLSTKCCIYKVPNKLRRLNPDAYTPRLVSFGPLHRGKEELQAMEDQKYRYL 338

Query: 95  NSYRCRVDMDIQGIVEMVRKWEKRARRYYSEFIDMSSDEFVKMMVLDGCFIVELLITDYG 154
            S+  R +  ++ +V + R WE+ AR  Y+E + + SDEFV+M+V+DG F+VELL+  + 
Sbjct: 339 LSFIPRTNSSLEDLVRLARTWEQNARSCYAEDVKLHSDEFVEMLVVDGSFLVELLLRSHY 398

Query: 155 NFPKTENMVISSIYDAIYFSICGDLMKLQNQLPFFVLEGLF----DQVSQSPENSVSFER 214
              + EN  I      +   +C D++ ++NQLPFFV++ +F    +   Q   + +   +
Sbjct: 399 PRLRGENDRIFG-NSMMITDVCRDMILIENQLPFFVVKEIFLLLLNYYQQGTPSIIQLAQ 458

Query: 215 VVRVFLSTRVLTCSYVQQPSNLWNIKPRHLLDFLSFYFVP-----LEGENYVHQRTYLPP 274
               +  +R+    ++ +        P H +D L   ++P     LE   Y   +    P
Sbjct: 459 RHFSYFLSRIDDEKFITE--------PEHFVDLLRSCYLPQFPIKLE---YTTVKVDNAP 518

Query: 275 TATELSEAGVVLKKAEEKH-IMDISFENGVLKIPPFEINDCFETNVRNLLAFEEFTMENF 334
            ATEL  AGV  K AE    ++DISF +GVLKIP   ++D  E+  +N++ FE+      
Sbjct: 519 EATELHTAGVRFKPAETSSCLLDISFADGVLKIPTIVVDDLTESLYKNIIGFEQC----- 578

Query: 335 AEQSDEVPHMASNKKYAIHYFLFLDELITTKQDVRLLVKEGIIINSIGGSDKEISKLFND 394
                      SNK + + Y + L   I +  D  LL+  GII+N +G S  ++S LFN 
Sbjct: 579 ---------RCSNKNF-LDYIMLLGCFIKSPTDADLLIHSGIIVNYLGNS-VDVSNLFNS 638

Query: 395 LCKNATLHGYNFFSHISKDLRDHCKRRRNRWMASLKHNYFNTPWAFISFLAATFLILLTL 454
           + K        +FS +S++L+ +C    NRW A L+ +YF+ PWA  S  AA  L+LLT 
Sbjct: 639 ISKEVIYDRRFYFSMLSENLQAYCNTPWNRWKAILRRDYFHNPWAVASVFAALLLLLLTF 680

BLAST of Tan0009390 vs. TAIR 10
Match: AT3G50150.1 (Plant protein of unknown function (DUF247) )

HSP 1 Score: 195.7 bits (496), Expect = 8.3e-50
Identity = 148/464 (31.90%), Postives = 231/464 (49.78%), Query Frame = 0

Query: 22  DETEQIRDDVTIFIEEKLEKMPPIIPECS-----IYRVPKLLMEMNEMAYVPQVISIGPF 81
           ++  + R++  I I++K+EK        S     IYRVP  L E ++ +Y+PQ +SIGP+
Sbjct: 58  EKPRETREEWVISIKDKMEKALSYDATNSWDKLCIYRVPFYLQENDKKSYLPQTVSIGPY 117

Query: 82  HHDQTVLKATEELKLRLFNSYRCRVDMDIQGIVEMVRKWEKRARRYYSEFIDM-SSDEFV 141
           HH +  L+  E  K R  N    R   +I+  ++ +++ E+ AR  Y   IDM +S+EF 
Sbjct: 118 HHGKVHLRPMERHKWRAVNMIMARTKHNIEMYIDAMKELEEEARACYQGPIDMKNSNEFT 177

Query: 142 KMMVLDGCFIVELLITDYGNFPKTENMVISSIY--DAIYFSICGDLMKLQNQLPFFVLEG 201
           +M+VLDGCF++EL       F K        ++    +  SI  D++ L+NQLP FVL+ 
Sbjct: 178 EMLVLDGCFVLELFKGTIQGFQKIGYARNDPVFAKRGLMHSIQRDMIMLENQLPLFVLDR 237

Query: 202 LFDQVSQSP-ENSVSFERVVRVFL----STRVLTCS-----YVQQPSNLWNIKPRHLLDF 261
           L    + +P +  +  E  VR F     ++ VLT S       ++   L +    H LD 
Sbjct: 238 LLGLQTGTPNQTGIVAEVAVRFFKTLMPTSEVLTKSERSLDSQEKSDELGDNGGLHCLDV 297

Query: 262 LSFYFV----------PLEGENYVHQRTYLPPTATELSEAGVVLKKAEEKHIMDISFENG 321
                +          P E  + V ++  L    TEL  AGV   + E   + DI F+NG
Sbjct: 298 FHRSLIQSSETTNQGTPYEDMSMVEKQQQLIHCVTELRGAGVNFMRKETGQLWDIEFKNG 357

Query: 322 VLKIPPFEINDCFETNVRNLLAFEEFTMENFAEQSDEVPHMASNKKYAIHYFLFLDELIT 381
            LKIP   I+D  ++   NL+AFE+              H  S+      Y +F+D LI 
Sbjct: 358 YLKIPKLLIHDGTKSLFSNLIAFEQC-------------HTQSSNNIT-SYIIFMDNLIN 417

Query: 382 TKQDVRLLVKEGIIINSIGGSDKEISKLFNDLCKNATLHGYN-FFSHISKDLRDHCKRRR 441
           + QDV  L  +GII + + GSD E++ LFN LCK       + + S +S+++  +  R+ 
Sbjct: 418 SSQDVSYLHHDGIIEHWL-GSDSEVADLFNRLCKEVIFDPKDGYLSQLSREVNRYYSRKW 477

Query: 442 NRWMASLKHNYFNTPWAFISFLAATFLILLTLLQTISSLVTIFK 457
           N   A+L+  YFN PWA+ SF AA  L+ LT  Q+  ++   +K
Sbjct: 478 NSLKATLRQKYFNNPWAYFSFSAAVILLFLTFFQSFFAVYAYYK 506

BLAST of Tan0009390 vs. TAIR 10
Match: AT3G50160.1 (Plant protein of unknown function (DUF247) )

HSP 1 Score: 189.9 bits (481), Expect = 4.5e-48
Identity = 135/415 (32.53%), Postives = 208/415 (50.12%), Query Frame = 0

Query: 51  IYRVPKLLMEMNEMAYVPQVISIGPFHHDQTVLKATEELKLRLFNSYRCRVDMDIQGIVE 110
           IYRVP  L E +  +Y+PQ++SIGP+HH    L   E  K R  N    R   DI+  ++
Sbjct: 106 IYRVPPYLQENDTKSYMPQIVSIGPYHHGHKHLMPMERHKWRAVNMVMARAKHDIEMYID 165

Query: 111 MVRKWEKRARRYYSEFIDMSSDEFVKMMVLDGCFIVELLITDYGNFPKTENMVISSIYD- 170
            +++ E++AR  Y   I+M+ +EF++M+VLDG FI+E+       F +        ++  
Sbjct: 166 AMKELEEKARACYQGPINMNRNEFIEMLVLDGVFIIEIFKGTSEGFQEIGYAPNDPVFGM 225

Query: 171 -AIYFSICGDLMKLQNQLPFFVLEGLFDQVSQSPENSVSFERVVRVFLSTRVLTCSYVQQ 230
             +  SI  D++ L+NQLP+ VL+GL         + V+ + + + F    + T   + +
Sbjct: 226 RGLMQSIRRDMVMLENQLPWSVLKGLLQLQRPDVLDKVNVQ-LFQPFFQPLLPTREVLTE 285

Query: 231 PSNLWNIKPRHLLDFLSFYFVPLEGEN------YVHQRTYLPPTATELSEAGVVLKKAEE 290
              L      H LD L    +   G +         Q   L    TEL  AGV   + E 
Sbjct: 286 EGGL------HCLDVLRRGLLQSSGTSDEDMSMVNKQPQQLIHCVTELRNAGVEFMRKET 345

Query: 291 KHIMDISFENGVLKIPPFEINDCFETNVRNLLAFEEFTMENFAEQSDEVPHMASNKKYAI 350
            H  DI F+NG LKIP   I+D  ++   NL+AFE+              H+ S+KK   
Sbjct: 346 GHFWDIEFKNGYLKIPKLLIHDGTKSLFLNLIAFEQC-------------HIKSSKKIT- 405

Query: 351 HYFLFLDELITTKQDVRLLVKEGIIINSIGGSDKEISKLFNDLCKNATLH-GYNFFSHIS 410
            Y +F+D LI + +DV  L   GII N + GSD E+S LFN L K         + S ++
Sbjct: 406 SYIIFMDNLINSSEDVSYLHHYGIIENWL-GSDSEVSDLFNGLGKEVIFDPNDGYLSALT 465

Query: 411 KDLRDHCKRRRNRWMASLKHNYFNTPWAFISFLAATFLILLTLLQTISSLVTIFK 457
            ++  + +R+ N   A+L+H YFN PWA+ SF+AA  L++ T  Q+  ++   FK
Sbjct: 466 GEVNIYYRRKWNYLKATLRHKYFNNPWAYFSFIAAVTLLIFTFCQSFFAVFAYFK 498

BLAST of Tan0009390 vs. TAIR 10
Match: AT3G50140.1 (Plant protein of unknown function (DUF247) )

HSP 1 Score: 181.4 bits (459), Expect = 1.6e-45
Identity = 142/474 (29.96%), Postives = 236/474 (49.79%), Query Frame = 0

Query: 23  ETEQIRDDVTIFIEEKLEKMPPIIPECS-----IYRVPKLLMEMNEMAYVPQVISIGPFH 82
           + E+ R++  I+I++K+E++       S     IYRVP  L + ++ +Y PQ +S+GP+H
Sbjct: 82  QPEETREEWVIWIKDKMEQVMRDAATTSWDKICIYRVPLSLKKSDKNSYFPQAVSLGPYH 141

Query: 83  HDQTVLKATEELKLRLFNSYRCRVDMDIQGIVEMVRKWEKRARRYYSEFIDMSSDEFVKM 142
           H    L+  +  K R  N    R    I+  ++ +++ E+RAR  Y   I +SS++F +M
Sbjct: 142 HGDEHLRPMDYHKWRAVNMVMKRTKQGIEMYIDAMKELEERARACYEGPIGLSSNKFTQM 201

Query: 143 MVLDGCFIVELLITDYGNFPK---TENMVISSIYDAIYFSICGDLMKLQNQLPFFVLEGL 202
           +VLDGCF+++L    Y  F K     N  + ++  +++ SI  D++ L+NQLP FVL  L
Sbjct: 202 LVLDGCFVLDLFRGAYEGFSKLGYDRNDPVFAMRGSMH-SIRRDMLMLENQLPLFVLNRL 261

Query: 203 FD-QVSQSPENSVSFERVVRVF--------LSTRV---------------------LTCS 262
            + Q+    +  +  +  VR F         ST++                     L C 
Sbjct: 262 LELQLGTQYQTGLVAQLAVRFFNPLMPTYMSSTKIENSQENNNKFFNPIADKEKEELHCL 321

Query: 263 YVQQPSNLW-NIKPRHLLDFLSFYFVPLEGENYVHQRTYLPPTATELSEAGVVLKKAEEK 322
            V + S L  ++KP   L    +   PL  +    Q   L    TEL EAG+  K+ +  
Sbjct: 322 DVFRRSLLQPSLKPDPRLSRSRWSRKPLVADKRQQQ---LLHCVTELREAGIKFKRRKSD 381

Query: 323 HIMDISFENGVLKIPPFEINDCFETNVRNLLAFEEFTMENFAEQSDEVPHMASNKKYAIH 382
              DI F+NG L+IP   I+D  ++   NL+A+E+  +++  + +               
Sbjct: 382 RFWDIQFKNGCLEIPKLLIHDGTKSLFSNLIAYEQCHIDSTNDITS-------------- 441

Query: 383 YFLFLDELITTKQDVRLLVKEGIIINSIGGSDKEISKLFNDLCKNATLHGYN-FFSHISK 442
           Y +F+D LI + +D+R L    II + + G+D E++ +FN LC+       N + S +S 
Sbjct: 442 YIIFMDNLIDSAEDIRYLHYYDIIEHWL-GNDSEVADVFNRLCQEVAFDLENTYLSELSN 501

Query: 443 DLRDHCKRRRNRWMASLKHNYFNTPWAFISFLAATFLILLTLLQTISSLVTIFK 457
            +  +  R+ N   A+LKH YF+ PWA+ SF AA  L+LLTL Q+  +    FK
Sbjct: 502 KVDRYYNRKWNVLKATLKHKYFSNPWAYFSFFAAVILLLLTLFQSFFTSYPYFK 536

BLAST of Tan0009390 vs. TAIR 10
Match: AT3G50120.1 (Plant protein of unknown function (DUF247) )

HSP 1 Score: 181.4 bits (459), Expect = 1.6e-45
Identity = 140/471 (29.72%), Postives = 233/471 (49.47%), Query Frame = 0

Query: 28  RDDVTIFIEEKLEKM-----PPIIPECSIYRVPKLLMEMNEMAYVPQVISIGPFHHDQTV 87
           RDD  I I +KLE+        +  +  IYRVP  L E +  +Y PQ +S+GP+HH +  
Sbjct: 77  RDDWVISITDKLEQAHRDDDTTLWGKLCIYRVPYYLQENDNKSYFPQTVSLGPYHHGKKR 136

Query: 88  LKATEELKLRLFNSYRCRVDMDIQGIVEMVRKWEKRARRYYSEFIDMSSDEFVKMMVLDG 147
           L++ +  K R  N    R +  I+  ++ +R+ E++AR  Y   + +SS+EF++M+VLDG
Sbjct: 137 LRSMDRHKWRAVNRVLKRTNQGIKMYIDAMRELEEKARACYEGPLSLSSNEFIEMLVLDG 196

Query: 148 CFIVELL---ITDYGNFPKTENMVISSIYDAIYFSICGDLMKLQNQLPFFVLEGL----- 207
           CF++EL    +  +       N  + ++  +++ SI  D++ L+NQLP FVL  L     
Sbjct: 197 CFVLELFRGAVEGFTELGYARNDPVFAMRGSMH-SIQRDMVMLENQLPLFVLNRLLELQL 256

Query: 208 ----------------FDQVSQSP-----------ENSVSFERVVRVFLSTRVLTCSYVQ 267
                           FD +  +            ENS++ ++    F     L C  V 
Sbjct: 257 GTRNQTGLVAQLAIRFFDPLMPTDEPLTKSGQSKLENSLARDKSFDPFADMGELHCLDVF 316

Query: 268 QPSNLWNI-KPRHLLDFLSFYFVPLEGENYVHQRTYLPPTATELSEAGVVLKKAEEKHIM 327
           + S L +  KP   L    +       +    +R  L    TEL EAG+  ++ +     
Sbjct: 317 RRSLLRSSPKPEPRLTRKRWSRNTRVADK---RRQQLIHCVTELKEAGIKFRRRKTDRFW 376

Query: 328 DISFENGVLKIPPFEINDCFETNVRNLLAFEEFTMENFAEQSDEVPHMASNKKYAIHYFL 387
           D+ F+NG L+IP   I+D  ++   NL+AFE+  +++    S+++            Y +
Sbjct: 377 DMQFKNGYLEIPRLLIHDGTKSLFLNLIAFEQCHIDS----SNDI----------TSYII 436

Query: 388 FLDELITTKQDVRLLVKEGIIINSIGGSDKEISKLFNDLCKNATLHGY-NFFSHISKDLR 447
           F+D LI + +DV  L   GII + + GSD E++ LFN LC+        ++ S +S ++ 
Sbjct: 437 FMDNLIDSHEDVSYLHYCGIIEHWL-GSDSEVADLFNRLCQEVVFDTEDSYLSRLSIEVN 496

Query: 448 DHCKRRRNRWMASLKHNYFNTPWAFISFLAATFLILLTLLQTISSLVTIFK 457
            +   + N W A+LKH YFN PWA +SF AA  L++LT  Q+  ++   +K
Sbjct: 497 RYYDHKWNAWRATLKHKYFNNPWAIVSFCAAVILLVLTFSQSFYAVYAYYK 528

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SD534.6e-3727.23UPF0481 protein At3g47200 OS=Arabidopsis thaliana OX=3702 GN=At3g47200 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
XP_022131634.11.8e-9946.48UPF0481 protein At3g47200-like [Momordica charantia][more]
XP_031736550.11.4e-9645.68UPF0481 protein At3g47200-like [Cucumis sativus] >XP_031736551.1 UPF0481 protein... [more]
XP_022132066.14.2e-9646.00UPF0481 protein At3g47200-like [Momordica charantia][more]
XP_008445188.16.1e-9544.22PREDICTED: UPF0481 protein At3g47200-like [Cucumis melo][more]
XP_023546108.11.4e-9446.34UPF0481 protein At3g47200-like [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
A0A6J1BQT68.8e-10046.48UPF0481 protein At3g47200-like OS=Momordica charantia OX=3673 GN=LOC111004764 PE... [more]
A0A6J1BR712.0e-9646.00UPF0481 protein At3g47200-like OS=Momordica charantia OX=3673 GN=LOC111005028 PE... [more]
A0A1S3BBL92.9e-9544.22UPF0481 protein At3g47200-like OS=Cucumis melo OX=3656 GN=LOC103488293 PE=4 SV=1[more]
A0A6J1HDJ36.5e-9546.90UPF0481 protein At3g47200-like OS=Cucurbita moschata OX=3662 GN=LOC111462533 PE=... [more]
A0A6J1HD721.1e-9447.52UPF0481 protein At3g47200-like OS=Cucurbita moschata OX=3662 GN=LOC111462539 PE=... [more]
Match NameE-valueIdentityDescription
AT4G31980.17.7e-5632.33unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF247,... [more]
AT3G50150.18.3e-5031.90Plant protein of unknown function (DUF247) [more]
AT3G50160.14.5e-4832.53Plant protein of unknown function (DUF247) [more]
AT3G50140.11.6e-4529.96Plant protein of unknown function (DUF247) [more]
AT3G50120.11.6e-4529.72Plant protein of unknown function (DUF247) [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004158Protein of unknown function DUF247, plantPFAMPF03140DUF247coord: 51..442
e-value: 1.2E-100
score: 337.7
NoneNo IPR availablePANTHERPTHR31170:SF13BNAC04G53230D PROTEINcoord: 29..452
NoneNo IPR availablePANTHERPTHR31170BNAC04G53230D PROTEINcoord: 29..452

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0009390.1Tan0009390.1mRNA