Tan0003819 (gene) Snake gourd v1

Overview
NameTan0003819
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPlant protein of unknown function (DUF247)
LocationLG10: 63044042 .. 63045813 (+)
RNA-Seq ExpressionTan0003819
SyntenyTan0003819
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAATGGAGCAGCATAAGGATGAGGGGGCCGTAGTTATTGAGCTCACGACTAAAAACCTGACATCGCGATTGGAAAATCTTCCCGACAACCTCAGTGAACTGGAAGTTATCAGTAAACCAATCCTTAAAATACCAAACGACATAAAAAAGGTTAATCCGAAAGCATTCTTGCCGCAACGGATGTCGTTCGGGCCATATCACCATGGAGTCGTTCATTTGTATCCGATGGAGAAAATGAAATTTCGATCGTTTCAAACATTTCAACGTTTTGTTGGACTGTCTGTTGAAGACATCGTGAAGCGCACGTGGGGCATGGTGGAAGATTTACAAAGAGCCTACGATGATCTTGATGATAAATGGATAAAAGAACCACAAAAATTCTTGGAGCTCATGATTGTGGATGGTTGTTGCATATTGCAACACCTACTCGATGGGGATTCAATCAATCAAATTGTGTTGCGGGATATGCTGCTGCTTGAGAATCAGGTGCCCATGACGCTTCTTCAGAAGCTGCATTTCATGGTAATAAACCAAAACATAGACAAGAAGGTAATCTCTCGCTTTCTACTTTCTAAGGCCCCCGTTTGATAACCATTTTGTCTTTTTGTTTTTTGTTTTTGAAATTTAAAACTAAAAATACTACTTCTACCCATGAGTTTATATGTCTTCTTATCTACTTTATACATATGTTTTCACAAACCAAGCTAACTTTTGAAAACTAAAAAAAATAGTTTTCAGAATTTCTACATTTAGATCCCAGAATCCGAAAACAGTTTTCGAAAACACATCTTCCAAACACATATTCACTAAATTCAGTGAATCTGAAAACATAAAACAGGATCTGGATTGCATACCAAACAGACCCTTACTTTCTTTTAAGAATTTGGCTAAGAGTTCAAAGGTTTAAAATAAGCTTAACTTTCAAAAACTAAAAACAAAAAACAAAATGGTTATCAAACGAGTCCTAACTAATTTATTTCCATTGAAACGCATGCACATAATTGTTGAGGATCCCACCTTGAAAAGTCGATATATGATATATGAGTTAATTCTTTTATTGTTAATTGGTTTTCAGGTAGATATATTAGTGAGAGGAGGTTGCAAACATCTTTTAGATATGTTCAGGCTAGAATTGATTCTTAGGAGACAAATGGAACCATTACTTCAAAGAAGGCTCATGGGACCGGGAAACGAAATTCAGCTAGCAACACTCTTCCGTAAAGCCGGGATCAAAATCAAGCAAGGCCCACTTGATTTTGATGAAAAACAAGGTGTGTTGAGGCTCCCATTCATCAATATGAATGCTCACATTGAATCAGCCTTGTTAAATGCAATGGCATTCGAGAAACTTTCAGGGATTTCCAAAGAAGCAAACTCTTTCATTATTCTGATGGGTAATCTGATAGAGAAAGATGAGATAGAGTCGTTCAATCAGTTGGCTAAATCTGAGGTTTTGGAAATGTGGAAGGAGGACACTTTTGTATACAATAAAGTGAGAAAGTATTGTAATAGGCCATGGAGAATATGGTGGACAAGGCTCAAAGATACAAACTTTCAAAATCCTTGGACCATTATCTCCACTCTTGCCGCTATCGTAGGCTTTGTGTTACTAATTCTCCAAACCTTATACGGAATCTATGGATACTACAAACCACATTCATCTTGATCCAACCGCCACTCATATGCTTTCATTTCAATGCTTTTTTTTTTTCAATTTTGTGTGAGGAAAGTTGCACTTATGAAAATCTAATTGCAAATTTAGCACAAGC

mRNA sequence

ATGGAAATGGAGCAGCATAAGGATGAGGGGGCCGTAGTTATTGAGCTCACGACTAAAAACCTGACATCGCGATTGGAAAATCTTCCCGACAACCTCAGTGAACTGGAAGTTATCAGTAAACCAATCCTTAAAATACCAAACGACATAAAAAAGGTTAATCCGAAAGCATTCTTGCCGCAACGGATGTCGTTCGGGCCATATCACCATGGAGTCGTTCATTTGTATCCGATGGAGAAAATGAAATTTCGATCGTTTCAAACATTTCAACGTTTTGTTGGACTGTCTGTTGAAGACATCGTGAAGCGCACGTGGGGCATGGTGGAAGATTTACAAAGAGCCTACGATGATCTTGATGATAAATGGATAAAAGAACCACAAAAATTCTTGGAGCTCATGATTGTGGATGGTTGTTGCATATTGCAACACCTACTCGATGGGGATTCAATCAATCAAATTGTGTTGCGGGATATGCTGCTGCTTGAGAATCAGGTGCCCATGACGCTTCTTCAGAAGCTGCATTTCATGGTAATAAACCAAAACATAGACAAGAAGGTAGATATATTAGTGAGAGGAGGTTGCAAACATCTTTTAGATATGTTCAGGCTAGAATTGATTCTTAGGAGACAAATGGAACCATTACTTCAAAGAAGGCTCATGGGACCGGGAAACGAAATTCAGCTAGCAACACTCTTCCGTAAAGCCGGGATCAAAATCAAGCAAGGCCCACTTGATTTTGATGAAAAACAAGGTGTGTTGAGGCTCCCATTCATCAATATGAATGCTCACATTGAATCAGCCTTGTTAAATGCAATGGCATTCGAGAAACTTTCAGGGATTTCCAAAGAAGCAAACTCTTTCATTATTCTGATGGGTAATCTGATAGAGAAAGATGAGATAGAGTCGTTCAATCAGTTGGCTAAATCTGAGGTTTTGGAAATGTGGAAGGAGGACACTTTTGTATACAATAAAGTGAGAAAGTATTGTAATAGGCCATGGAGAATATGGTGGACAAGGCTCAAAGATACAAACTTTCAAAATCCTTGGACCATTATCTCCACTCTTGCCGCTATCGTAGGCTTTGTGTTACTAATTCTCCAAACCTTATACGGAATCTATGGATACTACAAACCACATTCATCTTGATCCAACCGCCACTCATATGCTTTCATTTCAATGCTTTTTTTTTTTCAATTTTGTGTGAGGAAAGTTGCACTTATGAAAATCTAATTGCAAATTTAGCACAAGC

Coding sequence (CDS)

ATGGAAATGGAGCAGCATAAGGATGAGGGGGCCGTAGTTATTGAGCTCACGACTAAAAACCTGACATCGCGATTGGAAAATCTTCCCGACAACCTCAGTGAACTGGAAGTTATCAGTAAACCAATCCTTAAAATACCAAACGACATAAAAAAGGTTAATCCGAAAGCATTCTTGCCGCAACGGATGTCGTTCGGGCCATATCACCATGGAGTCGTTCATTTGTATCCGATGGAGAAAATGAAATTTCGATCGTTTCAAACATTTCAACGTTTTGTTGGACTGTCTGTTGAAGACATCGTGAAGCGCACGTGGGGCATGGTGGAAGATTTACAAAGAGCCTACGATGATCTTGATGATAAATGGATAAAAGAACCACAAAAATTCTTGGAGCTCATGATTGTGGATGGTTGTTGCATATTGCAACACCTACTCGATGGGGATTCAATCAATCAAATTGTGTTGCGGGATATGCTGCTGCTTGAGAATCAGGTGCCCATGACGCTTCTTCAGAAGCTGCATTTCATGGTAATAAACCAAAACATAGACAAGAAGGTAGATATATTAGTGAGAGGAGGTTGCAAACATCTTTTAGATATGTTCAGGCTAGAATTGATTCTTAGGAGACAAATGGAACCATTACTTCAAAGAAGGCTCATGGGACCGGGAAACGAAATTCAGCTAGCAACACTCTTCCGTAAAGCCGGGATCAAAATCAAGCAAGGCCCACTTGATTTTGATGAAAAACAAGGTGTGTTGAGGCTCCCATTCATCAATATGAATGCTCACATTGAATCAGCCTTGTTAAATGCAATGGCATTCGAGAAACTTTCAGGGATTTCCAAAGAAGCAAACTCTTTCATTATTCTGATGGGTAATCTGATAGAGAAAGATGAGATAGAGTCGTTCAATCAGTTGGCTAAATCTGAGGTTTTGGAAATGTGGAAGGAGGACACTTTTGTATACAATAAAGTGAGAAAGTATTGTAATAGGCCATGGAGAATATGGTGGACAAGGCTCAAAGATACAAACTTTCAAAATCCTTGGACCATTATCTCCACTCTTGCCGCTATCGTAGGCTTTGTGTTACTAATTCTCCAAACCTTATACGGAATCTATGGATACTACAAACCACATTCATCTTGA

Protein sequence

MEMEQHKDEGAVVIELTTKNLTSRLENLPDNLSELEVISKPILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTFQRFVGLSVEDIVKRTWGMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCCILQHLLDGDSINQIVLRDMLLLENQVPMTLLQKLHFMVINQNIDKKVDILVRGGCKHLLDMFRLELILRRQMEPLLQRRLMGPGNEIQLATLFRKAGIKIKQGPLDFDEKQGVLRLPFINMNAHIESALLNAMAFEKLSGISKEANSFIILMGNLIEKDEIESFNQLAKSEVLEMWKEDTFVYNKVRKYCNRPWRIWWTRLKDTNFQNPWTIISTLAAIVGFVLLILQTLYGIYGYYKPHSS
Homology
BLAST of Tan0003819 vs. ExPASy Swiss-Prot
Match: Q9SD53 (UPF0481 protein At3g47200 OS=Arabidopsis thaliana OX=3702 GN=At3g47200 PE=2 SV=1)

HSP 1 Score: 102.8 bits (255), Expect = 8.6e-21
Identity = 112/419 (26.73%), Postives = 176/419 (42.00%), Query Frame = 0

Query: 42  ILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTF-QRFVGLSVED-- 101
           I ++P     +NPKA+ P+ +S GPYH+G  HL  +++ K R  Q F        VE+  
Sbjct: 48  IFRVPESFVALNPKAYKPKVVSIGPYHYGEKHLQMIQQHKPRLLQLFLDEAKKKDVEENV 107

Query: 102 IVKRTWGMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCCILQHL--------LDGDSIN 161
           +VK    + + ++++Y +     +K     + +M++DGC IL           L  D I 
Sbjct: 108 LVKAVVDLEDKIRKSYSE----ELKTGHDLMFMMVLDGCFILMVFLIMSGNIELSEDPIF 167

Query: 162 QI------VLRDMLLLENQVPMTLLQKLH---------------FMVINQNIDKKVDILV 221
            I      +  D+LLLENQVP  +LQ L+               F      IDK+     
Sbjct: 168 SIPWLLSSIQSDLLLLENQVPFFVLQTLYVGSKIGVSSDLNRIAFHFFKNPIDKEGSYWE 227

Query: 222 RG---GCKHLLDMFRLELILRRQMEPLLQRRLMGPGNEIQL------------------- 281
           +      KHLLD+ R E  L    E     +   P  ++QL                   
Sbjct: 228 KHRNYKAKHLLDLIR-ETFLPNTSE---SDKASSPHVQVQLHEGKSGNVPSVDSKAVPLI 287

Query: 282 --ATLFRKAGIKIK------QGPLDFDEKQGVLRLPFINMNAHIESALLNAMAFEKL-SG 341
             A   R  GIK +         L+   K+  L++P +  +  I S  LN +AFE+  + 
Sbjct: 288 LSAKRLRLQGIKFRLRRSKEDSILNVRLKKNKLQIPQLRFDGFISSFFLNCVAFEQFYTD 347

Query: 342 ISKEANSFIILMGNLIEKDEIESFNQLAK----------SEVLEMWKE---------DTF 375
            S E  ++I+ MG L+  +E  +F +  K          +EV E +K          DT 
Sbjct: 348 SSNEITTYIVFMGCLLNNEEDVTFLRNDKLIIENHFGSNNEVSEFFKTISKDVVFEVDTS 407

BLAST of Tan0003819 vs. ExPASy Swiss-Prot
Match: P0C897 (Putative UPF0481 protein At3g02645 OS=Arabidopsis thaliana OX=3702 GN=At3g02645 PE=3 SV=1)

HSP 1 Score: 55.1 bits (131), Expect = 2.0e-06
Identity = 60/226 (26.55%), Postives = 111/226 (49.12%), Query Frame = 0

Query: 4   EQHK-DEGAVVIELTTKNLTSRLENLPDNLSELEVISKPILKIPNDIKKVNPKAFLPQRM 63
           EQH+ DE   VI +  K+L + LE       +LE ++  I  +P  +   +P ++ P R+
Sbjct: 12  EQHRFDETRWVINV-QKSLDAELEE-----HDLEEVTVSIFNVPKALMCSHPDSYTPHRV 71

Query: 64  SFGPYHHGVVHLYPMEKMKFRSFQTFQ-RFVGLSVEDIVKRTWGMVEDLQRAYDDLDDKW 123
           S GPYH     L+ ME+ K    +  + ++      D+V++   M   ++  Y     K+
Sbjct: 72  SIGPYHCLKPELHEMERYKLMIARKIRNQYNSFRFHDLVEKLQSMEIKIRACY----HKY 131

Query: 124 IK-EPQKFLELMIVDGCCILQHL------LDGDSINQI----VLRDMLLLENQVPMTLLQ 183
           I    +  L +M VD   +++ L           IN++    +LRD++++ENQ+P+ +L+
Sbjct: 132 IGFNGETLLWIMAVDSSFLIEFLKIYSFRKVETLINRVGHNEILRDIMMIENQIPLFVLR 191

Query: 184 K-LHFMV-INQNIDKKVDILVRGGCKHLLDM---FRLELILRRQME 212
           K L F +   ++ D  +  ++ G CK L  +   F  + IL+ Q +
Sbjct: 192 KTLEFQLESTESADDLLLSVLTGLCKDLSPLVIKFDDDQILKAQFQ 227

BLAST of Tan0003819 vs. NCBI nr
Match: XP_023535324.1 (UPF0481 protein At3g47200-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 423.3 bits (1087), Expect = 2.2e-114
Identity = 229/404 (56.68%), Postives = 288/404 (71.29%), Query Frame = 0

Query: 1   MEMEQHKDEGAVVIELTTKNLTSRLENLPDNLSELEVISKPILKIPNDIKKVNPKAFLPQ 60
           ME+ +  ++ A+V E+ T+NL S LENLPD+L E +     I +IP+ IKKV+PKAF P+
Sbjct: 1   MELSRLHNKDALV-EIITQNLNSHLENLPDDL-ERKGNGASIYRIPDHIKKVHPKAFKPK 60

Query: 61  RMSFGPYHHGVVHLYPMEKMKFRSFQTFQRFVGLSVEDIVKRTWGMVEDLQRAYDDLDDK 120
           R+SFGPYHHG +HL PMEKMK  + + F+R  GL VEDIV   W M+EDLQR+YD LDD+
Sbjct: 61  RVSFGPYHHGELHLSPMEKMKHLALRYFERRCGLCVEDIVNELWDMLEDLQRSYDKLDDE 120

Query: 121 WIKEPQKFLELMIVDGCCILQHLLDGDSINQI----VLRDMLLLENQVPMTLLQKLHFMV 180
           W K+P KFLE+MI+DGC ++Q LL+   +++     VLRDMLLLENQ+PM LL KL+ M 
Sbjct: 121 WKKKPTKFLEVMILDGCLMMQVLLEDTDVSKFTTTDVLRDMLLLENQLPMKLLDKLYLMS 180

Query: 181 INQNIDKKVDILV---------------RGGCKHLLDMFRLELILRRQME-PLLQRRLMG 240
           + ++ +KKV  LV                    HLLDM+R EL+  +  E   LQR  +G
Sbjct: 181 MPEH-NKKVKSLVWEFRNLPTDVKKTLLERDYSHLLDMYRAELVFYKGNEWKPLQRSHLG 240

Query: 241 PGNEIQLATLFRKAGIKIKQG----PLDFDEKQGVLRLPFINMNAHIESALLNAMAFEKL 300
            G+EIQLAT F KAGIK K+G     + FD K+GVL LPFI MNAHIES LLNAMAFEKL
Sbjct: 241 MGHEIQLATRFHKAGIKFKKGCNLMDVYFDRKRGVLSLPFIEMNAHIESGLLNAMAFEKL 300

Query: 301 SGISKEANSFIILMGNLIEKDEIESFNQLAKSEVLEMWKEDTFVYNKVRKYCNRPWRIWW 360
            GI    +SF+ILMGNL+EKDE++SFNQLAK EVL MW   T+VYN V ++C RPWRIWW
Sbjct: 301 YGIDNIVDSFVILMGNLMEKDEVDSFNQLAKGEVLGMWGHYTYVYNSVNEHCKRPWRIWW 360

Query: 361 TRLKDTNFQNPWTIISTLAAIVGFVLLILQTLYGIYGYYKPHSS 381
           T LKD NFQ+PWTIISTL+A++GF  LI+QT+YG+YGYY P  S
Sbjct: 361 TTLKDVNFQSPWTIISTLSALIGFAFLIIQTVYGVYGYYLPRRS 401

BLAST of Tan0003819 vs. NCBI nr
Match: XP_022925442.1 (UPF0481 protein At3g47200-like [Cucurbita moschata])

HSP 1 Score: 418.7 bits (1075), Expect = 5.4e-113
Identity = 226/404 (55.94%), Postives = 287/404 (71.04%), Query Frame = 0

Query: 1   MEMEQHKDEGAVVIELTTKNLTSRLENLPDNLSELEVISKPILKIPNDIKKVNPKAFLPQ 60
           ME+ +  ++ A+V E+ T+NL S LENLPD+L E +     I +IP+ IKKV+PKAF P+
Sbjct: 1   MELSRLHNKDALV-EIITQNLNSHLENLPDDL-ERKDNGASIYRIPDHIKKVHPKAFKPK 60

Query: 61  RMSFGPYHHGVVHLYPMEKMKFRSFQTFQRFVGLSVEDIVKRTWGMVEDLQRAYDDLDDK 120
           R+SFGPYHHG +HL PMEKMK  + + F+R  GL VEDIV   W M+EDLQR+YD LDD+
Sbjct: 61  RVSFGPYHHGELHLSPMEKMKHLALRYFERRCGLCVEDIVNELWDMLEDLQRSYDKLDDE 120

Query: 121 WIKEPQKFLELMIVDGCCILQHLLDGDSINQI----VLRDMLLLENQVPMTLLQKLHFMV 180
           W K+P KFLE+MI+DGC ++Q LL+   +++     VLRDMLLLENQ+PM LL KL+ M 
Sbjct: 121 WKKKPTKFLEVMILDGCLMMQVLLEDTDVSKFTTTDVLRDMLLLENQLPMKLLDKLYLMS 180

Query: 181 INQNIDKKVDILV---------------RGGCKHLLDMFRLELILRRQME-PLLQRRLMG 240
           + ++ +KKV  LV                    HLLDM+R EL+  +  E   LQR  +G
Sbjct: 181 MPEH-NKKVKSLVWEFRNLPTEVKKTLLERDYLHLLDMYRAELVFYKGNEWKPLQRSHLG 240

Query: 241 PGNEIQLATLFRKAGIKIKQG----PLDFDEKQGVLRLPFINMNAHIESALLNAMAFEKL 300
            G+EIQLAT F KAGIK K+G     + FD K+GVL LPFI MNAHI+S L+NAMAFEKL
Sbjct: 241 MGHEIQLATRFHKAGIKFKKGCNLMDVYFDRKRGVLSLPFIEMNAHIDSGLVNAMAFEKL 300

Query: 301 SGISKEANSFIILMGNLIEKDEIESFNQLAKSEVLEMWKEDTFVYNKVRKYCNRPWRIWW 360
            GI    +SF+ILMGNL+EKDE++SFNQLAK EVL MW    +VYN V ++C RPWRIWW
Sbjct: 301 YGIDNIVDSFVILMGNLMEKDEVDSFNQLAKGEVLGMWGHYNYVYNSVNEHCKRPWRIWW 360

Query: 361 TRLKDTNFQNPWTIISTLAAIVGFVLLILQTLYGIYGYYKPHSS 381
           T LKD NFQ+PWTIISTL+A++GF  LI+QT+YG+YGYY P  S
Sbjct: 361 TTLKDVNFQSPWTIISTLSALIGFAFLIIQTVYGVYGYYMPRRS 401

BLAST of Tan0003819 vs. NCBI nr
Match: KAG7025181.1 (UPF0481 protein, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 393.3 bits (1009), Expect = 2.4e-105
Identity = 215/389 (55.27%), Postives = 271/389 (69.67%), Query Frame = 0

Query: 1   MEMEQHKDEGAVVIELTTKNLTSRLENLPDNLSELEVISKPILKIPNDIKKVNPKAFLPQ 60
           ME+ +  ++ A+V E+ T+NL S LENLPD+L E +     I +IP+ IKKV+PKAF P+
Sbjct: 1   MELSRLHNKDALV-EIITQNLNSHLENLPDDL-ERKGNGASIYRIPDHIKKVHPKAFKPK 60

Query: 61  RMSFGPYHHGVVHLYPMEKMKFRSFQTFQRFVGLSVEDIVKRTWGMVEDLQRAYDDLDDK 120
           R+SFGPYHHG +HL PMEKMK  + + F+R  GL VEDIV   W M+EDLQR+YD LDD+
Sbjct: 61  RVSFGPYHHGELHLSPMEKMKHLALRYFERRCGLCVEDIVNELWDMLEDLQRSYDKLDDE 120

Query: 121 WIKEPQKFLELMIVDGCCILQHLLDGDSINQI----VLRDMLLLENQVPMTLLQKLHFMV 180
           W K+P KFLE+MI+DGC ++Q LL+   +++     VLRDM  L   V  TLL++ +   
Sbjct: 121 WKKKPTKFLEVMILDGCLMMQVLLEDTDVSKFTTTDVLRDM-NLPTDVKKTLLERDY--- 180

Query: 181 INQNIDKKVDILVRGGCKHLLDMFRLELILRRQME-PLLQRRLMGPGNEIQLATLFRKAG 240
                             HLLDM+R EL+  +  E   LQR  +G G+EIQLAT F KAG
Sbjct: 181 -----------------SHLLDMYRAELVFYKGNEWKPLQRSHLGMGHEIQLATRFHKAG 240

Query: 241 IKIKQG----PLDFDEKQGVLRLPFINMNAHIESALLNAMAFEKLSGISKEANSFIILMG 300
           IK K+G     + FD K+GVL LPFI MNAHIES LLNAMAFEKL GI    +SF+ILM 
Sbjct: 241 IKFKKGCNLMDVYFDRKRGVLSLPFIEMNAHIESGLLNAMAFEKLYGIDNIVDSFVILMD 300

Query: 301 NLIEKDEIESFNQLAKSEVLEMWKEDTFVYNKVRKYCNRPWRIWWTRLKDTNFQNPWTII 360
           NL+EKDE++SFNQLAK EVL MW   T+VYN V ++C RPWRIW T LKD NFQ+PWTII
Sbjct: 301 NLMEKDEVDSFNQLAKGEVLGMWGHYTYVYNSVNEHCKRPWRIWGTTLKDVNFQSPWTII 360

Query: 361 STLAAIVGFVLLILQTLYGIYGYYKPHSS 381
           STL+A++GF  LI+QT+YG+YGYY PH S
Sbjct: 361 STLSALIGFAFLIIQTVYGVYGYYLPHRS 366

BLAST of Tan0003819 vs. NCBI nr
Match: XP_022925444.1 (UPF0481 protein At3g47200-like [Cucurbita moschata])

HSP 1 Score: 384.4 bits (986), Expect = 1.1e-102
Identity = 218/428 (50.93%), Postives = 277/428 (64.72%), Query Frame = 0

Query: 1   MEMEQHKDEGAVVIELTTKNLTSRLENLPDNLSELEVISKPILKIPNDIKKVNPKAFLPQ 60
           ME  + +++ AVV E+ T+N+ S L NLPD+L E +     I +IP  IKKVNP AF PQ
Sbjct: 1   MEQSRPRNKDAVV-EIVTQNVKSHLANLPDDL-ESKGNRASIYRIPEHIKKVNPNAFKPQ 60

Query: 61  RMSFGPYHHGVVHLYPMEKMKFRSFQTFQRFVGLSVEDIVKRTWGMVEDLQRAYDDLDDK 120
            +SFGPYHHG +HL P EK+K  +F+ F++  GLS+ED+V   W M+EDLQR+YD LDDK
Sbjct: 61  LISFGPYHHGELHLMPTEKVKHLAFRYFEKRCGLSIEDMVNEVWDMLEDLQRSYDKLDDK 120

Query: 121 WIKEPQKFLELMIVDGCCILQHLLDGDSINQI----VLRDMLLLENQVPMTLLQKLHFMV 180
           W  EP KFLE+MI+DGC I+Q LL    +  +    VLRD+LLLENQ+PM LL KL  M+
Sbjct: 121 WKTEPAKFLEIMILDGCFIIQVLLGRSPLWTLPLEDVLRDVLLLENQLPMKLLAKLCSML 180

Query: 181 I-------NQNIDKKV-----------DILVRGGCKHLLDMFRLELIL-------RRQME 240
           +       N+N++  V            +L+     H+LDM+R EL         R Q+ 
Sbjct: 181 MLEGEGEHNKNVESLVRESQKIPAYLKKMLMENDYSHILDMYRAELQYYEGNEQERHQIS 240

Query: 241 PLLQR-------------RL--MGPGNEIQLATLFRKAGIKIKQG----PLDFDEKQGVL 300
               R             RL  +G G+EIQLA  F KAGIK+K+G     +DFD+ +GVL
Sbjct: 241 HTEMRFTHFEMIFGQLGMRLSHLGMGHEIQLARRFHKAGIKLKKGCNLRDVDFDQNKGVL 300

Query: 301 RLPFINMNAHIESALLNAMAFEKLSGISKEANSFIILMGNLIEKDEIESFNQLAKSEVLE 360
            LPFI MNA+IES LLNAMAFEKL GI     SF+ILMGNL+EKDE++SFNQLAK  VL 
Sbjct: 301 SLPFIEMNANIESGLLNAMAFEKLLGIGNIVGSFVILMGNLLEKDEVDSFNQLAKGTVLG 360

Query: 361 MWKEDTFVYNKVRKYCNRPWRIWWTRLKDTNFQNPWTIISTLAAIVGFVLLILQTLYGIY 381
           +W     VY  V  +C RPW+IWWT LKD NFQ+PWTIIST  A++GF LLI+QT+YG+Y
Sbjct: 361 LWGHYPTVYKSVNNHCKRPWKIWWTTLKDVNFQSPWTIISTFYALIGFALLIIQTVYGVY 420

BLAST of Tan0003819 vs. NCBI nr
Match: KAG7025179.1 (UPF0481 protein [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 383.3 bits (983), Expect = 2.5e-102
Identity = 213/399 (53.38%), Postives = 270/399 (67.67%), Query Frame = 0

Query: 5   QHKDEGAVVIELTTKNLTSRLENLPDNLSELEVISKPILKIPNDIKKVNPKAFLPQRMSF 64
           Q+KD    V+E+TT+NL S + N+ D   E+      I +IP  IKKVNP AF PQ +SF
Sbjct: 7   QNKDR---VVEITTQNLKSNMSNVED--FEMNRQKASIYRIPEHIKKVNPNAFKPQLVSF 66

Query: 65  GPYHHGVVHLYPMEKMKFRSFQTFQRFVGLSVEDIVKRTWGMVEDLQRAYDDLDDKWIKE 124
           GPYHHG +HL PMEK+K  +    +R  GLS++D+V   W M+EDLQR+YD LDD+W  +
Sbjct: 67  GPYHHGELHLLPMEKIKLFALSDIERCFGLSIKDMVNEVWDMLEDLQRSYDKLDDEWKTK 126

Query: 125 PQKFLELMIVDGCCILQHLLDGDSINQI----VLRDMLLLENQVPMTLLQKLHFMVIN-- 184
           P KFLELMI+DGC ++Q LL    + ++    V RDMLLLENQ+PM LL KL+ M+ +  
Sbjct: 127 PAKFLELMILDGCFLIQLLLYNLPLFKLPLEDVQRDMLLLENQLPMKLLDKLYSMLKSDE 186

Query: 185 -QNIDKKV-----------DILVRGGCKHLLDMFRLELILRRQMEP-LLQRRLMGPGNEI 244
            +NI   V           +IL+     HLLDM+R EL      EP LLQR  +G G+EI
Sbjct: 187 KENIKSLVWRSPNLPTEVKEILMAMDYLHLLDMYRKELKFGTGNEPYLLQRSHLGMGHEI 246

Query: 245 QLATLFRKAGIKIKQG----PLDFDEKQGVLRLPFINMNAHIESALLNAMAFEKLSGISK 304
           QLA  F KAGIK+K+G     +DFD+ +GVL LPFI MNA+IES LLNAM FEKL GI  
Sbjct: 247 QLARRFDKAGIKLKKGCNLKDVDFDQNKGVLSLPFIQMNANIESGLLNAMTFEKLVGIDN 306

Query: 305 EANSFIILMGNLIEKDEIESFNQLAKSEVLEMWKEDTFVYNKVRKYCNRPWRIWWTRLKD 364
              SF+ILMGNL+EKDE++SFNQLAK  VL MW     VY+ V ++C RPWRIWWT LKD
Sbjct: 307 IVGSFVILMGNLLEKDEVDSFNQLAKGTVLSMWDVYAHVYSSVNEHCKRPWRIWWTTLKD 366

Query: 365 TNFQNPWTIISTLAAIVGFVLLILQTLYGIYGYYKPHSS 381
            +F  PWTIIS+L A++GF LL++QT+YG+YGYY P  S
Sbjct: 367 -SFIGPWTIISSLYALIGFALLVIQTVYGVYGYYLPRRS 399

BLAST of Tan0003819 vs. ExPASy TrEMBL
Match: A0A6J1EBQ1 (UPF0481 protein At3g47200-like OS=Cucurbita moschata OX=3662 GN=LOC111432742 PE=4 SV=1)

HSP 1 Score: 418.7 bits (1075), Expect = 2.6e-113
Identity = 226/404 (55.94%), Postives = 287/404 (71.04%), Query Frame = 0

Query: 1   MEMEQHKDEGAVVIELTTKNLTSRLENLPDNLSELEVISKPILKIPNDIKKVNPKAFLPQ 60
           ME+ +  ++ A+V E+ T+NL S LENLPD+L E +     I +IP+ IKKV+PKAF P+
Sbjct: 1   MELSRLHNKDALV-EIITQNLNSHLENLPDDL-ERKDNGASIYRIPDHIKKVHPKAFKPK 60

Query: 61  RMSFGPYHHGVVHLYPMEKMKFRSFQTFQRFVGLSVEDIVKRTWGMVEDLQRAYDDLDDK 120
           R+SFGPYHHG +HL PMEKMK  + + F+R  GL VEDIV   W M+EDLQR+YD LDD+
Sbjct: 61  RVSFGPYHHGELHLSPMEKMKHLALRYFERRCGLCVEDIVNELWDMLEDLQRSYDKLDDE 120

Query: 121 WIKEPQKFLELMIVDGCCILQHLLDGDSINQI----VLRDMLLLENQVPMTLLQKLHFMV 180
           W K+P KFLE+MI+DGC ++Q LL+   +++     VLRDMLLLENQ+PM LL KL+ M 
Sbjct: 121 WKKKPTKFLEVMILDGCLMMQVLLEDTDVSKFTTTDVLRDMLLLENQLPMKLLDKLYLMS 180

Query: 181 INQNIDKKVDILV---------------RGGCKHLLDMFRLELILRRQME-PLLQRRLMG 240
           + ++ +KKV  LV                    HLLDM+R EL+  +  E   LQR  +G
Sbjct: 181 MPEH-NKKVKSLVWEFRNLPTEVKKTLLERDYLHLLDMYRAELVFYKGNEWKPLQRSHLG 240

Query: 241 PGNEIQLATLFRKAGIKIKQG----PLDFDEKQGVLRLPFINMNAHIESALLNAMAFEKL 300
            G+EIQLAT F KAGIK K+G     + FD K+GVL LPFI MNAHI+S L+NAMAFEKL
Sbjct: 241 MGHEIQLATRFHKAGIKFKKGCNLMDVYFDRKRGVLSLPFIEMNAHIDSGLVNAMAFEKL 300

Query: 301 SGISKEANSFIILMGNLIEKDEIESFNQLAKSEVLEMWKEDTFVYNKVRKYCNRPWRIWW 360
            GI    +SF+ILMGNL+EKDE++SFNQLAK EVL MW    +VYN V ++C RPWRIWW
Sbjct: 301 YGIDNIVDSFVILMGNLMEKDEVDSFNQLAKGEVLGMWGHYNYVYNSVNEHCKRPWRIWW 360

Query: 361 TRLKDTNFQNPWTIISTLAAIVGFVLLILQTLYGIYGYYKPHSS 381
           T LKD NFQ+PWTIISTL+A++GF  LI+QT+YG+YGYY P  S
Sbjct: 361 TTLKDVNFQSPWTIISTLSALIGFAFLIIQTVYGVYGYYMPRRS 401

BLAST of Tan0003819 vs. ExPASy TrEMBL
Match: A0A6J1EC69 (UPF0481 protein At3g47200-like OS=Cucurbita moschata OX=3662 GN=LOC111432744 PE=4 SV=1)

HSP 1 Score: 384.4 bits (986), Expect = 5.4e-103
Identity = 218/428 (50.93%), Postives = 277/428 (64.72%), Query Frame = 0

Query: 1   MEMEQHKDEGAVVIELTTKNLTSRLENLPDNLSELEVISKPILKIPNDIKKVNPKAFLPQ 60
           ME  + +++ AVV E+ T+N+ S L NLPD+L E +     I +IP  IKKVNP AF PQ
Sbjct: 1   MEQSRPRNKDAVV-EIVTQNVKSHLANLPDDL-ESKGNRASIYRIPEHIKKVNPNAFKPQ 60

Query: 61  RMSFGPYHHGVVHLYPMEKMKFRSFQTFQRFVGLSVEDIVKRTWGMVEDLQRAYDDLDDK 120
            +SFGPYHHG +HL P EK+K  +F+ F++  GLS+ED+V   W M+EDLQR+YD LDDK
Sbjct: 61  LISFGPYHHGELHLMPTEKVKHLAFRYFEKRCGLSIEDMVNEVWDMLEDLQRSYDKLDDK 120

Query: 121 WIKEPQKFLELMIVDGCCILQHLLDGDSINQI----VLRDMLLLENQVPMTLLQKLHFMV 180
           W  EP KFLE+MI+DGC I+Q LL    +  +    VLRD+LLLENQ+PM LL KL  M+
Sbjct: 121 WKTEPAKFLEIMILDGCFIIQVLLGRSPLWTLPLEDVLRDVLLLENQLPMKLLAKLCSML 180

Query: 181 I-------NQNIDKKV-----------DILVRGGCKHLLDMFRLELIL-------RRQME 240
           +       N+N++  V            +L+     H+LDM+R EL         R Q+ 
Sbjct: 181 MLEGEGEHNKNVESLVRESQKIPAYLKKMLMENDYSHILDMYRAELQYYEGNEQERHQIS 240

Query: 241 PLLQR-------------RL--MGPGNEIQLATLFRKAGIKIKQG----PLDFDEKQGVL 300
               R             RL  +G G+EIQLA  F KAGIK+K+G     +DFD+ +GVL
Sbjct: 241 HTEMRFTHFEMIFGQLGMRLSHLGMGHEIQLARRFHKAGIKLKKGCNLRDVDFDQNKGVL 300

Query: 301 RLPFINMNAHIESALLNAMAFEKLSGISKEANSFIILMGNLIEKDEIESFNQLAKSEVLE 360
            LPFI MNA+IES LLNAMAFEKL GI     SF+ILMGNL+EKDE++SFNQLAK  VL 
Sbjct: 301 SLPFIEMNANIESGLLNAMAFEKLLGIGNIVGSFVILMGNLLEKDEVDSFNQLAKGTVLG 360

Query: 361 MWKEDTFVYNKVRKYCNRPWRIWWTRLKDTNFQNPWTIISTLAAIVGFVLLILQTLYGIY 381
           +W     VY  V  +C RPW+IWWT LKD NFQ+PWTIIST  A++GF LLI+QT+YG+Y
Sbjct: 361 LWGHYPTVYKSVNNHCKRPWKIWWTTLKDVNFQSPWTIISTFYALIGFALLIIQTVYGVY 420

BLAST of Tan0003819 vs. ExPASy TrEMBL
Match: A0A6J1ECA1 (UPF0481 protein At3g47200-like OS=Cucurbita moschata OX=3662 GN=LOC111432765 PE=4 SV=1)

HSP 1 Score: 362.5 bits (929), Expect = 2.2e-96
Identity = 208/392 (53.06%), Postives = 261/392 (66.58%), Query Frame = 0

Query: 1   MEMEQHKDEGAVVIELTTKNLTSRLENLPDNLSELEVISKPILKIPNDIKKVNPKAFLPQ 60
           MEM    D+  +V  +TT+NL S LEN    +   + I   I +IP+ I KVNP AF PQ
Sbjct: 1   MEMPPLHDKHTMV-RITTQNLDSHLENHYVGVRS-KGIGASIYRIPDYIMKVNPNAFKPQ 60

Query: 61  RMSFGPYHHGVVHLYPMEKMKFRSFQTFQRFVGLSVEDIVKRTWGMVEDLQRAYDDLDDK 120
            +SFGPYHHG +HL PMEK K   F+ F+R  GL  EDIV   W M+EDLQ +YD L D+
Sbjct: 61  LVSFGPYHHGELHLLPMEKEKHLIFEDFKRCYGLCTEDIVNEVWDMLEDLQGSYDKLHDE 120

Query: 121 WIKEPQKFLELMIVDGCCILQHLLDGDSINQI----VLRDMLLLENQVPMTLLQKLHFMV 180
           W  +P KFLE+MIVDG  +L  LL G     +    ++RDMLLLENQ+PM LL KL+ M 
Sbjct: 121 WKTKPDKFLEVMIVDGYFVLFALLLGIHKLSVYFTEIMRDMLLLENQLPMKLLDKLYSM- 180

Query: 181 INQNIDKKV--------DILVRGGCKHLLDMFRLEL-ILRRQMEPLLQRRLMGPGNEIQL 240
           +  + D+KV        + L+     HLLDM+R EL +     E  LQ   +G  +EIQL
Sbjct: 181 LRLDDDRKVKSLIWESNESLMEKDYLHLLDMYRQELKVDTSSRESKLQVSHLGTSHEIQL 240

Query: 241 ATLFRKAGIKIKQGP----LDFDEKQGVLRLPFINMNAHIESALLNAMAFEKLSGISKEA 300
           AT F KAGIK+++G     L F+E +GVL L FI MNA+IES LLNAM FEKLSGI    
Sbjct: 241 ATRFHKAGIKLEKGCGLDFLYFNENKGVLTLSFIEMNANIESGLLNAMTFEKLSGIDNMV 300

Query: 301 NSFIILMGNLIEKDEIESFNQLAKSEVLEMWKEDTFVYNKVRKYCNRPWRIWWTRLKDTN 360
            SF+ILMGNL+EKDE++SFNQLAK  VL MW +  +VY  V K+C RPWRIWWT LK+ +
Sbjct: 301 GSFVILMGNLLEKDEVDSFNQLAKGTVLSMWDKYAYVYYSVNKHCKRPWRIWWTTLKNVS 360

Query: 361 FQNPWTIISTLAAIVGFVLLILQTLYGIYGYY 376
           FQ+PW IIS L+AI+GFVLLI+QT+ G+YGYY
Sbjct: 361 FQSPWIIISALSAIIGFVLLIIQTISGVYGYY 389

BLAST of Tan0003819 vs. ExPASy TrEMBL
Match: A0A6J1EF72 (UPF0481 protein At3g47200-like OS=Cucurbita moschata OX=3662 GN=LOC111432743 PE=4 SV=1)

HSP 1 Score: 360.1 bits (923), Expect = 1.1e-95
Identity = 211/445 (47.42%), Postives = 271/445 (60.90%), Query Frame = 0

Query: 1   MEMEQHKDEGAVVIELT---------TKNLTSRLENLPDNLSELEVISKPILKIPNDIKK 60
           ME+   +++  VV+EL+         T+NL  +LE+LPD   + ++    I +IP  IKK
Sbjct: 1   MELSSSRNKDTVVVELSPLQNAVVEITENLKVQLEDLPD-YRKRKINGASIYRIPEHIKK 60

Query: 61  VNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTFQRFVGLSVEDIVKRTWGMVEDLQ 120
           VNP AF PQ +SFGPYHHG  HL PMEK K  + Q F+R  GLS ED+V   W M+EDLQ
Sbjct: 61  VNPNAFKPQHVSFGPYHHGEPHLLPMEKKKRLAVQDFERRCGLSTEDMVTELWDMLEDLQ 120

Query: 121 RAYDDLDDKWIKEPQKFLELMIVDGCCILQHLLDGDSINQI----VLRDMLLLENQVPMT 180
            +YD LDDKW  +P KFLELM++DGC ++  L +   +        LRDML+LENQ+P+ 
Sbjct: 121 TSYDKLDDKWKTKPAKFLELMMLDGCFMVLVLEEITEMKPFPPRDTLRDMLVLENQLPIK 180

Query: 181 LLQKLHFMVINQN--------------ID-----------------KKVDILVRG----- 240
           LL KL+ M+  +N              ID                   +  LV G     
Sbjct: 181 LLDKLYSMLKQENNQLGLFLLNSSPLLIDIVFFKLSFLNFSVRERFSTIKSLVWGTLNFP 240

Query: 241 ---GCK---------HLLDMFRLELILRRQMEPLLQRRLMGPGNEIQLATLFRKAGIKIK 300
              G K         H+L M+R E++   +   L Q  L G G+EIQLA  F KAGIK+K
Sbjct: 241 TPTGVKEMLMEEDYLHVLHMYRKEVMFYNEQNRLRQSHL-GMGHEIQLARRFHKAGIKLK 300

Query: 301 QG----PLDFDEKQGVLRLPFINMNAHIESALLNAMAFEKLSGISKEANSFIILMGNLIE 360
           +G     + FDEK+G L LPFI MNA++ES LLN MAFEKLSGI+    SF+ILMGNL E
Sbjct: 301 KGCNFMDVGFDEKKGGLTLPFIEMNANVESGLLNVMAFEKLSGIANIVGSFVILMGNLPE 360

Query: 361 KDEIESFNQLAKSEVLEMWKEDTFVYNKVRKYCNRPWRIWWTRLKDTNFQNPWTIISTLA 381
           KDE++SFNQLAK  VLE+W     VY+ V K+  RPW+IWWT LKD NFQ+PWTIIST +
Sbjct: 361 KDEVDSFNQLAKGTVLELWGRFFNVYDSVNKHSKRPWKIWWTTLKDANFQSPWTIISTFS 420

BLAST of Tan0003819 vs. ExPASy TrEMBL
Match: A0A6J1EI35 (UPF0481 protein At3g47200-like OS=Cucurbita moschata OX=3662 GN=LOC111432766 PE=4 SV=1)

HSP 1 Score: 351.3 bits (900), Expect = 5.1e-93
Identity = 206/396 (52.02%), Postives = 258/396 (65.15%), Query Frame = 0

Query: 13  VIELTTKNLTSRLENLPDNLSELEVISKPILKIPNDIKKVNPKAFLPQRMSFGPYHHGVV 72
           ++E   +NL S L NL     E + I   I +IP  I  VNP AF P+ +SFGPYHHG +
Sbjct: 1   MVETIIQNLESHLANLQS--IESKGIGASIYRIPEHIMNVNPNAFKPKLVSFGPYHHGEL 60

Query: 73  HLYPMEKMKFRSFQTFQRFV-GLSVEDIVKRTWGMVEDLQRAYDDLDDKWIKEPQKFLEL 132
           HL PMEK K  +   F++   GL+ E IV   W M+ DLQ +YD LDDKW KEP KFLEL
Sbjct: 61  HLMPMEKKKHEALWYFKKSCRGLTTEKIVSGLWNMLGDLQGSYDKLDDKWKKEPLKFLEL 120

Query: 133 MIVDGCCILQHLLDGDSI----NQIVLRDMLLLENQVPMTLLQKLH---------FMVIN 192
           MI+DGC I+   L+   +    N  V RDMLLLENQ+PM LL KL+         F++  
Sbjct: 121 MILDGCLIMHIFLEDKYLLKFNNVDVQRDMLLLENQLPMMLLDKLYSILKPGKNKFVMHP 180

Query: 193 QNIDKKV---DILVRGGCK----HLLDMFRLEL---ILRRQMEPLLQRRLMGPGNEIQLA 252
            N+   V   D +  G  +    HLLDM+R EL   +L R  +  LQ   +G  +EI+LA
Sbjct: 181 LNVKSLVWESDSVGTGAMEKDYLHLLDMYRWELNINVLPRWSK--LQVSHLGTSHEIRLA 240

Query: 253 TLFRKAGIKIKQG----PLDFDEKQGVLRLPFINMNAHIESALLNAMAFEKLSGISKEAN 312
           T F KAGIK+++G     + FDE +G+L LPFI MNA+IES LLNAMAFEKLSGI     
Sbjct: 241 TRFHKAGIKLEKGWNLRAVSFDENKGILSLPFIEMNANIESGLLNAMAFEKLSGIDNIVG 300

Query: 313 SFIILMGNLIEKDEIESFNQLAKSEVLEMWKEDTFVYNKVRKYCNRPWRIWWTRLKDTNF 372
           SF++LMGNL++K E++SFNQLAK EVL  + +   VY  V ++C RPWRIWWT LKD NF
Sbjct: 301 SFVVLMGNLLKKGEVDSFNQLAKGEVLS-FLDYYSVYRLVNEHCKRPWRIWWTTLKDVNF 360

Query: 373 QNPWTIISTLAAIVGFVLLILQTLYGIYGYYKPHSS 381
           QNPWTIISTL+A +GFVLLILQT+YG+YGYY P  S
Sbjct: 361 QNPWTIISTLSASIGFVLLILQTVYGMYGYYLPRRS 391

BLAST of Tan0003819 vs. TAIR 10
Match: AT3G50120.1 (Plant protein of unknown function (DUF247) )

HSP 1 Score: 117.1 bits (292), Expect = 3.1e-26
Identity = 108/433 (24.94%), Postives = 181/433 (41.80%), Query Frame = 0

Query: 42  ILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTFQRFVGLSVEDIVK 101
           I ++P  +++ + K++ PQ +S GPYHHG   L  M++ K+R+     +     ++  + 
Sbjct: 105 IYRVPYYLQENDNKSYFPQTVSLGPYHHGKKRLRSMDRHKWRAVNRVLKRTNQGIKMYID 164

Query: 102 RTWGMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCCILQHLLDG--DSINQI------- 161
               + E  +  Y   +        +F+E++++DGC +L+ L  G  +   ++       
Sbjct: 165 AMRELEEKARACY---EGPLSLSSNEFIEMLVLDGCFVLE-LFRGAVEGFTELGYARNDP 224

Query: 162 ----------VLRDMLLLENQVPMTLLQKLHFMVI---NQN------------------- 221
                     + RDM++LENQ+P+ +L +L  + +   NQ                    
Sbjct: 225 VFAMRGSMHSIQRDMVMLENQLPLFVLNRLLELQLGTRNQTGLVAQLAIRFFDPLMPTDE 284

Query: 222 ---------------IDKKVDILVRGGCKHLLDMFRLELILRR-QMEPLLQRRLMGPGNE 281
                           DK  D     G  H LD+FR  L+    + EP L R+       
Sbjct: 285 PLTKSGQSKLENSLARDKSFDPFADMGELHCLDVFRRSLLRSSPKPEPRLTRKRWSRNTR 344

Query: 282 ---------IQLATLFRKAGIKIKQGPL----DFDEKQGVLRLPFINMNAHIESALLNAM 341
                    I   T  ++AGIK ++       D   K G L +P + ++   +S  LN +
Sbjct: 345 VADKRRQQLIHCVTELKEAGIKFRRRKTDRFWDMQFKNGYLEIPRLLIHDGTKSLFLNLI 404

Query: 342 AFEKLS-GISKEANSFIILMGNLIEKDE---------------------IESFNQLAKSE 380
           AFE+     S +  S+II M NLI+  E                      + FN+L +  
Sbjct: 405 AFEQCHIDSSNDITSYIIFMDNLIDSHEDVSYLHYCGIIEHWLGSDSEVADLFNRLCQEV 464

BLAST of Tan0003819 vs. TAIR 10
Match: AT4G31980.1 (unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF247, plant (InterPro:IPR004158), Protein of unknown function DUF862, eukaryotic (InterPro:IPR008580); BEST Arabidopsis thaliana protein match is: Plant protein of unknown function (DUF247) (TAIR:AT5G11290.1); Has 1967 Blast hits to 1844 proteins in 183 species: Archae - 0; Bacteria - 6; Metazoa - 223; Fungi - 83; Plants - 1477; Viruses - 0; Other Eukaryotes - 178 (source: NCBI BLink). )

HSP 1 Score: 114.4 bits (285), Expect = 2.0e-25
Identity = 101/388 (26.03%), Postives = 171/388 (44.07%), Query Frame = 0

Query: 42  ILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTFQRFVGLSVEDIVK 101
           I K+PN ++++NP A+ P+ +SFGP H G   L  ME  K+R   +F      S+ED+V+
Sbjct: 295 IYKVPNKLRRLNPDAYTPRLVSFGPLHRGKEELQAMEDQKYRYLLSFIPRTNSSLEDLVR 354

Query: 102 --RTWGMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCCILQHLL---------DGDSI- 161
             RTW      Q A     +       +F+E+++VDG  +++ LL         + D I 
Sbjct: 355 LARTWE-----QNARSCYAEDVKLHSDEFVEMLVVDGSFLVELLLRSHYPRLRGENDRIF 414

Query: 162 -NQI----VLRDMLLLENQVPMTLLQKLHFMVIN---QNIDKKVDILVRGGCKHLLDMFR 221
            N +    V RDM+L+ENQ+P  +++++  +++N   Q     + +  R     L  +  
Sbjct: 415 GNSMMITDVCRDMILIENQLPFFVVKEIFLLLLNYYQQGTPSIIQLAQRHFSYFLSRIDD 474

Query: 222 LELILRRQMEPLLQRRLMGPGNEIQL------------ATLFRKAGIKIKQGP-----LD 281
            + I   +    L R    P   I+L            AT    AG++ K        LD
Sbjct: 475 EKFITEPEHFVDLLRSCYLPQFPIKLEYTTVKVDNAPEATELHTAGVRFKPAETSSCLLD 534

Query: 282 FDEKQGVLRLPFINMNAHIESALLNAMAFEKLSGISKEANSFIILMGNLIEK-------- 341
                GVL++P I ++   ES   N + FE+    +K    +I+L+G  I+         
Sbjct: 535 ISFADGVLKIPTIVVDDLTESLYKNIIGFEQCRCSNKNFLDYIMLLGCFIKSPTDADLLI 594

Query: 342 -------------DEIESFNQLAKSEVLEMWKEDTFVYNKVRKYCNRPWRIWWTRLKDTN 372
                        D    FN ++K  + +     + +   ++ YCN PW  W   L+   
Sbjct: 595 HSGIIVNYLGNSVDVSNLFNSISKEVIYDRRFYFSMLSENLQAYCNTPWNRWKAILRRDY 654

BLAST of Tan0003819 vs. TAIR 10
Match: AT3G47250.1 (Plant protein of unknown function (DUF247) )

HSP 1 Score: 111.3 bits (277), Expect = 1.7e-24
Identity = 112/416 (26.92%), Postives = 184/416 (44.23%), Query Frame = 0

Query: 42  ILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTF-QRFVGLSV-EDI 101
           I +IP+ + +VNPKA+ P+ +S GPYH+G  HL  +++ KFR  + F  R     + E++
Sbjct: 63  IFRIPDSLAEVNPKAYKPKVVSIGPYHYGENHLQMIQQHKFRFLELFVDRATKKGMDENV 122

Query: 102 VKRTWGMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCCILQHLL----------DGDSI 161
           +    G ++   RA     ++   E  + + +MI+DGC IL  LL          + D I
Sbjct: 123 LYAAVGALQHKIRA--SYSEELRIEKSELVSMMILDGCFILMLLLIVSRKIDLDMNKDPI 182

Query: 162 NQI------VLRDMLLLENQVPMTLLQ---------------KLHFMVINQNIDKKVDIL 221
             I      +  D+LLLENQVP  +L+               ++ F   N +IDK     
Sbjct: 183 FTIPWILASIQSDLLLLENQVPFFVLRTIFDKSGIGSPGDLNRMAFSFFNLSIDKPDTYW 242

Query: 222 VR---GGCKHLLDMFRLELI--LRRQMEPLLQ-----RRLMGPGNEIQ----------LA 281
            +    G KHLLD+ R   I  +R   E         +   G   E+            A
Sbjct: 243 AKHRDRGAKHLLDLNRKTFIPSMRSMGESETTPSSKFQHSKGKSGEVSSSESTFPLILSA 302

Query: 282 TLFRKAGIKIK-----QGPLDFDEKQGVLRLPFINMNAHIESALLNAMAFEKL-SGISKE 341
              R  GIK +     +  LD   K+  L++P + ++  I S  LN +AFE+  +  + +
Sbjct: 303 KRLRLQGIKFRLRSDAESILDIKLKKNKLQIPLLRLDGFISSIFLNCVAFEQFYTESTND 362

Query: 342 ANSFIILMGNLIEKDEIESFNQLAK----------SEVLEMWKE-------DT------F 376
             S+++ MG L+   E  +F    K          +EV + +K        DT       
Sbjct: 363 ITSYVVFMGCLLNDQEDATFLNNDKRIIENYFGNENEVSQFFKTICKDVVFDTRRSYLRN 422

BLAST of Tan0003819 vs. TAIR 10
Match: AT3G47250.2 (Plant protein of unknown function (DUF247) )

HSP 1 Score: 111.3 bits (277), Expect = 1.7e-24
Identity = 112/416 (26.92%), Postives = 184/416 (44.23%), Query Frame = 0

Query: 42  ILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTF-QRFVGLSV-EDI 101
           I +IP+ + +VNPKA+ P+ +S GPYH+G  HL  +++ KFR  + F  R     + E++
Sbjct: 63  IFRIPDSLAEVNPKAYKPKVVSIGPYHYGENHLQMIQQHKFRFLELFVDRATKKGMDENV 122

Query: 102 VKRTWGMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCCILQHLL----------DGDSI 161
           +    G ++   RA     ++   E  + + +MI+DGC IL  LL          + D I
Sbjct: 123 LYAAVGALQHKIRA--SYSEELRIEKSELVSMMILDGCFILMLLLIVSRKIDLDMNKDPI 182

Query: 162 NQI------VLRDMLLLENQVPMTLLQ---------------KLHFMVINQNIDKKVDIL 221
             I      +  D+LLLENQVP  +L+               ++ F   N +IDK     
Sbjct: 183 FTIPWILASIQSDLLLLENQVPFFVLRTIFDKSGIGSPGDLNRMAFSFFNLSIDKPDTYW 242

Query: 222 VR---GGCKHLLDMFRLELI--LRRQMEPLLQ-----RRLMGPGNEIQ----------LA 281
            +    G KHLLD+ R   I  +R   E         +   G   E+            A
Sbjct: 243 AKHRDRGAKHLLDLNRKTFIPSMRSMGESETTPSSKFQHSKGKSGEVSSSESTFPLILSA 302

Query: 282 TLFRKAGIKIK-----QGPLDFDEKQGVLRLPFINMNAHIESALLNAMAFEKL-SGISKE 341
              R  GIK +     +  LD   K+  L++P + ++  I S  LN +AFE+  +  + +
Sbjct: 303 KRLRLQGIKFRLRSDAESILDIKLKKNKLQIPLLRLDGFISSIFLNCVAFEQFYTESTND 362

Query: 342 ANSFIILMGNLIEKDEIESFNQLAK----------SEVLEMWKE-------DT------F 376
             S+++ MG L+   E  +F    K          +EV + +K        DT       
Sbjct: 363 ITSYVVFMGCLLNDQEDATFLNNDKRIIENYFGNENEVSQFFKTICKDVVFDTRRSYLRN 422

BLAST of Tan0003819 vs. TAIR 10
Match: AT3G47250.3 (Plant protein of unknown function (DUF247) )

HSP 1 Score: 111.3 bits (277), Expect = 1.7e-24
Identity = 112/416 (26.92%), Postives = 184/416 (44.23%), Query Frame = 0

Query: 42  ILKIPNDIKKVNPKAFLPQRMSFGPYHHGVVHLYPMEKMKFRSFQTF-QRFVGLSV-EDI 101
           I +IP+ + +VNPKA+ P+ +S GPYH+G  HL  +++ KFR  + F  R     + E++
Sbjct: 63  IFRIPDSLAEVNPKAYKPKVVSIGPYHYGENHLQMIQQHKFRFLELFVDRATKKGMDENV 122

Query: 102 VKRTWGMVEDLQRAYDDLDDKWIKEPQKFLELMIVDGCCILQHLL----------DGDSI 161
           +    G ++   RA     ++   E  + + +MI+DGC IL  LL          + D I
Sbjct: 123 LYAAVGALQHKIRA--SYSEELRIEKSELVSMMILDGCFILMLLLIVSRKIDLDMNKDPI 182

Query: 162 NQI------VLRDMLLLENQVPMTLLQ---------------KLHFMVINQNIDKKVDIL 221
             I      +  D+LLLENQVP  +L+               ++ F   N +IDK     
Sbjct: 183 FTIPWILASIQSDLLLLENQVPFFVLRTIFDKSGIGSPGDLNRMAFSFFNLSIDKPDTYW 242

Query: 222 VR---GGCKHLLDMFRLELI--LRRQMEPLLQ-----RRLMGPGNEIQ----------LA 281
            +    G KHLLD+ R   I  +R   E         +   G   E+            A
Sbjct: 243 AKHRDRGAKHLLDLNRKTFIPSMRSMGESETTPSSKFQHSKGKSGEVSSSESTFPLILSA 302

Query: 282 TLFRKAGIKIK-----QGPLDFDEKQGVLRLPFINMNAHIESALLNAMAFEKL-SGISKE 341
              R  GIK +     +  LD   K+  L++P + ++  I S  LN +AFE+  +  + +
Sbjct: 303 KRLRLQGIKFRLRSDAESILDIKLKKNKLQIPLLRLDGFISSIFLNCVAFEQFYTESTND 362

Query: 342 ANSFIILMGNLIEKDEIESFNQLAK----------SEVLEMWKE-------DT------F 376
             S+++ MG L+   E  +F    K          +EV + +K        DT       
Sbjct: 363 ITSYVVFMGCLLNDQEDATFLNNDKRIIENYFGNENEVSQFFKTICKDVVFDTRRSYLRN 422

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SD538.6e-2126.73UPF0481 protein At3g47200 OS=Arabidopsis thaliana OX=3702 GN=At3g47200 PE=2 SV=1[more]
P0C8972.0e-0626.55Putative UPF0481 protein At3g02645 OS=Arabidopsis thaliana OX=3702 GN=At3g02645 ... [more]
Match NameE-valueIdentityDescription
XP_023535324.12.2e-11456.68UPF0481 protein At3g47200-like [Cucurbita pepo subsp. pepo][more]
XP_022925442.15.4e-11355.94UPF0481 protein At3g47200-like [Cucurbita moschata][more]
KAG7025181.12.4e-10555.27UPF0481 protein, partial [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_022925444.11.1e-10250.93UPF0481 protein At3g47200-like [Cucurbita moschata][more]
KAG7025179.12.5e-10253.38UPF0481 protein [Cucurbita argyrosperma subsp. argyrosperma][more]
Match NameE-valueIdentityDescription
A0A6J1EBQ12.6e-11355.94UPF0481 protein At3g47200-like OS=Cucurbita moschata OX=3662 GN=LOC111432742 PE=... [more]
A0A6J1EC695.4e-10350.93UPF0481 protein At3g47200-like OS=Cucurbita moschata OX=3662 GN=LOC111432744 PE=... [more]
A0A6J1ECA12.2e-9653.06UPF0481 protein At3g47200-like OS=Cucurbita moschata OX=3662 GN=LOC111432765 PE=... [more]
A0A6J1EF721.1e-9547.42UPF0481 protein At3g47200-like OS=Cucurbita moschata OX=3662 GN=LOC111432743 PE=... [more]
A0A6J1EI355.1e-9352.02UPF0481 protein At3g47200-like OS=Cucurbita moschata OX=3662 GN=LOC111432766 PE=... [more]
Match NameE-valueIdentityDescription
AT3G50120.13.1e-2624.94Plant protein of unknown function (DUF247) [more]
AT4G31980.12.0e-2526.03unknown protein; CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF247,... [more]
AT3G47250.11.7e-2426.92Plant protein of unknown function (DUF247) [more]
AT3G47250.21.7e-2426.92Plant protein of unknown function (DUF247) [more]
AT3G47250.31.7e-2426.92Plant protein of unknown function (DUF247) [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR004158Protein of unknown function DUF247, plantPFAMPF03140DUF247coord: 42..362
e-value: 3.3E-50
score: 171.5
IPR004158Protein of unknown function DUF247, plantPANTHERPTHR31549PROTEIN, PUTATIVE (DUF247)-RELATED-RELATEDcoord: 27..372

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0003819.1Tan0003819.1mRNA