Tan0012268 (gene) Snake gourd v1

Overview
NameTan0012268
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function DUF455
LocationLG11: 7693074 .. 7695557 (-)
RNA-Seq ExpressionTan0012268
SyntenyTan0012268
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAGACGAGACCTTGGTCGAAGCGGCGCTTCGAGTTCTCAACACTTCCGATCCCTTCGAGAAGGCCGAACTCGGCGATAGCGTAGCTTCTCGATGGCTTAACGGCGCCATTTCTCGCGCTTACGATCCCTCCGCCGATCTTGCGGTCCCCGACCGCCCTGCCAGGCTCTCCGATGTAACAATCAACCCCTCTTCCTCATCTCTTCGTGTATATATATTGTATGTTCATTGCAATGGTGCAAGGTTTGAAGATTCTCTGCGTTCGAAATGGTTTAGGTGAAGTTGGTTTCGCCGAGTCTCATGCCGAAGCTGGGGAAGGCGGGAAGCTTACAGAGCCGGCAGGCTATTGTGCATAGTCTTGTCCACACTGAAAGTTGGGCTATCGATTTGTCGTGGGTACTTCGCCGTTTTATTTTTAAGTCGGATTTGTTCTGAATTCCGTATTTAGAGTACGCATTTAATTCTATACTCTTGTTATTTCATGGCATTCTGTTCGATTTTGGAGGTTCTAGTAAATCCCAAACCCTTTTTGTTGCTCTGTCTCTCTGTCTCTCTTCGTTGTTAATGAATCCCTTCTTCTTTCTCGAATGCCGTTATCGATTCTCTTTTGCCCATTAAGAGTTGAAATAATAATCAATTCGTCATTTCTCTTTGTGTAATCTGCTGTTATGAAAACCTCGTTAGTCGTTCCCCTTTTGCTGCTTCTTTTTATGTACAGTATACTATTAATTATTATACTTGTGCATACTTTTGCTTGTCCCTTCTATTTTGACTCTTTGTTTATTTTATGGCTGCTTGATATTCTCTTTCCCTTTCCTACAATGGAAAATTTACACCTAATTGCCTATAGTGAGGAAGAGAGTACCCTGTCTGATAATTGCAAGAAATATATCATTGAAGCCAGTTATTACCAGTAAATTGTCTTCTTGTGTTTTAAGTGTGTTTAATTTTTGCAATAACTACTAGTTATACTTCTCTATGGCATCCATTTCTGGGATATTTCTATTACCCTAGTTCTTCACCTATCATTTCATGTGACCGTTAATTTTTTTAGCATTATTGACTAGGCCAAGTTTTAATTTTCTGTAGGACATAATAGCTCGGTTCGGAAAACAAGAAGGAATGCCGAGGGAGTTCTTCACTGATTTCGTTAGGGTGGCTCAGGATGAAGGTAGACATTTCACTCTTCTAGCTGCACGACTTAAGGAACTGGGCTCTTTCTATGGAGCACTGCCCGCTCATGATGGCCTTTGGGATTCTGCTATTGCTACTTCCAAGGATTTATTAGCACGCTTGGCAATTGAGCATTGTGTCCACGAGGTGCGTGCTTGATTTATTTAGCTGATGATGAAGTAGAAAAGAATGAGCTTTATCAAGCGTATGCTTAAGTTCAATTTCTCTCTTTTAATTAACGAACTTTCAACATCAATGAATTTCTAAGATCGGGTGGTAGAATTGTGGGTAAAAATCTTATTTCGTTCTTTGAACTTTCACATGTGTTATGACGTTTCCATTGTACTAGTTTAACAAGAATCTTAAGTTGACAATCCGAGCATAGTTTAGTGGCTAAGGCACTATAACATCTAAAGGTTGATGGTCGAATCCTGCGATTGTTAGAACTCATAAAAAAAAAAAAGAACAAGAATTTAAAGTTATCTATTTTTAACTATTTTCTAATTTGATGAATAGACCCCTTATATGTGGTTAAGGACAGTCTCTGCAATAAGGAAATTATCACGCAAGGACCAGTATATACATTTATGGAAAATTGAAGGACCGAAACACCTGAAATAAAATAATATTTAAACCTGGAACATAAAATAAAATAGAACAAAATAATGATACTTGGGGGACTGAGTTATCTTTATAATTCGTTTGTGTAGATTTATTTATGATGATGGCTGGTCGCATATGAGAGTGTTTTTAAAAACATGGAAACTCCTTAATAGTACTTTGATATGTATTTTGAAGAAACTAAGGAAATATGCTATTTAAATTACTCAGATGGCCTATTGCACACCAATGATCTTGAAGCTCTTTTAGAGGTGTATGTATGTGTATGTTTATCGTACCGAAAAACCTGTTGTTGAATAAATTGTATGAACAGGCTAGAGGGCTCGATGTGCTTCCGACGACCATCTCGCGATTCAGAAATGGAGGTGACAATGAGACTGCAGATTTATTGGAGAAAGTAGTGTACCCAGAAGAAGTAACCCATTGTGGTGCAGGAGTAAAATGGTTTAAATATCTTTGCCAGAGAACTGGAAAACAGATAATGAATGGGACAGAAGAAGATAATGCAATGGAAATGGATGAGGAAGAAATCATTCACAAGTTTCATGAAATTGTGAGAAAGCACTTCAGGGGGCCATTGAAGCCACCTTTCAACGAAGTGGCAAGAAAAGCTGCTGGTTTTGGTCCTCGATGGTATGAACCACTTGCTTTCAAAAGAGACCCTACCTTAGATTCTGAATGTGAATGA

mRNA sequence

ATGGCAGACGAGACCTTGGTCGAAGCGGCGCTTCGAGTTCTCAACACTTCCGATCCCTTCGAGAAGGCCGAACTCGGCGATAGCGTAGCTTCTCGATGGCTTAACGGCGCCATTTCTCGCGCTTACGATCCCTCCGCCGATCTTGCGGTCCCCGACCGCCCTGCCAGGCTCTCCGATGTGAAGTTGGTTTCGCCGAGTCTCATGCCGAAGCTGGGGAAGGCGGGAAGCTTACAGAGCCGGCAGGCTATTGTGCATAGTCTTGTCCACACTGAAAGTTGGGCTATCGATTTGTCGTGGGACATAATAGCTCGGTTCGGAAAACAAGAAGGAATGCCGAGGGAGTTCTTCACTGATTTCGTTAGGGTGGCTCAGGATGAAGGTAGACATTTCACTCTTCTAGCTGCACGACTTAAGGAACTGGGCTCTTTCTATGGAGCACTGCCCGCTCATGATGGCCTTTGGGATTCTGCTATTGCTACTTCCAAGGATTTATTAGCACGCTTGGCAATTGAGCATTGTGTCCACGAGGCTAGAGGGCTCGATGTGCTTCCGACGACCATCTCGCGATTCAGAAATGGAGGTGACAATGAGACTGCAGATTTATTGGAGAAAGTAGTGTACCCAGAAGAAGTAACCCATTGTGGTGCAGGAGTAAAATGGTTTAAATATCTTTGCCAGAGAACTGGAAAACAGATAATGAATGGGACAGAAGAAGATAATGCAATGGAAATGGATGAGGAAGAAATCATTCACAAGTTTCATGAAATTGTGAGAAAGCACTTCAGGGGGCCATTGAAGCCACCTTTCAACGAAGTGGCAAGAAAAGCTGCTGGTTTTGGTCCTCGATGGTATGAACCACTTGCTTTCAAAAGAGACCCTACCTTAGATTCTGAATGTGAATGA

Coding sequence (CDS)

ATGGCAGACGAGACCTTGGTCGAAGCGGCGCTTCGAGTTCTCAACACTTCCGATCCCTTCGAGAAGGCCGAACTCGGCGATAGCGTAGCTTCTCGATGGCTTAACGGCGCCATTTCTCGCGCTTACGATCCCTCCGCCGATCTTGCGGTCCCCGACCGCCCTGCCAGGCTCTCCGATGTGAAGTTGGTTTCGCCGAGTCTCATGCCGAAGCTGGGGAAGGCGGGAAGCTTACAGAGCCGGCAGGCTATTGTGCATAGTCTTGTCCACACTGAAAGTTGGGCTATCGATTTGTCGTGGGACATAATAGCTCGGTTCGGAAAACAAGAAGGAATGCCGAGGGAGTTCTTCACTGATTTCGTTAGGGTGGCTCAGGATGAAGGTAGACATTTCACTCTTCTAGCTGCACGACTTAAGGAACTGGGCTCTTTCTATGGAGCACTGCCCGCTCATGATGGCCTTTGGGATTCTGCTATTGCTACTTCCAAGGATTTATTAGCACGCTTGGCAATTGAGCATTGTGTCCACGAGGCTAGAGGGCTCGATGTGCTTCCGACGACCATCTCGCGATTCAGAAATGGAGGTGACAATGAGACTGCAGATTTATTGGAGAAAGTAGTGTACCCAGAAGAAGTAACCCATTGTGGTGCAGGAGTAAAATGGTTTAAATATCTTTGCCAGAGAACTGGAAAACAGATAATGAATGGGACAGAAGAAGATAATGCAATGGAAATGGATGAGGAAGAAATCATTCACAAGTTTCATGAAATTGTGAGAAAGCACTTCAGGGGGCCATTGAAGCCACCTTTCAACGAAGTGGCAAGAAAAGCTGCTGGTTTTGGTCCTCGATGGTATGAACCACTTGCTTTCAAAAGAGACCCTACCTTAGATTCTGAATGTGAATGA

Protein sequence

MADETLVEAALRVLNTSDPFEKAELGDSVASRWLNGAISRAYDPSADLAVPDRPARLSDVKLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQEGMPREFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGLDVLPTTISRFRNGGDNETADLLEKVVYPEEVTHCGAGVKWFKYLCQRTGKQIMNGTEEDNAMEMDEEEIIHKFHEIVRKHFRGPLKPPFNEVARKAAGFGPRWYEPLAFKRDPTLDSECE
Homology
BLAST of Tan0012268 vs. ExPASy Swiss-Prot
Match: P43935 (Uncharacterized protein HI_0077 OS=Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) OX=71421 GN=HI_0077 PE=4 SV=1)

HSP 1 Score: 108.2 bits (269), Expect = 1.6e-22
Identity = 84/278 (30.22%), Postives = 132/278 (47.48%), Query Frame = 0

Query: 7   VEAALRVLNTSDPFEKAELGDSVASRWLNGAISRAYDPSADLAVPDRPARLSDVK-LVSP 66
           VE AL+   T++P EK  L + +    L        +   ++   D  A   +   LV+P
Sbjct: 12  VETALK---TANPQEKCRLVNDLYDNLLPQIQLIKLEDFPEIVPQDNIAAFPEKPLLVAP 71

Query: 67  SLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGK----QEGMPREFFTDFVR 126
             +PK   A + +   A +H++ H E  AI+L  D   RFG+    + G    F  D++R
Sbjct: 72  KDVPKRSFA-TEEGYAATLHAIAHIEFNAINLGLDAAWRFGRNAQEELGEGLAFVKDWLR 131

Query: 127 VAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGLD 186
           VA++E  HF+L+   LK LG  YG   AH GLW+ A AT+ D+  R+A+   V EARGLD
Sbjct: 132 VAREESTHFSLVNEHLKTLGYQYGDFEAHAGLWEMAQATAHDIWERMALVPRVLEARGLD 191

Query: 187 VLPTTISRFRNGGDNETADLLEKVVYPEEVTHCGAGVKWFKYLCQRTGKQIMNGTEEDNA 246
             P    +     D    ++L+ ++  +E+ H   G  W+  L ++ G   M        
Sbjct: 192 ATPVLQEKIAQRKDFAAVNILD-IILRDEIGHVYIGNHWYHALSKKRGLDAMKCF----- 251

Query: 247 MEMDEEEIIHKFHEIVRKHFRGPLKPPFNEVARKAAGF 280
                 E++HK+  ++   F+G +    N  AR  AGF
Sbjct: 252 -----TELLHKYRIVI---FKGVI----NTDARIQAGF 267

BLAST of Tan0012268 vs. NCBI nr
Match: KAG6602270.1 (hypothetical protein SDJN03_07503, partial [Cucurbita argyrosperma subsp. sororia] >KAG7032955.1 hypothetical protein SDJN02_07006 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 563.1 bits (1450), Expect = 1.4e-156
Identity = 278/311 (89.39%), Postives = 287/311 (92.28%), Query Frame = 0

Query: 1   MADETLVEAALRVLNTSDPFEKAELGDSVASRWLNGAISRAYDPSADLAVPDRPARLSDV 60
           MADETLVEAALRVLNTSDPFEKAELGD VASRWLNG ISR YDPS+DLAVPDRPARLSDV
Sbjct: 1   MADETLVEAALRVLNTSDPFEKAELGDKVASRWLNGVISRPYDPSSDLAVPDRPARLSDV 60

Query: 61  KLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQEGMPREFFTDFV 120
           KLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQE MPREFFTDFV
Sbjct: 61  KLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQEEMPREFFTDFV 120

Query: 121 RVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGL 180
           RVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGL
Sbjct: 121 RVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGL 180

Query: 181 DVLPTTISRFRNGGDNETADLLEKVVYPEEVTHCGAGVKWFKYLCQRTGKQIMNGT---- 240
           DVLPTTISRFRNGGDNETADLLEKVVYPEEVTHC AG+KWFKYLCQR+GKQ ++G     
Sbjct: 181 DVLPTTISRFRNGGDNETADLLEKVVYPEEVTHCAAGLKWFKYLCQRSGKQTLDGDGLLL 240

Query: 241 ------EEDNA-MEMDEEEIIHKFHEIVRKHFRGPLKPPFNEVARKAAGFGPRWYEPLAF 300
                  EDNA +EM+ EEIIHKFH IVRKHFRGPLKPPFNEVARKAAGFGPRWYEPLAF
Sbjct: 241 LLHQDGAEDNASLEMENEEIIHKFHAIVRKHFRGPLKPPFNEVARKAAGFGPRWYEPLAF 300

BLAST of Tan0012268 vs. NCBI nr
Match: XP_022954428.1 (uncharacterized protein LOC111456685 [Cucurbita moschata])

HSP 1 Score: 563.1 bits (1450), Expect = 1.4e-156
Identity = 275/310 (88.71%), Postives = 287/310 (92.58%), Query Frame = 0

Query: 1   MADETLVEAALRVLNTSDPFEKAELGDSVASRWLNGAISRAYDPSADLAVPDRPARLSDV 60
           MADETLVEAALRVLNTSDPFEKAELGD VASRWLNG ISR YDPS+DLAVPDRPARLSDV
Sbjct: 1   MADETLVEAALRVLNTSDPFEKAELGDKVASRWLNGVISRPYDPSSDLAVPDRPARLSDV 60

Query: 61  KLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQEGMPREFFTDFV 120
           KLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQE MPREFFTDFV
Sbjct: 61  KLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQEEMPREFFTDFV 120

Query: 121 RVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGL 180
           RVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGL
Sbjct: 121 RVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGL 180

Query: 181 DVLPTTISRFRNGGDNETADLLEKVVYPEEVTHCGAGVKWFKYLCQRTGKQIM------- 240
           DVLPTTISRFRNGGDNETADLLEKVVYPEEVTHC AGVKWF+YLCQR+GKQ +       
Sbjct: 181 DVLPTTISRFRNGGDNETADLLEKVVYPEEVTHCAAGVKWFRYLCQRSGKQTLDRDGLLL 240

Query: 241 ---NGTEEDNAMEMDEEEIIHKFHEIVRKHFRGPLKPPFNEVARKAAGFGPRWYEPLAFK 300
              +G E++ ++EM+ EEIIHKFH IVRKHFRGPLKPPFNEVARKAAGFGPRWYEPLAFK
Sbjct: 241 LHQDGAEDNASLEMENEEIIHKFHAIVRKHFRGPLKPPFNEVARKAAGFGPRWYEPLAFK 300

BLAST of Tan0012268 vs. NCBI nr
Match: XP_023551934.1 (uncharacterized protein LOC111809758 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 559.3 bits (1440), Expect = 2.0e-155
Identity = 277/310 (89.35%), Postives = 285/310 (91.94%), Query Frame = 0

Query: 1   MADETLVEAALRVLNTSDPFEKAELGDSVASRWLNGAISRAYDPSADLAVPDRPARLSDV 60
           MADETLVEAALRVLNTSDPFEKAELGD VASRWLNG ISR YDPS+DLAVPDRPARLSDV
Sbjct: 18  MADETLVEAALRVLNTSDPFEKAELGDKVASRWLNGVISRPYDPSSDLAVPDRPARLSDV 77

Query: 61  KLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQEGMPREFFTDFV 120
           KLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQE MPREFFTDFV
Sbjct: 78  KLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQEEMPREFFTDFV 137

Query: 121 RVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGL 180
           RVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGL
Sbjct: 138 RVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGL 197

Query: 181 DVLPTTISRFRNGGDNETADLLEKVVYPEEVTHCGAGVKWFKYLCQRTGKQIMNGT---- 240
           DVLPTTISRFRNGG NETADLLEKVVYPEEVTHC AG+KWFKYLCQR+GKQ ++G     
Sbjct: 198 DVLPTTISRFRNGGVNETADLLEKVVYPEEVTHCAAGLKWFKYLCQRSGKQTLDGDGLLL 257

Query: 241 -----EEDNA-MEMDEEEIIHKFHEIVRKHFRGPLKPPFNEVARKAAGFGPRWYEPLAFK 300
                 EDNA +EM+ EEIIHKFH IVRKHFRGPLKPPFN VARKAAGFGPRWYEPLAFK
Sbjct: 258 LHEHGAEDNASLEMENEEIIHKFHAIVRKHFRGPLKPPFNVVARKAAGFGPRWYEPLAFK 317

BLAST of Tan0012268 vs. NCBI nr
Match: XP_038884770.1 (uncharacterized protein HI_0077 [Benincasa hispida])

HSP 1 Score: 556.2 bits (1432), Expect = 1.7e-154
Identity = 272/298 (91.28%), Postives = 283/298 (94.97%), Query Frame = 0

Query: 1   MADETLVEAALRVLNTSDPFEKAELGDSVASRWLNGAISRAYDPSADLAVPDRPARLSDV 60
           MA ETLVEAALRVLNTSDPFEKAELGD VASRWLNGAIS  YDPSADLAVPDRPARLSDV
Sbjct: 1   MAAETLVEAALRVLNTSDPFEKAELGDHVASRWLNGAISSPYDPSADLAVPDRPARLSDV 60

Query: 61  KLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQEGMPREFFTDFV 120
           KLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQEGMPREFFTDFV
Sbjct: 61  KLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQEGMPREFFTDFV 120

Query: 121 RVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGL 180
           RVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGL
Sbjct: 121 RVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGL 180

Query: 181 DVLPTTISRFRNGGDNETADLLEKVVYPEEVTHCGAGVKWFKYLCQRTGKQIMNGTE--- 240
           DVLPTTISRFRNGGDNETADLLEKVVYPEEVTHC AGVKWFKY+CQR+G + ++  +   
Sbjct: 181 DVLPTTISRFRNGGDNETADLLEKVVYPEEVTHCAAGVKWFKYVCQRSGNRKLDEDDAGA 240

Query: 241 EDNAMEMDEEEIIHKFHEIVRKHFRGPLKPPFNEVARKAAGFGPRWYEPLAFKRDPTL 296
           +DNAMEM++EE IHKFHEIVRK+FRGPLKPPFNEVARKAAGFGP+WYEPLAFK DP L
Sbjct: 241 KDNAMEMEKEETIHKFHEIVRKYFRGPLKPPFNEVARKAAGFGPQWYEPLAFKTDPIL 298

BLAST of Tan0012268 vs. NCBI nr
Match: XP_022990415.1 (uncharacterized protein LOC111487281 [Cucurbita maxima])

HSP 1 Score: 552.7 bits (1423), Expect = 1.9e-153
Identity = 273/310 (88.06%), Postives = 285/310 (91.94%), Query Frame = 0

Query: 1   MADETLVEAALRVLNTSDPFEKAELGDSVASRWLNGAISRAYDPSADLAVPDRPARLSDV 60
           MADETLVEAALRVLNTSDPFEKAELGD VASRWLNG ISR YDPS+DL+VPDRPARLSDV
Sbjct: 1   MADETLVEAALRVLNTSDPFEKAELGDKVASRWLNGMISRPYDPSSDLSVPDRPARLSDV 60

Query: 61  KLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQEGMPREFFTDFV 120
           KLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQE MPREFFTDFV
Sbjct: 61  KLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQEEMPREFFTDFV 120

Query: 121 RVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGL 180
           RVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATS+DLLARLAIEHCVHEARGL
Sbjct: 121 RVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSRDLLARLAIEHCVHEARGL 180

Query: 181 DVLPTTISRFRNGGDNETADLLEKVVYPEEVTHCGAGVKWFKYLCQRTGKQIMNGT---- 240
           DVLPTTISRFRNGGDNETADLLEKVVYPEEVTHC AG+KWFKYL QR+GKQ ++G     
Sbjct: 181 DVLPTTISRFRNGGDNETADLLEKVVYPEEVTHCAAGLKWFKYLSQRSGKQTLDGDGLLL 240

Query: 241 -----EEDNA-MEMDEEEIIHKFHEIVRKHFRGPLKPPFNEVARKAAGFGPRWYEPLAFK 300
                 EDNA +EM+ EEIIHKFH IVR +FRGPLKPPFNEVARKAAGFGPRWYEPLAFK
Sbjct: 241 LHQDGAEDNASLEMENEEIIHKFHAIVRNYFRGPLKPPFNEVARKAAGFGPRWYEPLAFK 300

BLAST of Tan0012268 vs. ExPASy TrEMBL
Match: A0A6J1GR27 (uncharacterized protein LOC111456685 OS=Cucurbita moschata OX=3662 GN=LOC111456685 PE=4 SV=1)

HSP 1 Score: 563.1 bits (1450), Expect = 6.8e-157
Identity = 275/310 (88.71%), Postives = 287/310 (92.58%), Query Frame = 0

Query: 1   MADETLVEAALRVLNTSDPFEKAELGDSVASRWLNGAISRAYDPSADLAVPDRPARLSDV 60
           MADETLVEAALRVLNTSDPFEKAELGD VASRWLNG ISR YDPS+DLAVPDRPARLSDV
Sbjct: 1   MADETLVEAALRVLNTSDPFEKAELGDKVASRWLNGVISRPYDPSSDLAVPDRPARLSDV 60

Query: 61  KLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQEGMPREFFTDFV 120
           KLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQE MPREFFTDFV
Sbjct: 61  KLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQEEMPREFFTDFV 120

Query: 121 RVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGL 180
           RVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGL
Sbjct: 121 RVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGL 180

Query: 181 DVLPTTISRFRNGGDNETADLLEKVVYPEEVTHCGAGVKWFKYLCQRTGKQIM------- 240
           DVLPTTISRFRNGGDNETADLLEKVVYPEEVTHC AGVKWF+YLCQR+GKQ +       
Sbjct: 181 DVLPTTISRFRNGGDNETADLLEKVVYPEEVTHCAAGVKWFRYLCQRSGKQTLDRDGLLL 240

Query: 241 ---NGTEEDNAMEMDEEEIIHKFHEIVRKHFRGPLKPPFNEVARKAAGFGPRWYEPLAFK 300
              +G E++ ++EM+ EEIIHKFH IVRKHFRGPLKPPFNEVARKAAGFGPRWYEPLAFK
Sbjct: 241 LHQDGAEDNASLEMENEEIIHKFHAIVRKHFRGPLKPPFNEVARKAAGFGPRWYEPLAFK 300

BLAST of Tan0012268 vs. ExPASy TrEMBL
Match: A0A6J1JQ17 (uncharacterized protein LOC111487281 OS=Cucurbita maxima OX=3661 GN=LOC111487281 PE=4 SV=1)

HSP 1 Score: 552.7 bits (1423), Expect = 9.1e-154
Identity = 273/310 (88.06%), Postives = 285/310 (91.94%), Query Frame = 0

Query: 1   MADETLVEAALRVLNTSDPFEKAELGDSVASRWLNGAISRAYDPSADLAVPDRPARLSDV 60
           MADETLVEAALRVLNTSDPFEKAELGD VASRWLNG ISR YDPS+DL+VPDRPARLSDV
Sbjct: 1   MADETLVEAALRVLNTSDPFEKAELGDKVASRWLNGMISRPYDPSSDLSVPDRPARLSDV 60

Query: 61  KLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQEGMPREFFTDFV 120
           KLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQE MPREFFTDFV
Sbjct: 61  KLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQEEMPREFFTDFV 120

Query: 121 RVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGL 180
           RVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATS+DLLARLAIEHCVHEARGL
Sbjct: 121 RVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSRDLLARLAIEHCVHEARGL 180

Query: 181 DVLPTTISRFRNGGDNETADLLEKVVYPEEVTHCGAGVKWFKYLCQRTGKQIMNGT---- 240
           DVLPTTISRFRNGGDNETADLLEKVVYPEEVTHC AG+KWFKYL QR+GKQ ++G     
Sbjct: 181 DVLPTTISRFRNGGDNETADLLEKVVYPEEVTHCAAGLKWFKYLSQRSGKQTLDGDGLLL 240

Query: 241 -----EEDNA-MEMDEEEIIHKFHEIVRKHFRGPLKPPFNEVARKAAGFGPRWYEPLAFK 300
                 EDNA +EM+ EEIIHKFH IVR +FRGPLKPPFNEVARKAAGFGPRWYEPLAFK
Sbjct: 241 LHQDGAEDNASLEMENEEIIHKFHAIVRNYFRGPLKPPFNEVARKAAGFGPRWYEPLAFK 300

BLAST of Tan0012268 vs. ExPASy TrEMBL
Match: A0A0A0LMC3 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G385090 PE=4 SV=1)

HSP 1 Score: 547.7 bits (1410), Expect = 2.9e-152
Identity = 268/297 (90.24%), Postives = 281/297 (94.61%), Query Frame = 0

Query: 1   MADETLVEAALRVLNTSDPFEKAELGDSVASRWLNGAISRAYDPSADLAVPDRPARLSDV 60
           MADETLVEAALRVLNTSDPFEKAELGD+VASRWLNGAIS  YDPSADL VPDRPARLS+V
Sbjct: 1   MADETLVEAALRVLNTSDPFEKAELGDNVASRWLNGAISNPYDPSADLPVPDRPARLSNV 60

Query: 61  KLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQEGMPREFFTDFV 120
           KLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQEGMPREFFTDFV
Sbjct: 61  KLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQEGMPREFFTDFV 120

Query: 121 RVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGL 180
           RVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGL
Sbjct: 121 RVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGL 180

Query: 181 DVLPTTISRFRNGGDNETADLLEKVVYPEEVTHCGAGVKWFKYLCQRTGKQIMNGTE--- 240
           DVLPTTI RFRNGGDNETADLLEKVVYPEEVTHC AGVKWFKYLCQR+  + ++  +   
Sbjct: 181 DVLPTTIYRFRNGGDNETADLLEKVVYPEEVTHCAAGVKWFKYLCQRSIDRKLDEDDDGA 240

Query: 241 EDNAMEMDEEEIIHKFHEIVRKHFRGPLKPPFNEVARKAAGFGPRWYEPLAFKRDPT 295
           E NAMEM++EE I+KFHE+VRK+FRGPLKPPFNEVARKAAGFGP+WYEPLAFK DPT
Sbjct: 241 ESNAMEMEKEETINKFHEVVRKYFRGPLKPPFNEVARKAAGFGPQWYEPLAFKTDPT 297

BLAST of Tan0012268 vs. ExPASy TrEMBL
Match: A0A1S3C3M9 (uncharacterized protein HI_0077 OS=Cucumis melo OX=3656 GN=LOC103496484 PE=4 SV=1)

HSP 1 Score: 543.5 bits (1399), Expect = 5.5e-151
Identity = 267/297 (89.90%), Postives = 279/297 (93.94%), Query Frame = 0

Query: 1   MADETLVEAALRVLNTSDPFEKAELGDSVASRWLNGAISRAYDPSADLAVPDRPARLSDV 60
           MADETLVEAALRVLNTSDPFEKA LGD+VASRWLNGAIS  YDPSADL VPDRPARLS+V
Sbjct: 1   MADETLVEAALRVLNTSDPFEKAGLGDNVASRWLNGAISSLYDPSADLPVPDRPARLSNV 60

Query: 61  KLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQEGMPREFFTDFV 120
           KLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQEGMPREFFTDFV
Sbjct: 61  KLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQEGMPREFFTDFV 120

Query: 121 RVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGL 180
           RVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGL
Sbjct: 121 RVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGL 180

Query: 181 DVLPTTISRFRNGGDNETADLLEKVVYPEEVTHCGAGVKWFKYLCQRTGKQIMNGTE--- 240
           DVLPTTI RFRNGGDNETADLLEKVVYPEEVTHC AGVKWFKYLCQR+  + ++  +   
Sbjct: 181 DVLPTTIYRFRNGGDNETADLLEKVVYPEEVTHCAAGVKWFKYLCQRSMDRKLDEDDDGA 240

Query: 241 EDNAMEMDEEEIIHKFHEIVRKHFRGPLKPPFNEVARKAAGFGPRWYEPLAFKRDPT 295
           EDNAMEM++EE I KFHE+VRK+FRGPLKPPFNEVARKAAGFGP+WYEPLAFK D T
Sbjct: 241 EDNAMEMEKEETIKKFHEVVRKYFRGPLKPPFNEVARKAAGFGPQWYEPLAFKTDST 297

BLAST of Tan0012268 vs. ExPASy TrEMBL
Match: A0A6J1BXS8 (uncharacterized protein LOC111006681 OS=Momordica charantia OX=3673 GN=LOC111006681 PE=4 SV=1)

HSP 1 Score: 539.3 bits (1388), Expect = 1.0e-149
Identity = 259/292 (88.70%), Postives = 276/292 (94.52%), Query Frame = 0

Query: 1   MADETLVEAALRVLNTSDPFEKAELGDSVASRWLNGAISRAYDPSADLAVPDRPARLSDV 60
           M DETLVEAALRVLNTSDP+EKAELGDSVASRWL+GAIS  YDPS DL VPDRPARLSDV
Sbjct: 1   MGDETLVEAALRVLNTSDPYEKAELGDSVASRWLSGAISLPYDPSVDLQVPDRPARLSDV 60

Query: 61  KLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQEGMPREFFTDFV 120
           KLVSPSLMPKLG+AGSL SRQAIVHSLVHTESWAIDLSWDIIARFGKQEGMPREFFTDFV
Sbjct: 61  KLVSPSLMPKLGRAGSLHSRQAIVHSLVHTESWAIDLSWDIIARFGKQEGMPREFFTDFV 120

Query: 121 RVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGL 180
           RVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSA ATSKDLLARLAIEHCVHEARGL
Sbjct: 121 RVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSATATSKDLLARLAIEHCVHEARGL 180

Query: 181 DVLPTTISRFRNGGDNETADLLEKVVYPEEVTHCGAGVKWFKYLCQRTGKQIMNGTEEDN 240
           DVLPTTI RFRNGGD+ETADLLEKVVYPEEVTHCGAGVKWFKYLC+R+GK +++  +E N
Sbjct: 181 DVLPTTICRFRNGGDDETADLLEKVVYPEEVTHCGAGVKWFKYLCERSGKLMLDEEDEAN 240

Query: 241 AMEMDEEEIIHKFHEIVRKHFRGPLKPPFNEVARKAAGFGPRWYEPLAFKRD 293
           A++M++E+II+KFHEIVR +FRGPLKPPFNE ARKAAGFGPRWYEPLA K D
Sbjct: 241 AVKMEDEDIINKFHEIVRNYFRGPLKPPFNEAARKAAGFGPRWYEPLALKTD 292

BLAST of Tan0012268 vs. TAIR 10
Match: AT5G04520.1 (Protein of unknown function DUF455 )

HSP 1 Score: 476.1 bits (1224), Expect = 2.1e-134
Identity = 231/289 (79.93%), Postives = 255/289 (88.24%), Query Frame = 0

Query: 4   ETLVEAALRVLNTSDPFEKAELGDSVASRWLNGAISRAYDPSADLAVPDRPARLSDVKLV 63
           ETL+E+A+R+LNTSDP EKA LGDS+A +WL GAI+  YDP+ D  VPDRPARL  VKLV
Sbjct: 4   ETLIESAIRILNTSDPHEKARLGDSIAVKWLQGAIAEPYDPTVDFPVPDRPARL-PVKLV 63

Query: 64  SPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQEGMPREFFTDFVRVA 123
           SPSLMPKLG+AGSLQSRQAIVHSL HTESWAIDLSWDIIARFGKQE MPR+FFTDFVRVA
Sbjct: 64  SPSLMPKLGRAGSLQSRQAIVHSLAHTESWAIDLSWDIIARFGKQEKMPRDFFTDFVRVA 123

Query: 124 QDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGLDVL 183
           QDEGRHFTLLAARL+E+GS YGALPAHDGLWDSA ATS DLLARLAIEHCVHEARGLDVL
Sbjct: 124 QDEGRHFTLLAARLEEIGSSYGALPAHDGLWDSATATSHDLLARLAIEHCVHEARGLDVL 183

Query: 184 PTTISRFRNGGDNETADLLEKVVYPEEVTHCGAGVKWFKYLCQRT--GKQIMNGTEEDNA 243
           PTTISRFRNGGDNETADLLEKVVYPEE+THC AGVKWFKYLC+R+   +  ++  E D++
Sbjct: 184 PTTISRFRNGGDNETADLLEKVVYPEEITHCAAGVKWFKYLCERSKDPEFTISSKESDDS 243

Query: 244 MEMDEEEIIHKFHEIVRKHFRGPLKPPFNEVARKAAGFGPRWYEPLAFK 291
                EEII+KFH +VR+HFRGPLKPPFN  ARKAAGFGP+WYEPLA K
Sbjct: 244 ----NEEIINKFHSVVREHFRGPLKPPFNAEARKAAGFGPQWYEPLAVK 287

BLAST of Tan0012268 vs. TAIR 10
Match: AT1G06240.1 (Protein of unknown function DUF455 )

HSP 1 Score: 154.5 bits (389), Expect = 1.4e-37
Identity = 102/287 (35.54%), Postives = 141/287 (49.13%), Query Frame = 0

Query: 2   ADETLVEAALRVLNTSDPFEKAELGDSVASRWLNGAISRAYDPSADLA-VPDRPARLSDV 61
           A  +L +    VL+TSDP  K+ +     SRW      R   P   ++ +P  PAR    
Sbjct: 78  AASSLADLGALVLSTSDPLSKSHISHLAFSRW-----RRENLPVGSISHLPSSPARPPKP 137

Query: 62  KLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGK-QEGMPREFFTDF 121
            LV+ + +P   K  +L     ++H+L H E  AIDL+WD +ARF    + +   FF DF
Sbjct: 138 LLVATNQVPN-PKDSNLPLNAHMLHNLAHVELNAIDLAWDTVARFSPFFDLLGHNFFDDF 197

Query: 122 VRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARG 181
             VA DE RHF   + RL ELG  YG +PA++ L      TS ++ ARLA    V EARG
Sbjct: 198 AHVADDESRHFLWCSQRLAELGFKYGDIPANNLLMRECEKTSNNVAARLACIPLVQEARG 257

Query: 182 LDVLPTTISRFRNGGDNETADLLEKVVYPEEVTHCGAGVKWFKYLCQRTGKQIMNGTEED 241
           LD  P  + R    GDN T+ ++ K+   EEV H   GV WF  +CQ+     MN     
Sbjct: 258 LDAGPRLVKRLTGFGDNRTSKIVAKIA-EEEVAHVAVGVDWFLSVCQK-----MNRAPSP 317

Query: 242 NAMEMDEEEIIHKFHEIVRKHFRGPLKPPFNEVARKAAGFGPRWYEP 287
                        F +++ K +   L+ PFN  AR+ AG    WY+P
Sbjct: 318 T------------FKDLI-KEYGVELRGPFNHSAREVAGIPRDWYDP 339

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P439351.6e-2230.22Uncharacterized protein HI_0077 OS=Haemophilus influenzae (strain ATCC 51907 / D... [more]
Match NameE-valueIdentityDescription
KAG6602270.11.4e-15689.39hypothetical protein SDJN03_07503, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_022954428.11.4e-15688.71uncharacterized protein LOC111456685 [Cucurbita moschata][more]
XP_023551934.12.0e-15589.35uncharacterized protein LOC111809758 [Cucurbita pepo subsp. pepo][more]
XP_038884770.11.7e-15491.28uncharacterized protein HI_0077 [Benincasa hispida][more]
XP_022990415.11.9e-15388.06uncharacterized protein LOC111487281 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A6J1GR276.8e-15788.71uncharacterized protein LOC111456685 OS=Cucurbita moschata OX=3662 GN=LOC1114566... [more]
A0A6J1JQ179.1e-15488.06uncharacterized protein LOC111487281 OS=Cucurbita maxima OX=3661 GN=LOC111487281... [more]
A0A0A0LMC32.9e-15290.24Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G385090 PE=4 SV=1[more]
A0A1S3C3M95.5e-15189.90uncharacterized protein HI_0077 OS=Cucumis melo OX=3656 GN=LOC103496484 PE=4 SV=... [more]
A0A6J1BXS81.0e-14988.70uncharacterized protein LOC111006681 OS=Momordica charantia OX=3673 GN=LOC111006... [more]
Match NameE-valueIdentityDescription
AT5G04520.12.1e-13479.93Protein of unknown function DUF455 [more]
AT1G06240.11.4e-3735.54Protein of unknown function DUF455 [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011197Uncharacterised conserved protein UCP012318PIRSFPIRSF012318UCP012318coord: 1..298
e-value: 3.6E-98
score: 326.3
IPR007402Protein of unknown function DUF455PFAMPF04305DUF455coord: 15..285
e-value: 3.4E-97
score: 324.8
NoneNo IPR availablePANTHERPTHR42782:SF2SI:CH73-314G15.3coord: 4..291
NoneNo IPR availablePANTHERPTHR42782SI:CH73-314G15.3coord: 4..291
NoneNo IPR availableCDDcd00657Ferritin_likecoord: 82..221
e-value: 2.53892E-15
score: 69.4472
IPR009078Ferritin-like superfamilySUPERFAMILY47240Ferritin-likecoord: 82..226

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0012268.1Tan0012268.1mRNA