Tan0011932 (gene) Snake gourd v1

Overview
NameTan0011932
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionSOUL heme-binding protein
LocationLG09: 66118463 .. 66122225 (-)
RNA-Seq ExpressionTan0011932
SyntenyTan0011932
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTGCTGCCCAAGTTTCAATCCAAAACTTCCTCTCAATCCCAACCGTTGGTTTTGGTCTCCGGCCGAGGAAATCCGGCGGACCGACCGGCGTCGCACAAAGCAGAACCGTAGGCCAAAATTCGAAGTGGGTTATTCGATCAAAATTGGGAGATCATCAAAGGCCTCGGAAATCGACGGTGGACGTTGACCGATTGGTGGATTTCTTGTACGAGGATCTCCGGCATGTGTTCGATGAGCAGGGAATTGATCGGACGGCGTACGATGAACAAGTGAGATTTCGAGACCCAGTTACAAAGTATAATGGCATTACAGGGTATTTGCTGAATATTGCCCTGTTGCGAGAATTCTTCAGGCCTGAGATCATAGTGCACTGGGTCAAAAAGGTTCTTATCACTTCTCTTTTAGGTCTATGGATTATTTAGCCCTTCTTTTTTTAACTTCAATCATGATATCGAACTATCGACTTTTAAAATGGTAATAATTGTGTTTGTAGTATAAGTTGTTTTGTTGGTTGTCTATGGTATAGTGTTTAAGGGAGATGTCCAAGATCCGATATTTCTCTTATGGTGCTAATTTAAGTATACTTCGATGGATAAAACACAAATTACTATATCTTAAACGTTGATGATTCAATCCCCCACTTCTGTAATTGTTAAACTAAAAGAACATTTCTCATATGCTAAAATAATGGCCTTTTTTTTCTTTTGAATTCAACAATTGCGGAATAAGAGATTGAACCATCAACCTTTGGATGAAGCCCTGAACAGAAATCCTGATTCGCCACCGTTTAGGGTAGTAATTGGCCGTATTTTATTTATTGAGTTGTGTTTAGATTGACAATTAATAGTTGTGCTAAGTAATAAGATTAGTTGTTACATAGGTCTTTCACTTTTTAATTTTGTATCCGATAGATCTCTATCTAATAGATTTCTAAACATTTTGTGTCTATTAGACCCGTTTTAAGAGGGTCTAACAAATTTCTAAAATTTTAATTTTGTATTTATAAATTTCTGACATATTTGATATTTAAAAAAATAATTTATCTACTATACACAAAATTGAGAATATTAAACTTCTAAATTTATGTCTAATAGAACATTAAACATTTAATTTATTATTTAATAGGTTTGTAAATTTAGAAAATATTGAATAGGTTAGAATTTTTTTTTTTAAAAAAAGCCATTAAACTTTGTACTTTATTTCAAAAATACCTTGAACTTTCAAAATTAGTTTCATTTAATACCCTTTAACTTTCAAACGTTTTATTCATACCTTTAAACTTAAAAAAAAAATGTTAGTATATAGACAGAAATGGATAGTATCTCGTTTTAAAATATTTTTTGGACTTTCAAAAGTTGTATTATTATACCATTAACCCTTAAAAACAAAAGTTAAAAAATATTCTTACCATTAGTATATAGATGACTATCGTTTATCTACACTCATTACCCCTTTTTTCTATTTCATCTCTCCTGATTTGTTTTGATCTATAGTCTAGCAACTTCTTTATTTTTCTCATTTTTTTTCCTTTCTTTTTCAATACCCTCTTACTTTCTTTTTCTCTATTTTCCAATCTCTAAAACTTCTTCCATTTCTTCTATCTTCTTAAATATTAGAACTTTAAATAACCAAGAGGAGCTCTCTCTTATTTTTAAGTACATACTCAAACATAAATATTAGAAATTTCAACACCCAAATAGTAAGTTATCTTGAGTTGGAAAATAATCAATTTGAAAAATAACTCATATAGGTGCCTTTTTTTTTAATATATAAATACTGTCATTTTTACTCCACTTATCTAATTCTCAATCATGAGGAATCTTCTTAACTGTCCACAAAAACTAGACTAATTTGAACTATAAAAATCATTAATAAAATCTGCATCAACTACAAATTTAATTATATATGCATATGTTTTCTTTCTTTTCTCTCCCACTAAATGTCCTCGAAATTATGACCCAAGTTCTCAACATGTCTTTCTTTTTGTGTATTAAATCCCAATGTACTCATTCAAATTACTAATGTACAACTTTTGTTAAACCTAAAGTCCTAAGGTTCTTTGATGGTTCCATTATTCTCTCTCTCTTTTGCTATATTCTATAATTTGATTGAATAGGGCAGTCTTAGAGATTGAAAAATGGAAGAAAGAAGTAAAGAGGGTATTGAAAAAGAAAGAAAAAAATAGAGAGAGAAAAGGAGAGAAGTGTTAGACCAAAGATGGAAGCAAATCAAGAGAAAAAAGATATAGAAGTCTGAGGTAATGGAATGTAGATACATGTTTTTTCGGTTAAAGTGTGACGCTTGAGGGAGAGGGGTACATGAATCAACTGAGTCCATATCTGAATGAGAGTGATCCTGAGGTTATGAAAGTATGGATGAGAAAGGCTTGAAAAAATTGATAGCTACTACCTATACCAACAAAATGTACATTTTTTTTGTGGCTCAATCATAGGAACTTCTAAGTTAAACATGCTCGGCTTGGAGTAATTCTATGTTGGGTGACCTCTCGAAAATGTTTTTGTCACATATAAGAGTATTTTTGAACTTTTTTTTTTTAAAGTTCAAGGGTATTAATGAAAGTTTTGAAAGTTCAAGGGTATTTTTTAATTTTTTTTTTAGTTGAAGGGTATTAATGAAACTTTTGAAAGTTCAACGGTATTTTTGAAACAAAAGTACAAAGTTCAGAAGTATTTTTTTTATATATAATTTAGCCAATATGTTAGAACTTAGTAGACATGAAATAGAAAGTTATAGAATTTATTAGATCACAATCGTATCAAGTTCTATAATATGTGACACTATTTTTGTCACAGAAATGCTACTTAAGAGTTAAGAGAGAAAAGACCCTTTGTCCTTTTAAAACATTATTGACAATGAAATTTTCACAAGGCTCAACAACATCGGGTTAATTTTTCTTGGGAATGATACTAGATTCTAGAACCATATATTCGTGAATCCCTGAATTCTTTTTTTAGTTCAACAACTATATATATATATATATTATATATATATATATGGAGTAATGTGATTCCAACCTTGACTTCCCAGGTCGTGAATGGAAGGTATATATCCTAAATAATCAATTAAACTTTGTTCAAGTTGATTAATTATATAAAAATTAGACATAAGGAGGTGAAATAACATTACACAAGTTTAGACACAAGAGTCCAGACTTCTTACTCTGGCATCATGTTAAATCACCACTAAATTTAAAAACTTAGTTAGTGACAAGTAAATTTAATCTTAATCAAATGAAAAAAAAAACGTTAAGATCGTTTTTAGACATATCCTAACTTATTCAGATTGAGTTTGTCATAGAGTGGACCATATGAAATAACTACAAGATGGACTGCAGTGATGAAGTTCATCCTTCTACCATGGAAACCAGAATTAGTTTTGACTGGAACTTCCATTATGGGTATCAATTCACAGACGGGCAAGTTTTGTAGCCATGTGGTAATTAAAATGTCCTTCATATATTAATTGTCATCGCTGACTTATTACCCACTTCTTAATTTCAATTCCTAATTCATCTTTTTAATTAATTAATTCCAGGATCTTTGGGATTCAGTTCAAAATAATGACTACTTTTCTCTAGAAGGCTTATGGGATGTATTTAAACAGGTATATAATTCTCATGGTGAAAAATTAATCCACAAGTAAAGATTTTGTTATAAATTTTGATGTATTTTTTTTTCTCTTTTCAGTTTAGGTTTTATGAGACTCCAGAATTGGAATCACCCAAATATCAGATATTGA

mRNA sequence

ATGGCTGCTGCCCAAGTTTCAATCCAAAACTTCCTCTCAATCCCAACCGTTGGTTTTGGTCTCCGGCCGAGGAAATCCGGCGGACCGACCGGCGTCGCACAAAGCAGAACCGTAGGCCAAAATTCGAAGTGGGTTATTCGATCAAAATTGGGAGATCATCAAAGGCCTCGGAAATCGACGGTGGACGTTGACCGATTGGTGGATTTCTTGTACGAGGATCTCCGGCATGTGTTCGATGAGCAGGGAATTGATCGGACGGCGTACGATGAACAAGTGAGATTTCGAGACCCAGTTACAAAGTATAATGGCATTACAGGGTATTTGCTGAATATTGCCCTGTTGCGAGAATTCTTCAGGCCTGAGATCATAGTGCACTGGGTCAAAAAGAGTGGACCATATGAAATAACTACAAGATGGACTGCAGTGATGAAGTTCATCCTTCTACCATGGAAACCAGAATTAGTTTTGACTGGAACTTCCATTATGGGTATCAATTCACAGACGGGCAAGTTTTGTAGCCATGTGGATCTTTGGGATTCAGTTCAAAATAATGACTACTTTTCTCTAGAAGGCTTATGGGATGTATTTAAACAGAATTGGAATCACCCAAATATCAGATATTGA

Coding sequence (CDS)

ATGGCTGCTGCCCAAGTTTCAATCCAAAACTTCCTCTCAATCCCAACCGTTGGTTTTGGTCTCCGGCCGAGGAAATCCGGCGGACCGACCGGCGTCGCACAAAGCAGAACCGTAGGCCAAAATTCGAAGTGGGTTATTCGATCAAAATTGGGAGATCATCAAAGGCCTCGGAAATCGACGGTGGACGTTGACCGATTGGTGGATTTCTTGTACGAGGATCTCCGGCATGTGTTCGATGAGCAGGGAATTGATCGGACGGCGTACGATGAACAAGTGAGATTTCGAGACCCAGTTACAAAGTATAATGGCATTACAGGGTATTTGCTGAATATTGCCCTGTTGCGAGAATTCTTCAGGCCTGAGATCATAGTGCACTGGGTCAAAAAGAGTGGACCATATGAAATAACTACAAGATGGACTGCAGTGATGAAGTTCATCCTTCTACCATGGAAACCAGAATTAGTTTTGACTGGAACTTCCATTATGGGTATCAATTCACAGACGGGCAAGTTTTGTAGCCATGTGGATCTTTGGGATTCAGTTCAAAATAATGACTACTTTTCTCTAGAAGGCTTATGGGATGTATTTAAACAGAATTGGAATCACCCAAATATCAGATATTGA

Protein sequence

MAAAQVSIQNFLSIPTVGFGLRPRKSGGPTGVAQSRTVGQNSKWVIRSKLGDHQRPRKSTVDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQVRFRDPVTKYNGITGYLLNIALLREFFRPEIIVHWVKKSGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKFCSHVDLWDSVQNNDYFSLEGLWDVFKQNWNHPNIRY
Homology
BLAST of Tan0011932 vs. NCBI nr
Match: XP_022965046.1 (uncharacterized protein LOC111465022 [Cucurbita maxima])

HSP 1 Score: 350.5 bits (898), Expect = 9.8e-93
Identity = 167/198 (84.34%), Postives = 180/198 (90.91%), Query Frame = 0

Query: 1   MAAAQVSIQNFLSIPTVGFGLRPRKSGGPTGVAQSRTVGQNSKWVIRSKLGDHQRPRKST 60
           MA AQVS QNFLSIPTV FG+RPRKS GPT  AQSRT   N KW IRS L D QR +K T
Sbjct: 1   MATAQVSFQNFLSIPTVDFGVRPRKSYGPTRAAQSRTASPNWKWSIRSTLAD-QRHQKPT 60

Query: 61  VDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQVRFRDPVTKYNGITGYLLNIALLREFFRP 120
           VDVDRLVDF+Y+DLRHVFDEQGIDRTAYDE+VRFRDP+TKY+GI+GY+LNIALLREFFRP
Sbjct: 61  VDVDRLVDFMYDDLRHVFDEQGIDRTAYDEEVRFRDPITKYDGISGYMLNIALLREFFRP 120

Query: 121 EIIVHWVKKSGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKFCSHVDLWDS 180
           EII+HWVKK+GPYEITTRWTAVMKFILLPWKPELVLTGTSIMGIN QTGKFCSHVDLWDS
Sbjct: 121 EIILHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDS 180

Query: 181 VQNNDYFSLEGLWDVFKQ 199
           +QNNDYFS+E LWDVFKQ
Sbjct: 181 LQNNDYFSVEALWDVFKQ 197

BLAST of Tan0011932 vs. NCBI nr
Match: KAG7021789.1 (hypothetical protein SDJN02_15516 [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 345.9 bits (886), Expect = 2.4e-91
Identity = 167/198 (84.34%), Postives = 179/198 (90.40%), Query Frame = 0

Query: 1   MAAAQVSIQNFLSIPTVGFGLRPRKSGGPTGVAQSRTVGQNSKWVIRSKLGDHQRPRKST 60
           MA AQVS QNFLSIPTV FG+RPRKS GPT  AQSRT   N K  IRS LGD  R +K T
Sbjct: 1   MATAQVSFQNFLSIPTVDFGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLGDQSR-QKPT 60

Query: 61  VDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQVRFRDPVTKYNGITGYLLNIALLREFFRP 120
           VDVDRLVDF+Y+DLRHVFDEQGIDRTAYDE+VRFRDP+TKY+GI+GY+LNIALLREFFRP
Sbjct: 61  VDVDRLVDFMYDDLRHVFDEQGIDRTAYDEEVRFRDPITKYDGISGYMLNIALLREFFRP 120

Query: 121 EIIVHWVKKSGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKFCSHVDLWDS 180
           EII+HWVKK+GPYEITTRWTAVMKFILLPWKPELVLTGTSIMGIN QTGKFCSHVDLWDS
Sbjct: 121 EIILHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDS 180

Query: 181 VQNNDYFSLEGLWDVFKQ 199
           +QNNDYFSLE LWDVFKQ
Sbjct: 181 LQNNDYFSLEALWDVFKQ 197

BLAST of Tan0011932 vs. NCBI nr
Match: KAG6587902.1 (hypothetical protein SDJN03_16467, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 344.4 bits (882), Expect = 7.0e-91
Identity = 166/198 (83.84%), Postives = 179/198 (90.40%), Query Frame = 0

Query: 1   MAAAQVSIQNFLSIPTVGFGLRPRKSGGPTGVAQSRTVGQNSKWVIRSKLGDHQRPRKST 60
           MA AQVS QNFLSIPTV FG+RPRKS GPT  AQSRT   N K  IRS LGD  R +K T
Sbjct: 1   MATAQVSFQNFLSIPTVDFGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLGDQSR-QKPT 60

Query: 61  VDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQVRFRDPVTKYNGITGYLLNIALLREFFRP 120
           VDVDRLVDF+Y+DLRHVFDEQGIDRTAYDE+VRFRDP+TKY+GI+GY+LNIALLREFFRP
Sbjct: 61  VDVDRLVDFMYDDLRHVFDEQGIDRTAYDEEVRFRDPITKYDGISGYMLNIALLREFFRP 120

Query: 121 EIIVHWVKKSGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKFCSHVDLWDS 180
           EII+HWVKK+GPYEITTRWTAVMKFILLPWKPELVLTGTSIMGIN +TGKFCSHVDLWDS
Sbjct: 121 EIILHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINPKTGKFCSHVDLWDS 180

Query: 181 VQNNDYFSLEGLWDVFKQ 199
           +QNNDYFSLE LWDVFKQ
Sbjct: 181 LQNNDYFSLEALWDVFKQ 197

BLAST of Tan0011932 vs. NCBI nr
Match: XP_023531546.1 (uncharacterized protein LOC111793749 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 341.7 bits (875), Expect = 4.6e-90
Identity = 165/198 (83.33%), Postives = 177/198 (89.39%), Query Frame = 0

Query: 1   MAAAQVSIQNFLSIPTVGFGLRPRKSGGPTGVAQSRTVGQNSKWVIRSKLGDHQRPRKST 60
           MA AQVS QNFLSIPTV FG+RPRKS GPT  AQSRT   N K  IRS L D  R +K T
Sbjct: 1   MATAQVSFQNFLSIPTVDFGVRPRKSYGPTRAAQSRTGSPNWKSSIRSTLADQSR-QKPT 60

Query: 61  VDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQVRFRDPVTKYNGITGYLLNIALLREFFRP 120
           VDVDRLVDF+Y+DLRHVFDEQGIDRTAYDE+VRFRDP+TKY+GI+GY+LNIALLREFFRP
Sbjct: 61  VDVDRLVDFMYDDLRHVFDEQGIDRTAYDEEVRFRDPITKYDGISGYMLNIALLREFFRP 120

Query: 121 EIIVHWVKKSGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKFCSHVDLWDS 180
           EII HWVKK+GPYEITTRWTAVMKFILLPWKPELVLTGTSIMGIN QTGKFCSHVD+WDS
Sbjct: 121 EIIFHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDVWDS 180

Query: 181 VQNNDYFSLEGLWDVFKQ 199
           +QNNDYFSLE LWDVFKQ
Sbjct: 181 LQNNDYFSLEALWDVFKQ 197

BLAST of Tan0011932 vs. NCBI nr
Match: XP_022933414.1 (uncharacterized protein LOC111440839 [Cucurbita moschata])

HSP 1 Score: 337.8 bits (865), Expect = 6.6e-89
Identity = 162/198 (81.82%), Postives = 177/198 (89.39%), Query Frame = 0

Query: 1   MAAAQVSIQNFLSIPTVGFGLRPRKSGGPTGVAQSRTVGQNSKWVIRSKLGDHQRPRKST 60
           MA AQVS QNFLSIPTV  G+RPRKS GPT  AQSRT   N K  IRS L D  R +K T
Sbjct: 1   MATAQVSFQNFLSIPTVDSGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLADQSR-QKPT 60

Query: 61  VDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQVRFRDPVTKYNGITGYLLNIALLREFFRP 120
           VDVDRLVDF+Y+DLRHVFDEQGIDRTAYD++VRFRDP+TKY+GI+GY+LNIALLREFFRP
Sbjct: 61  VDVDRLVDFMYDDLRHVFDEQGIDRTAYDDEVRFRDPITKYDGISGYMLNIALLREFFRP 120

Query: 121 EIIVHWVKKSGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKFCSHVDLWDS 180
           EII+HWVKK+GPYEITTRWTA+MKFILLPWKPELVLTGTSIMGIN QTGKFCSHVDLWDS
Sbjct: 121 EIILHWVKKTGPYEITTRWTAIMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDS 180

Query: 181 VQNNDYFSLEGLWDVFKQ 199
           +QNNDYFS+E LWDVFKQ
Sbjct: 181 LQNNDYFSVEALWDVFKQ 197

BLAST of Tan0011932 vs. ExPASy TrEMBL
Match: A0A6J1HKM5 (uncharacterized protein LOC111465022 OS=Cucurbita maxima OX=3661 GN=LOC111465022 PE=3 SV=1)

HSP 1 Score: 350.5 bits (898), Expect = 4.7e-93
Identity = 167/198 (84.34%), Postives = 180/198 (90.91%), Query Frame = 0

Query: 1   MAAAQVSIQNFLSIPTVGFGLRPRKSGGPTGVAQSRTVGQNSKWVIRSKLGDHQRPRKST 60
           MA AQVS QNFLSIPTV FG+RPRKS GPT  AQSRT   N KW IRS L D QR +K T
Sbjct: 1   MATAQVSFQNFLSIPTVDFGVRPRKSYGPTRAAQSRTASPNWKWSIRSTLAD-QRHQKPT 60

Query: 61  VDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQVRFRDPVTKYNGITGYLLNIALLREFFRP 120
           VDVDRLVDF+Y+DLRHVFDEQGIDRTAYDE+VRFRDP+TKY+GI+GY+LNIALLREFFRP
Sbjct: 61  VDVDRLVDFMYDDLRHVFDEQGIDRTAYDEEVRFRDPITKYDGISGYMLNIALLREFFRP 120

Query: 121 EIIVHWVKKSGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKFCSHVDLWDS 180
           EII+HWVKK+GPYEITTRWTAVMKFILLPWKPELVLTGTSIMGIN QTGKFCSHVDLWDS
Sbjct: 121 EIILHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDS 180

Query: 181 VQNNDYFSLEGLWDVFKQ 199
           +QNNDYFS+E LWDVFKQ
Sbjct: 181 LQNNDYFSVEALWDVFKQ 197

BLAST of Tan0011932 vs. ExPASy TrEMBL
Match: A0A6J1EZQ2 (uncharacterized protein LOC111440839 OS=Cucurbita moschata OX=3662 GN=LOC111440839 PE=3 SV=1)

HSP 1 Score: 337.8 bits (865), Expect = 3.2e-89
Identity = 162/198 (81.82%), Postives = 177/198 (89.39%), Query Frame = 0

Query: 1   MAAAQVSIQNFLSIPTVGFGLRPRKSGGPTGVAQSRTVGQNSKWVIRSKLGDHQRPRKST 60
           MA AQVS QNFLSIPTV  G+RPRKS GPT  AQSRT   N K  IRS L D  R +K T
Sbjct: 1   MATAQVSFQNFLSIPTVDSGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLADQSR-QKPT 60

Query: 61  VDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQVRFRDPVTKYNGITGYLLNIALLREFFRP 120
           VDVDRLVDF+Y+DLRHVFDEQGIDRTAYD++VRFRDP+TKY+GI+GY+LNIALLREFFRP
Sbjct: 61  VDVDRLVDFMYDDLRHVFDEQGIDRTAYDDEVRFRDPITKYDGISGYMLNIALLREFFRP 120

Query: 121 EIIVHWVKKSGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKFCSHVDLWDS 180
           EII+HWVKK+GPYEITTRWTA+MKFILLPWKPELVLTGTSIMGIN QTGKFCSHVDLWDS
Sbjct: 121 EIILHWVKKTGPYEITTRWTAIMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDS 180

Query: 181 VQNNDYFSLEGLWDVFKQ 199
           +QNNDYFS+E LWDVFKQ
Sbjct: 181 LQNNDYFSVEALWDVFKQ 197

BLAST of Tan0011932 vs. ExPASy TrEMBL
Match: A0A6J1CV62 (uncharacterized protein LOC111014503 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111014503 PE=3 SV=1)

HSP 1 Score: 314.3 bits (804), Expect = 3.8e-82
Identity = 152/202 (75.25%), Postives = 173/202 (85.64%), Query Frame = 0

Query: 1   MAAAQVSIQNFLSIPTVGFGLRPRKSGGPTG----VAQSRTVGQNSKWVIRSKLGDHQRP 60
           M   QVS+QNFLSIPTVG G RP+KSG  TG    + +SRT  +  K V+RS+L D + P
Sbjct: 1   MGGGQVSLQNFLSIPTVGCGFRPKKSGRKTGPEPRLLRSRT--KVRKCVVRSRLAD-RSP 60

Query: 61  RKSTVDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQVRFRDPVTKYNGITGYLLNIALLRE 120
            KSTVDVDRLVDFLYEDLRHVFD QGID TAYDE VRFRDP+TKYNGI GY+LNIALLR+
Sbjct: 61  PKSTVDVDRLVDFLYEDLRHVFDAQGIDPTAYDEHVRFRDPITKYNGIRGYMLNIALLRQ 120

Query: 121 FFRPEIIVHWVKKSGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKFCSHVD 180
            FRP+ ++HWVKK+GPYEITTRWTAVMKF+LLPWKPELVLTGTSIM I+ +TGKFC+HVD
Sbjct: 121 LFRPQFLLHWVKKTGPYEITTRWTAVMKFVLLPWKPELVLTGTSIMDIDPETGKFCNHVD 180

Query: 181 LWDSVQNNDYFSLEGLWDVFKQ 199
           LWDSVQNN+YFSLEGLWD+FKQ
Sbjct: 181 LWDSVQNNNYFSLEGLWDIFKQ 199

BLAST of Tan0011932 vs. ExPASy TrEMBL
Match: A0A6J1CUY2 (uncharacterized protein LOC111014503 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111014503 PE=3 SV=1)

HSP 1 Score: 307.4 bits (786), Expect = 4.6e-80
Identity = 149/209 (71.29%), Postives = 168/209 (80.38%), Query Frame = 0

Query: 1   MAAAQVSIQNFLSIPTVGFGLRPRKSGG------PTGVAQSRTV-----GQNSKWVIRSK 60
           MAA Q+S+QNFLS PT GFG RP KSGG      P  + +SRTV      +NSKW +R  
Sbjct: 1   MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLS 60

Query: 61  LGDHQRPRKSTVDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQVRFRDPVTKYNGITGYLL 120
           L D Q P KS VDVDRLVDFLYEDLRH+FDEQGIDRTAYDE VRFRDP+TK++ I+GY  
Sbjct: 61  LVD-QSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDEHVRFRDPITKHDTISGYSF 120

Query: 121 NIALLREFFRPEIIVHWVKKSGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINSQTG 180
           NI+LLRE FRPE  +HWVK++GPYEITTRWT VMKF+LLPWKPE + TG SIMGIN +TG
Sbjct: 121 NISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETG 180

Query: 181 KFCSHVDLWDSVQNNDYFSLEGLWDVFKQ 199
           KFCSHVDLWDS+QNNDYFSLEGL DVFKQ
Sbjct: 181 KFCSHVDLWDSIQNNDYFSLEGLLDVFKQ 208

BLAST of Tan0011932 vs. ExPASy TrEMBL
Match: A0A1S3CJ12 (uncharacterized protein LOC103501513 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103501513 PE=3 SV=1)

HSP 1 Score: 305.8 bits (782), Expect = 1.3e-79
Identity = 150/207 (72.46%), Postives = 169/207 (81.64%), Query Frame = 0

Query: 1   MAAAQVSIQNFLSIPTVGFGLRPRKSGGPTG----VAQSRTVG-----QNSKWVIRSKLG 60
           MAA Q+S+QNFLS PT+   LRP KSG  T     + QSRT       QNSKWV+R  L 
Sbjct: 1   MAALQLSLQNFLSTPTLTSVLRPPKSGRLTNLLPRLLQSRTPAVKPNTQNSKWVVRFNLV 60

Query: 61  DHQRPRKSTVDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQVRFRDPVTKYNGITGYLLNI 120
           D Q P KSTVDV RLVDFLYEDL H+FDEQGIDRTAYDEQVRFRDP+TK++ I+GYL NI
Sbjct: 61  D-QSPPKSTVDVGRLVDFLYEDLSHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNI 120

Query: 121 ALLREFFRPEIIVHWVKKSGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKF 180
           +LLRE FRPE  +HWVK++GPYEITTRWT +MKF LLPWKPEL+ TGTSIMGIN +TGKF
Sbjct: 121 SLLREIFRPEFFLHWVKQTGPYEITTRWTMIMKFALLPWKPELIFTGTSIMGINPETGKF 180

Query: 181 CSHVDLWDSVQNNDYFSLEGLWDVFKQ 199
           CSHVDLWDS+QNNDYFS+EGLWDVFKQ
Sbjct: 181 CSHVDLWDSIQNNDYFSVEGLWDVFKQ 206

BLAST of Tan0011932 vs. TAIR 10
Match: AT5G20140.1 (SOUL heme-binding family protein )

HSP 1 Score: 225.3 bits (573), Expect = 4.4e-59
Identity = 101/163 (61.96%), Postives = 128/163 (78.53%), Query Frame = 0

Query: 36  RTVGQNSKWVIRSKLGDHQRPRKSTVDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQVRFR 95
           R V    + ++  ++G       STV+++ LV FLYEDL H+FD+QGID+TAYDE+V+FR
Sbjct: 34  RNVTTRLRPILSLEVGKEVASAPSTVNMEELVGFLYEDLPHLFDDQGIDKTAYDERVKFR 93

Query: 96  DPVTKYNGITGYLLNIALLREFFRPEIIVHWVKKSGPYEITTRWTAVMKFILLPWKPELV 155
           DP+TK++ I+GYL NIA L+  F P+  +HW K++GPYEITTRWT VMKFI LPWKPELV
Sbjct: 94  DPITKHDTISGYLFNIAFLKNIFTPQFQLHWAKQTGPYEITTRWTMVMKFIPLPWKPELV 153

Query: 156 LTGTSIMGINSQTGKFCSHVDLWDSVQNNDYFSLEGLWDVFKQ 199
            TG SIM +N +T KFCSH+DLWDS++NNDYFSLEGL DVFKQ
Sbjct: 154 FTGLSIMEVNPETNKFCSHLDLWDSIKNNDYFSLEGLVDVFKQ 196

BLAST of Tan0011932 vs. TAIR 10
Match: AT5G20140.2 (SOUL heme-binding family protein )

HSP 1 Score: 225.3 bits (573), Expect = 4.4e-59
Identity = 101/163 (61.96%), Postives = 128/163 (78.53%), Query Frame = 0

Query: 36  RTVGQNSKWVIRSKLGDHQRPRKSTVDVDRLVDFLYEDLRHVFDEQGIDRTAYDEQVRFR 95
           R V    + ++  ++G       STV+++ LV FLYEDL H+FD+QGID+TAYDE+V+FR
Sbjct: 34  RNVTTRLRPILSLEVGKEVASAPSTVNMEELVGFLYEDLPHLFDDQGIDKTAYDERVKFR 93

Query: 96  DPVTKYNGITGYLLNIALLREFFRPEIIVHWVKKSGPYEITTRWTAVMKFILLPWKPELV 155
           DP+TK++ I+GYL NIA L+  F P+  +HW K++GPYEITTRWT VMKFI LPWKPELV
Sbjct: 94  DPITKHDTISGYLFNIAFLKNIFTPQFQLHWAKQTGPYEITTRWTMVMKFIPLPWKPELV 153

Query: 156 LTGTSIMGINSQTGKFCSHVDLWDSVQNNDYFSLEGLWDVFKQ 199
            TG SIM +N +T KFCSH+DLWDS++NNDYFSLEGL DVFKQ
Sbjct: 154 FTGLSIMEVNPETNKFCSHLDLWDSIKNNDYFSLEGLVDVFKQ 196

BLAST of Tan0011932 vs. TAIR 10
Match: AT2G46100.1 (Nuclear transport factor 2 (NTF2) family protein )

HSP 1 Score: 48.5 bits (114), Expect = 7.4e-06
Identity = 36/128 (28.12%), Postives = 59/128 (46.09%), Query Frame = 0

Query: 66  LVDFLYEDL-RHVFDEQGIDRTAYDEQVRFRDPVTKYNGIT---------GYLL---NIA 125
           +VD + +D  R  F    +    Y+E+  F DP   + G+          G L+   N+ 
Sbjct: 105 VVDSIKQDFKRSYFVTGNLTPEVYEEKCEFADPAGSFKGLARFKRNCTNFGSLIEKSNMK 164

Query: 126 LLR-EFFRPEIIVHWVKKSGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINSQTGKF 180
           L++ E F  + I HW           +++ VM F   PWKP L  TG +    ++++GK 
Sbjct: 165 LMKWENFEDKGIGHW-----------KFSCVMSF---PWKPILSATGYTEYYFDTESGKI 218

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_022965046.19.8e-9384.34uncharacterized protein LOC111465022 [Cucurbita maxima][more]
KAG7021789.12.4e-9184.34hypothetical protein SDJN02_15516 [Cucurbita argyrosperma subsp. argyrosperma][more]
KAG6587902.17.0e-9183.84hypothetical protein SDJN03_16467, partial [Cucurbita argyrosperma subsp. sorori... [more]
XP_023531546.14.6e-9083.33uncharacterized protein LOC111793749 [Cucurbita pepo subsp. pepo][more]
XP_022933414.16.6e-8981.82uncharacterized protein LOC111440839 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A6J1HKM54.7e-9384.34uncharacterized protein LOC111465022 OS=Cucurbita maxima OX=3661 GN=LOC111465022... [more]
A0A6J1EZQ23.2e-8981.82uncharacterized protein LOC111440839 OS=Cucurbita moschata OX=3662 GN=LOC1114408... [more]
A0A6J1CV623.8e-8275.25uncharacterized protein LOC111014503 isoform X2 OS=Momordica charantia OX=3673 G... [more]
A0A6J1CUY24.6e-8071.29uncharacterized protein LOC111014503 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A1S3CJ121.3e-7972.46uncharacterized protein LOC103501513 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... [more]
Match NameE-valueIdentityDescription
AT5G20140.14.4e-5961.96SOUL heme-binding family protein [more]
AT5G20140.24.4e-5961.96SOUL heme-binding family protein [more]
AT2G46100.17.4e-0628.13Nuclear transport factor 2 (NTF2) family protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR018790Protein of unknown function DUF2358PFAMPF10184DUF2358coord: 65..174
e-value: 1.8E-24
score: 86.2
IPR006917SOUL haem-binding proteinPANTHERPTHR11220HEME-BINDING PROTEIN-RELATEDcoord: 46..199
NoneNo IPR availablePANTHERPTHR11220:SF50SOUL HEME-BINDING FAMILY PROTEINcoord: 46..199
IPR032710NTF2-like domain superfamilySUPERFAMILY54427NTF2-likecoord: 72..179

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0011932.1Tan0011932.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0110165 cellular anatomical entity