Tan0000591 (gene) Snake gourd v1

Overview
NameTan0000591
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionAdenine nucleotide alpha hydrolases-like superfamily protein, putative
LocationLG10: 5053603 .. 5056936 (+)
RNA-Seq ExpressionTan0000591
SyntenyTan0000591
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CACAAACTTTGAGATAGGAATTGATTTAAAAAATAAAAAACAACAAAGGAGAAAAACTAGGGTTTTTTTGGTGGCGGAAGAAAGGGCGAGAAGATCGGAGAAGACGGTGACCGAATCAGAAGGAAGCAGAAAAATCGGAAGAAATGGAAGAGTTGAGGTCCGAAAAATGGAATGGTAAGCAAATAGCGAGAGGCGAGAGGAAACAGAAAAGAAAGGGGAGAAGAGAAGGAAAATTTGTTAATATAGTTGTGTTTTGAAACGGACTAAAACCAAAATGACCACCTCATATTCATATTGCTTCTCCTTCTCACTTTTGATTTTGCCTTTCCTTCACATCCACAGCCTGATTCCCCCCCTTCTCTTTGTTGTTTTGGGTCCAGCGGCGGCGGCTGCTGCTGTGGATGGTAGCATCGCCGGAGGTAGACGGCAGCCATGGGAGGAGGGCGGGCTATTAAATACTACTAATACTACTACGTCCACCACGTCACCAAAAAAGGTCATGGTCGTCGTAGATCCCACACGAGAGTCCGCCGCCGCGCTCCAGTACGCGCTTTCGCATGCTGTCATTGATAACGACGAGGTCATTCTTCTCCATGTTGATAACCCTAATTCTTGGAGGAACACCATTACTACATTCCTTAAGAGGCCCAATGGCGGATCCGCCAATGCTCATTCTCATGCTCATGCTCATGCTCATTCTCATGCCACGGCACCTGCAAATGCCGCCTCCGACGGCGGAGGAGCGGCGGAGGTCGATTTTCTTGAGGAGATGAAGAAGGTCTGCAAGGTTGCTCATCCCAAACTGAAAGTGCGCACGGCGAGGGTTGAATTGGAAGGCAAAGACAAGGCCGCCATGATTATGGCTCAAACCAAGGCTCTCGGTATTGATCTGCTAGTCATAGGGCAGAGGCGAAGTCTCTCCACAGCAATCTTAGGGTTTGCCTTCTTCCCTTCATCAAAAATTTTGGGATTAGTATTCTTCTTTTTTAACTTCAGATCAAATGCATTTTTGGTCCATGAATTGTATTGTTCTGCTTTGATTTTCGAGCTTTCTCAATAATCATTTTAATTTTAGCTCCTAAACTGGTTTTGGTCCCTGTTTAGCAGTTGAATGTGAGAGTTTTGAATATGCTTATTATACTCTATCGTAGTCAACCAACCACCATAGGTCGGATTAGTTGTCAATAAAGGCATATAAATAATAAAGAGCTTAAAGAGAATGGGGTAGAGTTAGATAATTGCCCGAATAGGCAATGTGCATAGATGCAGACACTCACGGATATTAATTTTTTTTTTTTTAAAAAAAGAGCATGTCTCGAACTCAAGTAGGTTGAGCTTGTATTGAAATCAAAGTATGTTAGGTTAGTTAAGATAATGTATTGACTTGTTTACTTGAGTAAATATTGAAGTCAAAGAAGGTTAGGTTGGTTAAGATAATATGTGTATTAACTTGTTGATTTGATTATGTATTGGAGTCAAATAGGTTAGATTGGTTAAGATAATATGTGTGTTGACTTGTTGGATTGAGTATGTATTGAAATTAAAGTATGTTAGGTTGGTTAAGATAATATGTGTATTGATTTATTGGCTTGAGTACATTGGAGTCAAGTAGGTTAGGTTAGTTAAGATGGATGTTAGGTAGAGAAAAATTAATGACAAAATAAAGACAATGACTTTTATGGATAACAAAATAAAGACAATGACAAAGTTTATTATTTTTTTAAAAAATTAAATAACAAAGAATGTAAAAGTTCATTGGCTAAAATGAGATTTAAACATTAACTTTATCAAACCTAAAAGCTTGAATACAATGTAGTGAAATTAATACATACCCTTTGTTGTTCGAGTAATTTTTTTGAAGATCCATACCTAAATGCTGAGTGTTTCTGTAATTGGGACTTGGGGTAAGCATAGTTCTACAAGTGTATATTTGACATCCTCTTTTGGTTGGACAAAAGCGATTTTATGGTAGGTTCGAGTTTTTTTTTCCTCCTCCTCTCTTTTGGACCTTTTGCAATTATAATGGAAGTTTAGTTGACTGACCTTAGGATGATTCTTTTAAATAAAACATTGTATTATATAGATAAAAGTGTTGAAATCAAACTAGGAAACCATAAGTTATGAAACCATTAAGATGTTTTAATAGTTAGCTTTGGTCAACTGGATGAAAAGTTTTTGTTTGTGAAATTATTAAATGCAGATATAGACGGTCAGGAGGGCCAATGAAAGGGGCAAAAATGTTGGACACAGCAGAGTATTTGATTGAGAACAGCAAATGCACTTGTGTTGCTGTACAAAAGAAGGGTCAAAATGCAGGCTATCTTTTGAACACCAAAACCCACAGAAACTTCTGGCTGTTGGCTTGATATCATATCATCAACCTCTCTTCACCATTTTCCATTCTCCACAATCCACCACTGTTTTCAATTCATCACACTCTCTTTCTCTCTCTCTCTCTCTCTCTCTTTAAACTCAAAACAGATTGTCTTTCAATTTGGGTCACTTTCTTCTTCCCCCATTGTGAGATCCATTGCAAGCTCACCCAGTTGAATTTGTTGATTCATTCATCTCTAGGCTTGGCTTGGCTTGGTTTGGTTTGGATTGGTCACCCTCCTTTTCTTTCTTTCCAGGCAGTGCATCAACATTTATTTGTACATACTTTTGGACCCACAGAATTAACCAACTGAAAATGAGTTAATCCACTCTCCTTTTTCTTTATTGCTTTCTTTTTTTTTTTCTTCCTTTTGCTCCAAAGGATTTGTTTGTTAATCATGTGCTTATTTGTGTCAGTCTGTTTAATCTTGACATTGTAATTTTGTTCCTTTGAGCTTGAGTTTGTATAAGTGGTGAAATTAGTACTCTTCTTACAACCTAACCTAGAAAATTGCCCTGAAGCCAACTTAGTCAACTGTCGTGTCGAGTCGAGTAGTAGTTAAAAATGTCCAAACCGAACAAGAAAATATAACCTAACTACTGTAAAATCCCACATCATCTAGTAACAAAAAAAAAATGTCCACACCCAGTCTCAAATCTTCGAATGCCAGCAACTTTACTGTAAGAAGTTATTTATTTCACTGCTTATCCAAGTTCATGGGGTTCAATTTGATGAAAAGGTTAAATTGGACTCATTTAGACAGGTTCAAGAAGGAAAAATAGAAGTAAAAAGCCCTCCTTTTCTTTGTTTTGTTACTTTGTGTAAGATTGTTCTTATCCATATTGAAATAAAATGAAGTCTCAATGTTTGTAGGGATGATCTTGTTGAGCGAGAAAGAAGATCAAACCATCCAATTTTTAGGATGAGCATTTTCCATCCAGATTAATGAGATTAATAAAA

mRNA sequence

CACAAACTTTGAGATAGGAATTGATTTAAAAAATAAAAAACAACAAAGGAGAAAAACTAGGGTTTTTTTGGTGGCGGAAGAAAGGGCGAGAAGATCGGAGAAGACGGTGACCGAATCAGAAGGAAGCAGAAAAATCGGAAGAAATGGAAGAGTTGAGGTCCGAAAAATGGAATGCGGCGGCGGCTGCTGCTGTGGATGGTAGCATCGCCGGAGGTAGACGGCAGCCATGGGAGGAGGGCGGGCTATTAAATACTACTAATACTACTACGTCCACCACGTCACCAAAAAAGGTCATGGTCGTCGTAGATCCCACACGAGAGTCCGCCGCCGCGCTCCAGTACGCGCTTTCGCATGCTGTCATTGATAACGACGAGGTCATTCTTCTCCATGTTGATAACCCTAATTCTTGGAGGAACACCATTACTACATTCCTTAAGAGGCCCAATGGCGGATCCGCCAATGCTCATTCTCATGCTCATGCTCATGCTCATTCTCATGCCACGGCACCTGCAAATGCCGCCTCCGACGGCGGAGGAGCGGCGGAGGTCGATTTTCTTGAGGAGATGAAGAAGGTCTGCAAGGTTGCTCATCCCAAACTGAAAGTGCGCACGGCGAGGGTTGAATTGGAAGGCAAAGACAAGGCCGCCATGATTATGGCTCAAACCAAGGCTCTCGGTATTGATCTGCTAGTCATAGGGCAGAGGCGAAGTCTCTCCACAGCAATCTTAGGATATAGACGGTCAGGAGGGCCAATGAAAGGGGCAAAAATGTTGGACACAGCAGAGTATTTGATTGAGAACAGCAAATGCACTTGTGTTGCTGTACAAAAGAAGGGTCAAAATGCAGGCTATCTTTTGAACACCAAAACCCACAGAAACTTCTGGCTGTTGGCTTGATATCATATCATCAACCTCTCTTCACCATTTTCCATTCTCCACAATCCACCACTGTTTTCAATTCATCACACTCTCTTTCTCTCTCTCTCTCTCTCTCTCTTTAAACTCAAAACAGATTGTCTTTCAATTTGGGTCACTTTCTTCTTCCCCCATTGTGAGATCCATTGCAAGCTCACCCAGTTGAATTTGTTGATTCATTCATCTCTAGGCTTGGCTTGGCTTGGTTTGGTTTGGATTGGTCACCCTCCTTTTCTTTCTTTCCAGGCAGTGCATCAACATTTATTTGTACATACTTTTGGACCCACAGAATTAACCAACTGAAAATGAGTTAATCCACTCTCCTTTTTCTTTATTGCTTTCTTTTTTTTTTTCTTCCTTTTGCTCCAAAGGATTTGTTTGTTAATCATGTGCTTATTTGTGTCAGTCTGTTTAATCTTGACATTGTAATTTTGTTCCTTTGAGCTTGAGTTTGTATAAGTGGTGAAATTAGTACTCTTCTTACAACCTAACCTAGAAAATTGCCCTGAAGCCAACTTAGTCAACTGTCGTGTCGAGTCGAGTAGTAGTTAAAAATGTCCAAACCGAACAAGAAAATATAACCTAACTACTGTAAAATCCCACATCATCTAGTAACAAAAAAAAAATGTCCACACCCAGTCTCAAATCTTCGAATGCCAGCAACTTTACTGTAAGAAGTTATTTATTTCACTGCTTATCCAAGTTCATGGGGTTCAATTTGATGAAAAGGTTAAATTGGACTCATTTAGACAGGTTCAAGAAGGAAAAATAGAAGTAAAAAGCCCTCCTTTTCTTTGTTTTGTTACTTTGTGTAAGATTGTTCTTATCCATATTGAAATAAAATGAAGTCTCAATGTTTGTAGGGATGATCTTGTTGAGCGAGAAAGAAGATCAAACCATCCAATTTTTAGGATGAGCATTTTCCATCCAGATTAATGAGATTAATAAAA

Coding sequence (CDS)

ATGGAAGAGTTGAGGTCCGAAAAATGGAATGCGGCGGCGGCTGCTGCTGTGGATGGTAGCATCGCCGGAGGTAGACGGCAGCCATGGGAGGAGGGCGGGCTATTAAATACTACTAATACTACTACGTCCACCACGTCACCAAAAAAGGTCATGGTCGTCGTAGATCCCACACGAGAGTCCGCCGCCGCGCTCCAGTACGCGCTTTCGCATGCTGTCATTGATAACGACGAGGTCATTCTTCTCCATGTTGATAACCCTAATTCTTGGAGGAACACCATTACTACATTCCTTAAGAGGCCCAATGGCGGATCCGCCAATGCTCATTCTCATGCTCATGCTCATGCTCATTCTCATGCCACGGCACCTGCAAATGCCGCCTCCGACGGCGGAGGAGCGGCGGAGGTCGATTTTCTTGAGGAGATGAAGAAGGTCTGCAAGGTTGCTCATCCCAAACTGAAAGTGCGCACGGCGAGGGTTGAATTGGAAGGCAAAGACAAGGCCGCCATGATTATGGCTCAAACCAAGGCTCTCGGTATTGATCTGCTAGTCATAGGGCAGAGGCGAAGTCTCTCCACAGCAATCTTAGGATATAGACGGTCAGGAGGGCCAATGAAAGGGGCAAAAATGTTGGACACAGCAGAGTATTTGATTGAGAACAGCAAATGCACTTGTGTTGCTGTACAAAAGAAGGGTCAAAATGCAGGCTATCTTTTGAACACCAAAACCCACAGAAACTTCTGGCTGTTGGCTTGA

Protein sequence

MEELRSEKWNAAAAAAVDGSIAGGRRQPWEEGGLLNTTNTTTSTTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPNGGSANAHSHAHAHAHSHATAPANAASDGGGAAEVDFLEEMKKVCKVAHPKLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQKKGQNAGYLLNTKTHRNFWLLA
Homology
BLAST of Tan0000591 vs. NCBI nr
Match: XP_038881793.1 (homeobox protein 5 [Benincasa hispida])

HSP 1 Score: 337.4 bits (864), Expect = 1.0e-88
Identity = 179/219 (81.74%), Postives = 189/219 (86.30%), Query Frame = 0

Query: 42  TSTTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPN 101
           TST   +KVMVVVDPTRESAAALQYALSHAVIDND+VILLHVDNPNSW+N ITTFLKRPN
Sbjct: 6   TSTAPSRKVMVVVDPTRESAAALQYALSHAVIDNDQVILLHVDNPNSWKNAITTFLKRPN 65

Query: 102 GGSAN----------AHSHAHAHAHSHATAPANAASDGGGAAEVDFLEEMKKVCKVAHPK 161
           GGSAN           H++A+A A + ATA ++    GG  AEVDFLEEMKK CK AHPK
Sbjct: 66  GGSANNNNNNSNNNYNHNNANATAAAAATAASDGGQGGGPTAEVDFLEEMKKACKTAHPK 125

Query: 162 LKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLD 221
           LKV T RVELEGKDKA+MIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMK AKMLD
Sbjct: 126 LKVGTLRVELEGKDKASMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKAAKMLD 185

Query: 222 TAEYLIENSKCTCVAVQKKGQNAGYLLNTKTHRNFWLLA 251
           TAEYLIENSKCTCVAVQKKGQNAGYLLNTKTHRNFWLLA
Sbjct: 186 TAEYLIENSKCTCVAVQKKGQNAGYLLNTKTHRNFWLLA 224

BLAST of Tan0000591 vs. NCBI nr
Match: XP_008438448.1 (PREDICTED: uncharacterized protein LOC103483538 [Cucumis melo] >KAA0049204.1 uncharacterized protein E6C27_scaffold171G004420 [Cucumis melo var. makuwa] >TYK17355.1 uncharacterized protein E5676_scaffold434G002120 [Cucumis melo var. makuwa])

HSP 1 Score: 330.1 bits (845), Expect = 1.7e-86
Identity = 169/209 (80.86%), Postives = 188/209 (89.95%), Query Frame = 0

Query: 43  STTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPN- 102
           ST   +KVMVVVDPTRESAAALQYA+SHAV+DND+VILLHVDNPNSWRN I+TFLKRPN 
Sbjct: 7   STAPSRKVMVVVDPTRESAAALQYAISHAVMDNDQVILLHVDNPNSWRNAISTFLKRPNG 66

Query: 103 GGSANAHSHAHAHAHSHATAPANAASDGGGAAEVDFLEEMKKVCKVAHPKLKVRTARVEL 162
           GGS N++++ + HA + ATA ++    GG  A+VDFLEEMKK CKVAHPK+KV T RVEL
Sbjct: 67  GGSTNSNNNNNVHAAATATAASDGGQGGGATADVDFLEEMKKACKVAHPKVKVGTLRVEL 126

Query: 163 EGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSK 222
           EGKDKA+MIMAQTK+LG+DLLVIGQRRSLSTAILGYRR+GG MKGAKMLDTAEYLIENSK
Sbjct: 127 EGKDKASMIMAQTKSLGVDLLVIGQRRSLSTAILGYRRTGGAMKGAKMLDTAEYLIENSK 186

Query: 223 CTCVAVQKKGQNAGYLLNTKTHRNFWLLA 251
           CTCVAVQKKGQNAGYLLNTKTHRNFWLLA
Sbjct: 187 CTCVAVQKKGQNAGYLLNTKTHRNFWLLA 215

BLAST of Tan0000591 vs. NCBI nr
Match: XP_004134033.1 (uncharacterized protein LOC101222608 [Cucumis sativus] >KGN56804.1 hypothetical protein Csa_011229 [Cucumis sativus])

HSP 1 Score: 329.7 bits (844), Expect = 2.2e-86
Identity = 169/210 (80.48%), Postives = 188/210 (89.52%), Query Frame = 0

Query: 42  TSTTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPN 101
           TST   +KVMVVVDPTRESAAALQYALSHA++DND+VILLH+DNPNSWRN I+TFLKRPN
Sbjct: 6   TSTAPSRKVMVVVDPTRESAAALQYALSHALMDNDQVILLHIDNPNSWRNAISTFLKRPN 65

Query: 102 -GGSANAHSHAHAHAHSHATAPANAASDGGGAAEVDFLEEMKKVCKVAHPKLKVRTARVE 161
            GGS N++++ + HA + ATA ++    GG  AEVDFLEEMKK CK AHPKL+V T RVE
Sbjct: 66  GGGSTNSNNNNNVHAAATATAASDGGQGGGATAEVDFLEEMKKACKKAHPKLEVGTLRVE 125

Query: 162 LEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENS 221
           LEGKDKA+MIMAQTK+LG+DLLVIGQRRSLSTAILGYRR+GG MKGAKMLDTAEYLIENS
Sbjct: 126 LEGKDKASMIMAQTKSLGVDLLVIGQRRSLSTAILGYRRTGGAMKGAKMLDTAEYLIENS 185

Query: 222 KCTCVAVQKKGQNAGYLLNTKTHRNFWLLA 251
           KCTCVAVQKKGQNAGYLLNTKTHRNFWLLA
Sbjct: 186 KCTCVAVQKKGQNAGYLLNTKTHRNFWLLA 215

BLAST of Tan0000591 vs. NCBI nr
Match: KAG7028797.1 (hypothetical protein SDJN02_09978, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 320.5 bits (820), Expect = 1.3e-83
Identity = 174/221 (78.73%), Postives = 187/221 (84.62%), Query Frame = 0

Query: 32  GGLLNTTNTTTSTTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRN 91
           GG + T  TT S T+PKKVMVVVDPTRESAAALQY+LSHAVID DEVILLHVDNPN+W+N
Sbjct: 3   GGRITTNTTTASNTTPKKVMVVVDPTRESAAALQYSLSHAVIDCDEVILLHVDNPNTWKN 62

Query: 92  TITTFLKRPNGGSANAHSHAHAHAHSHATAPANAAS--DGGGAAEVDFLEEMKKVCKVAH 151
            ITTFLKRPNGG AN        +H+ AT   NAAS  +GGG  +VDFL+EMKKVC VA 
Sbjct: 63  AITTFLKRPNGGPAN--------SHAAATPTDNAASGAEGGGPTDVDFLKEMKKVCNVAR 122

Query: 152 PKLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKM 211
           P+LKVRTARVELEGKDKAAMIMAQTKAL IDLLVIGQRRSLSTAILGY+R+G     AKM
Sbjct: 123 PELKVRTARVELEGKDKAAMIMAQTKALDIDLLVIGQRRSLSTAILGYKRAG----VAKM 182

Query: 212 LDTAEYLIENSKCTCVAVQKKGQNAGYLLNTKTHRNFWLLA 251
           LDTAEYLIENS CTCVAVQKKGQNAGYLLNTKTHRNFWLLA
Sbjct: 183 LDTAEYLIENSNCTCVAVQKKGQNAGYLLNTKTHRNFWLLA 211

BLAST of Tan0000591 vs. NCBI nr
Match: XP_023538863.1 (uncharacterized protein LOC111799663 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 319.7 bits (818), Expect = 2.2e-83
Identity = 175/221 (79.19%), Postives = 184/221 (83.26%), Query Frame = 0

Query: 32  GGLLNTTNTTTSTTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRN 91
           GG + T  TT S T+PKKVMVVVDPTRESAAALQY+LSHAVID DEVILLHVDNPNSW+N
Sbjct: 3   GGRITTNTTTASNTTPKKVMVVVDPTRESAAALQYSLSHAVIDCDEVILLHVDNPNSWKN 62

Query: 92  TITTFLKRPNGGSANAHSHAHAHAHSHATAPANAASD--GGGAAEVDFLEEMKKVCKVAH 151
            ITTFLKRPNGG AN            AT   NAASD  GGG  +VDFL+EMKKVCKVA 
Sbjct: 63  AITTFLKRPNGGPANT-----------ATPTDNAASDAGGGGPMDVDFLKEMKKVCKVAR 122

Query: 152 PKLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKM 211
           P+L VRTARVELEGKDKAAMIMAQTKAL IDLLVIGQRRSLSTAILGY+R+G     AKM
Sbjct: 123 PELNVRTARVELEGKDKAAMIMAQTKALDIDLLVIGQRRSLSTAILGYKRAG----VAKM 182

Query: 212 LDTAEYLIENSKCTCVAVQKKGQNAGYLLNTKTHRNFWLLA 251
           LDTAEYLIENS CTCVAVQKKGQNAGYLLNTKTHRNFWLLA
Sbjct: 183 LDTAEYLIENSNCTCVAVQKKGQNAGYLLNTKTHRNFWLLA 208

BLAST of Tan0000591 vs. ExPASy TrEMBL
Match: A0A5A7U1M3 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold434G002120 PE=4 SV=1)

HSP 1 Score: 330.1 bits (845), Expect = 8.0e-87
Identity = 169/209 (80.86%), Postives = 188/209 (89.95%), Query Frame = 0

Query: 43  STTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPN- 102
           ST   +KVMVVVDPTRESAAALQYA+SHAV+DND+VILLHVDNPNSWRN I+TFLKRPN 
Sbjct: 7   STAPSRKVMVVVDPTRESAAALQYAISHAVMDNDQVILLHVDNPNSWRNAISTFLKRPNG 66

Query: 103 GGSANAHSHAHAHAHSHATAPANAASDGGGAAEVDFLEEMKKVCKVAHPKLKVRTARVEL 162
           GGS N++++ + HA + ATA ++    GG  A+VDFLEEMKK CKVAHPK+KV T RVEL
Sbjct: 67  GGSTNSNNNNNVHAAATATAASDGGQGGGATADVDFLEEMKKACKVAHPKVKVGTLRVEL 126

Query: 163 EGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSK 222
           EGKDKA+MIMAQTK+LG+DLLVIGQRRSLSTAILGYRR+GG MKGAKMLDTAEYLIENSK
Sbjct: 127 EGKDKASMIMAQTKSLGVDLLVIGQRRSLSTAILGYRRTGGAMKGAKMLDTAEYLIENSK 186

Query: 223 CTCVAVQKKGQNAGYLLNTKTHRNFWLLA 251
           CTCVAVQKKGQNAGYLLNTKTHRNFWLLA
Sbjct: 187 CTCVAVQKKGQNAGYLLNTKTHRNFWLLA 215

BLAST of Tan0000591 vs. ExPASy TrEMBL
Match: A0A1S3AWE0 (uncharacterized protein LOC103483538 OS=Cucumis melo OX=3656 GN=LOC103483538 PE=4 SV=1)

HSP 1 Score: 330.1 bits (845), Expect = 8.0e-87
Identity = 169/209 (80.86%), Postives = 188/209 (89.95%), Query Frame = 0

Query: 43  STTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPN- 102
           ST   +KVMVVVDPTRESAAALQYA+SHAV+DND+VILLHVDNPNSWRN I+TFLKRPN 
Sbjct: 7   STAPSRKVMVVVDPTRESAAALQYAISHAVMDNDQVILLHVDNPNSWRNAISTFLKRPNG 66

Query: 103 GGSANAHSHAHAHAHSHATAPANAASDGGGAAEVDFLEEMKKVCKVAHPKLKVRTARVEL 162
           GGS N++++ + HA + ATA ++    GG  A+VDFLEEMKK CKVAHPK+KV T RVEL
Sbjct: 67  GGSTNSNNNNNVHAAATATAASDGGQGGGATADVDFLEEMKKACKVAHPKVKVGTLRVEL 126

Query: 163 EGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSK 222
           EGKDKA+MIMAQTK+LG+DLLVIGQRRSLSTAILGYRR+GG MKGAKMLDTAEYLIENSK
Sbjct: 127 EGKDKASMIMAQTKSLGVDLLVIGQRRSLSTAILGYRRTGGAMKGAKMLDTAEYLIENSK 186

Query: 223 CTCVAVQKKGQNAGYLLNTKTHRNFWLLA 251
           CTCVAVQKKGQNAGYLLNTKTHRNFWLLA
Sbjct: 187 CTCVAVQKKGQNAGYLLNTKTHRNFWLLA 215

BLAST of Tan0000591 vs. ExPASy TrEMBL
Match: A0A0A0L6Y4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G134530 PE=4 SV=1)

HSP 1 Score: 329.7 bits (844), Expect = 1.0e-86
Identity = 169/210 (80.48%), Postives = 188/210 (89.52%), Query Frame = 0

Query: 42  TSTTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPN 101
           TST   +KVMVVVDPTRESAAALQYALSHA++DND+VILLH+DNPNSWRN I+TFLKRPN
Sbjct: 6   TSTAPSRKVMVVVDPTRESAAALQYALSHALMDNDQVILLHIDNPNSWRNAISTFLKRPN 65

Query: 102 -GGSANAHSHAHAHAHSHATAPANAASDGGGAAEVDFLEEMKKVCKVAHPKLKVRTARVE 161
            GGS N++++ + HA + ATA ++    GG  AEVDFLEEMKK CK AHPKL+V T RVE
Sbjct: 66  GGGSTNSNNNNNVHAAATATAASDGGQGGGATAEVDFLEEMKKACKKAHPKLEVGTLRVE 125

Query: 162 LEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENS 221
           LEGKDKA+MIMAQTK+LG+DLLVIGQRRSLSTAILGYRR+GG MKGAKMLDTAEYLIENS
Sbjct: 126 LEGKDKASMIMAQTKSLGVDLLVIGQRRSLSTAILGYRRTGGAMKGAKMLDTAEYLIENS 185

Query: 222 KCTCVAVQKKGQNAGYLLNTKTHRNFWLLA 251
           KCTCVAVQKKGQNAGYLLNTKTHRNFWLLA
Sbjct: 186 KCTCVAVQKKGQNAGYLLNTKTHRNFWLLA 215

BLAST of Tan0000591 vs. ExPASy TrEMBL
Match: A0A6J1IE79 (uncharacterized protein LOC111473211 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111473211 PE=4 SV=1)

HSP 1 Score: 318.5 bits (815), Expect = 2.4e-83
Identity = 177/222 (79.73%), Postives = 188/222 (84.68%), Query Frame = 0

Query: 32  GGLLNTTNTTT-STTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWR 91
           GG   TTNTTT S T+PKKVMVVVDPTRESAAALQY+LSHAVID DEVILLHVDNPNSW+
Sbjct: 2   GGERITTNTTTASNTTPKKVMVVVDPTRESAAALQYSLSHAVIDCDEVILLHVDNPNSWK 61

Query: 92  NTITTFLKRPNGGSANAHSHAHAHAHSHATAPANAASD--GGGAAEVDFLEEMKKVCKVA 151
           N ITTFLKRPNGG AN        +H+ AT   NAASD  GGG  +VDFL+EMKKVC VA
Sbjct: 62  NAITTFLKRPNGGPAN--------SHAAATPIDNAASDAGGGGPTDVDFLKEMKKVCNVA 121

Query: 152 HPKLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAK 211
            P+LKVRTARVE+EGKDKAAMIMAQTKAL IDLLVIGQRRSLSTAILGY+R+G      K
Sbjct: 122 RPELKVRTARVEMEGKDKAAMIMAQTKALDIDLLVIGQRRSLSTAILGYKRAG----VVK 181

Query: 212 MLDTAEYLIENSKCTCVAVQKKGQNAGYLLNTKTHRNFWLLA 251
           MLDTAEYLIENS CTCVAVQKKGQNAGYLLNTKTHRNFWLLA
Sbjct: 182 MLDTAEYLIENSNCTCVAVQKKGQNAGYLLNTKTHRNFWLLA 211

BLAST of Tan0000591 vs. ExPASy TrEMBL
Match: A0A6J1FDQ7 (uncharacterized protein LOC111444812 OS=Cucurbita moschata OX=3662 GN=LOC111444812 PE=4 SV=1)

HSP 1 Score: 317.0 bits (811), Expect = 7.0e-83
Identity = 173/221 (78.28%), Postives = 185/221 (83.71%), Query Frame = 0

Query: 32  GGLLNTTNTTTSTTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRN 91
           GG + T  TT S T+PKKVMVVVDPTRESAAALQY+LSHAVID DEVILLHVDNPN+W+N
Sbjct: 3   GGRITTNTTTASNTTPKKVMVVVDPTRESAAALQYSLSHAVIDCDEVILLHVDNPNTWKN 62

Query: 92  TITTFLKRPNGGSANAHSHAHAHAHSHATAPANAAS--DGGGAAEVDFLEEMKKVCKVAH 151
            ITTFLKRPNGG AN        +H+ AT   NAAS   GGG  +VDFL+EMKKVC VA 
Sbjct: 63  AITTFLKRPNGGPAN--------SHAAATPTDNAASGAGGGGPTDVDFLKEMKKVCNVAR 122

Query: 152 PKLKVRTARVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKM 211
           P+LKVR ARVELEGKDKAAMIMAQTKAL IDLLVIGQRRSLSTAILGY+R+G     AKM
Sbjct: 123 PELKVRMARVELEGKDKAAMIMAQTKALDIDLLVIGQRRSLSTAILGYKRAG----VAKM 182

Query: 212 LDTAEYLIENSKCTCVAVQKKGQNAGYLLNTKTHRNFWLLA 251
           LDTAEYLIENS CTCVAVQKKGQNAGYLLNTKTHRNFWLLA
Sbjct: 183 LDTAEYLIENSNCTCVAVQKKGQNAGYLLNTKTHRNFWLLA 211

BLAST of Tan0000591 vs. TAIR 10
Match: AT4G13450.1 (Adenine nucleotide alpha hydrolases-like superfamily protein )

HSP 1 Score: 215.7 bits (548), Expect = 4.2e-56
Identity = 110/213 (51.64%), Postives = 159/213 (74.65%), Query Frame = 0

Query: 41  TTSTTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNP-NSWRNTITTFLKR 100
           ++ST   +K+MV+ DPTRESAAALQYALSHAV++ DE+IL+H++N   SW+N  ++FL+ 
Sbjct: 8   SSSTPQSRKIMVIADPTRESAAALQYALSHAVLEQDELILVHIENSGGSWKNAFSSFLRL 67

Query: 101 PN--GGSANAHSHAHAHAHSHATAPANAASDGGGAAEVDFLEEMKKVCKVAHPKLKVRTA 160
           P+    S++  S A     + + A ANA +   G  + +FLE+MK++C++A PK++V T 
Sbjct: 68  PSSISSSSSGSSPASNGTTTASNAAANALASEIGQGDGNFLEQMKRICEIAQPKVRVHTE 127

Query: 161 RVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLI 220
            + ++G  KA  I+     LG+D+++IGQRR++S+++LG RR GG ++G+K +DTAEYLI
Sbjct: 128 CIAIDGV-KATAILLHGDKLGVDVIIIGQRRTISSSLLGTRRPGGSLRGSKGVDTAEYLI 187

Query: 221 ENSKCTCVAVQKKGQNAGYLLNTKTHRNFWLLA 251
           ENSKCTCV V KKGQN GY+LNTKTH+NFWLLA
Sbjct: 188 ENSKCTCVGVTKKGQNGGYVLNTKTHKNFWLLA 219

BLAST of Tan0000591 vs. TAIR 10
Match: AT4G13450.2 (Adenine nucleotide alpha hydrolases-like superfamily protein )

HSP 1 Score: 129.0 bits (323), Expect = 5.2e-30
Identity = 70/160 (43.75%), Postives = 113/160 (70.62%), Query Frame = 0

Query: 41  TTSTTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNP-NSWRNTITTFLKR 100
           ++ST   +K+MV+ DPTRESAAALQYALSHAV++ DE+IL+H++N   SW+N  ++FL+ 
Sbjct: 8   SSSTPQSRKIMVIADPTRESAAALQYALSHAVLEQDELILVHIENSGGSWKNAFSSFLRL 67

Query: 101 PN--GGSANAHSHAHAHAHSHATAPANAASDGGGAAEVDFLEEMKKVCKVAHPKLKVRTA 160
           P+    S++  S A     + + A ANA +   G  + +FLE+MK++C++A PK++V T 
Sbjct: 68  PSSISSSSSGSSPASNGTTTASNAAANALASEIGQGDGNFLEQMKRICEIAQPKVRVHTE 127

Query: 161 RVELEGKDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGY 198
            + ++G  KA  I+     LG+D+++IGQRR++S+++LGY
Sbjct: 128 CIAIDGV-KATAILLHGDKLGVDVIIIGQRRTISSSLLGY 166

BLAST of Tan0000591 vs. TAIR 10
Match: AT2G03720.1 (Adenine nucleotide alpha hydrolases-like superfamily protein )

HSP 1 Score: 90.9 bits (224), Expect = 1.6e-18
Identity = 66/202 (32.67%), Postives = 95/202 (47.03%), Query Frame = 0

Query: 51  MVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPNGGSANAHSH 110
           MVVVD T ++  ALQ+AL+H V D D + LLHV               R   G A   + 
Sbjct: 1   MVVVDTTSQTKNALQWALTHCVQDEDNITLLHV--------------TRTPVGQAIDETQ 60

Query: 111 AHAHAHSHATAPANAASDGGGAAEVDFLEEMKKVCKVAHPKLKVRTARVELEGKDKAAMI 170
              ++ +H                 + +  +K  C++  P +K     VE   ++K   I
Sbjct: 61  RERNSRAH-----------------ELVHPLKNFCQLKKPNVKTEIVVVE-TAEEKGKTI 120

Query: 171 MAQTKALGIDLLVIGQRRSLS--TAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQ 230
           + ++K  G  +LV+GQR+  S    I  +R  GG   G       EY I NS C  +AV+
Sbjct: 121 VEESKKQGAGVLVLGQRKRTSKWRVIWKWRTKGGMGGG-----VVEYCIHNSDCMAIAVR 165

Query: 231 KKGQNAGYLLNTKTHRNFWLLA 251
           KK  N GYL+ TK H++FWLLA
Sbjct: 181 KKSNNGGYLITTKRHKDFWLLA 165

BLAST of Tan0000591 vs. TAIR 10
Match: AT5G17390.1 (Adenine nucleotide alpha hydrolases-like superfamily protein )

HSP 1 Score: 77.4 bits (189), Expect = 1.8e-14
Identity = 59/203 (29.06%), Postives = 96/203 (47.29%), Query Frame = 0

Query: 49  KVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPNGGSANAH 108
           +VMVVVD    S  AL++A++H +   D + LL+   P  +R +     KR N       
Sbjct: 118 RVMVVVDKALASTGALEWAITHTLQPQDTLFLLYFAKP--FRKS-----KRKN------- 177

Query: 109 SHAHAHAHSHATAPANAASDGGGAAEVDFLEEMKKVCKVAHPKLKVRTARVELEGKDKAA 168
                             +D       + +  +KK+C+   P ++V   R+E + KDK  
Sbjct: 178 ------------RKREVKTD-------ELVHTLKKLCQTKRPGIEVEIRRLEGKDKDKGQ 237

Query: 169 MIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCTCVAVQ 228
            I+ ++K   + LLV+GQ +      L  R +    +G +     +Y +EN+ C  +AV+
Sbjct: 238 KIVEESKKQQVSLLVVGQEKKPPVWRLLKRWAWKRRRGHE--GVLKYCLENASCMTIAVK 285

Query: 229 KKGQN-AGYLLNTKTHRNFWLLA 251
            K +   GYL+ TK H+NFWLLA
Sbjct: 298 PKNRKLGGYLITTKRHKNFWLLA 285

BLAST of Tan0000591 vs. TAIR 10
Match: AT3G03290.1 (Adenine nucleotide alpha hydrolases-like superfamily protein )

HSP 1 Score: 73.2 bits (178), Expect = 3.4e-13
Identity = 57/208 (27.40%), Postives = 91/208 (43.75%), Query Frame = 0

Query: 44  TTSPKKVMVVVDPTRESAAALQYALSHAVIDNDEVILLHVDNPNSWRNTITTFLKRPNGG 103
           T +  +VMVVVD    S  AL++AL H +   D + LL+   P         F K   G 
Sbjct: 102 TEAGNRVMVVVDKVIASTGALEWALKHTLQSQDYLFLLYFSKP---------FRK---GK 161

Query: 104 SANAHSHAHAHAHSHATAPANAASDGGGAAEVDFLEEMKKVCKVAHPKLKVRTARVELEG 163
             N  S                          + +  +KK+C+   P ++V   R++ + 
Sbjct: 162 RKNRKSEVKTD---------------------ELVHTLKKLCQTKRPGIEVEIRRLQGKE 221

Query: 164 KDKAAMIMAQTKALGIDLLVIGQRRSLSTAILGYRRSGGPMKGAKMLDTAEYLIENSKCT 223
           K+K   I+ + K   + LLV+G+ +     +    +  G  K      T +Y +E + C 
Sbjct: 222 KEKGEKIVEEAKEQQVSLLVVGKEK--KPPVWRLLKRWGWKKRRGRAGTLKYCLEKASCM 274

Query: 224 CVAVQKKGQN-AGYLLNTKTHRNFWLLA 251
            +AV+ K +   GYL+ TK H+NFWLLA
Sbjct: 282 TIAVKPKNRKLGGYLITTKRHKNFWLLA 274

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
XP_038881793.11.0e-8881.74homeobox protein 5 [Benincasa hispida][more]
XP_008438448.11.7e-8680.86PREDICTED: uncharacterized protein LOC103483538 [Cucumis melo] >KAA0049204.1 unc... [more]
XP_004134033.12.2e-8680.48uncharacterized protein LOC101222608 [Cucumis sativus] >KGN56804.1 hypothetical ... [more]
KAG7028797.11.3e-8378.73hypothetical protein SDJN02_09978, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_023538863.12.2e-8379.19uncharacterized protein LOC111799663 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
A0A5A7U1M38.0e-8780.86Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3AWE08.0e-8780.86uncharacterized protein LOC103483538 OS=Cucumis melo OX=3656 GN=LOC103483538 PE=... [more]
A0A0A0L6Y41.0e-8680.48Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G134530 PE=4 SV=1[more]
A0A6J1IE792.4e-8379.73uncharacterized protein LOC111473211 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1FDQ77.0e-8378.28uncharacterized protein LOC111444812 OS=Cucurbita moschata OX=3662 GN=LOC1114448... [more]
Match NameE-valueIdentityDescription
AT4G13450.14.2e-5651.64Adenine nucleotide alpha hydrolases-like superfamily protein [more]
AT4G13450.25.2e-3043.75Adenine nucleotide alpha hydrolases-like superfamily protein [more]
AT2G03720.11.6e-1832.67Adenine nucleotide alpha hydrolases-like superfamily protein [more]
AT5G17390.11.8e-1429.06Adenine nucleotide alpha hydrolases-like superfamily protein [more]
AT3G03290.13.4e-1327.40Adenine nucleotide alpha hydrolases-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006016UspAPFAMPF00582Uspcoord: 48..196
e-value: 2.6E-13
score: 50.7
IPR014729Rossmann-like alpha/beta/alpha sandwich foldGENE3D3.40.50.620HUPscoord: 32..197
e-value: 1.2E-14
score: 56.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 100..127
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 23..44
NoneNo IPR availablePANTHERPTHR47867ADENINE NUCLEOTIDE ALPHA HYDROLASES-LIKE SUPERFAMILY PROTEINcoord: 40..198
NoneNo IPR availablePANTHERPTHR47867:SF1ADENINE NUCLEOTIDE ALPHA HYDROLASES-LIKE SUPERFAMILY PROTEINcoord: 40..198
NoneNo IPR availableCDDcd00293USP_Likecoord: 49..196
e-value: 7.74054E-10
score: 53.5278
NoneNo IPR availableSUPERFAMILY52402Adenine nucleotide alpha hydrolases-likecoord: 46..188

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0000591.1Tan0000591.1mRNA
Tan0000591.2Tan0000591.2mRNA